Project description:BACKGROUND: Human genetic diversity observed in Indian subcontinent is second only to that of Africa. This implies an early settlement and demographic growth soon after the first 'Out-of-Africa' dispersal of anatomically modern humans in Late Pleistocene. In contrast to this perspective, linguistic diversity in India has been thought to derive from more recent population movements and episodes of contact. With the exception of Dravidian, which origin and relatedness to other language phyla is obscure, all the language families in India can be linked to language families spoken in different regions of Eurasia. Mitochondrial DNA and Y chromosome evidence has supported largely local evolution of the genetic lineages of the majority of Dravidian and Indo-European speaking populations, but there is no consensus yet on the question of whether the Munda (Austro-Asiatic) speaking populations originated in India or derive from a relatively recent migration from further East. RESULTS: Here, we report the analysis of 35 novel complete mtDNA sequences from India which refine the structure of Indian-specific varieties of haplogroup R. Detailed analysis of haplogroup R7, coupled with a survey of approximately 12,000 mtDNAs from caste and tribal groups over the entire Indian subcontinent, reveals that one of its more recently derived branches (R7a1), is particularly frequent among Munda-speaking tribal groups. This branch is nested within diverse R7 lineages found among Dravidian and Indo-European speakers of India. We have inferred from this that a subset of Munda-speaking groups have acquired R7 relatively recently. Furthermore, we find that the distribution of R7a1 within the Munda-speakers is largely restricted to one of the sub-branches (Kherwari) of northern Munda languages. This evidence does not support the hypothesis that the Austro-Asiatic speakers are the primary source of the R7 variation. Statistical analyses suggest a significant correlation between genetic variation and geography, rather than between genes and languages. CONCLUSION: Our high-resolution phylogeographic study, involving diverse linguistic groups in India, suggests that the high frequency of mtDNA haplogroup R7 among Munda speaking populations of India can be explained best by gene flow from linguistically different populations of Indian subcontinent. The conclusion is based on the observation that among Indo-Europeans, and particularly in Dravidians, the haplogroup is, despite its lower frequency, phylogenetically more divergent, while among the Munda speakers only one sub-clade of R7, i.e. R7a1, can be observed. It is noteworthy that though R7 is autochthonous to India, and arises from the root of hg R, its distribution and phylogeography in India is not uniform. This suggests the more ancient establishment of an autochthonous matrilineal genetic structure, and that isolation in the Pleistocene, lineage loss through drift, and endogamy of prehistoric and historic groups have greatly inhibited genetic homogenization and geographical uniformity.
Project description:BACKGROUND:Although our microbial community and genomes (the human microbiome) outnumber our genome by several orders of magnitude, to what extent the human host genetic complement informs the microbiota composition is not clear. The Human Microbiome Project (HMP) Consortium established a unique population-scale framework with which to characterize the relationship of microbial community structure with their human hosts. A wide variety of taxa and metabolic pathways have been shown to be differentially distributed by virtue of race/ethnicity in the HMP. Given that mtDNA haplogroups are the maternally derived ancestral genomic markers and mitochondria's role as the generator for cellular ATP, characterizing the relationship between human mtDNA genomic variants and microbiome profiles becomes of potential marked biologic and clinical interest. RESULTS:We leveraged sequencing data from the HMP to investigate the association between microbiome community structures with its own host mtDNA variants. 15 haplogroups and 631 mtDNA nucleotide polymorphisms (mean sequencing depth of 280X on the mitochondria genome) from 89 individuals participating in the HMP were accurately identified. 16S rRNA (V3-V5 region) sequencing generated microbiome taxonomy profiles and whole genome shotgun sequencing generated metabolic profiles from various body sites were treated as traits to conduct association analysis between haplogroups and host clinical metadata through linear regression. The mtSNPs of individuals with European haplogroups were associated with microbiome profiles using PLINK quantitative trait associations with permutation and adjusted for multiple comparisons. We observe that among 139 stool and 59 vaginal posterior fornix samples, several haplogroups show significant association with specific microbiota (q-value < 0.05) as well as their aggregate community structure (Chi-square with Monte Carlo, p < 0.005), which confirmed and expanded previous research on the association of race and ethnicity with microbiome profile. Our results further indicate that mtDNA variations may render different microbiome profiles, possibly through an inflammatory response to different levels of reactive oxygen species activity. CONCLUSIONS:These data provide initial evidence for the association between host ancestral genome with the structure of its microbiome.
Project description:Current mitochondrial DNA (mtDNA) haplogroup classification tools map reads to a single reference genome and perform inference based on the detected mutations to this reference. This approach biases haplogroup assignments towards the reference and prohibits accurate calculations of the uncertainty in assignment. We present HaploCart, a probabilistic mtDNA haplogroup classifier which uses a pangenomic reference graph framework together with principles of Bayesian inference. We demonstrate that our approach significantly outperforms available tools by being more robust to lower coverage or incomplete consensus sequences and producing phylogenetically-aware confidence scores that are unbiased towards any haplogroup. HaploCart is available both as a command-line tool and through a user-friendly web interface. The C++ program accepts as input consensus FASTA, FASTQ, or GAM files, and outputs a text file with the haplogroup assignments of the samples along with the level of confidence in the assignments. Our work considerably reduces the amount of data required to obtain a confident mitochondrial haplogroup assignment.
Project description:BackgroundThe phylogeny of the indigenous Indian-specific mitochondrial DNA (mtDNA) haplogroups have been determined and refined in previous reports. Similar to mtDNA superhaplogroups M and N, a profusion of reports are also available for superhaplogroup R. However, there is a dearth of information on South Asian subhaplogroups in particular, including R8. Therefore, we ought to access the genealogy and pre-historic expansion of haplogroup R8 which is considered one of the autochthonous lineages of South Asia.Methodology/principal findingsUpon screening the mtDNA of 5,836 individuals belonging to 104 distinct ethnic populations of the Indian subcontinent, we found 54 individuals with the HVS-I motif that defines the R8 haplogroup. Complete mtDNA sequencing of these 54 individuals revealed two deep-rooted subclades: R8a and R8b. Furthermore, these subclades split into several fine subclades. An isofrequency contour map detected the highest frequency of R8 in the state of Orissa. Spearman's rank correlation analysis suggests significant correlation of R8 occurrence with geography.Conclusions/significanceThe coalescent age of newly-characterized subclades of R8, R8a (15.4+/-7.2 Kya) and R8b (25.7+/-10.2 Kya) indicates that the initial maternal colonization of this haplogroup occurred during the middle and upper Paleolithic period, roughly around 40 to 45 Kya. These results signify that the southern part of Orissa currently inhabited by Munda speakers is likely the origin of these autochthonous maternal deep-rooted haplogroups. Our high-resolution study on the genesis of R8 haplogroup provides ample evidence of its deep-rooted ancestry among the Orissa (Austro-Asiatic) tribes.
Project description:The immune response plays a key role in the disease development of the organism, while immune function serves as an important indicator for animal models evaluation. The tree shrew (Tupaia belangeri chinensis), as a new laboratory animal with a close genetic relationship with primates, has been used to construct various disease models. However, the immune system of tree shrews, especially anatomical descriptions of lymph nodes, is still relatively unknown. In this study, a total of 16 different lymph nodes were identified, including superficial lymph nodes and deep lymph nodes. Superficial lymph nodes were located in the head and neck region (submandibular lymph node, parotid lymph node, deep and superficial cervical lymph nodes) and at the forelimb (axillary and accessory axillary lymph nodes, subscapular lymph node) and hindlimb (popliteal, sciatic, and inguinal lymph nodes). Deep lymph nodes comprise mediastinal lymph nodes located in thoracic cavity and abdominal lymph nodes that are mainly located in each mesentery (mesenteric, gastric, pancreatic-duodenal, renal lymph nodes) or along the major vessels (iliac lymph nodes). In addition, we described the spleen and thymus of the tree shrew, as well as two lymphoid tissues in the top wall of the nasal cavity and the oropharynx. This study mainly describes the tree shrew immune system from an anatomical and histopathological perspective and provides fundamental research references for the establishment of various animal models of tree shrews.
Project description:BACKGROUND: Recent genome-wide association studies searching for candidate susceptibility loci for common complex diseases such as type 2 diabetes mellitus (T2DM) and its common complications have uncovered novel disease-associated genes. Nevertheless these large-scale population screens often overlook the tremendous variation in the mitochondrial genome (mtDNA) and its involvement in complex disorders. RESULTS: We have analyzed the mitochondrial DNA (mtDNA) genetic variability in Ashkenazi (Ash), Sephardic (Seph) and North African (NAF) Jewish populations (total n = 1179). Our analysis showed significant differences (p < 0.001) in the distribution of mtDNA genetic backgrounds (haplogroups) among the studied populations. To test whether these differences alter the pattern of disease susceptibility, we have screened our three Jewish populations for an association of mtDNA genetic haplogroups with T2DM complications. Our results identified population-specific susceptibility factors of which the best example is the Ashkenazi Jewish specific haplogroup N1b1, having an apparent protective effect against T2DM complications in Ash (p = 0.006), being absent in the NAF population and under-represented in the Seph population. We have generated and analyzed whole mtDNA sequences from the disease associated haplogroups revealing mutations in highly conserved positions that are good candidates to explain the phenotypic effect of these genetic backgrounds. CONCLUSION: Our findings support the possibility that recent bottleneck events leading to over-representation of minor mtDNA alleles in specific genetic isolates, could result in population-specific susceptibility loci to complex disorders.
Project description:The association between a geographical region and an mtDNA haplogroup(s) has provided the basis for using mtDNA haplogroups to infer an individual's place of origin and genetic ancestry. Although it is well known that ancestry inferences using mtDNA haplogroups and those using genome-wide markers are frequently discrepant, little empirical information exists on the magnitude and scope of such discrepancies between multiple mtDNA haplogroups and worldwide populations. We compared genetic-ancestry inferences made by mtDNA-haplogroup membership to those made by autosomal SNPs in ∼940 samples of the Human Genome Diversity Panel and recently admixed populations from the 1000 Genomes Project. Continental-ancestry proportions often varied widely among individuals sharing the same mtDNA haplogroup. For only half of mtDNA haplogroups did the highest average continental-ancestry proportion match the highest continental-ancestry proportion of a majority of individuals with that haplogroup. Prediction of an individual's mtDNA haplogroup from his or her continental-ancestry proportions was often incorrect. Collectively, these results indicate that for most individuals in the worldwide populations sampled, mtDNA-haplogroup membership provides limited information about either continental ancestry or continental region of origin.
Project description:The crystal proteins of Bacillus thuringiensis have been extensively studied because of their pesticidal properties and their high natural levels of production. The increasingly rapid characterization of new crystal protein genes, triggered by an effort to discover proteins with new pesticidal properties, has resulted in a variety of sequences and activities that no longer fit the original nomenclature system proposed in 1989. Bacillus thuringiensis pesticidal crystal protein (Cry and Cyt) nomenclature was initially based on insecticidal activity for the primary ranking criterion. Many exceptions to this systematic arrangement have become apparent, however, making the nomenclature system inconsistent. Additionally, the original nomenclature, with four activity-based primary ranks for 13 genes, did not anticipate the current 73 holotype sequences that form many more than the original four subgroups. A new nomenclature, based on hierarchical clustering using amino acid sequence identity, is proposed. Roman numerals have been exchanged for Arabic numerals in the primary rank (e.g., Cry1Aa) to better accommodate the large number of expected new sequences. In this proposal, 133 crystal proteins comprising 24 primary ranks are systematically arranged.
Project description:Mutations of both nuclear and mitochondrial DNA (mtDNA)-encoded mitochondrial proteins can cause cardiomyopathy associated with mitochondrial dysfunction. Hence, the cardiac phenotype of nuclear DNA mitochondrial mutations might be modulated by mtDNA variation. We studied a 13-generation Mennonite pedigree with autosomal recessive myopathy and cardiomyopathy due to an SLC25A4 frameshift null mutation (c.523delC, p.Q175RfsX38), which codes for the heart-muscle isoform of the adenine nucleotide translocator-1. Ten homozygous null (adenine nucleotide translocator-1(-/-)) patients monitored over a median of 6 years had a phenotype of progressive myocardial thickening, hyperalaninemia, lactic acidosis, exercise intolerance, and persistent adrenergic activation. Electrocardiography and echocardiography with velocity vector imaging revealed abnormal contractile mechanics, myocardial repolarization abnormalities, and impaired left ventricular relaxation. End-stage heart disease was characterized by massive, symmetric, concentric cardiac hypertrophy; widespread cardiomyocyte degeneration; overabundant and structurally abnormal mitochondria; extensive subendocardial interstitial fibrosis; and marked hypertrophy of arteriolar smooth muscle. Substantial variability in the progression and severity of heart disease segregated with maternal lineage, and sequencing of mtDNA from five maternal lineages revealed two major European haplogroups, U and H. Patients with the haplogroup U mtDNAs had more rapid and severe cardiomyopathy than those with haplogroup H.
Project description:BackgroundAurochs (Bos primigenius) were distributed throughout large parts of Eurasia and Northern Africa during the late Pleistocene and the early Holocene, and all modern cattle are derived from the aurochs. Although the mtDNA haplogroups of most modern cattle belong to haplogroups T and I, several additional haplogroups (P, Q, R, C and E) have been identified in modern cattle and aurochs. Haplogroup P was the most common haplogroup in European aurochs, but so far, it has been identified in only three of >3,000 submitted haplotypes of modern Asian cattle.MethodologyWe sequenced the complete mtDNA D-loop region of 181 Japanese Shorthorn cattle and analyzed these together with representative bovine mtDNA sequences. The haplotype P of Japanese Shorthorn cattle was analyzed along with that of 36 previously published European aurochs and three modern Asian cattle sequences using the hypervariable 410 bp of the D-loop region.ConclusionsWe detected the mtDNA haplogroup P in Japanese Shorthorn cattle with an extremely high frequency (83/181). Phylogenetic networks revealed two main clusters, designated as Pa for haplogroup P in European aurochs and Pc in modern Asian cattle. We also report the genetic diversity of haplogroup P compared with the sequences of extinct aurochs. No shared haplotypes are observed between the European aurochs and the modern Asian cattle. This finding suggests the possibility of local and secondary introgression events of haplogroup P in northeast Asian cattle, and will contribute to a better understanding of its origin and genetic diversity.