Genomic linkage map of the human blood fluke Schistosoma mansoni.
ABSTRACT: Schistosoma mansoni is a blood fluke that infects approximately 90 million people. The complete life cycle of this parasite can be maintained in the laboratory, making this one of the few experimentally tractable human helminth infections, and a rich literature reveals heritable variation in important biomedical traits such as virulence, host-specificity, transmission and drug resistance. However, there is a current lack of tools needed to study S. mansoni's molecular, quantitative, and population genetics. Our goal was to construct a genetic linkage map for S. mansoni, and thus provide a new resource that will help stimulate research on this neglected pathogen.We genotyped grandparents, parents and 88 progeny to construct a 5.6 cM linkage map containing 243 microsatellites positioned on 203 of the largest scaffolds in the genome sequence. The map allows 70% of the estimated 300 Mb genome to be ordered on chromosomes, and highlights where scaffolds have been incorrectly assembled. The markers fall into eight main linkage groups, consistent with seven pairs of autosomes and one pair of sex chromosomes, and we were able to anchor linkage groups to chromosomes using fluorescent in situ hybridization. The genome measures 1,228.6 cM. Marker segregation reveals higher female recombination, confirms ZW inheritance patterns, and identifies recombination hotspots and regions of segregation distortion.The genetic linkage map presented here is the first for S. mansoni and the first for a species in the phylum Platyhelminthes. The map provides the critical tool necessary for quantitative genetic analysis, aids genome assembly, and furnishes a framework for comparative flatworm genomics and field-based molecular epidemiological studies.
Project description:Construction of high-density genetic linkage maps is crucially important for quantitative trait loci (QTL) studies, and they are more useful when integrated with physical maps. Such integrated maps are valuable genome resources for fine mapping of QTL, comparative genomics, and accurate and efficient whole-genome assembly. Previously, we established both linkage maps and a physical map for channel catfish, Ictalurus punctatus, the dominant aquaculture species in the United States. Here we added 2030 BAC end sequence (BES)-derived microsatellites from 1481 physical map contigs, as well as markers from singleton BES, ESTs, anonymous microsatellites, and SNPs, to construct a second-generation linkage map. Average marker density across the 29 linkage groups reached 1.4 cM/marker. The increased marker density highlighted variations in recombination rates within and among catfish chromosomes. This work effectively anchored 44.8% of the catfish BAC physical map contigs, covering ~52.8% of the genome. The genome size was estimated to be 2546 cM on the linkage map, and the calculated physical distance per centimorgan was 393 Kb. This integrated map should enable comparative studies with teleost model species as well as provide a framework for ordering and assembling whole-genome scaffolds.
Project description:SNPs are the most abundant polymorphism type, and have been explored in many crop genomic studies, including rice and maize. SNP discovery in allotetraploid cotton genomes has lagged behind that of other crops due to their complexity and polyploidy. In this study, genome-wide SNPs are detected systematically using next-generation sequencing and efficient SNP genotyping methods, and used to construct a linkage map and characterize the structural variations in polyploid cotton genomes.We construct an ultra-dense inter-specific genetic map comprising 4,999,048 SNP loci distributed unevenly in 26 allotetraploid cotton linkage groups and covering 4,042 cM. The map is used to order tetraploid cotton genome scaffolds for accurate assembly of G. hirsutum acc. TM-1. Recombination rates and hotspots are identified across the cotton genome by comparing the assembled draft sequence and the genetic map. Using this map, genome rearrangements and centromeric regions are identified in tetraploid cotton by combining information from the publicly-available G. raimondii genome with fluorescent in situ hybridization analysis.We report the genotype-by-sequencing method used to identify millions of SNPs between G. hirsutum and G. barbadense. We construct and use an ultra-dense SNP map to correct sequence mis-assemblies, merge scaffolds into pseudomolecules corresponding to chromosomes, detect genome rearrangements, and identify centromeric regions in allotetraploid cottons. We find that the centromeric retro-element sequence of tetraploid cotton derived from the D subgenome progenitor might have invaded the A subgenome centromeres after allotetrapolyploid formation. This study serves as a valuable genomic resource for genetic research and breeding of cotton.
Project description:BACKGROUND: Cotton, with a large genome, is an important crop throughout the world. A high-density genetic linkage map is the prerequisite for cotton genetics and breeding. A genetic map based on simple polymerase chain reaction markers will be efficient for marker-assisted breeding in cotton, and markers from transcribed sequences have more chance to target genes related to traits. To construct a genome-wide, functional marker-based genetic linkage map in cotton, we isolated and mapped expressed sequence tag-simple sequence repeats (EST-SSRs) from cotton ESTs derived from the A1, D5, (AD)1, and (AD)2 genome. RESULTS: A total of 3177 new EST-SSRs developed in our laboratory and other newly released SSRs were used to enrich our interspecific BC1 genetic linkage map. A total of 547 loci and 911 loci were obtained from our EST-SSRs and the newly released SSRs, respectively. The 1458 loci together with our previously published data were used to construct an updated genetic linkage map. The final map included 2316 loci on the 26 cotton chromosomes, 4418.9 cM in total length and 1.91 cM in average distance between adjacent markers. To our knowledge, this map is one of the three most dense linkage maps in cotton. Twenty-one segregation distortion regions (SDRs) were found in this map; three segregation distorted chromosomes, Chr02, Chr16, and Chr18, were identified with 99.9% of distorted markers segregating toward the heterozygous allele. Functional analysis of SSR sequences showed that 1633 loci of this map (70.6%) were transcribed loci and 1332 loci (57.5%) were translated loci. CONCLUSIONS: This map lays groundwork for further genetic analyses of important quantitative traits, marker-assisted selection, and genome organization architecture in cotton as well as for comparative genomics between cotton and other species. The segregation distorted chromosomes can be a guide to identify segregation distortion loci in cotton. The annotation of SSR sequences identified frequent and rare gene ontology items on each chromosome, which is helpful to discover functions of cotton chromosomes.
Project description:High-density genetic maps are useful to precisely localize QTL or genes that might be used to improve traits of nutritional and/or economical importance in crops. However, high-density genetic maps are lacking for most wild relatives of crop species, including wheat. Aegilops umbellulata is a wild relative of wheat known for its potential as a source of biotic and abiotic stress resistance genes. In this work, we have developed a framework consensus genetic map using two biparental populations derived from accessions PI 298905, PI 542369, PI 5422375, and PI 554395. The framework map comprised 3009 genotype-by-sequence SNPs with a total map size of 948.72 cM. On average, there were three SNPs per centimorgan for each chromosome. Chromosome 1U was the shortest (66.5 cM), with only 81 SNPs, whereas the remaining chromosomes had between 391 and 591 SNP markers. A total of 2395 unmapped SNPs were added to the linkage maps through a recombination frequency approach, and increased the number of SNPs placed on the consensus map to a total of 5404 markers. Segregation distortion was disproportionally high for chromosome 1U for both populations used to construct component linkage maps, and thus segregation distortion could be one of the probable reasons for the exceptionally reduced linkage size for chromosome 1U. From comparative analysis, Aeumbellulata chromosomes except 4U showed moderate to strong collinearity with corresponding homeologous chromosomes of hexaploid wheat and barley. The present consensus map may serve as a reference map in QTL mapping and validation projects, and also in genome assembly to develop a reference genome sequence for Ae. umbellulata.
Project description:BACKGROUND: Brassica rapa is an economically important crop and a model plant for studies concerning polyploidization and the evolution of extreme morphology. The multinational B. rapa Genome Sequencing Project (BrGSP) was launched in 2003. In 2008, next generation sequencing technology was used to sequence the B. rapa genome. Several maps concerning B. rapa pseudochromosome assembly have been published but their coverage of the genome is incomplete, anchoring approximately 73.6% of the scaffolds on to chromosomes. Therefore, a new genetic map to aid pseudochromosome assembly is required. RESULTS: This study concerns the construction of a reference genetic linkage map for Brassica rapa, forming the backbone for anchoring sequence scaffolds of the B. rapa genome resulting from recent sequencing efforts. One hundred and nineteen doubled haploid (DH) lines derived from microspore cultures of an F1 cross between a Chinese cabbage (B. rapa ssp. pekinensis) DH line (Z16) and a rapid cycling inbred line (L144) were used to construct the linkage map. PCR-based insertion/deletion (InDel) markers were developed by re-sequencing the two parental lines. The map comprises a total of 507 markers including 415 InDels and 92 SSRs. Alignment and orientation using SSR markers in common with existing B. rapa linkage maps allowed ten linkage groups to be identified, designated A01-A10. The total length of the linkage map was 1234.2 cM, with an average distance of 2.43 cM between adjacent marker loci. The lengths of linkage groups ranged from 71.5 cM to 188.5 cM for A08 and A09, respectively. Using the developed linkage map, 152 scaffolds were anchored on to the chromosomes, encompassing more than 82.9% of the B. rapa genome. Taken together with the previously available linkage maps, 183 scaffolds were anchored on to the chromosomes and the total coverage of the genome was 88.9%. CONCLUSIONS: The development of this linkage map is vital for the integration of genome sequences and genetic information, and provides a useful resource for the international Brassica research community.
Project description:BACKGROUND: Introgressive hybridization is an important evolutionary process that can lead to the creation of novel genome structures and thus potentially new genetic variation for selection to act upon. On the other hand, hybridization with introduced species can threaten native species, such as cutthroat trout (Oncorhynchus clarkii) following the introduction of rainbow trout (O. mykiss). Neither the evolutionary consequences nor conservation implications of rainbow trout introgression in cutthroat trout is well understood. Therefore, we generated a genetic linkage map for rainbow-Yellowstone cutthroat trout (O. clarkii bouvieri) hybrids to evaluate genome processes that may help explain how introgression affects hybrid genome evolution. RESULTS: The hybrid map closely aligned with the rainbow trout map (a cutthroat trout map does not exist), sharing all but one linkage group. This linkage group (RYHyb20) represented a fusion between an acrocentric (Omy28) and a metacentric chromosome (Omy20) in rainbow trout. Additional mapping in Yellowstone cutthroat trout indicated the two rainbow trout homologues were fused in the Yellowstone genome. Variation in the number of hybrid linkage groups (28 or 29) likely depended on a Robertsonian rearrangement polymorphism within the rainbow trout stock. Comparison between the female-merged F? map and a female consensus rainbow trout map revealed that introgression suppressed recombination across large genomic regions in 5 hybrid linkage groups. Two of these linkage groups (RYHyb20 and RYHyb25_29) contained confirmed chromosome rearrangements between rainbow and Yellowstone cutthroat trout indicating that rearrangements may suppress recombination. The frequency of allelic and genotypic segregation distortion varied among parents and families, suggesting few incompatibilities exist between rainbow and Yellowstone cutthroat trout genomes. CONCLUSIONS: Chromosome rearrangements suppressed recombination in the hybrids. This result supports several previous findings demonstrating that recombination suppression restricts gene flow between chromosomes that differ by arrangement. Conservation of synteny and map order between the hybrid and rainbow trout maps and minimal segregation distortion in the hybrids suggest rainbow and Yellowstone cutthroat trout genomes freely introgress across chromosomes with similar arrangement. Taken together, these results suggest that rearrangements impede introgression. Recombination suppression across rearrangements could enable large portions of non-recombined chromosomes to persist within admixed populations.
Project description:Two independent genome projects for the honey bee, a microsatellite linkage map and a genome sequence assembly, interactively produced an almost complete organization of the euchromatic genome. Assembly 4.0 now includes 626 scaffolds that were ordered and oriented into chromosomes according to the framework provided by the third-generation linkage map (AmelMap3). Each construct was used to control the quality of the other. The co-linearity of markers in the sequence and the map is almost perfect and argues in favor of the high quality of both.
Project description:High density genetic maps are a reliable tool for genetic dissection of complex plant traits. Mapping resolution is often hampered by the variable crossover and non-crossover events occurring across the genome, with pericentromeric regions (pCENR) showing highly suppressed recombination rates. The efficiency of linkage mapping can further be improved by characterizing and understanding the distribution of recombinational activity along individual chromosomes. In order to evaluate the genome wide recombination rate in common beans (Phaseolus vulgaris L.) we developed a SNP-based linkage map using the genotype-by-sequencing approach with a 188 recombinant inbred line family generated from an inter gene pool cross (Andean x Mesoamerican). We identified 1,112 SNPs that were subsequently used to construct a robust linkage map with 11 groups, comprising 513 recombinationally unique marker loci spanning 943 cM (LOD 3.0). Comparative analysis showed that the linkage map spanned >95% of the physical map, indicating that the map is almost saturated. Evaluation of genome-wide recombination rate indicated that at least 45% of the genome is highly recombinationally suppressed, and allowed us to estimate locations of pCENRs. We observed an average recombination rate of 0.25 cM/Mb in pCENRs as compared to the rest of genome that showed 3.72 cM/Mb. However, several hot spots of recombination were also detected with recombination rates reaching as high as 34 cM/Mb. Hotspots were mostly found towards the end of chromosomes, which also happened to be gene-rich regions. Analyzing relationships between linkage and physical map indicated a punctuated distribution of recombinational hot spots across the genome.
Project description:Detailed linkage and recombination rate maps are necessary to use the full potential of genome sequencing and population genomic analyses. We used a custom collared flycatcher 50 K SNP array to develop a high-density linkage map with 37 262 markers assigned to 34 linkage groups in 33 autosomes and the Z chromosome. The best-order map contained 4215 markers, with a total distance of 3132 cM and a mean genetic distance between markers of 0.12 cM. Facilitated by the array being designed to include markers from most scaffolds, we obtained a second-generation assembly of the flycatcher genome that approaches full chromosome sequences (N50 super-scaffold size 20.2 Mb and with 1.042 Gb (of 1.116 Gb) anchored to and mostly ordered and oriented along chromosomes). We found that flycatcher and zebra finch chromosomes are entirely syntenic but that inversions at mean rates of 1.5-2.0 event (6.6-7.5 Mb) per My have changed the organization within chromosomes, rates high enough for inversions to potentially have been involved with many speciation events during avian evolution. The mean recombination rate was 3.1 cM/Mb and correlated closely with chromosome size, from 2 cM/Mb for chromosomes >100 Mb to >10 cM/Mb for chromosomes <10 Mb. This size dependence seemed entirely due to an obligate recombination event per chromosome; if 50 cM was subtracted from the genetic lengths of chromosomes, the rate per physical unit DNA was constant across chromosomes. Flycatcher recombination rate showed similar variation along chromosomes as chicken but lacked the large interior recombination deserts characteristic of zebra finch chromosomes.
Project description:The genome of potato, a major global food crop, was recently sequenced. The work presented here details the integration of the potato reference genome (DM) with a new sequence-tagged site marker-based linkage map and other physical and genetic maps of potato and the closely related species tomato. Primary anchoring of the DM genome assembly was accomplished by the use of a diploid segregating population, which was genotyped with several types of molecular genetic markers to construct a new ~936 cM linkage map comprising 2469 marker loci. In silico anchoring approaches used genetic and physical maps from the diploid potato genotype RH89-039-16 (RH) and tomato. This combined approach has allowed 951 superscaffolds to be ordered into pseudomolecules corresponding to the 12 potato chromosomes. These pseudomolecules represent 674 Mb (~93%) of the 723 Mb genome assembly and 37,482 (~96%) of the 39,031 predicted genes. The superscaffold order and orientation within the pseudomolecules are closely collinear with independently constructed high density linkage maps. Comparisons between marker distribution and physical location reveal regions of greater and lesser recombination, as well as regions exhibiting significant segregation distortion. The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal "pseudomolecules".