Project description:Sequencing the coding regions, the exome, of the human genome is one of the major current strategies to identify low frequency and rare variants associated with human disease traits. So far, the most widely used commercial exome capture reagents have mainly targeted the consensus coding sequence (CCDS) database. We report the design of an extended set of targets for capturing the complete human exome, based on annotation from the GENCODE consortium. The extended set covers an additional 5594 genes and 10.3 Mb compared with the current CCDS-based sets. The additional regions include potential disease genes previously inaccessible to exome resequencing studies, such as 43 genes linked to ion channel activity and 70 genes linked to protein kinase activity. In total, the new GENCODE exome set developed here covers 47.9 Mb and performed well in sequence capture experiments. In the sample set used in this study, we identified over 5000 SNP variants more in the GENCODE exome target (24%) than in the CCDS-based exome sequencing.
Project description:ObjectivesCerebral palsy (CP) is the most common childhood motor disability, yet its link to single-gene disorders is under-characterized. To explore the genetic landscape of CP, we conducted whole exome sequencing (WES) in a cohort of patients with CP.MethodsWe performed comprehensive phenotyping and WES on a prospective cohort of individuals with cryptogenic CP (who meet criteria for CP; have no risk factors), non-cryptogenic CP (who meet criteria for CP; have at least one risk factor), and CP masqueraders (who could be diagnosed with CP, but have regression/progressive symptoms). We characterized motor phenotypes, ascertained medical comorbidities, and classified brain MRIs. We analyzed WES data using an institutional pipeline.ResultsWe included 50 probands in this analysis (20 females, 30 males). Twenty-four had cryptogenic CP, 20 had non-cryptogenic CP, five had CP masquerader classification, and one had unknown classification. Hypotonic-ataxic subtype showed a difference in prevalence across the classification groups (p = 0.01). Twenty-six percent of participants (13/50) had a pathogenic/likely pathogenic variant in 13 unique genes (ECHS1, SATB2, ZMYM2, ADAT3, COL4A1, THOC2, SLC16A2, SPAST, POLR2A, GNAO1, PDHX, ACADM, ATL1), including one patient with two genetic disorders (ACADM, PDHX) and two patients with a SPAST-related disorder. The CP masquerader category had the highest diagnostic yield (n = 3/5, 60%), followed by the cryptogenic CP category (n = 7/24, 29%). Fifteen percent of patients with non-cryptogenic CP (n = 3/20) had a Mendelian disorder on WES.InterpretationWES demonstrated a significant prevalence of Mendelian disorders in individuals clinically diagnosed with CP, including in individuals with known CP risk factors.
Project description:ImportanceCerebral palsy is a common neurodevelopmental disorder affecting movement and posture that often co-occurs with other neurodevelopmental disorders. Individual cases of cerebral palsy are often attributed to birth asphyxia; however, recent studies indicate that asphyxia accounts for less than 10% of cerebral palsy cases.ObjectiveTo determine the molecular diagnostic yield of exome sequencing (prevalence of pathogenic and likely pathogenic variants) in individuals with cerebral palsy.Design, setting, and participantsA retrospective cohort study of patients with cerebral palsy that included a clinical laboratory referral cohort with data accrued between 2012 and 2018 and a health care-based cohort with data accrued between 2007 and 2017.ExposuresExome sequencing with copy number variant detection.Main outcomes and measuresThe primary outcome was the molecular diagnostic yield of exome sequencing.ResultsAmong 1345 patients from the clinical laboratory referral cohort, the median age was 8.8 years (interquartile range, 4.4-14.7 years; range, 0.1-66 years) and 601 (45%) were female. Among 181 patients in the health care-based cohort, the median age was 41.9 years (interquartile range, 28.0-59.6 years; range, 4.8-89 years) and 96 (53%) were female. The molecular diagnostic yield of exome sequencing was 32.7% (95% CI, 30.2%-35.2%) in the clinical laboratory referral cohort and 10.5% (95% CI, 6.0%-15.0%) in the health care-based cohort. The molecular diagnostic yield ranged from 11.2% (95% CI, 6.4%-16.2%) for patients without intellectual disability, epilepsy, or autism spectrum disorder to 32.9% (95% CI, 25.7%-40.1%) for patients with all 3 comorbidities. Pathogenic and likely pathogenic variants were identified in 229 genes (29.5% of 1526 patients); 86 genes were mutated in 2 or more patients (20.1% of 1526 patients) and 10 genes with mutations were independently identified in both cohorts (2.9% of 1526 patients).Conclusions and relevanceAmong 2 cohorts of patients with cerebral palsy who underwent exome sequencing, the prevalence of pathogenic and likely pathogenic variants was 32.7% in a cohort that predominantly consisted of pediatric patients and 10.5% in a cohort that predominantly consisted of adult patients. Further research is needed to understand the clinical implications of these findings.
Project description:Over the next few years, the efficient use of next-generation sequencing (NGS) in human genetics research will depend heavily upon the effective mechanisms for the selective enrichment of genomic regions of interest. Recently, comprehensive exome capture arrays have become available for targeting approximately 33 Mb or approximately 180,000 coding exons across the human genome. Selective genomic enrichment of the human exome offers an attractive option for new experimental designs aiming to quickly identify potential disease-associated genetic variants, especially in family-based studies. We have evaluated a 2.1 M feature human exome capture array on eight individuals from a three-generation family pedigree. We were able to cover up to 98% of the targeted bases at a long-read sequence read depth of > or = 3, 86% at a read depth of > or = 10, and over 50% of all targets were covered with > or = 20 reads. We identified up to 14,284 SNPs and small indels per individual exome, with up to 1,679 of these representing putative novel polymorphisms. Applying the conservative genotype calling approach HCDiff, the average rate of detection of a variant allele based on Illumina 1 M BeadChips genotypes was 95.2% at > or = 10x sequence. Further, we propose an advantageous genotype calling strategy for low covered targets that empirically determines cut-off thresholds at a given coverage depth based on existing genotype data. Application of this method was able to detect >99% of SNPs covered > or = 8x. Our results offer guidance for "real-world" applications in human genetics and provide further evidence that microarray-based exome capture is an efficient and reliable method to enrich for chromosomal regions of interest in next-generation sequencing experiments.
Project description:We used a RainDance Technologies (RDT) expanded content library to enrich the human X chromosome exome (2.5 Mb) from 26 male samples followed by Illumina sequencing. Our multiplex primer library covered 98.05% of the human X chromosome exome in a single tube with 11,845 different PCR amplicons. Illumina sequencing of 24 male samples showed coverage for 97% of the targeted sequences. Sequence from 2 HapMap samples confirmed missing data rates of 2-3% at sites successfully typed by the HapMap project, with an accuracy of at least ~99.5% as compared to reported HapMap genotypes. Our demonstration that a RDT expanded content library can efficiently enrich and enable the routine sequencing of the human X chromosome exome suggests a wide variety of potential research and clinical applications for this platform.
Project description:Genomic technologies, such as whole-exome sequencing, are a powerful tool in genetic research. Such testing yields a great deal of incidental medical information, or medical information not related to the primary research target. We describe the management of incidental medical information derived from whole-exome sequencing in the research context. We performed whole-exome sequencing on a monozygotic twin pair in which only 1 child was affected with congenital anomalies and applied an institutional review board-approved algorithm to determine what genetic information would be returned. Whole-exome sequencing identified 79525 genetic variants in the twins. Here, we focus on novel variants. After filtering artifacts and excluding known single nucleotide polymorphisms and variants not predicted to be pathogenic, the twins had 32 novel variants in 32 genes that were felt to be likely to be associated with human disease. Eighteen of these novel variants were associated with recessive disease and 18 were associated with dominantly manifesting conditions (variants in some genes were potentially associated with both recessive and dominant conditions), but only 1 variant ultimately met our institutional review board-approved criteria for return of information to the research participants.
Project description:Next generation sequencing can be used to search for Mendelian disease genes in an unbiased manner by sequencing the entire protein-coding sequence, known as the exome, or even the entire human genome. Identifying the pathogenic mutation amongst thousands to millions of genomic variants is a major challenge, and novel variant prioritization strategies are required. The choice of these strategies depends on the availability of well-phenotyped patients and family members, the mode of inheritance, the severity of the disease and its population frequency. In this review, we discuss the current strategies for Mendelian disease gene identification by exome resequencing. We conclude that exome strategies are successful and identify new Mendelian disease genes in approximately 60% of the projects. Improvements in bioinformatics as well as in sequencing technology will likely increase the success rate even further. Exome sequencing is likely to become the most commonly used tool for Mendelian disease gene identification for the coming years.
Project description:Germline mutations in FASL and FAS impair Fas-dependent apoptosis and cause recessively or dominantly inherited autoimmune lymphoproliferative syndrome (ALPS). Patients with ALPS typically present with no other clinical phenotype. We investigated a large, consanguineous, multiplex kindred in which biological features of ALPS were found in the context of severe bacterial and viral disease, recurrent hepatopathy and encephalopathy, and cardiac malformations. By a combination of genome-wide linkage and whole-exome sequencing, we identified a homozygous missense mutation in FADD, encoding the Fas-associated death domain protein (FADD), in the patients. This FADD mutation decreases steady-state protein levels and impairs Fas-dependent apoptosis in vitro, accounting for biological ALPS phenotypes in vivo. It also impairs Fas-independent signaling pathways. The observed bacterial infections result partly from functional hyposplenism, and viral infections result from impaired interferon immunity. We describe here a complex clinical disorder, its genetic basis, and some of the key mechanisms underlying its pathogenesis. Our findings highlight the key role of FADD in Fas-dependent and Fas-independent signaling pathways in humans.