Concordance between whole-exome sequencing and clinical Sanger sequencing: implications for patient care.
Ontology highlight
ABSTRACT: The clinical translation of next-generation sequencing has created a paradigm shift in the diagnostic assessment of individuals with suspected rare genetic diseases. Whole-exome sequencing (WES) simultaneously examines the majority of the coding portion of the genome and is rapidly becoming accepted as an efficient alternative to clinical Sanger sequencing for diagnosing genetically heterogeneous disorders. Among reports of the clinical and diagnostic utility of WES, few studies to date have directly compared its concordance to Sanger sequencing, which is considered the clinical "gold standard". We performed a direct comparison of 391 coding and noncoding polymorphisms and variants of unknown significance identified by clinical Sanger sequencing to the WES results of 26 patients. Of the 150 well-covered coding variants identified by Sanger sequencing, 146 (97.3%) were also reported by WES. Nine genes were excluded from the comparison due to consistently low coverage in WES, which might be attributed to the use of older exome capture kits. We performed confirmatory Sanger sequencing of discordant variants; including five variants with discordant bases and four with discordant zygosity. Confirmatory Sanger sequencing supported the original Sanger report for three of the five discordant bases, one was shown to be a false positive supporting the WES data, and one result differed from both the Sanger and WES data. Two of the discordant zygosity results supported Sanger and the other two supported WES data. We report high concordance for well-covered coding variants, supporting the use of WES as a screening tool for heterogeneous disorders, and recommend the use of supplementary Sanger sequencing for poorly-covered genes when the clinical suspicion is high. Importantly, despite remaining difficulties with achieving complete coverage of the whole exome, 10 (38.5%) of the 26 compared patients were diagnosed through WES.
Project description:Next generation sequencing of circulating tumor DNA (ctDNA) has been used as a noninvasive alternative for cancer diagnosis and characterization of tumor mutational landscape. However, low ctDNA fraction and other factors can limit the ability of ctDNA analysis to capture tumor-specific and actionable variants. In this study, whole-exome sequencings (WES) were performed on paired ctDNA and tumor biopsy in 15 cancer patients to assess the extent of concordance between mutational profiles derived from the two source materials. We found that up to 16.4% ctDNA fraction can still be insufficient for detecting tumor-specific variants and that good concordance with tumor biopsy is consistently achieved at higher ctDNA fractions. Most importantly, ctDNA analysis can consistently capture tumor heterogeneity and detect key cancer-related genes even in a patient with both primary and metastatic tumors.
Project description:PurposeTriple-negative breast cancer (TNBC) is a molecularly complex and heterogeneous breast cancer subtype with distinct biological features and clinical behavior. Although TNBC is associated with an increased risk of metastasis and recurrence, the molecular mechanisms underlying TNBC metastasis remain unclear. We performed whole-exome sequencing (WES) analysis of primary TNBC and paired recurrent tumors to investigate the genetic profile of TNBC.MethodsGenomic DNA extracted from 35 formalin-fixed paraffin-embedded tissue samples from 26 TNBC patients was subjected to WES. Of these, 15 were primary tumors that did not have recurrence, and 11 were primary tumors that had recurrence (nine paired primary and recurrent tumors). Tumors were analyzed for single-nucleotide variants and insertions/deletions.ResultsThe tumor mutational burden (TMB) was 7.6 variants/megabase in primary tumors that recurred (n = 9); 8.2 variants/megabase in corresponding recurrent tumors (n = 9); and 7.3 variants/megabase in primary tumors that did not recur (n = 15). MUC3A was the most frequently mutated gene in all groups. Mutations in MAP3K1 and MUC16 were more common in our dataset. No alterations in PI3KCA were detected in our dataset.ConclusionsWe found similar mutational profiles between primary and paired recurrent tumors, suggesting that genomic features may be retained during local recurrence.
Project description:BackgroundWhole exome sequencing (WES) is widely adopted in clinical and research settings; however, one of the practical concerns is the potential false negatives due to incomplete breadth and depth of coverage for several exons in clinically implicated genes. In some cases, a targeted gene panel testing may be a dependable option to ascertain true negatives for genomic variants in known disease-associated genes. We developed a web-based tool to quickly gauge whether all genes of interest would be reliably covered by WES or whether targeted gene panel testing should be considered instead to minimize false negatives in candidate genes.ResultsWEScover is a novel web application that provides an intuitive user interface for discovering breadth and depth of coverage across population-scale WES datasets, searching either by phenotype, by targeted gene panel(s) or by gene(s). Moreover, the application shows metrics from the Genome Aggregation Database to provide gene-centric view on breadth of coverage.ConclusionsWEScover allows users to efficiently query genes and phenotypes for the coverage of associated exons by WES and recommends use of panel tests for the genes with potential incomplete coverage by WES.
Project description:A patient-derived epithelium-only colon rectal organoid, also referred to as a colonoid, was generated from an adenoma (associated with a resection surgery of an invasive moderately differentiated colorectal adenocarcinoma) as part of the development of an on-going organoid biobank at the Michigan Medicine Translational Tissue Modeling Laboratory (TTML, www.UmichTTML.org). The genomic variant signature of this adenoma colonoid was characterized using whole exome sequencing in order to access genomic concordance between the source patient tissue (adenoma and histologically normal tissue 10 cm from lesion) and the in vitro culture, as well as to access genomic stability of the culture over time at 2 and 6.5 months in culture.
Project description:Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.
Project description:BackgroundTo facilitate the clinical implementation of genomic medicine by next-generation sequencing, it will be critically important to obtain accurate and consistent variant calls on personal genomes. Multiple software tools for variant calling are available, but it is unclear how comparable these tools are or what their relative merits in real-world scenarios might be.MethodsWe sequenced 15 exomes from four families using commercial kits (Illumina HiSeq 2000 platform and Agilent SureSelect version 2 capture kit), with approximately 120X mean coverage. We analyzed the raw data using near-default parameters with five different alignment and variant-calling pipelines (SOAP, BWA-GATK, BWA-SNVer, GNUMAP, and BWA-SAMtools). We additionally sequenced a single whole genome using the sequencing and analysis pipeline from Complete Genomics (CG), with 95% of the exome region being covered by 20 or more reads per base. Finally, we validated 919 single-nucleotide variations (SNVs) and 841 insertions and deletions (indels), including similar fractions of GATK-only, SOAP-only, and shared calls, on the MiSeq platform by amplicon sequencing with approximately 5000X mean coverage.ResultsSNV concordance between five Illumina pipelines across all 15 exomes was 57.4%, while 0.5 to 5.1% of variants were called as unique to each pipeline. Indel concordance was only 26.8% between three indel-calling pipelines, even after left-normalizing and intervalizing genomic coordinates by 20 base pairs. There were 11% of CG variants falling within targeted regions in exome sequencing that were not called by any of the Illumina-based exome analysis pipelines. Based on targeted amplicon sequencing on the MiSeq platform, 97.1%, 60.2%, and 99.1% of the GATK-only, SOAP-only and shared SNVs could be validated, but only 54.0%, 44.6%, and 78.1% of the GATK-only, SOAP-only and shared indels could be validated. Additionally, our analysis of two families (one with four individuals and the other with seven), demonstrated additional accuracy gained in variant discovery by having access to genetic data from a multi-generational family.ConclusionsOur results suggest that more caution should be exercised in genomic medicine settings when analyzing individual genomes, including interpreting positive and negative findings with scrutiny, especially for indels. We advocate for renewed collection and sequencing of multi-generational families to increase the overall accuracy of whole genomes.
Project description:Genomic technologies, such as whole-exome sequencing, are a powerful tool in genetic research. Such testing yields a great deal of incidental medical information, or medical information not related to the primary research target. We describe the management of incidental medical information derived from whole-exome sequencing in the research context. We performed whole-exome sequencing on a monozygotic twin pair in which only 1 child was affected with congenital anomalies and applied an institutional review board-approved algorithm to determine what genetic information would be returned. Whole-exome sequencing identified 79525 genetic variants in the twins. Here, we focus on novel variants. After filtering artifacts and excluding known single nucleotide polymorphisms and variants not predicted to be pathogenic, the twins had 32 novel variants in 32 genes that were felt to be likely to be associated with human disease. Eighteen of these novel variants were associated with recessive disease and 18 were associated with dominantly manifesting conditions (variants in some genes were potentially associated with both recessive and dominant conditions), but only 1 variant ultimately met our institutional review board-approved criteria for return of information to the research participants.
Project description:As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next-generation sequencing as standard practice in research and diagnostics. However, computing cost-performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state-of-the-art read aligners (BWA-MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available.
Project description:BackgroundClinical diagnostic whole-exome sequencing (WES) is a powerful tool for patients with undiagnosed genetic disorders. To demonstrate the clinical utility, we surveyed healthcare providers (HCP) about changes in medical management and treatment, diagnostic testing, reproductive planning, and use of educational services subsequent to WES testing.MethodsFor a period of 18 months, an 18-question survey was sent to HCPs attached to the WES reports. We analyzed the molecular diagnosis, patient clinical features, and the medical management changes reported in the returned surveys.ResultsA total of 62 (2.2% of 2,876) surveys were returned, consisting of 37.1% patients with a positive or likely positive pathogenic alteration, 51.6% negative results, 9.7% uncertain findings, and 1 patient (1.6%) with a novel candidate finding. Overall, 100% of the HCPs of patients with positive or likely positive WES results (n = 23) and HCPs of patients with uncertain WES results (n = 6) responded positively to one of the 18 queries. Of note, 37.5% of the HCPs of patients with negative WES results (n = 32) responded positively to at least one query.ConclusionOverall, these data clearly demonstrate the clinical utility of WES by demonstrating the impact on medical management irrespective of the exome result.