<HashMap><database>biostudies-literature</database><scores/><additional><omics_type>Unknown</omics_type><submitter>Sui Y</submitter><funding>NICHD NIH HHS</funding><funding>NIMH NIH HHS</funding><funding>NINDS NIH HHS</funding><funding>NIH HHS</funding><pubmed_abstract>Autism spectrum disorders (ASDs) are genetically and phenotypically heterogeneous and the majority of cases still remain genetically unresolved. To better understand large-effect pathogenic variation, we generated long-read sequencing data to construct phased and near-complete genome assemblies (average contig N50=43 Mbp, QV=56) for 189 individuals from 51 families with unsolved cases of autism. We applied read- and assembly-based strategies to facilitate comprehensive characterization of &lt;i>de novo&lt;/i> mutations (DNMs), structural variants (SVs), and DNA methylation profiles. Merging common SVs obtained from long-read pangenome controls, we efficiently filtered >97% of common SVs exclusive to 87 offspring. We find no evidence of increased autosomal SV burden for probands when compared to unaffected siblings yet note a trend for an increase of SV burden on the X chromosome among affected females. We establish a workflow to prioritize potential pathogenic variants by integrating autism risk genes and putative noncoding regulatory elements defined from ATAC-seq and CUT&amp;Tag data from the developing cortex. In total, we identified three pathogenic variants in &lt;i>TBL1XR1&lt;/i>, &lt;i>MECP2&lt;/i>, and &lt;i>SYNGAP1&lt;/i>, as well as nine candidate &lt;i>de novo&lt;/i> and biparental homozygous SVs, most of which were missed by short-read sequencing. Our work highlights the potential of phased genomes to discover complex more pathogenic mutations and the power of the pangenome to restrict the focus on an increasingly smaller number of SVs for clinical evaluation.</pubmed_abstract><journal>medRxiv : the preprint server for health sciences</journal><pagination>2025.07.21.25331932</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC12330438</full_dataset_link><repository>biostudies-literature</repository><pubmed_title>Pangenome discovery of missing autism variants.</pubmed_title><pmcid>PMC12330438</pmcid><funding_grant_id>R01 NS057819</funding_grant_id><funding_grant_id>F32 HD116501</funding_grant_id><funding_grant_id>DP5 OD033357</funding_grant_id><funding_grant_id>R01 MH101221</funding_grant_id><pubmed_authors>Koundinya N</pubmed_authors><pubmed_authors>Wu M</pubmed_authors><pubmed_authors>Nowakowski TJ</pubmed_authors><pubmed_authors>Karunakaran D</pubmed_authors><pubmed_authors>Human Pangenome Reference Consortium (HPRC)</pubmed_authors><pubmed_authors>Hennick K</pubmed_authors><pubmed_authors>Cherry F</pubmed_authors><pubmed_authors>Polo Prieto RA</pubmed_authors><pubmed_authors>Wang T</pubmed_authors><pubmed_authors>Kwon Y</pubmed_authors><pubmed_authors>Wong I</pubmed_authors><pubmed_authors>Suter B</pubmed_authors><pubmed_authors>Wertz J</pubmed_authors><pubmed_authors>Hoekzema K</pubmed_authors><pubmed_authors>Chakravarti A</pubmed_authors><pubmed_authors>Sui Y</pubmed_authors><pubmed_authors>Garcia GH</pubmed_authors><pubmed_authors>Berk-Rauch H</pubmed_authors><pubmed_authors>Munson KM</pubmed_authors><pubmed_authors>Pehlivan D</pubmed_authors><pubmed_authors>Zoghbi HY</pubmed_authors><pubmed_authors>Meyer-Schuman R</pubmed_authors><pubmed_authors>Harvey WT</pubmed_authors><pubmed_authors>Miller DE</pubmed_authors><pubmed_authors>Eichler EE</pubmed_authors><pubmed_authors>Lin J</pubmed_authors><pubmed_authors>Noyes MD</pubmed_authors><pubmed_authors>Gustafson JA</pubmed_authors><pubmed_authors>Knuth J</pubmed_authors></additional><is_claimable>false</is_claimable><name>Pangenome discovery of missing autism variants.</name><description>Autism spectrum disorders (ASDs) are genetically and phenotypically heterogeneous and the majority of cases still remain genetically unresolved. To better understand large-effect pathogenic variation, we generated long-read sequencing data to construct phased and near-complete genome assemblies (average contig N50=43 Mbp, QV=56) for 189 individuals from 51 families with unsolved cases of autism. We applied read- and assembly-based strategies to facilitate comprehensive characterization of &lt;i>de novo&lt;/i> mutations (DNMs), structural variants (SVs), and DNA methylation profiles. Merging common SVs obtained from long-read pangenome controls, we efficiently filtered >97% of common SVs exclusive to 87 offspring. We find no evidence of increased autosomal SV burden for probands when compared to unaffected siblings yet note a trend for an increase of SV burden on the X chromosome among affected females. We establish a workflow to prioritize potential pathogenic variants by integrating autism risk genes and putative noncoding regulatory elements defined from ATAC-seq and CUT&amp;Tag data from the developing cortex. In total, we identified three pathogenic variants in &lt;i>TBL1XR1&lt;/i>, &lt;i>MECP2&lt;/i>, and &lt;i>SYNGAP1&lt;/i>, as well as nine candidate &lt;i>de novo&lt;/i> and biparental homozygous SVs, most of which were missed by short-read sequencing. Our work highlights the potential of phased genomes to discover complex more pathogenic mutations and the power of the pangenome to restrict the focus on an increasingly smaller number of SVs for clinical evaluation.</description><dates><release>2025-01-01T00:00:00Z</release><publication>2025 Jul</publication><modification>2026-04-07T13:43:39.649Z</modification><creation>2026-04-07T13:34:18.821Z</creation></dates><accession>S-EPMC12330438</accession><cross_references><pubmed>40778144</pubmed><doi>10.1101/2025.07.21.25331932</doi></cross_references></HashMap>