<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Mao Y</submitter><funding>Oregon National Primate Research Center</funding><funding>Howard Hughes Medical Institute</funding><funding>NIDA NIH HHS</funding><funding>NIAID NIH HHS</funding><funding>NHGRI NIH HHS</funding><funding>National Natural Science Foundation of China</funding><funding>National Institutes of Health</funding><funding>NIH HHS</funding><funding>NIGMS NIH HHS</funding><funding>Shanghai Jiao Tong University</funding><pagination>1547-1562.e13</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC10947866</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>187(6)</volume><pubmed_abstract>We sequenced and assembled using multiple long-read sequencing technologies the genomes of chimpanzee, bonobo, gorilla, orangutan, gibbon, macaque, owl monkey, and marmoset. We identified 1,338,997 lineage-specific fixed structural variants (SVs) disrupting 1,561 protein-coding genes and 136,932 regulatory elements, including the most complete set of human-specific fixed differences. We estimate that 819.47 Mbp or ∼27% of the genome has been affected by SVs across primate evolution. We identify 1,607 structurally divergent regions wherein recurrent structural variation contributes to creating SV hotspots where genes are recurrently lost (e.g., CARD, C4, and OLAH gene families) and additional lineage-specific genes are generated (e.g., CKAP2, VPS36, ACBD7, and NEK5 paralogs), becoming targets of rapid chromosomal diversification and positive selection (e.g., RGPD gene family). High-fidelity long-read sequencing has made these dynamic regions of the genome accessible for sequence-level analyses within and between primate species.</pubmed_abstract><journal>Cell</journal><pubmed_title>Structurally divergent and recurrently mutated regions of primate genomes.</pubmed_title><pmcid>PMC10947866</pmcid><funding_grant_id>R01 HG010485</funding_grant_id><funding_grant_id>P51 OD011092</funding_grant_id><funding_grant_id>R01 AI137011</funding_grant_id><funding_grant_id>U24 HG011853</funding_grant_id><funding_grant_id>K99 GM147352</funding_grant_id><funding_grant_id>R00 GM147352</funding_grant_id><funding_grant_id>R01 HG002385</funding_grant_id><funding_grant_id>U01 HG010961</funding_grant_id><funding_grant_id>R01 HG010169</funding_grant_id><funding_grant_id>R01 HG010333</funding_grant_id><funding_grant_id>DP1 AI175471</funding_grant_id><funding_grant_id>DP1 DA046108</funding_grant_id><funding_grant_id>R01 OD034046</funding_grant_id><funding_grant_id>U41 HG010972</funding_grant_id><pubmed_authors>Audano PA</pubmed_authors><pubmed_authors>Fair T</pubmed_authors><pubmed_authors>Paten B</pubmed_authors><pubmed_authors>Yoo D</pubmed_authors><pubmed_authors>Haukness M</pubmed_authors><pubmed_authors>Feng G</pubmed_authors><pubmed_authors>Wei X</pubmed_authors><pubmed_authors>Pollen AA</pubmed_authors><pubmed_authors>Zhang S</pubmed_authors><pubmed_authors>Del Rosario R</pubmed_authors><pubmed_authors>Shi Y</pubmed_authors><pubmed_authors>Logsdon GA</pubmed_authors><pubmed_authors>Sun Q</pubmed_authors><pubmed_authors>Sawyer SL</pubmed_authors><pubmed_authors>Lewis AP</pubmed_authors><pubmed_authors>Warren WC</pubmed_authors><pubmed_authors>Mao Y</pubmed_authors><pubmed_authors>Dishuck PC</pubmed_authors><pubmed_authors>Gordon DS</pubmed_authors><pubmed_authors>Bakken TE</pubmed_authors><pubmed_authors>Hoekzema K</pubmed_authors><pubmed_authors>Fattor WT</pubmed_authors><pubmed_authors>Jeong H</pubmed_authors><pubmed_authors>Porubsky D</pubmed_authors><pubmed_authors>Bauer VL</pubmed_authors><pubmed_authors>Carbone L</pubmed_authors><pubmed_authors>Munson KM</pubmed_authors><pubmed_authors>Yang X</pubmed_authors><pubmed_authors>Rozanski A</pubmed_authors><pubmed_authors>Harvey WT</pubmed_authors><pubmed_authors>Eichler EE</pubmed_authors><pubmed_authors>Wilkerson GK</pubmed_authors><pubmed_authors>Lu Q</pubmed_authors></additional><is_claimable>false</is_claimable><name>Structurally divergent and recurrently mutated regions of primate genomes.</name><description>We sequenced and assembled using multiple long-read sequencing technologies the genomes of chimpanzee, bonobo, gorilla, orangutan, gibbon, macaque, owl monkey, and marmoset. We identified 1,338,997 lineage-specific fixed structural variants (SVs) disrupting 1,561 protein-coding genes and 136,932 regulatory elements, including the most complete set of human-specific fixed differences. We estimate that 819.47 Mbp or ∼27% of the genome has been affected by SVs across primate evolution. We identify 1,607 structurally divergent regions wherein recurrent structural variation contributes to creating SV hotspots where genes are recurrently lost (e.g., CARD, C4, and OLAH gene families) and additional lineage-specific genes are generated (e.g., CKAP2, VPS36, ACBD7, and NEK5 paralogs), becoming targets of rapid chromosomal diversification and positive selection (e.g., RGPD gene family). High-fidelity long-read sequencing has made these dynamic regions of the genome accessible for sequence-level analyses within and between primate species.</description><dates><release>2024-01-01T00:00:00Z</release><publication>2024 Mar</publication><modification>2026-06-01T19:20:20.801Z</modification><creation>2025-04-04T09:07:10.61Z</creation></dates><accession>S-EPMC10947866</accession><cross_references><pubmed>38428424</pubmed><doi>10.1016/j.cell.2024.01.052</doi></cross_references></HashMap>