<HashMap><database>biostudies-literature</database><scores><citationCount>0</citationCount><reanalysisCount>0</reanalysisCount><viewCount>44</viewCount><searchCount>0</searchCount></scores><additional><submitter>Bonfield JK</submitter><funding>Wellcome Trust</funding><pagination>337-339</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC6330002</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>35(2)</volume><pubmed_abstract>&lt;h4>Motivation&lt;/h4>The bulk of space taken up by NGS sequencing CRAM files consists of per-base quality values. Most of these are unnecessary for variant calling, offering an opportunity for space saving.&lt;h4>Results&lt;/h4>On the Syndip test set, a 17 fold reduction in the quality storage portion of a CRAM file can be achieved while maintaining variant calling accuracy. The size reduction of an entire CRAM file varied from 2.2 to 7.4 fold, depending on the non-quality content of the original file (see Supplementary Material S6 for details).&lt;h4>Availability and implementation&lt;/h4>Crumble is OpenSource and can be obtained from https://github.com/jkbonfield/crumble.&lt;h4>Supplementary information&lt;/h4>Supplementary data are available at Bioinformatics online.</pubmed_abstract><journal>Bioinformatics (Oxford, England)</journal><pubmed_title>Crumble: reference free lossy compression of sequence quality values.</pubmed_title><pmcid>PMC6330002</pmcid><funding_grant_id>207492/Z/17/Z</funding_grant_id><funding_grant_id>WT098051</funding_grant_id><pubmed_authors>McCarthy SA</pubmed_authors><pubmed_authors>Bonfield JK</pubmed_authors><pubmed_authors>Durbin R</pubmed_authors><view_count>44</view_count></additional><is_claimable>false</is_claimable><name>Crumble: reference free lossy compression of sequence quality values.</name><description>&lt;h4>Motivation&lt;/h4>The bulk of space taken up by NGS sequencing CRAM files consists of per-base quality values. Most of these are unnecessary for variant calling, offering an opportunity for space saving.&lt;h4>Results&lt;/h4>On the Syndip test set, a 17 fold reduction in the quality storage portion of a CRAM file can be achieved while maintaining variant calling accuracy. The size reduction of an entire CRAM file varied from 2.2 to 7.4 fold, depending on the non-quality content of the original file (see Supplementary Material S6 for details).&lt;h4>Availability and implementation&lt;/h4>Crumble is OpenSource and can be obtained from https://github.com/jkbonfield/crumble.&lt;h4>Supplementary information&lt;/h4>Supplementary data are available at Bioinformatics online.</description><dates><release>2019-01-01T00:00:00Z</release><publication>2019 Jan</publication><modification>2021-02-20T23:37:48Z</modification><creation>2019-03-26T22:37:10Z</creation></dates><accession>S-EPMC6330002</accession><cross_references><pubmed>29992288</pubmed><doi>10.1093/bioinformatics/bty608</doi></cross_references></HashMap>