<HashMap><database>biostudies-literature</database><scores/><additional><omics_type>Unknown</omics_type><volume>12(1)</volume><submitter>Lv Z</submitter><pubmed_abstract>Under the strategic objectives of carbon peaking and carbon neutrality, energy transition driven by new quality productive forces has emerged as a central theme in China's energy development. Among these, the intelligent sorting and analysis of raw coal using deep learning constitute a pivotal technical process. However, the progress of intelligent coal preparation in China has been constrained by the absence of accurate and large-scale data. To address this gap, this study introduces DsCGF, a large-scale, open-source raw coal image dataset. Over the past five years, extensive raw coal image samples were systematically collected and meticulously annotated from three representative mining regions in China, resulting in a dataset comprising over 270,000 visible-light images. These images are annotated at multiple levels, targeting three primary categories: coal, gangue, and foreign objects, and are designed for three core computer vision tasks: image classification, object detection, and instance segmentation. Comprehensive evaluation results indicate that the DsCGF can effectively support further research into the intelligent sorting of raw coal.</pubmed_abstract><journal>Scientific data</journal><pagination>403</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC11890867</full_dataset_link><repository>biostudies-literature</repository><pubmed_title>A large-scale open image dataset for deep learning-enabled intelligent sorting and analyzing of raw coal.</pubmed_title><pmcid>PMC11890867</pmcid><pubmed_authors>Lv Z</pubmed_authors><pubmed_authors>Sun M</pubmed_authors><pubmed_authors>Sha T</pubmed_authors><pubmed_authors>Fan Y</pubmed_authors><pubmed_authors>Cui Y</pubmed_authors><pubmed_authors>Lv H</pubmed_authors><pubmed_authors>Wang W</pubmed_authors><pubmed_authors>Tu Y</pubmed_authors><pubmed_authors>Wu Y</pubmed_authors><pubmed_authors>Xu Z</pubmed_authors></additional><is_claimable>false</is_claimable><name>A large-scale open image dataset for deep learning-enabled intelligent sorting and analyzing of raw coal.</name><description>Under the strategic objectives of carbon peaking and carbon neutrality, energy transition driven by new quality productive forces has emerged as a central theme in China's energy development. Among these, the intelligent sorting and analysis of raw coal using deep learning constitute a pivotal technical process. However, the progress of intelligent coal preparation in China has been constrained by the absence of accurate and large-scale data. To address this gap, this study introduces DsCGF, a large-scale, open-source raw coal image dataset. Over the past five years, extensive raw coal image samples were systematically collected and meticulously annotated from three representative mining regions in China, resulting in a dataset comprising over 270,000 visible-light images. These images are annotated at multiple levels, targeting three primary categories: coal, gangue, and foreign objects, and are designed for three core computer vision tasks: image classification, object detection, and instance segmentation. Comprehensive evaluation results indicate that the DsCGF can effectively support further research into the intelligent sorting of raw coal.</description><dates><release>2025-01-01T00:00:00Z</release><publication>2025 Mar</publication><modification>2025-04-04T08:22:27.854Z</modification><creation>2025-04-04T08:22:27.854Z</creation></dates><accession>S-EPMC11890867</accession><cross_references><pubmed>40057526</pubmed><doi>10.1038/s41597-025-04719-0</doi></cross_references></HashMap>