<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Tian Y</submitter><funding>Natural Science Foundation of Hunan Province (Hunan Provincial Natural Science Foundation)</funding><funding>National Science Foundation of China | Major Research Plan</funding><funding>National Natural Science Foundation of China</funding><funding>National Science Foundation of China | Young Scientists Fund</funding><funding>Natural Science Foundation of Hunan Province</funding><funding>National Natural Science Foundation of China (National Science Foundation of China)</funding><pagination>59</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC12856017</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>9(1)</volume><pubmed_abstract>Adverse drug reactions (ADRs) represent a significant cause of morbidity and mortality in children, who face distinct pharmacological vulnerabilities due to unique physiological development. Current pediatric drug safety research is hindered by limited clinical data and adult-focused studies, creating evidence gaps. We developed a comprehensive computational approach for pediatric pharmacovigilance, integrating consensus-driven signal detection, multi-level biological features, and interpretable machine learning. Using 1.4 million FDA Adverse Event Reporting System reports, we constructed the largest curated pediatric drug-ADR dataset. Severity-specific thresholds and voting across four algorithms (PRR, ROR, BCPNN, and EBGM) optimized ADR identification. Multi-level biological fingerprints spanning molecular, target, and network domains combined with XGBoost significantly improved predictive performance (ROC AUC: 0.7177), especially for imbalanced scenarios. Cross-domain analyses revealed that models trained on adult data exhibit poor generalization to pediatric contexts, confirming that adverse reactions in children cannot be reliably predicted using adult data. Our approach successfully identified established and novel pediatric-specific ADRs with strong literature support. Collectively, this work establishes methodological innovations for pediatric pharmacovigilance, bridges a critical evidence gap in pediatric drug safety, and delivers practical tools for clinical and regulatory decision-making.</pubmed_abstract><journal>Communications chemistry</journal><pubmed_title>Machine learning prediction of pediatric adverse drug reactions using consensus-derived scarce data.</pubmed_title><pmcid>PMC12856017</pmcid><funding_grant_id>22173118, 22220102001</funding_grant_id><funding_grant_id>2024JJ6554</funding_grant_id><funding_grant_id>82304316</funding_grant_id><funding_grant_id>2025JJ60651</funding_grant_id><funding_grant_id>2023ZD0507104</funding_grant_id><funding_grant_id>22307112</funding_grant_id><pubmed_authors>Yi J</pubmed_authors><pubmed_authors>Tian Y</pubmed_authors><pubmed_authors>Cao D</pubmed_authors><pubmed_authors>Jiang D</pubmed_authors><pubmed_authors>Li K</pubmed_authors><pubmed_authors>Peng J</pubmed_authors><pubmed_authors>Deng Y</pubmed_authors></additional><is_claimable>false</is_claimable><name>Machine learning prediction of pediatric adverse drug reactions using consensus-derived scarce data.</name><description>Adverse drug reactions (ADRs) represent a significant cause of morbidity and mortality in children, who face distinct pharmacological vulnerabilities due to unique physiological development. Current pediatric drug safety research is hindered by limited clinical data and adult-focused studies, creating evidence gaps. We developed a comprehensive computational approach for pediatric pharmacovigilance, integrating consensus-driven signal detection, multi-level biological features, and interpretable machine learning. Using 1.4 million FDA Adverse Event Reporting System reports, we constructed the largest curated pediatric drug-ADR dataset. Severity-specific thresholds and voting across four algorithms (PRR, ROR, BCPNN, and EBGM) optimized ADR identification. Multi-level biological fingerprints spanning molecular, target, and network domains combined with XGBoost significantly improved predictive performance (ROC AUC: 0.7177), especially for imbalanced scenarios. Cross-domain analyses revealed that models trained on adult data exhibit poor generalization to pediatric contexts, confirming that adverse reactions in children cannot be reliably predicted using adult data. Our approach successfully identified established and novel pediatric-specific ADRs with strong literature support. Collectively, this work establishes methodological innovations for pediatric pharmacovigilance, bridges a critical evidence gap in pediatric drug safety, and delivers practical tools for clinical and regulatory decision-making.</description><dates><release>2025-01-01T00:00:00Z</release><publication>2025 Dec</publication><modification>2026-06-16T03:08:42.333Z</modification><creation>2026-06-16T03:06:38.619Z</creation></dates><accession>S-EPMC12856017</accession><cross_references><pubmed>41449233</pubmed><doi>10.1038/s42004-025-01865-9</doi></cross_references></HashMap>