Unknown

Dataset Information

0

Dataset of solution-based inorganic materials synthesis procedures extracted from the scientific literature.


ABSTRACT: The development of a materials synthesis route is usually based on heuristics and experience. A possible new approach would be to apply data-driven approaches to learn the patterns of synthesis from past experience and use them to predict the syntheses of novel materials. However, this route is impeded by the lack of a large-scale database of synthesis formulations. In this work, we applied advanced machine learning and natural language processing techniques to construct a dataset of 35,675 solution-based synthesis procedures extracted from the scientific literature. Each procedure contains essential synthesis information including the precursors and target materials, their quantities, and the synthesis actions and corresponding attributes. Every procedure is also augmented with the reaction formula. Through this work, we are making freely available the first large dataset of solution-based inorganic materials synthesis procedures.

SUBMITTER: Wang Z 

PROVIDER: S-EPMC9132903 | biostudies-literature | 2022 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Dataset of solution-based inorganic materials synthesis procedures extracted from the scientific literature.

Wang Zheren Z   Kononova Olga O   Cruse Kevin K   He Tanjin T   Huo Haoyan H   Fei Yuxing Y   Zeng Yan Y   Sun Yingzhi Y   Cai Zijian Z   Sun Wenhao W   Ceder Gerbrand G  

Scientific data 20220525 1


The development of a materials synthesis route is usually based on heuristics and experience. A possible new approach would be to apply data-driven approaches to learn the patterns of synthesis from past experience and use them to predict the syntheses of novel materials. However, this route is impeded by the lack of a large-scale database of synthesis formulations. In this work, we applied advanced machine learning and natural language processing techniques to construct a dataset of 35,675 solu  ...[more]

Similar Datasets

| S-EPMC10256153 | biostudies-literature
| S-EPMC6794279 | biostudies-literature
| S-EPMC5930398 | biostudies-literature
| S-EPMC8772125 | biostudies-literature
| S-EPMC8724131 | biostudies-literature
| S-EPMC10060421 | biostudies-literature
| S-EPMC9587980 | biostudies-literature
| S-EPMC7141037 | biostudies-literature
| S-EPMC10793203 | biostudies-literature
| S-EPMC10490488 | biostudies-literature