Unknown

Dataset Information

0

A thermoelectric materials database auto-generated from the scientific literature using ChemDataExtractor.


ABSTRACT: An auto-generated thermoelectric-materials database is presented, containing 22,805 data records, automatically generated from the scientific literature, spanning 10,641 unique extracted chemical names. Each record contains a chemical entity and one of the seminal thermoelectric properties: thermoelectric figure of merit, ZT; thermal conductivity, κ; Seebeck coefficient, S; electrical conductivity, σ; power factor, PF; each linked to their corresponding recorded temperature, T. The database was auto-generated using the automatic sentence-parsing capabilities of the chemistry-aware, natural language processing toolkit, ChemDataExtractor 2.0, adapted for application in the thermoelectric-materials domain, following a rule-based sentence-simplification step. Data were mined from the text of 60,843 scientific papers that were sourced from three scientific publishers: Elsevier, the Royal Society of Chemistry, and Springer. To the best of our knowledge, this is the first automatically-generated database of thermoelectric materials and their properties from existing literature. The database was evaluated to have a precision of 82.25% and has been made publicly available to facilitate the application of data science in the thermoelectric-materials domain, for analysis, design, and prediction.

SUBMITTER: Sierepeklis O 

PROVIDER: S-EPMC9587980 | biostudies-literature | 2022 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

A thermoelectric materials database auto-generated from the scientific literature using ChemDataExtractor.

Sierepeklis Odysseas O   Cole Jacqueline M JM  

Scientific data 20221022 1


An auto-generated thermoelectric-materials database is presented, containing 22,805 data records, automatically generated from the scientific literature, spanning 10,641 unique extracted chemical names. Each record contains a chemical entity and one of the seminal thermoelectric properties: thermoelectric figure of merit, ZT; thermal conductivity, κ; Seebeck coefficient, S; electrical conductivity, σ; power factor, PF; each linked to their corresponding recorded temperature, T. The database was  ...[more]

Similar Datasets

| S-EPMC7411033 | biostudies-literature
| S-EPMC10794197 | biostudies-literature
| S-EPMC9065101 | biostudies-literature
| S-EPMC9065060 | biostudies-literature
| S-EPMC9205998 | biostudies-literature
| S-EPMC6007086 | biostudies-literature
| S-EPMC10210167 | biostudies-literature
| S-EPMC10305376 | biostudies-literature
| S-EPMC9132903 | biostudies-literature
| S-EPMC10256153 | biostudies-literature