Unknown

Dataset Information

0

Automatic feature engineering for catalyst design using small data without prior knowledge of target catalysis.


ABSTRACT: The empirical aspect of descriptor design in catalyst informatics, particularly when confronted with limited data, necessitates adequate prior knowledge for delving into unknown territories, thus presenting a logical contradiction. This study introduces a technique for automatic feature engineering (AFE) that works on small catalyst datasets, without reliance on specific assumptions or pre-existing knowledge about the target catalysis when designing descriptors and building machine-learning models. This technique generates numerous features through mathematical operations on general physicochemical features of catalytic components and extracts relevant features for the desired catalysis, essentially screening numerous hypotheses on a machine. AFE yields reasonable regression results for three types of heterogeneous catalysis: oxidative coupling of methane (OCM), conversion of ethanol to butadiene, and three-way catalysis, where only the training set is swapped. Moreover, through the application of active learning that combines AFE and high-throughput experimentation for OCM, we successfully visualize the machine's process of acquiring precise recognition of the catalyst design. Thus, AFE is a versatile technique for data-driven catalysis research and a key step towards fully automated catalyst discoveries.

SUBMITTER: Taniike T 

PROVIDER: S-EPMC10786848 | biostudies-literature | 2024 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Automatic feature engineering for catalyst design using small data without prior knowledge of target catalysis.

Taniike Toshiaki T   Fujiwara Aya A   Nakanowatari Sunao S   García-Escobar Fernando F   Takahashi Keisuke K  

Communications chemistry 20240112 1


The empirical aspect of descriptor design in catalyst informatics, particularly when confronted with limited data, necessitates adequate prior knowledge for delving into unknown territories, thus presenting a logical contradiction. This study introduces a technique for automatic feature engineering (AFE) that works on small catalyst datasets, without reliance on specific assumptions or pre-existing knowledge about the target catalysis when designing descriptors and building machine-learning mode  ...[more]

Similar Datasets

| S-EPMC6051992 | biostudies-literature
| S-EPMC1162696 | biostudies-other
| S-EPMC8040869 | biostudies-literature
| S-EPMC5875469 | biostudies-literature
| S-EPMC11365982 | biostudies-literature
| S-EPMC5719493 | biostudies-literature
| S-EPMC4380987 | biostudies-literature
| S-EPMC11447918 | biostudies-literature
| S-EPMC6614886 | biostudies-literature
2016-11-08 | GSE73638 | GEO