Ontology highlight
ABSTRACT:
SUBMITTER: Handsel J
PROVIDER: S-EPMC8496104 | biostudies-literature | 2021 Oct
REPOSITORIES: biostudies-literature

Handsel Jennifer J Matthews Brian B Knight Nicola J NJ Coles Simon J SJ
Journal of cheminformatics 20211007 1
We present a sequence-to-sequence machine learning model for predicting the IUPAC name of a chemical from its standard International Chemical Identifier (InChI). The model uses two stacks of transformers in an encoder-decoder architecture, a setup similar to the neural networks used in state-of-the-art machine translation. Unlike neural machine translation, which usually tokenizes input and output into words or sub-words, our model processes the InChI and predicts the IUPAC name character by cha ...[more]