Unknown

Dataset Information

0

Unique identifiers for small molecules enable rigorous labeling of their atoms.


ABSTRACT: Rigorous characterization of small organic molecules in terms of their structural and biological properties is vital to biomedical research. The three-dimensional structure of a molecule, its 'photo ID', is inefficient for searching and matching tasks. Instead, identifiers play a key role in accessing compound data. Unique and reproducible molecule and atom identifiers are required to ensure the correct cross-referencing of properties associated with compounds archived in databases. The best approach to this requirement is the International Chemical Identifier (InChI). However, the current implementation of InChI fails to provide a complete standard for atom nomenclature, and incorrect use of the InChI standard has resulted in the proliferation of non-unique identifiers. We propose a methodology and associated software tools, named ALATIS, that overcomes these shortcomings. ALATIS is an adaptation of InChI, which operates fully within the InChI convention to provide unique and reproducible molecule and all atom identifiers. ALATIS includes an InChI extension for unique atom labeling of symmetric molecules. ALATIS forms the basis for improving reproducibility and unifying cross-referencing across databases.

SUBMITTER: Dashti H 

PROVIDER: S-EPMC5441290 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Unique identifiers for small molecules enable rigorous labeling of their atoms.

Dashti Hesam H   Westler William M WM   Markley John L JL   Eghbalnia Hamid R HR  

Scientific data 20170523


Rigorous characterization of small organic molecules in terms of their structural and biological properties is vital to biomedical research. The three-dimensional structure of a molecule, its 'photo ID', is inefficient for searching and matching tasks. Instead, identifiers play a key role in accessing compound data. Unique and reproducible molecule and atom identifiers are required to ensure the correct cross-referencing of properties associated with compounds archived in databases. The best app  ...[more]

Similar Datasets

| S-EPMC6157883 | biostudies-other
| S-EPMC7471316 | biostudies-literature
2011-11-21 | E-MTAB-816 | biostudies-arrayexpress
2023-06-01 | GSE218903 | GEO
| PRJEB2713 | ENA
2012-04-07 | E-GEOD-36246 | biostudies-arrayexpress
| S-EPMC8047766 | biostudies-literature
| S-EPMC5658704 | biostudies-literature
| S-EPMC6044086 | biostudies-literature
2023-06-01 | GSE218899 | GEO