Unknown

Dataset Information

0

Visualizing chemical space networks with RDKit and NetworkX.


ABSTRACT: This article demonstrates how to create Chemical Space Networks (CSNs) using a Python RDKit and NetworkX workflow. CSNs are a type of network visualization that depict compounds as nodes connected by edges, defined as a pairwise relationship such as a 2D fingerprint similarity value. A step by step approach is presented for creating two different CSNs in this manuscript, one based on RDKit 2D fingerprint Tanimoto similarity values, and another based on maximum common substructure similarity values. Several different CSN visualization features are included in the tutorial including methods to represent nodes with color based on bioactivity attribute value, edges with different line styles based on similarity value, as well as replacing the circle nodes with 2D structure depictions. Finally, some common network property and analysis calculations are presented including the clustering coefficient, degree assortativity, and modularity. All code is provided in the form of Jupyter Notebooks and is available on GitHub with a permissive BSD-3 open-source license: https://github.com/vfscalfani/CSN_tutorial.

SUBMITTER: Scalfani VF 

PROVIDER: S-EPMC9798653 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Visualizing chemical space networks with RDKit and NetworkX.

Scalfani Vincent F VF   Patel Vishank D VD   Fernandez Avery M AM  

Journal of cheminformatics 20221228 1


This article demonstrates how to create Chemical Space Networks (CSNs) using a Python RDKit and NetworkX workflow. CSNs are a type of network visualization that depict compounds as nodes connected by edges, defined as a pairwise relationship such as a 2D fingerprint similarity value. A step by step approach is presented for creating two different CSNs in this manuscript, one based on RDKit 2D fingerprint Tanimoto similarity values, and another based on maximum common substructure similarity valu  ...[more]

Similar Datasets

| S-EPMC11492315 | biostudies-literature
| S-EPMC9060647 | biostudies-literature
| S-EPMC5538041 | biostudies-literature
| S-EPMC7278228 | biostudies-literature
| S-EPMC6013758 | biostudies-literature
| S-EPMC11263914 | biostudies-literature
| S-EPMC2481276 | biostudies-literature
| S-EPMC6156459 | biostudies-literature
| S-EPMC9787733 | biostudies-literature
| S-EPMC6287642 | biostudies-literature