Unknown

Dataset Information

0

DoubleHelix: nucleic acid sequence identification, assignment and validation tool for cryo-EM and crystal structure models.


ABSTRACT: Sequence assignment is a key step of the model building process in both cryogenic electron microscopy (cryo-EM) and macromolecular crystallography (MX). If the assignment fails, it can result in difficult to identify errors affecting the interpretation of a model. There are many model validation strategies that help experimentalists in this step of protein model building, but they are virtually non-existent for nucleic acids. Here, I present doubleHelix-a comprehensive method for assignment, identification, and validation of nucleic acid sequences in structures determined using cryo-EM and MX. The method combines a neural network classifier of nucleobase identities and a sequence-independent secondary structure assignment approach. I show that the presented method can successfully assist sequence-assignment step in nucleic-acid model building at lower resolutions, where visual map interpretation is very difficult. Moreover, I present examples of sequence assignment errors detected using doubleHelix in cryo-EM and MX structures of ribosomes deposited in the Protein Data Bank, which escaped the scrutiny of available model-validation approaches. The doubleHelix program source code is available under BSD-3 license at https://gitlab.com/gchojnowski/doublehelix.

SUBMITTER: Chojnowski G 

PROVIDER: S-EPMC10450167 | biostudies-literature | 2023 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

DoubleHelix: nucleic acid sequence identification, assignment and validation tool for cryo-EM and crystal structure models.

Chojnowski Grzegorz G  

Nucleic acids research 20230801 15


Sequence assignment is a key step of the model building process in both cryogenic electron microscopy (cryo-EM) and macromolecular crystallography (MX). If the assignment fails, it can result in difficult to identify errors affecting the interpretation of a model. There are many model validation strategies that help experimentalists in this step of protein model building, but they are virtually non-existent for nucleic acids. Here, I present doubleHelix-a comprehensive method for assignment, ide  ...[more]

Similar Datasets

| S-EPMC9248842 | biostudies-literature
| S-EPMC10306063 | biostudies-literature
| S-EPMC6130467 | biostudies-literature
| S-EPMC3690725 | biostudies-literature
| S-EPMC3670386 | biostudies-literature
| S-EPMC8411978 | biostudies-literature
| S-EPMC7467112 | biostudies-literature
| S-EPMC6760665 | biostudies-literature
| S-EPMC3577369 | biostudies-literature
| S-EPMC3696195 | biostudies-literature