Unknown

Dataset Information

0

Sequence physical properties encode the global organization of protein structure space.


ABSTRACT: It is demonstrated that, properly represented, the amino acid composition of protein sequences contains the information necessary to delineate the global properties of protein structure space. A numerical representation of amino acid sequence in terms of a set of property factors is used, and the values of those property factors are averaged over individual sequences and then over sets of sequences belonging to structurally defined groups. These sequence sets then can be viewed as points in a 10-dimensional space, and the organization of that space, determined only by sequence properties, is similar at both local and global scales to that of the space of protein structures determined previously.

SUBMITTER: Rackovsky S 

PROVIDER: S-EPMC2732808 | biostudies-literature | 2009 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence physical properties encode the global organization of protein structure space.

Rackovsky S S  

Proceedings of the National Academy of Sciences of the United States of America 20090812 34


It is demonstrated that, properly represented, the amino acid composition of protein sequences contains the information necessary to delineate the global properties of protein structure space. A numerical representation of amino acid sequence in terms of a set of property factors is used, and the values of those property factors are averaged over individual sequences and then over sets of sequences belonging to structurally defined groups. These sequence sets then can be viewed as points in a 10  ...[more]

Similar Datasets

| S-EPMC7345901 | biostudies-literature
| S-EPMC7345901 | biostudies-literature
| S-EPMC4908355 | biostudies-literature
| S-EPMC9684540 | biostudies-literature
| S-EPMC548596 | biostudies-literature
| S-EPMC7988036 | biostudies-literature
| S-EPMC10260930 | biostudies-literature
| S-EPMC5738032 | biostudies-literature
| S-EPMC10081344 | biostudies-literature
| S-EPMC10846622 | biostudies-literature