dorsal/arxiv
View SchemaInformational Way to Protein Alphabet: Entropic Classification of Amino Acids
| Authors | A. N. Gorban, M. Kudryashev, T. Popova |
|---|---|
| Categories | |
| ArXiv ID | q-bio/0501019 |
| URL | https://arxiv.org/abs/q-bio/0501019 |
Abstract
What are proteins made from, as the working parts of the living cells protein machines? To answer this question, we need a technology to disassemble proteins onto elementary func-tional details and to prepare lumped description of such details. This lumped description might have a multiple material realization (in amino acids). Our hypothesis is that informational approach to this problem is possible. We propose a way of hierarchical classification that makes the primary structure of protein maximally non-random. The first steps of the suggested research program are realized: the method and the analysis of optimal informational protein binary alphabet. The general method is used to answer several specific questions, for example: (i) Is there a syntactic difference between Globular and Membrane proteins? (ii) Are proteins random sequences of amino acids (a long discussion)? For these questions, the answers are as follows: (i) There exists significant syntactic difference between Globular and Membrane proteins, and this difference is described; (ii) Amino acid sequences in proteins are definitely not random.
{
"annotation_id": "a922cf37-523b-4d70-a611-8598bb0014c1",
"date_created": "2026-03-02T18:01:31.545000Z",
"date_modified": "2026-03-02T18:01:31.545000Z",
"file_hash": "61470b23bbdd1ee151e007c6704c856c1e6547ae68a53e58103da5b73679b129",
"private": false,
"record": {
"abstract": "What are proteins made from, as the working parts of the living cells protein\nmachines? To answer this question, we need a technology to disassemble proteins\nonto elementary func-tional details and to prepare lumped description of such\ndetails. This lumped description might have a multiple material realization (in\namino acids). Our hypothesis is that informational approach to this problem is\npossible. We propose a way of hierarchical classification that makes the\nprimary structure of protein maximally non-random. The first steps of the\nsuggested research program are realized: the method and the analysis of optimal\ninformational protein binary alphabet. The general method is used to answer\nseveral specific questions, for example: (i) Is there a syntactic difference\nbetween Globular and Membrane proteins? (ii) Are proteins random sequences of\namino acids (a long discussion)? For these questions, the answers are as\nfollows: (i) There exists significant syntactic difference between Globular and\nMembrane proteins, and this difference is described; (ii) Amino acid sequences\nin proteins are definitely not random.",
"arxiv_id": "q-bio/0501019",
"authors": [
"A. N. Gorban",
"M. Kudryashev",
"T. Popova"
],
"categories": [
"q-bio.BM",
"physics.bio-ph",
"q-bio.QM"
],
"title": "Informational Way to Protein Alphabet: Entropic Classification of Amino Acids",
"url": "https://arxiv.org/abs/q-bio/0501019"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "825e0982-f357-4385-a6f0-06731b5ce711",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}