dorsal/arxiv
View SchemaProtein Structure and Evolutionary History Determine Sequence Space Topology
| Authors | Boris Shakhnovich, Eric Deeds, Charles Delisi, Eugene Shakhnovich |
|---|---|
| Categories | |
| ArXiv ID | q-bio/0404040 |
| URL | https://arxiv.org/abs/q-bio/0404040 |
Abstract
Understanding the observed variability in the number of homologs of a gene is a very important, unsolved problem that has broad implications for research into co-evolution of structure and function, gene duplication, pseudogene formation and possibly for emerging diseases. Here we attempt to define and elucidate the reasons behind this observed unevenness in sequence space. We present evidence that sequence variability and functional diversity of a gene or fold family is influenced by certain quantitative characteristics of the protein structure that reflect potential for sequence plasticity i.e. the ability to accept mutation without losing thermodynamic stability.
{
"annotation_id": "fa0b824e-ad22-4c71-88b0-7868972ce6fa",
"date_created": "2026-03-02T18:01:31.019000Z",
"date_modified": "2026-03-02T18:01:31.019000Z",
"file_hash": "d55659909939c64857f5b452d2d721ce636de785b528897ae4a9c8a7728b335e",
"private": false,
"record": {
"abstract": "Understanding the observed variability in the number of homologs of a gene is\na very important, unsolved problem that has broad implications for research\ninto co-evolution of structure and function, gene duplication, pseudogene\nformation and possibly for emerging diseases. Here we attempt to define and\nelucidate the reasons behind this observed unevenness in sequence space. We\npresent evidence that sequence variability and functional diversity of a gene\nor fold family is influenced by certain quantitative characteristics of the\nprotein structure that reflect potential for sequence plasticity i.e. the\nability to accept mutation without losing thermodynamic stability.",
"arxiv_id": "q-bio/0404040",
"authors": [
"Boris Shakhnovich",
"Eric Deeds",
"Charles Delisi",
"Eugene Shakhnovich"
],
"categories": [
"q-bio.BM",
"q-bio.GN"
],
"title": "Protein Structure and Evolutionary History Determine Sequence Space Topology",
"url": "https://arxiv.org/abs/q-bio/0404040"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "444b4e12-cd07-46e7-945b-3d3bdefe806c",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}