dorsal/arxiv
View SchemaA simple stochastic model for the evolution of protein lengths
| Authors | C. Destri, C. Miccio |
|---|---|
| Categories | |
| ArXiv ID | q-bio/0703054 |
| URL | https://arxiv.org/abs/q-bio/0703054 |
| DOI | 10.1103/PhysRevE.76.011924 |
Abstract
We analyse a simple discrete-time stochastic process for the theoretical modeling of the evolution of protein lengths. At every step of the process a new protein is produced as a modification of one of the proteins already existing and its length is assumed to be random variable which depends only on the length of the originating protein. Thus a Random Recursive Trees (RRT) is produced over the natural integers. If (quasi) scale invariance is assumed, the length distribution in a single history tends to a lognormal form with a specific signature of the deviations from exact gaussianity. Comparison with the very large SIMAP protein database shows good agreement.
{
"annotation_id": "8082b888-1f57-48f3-b7e1-ed2212030fe8",
"date_created": "2026-03-02T18:01:34.806000Z",
"date_modified": "2026-03-02T18:01:34.806000Z",
"file_hash": "71834cd2070d87ef5a0cadead5986f1d3893cf7e61b2873cd4d83393bc2aaceb",
"private": false,
"record": {
"abstract": "We analyse a simple discrete-time stochastic process for the theoretical\nmodeling of the evolution of protein lengths. At every step of the process a\nnew protein is produced as a modification of one of the proteins already\nexisting and its length is assumed to be random variable which depends only on\nthe length of the originating protein. Thus a Random Recursive Trees (RRT) is\nproduced over the natural integers. If (quasi) scale invariance is assumed, the\nlength distribution in a single history tends to a lognormal form with a\nspecific signature of the deviations from exact gaussianity. Comparison with\nthe very large SIMAP protein database shows good agreement.",
"arxiv_id": "q-bio/0703054",
"authors": [
"C. Destri",
"C. Miccio"
],
"categories": [
"q-bio.PE",
"q-bio.QM"
],
"doi": "10.1103/PhysRevE.76.011924",
"title": "A simple stochastic model for the evolution of protein lengths",
"url": "https://arxiv.org/abs/q-bio/0703054"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "5594a62e-b46a-4456-baba-197fd3e29128",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}