dorsal/arxiv
View SchemaNon-extensive Trends in the Size Distribution of Coding and Non-coding DNA Sequences in the Human Genome
| Authors | Th. Oikonomou, A. Provata |
|---|---|
| Categories | |
| ArXiv ID | q-bio/0510021 |
| URL | https://arxiv.org/abs/q-bio/0510021 |
| DOI | 10.1140/epjb/e2006-00121-2 |
Abstract
We study the primary DNA structure of four of the most completely sequenced human chromosomes (including chromosome 19 which is the most dense in coding), using Non-extensive Statistics. We show that the exponents governing the decay of the coding size distributions vary between $5.2 \le r \le 5.7$ for the short scales and $1.45 \le q \le 1.50$ for the large scales. On the contrary, the exponents governing the decay of the non-coding size distributions in these four chromosomes, take the values $2.4 \le r \le 3.2$ for the short scales and $1.50 \le q \le 1.72$ for the large scales. This quantitative difference, in particular in the tail exponent $q$, indicates that the non-coding (coding) size distributions have long (short) range correlations. This non-trivial difference in the DNA statistics is attributed to the non-conservative (conservative) evolution dynamics acting on the non-coding (coding) DNA sequences.
{
"annotation_id": "0d8b37ff-ec18-4fe9-894e-7690bd964e37",
"date_created": "2026-03-02T18:01:32.331000Z",
"date_modified": "2026-03-02T18:01:32.331000Z",
"file_hash": "31e68c103d2cdc7aa4dbe934347f9adcd103af45795527bb77ea69c67598aa89",
"private": false,
"record": {
"abstract": "We study the primary DNA structure of four of the most completely sequenced\nhuman chromosomes (including chromosome 19 which is the most dense in coding),\nusing Non-extensive Statistics. We show that the exponents governing the decay\nof the coding size distributions vary between $5.2 \\le r \\le 5.7$ for the short\nscales and $1.45 \\le q \\le 1.50$ for the large scales. On the contrary, the\nexponents governing the decay of the non-coding size distributions in these\nfour chromosomes, take the values $2.4 \\le r \\le 3.2$ for the short scales and\n$1.50 \\le q \\le 1.72$ for the large scales. This quantitative difference, in\nparticular in the tail exponent $q$, indicates that the non-coding (coding)\nsize distributions have long (short) range correlations. This non-trivial\ndifference in the DNA statistics is attributed to the non-conservative\n(conservative) evolution dynamics acting on the non-coding (coding) DNA\nsequences.",
"arxiv_id": "q-bio/0510021",
"authors": [
"Th. Oikonomou",
"A. Provata"
],
"categories": [
"q-bio.GN"
],
"doi": "10.1140/epjb/e2006-00121-2",
"title": "Non-extensive Trends in the Size Distribution of Coding and Non-coding DNA Sequences in the Human Genome",
"url": "https://arxiv.org/abs/q-bio/0510021"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "e364d3dc-a415-469e-ab69-83568cbffd66",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}