dorsal/arxiv
View SchemaLong range correlations in DNA sequences
| Authors | A. K. Mohanty, A. V. S. S. Narayana Rao |
|---|---|
| Categories | |
| ArXiv ID | physics/0202075 |
| URL | https://arxiv.org/abs/physics/0202075 |
Abstract
The so called long range correlation properties of DNA sequences are studied using the variance analyses of the density distribution of a single or a group of nucleotides in a model independent way. This new method which was suggested earlier has been applied to extract slope parameters that characterize the correlation properties for several intron containing and intron less DNA sequences. An important aspect of all the DNA sequences is the properties of complimentarity by virtue of which any two complimentary distributions (like GA is complimentary to TC or G is complimentary to ATC) have identical fluctuations at all scales although their distribution functions need not be identical. Due to this complimentarity, the famous DNA walk representation whose statistical interpretation is still unresolved is shown to be a special case of the present formalism with a density distribution corresponding to a purine or a pyrimidine group. Another interesting aspect of most of the DNA sequences is that the factorial moments as a function of length exceed unity around a region where the variance versus length in a log-log plot shows a bending. This is a pure phenomenological observation which is found for several DNA sequences with a few exception. Therefore, this length scale has been used as an approximate measure to exclude the bending regions from the slope analyses. The asymmetries in the nucleotide contents or the patchy structure as a possible origin of the long range correlations has also been investigated.
{
"annotation_id": "942d84ed-dd95-4ff2-9d64-130337fb6a77",
"date_created": "2026-03-02T18:00:39.774000Z",
"date_modified": "2026-03-02T18:00:39.774000Z",
"file_hash": "36862df47b087111f57149af4e53e7102182efa3167326db07e630ab4081ea6e",
"private": false,
"record": {
"abstract": "The so called long range correlation properties of DNA sequences are studied\nusing the variance analyses of the density distribution of a single or a group\nof nucleotides in a model independent way. This new method which was suggested\nearlier has been applied to extract slope parameters that characterize the\ncorrelation properties for several intron containing and intron less DNA\nsequences. An important aspect of all the DNA sequences is the properties of\ncomplimentarity by virtue of which any two complimentary distributions (like GA\nis complimentary to TC or G is complimentary to ATC) have identical\nfluctuations at all scales although their distribution functions need not be\nidentical. Due to this complimentarity, the famous DNA walk representation\nwhose statistical interpretation is still unresolved is shown to be a special\ncase of the present formalism with a density distribution corresponding to a\npurine or a pyrimidine group. Another interesting aspect of most of the DNA\nsequences is that the factorial moments as a function of length exceed unity\naround a region where the variance versus length in a log-log plot shows a\nbending. This is a pure phenomenological observation which is found for several\nDNA sequences with a few exception. Therefore, this length scale has been used\nas an approximate measure to exclude the bending regions from the slope\nanalyses. The asymmetries in the nucleotide contents or the patchy structure as\na possible origin of the long range correlations has also been investigated.",
"arxiv_id": "physics/0202075",
"authors": [
"A. K. Mohanty",
"A. V. S. S. Narayana Rao"
],
"categories": [
"physics.bio-ph",
"q-bio"
],
"title": "Long range correlations in DNA sequences",
"url": "https://arxiv.org/abs/physics/0202075"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "1f90482d-7b8f-4c8e-a823-e0cf264bd0cd",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}