dorsal/arxiv
View SchemaMeasure representation and multifractal analysis of complete genomes
| Authors | Zu-Guo Yu, Vo Anh, Ka-Sing Lau |
|---|---|
| Categories | |
| ArXiv ID | physics/0108055 |
| URL | https://arxiv.org/abs/physics/0108055 |
| DOI | 10.1103/PhysRevE.64.031903 |
| Journal | Phys. Rev. E, Vol. 64, 031903 (2001) |
Abstract
This paper introduces the notion of measure representation of DNA sequences. Spectral analysis and multifractal analysis are then performed on the measure representations of a large number of complete genomes. The main aim of this paper is to discuss the multifractal property of the measure representation and the classification of bacteria. From the measure representations and the values of the $D_{q}$ spectra and related $C_{q}$ curves, it is concluded that these complete genomes are not random sequences. In fact, spectral analyses performed indicate that these measure representations considered as time series, exhibit strong long-range correlation. For substrings with length K=8, the $D_{q}$ spectra of all organisms studied are multifractal-like and sufficiently smooth for the $C_{q}$ curves to be meaningful. The $C_{q}$ curves of all bacteria resemble a classical phase transition at a critical point. But the 'analogous' phase transitions of chromosomes of non-bacteria organisms are different. Apart from Chromosome 1 of {\it C. elegans}, they exhibit the shape of double-peaked specific heat function.
{
"annotation_id": "6f9bb7bb-dd17-48d8-b1ee-b2a94aed2d69",
"date_created": "2026-03-02T18:00:35.384000Z",
"date_modified": "2026-03-02T18:00:35.384000Z",
"file_hash": "adf4a0b6d13d090eb8e3017a946f28b7eead891b1304cade965e3a74be677249",
"private": false,
"record": {
"abstract": "This paper introduces the notion of measure representation of DNA sequences.\nSpectral analysis and multifractal analysis are then performed on the measure\nrepresentations of a large number of complete genomes. The main aim of this\npaper is to discuss the multifractal property of the measure representation and\nthe classification of bacteria. From the measure representations and the values\nof the $D_{q}$ spectra and related $C_{q}$ curves, it is concluded that these\ncomplete genomes are not random sequences. In fact, spectral analyses performed\nindicate that these measure representations considered as time series, exhibit\nstrong long-range correlation. For substrings with length K=8, the $D_{q}$\nspectra of all organisms studied are multifractal-like and sufficiently smooth\nfor the $C_{q}$ curves to be meaningful. The $C_{q}$ curves of all bacteria\nresemble a classical phase transition at a critical point. But the \u0027analogous\u0027\nphase transitions of chromosomes of non-bacteria organisms are different. Apart\nfrom Chromosome 1 of {\\it C. elegans}, they exhibit the shape of double-peaked\nspecific heat function.",
"arxiv_id": "physics/0108055",
"authors": [
"Zu-Guo Yu",
"Vo Anh",
"Ka-Sing Lau"
],
"categories": [
"physics.bio-ph",
"q-bio"
],
"doi": "10.1103/PhysRevE.64.031903",
"journal_ref": "Phys. Rev. E, Vol. 64, 031903 (2001)",
"title": "Measure representation and multifractal analysis of complete genomes",
"url": "https://arxiv.org/abs/physics/0108055"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "e2c6d9f3-e7e6-4390-891c-8be641f5e7de",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}