dorsal/arxiv
View SchemaRecognition of an organism from fragments of its complete genome
| Authors | V. V. Anh, K. S. Lau, Z. G. Yu |
|---|---|
| Categories | |
| ArXiv ID | physics/0207060 |
| URL | https://arxiv.org/abs/physics/0207060 |
| DOI | 10.1103/PhysRevE.66.031910 |
| Journal | Phys. Rev. E, vol. 66, (2002) 031910. |
Abstract
This paper considers the problem of matching fragment to organism using its complete genome. Our method is based on the probability measure representation of a genome. We first demonstrate that these probability measures can be modelled as recurrent iterated function systems (RIFS) consisting of four contractive similarities. Our hypothesis is that the multifractal characteristic of the probability measure of a complete genome, as captured by the RIFS, is preserved in its reasonably long fragments. We compute the RIFS of fragments of various lengths and random starting points, and compare with that of the original sequence for recognition using the Euclidean distance. A demonstration on five randomly selected organisms supports the above hypothesis.
{
"annotation_id": "d0da82f6-3bc0-413c-b3e9-e8558d42df14",
"date_created": "2026-03-02T18:00:39.650000Z",
"date_modified": "2026-03-02T18:00:39.650000Z",
"file_hash": "8591d7884c165c26594cc965a491ebd9af9563c379dd736a2c30c7dac81b2503",
"private": false,
"record": {
"abstract": "This paper considers the problem of matching fragment to organism using its\ncomplete genome. Our method is based on the probability measure representation\nof a genome. We first demonstrate that these probability measures can be\nmodelled as recurrent iterated function systems (RIFS) consisting of four\ncontractive similarities. Our hypothesis is that the multifractal\ncharacteristic of the probability measure of a complete genome, as captured by\nthe RIFS, is preserved in its reasonably long fragments. We compute the RIFS of\nfragments of various lengths and random starting points, and compare with that\nof the original sequence for recognition using the Euclidean distance. A\ndemonstration on five randomly selected organisms supports the above\nhypothesis.",
"arxiv_id": "physics/0207060",
"authors": [
"V. V. Anh",
"K. S. Lau",
"Z. G. Yu"
],
"categories": [
"physics.bio-ph",
"q-bio"
],
"doi": "10.1103/PhysRevE.66.031910",
"journal_ref": "Phys. Rev. E, vol. 66, (2002) 031910.",
"title": "Recognition of an organism from fragments of its complete genome",
"url": "https://arxiv.org/abs/physics/0207060"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "6a124225-cccb-46d0-801a-2c1cc5707efe",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}