dorsal/arxiv
View SchemaQuasireplicas and universal lengths of microbial genomes
| Authors | Li-Ching Hsieh, Chang-Heng Chang, Liaofu Luo, Fengmin Ji, Hoong-Chien Lee |
|---|---|
| Categories | |
| ArXiv ID | physics/0309006 |
| URL | https://arxiv.org/abs/physics/0309006 |
Abstract
Statistical analysis of distributions of occurrence frequencies of short words in 108 microbial complete genomes reveals the existence of a set of universal "root-sequence lengths" shared by all microbial genomes. These lengths and their universality give powerful clues to the way microbial genomes are grown. We show that the observed genomic properties are explained by a model for genome growth in which primitive genomes grew mainly by maximally stochastic duplications of short segments from an initial length of about 200 nucleotides (nt) to a length of about one million nt typical of microbial genomes. The relevance of the result of this study to the nature of simultaneous random growth and information acquisition by genomes, to the so-called RNA world in which life evolved before the rise of proteins and enzymes and to several other topics are discussed.
{
"annotation_id": "8648358a-bfea-4755-ad47-d32afad2885e",
"date_created": "2026-03-02T18:00:46.557000Z",
"date_modified": "2026-03-02T18:00:46.557000Z",
"file_hash": "223fb363f30cb37f4668e96f690338c78849787fdf1d6a08578c8953e169bdff",
"private": false,
"record": {
"abstract": "Statistical analysis of distributions of occurrence frequencies of short\nwords in 108 microbial complete genomes reveals the existence of a set of\nuniversal \"root-sequence lengths\" shared by all microbial genomes. These\nlengths and their universality give powerful clues to the way microbial genomes\nare grown. We show that the observed genomic properties are explained by a\nmodel for genome growth in which primitive genomes grew mainly by maximally\nstochastic duplications of short segments from an initial length of about 200\nnucleotides (nt) to a length of about one million nt typical of microbial\ngenomes. The relevance of the result of this study to the nature of\nsimultaneous random growth and information acquisition by genomes, to the\nso-called RNA world in which life evolved before the rise of proteins and\nenzymes and to several other topics are discussed.",
"arxiv_id": "physics/0309006",
"authors": [
"Li-Ching Hsieh",
"Chang-Heng Chang",
"Liaofu Luo",
"Fengmin Ji",
"Hoong-Chien Lee"
],
"categories": [
"physics.bio-ph",
"q-bio.GN"
],
"title": "Quasireplicas and universal lengths of microbial genomes",
"url": "https://arxiv.org/abs/physics/0309006"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "fcb1a97a-bce2-44d5-9ce3-04630b810770",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}