dorsal/arxiv
View SchemaDNA Segmentation as A Model Selection Process
| Authors | Wentian Li |
|---|---|
| Categories | |
| ArXiv ID | physics/0104027 |
| URL | https://arxiv.org/abs/physics/0104027 |
| Journal | in RECOMB01: Proceedings of the Fifth Annual International Conference on Computational Biology, pp.201-210 (ACM Press, 2001) |
Abstract
Previous divide-and-conquer segmentation analyses of DNA sequences do not provide a satisfactory stopping criterion for the recursion. This paper proposes that segmentation be considered as a model selection process. Using the tools in model selection, a limit for the stopping criterion on the relaxed end can be determined. The Bayesian information criterion, in particular, provides a much more stringent stopping criterion than what is currently used. Such a stringent criterion can be used to delineate larger DNA domains. A relationship between the stopping criterion and the average domain size is empirically determined, which may aid in the determination of isochore borders.
{
"annotation_id": "931f3981-fa1c-45ca-84ce-538536dcd781",
"date_created": "2026-03-02T18:00:35.933000Z",
"date_modified": "2026-03-02T18:00:35.933000Z",
"file_hash": "ceed6a2002d01b7e2a0e92bbc6e01dc9425dc9dd18a93caf2a8dcc9cc4925113",
"private": false,
"record": {
"abstract": "Previous divide-and-conquer segmentation analyses of DNA sequences do not\nprovide a satisfactory stopping criterion for the recursion. This paper\nproposes that segmentation be considered as a model selection process. Using\nthe tools in model selection, a limit for the stopping criterion on the relaxed\nend can be determined. The Bayesian information criterion, in particular,\nprovides a much more stringent stopping criterion than what is currently used.\nSuch a stringent criterion can be used to delineate larger DNA domains. A\nrelationship between the stopping criterion and the average domain size is\nempirically determined, which may aid in the determination of isochore borders.",
"arxiv_id": "physics/0104027",
"authors": [
"Wentian Li"
],
"categories": [
"physics.bio-ph",
"physics.data-an",
"q-bio.GN"
],
"journal_ref": "in RECOMB01: Proceedings of the Fifth Annual International\n Conference on Computational Biology, pp.201-210 (ACM Press, 2001)",
"title": "DNA Segmentation as A Model Selection Process",
"url": "https://arxiv.org/abs/physics/0104027"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "a3185a67-27db-4092-9f8f-ffa4884ee22d",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}