dorsal/arxiv
View SchemaPattern overlapping decomposition by Cumulative Local Cross-Correlation
| Authors | Simon Kogan |
|---|---|
| Categories | |
| ArXiv ID | q-bio/0310033 |
| URL | https://arxiv.org/abs/q-bio/0310033 |
Abstract
Background Nucleotide sequences contain multiple codes responsible for organism's functioning and structure. They can be investigated by various signal processing methods. These techniques are well suited for indication of frequently encountered sequence motifs (i.e., repeats). However, if there are two or more codes containing the same motif, the local nucleotide distribution (i.e., profile), resulting from sequence alignment by the motif position, will represent overlapping of the code patterns. Results The novel algorithm for decomposition of pattern overlapping is proposed. It is capable to work with dispersed repeats as well. The algorithm is based on cross-correlation procedure applied locally in a cumulative fashion. Its sensitivity was tested on human genomic sequences. Conclusions Cumulative Local Cross-Correlation was successfully used to decompose overlapping of nucleotide patterns in human genomic sequences. Being very general technique (as general as cross-correlation), it can be easily adopted in other signal processing applications and naturally extended for multidimensional cases. Software implementation of the algorithm is available on request from the authors.
{
"annotation_id": "2cac9c5f-9e33-4fca-b56f-ed8cc1b453a1",
"date_created": "2026-03-02T18:01:28.772000Z",
"date_modified": "2026-03-02T18:01:28.772000Z",
"file_hash": "7763a8622869d63c28ca8ef42229fb6fd8d23765bbf4c59cd683d085de18b473",
"private": false,
"record": {
"abstract": "Background\n Nucleotide sequences contain multiple codes responsible for organism\u0027s\nfunctioning and structure. They can be investigated by various signal\nprocessing methods. These techniques are well suited for indication of\nfrequently encountered sequence motifs (i.e., repeats). However, if there are\ntwo or more codes containing the same motif, the local nucleotide distribution\n(i.e., profile), resulting from sequence alignment by the motif position, will\nrepresent overlapping of the code patterns.\n Results\n The novel algorithm for decomposition of pattern overlapping is proposed. It\nis capable to work with dispersed repeats as well. The algorithm is based on\ncross-correlation procedure applied locally in a cumulative fashion. Its\nsensitivity was tested on human genomic sequences.\n Conclusions\n Cumulative Local Cross-Correlation was successfully used to decompose\noverlapping of nucleotide patterns in human genomic sequences. Being very\ngeneral technique (as general as cross-correlation), it can be easily adopted\nin other signal processing applications and naturally extended for\nmultidimensional cases.\n Software implementation of the algorithm is available on request from the\nauthors.",
"arxiv_id": "q-bio/0310033",
"authors": [
"Simon Kogan"
],
"categories": [
"q-bio.QM",
"q-bio.BM"
],
"title": "Pattern overlapping decomposition by Cumulative Local Cross-Correlation",
"url": "https://arxiv.org/abs/q-bio/0310033"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "4ad09e3f-fe30-4a21-874a-32c7225a979d",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}