dorsal/arxiv
View SchemaOn data analysis and variable selection: the minimum entropy analysis
| Authors | Chih-Yuan Tseng, Chien-Chih CHen |
|---|---|
| Categories | |
| ArXiv ID | physics/0609250 |
| URL | https://arxiv.org/abs/physics/0609250 |
Abstract
In this work, we present a minimum entropy analysis scheme for variable selection and preliminary data analysis. The variable selection can be achieved by the increasing preference of variables. We show such a preference to has a unqiue form, which is given by the entropy of models associated with variables. Evaluating the entropy provides a complete ranking scheme of variables. This scheme not only indicates preferred variables but also may reveal the system's nature and properties. We illustrate the proposed scheme to analyze a set of geological data for three carbonate rock units in Texas and Oklahoma, and compare to the discriminant function analysis. The result suggests this scheme to provide a quick and robust analysis, and the use in data analysis is promising.
{
"annotation_id": "9f7cdfbd-d7ea-460a-8b6c-5ff9039e37f0",
"date_created": "2026-03-02T18:01:14.465000Z",
"date_modified": "2026-03-02T18:01:14.465000Z",
"file_hash": "f108cf7e7a7d9f3bccff81a83678d3f1380e38e52a331e641ed52e27d58d019c",
"private": false,
"record": {
"abstract": "In this work, we present a minimum entropy analysis scheme for variable\nselection and preliminary data analysis. The variable selection can be achieved\nby the increasing preference of variables. We show such a preference to has a\nunqiue form, which is given by the entropy of models associated with variables.\nEvaluating the entropy provides a complete ranking scheme of variables. This\nscheme not only indicates preferred variables but also may reveal the system\u0027s\nnature and properties. We illustrate the proposed scheme to analyze a set of\ngeological data for three carbonate rock units in Texas and Oklahoma, and\ncompare to the discriminant function analysis. The result suggests this scheme\nto provide a quick and robust analysis, and the use in data analysis is\npromising.",
"arxiv_id": "physics/0609250",
"authors": [
"Chih-Yuan Tseng",
"Chien-Chih CHen"
],
"categories": [
"physics.data-an",
"physics.geo-ph"
],
"title": "On data analysis and variable selection: the minimum entropy analysis",
"url": "https://arxiv.org/abs/physics/0609250"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "5c985bf8-a90a-4bf0-be83-e23ed20d94cd",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}