dorsal/arxiv
View SchemaVirtual Data in CMS Analysis
| Authors | A. Arbree, P. Avery, D. Bourilkov, R. Cavanaugh, J. Rodriguez, G. Graham, M. Wilde, Y. Zhao |
|---|---|
| Categories | |
| ArXiv ID | physics/0306008 |
| URL | https://arxiv.org/abs/physics/0306008 |
| Journal | ECONFC0303241:TUAT010,2003 |
Abstract
The use of virtual data for enhancing the collaboration between large groups of scientists is explored in several ways: - by defining ``virtual'' parameter spaces which can be searched and shared in an organized way by a collaboration of scientists in the course of their analysis; - by providing a mechanism to log the provenance of results and the ability to trace them back to the various stages in the analysis of real or simulated data; - by creating ``check points'' in the course of an analysis to permit collaborators to explore their own analysis branches by refining selections, improving the signal to background ratio, varying the estimation of parameters, etc.; - by facilitating the audit of an analysis and the reproduction of its results by a different group, or in a peer review context. We describe a prototype for the analysis of data from the CMS experiment based on the virtual data system Chimera and the object-oriented data analysis framework ROOT. The Chimera system is used to chain together several steps in the analysis process including the Monte Carlo generation of data, the simulation of detector response, the reconstruction of physics objects and their subsequent analysis, histogramming and visualization using the ROOT framework.
{
"annotation_id": "6fec9eb3-ad6f-4bd5-8c0f-93ea7c8a6f31",
"date_created": "2026-03-02T18:00:43Z",
"date_modified": "2026-03-02T18:00:43Z",
"file_hash": "a8474a767de9ac5de3b527b2e33f52ef57d3ba51282db908358a294297acbb31",
"private": false,
"record": {
"abstract": "The use of virtual data for enhancing the collaboration between large groups\nof scientists is explored in several ways:\n - by defining ``virtual\u0027\u0027 parameter spaces which can be searched and shared\nin an organized way by a collaboration of scientists in the course of their\nanalysis;\n - by providing a mechanism to log the provenance of results and the ability\nto trace them back to the various stages in the analysis of real or simulated\ndata;\n - by creating ``check points\u0027\u0027 in the course of an analysis to permit\ncollaborators to explore their own analysis branches by refining selections,\nimproving the signal to background ratio, varying the estimation of parameters,\netc.;\n - by facilitating the audit of an analysis and the reproduction of its\nresults by a different group, or in a peer review context.\n We describe a prototype for the analysis of data from the CMS experiment\nbased on the virtual data system Chimera and the object-oriented data analysis\nframework ROOT. The Chimera system is used to chain together several steps in\nthe analysis process including the Monte Carlo generation of data, the\nsimulation of detector response, the reconstruction of physics objects and\ntheir subsequent analysis, histogramming and visualization using the ROOT\nframework.",
"arxiv_id": "physics/0306008",
"authors": [
"A. Arbree",
"P. Avery",
"D. Bourilkov",
"R. Cavanaugh",
"J. Rodriguez",
"G. Graham",
"M. Wilde",
"Y. Zhao"
],
"categories": [
"physics.data-an",
"hep-ex"
],
"journal_ref": "ECONFC0303241:TUAT010,2003",
"title": "Virtual Data in CMS Analysis",
"url": "https://arxiv.org/abs/physics/0306008"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "975c8aa1-c406-4e13-bd88-15e48ce8889a",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}