dorsal/arxiv
View SchemaThe CAVES Project - Exploring Virtual Data Concepts for Data Analysis
| Authors | Dimitri Bourilkov |
|---|---|
| Categories | |
| ArXiv ID | physics/0401007 |
| URL | https://arxiv.org/abs/physics/0401007 |
Abstract
The Collaborative Analysis Versioning Environment System (CAVES) project concentrates on the interactions between users performing data and/or computing intensive analyses on large data sets, as encountered in many contemporary scientific disciplines. In modern science increasingly larger groups of researchers collaborate on a given topic over extended periods of time. The logging and sharing of knowledge about how analyses are performed or how results are obtained is important throughout the lifetime of a project. Here is where virtual data concepts play a major role. The ability to seamlessly log, exchange and reproduce results and the methods, algorithms and computer programs used in obtaining them enhances in a qualitative way the level of collaboration in a group or between groups in larger organizations. The CAVES project takes a pragmatic approach in assessing the needs of a community of scientists by building series of prototypes with increasing sophistication. In extending the functionality of existing data analysis packages with virtual data capabilities these prototypes provide an easy and habitual entry point for researchers to explore virtual data concepts in real life applications and to provide valuable feedback for refining the system design. The architecture is modular based on Web, Grid and other services which can be plugged in as desired. As a proof of principle we build a first system by extending the very popular data analysis framework ROOT, widely used in high energy physics and other fields, making it virtual data enabled.
{
"annotation_id": "29ec3131-22c0-4b25-9bc2-393c404733f0",
"date_created": "2026-03-02T18:00:46.911000Z",
"date_modified": "2026-03-02T18:00:46.911000Z",
"file_hash": "fe90d3d72c97872ab37a99d0d18e01eba9eb8bca3f2f47ac5e9529c25a508b63",
"private": false,
"record": {
"abstract": "The Collaborative Analysis Versioning Environment System (CAVES) project\nconcentrates on the interactions between users performing data and/or computing\nintensive analyses on large data sets, as encountered in many contemporary\nscientific disciplines. In modern science increasingly larger groups of\nresearchers collaborate on a given topic over extended periods of time. The\nlogging and sharing of knowledge about how analyses are performed or how\nresults are obtained is important throughout the lifetime of a project. Here is\nwhere virtual data concepts play a major role. The ability to seamlessly log,\nexchange and reproduce results and the methods, algorithms and computer\nprograms used in obtaining them enhances in a qualitative way the level of\ncollaboration in a group or between groups in larger organizations.\n The CAVES project takes a pragmatic approach in assessing the needs of a\ncommunity of scientists by building series of prototypes with increasing\nsophistication. In extending the functionality of existing data analysis\npackages with virtual data capabilities these prototypes provide an easy and\nhabitual entry point for researchers to explore virtual data concepts in real\nlife applications and to provide valuable feedback for refining the system\ndesign. The architecture is modular based on Web, Grid and other services which\ncan be plugged in as desired. As a proof of principle we build a first system\nby extending the very popular data analysis framework ROOT, widely used in high\nenergy physics and other fields, making it virtual data enabled.",
"arxiv_id": "physics/0401007",
"authors": [
"Dimitri Bourilkov"
],
"categories": [
"physics.data-an",
"hep-ex"
],
"title": "The CAVES Project - Exploring Virtual Data Concepts for Data Analysis",
"url": "https://arxiv.org/abs/physics/0401007"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "693eadfa-e00b-4fc9-9798-bb2da3f4bdb6",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}