dorsal/arxiv
View SchemaMulti-Terabyte EIDE Disk Arrays running Linux RAID5
| Authors | D. A. Sanders, L. M. Cremaldi, V. Eschenburg, R. Godang, M. D. Joy, D. J. Summers, D. L. Petravick |
|---|---|
| Categories | |
| ArXiv ID | physics/0411188 |
| URL | https://arxiv.org/abs/physics/0411188 |
Abstract
High-energy physics experiments are currently recording large amounts of data and in a few years will be recording prodigious quantities of data. New methods must be developed to handle this data and make analysis at universities possible. Grid Computing is one method; however, the data must be cached at the various Grid nodes. We examine some storage techniques that exploit recent developments in commodity hardware. Disk arrays using RAID level 5 (RAID-5) include both parity and striping. The striping improves access speed. The parity protects data in the event of a single disk failure, but not in the case of multiple disk failures. We report on tests of dual-processor Linux Software RAID-5 arrays and Hardware RAID-5 arrays using a 12-disk 3ware controller, in conjunction with 250 and 300 GB disks, for use in offline high-energy physics data analysis. The price of IDE disks is now less than $1/GB. These RAID-5 disk arrays can be scaled to sizes affordable to small institutions and used when fast random access at low cost is important.
{
"annotation_id": "b8619f25-bafa-48c4-9061-ca98000db619",
"date_created": "2026-03-02T18:00:53.631000Z",
"date_modified": "2026-03-02T18:00:53.631000Z",
"file_hash": "6b74011b65d8729251e894ce1051217f2c4a286ea4cab828fc0770071414a170",
"private": false,
"record": {
"abstract": "High-energy physics experiments are currently recording large amounts of data\nand in a few years will be recording prodigious quantities of data. New methods\nmust be developed to handle this data and make analysis at universities\npossible. Grid Computing is one method; however, the data must be cached at the\nvarious Grid nodes. We examine some storage techniques that exploit recent\ndevelopments in commodity hardware. Disk arrays using RAID level 5 (RAID-5)\ninclude both parity and striping. The striping improves access speed. The\nparity protects data in the event of a single disk failure, but not in the case\nof multiple disk failures.\n We report on tests of dual-processor Linux Software RAID-5 arrays and\nHardware RAID-5 arrays using a 12-disk 3ware controller, in conjunction with\n250 and 300 GB disks, for use in offline high-energy physics data analysis. The\nprice of IDE disks is now less than $1/GB. These RAID-5 disk arrays can be\nscaled to sizes affordable to small institutions and used when fast random\naccess at low cost is important.",
"arxiv_id": "physics/0411188",
"authors": [
"D. A. Sanders",
"L. M. Cremaldi",
"V. Eschenburg",
"R. Godang",
"M. D. Joy",
"D. J. Summers",
"D. L. Petravick"
],
"categories": [
"physics.data-an",
"hep-ex",
"physics.comp-ph",
"physics.ins-det"
],
"title": "Multi-Terabyte EIDE Disk Arrays running Linux RAID5",
"url": "https://arxiv.org/abs/physics/0411188"
},
"schema_id": "dorsal/arxiv",
"source": {
"execution_id": "33765f9b-7e7e-4c49-9531-61abdd26d329",
"id": "arXiv Dataset IDs",
"type": "Model",
"variant": "snapshot-2026-03-01",
"version": "0.1.0"
},
"user_id": 1000002
}