Interpretation: single-fault exp’ts
Linux and Windows take opposite approaches to managing benign and transient faults
- these faults do not necessarily imply a failing disk
- Tertiary Disk: 368/368 disks had transient SCSI errors; 13/368 disks had transient hardware errors, only 2/368 needed replacing.
- Linux is paranoid and stops using a disk on any error
- fragile: system is more vulnerable to multiple faults
- but no chance of slowly-failing disk impacting perf.
- Windows ignores most benign/transient faults
- robust: less likely to lose data, more disk-efficient
- less likely to catch slowly-failing disks and remove them