Summary and Conclusion
Disks don’t fail very often
- In the 10 months of logs, only two disks failed
- We have only 2 data points for these conclusions!
We can predict disk failures and other kinds of failures with enough time to do something about it
There are correlations between the logged messages:
- Hardware Failure Messages on one disk device propagates as Time Out Messages on:
- not only the failing disk,
- but also other disks on the same SCSI bus