The Need for 24x7 Availability
Today’s widely deployed systems can’t provide 24x7 fault- and performance-tolerance
- they rely on manual administration
- static data and application partitioning
- human detection of and response to most anomalous behaviors and changes in system environment
- human administrators are too expensive, too slow, too prone to mistakes
- Jim Gray reports 42% of Tandem failures due to administrator error (in 1985)
Tomorrow’s ever-growing infrastructure systems need to be self-maintaining
- self-maintaining systems anticipate problems and handle them as they arise, automatically