Failing disk: return to normal operation
Global fault handling mechanism...
- Rebuilds data redundancy
- By allocating space for a new replica on a functioning disk and copying data to it from existing replicas
- Using an application-specific data replication mechanism
- Where to allocate new replicas, how to copy data, how to lay out data for new replicas, how to update global directory
- Example in upcoming slide
Life returns to normal
- Degree of fault-tolerance has been restored
Failed component can be replaced during regularly-scheduled maintenance