Open source cluster management framework, designed to automate provisioning, management and scaling of distributed systems.
Its primary responsibility is:
- Resource management
- Fault tolerance
- Scalability
- Lifecycle management
It automates failure detection, leadership election and resource parition management