South Jakarta, South Jakarta City, Jakarta, Indonesia
- Steering the management of SLA, SLO, and SLI for optimal service delivery.
- Monitoring Production Area to make sure all service running smootly.
- Crafting comprehensive and easy-to-understand documentation detailing services and infrastructure.
- Designing informative runbooks that demystify incident mitigation and root cause analysis.
- Designing custom alerts and dashboards to empower engineers with real-time monitoring tools.
- Pioneering the research and development of innovative monitoring methodologies and tools.
- Performing meticulous housekeeping on all alert triggers, monitoring systems, and dashboards to ensure a clean workspace.
- Constantly seeking enhancements and efficiencies from a TechOps perspective.
- Unearthing, analyzing, and diligently following up on root cause incidents, providing comprehensive reports to the engineering manager.