The SRE’s primary goals is to ensure the platform reliability, scalability, and availability. Key success metrics include system health and availability, ability to scale infrastructure quickly and appropriately, cost to run services, quality and platform health. Responsibilities: develop internal automation - monitoring, setup, statistics setup automatic systems to control infrastructure monitor live production systems health first-aid reaction to infrastructure / platform failures deal with pr
3 years of experience required
No management responsibility