动 Site Reliability Engineer~Singapore, Singapore On-duty and stability operation Responsible for the monitoring and alarming, emergency response, capacity planning and new IDC deployment of the multi-region media processing platform. Analyze on-duty data, use indicators such as SLI, SLO, MTTR , etc... to make decisions, improve alarm rules, shorten fault recovery time, and optimize system stability and on-duty experience. Platform development Develop an automatic analysis system for accidents to assist decision-making and handling measures during the emergency response process, such as load reduction, rate limiting or traffic switching. Develop an
Full-time / Interested in working remotely
National Taipei University of Technology・
資訊工程學系