We're looking for a Site Reliability Engineer (SRE) to join a fast-paced team in a leading internet company. In this role, you'll ensure our systems are stable, perform well, and can scale as needed. You’ll bridge the gap between development and operations, combining software and systems engineering to maintain large-scale, reliable systems.
If you’re passionate about making systems more reliable and efficient, this could be the perfect fit for you.
Responsibilities:
- Maintain the reliability and availability of our systems and services, ensuring smooth 24/7 operations.
- Monitor and optimize system performance to meet service level agreements (SLAs).
- Lead incident management and perform root cause analysis to prevent future issues.
- Develop automation tools to improve deployment, monitoring, and operational efficiency.
- Work with development teams to design scalable, reliable systems and assist in new feature development.
- Communicate with stakeholders in China to ensure clear understanding of technical needs and solutions.
- Drive continuous improvement in system architecture, processes, and tools as we scale.
Qualifications:
- 4-7 years of experience as a Site Reliability Engineer, DevOps Engineer, or in a similar role.
- Proficiency in English and Chinese (both written and spoken). You’ll be working closely with stakeholders in China and Taiwan, so you need to be fluent in both.
- Strong skills in programming/scripting (e.g., Python, Go, Shell).
- Strong hands-on experience with cloud platforms (AWS, GCP, Azure) and containerization (Docker, Kubernetes).
- Hands-on experience with CI/CD pipelines, configuration management, and monitoring/logging tools.
- Exposure to Machine Learning applications and models is highly advantageous
- Strong problem-solving skills and the ability to troubleshoot complex distributed systems.
- Excellent communication skills for working with cross-functional teams and stakeholders.
- Ability to work effectively in a multicultural environment.
License No.22S1076 | EA Reg.: R1330171