Description of the Company’s background
Xiaomi Corporation was founded in April 2010 and listed on the Main Board of the Hong Kong Stock Exchange on July 9, 2018. Xiaomi is a consumer electronics and smart manufacturing company with smartphones and smart hardware connected by an IoT platform at its core.
Embracing our vision of “Make friends with users and be the coolest company in the users’ hearts”, Xiaomi continuously pursues innovations, high-quality user experience and operational efficiency. The company relentlessly builds amazing products with honest prices to let everyone in the world enjoy a better life through innovative technology.
Xiaomi is one of the world's leading smartphone companies. The company’s market share in terms of smartphone shipments ranked no. 2 globally in the second quarter of 2021. The company has also established the world’s leading consumer AIoT (AI+IoT) platform, with 374.5 million smart devices connected to its platform (excluding smartphones and laptops) as of 31 March 31, 2021, excluding smartphones and laptops. Xiaomi products are present in more than 100 countries and regions around the world.
Job Description
1. Responsible for resource delivery, system capacity management, monitoring, fault handling and other daily operations related to internal systems and Internet-facing services.
2. Participate in the review of technical and system design plans, understand the technology architecture and principles, identify risks proactively and provide professional solutions.
3. Communicate with relevant stakeholders to provide feedback and promote timely improvements according to existing problems in the business operating environment.
4. Participate in the design and development of internal system features and contribute to the optimization of the system based on BU’s requests.
5. Responsible for operational data analysis, quality analysis, budgeting and other operation work.
6. You will participate in the team’s on-call rotation (24x7), assist with triaging, and addressing production issues, and respond to incidents.
Job Requirements
1. Bachelor's degree in Computer Science, Engineering or related field, or equivalent practical experience.
2. Experience in Linux or Unix-like operating systems. Strong command of computer science fundamentals: data structures, algorithms, programming languages, distributed systems, and information retrieval.
3. Experience in one of the following: Python, Go or shell scripting. Experience in Public Cloud, AWS and/or Azure.
4. Familiarity with Incident Response programs and processes; including triaging and resolving production incidents at an organization with challenging SLAs and customer expectations.
5. A passion for reliability, scaling patterns, up-time, and availability. Flexibility to work non-business hours that may include weekends and/or holidays.
6. Bonus Points: Experience maintaining Internet-facing production-grade application.