As a Junior Site Reliability Engineer, you will assist in the operation and maintenance of LANDI Global infrastructures. Your responsibilities will include supporting the platform's reliability and performance while learning from senior engineers.
Key Responsibilities:
- Help build and maintain platform infrastructures across various environments.
- Collaborate with the R&D team to ensure platform availability and scalability.
- Assist in implementing monitoring and alerting systems for timely issue resolution.
- Support the maintenance of Disaster Recovery plans for business continuity.
- Analyze performance metrics and contribute to cost optimization strategies.
- Participate in automated testing, CI/CD processes, and deployment efficiency.
- Help manage incident reporting and change management processes.
- Provide operational support for platforms and assist with production issues.
- Participate in a 24/7 on-call rotation.
- Support environment deployments for new client onboarding.
Qualifications:
- Bachelor’s degree in Computer Science or a related field, or equivalent experience.
- Basic understanding of cloud technologies (AWS, Azure).
- Familiarity with Linux/Unix operating systems and scripting languages.
- Exposure to monitoring tools (e.g., Prometheus, Grafana) and CI/CD tools (e.g., Jenkins).
- Strong communication skills in English.
Preferred Skills:
- Basic experience with configuration management tools (e.g., Ansible).
- Familiarity with SQL databases (e.g., Postgres, MySQL) and load balancing (e.g., Nginx).