As a Junior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our organization's application systems and services. You will work at the intersection of software engineering and IT operations, implementing best practices to enhance system reliability, scalability, and performance. Your responsibilities will include designing and building resilient systems, monitoring for issues, and automating incident response and prevention.
Responsibilities:
- Research, assess, and implement new technologies and practices to improve the robustness (i.e reliable & scalable) of our application systems.
- Use of scripting languages to automate tasks and enhance system performance.
- Manage and maintain Linux servers and associated infrastructure components, ensuring uptime, security, and scalability.
- Participate in system architecture design, deployment, and ongoing maintenance, with an emphasis on security best practices.
- Troubleshoot and resolve issues related to software, servers, and infrastructure
Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
- Minimum 1-2 years working experience in a role related to system administration, DevOps, or software engineering.
- Proficiency in scripting languages (e.g Python, Shell, Perl, etc)
- Experienced in system administration (sysadmin) and server management, particularly with Linux.
- Experienced in designing and building reliable, scalable systems.
- Knowledge of monitoring and alerting solutions (e.g., Prometheus, Grafana).
- Effective communication and collaboration abilities within a team environment.