This role involves implementing and maintaining CI/CD pipelines, managing cloud infrastructure, and providing operational support.
Responsibilities
- Implement and manage continuous integration and continuous deployment (CI/CD) pipelines to streamline software delivery.
- Utilize cloud platforms Azure, or GCP to manage and optimize cloud infrastructure.
- Automate deployment, monitoring, and infrastructure management tasks to improve efficiency and reliability.
- Continuously monitor system performance and reliability, responding to incidents as needed.
- Work closely with development teams to ensure smooth and reliable software releases.
- Ensure security best practices are followed across infrastructure and applications.
- Create and maintain detailed documentation of processes and procedures.
Skills Requirement
- Bachelor’s degree in Computer Science/Information Technology/Engineering/related field.
- 5-8 years of experience in a DevOps or as an SRE.
- Proficiency with cloud platforms (Azure, GCP).
- Strong scripting skills - Python, Bash, Shell.
- Deep understanding of CI/CD principles and tools (e.g., Jenkins, GitLab CI/CD).
- Experience with containerization and orchestration tools (Docker, Kubernetes).
- Knowledge of version control systems (e.g., Git).
- Familiarity with infrastructure as code (IaC) tools (e.g., Terraform, Ansible).
- Proficiency with monitoring tools like Prometheus, Grafana, and the ELK stack (Elasticsearch, Logstash, Kibana) to gain insights into system performance and reliability.
- Experience with managing and optimizing databases (SQL and NoSQL) to ensure data integrity and performance.
- Strong understanding of networking concepts, including DNS, load balancing, and network security. Knowledge of network configurations and service discovery tools (e.g., Consul, Envoy).
- Familiar with programming languages such as Go, Java, or Ruby.
- Experience with configuration management tools like Chef, Puppet, or SaltStack.
- Good in incident response and management, including root cause analysis and post-mortem documentation.
- Understanding of distributed computing and cloud-native applications.
- Excellent problem-solving and troubleshooting skills.