Job Summary:
We are seeking a skilled and experienced Operations Engineer to join our team. The ideal candidate will be responsible for managing and optimizing cloud-based infrastructure and services across AWS and Azure platforms. This role requires proficiency in Infrastructure-as-Code (IaC) tools, cloud resource management, automation, monitoring, and ensuring security and compliance across cloud environments.
Roles and Responsibilities:
Infrastructure Management: Deploy, configure, and manage cloud resources using IaC tools such as Terraform or CloudFormation. Monitor and maintain cloud infrastructure to ensure optimal performance, availability, and security. Implement automation and orchestration strategies to streamline deployment and scaling processes.
Cloud Services Administration: Administer various cloud services, including compute instances, storage solutions, databases, networking components, and serverless offerings hosted on AWS and Azure. Optimize resource utilization to ensure efficient cloud service delivery and effective cost management.
Monitoring and Incident Response: Set up monitoring and alerting mechanisms to promptly identify and address performance issues and security vulnerabilities. Collaborate with the incident response team to troubleshoot and resolve operational incidents, ensuring minimal service disruption.
Security and Compliance: Implement security best practices to protect cloud environments from potential threats and vulnerabilities. Assist in conducting security audits and assessments to maintain compliance with industry regulations and standards.
Collaboration and Documentation: Collaborate with cross-functional teams (DevOps, development, security) to understand requirements and align cloud infrastructure accordingly. Maintain comprehensive documentation related to cloud infrastructure, processes, and procedures.
Continuous Improvement: Stay up to date with industry trends and advancements in cloud technologies. Identify opportunities for automation, optimization, and process improvement to enhance cloud operations.
Qualifications and Requirements:
Education:
Bachelor's degree in computer science, Information Technology, or a related field, or equivalent experience.
Experience:
At least 3-5 years of experience in cloud operations, managing cloud-based infrastructure, and deploying cloud services.
Cloud Platforms:
Proficiency in cloud platforms such as AWS and/or Microsoft Azure.
Infrastructure-as-Code:
Strong understanding of Infrastructure-as-Code (IaC) principles and tools (e.g., Terraform, CloudFormation).
Scripting/Automation:
Familiarity with scripting and automation using languages such as Python, PowerShell, Bash, etc.
Monitoring:
Experience with monitoring and log aggregation tools for proactive issue detection (e.g., CloudWatch, Azure Monitor).
Security Best Practices:
Knowledge of security best practices and experience implementing security controls in cloud environments.
Problem Solving:
Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
Communication:
Strong communication and collaboration skills to work effectively with diverse teams.
Certifications:
AWS Cloud Practitioner, AWS Architect certifications are a plus.
Skills:
Python and associated libraries (e.g., pandas, pyspark)
Frameworks – React/FLASK