Email to: [email protected] (89-001-DevOps Specialist - System Automation Specialist [Project: Cloud File Transfer (CFT)])
Project: Cloud File Transfer (CFT)
Responsibilities:
• Spearhead cloud operations with a strong focus on monitoring, performance tuning, and release management within AWS environments.
• Ensure L2 incident management and escalation procedures are robust and proactive, prioritizing multiple issues effectively.
• Coordinate with internal and external teams to swiftly resolve application and security incidents in line with SLAs.
• Develop and refine operational support processes, including daily checklists, work dashboards, and communication protocols to maintain clear timelines and issue tracking.
• Regularly analyse operational metrics, report on cloud system status, and provide insightful updates to stakeholders.
• Exhibit excellent communication skills to convey key findings and maintain strong relationships across the board.
• Lead change management initiatives by assessing impacts thoroughly, crafting strategies, and developing risk mitigation measures.
• Oversee a validation team tasked with rigorous QA and security assessments to ensure stakeholder changes are thoroughly vetted before release.
• Organize and execute maintenance schedules and system upgrades to optimize cloud infrastructure performance, liaising with vendors and teams for seamless cloud environment stability.
• Set and evolve OKRs and SLAs, striving for continuous enhancement of cloud operation performance.
Experience and Skills:
• Degree or equivalent in Computer Science, Information Technology, or related fields, supplemented by relevant experience.
• At least 2 years of hands-on management of public cloud services, preferably AWS.
• Acute problem-solving skills within varied cloud infrastructures and applications.• Exceptional customer service acumen, with a strong sense of urgency and detail-oriented approach to issue resolution.
• Track record in developing and enforcing IT processes, procedures, and policies.
• Proficient in managing cloud production environments and instituting preventative measures to mitigate potential business impact.
• Competent in operational cloud technology activities, including impact assessments and service improvement execution.
Key Technologies:
• Experience with infrastructure as code, specifically Terraform, for efficient resource provisioning and management.
• Proficiency in GitLab for continuous integration/continuous deployment (CI/CD) pipelines and version control.
• Strong understanding of AWS services and architecture, underpinning the majority of our cloud operations.
As a DevOps Specialist, you need to bring to the team:
• Dedication for automation, standardization and best practices
• Excellent understanding of Software Development Life Cycle, Test Driven Development, Continuous Integration and Continuous Delivery.
• Experience working with high availability, high performance, high security, multidata centre systems and hybrid cloud environments.
• Demonstrable skills in three or more programing/scripting languages.
• Experience with version control systems such as Git.
• Experience with such as GPC, GCC (i.e. AWS, Azure, Google Cloud).
• Ability to troubleshoot complex issues ranging from system resource to application stack traces.
• Comfortable with Agile methodologies and working closely with product development teams.
• Strong on collaboration and communication including documentation.
• Degree or Diploma in Computer Science, Computer or Electronics Engineering, Information Technology or related disciplines.
Experience required:
• Experience in one or more automated provisioning tools such as Vagrant, Ansible, Puppet, Chef.
• Experience in one or more automated infrastructure testing tools such as Serverspec.
• Experience in one or more Cloud infrastructure such as OpenStack, CloudStack, vSphere.
• Knowledge of RPM file deployment, management and design.
• Knowledge of disaster recovery, system backup and restore.
• Experience in one or more virtualization technologies (KVM, Xen, VMware, Hyper-V).
• Knowledge of container technologies such as Docker, LXC.