We are looking for an experienced Delivery Lead specializing in Oracle Cloud Infrastructure (OCI), AWS, and Oracle Database Administration to lead cloud operations and support. This role requires deep technical expertise in cloud infrastructure and database management, with a focus on optimizing performance, maintaining security compliance, managing costs, and responding effectively to incidents. The Delivery Lead will work closely with cross-functional teams to ensure seamless and efficient cloud operations, high service reliability, and compliance with operational standards.
Key Responsibilities:
- Cloud Monitoring and Alerting:
- Implement comprehensive monitoring for OCI (primary), Oracle Database, and AWS infrastructure.
- Define alert thresholds, escalation policies, and proactive monitoring to ensure system health and uptime.
- Oracle Database Administration:
- Perform administration tasks for Oracle Databases hosted on OCI, including setup, configuration, performance tuning, and troubleshooting.
- Implement high-availability configurations, ensure data integrity, and conduct regular performance assessments.
- Security and Compliance:
- Establish robust security practices, including logging, threat detection, and compliance monitoring.
- Enforce IAM and user permissions following best practices to secure cloud and database environments.
- Backup and Disaster Recovery (DR):
- Automate backup and recovery processes, ensuring database and cloud data resilience.
- Design, test, and validate DR plans regularly to ensure quick recovery and minimize downtime.
- Cost Management and Optimization:
- Regularly review cloud and database usage, identify cost-saving opportunities, and implement FinOps practices to streamline financial operations.
- Performance Monitoring and Optimization:
- Define metrics and performance benchmarks for cloud infrastructure and database performance.
- Collaborate with application and database teams to optimize performance and troubleshoot issues as needed.
- Incident Response and Root Cause Analysis:
- Develop and refine incident response processes, performing root cause analysis and applying quick resolutions to meet SLA requirements.
- Escalate incidents to upper-level support or third-party vendors as required and ensure prompt workarounds.
- Documentation and Knowledge Sharing:
- Document cloud and database architecture, resource configurations, and troubleshooting procedures.
- Maintain updated runbooks, facilitate knowledge transfer sessions, and compile handbooks for handling known issues.
- Routine Maintenance:
- Apply security patches and software updates for cloud and Oracle Database environments, ensuring they meet the latest security and performance standards.
- Vendor and Stakeholder Collaboration:
- Collaborate with managed service vendors and internal teams to implement infrastructure changes and verify application performance.
- Work with FPT to continuously refine severity definitions based on system behaviour and feedback.
Required Skills and Qualifications:
- Bachelors degree in computer science, Information Technology, or a related field.
- 5-7+ years of experience in cloud operations with a focus on OCI, AWS, and Oracle Database Administration.
- Strong knowledge of cloud monitoring tools, security compliance frameworks, and IAM policies.
- Proficiency in Oracle Database administration including installation, configuration, patching, tuning, and backup/recovery.
- Experience in cost optimization (FinOps) and incident management.
- Proven ability to develop and execute Disaster Recovery plans.
- Excellent documentation, communication, and collaboration skills for efficient teamwork.
Preferred Qualifications:
- Certifications in OCI, AWS, and Oracle Database Administration (e.g., OCI certifications, Oracle Database certifications, AWS Certified Solutions Architect).
- Experience with infrastructure-as-code and automation frameworks.