Key Responsibilities
1. 24x7 Operational Support:
· Ensure continuous system availability and lead system recovery during outages.
2. Technical Expertise:
· Investigate and resolve complex technical issues efficiently.
· Leverage subject matter expertise for timely problem resolution.
3. Proactive Monitoring and Automation:
· Monitor system health checks and detect issues early.
· Automate routine monitoring tasks to enhance system stability and resiliency.
4. System Maintenance and Growth Management:
· Perform regular health and capacity assessments for the production environment.
· Address growth from Business-As-Usual (BAU) and project-related demands.
5. Incident Management:
· Oversee end-to-end incident handling, including investigation, service recovery, and follow-up actions.
· Collaborate with various stakeholders to resolve incidents within agreed SLAs.
6. Collaboration with Infrastructure Teams:
· Partner with Middleware, Database Administrators (DBAs), and System Administrators (SAs) to troubleshoot and maintain infrastructure.
7. Application BAU and Projects:
· Manage BAU tasks such as Disaster Recovery exercises, responding to audit queries, and updating SOP documentation.
· Handle system upgrades for End-of-Service/End-of-Life (EOS/EOL) components.
8. Governance and Compliance:
· Enforce strong governance for Change Management, Data Protection, and Privileged ID access control.
9. Shift-Based Work:
· Provide on-site operational support according to a 5-day weekly shift roster.
· Compensatory weekdays off are provided for working on weekends or public holidays.
Skills and Requirements
• Technical Proficiency:
· Strong understanding of systems, applications, and infrastructure.
· Expertise in problem detection, troubleshooting, and automation.
• Communication:
· Ability to collaborate effectively with cross-functional teams and business stakeholders.
· Excellent incident reporting and documentation skills.
• Adaptability:
· Comfortable working in a high-pressure, 24x7 environment.
· Willingness to work flexible shifts and handle critical system events.
• Governance Awareness:
· Familiarity with change management processes and compliance standards.