Job Responsibilities:
• Responsible for maturing and expanding Bank’s IT Application Performance Monitoring program. The program's goal is to improve application performance, availability and resiliency through real-time monitoring.
• Drive and oversee infrastructure and application-related monitoring initiatives to connect the dots and achieve end-to-end visibility of IT systems.
• Partner with application developers and production support teams to determine the monitoring enhancement areas and champion solutions.
• Collaborate with a cross-functional team of dev, ops, and architects to understand complex application architectures (including Cloud and Kubernetes) to implement a practical top-down monitoring approach.
• Participate in strategy and future implementation discussions for redesigning and implementing the monitoring environment to modernize with the latest technology trends.
• Establish KPIs around monitoring and availability, track and report progress and communicate organizational value to the management and senior stakeholders periodically.
• The candidate must have at least 15+ years of experience in Enterprise monitoring and observability domain; preferably in Banking industry.
• At least 10+ years of hands-on working experience in one or more industry leading tools:
o Enterprise monitoring/observability – ITRS Geneos, Prometheus/Grafana, AppDynamics, Dynatrace, etc.
o Automation scripting - Ansible, Terraform, PowerShell, etc.
o Enterprise logging platforms - Splunk, Elasticsearch, etc.
• Familiarity with workflow management and ITSM tools such as Remedy or ServiceNow
• Good understanding of the 3 pillars of observability - metrics, logs, and traces.
• Ability to visualize and design monitoring dashboards to enable single plan of glass for IT Operations.
• Good understanding of Agile (Scrum or Kanban) and implementation of same in real world along with exposure to associated tool sets (JIRA/Confluence)
• Technical background, with hands-on experience in Open Systems, Virtualization, Storage and Networking technologies.
• Ideal candidate would also be exposed to Cloud Technologies, SRE, and DevOps concepts with focus on automation.
Job Requirements:
• Bachelors or Masters in Computer Science / IT
• Related professional/technical qualification will be advantageous although not mandatory
• Self-directed and self-learner, displays resilience and discipline
• Excellent analytical and communication skills.
• Good planning and organizing skills with structured thinking and innovative solutions
• Strategic forward-thinking approach to challenges with outstanding influencing, negotiating and persuasion skills
• Strong in influencing / negotiating and leveraging strong professional relationship to drive and deliver the right business outcomes.
• With the ability to look at issues from multiple angles, the candidate should be able to appreciate diverse perspectives and complexity.