Responsibilities :
• Take up technical tasks in the ITSM queues to resolve day-to-day BAU / Project Requests and Incidents
• Plan, prepare and execute Change technical tasks in the ITSM queues
• Respond to and resolve alerts from the monitoring management system
• Provision server/cluster build and hardening according to the security standards
• Troubleshoot, configure and resolve requests / incidents linked to Hardware, Operating System, Hypervisor and High-Availability services in the operating environment
• Deploy and operate storage, network and OS/cluster configurable items, and infrastructure management tools
• Plan, prepare and execute servers OS patching and reboot
• Plan, prepare and execute servers hardware movement in Data Center including the setup of racking and cabling requirements
• Perform basic to intermediate root cause analysis in Incident and Problem tasks
• Identify, assess and remediate vulnerabilities reported by security sources (scan report, pen test, audit finding, etc.)
• Plan, prepare, execute and support Data Center maintenance activities
• Plan, prepare, execute and support Disaster Recovery / Business Continuity exercises
• Coordinate and deliver with internal clients/partners and external vendors for hardware break/fix cases, software cases, etc.
•Respond to and own L1/L2 escalations
• Engage in taskforce resolution squad in priority incident management / crisis management cases
• Document, review, maintain and share technical information and write-up (primarily, SOP) as part of Knowledge Management
• Extract and prepare data needed for reporting and dashboard (capacity planning, health checks, IT controls, compliance, audit, etc)
• Engage in Service Improvement review and actions plan
Requirements :
• Bachelor’s degree or Master’s degree in Information Technology or pertinent programs
• Minimum 10 years of relevant hands-on work experiences
• Experience with patch deployment Process (Planning, Schedule, and Rollout) by following the Microsoft Guidelines.
• Should have worked in a SLA driven environment
• Flexible to work on Shifts / Weekends / On-call basis
Hands On Technical Expertise in :
· Operating System – Windows 2008 / 2012 / 2016 / 2019
· Hypervisor – Hyper-V and VMware (lesser extent)
· High-Availability solution – Microsoft Cluster Service (multi-nodes geo cluster)
· Hardware – HPE Proliant DL / Blade C7000 / Synergy / Simplivity / Apollo
· Run operate of enterprise Windows ecosystem (Physical and Virtual servers) in heterogeneous environment with Storage, Network, Database and Middleware services
· Run operate of CLX, Raid Manager and HORCM to manage disks and storage operations on server/host side
· Active Directory management (OU/GPO/Forest and domain)
· Scripting / automation experience with Power shell proficiency
· Work with tools such as Altiris (inventory and patch management), CyberArk (privileged access management), OMI suite (events/alerts management)
· Troubleshooting skills in WINDOWS OS, Hyper-V and VMWARE related issues.
· Experience with patch deployment Process (Planning, Schedule, and Rollout) by following the Microsoft Guidelines.
· Knowledge of SCVMM and Virtual Center
· Performance Analysis and Root cause analysis for Critical and Major incidents
· Hands on experience of Monitoring tools like HPOMI, NNM, TIVOLI, etc.
· Server Deployment, configuration, Bios upgrades and firmware upgrades.
· Driving of Critical incidents to resolution with Incident Management Team & coordinating with different teams and relevant stakeholders
· Knowledge and/or hands on working experience with emerging technologies – Cloud IaaS, HP Synergy is advantageous