We are looking for Systems Engineer (Application Infrastructure):
Planning
- Assist in Technology roadmap and planning
- Proactive Capacity Management & Planning
- Perform growth projections on infrastructure resources (Application servers, webservers, API Gateways and middleware).
- Proactive capacity and usage monitoring and plan for the required upgrade
- Perform capacity upgrades by scaling up of resources or scaling out in terms of addition of additional boxes
- Proactive Availability Management and Planning
- Perform availability review on infrastructure resources (Application servers, webservers, API Gateways and middleware)
- Identify and provide recommendations to achieve required improvements to meet business expected availability requirements
- Implement approved availability improvements
- Assist to plan and consolidate all change activities that required downtime where possible, in order to reduce the disruption time window to production
- Assist to plan hardware/software/firmware/middleware update, refresh or replacement to ensure they are of vendor/manufacturer supported version
- Perform Systems Security hardening
- Perform High Availability (HA)/Failover and Backup/Recovery configuration and setup
- Perform DRP/BCP (Disaster Recovery Plan and Business Continuity Plan) procedure build, update, verify and actual execution
System and Service Administration
- Perform monitoring health status of multiple environments 24*7
- Perform access control management
- Identifying and documenting dependencies and interfaces between network, systems, applications and databases to ensure the applications and associated Service Level Agreements (SLA) can be met
- Support periodic audit review exercises by providing the required information and to perform the remediation actions to close all audit findings
- Inventory, Assets and License tracking and updates to ensure compliance
- Maintaining technical documentation for configurations/setups
- Ensure the data integrity, configurations and data/applications synchronization of DEV/UAT/Production/DR or other established environments
- Ensuring the various systems interfaces, system flow and data exchanges between components and servers are functioning
- Ensure data replication (Application/database or other products) in sync for the UAT/Production/DR databases
- Preparation and setup of test environments for applications (setup/patching/upgrade/enhancements or new projects)
- Ensure proper backup and recovery of system, applications and data level backup successful (based on backup/recovery schedule)
- Ensure proper logs and archival, and manage the tape restoration test successfully at least once every 6 months.
- Perform scheduled periodic BCP/DRP exercises (switch over and switch back and testing)
Operations Support
- Perform after-hours emergency work as scheduled or required.
- Investigate issues within 30 min upon alert/activation for incidents
- Observe the daily behaviour trend and effectively take proactive actions to remediate issues
- Perform proactive incident prevention actions
- Incident Management - effective and timely participation, management and resolutionProblem Management
- o Effective and timely participation to resolve root cause and provide recommendation (s) to solve technical problems
- Release Management
- Perform necessary code changes/enhancement deployment release tasks required into UAT environments
- Perform post implementation review of release tasks and update completion status into ITSM tool
- Change Management
- To comply with the Change Release Procedure for Production, DR and UAT environments, as stated in the IT Change Management System Guideline
- Perform code changes/enhancement deployment (Production/DR environments)
- Perform periodic/scheduled maintenance work
- Perform/manage hardware (firmware)/Software version upgrade, patch fixes and security updates feasibility study and recommendation (inclusive of testing, change management, plan rollout and implementation)
- Operations Enhancement and Efficiency
- Proactive review of current operational tasks, processes and procedures (inclusive of BCP and DR). Identify areas of improvement or to streamline mundane tasks for the purpose of simplification and automation to improve operational efficiencies.
- o Review and propose new or better ways to monitor and perform health checks of current environment to ensure availability and effective/efficient issue resolution.
- Documentation
- Perform documentation review and updates to ensure procedures are maintained and current.
Project Management and Vendor Management
- Provide/review of solutions and options in terms of systems architecture and configuration
- Project manage infrastructure related projects (e.g. server refresh, new infrastructure services) when assigned
- Participate as project members for key application and/or infrastructure projects
- Provide Application Infrastructure support to Application team and various internal teams during solution/project deployments
- Manage vendors/contractors to ensure implementation is in accordance to stipulated requirements, service/system SLA is met and project deliverables are provided timely and sufficiently
- Assess and provide the infrastructure level requirements (e.g. system licenses, software and versions, etc.)
- Identify and assess risks, provide/review risk mitigating measures associated with the technical solutions and the migration plan
- Planning the infrastructure-level capacity (CPUs, memory, disk) requirement and to leverage on existing setup if possible
- Provide and/or assess migration plan from existing setup to new proposed solution
- Provide the 24x7 operation support plan after the project implementation transitions into operational phase
- Ensure required health-checks are updated
REQUIREMENT:
- Minimum of 3 years’ experience;
- Candidate with hands-on experience with RedHat, Linux, JBoss will be prioritise;
- Experience in supporting, maintaining, performance tuning and implementing Middleware applications and Linux system ;
- Degree in IT or Computer Science or equivalent education.