The System Administrator/Engineer (SA/E) is responsible for effective provisioning, installation/configuration, operation, and maintenance of systems hardware and software and related infrastructure. This individual
participates in technical research and development to enable continuing innovation within the infrastructure. This individual ensures that system hardware, operating systems, software systems, and related procedures
adhere to organizational values, enabling staff, volunteers, and Partners.
This individual is accountable for the following systems: Linux (Red Hat Enterprise Linux), Windows,
UNIX (AIX and Solaris) systems. Middleware includes WebSphere Application Servers, WebSphere Message Queue. Backup software includes Tivoli Storage Manager that support MSI/LTA infrastructure.
Responsibilities on these systems include operations and support, maintenance and research and development to ensure continual innovation.
Job Description
- Install new/rebuild existing servers and configure hardware, peripherals, services, settings, directories, storage, etc. in accordance with standards and project/operational requirements
- Develop and maintain installation and configuration procedures
- Research and recommend innovative, and where possible automated approaches for system administration tasks. Identify approaches that leverage our resources and provide economies of scale
- Perform daily system monitoring, verifying the integrity and availability of all hardware, server resources, systems and key processes, reviewing system and application logs, and verifying completion of scheduled jobs such as backups, and related administration tasks
- Perform daily backup operations, ensuring all required file systems and system data are successfully backed up to the appropriate media, recovery tapes or disks are created, and media is recycled and sent off site as necessary
- Provide 24 by 7 technical support per request from various constituencies. Investigate, troubleshoot issues and provide professional recommendations and solutions
- Develop an in-depth knowledge of the system’s infrastructure and its integration with the various applications. Provide professional consultations and solutions as necessary
- Repair and recover from hardware or software failures. Coordinate and communicate with impacted constituencies
- Provide Desktop support and recovery services
- Managing, maintaining and supporting Middleware and other application software implementations and solutions. (WebSphere Application Server and WebSphere Message Queue, IBM Integration Bus, IBM MessageSight, IBM DataStage) Maintenance
- Develop new and/or maintain existing backup policies and schedules according to operational and business requirements
- Apply OS patches and upgrades on a regular basis, and upgrade administrative tools and utilities. Configure / add new services as necessary
- Review and evaluate software security and its related risks. Plan and implement security patching as needed
- Script application deployment, daily monitoring and automation
- Upgrade and configure system software that supports LTA/MSI infrastructure per project or operational needs
- Maintain operational, configuration, or other procedures and documents
- Perform periodic performance reporting to support capacity planning
- Perform ongoing performance tuning, hardware upgrades, and resource optimization as required. Configure CPU, memory, and disk partitions as required
- Evaluate, develop and implement new system enhancements per request from various constituencies
- Plan and deploy new SeP application software upgrades / Business Objects as required
- Perform incident management, vendor management and reporting
- Monitor and coordinate recovery procedures between the affected constituencies and the vendor. Provide technical assistance as required
- Involve in the Disaster Recovery planning and execution of DR procedures during simulated dry runs and actual disaster runs
Requirements
- Working knowledge of basic cloud infrastructure is an added advantage.
Technical Expertise and Knowledge include:
- OS : Windows, Linux, AIX,
- Middleware : WebSphere Application Servers, WebSphere Message Queue, Tivoli Directory Services, WebSphere Network Deployment Manager, Edge Server, IBM Integration Bus, IBM MessageSight, IBM DataStage.
- Backup software : Tivoli Storage Manager,
- Storage: V3000, V5000, V7000/V9000 SAN
- Hypervisor: VMWare.
- Others include Tape Library, BOE, Web Server, HACMP, HMC, SAN Switch, UPS, Samba, Flashcopy, VIOS, Red Hat Virtualization.
- Windows Desktop installation, configuration and support
- WebSphere, IIB, Application Deployment
- Unix shell and VB scripting
• Cloud Platform Expertise: Experience with one or more cloud platforms such as AWS, Azure, Google Cloud, or other similar providers
• Infrastructure as Code (IaC): Proficiency in using tools like Terraform, CloudFormation, or Ansible to manage cloud infrastructure
• Cloud Resource Management: Experience in deploying, managing, and monitoring cloud-based instances (e.g., virtual machines, storage, networking, etc.)
• Automation & Scripting: Strong scripting skills (e.g., Python, PowerShell, Bash) to automate tasks in a cloud environment
• Cloud Security: Understanding of cloud security best practices (e.g., IAM roles, security groups, firewall configurations)
• Monitoring & Performance: Familiarity with cloud monitoring tools (e.g., AWS CloudWatch, Azure Monitor) to track performance and troubleshoot issues
• Scalability & Cost Management: Experience in scaling cloud infrastructure dynamically and optimizing costs using reserved instances, spot instances, and autoscaling