Roles & Responsibilities
- Design, develop, and enhance system software, including kernels, drivers, libraries, and utilities.
- Analyze system performance and implement optimizations to improve efficiency and reduce resource consumption.
- Diagnose and resolve complex system-level issues, including hardware-software interactions, memory management, and system crashes.
- Create and maintain scripts for system automation, deployment, and monitoring.
- Implement and maintain security best practices at the system level, including secure coding, access control, and vulnerability management.
- You partner with various technology teams to design and deliver a reliable, scalable, secure, and performant Red Hat Linux Platform.
- You participate in a 24/7 on-call rotation and drive improvements using SRE practices.
- You actively participate in toil elimination, observability and monitoring improvements, knowledge management, error budget compliance, deployment designs and testing.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
- Proficiency in programming languages such as C, C++, and assembly language.
- Install, Maintain, Upgrade and Patch LINUX servers in the organization.
- Troubleshoot and fix system and software/hardware issues.
- Support and maintain High Availability of system using clustering software.
- Secure the systems by following published hardening guidelines.
- Assist in audit and compliance tasks.
- Perform Disaster Recovery activities.
- Know at least one of {Python, PowerShell, Ruby, Java, C++, C#, Go} at an intermediate level.
- Experience with infrastructure as code (Terraform or CloudFormation).
- Knowledge of configuration management systems like Ansible, Puppet, Chef