1-3yrs exp and Exp on Unix/sql and monitoring tools
Incident and Problem management.
Morning (7.30am-4.30pm) /afternoon (2-11pm) shifts /night shift time- 10pm- 7.30am
Rotation is every 2 weeks, you'll be on a single shift for 2 weeks.
But you need to be flexible as well since it can change considering this is a new team.
• Perform work in shifts to provide 24/7 on-site .
• Incident and Problem management.
• Should have knowledge on SRE Best practices and able to adhere to SRE guidelines in the work.
• Provide root cause analysis techniques to determine cause and resolve complex system issues.
• Perform post-resolution follow-ups to ensure problems have been adequately resolved.
• Communicate application problems and issues to key stakeholders, including management, development teams, end users, and unit leaders.
• Work with onsite and offshore teams across multiple technologies/applications
• Continuous improvement of the system, eq. removal of TOIL, job automation, performance tuning.
• Proactive management of production services by measuring and monitoring availability, latency, throughput, user journeys and overall system health.
• Good to have knowledge, experience on UNIX and SQL