x
Get our mobile app
Fast & easy access to Jobstore
Use App
Congratulations!
You just received a job recommendation!
check it out now
Browse Jobs
Companies
Campus Hiring
Download App
Jobs in Singapore   »   Jobs in Singapore   »   Engineering Job   »   Sr. SW Engineer (Site Reliability Engineer)
 banner picture 1  banner picture 2  banner picture 3

Sr. SW Engineer (Site Reliability Engineer)

Visa

Visa company logo

Sr. SW Engineer (Site Reliability Engineering)

Company Description

Visa is a world leader in digital payments, facilitating more than 215 billion payments transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable and secure payments network, enabling individuals, businesses and economies to thrive.
When you join Visa, you join a culture of purpose and belonging – where your growth is priority, your identity is embraced, and the work you do matters. We believe that economies that include everyone everywhere, uplift everyone everywhere. Your work will have a direct impact on billions of people around the world – helping unlock financial access to enable the future of money movement.

Platform Products Technology group in VISA is one team that strongly works towards next-gen payments and believes in its slogan It's Everywhere You Want to Be, for making payments accessible everywhere and for everyone. This group innovates technology that improves the lives of millions of people around the world for the payment ecosystem. The desired candidate will be part of this journey of our team and will be contributing to achieve the same. This role is in Site Reliability Engineering (SRE) team which focusses on the digital products from reliability, availability, performance, and efficiency perspective.

Responsibilities

  • Engage with product, architects, developers, Certification, Project management, Operations & Infrastructure teams from the start of the SDLC phase.

  • Become subject matter expert for the assigned product verticals. Analyze complex systems from a reliability and resilience perspective.

  • Run the production environment by monitoring availability and taking a holistic view of system health. Use ELK, Grafana, and Splunk for monitoring application-specific logs, visualizing metrics, creating dashboards, and alerts.

    issue occurred and own them completely for end-to-end closure.

  • Performing functional analysis of products by gathering and analyzing metrics from both operating systems and applications to assist in performance tuning and fault finding – integration/operational challenges.

    staff and management during the incident and change management process.

    availability

Join Visa: A Network Working for Everyone.

Job Description

  • Understanding the end-to-end product topology from infrastructure and application perspective.

  • Build/Design automation script for manual process.

  • Build/Design test script using ML/Python for API based products.

  • Identify sources of instability in large-scale distributed systems and drive operational excellence. Dive deep and understand every

  • Performing code bug fixes in production and recommending any architectural improvements during issue/incident analysis.

  • Work closely with development and product teams on suggesting new features and enhancements based on live issues.

  • Drive down the burden of toil with tooling and automation to achieve operational efficiency and smoother customer experience.

  • Apply AI techniques to improve system reliability and efficiency.

  • Technical consultancy for monitoring, incidents, and problem management. Lead technical bridges and interact with both technical

  • Participate in on-call support.

  • Engage with tech and non-tech partners on regular basis to analyze functional and technical in-depth solutions.

  • Understanding new changes in production systems and assessing its risk from application perspective for driving reliability and

  • Have some level of network engineering understanding to assist in incident/issue triaging.

  • Provide guidance and technical expertise to junior team members.

  • Excellent problem-solving skills and attention to detail.

  • Strong communication skills and ability to work effectively in a team.

  • Collaborate with the team to define SRE practices and identify areas for improvement.

 

  • This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

 

✱   This job post has expired   ✱

Sharing is Caring

Know others who would be interested in this job?