Sr. Site Reliability Engineer
1 month ago
About The RoleWe're looking for a curious and innovative Product Reliability Engineer to join our Middleware team at Visa. Here, you'll be part of a d..
About The Role
We're looking for a curious and innovative Product Reliability Engineer to join our Middleware team at Visa. Here, you'll be part of a diverse group of problem-solvers who ensure billions of transactions flow seamlessly across the world's largest payment network.
What You'll Do
- Design for Reliability: Architect and implement solutions that keep Visa's middleware services running with Always On availability
- Automate Everything: Create intelligent automation for monitoring, deployment, and incident response
- Lead Investigations: Use your detective skills to solve complex technical puzzles and prevent future incidents
- Drive Innovation: Contribute to our evolution from traditional middleware to cloud-native solutions
- Collaborate Globally: Work with talented engineers across the world to build, support, and deploy middleware services.
Why You'll Love It
- Real Impact: Your code will help process millions of transactions, enabling commerce worldwide
- Growth Opportunities: Regular learning sessions, mentorship programs, and exposure to cutting-edge technology
- Work-Life Integration: Hybrid work model (2-3 days in office) with flexible scheduling
- Inclusive Culture: Join a team that actively promotes diverse perspectives and collaborative problem-solving
Your Experience & Skills
We encourage you to apply even if you don't meet every requirement. We value potential and enthusiasm over perfection.
Core Skills (Some combination of:)
- 3+ years of experience with modern middleware technologies. These might include (Tomcat, Apache, Springboot, SQS, JBoss, IBM MQ, IBM DataPower, Hazelcast, Flink, Connect Direct, SSL)
- Understanding of Linux/Unix systems, networking, cloud platforms (AWS, Azure, GCP), containerization (Kubernetes, Docker), and infrastructure-as-code tools (Terraform, Ansible).
- Proficiency with monitoring tools (Prometheus, Grafana, Datadog, etc.), logging systems (ELK stack, Splunk), and tracing tools (Jaeger, Zipkin).
- Proven track record of automating complex tasks and processes to improve efficiency and reliability using Python, Go, Java, or similar.
Technical Areas You'll Grow In:
- Cloud & System Architecture: Design scalable, resilient systems across hybrid cloud platforms (AWS, GCP, Azure)
- AI/ML Operations: Support and optimize ML model deployment pipelines and monitoring systems
- Observability & Performance: Master advanced monitoring, tracing, and performance optimization techniques
- Automation & Intelligence: Build smart alerting systems and automated remediation workflows
- Distributed Systems: Design and maintain globally distributed payment processing systems
What Makes You Thrive:
- You're energized by solving complex problems
- You believe in automation over manual processes
- You enjoy mentoring others and sharing knowledge
- You're comfortable with ambiguity and rapid change
- You value building reliable systems over quick fixes
This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.
Official account of Jobstore.