• Should have a solid technical skill with diversity of thought and creative solutions that are in the best interests of our customers globally.
• Develop, test, and debug automated tasks (Apps, Systems, Infrastructure)
• Troubleshoot priority incidents, facilitate blameless post-mortems.
• Work with development teams throughout the software life cycle ensuring sustainable software releases.
• Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions.
• Build and drive adoption for greater self-healing and resiliency patterns.
• Lead and participate in performance tests; identify bottlenecks, opportunities for optimization, and capacity demands.
• Adhere to firm-wide architecture standards, risk management and security policies.
• Team player and ability to work in Global Team setup, product owners and business team to develop, build & support application.
• Communicate and collaborate on development items with global team, as well as raise/work to resolve issues impacting development.
• Postproduction application support
• Participate in quality assurance, peer reviews and code reviews Qualifications.
• Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
• A successful history of manipulating, processing, and extracting value from large datasets.
• Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
• Experience supporting and working with cross-functional teams in a dynamic environment.
Experienced deploying application in a containerized environment using Docker/Kubernetes/PCF/Openshift/AWS etc.
In-Depth OS experience (RHEL, Ubuntu, Windows Server) with strong debugging, troubleshooting, and problem-solving skills.
Experience in site reliability engineering in one of the following languages: Python, Java, PowerShell, shell scripting.
Hand-on experience with cloud-based technologies and tools especially in deployment, monitoring and operations, such as Prometheus, Splunk, Elasticsearch, Grafana
Strong working knowledge of modern development technologies and tools such Agile, CI/CD, Git, Terraform and Jenkins.
Good understanding of networking protocols and cybersecurity best practices in public cloud environment
Advanced working SQL knowledge and experience working with RDBMS, Hadoop, and NoSQL DB.
Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
Build processes supporting data transformation, data structures, metadata, dependency, and workload management.