- Have a solid technical skill with diversity of thought and creative solutions that are in the best interests of our customers globally.
- Develop, test, and debug automated tasks (Apps, Systems, Infrastructure)
- Troubleshoot priority incidents, facilitate blameless post-mortems.
- Work with development teams throughout the software life cycle ensuring sustainable software releases.
- Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions.
- Build and drive adoption for greater self-healing and resiliency patterns.
- Lead and participate in performance tests; identify bottlenecks, opportunities for optimization, and capacity demands.
- Adhere to firm-wide architecture standards, risk management and security policies.
- Team player and ability to work in Global Team setup, product owners and business team to develop, build & support application.
- Communicate and collaborate on development items with global team, as well as raise/work to resolve issues impacting development.
- Postproduction application support
- Participate in quality assurance, peer reviews and code reviews Qualifications.
- Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
- A successful history of manipulating, processing, and extracting value from large datasets.
- Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
- Experience supporting and working with cross-functional teams in a dynamic environment.
- Up to date with latest ongoing technologies in the industry
- Well-spoken/Articulate and Emotionally intelligent
- Well connected to industry members, attends developer and tech meetups.
- Strong understanding of DevOps practices, tools, and techniques.
- Degree in Computer Science/ IT or other related fields
- Experienced deploying application in a containerized environment using Docker/Kubernetes/PCF/Openshift/AWS etc.
- In-Depth OS experience (RHEL, Ubuntu, Windows Server) with strong debugging, troubleshooting, and problem-solving skills.
- Experience in site reliability engineering in one of the following languages: Python, Java, PowerShell, shell scripting.
- Hand-on experience with cloud-based technologies and tools especially in deployment, monitoring and operations, such as Prometheus, Splunk, Elasticsearch, Grafana
- Strong working knowledge of modern development technologies and tools such Agile, CI/CD, Git, Terraform and Jenkins.
- Good understanding of networking protocols and cybersecurity best practices in public cloud environment
- Advanced working SQL knowledge and experience working with RDBMS, Hadoop, and NoSQL DB.
- Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
- Build processes supporting data transformation, data structures, metadata, dependency, and workload management.