Who We Are:
OnebyZero is a data-first customer strategy firm offering end-to-end business transformation. Headquartered in Singapore with local presence in the 6 countries in ASEAN and India, and with a team of expert practitioners and technologists, we deliver sustainable business outcomes using AI/ML.
Duration: 6 -8 weeks
The Role: Data Engineer
Key Responsibilities
● Design and implement scalable ETL/ELT pipelines.
● Automate data workflows and ensure fault-tolerant execution.
● Support real-time data ingestion and streaming.
● The ability to design, implement, and optimize large-scale data and analytics solutions on Data Engineering
● Development of scripts using Python for loading, extracting, and transforming data.
● Assist with production issues in data warehouses like loading data, transformations, and translations
● Set up and manage data lakes and warehouses.
● Optimize data storage for performance and cost.
● Ensure high-performance data access during AI/ML training.
● Implement data security best practices (encryption, access control).
● Ensure data governance, compliance, and privacy.
Basic Qualifications
● 3+ years of experience in data engineering, ETL/ELT pipeline development, and data management.
● Proven hands-on experience with large-scale data processing, real-time data streaming, and data warehousing.
● Strong understanding of data modeling, data architecture, and database design.
● Hands-on experience with data partitioning, indexing, and query optimization.
● Proficiency in Python, Java, or Scala for data manipulation and automation.
● Experience with Apache Hadoop, Spark, or similar big data frameworks on AWS
● Understanding of enterprise data management concepts (Data Ingestion, Data Lake, Data Warehouse, Data Engineering, Data Sharing, Data Applications)
● Working knowledge of software engineering best practices
Preferred Qualifications
● Hands on experience working on Glue, Apache4
● Experience in GenAI tools and framework
● Implementing AWS services in a variety of distributed computing, enterprise environments
● Proficiency with at least one the languages such as C++, Java, Scala or Python
What We Offer:
● An opportunity to be part of an agile, highly proficient and experienced Data/AI/ML team
● An opportunity to work on challenging data science and machine learning problems with customers and seeing your work deployed in action
● A fast-paced software development environment that uses the latest open-source tools across the development stack