Avensys is a reputed global IT professional services company headquartered in Singapore. Our service spectrum includes enterprise solution consulting, business intelligence, business process automation and managed services. Given our decade of success we have evolved to become one of the top trusted providers in Singapore and service a client base across banking and financial services, insurance, information technology, healthcare, retail, and supply chain.
We are currently looking to hire Cloud Data Engineer (AWS, Databricks, and Informatica IDMC)
This is an exciting opportunity to expand your skill set, achieve job satisfaction and work-life balance. More details as below.
Responsibilities:
A Cloud Data Engineer specializing in AWS, Databricks, and Informatica IDMC is responsible for building and maintaining a robust, integrated, and governed data infrastructure that leverages the strengths of the analytics platforms to extract valuable insights from data while ensuring data security, compliance, and high-quality data management.
· Design and architect data storage solutions, including databases, data lakes, and warehouses, using AWS services such as Amazon S3, Amazon RDS, Amazon Redshift, and Amazon DynamoDB, along with Databricks' Delta Lake. Integrate Informatica IDMC for metadata management and data cataloging.
· Create, manage, and optimize data pipelines for ingesting, processing, and transforming data using AWS services like AWS Glue, AWS Data Pipeline, and AWS Lambda, Databricks for advanced data processing, and Informatica IDMC for data integration and quality.
· Integrate data from various sources, both internal and external, into AWS and Databricks environments, ensuring data consistency and quality, while leveraging Informatica IDMC for data integration, transformation, and governance.
· Develop ETL (Extract, Transform, Load) processes to cleanse, transform, and enrich data, making it suitable for analytical purposes using Databricks' Spark capabilities and Informatica IDMC for data transformation and quality.
· Monitor and optimize data processing and query performance in both AWS and Databricks environments, making necessary adjustments to meet performance and scalability requirements. Utilize Informatica IDMC for optimizing data workflows.
· Implement security best practices and data encryption methods to protect sensitive data in both AWS and Databricks, while ensuring compliance with data privacy regulations. Employ Informatica IDMC for data governance and compliance.
· Implement automation for routine tasks, such as data ingestion, transformation, and monitoring, using AWS services like AWS Step Functions, AWS Lambda, Databricks Jobs, and Informatica IDMC for workflow automation.
· Maintain clear and comprehensive documentation of data infrastructure, pipelines, and configurations in both AWS and Databricks environments, with metadata management facilitated by Informatica IDMC.
· Collaborate with cross-functional teams, including data scientists, analysts, and software engineers, to understand data requirements and deliver appropriate solutions across AWS, Databricks, and Informatica IDMC.
· Identify and resolve data-related issues and provide support to ensure data availability and integrity in both AWS, Databricks, and Informatica IDMC environments.
· Optimize AWS, Databricks, and Informatica resource usage to control costs while meeting performance and scalability requirements.
· Stay up-to-date with AWS, Databricks, Informatica IDMC services, and data engineering best practices to recommend and implement new technologies and techniques.
Requirements / Qualifications
· Bachelor’s or master’s degree in computer science, data engineering, or a related field.
· Minimum 5 years of experience in data engineering, with expertise in AWS services, Databricks, and/or Informatica IDMC.
· Proficiency in programming languages such as Python, Java, or Scala for building data pipelines.
· Evaluate potential technical solutions and make recommendations to resolve data issues especially on performance assessment for complex data transformations and long running data processes.
· Strong knowledge of SQL and NoSQL databases.
· Familiarity with data modeling and schema design.
· Excellent problem-solving and analytical skills.
· Strong communication and collaboration skills.
· AWS certifications (e.g., AWS Certified Data Analytics - Specialty, AWS Certified Data Analytics - Specialty), Databricks certifications, and Informatica certifications are a plus.
Preferred Skills:
· Experience with big data technologies like Apache Spark and Hadoop on Databricks.
· Knowledge of containerization and orchestration tools like Docker and Kubernetes.
· Familiarity with data visualization tools like Tableau or Power BI.
· Understanding of DevOps principles for managing and deploying data pipelines.
· Experience with version control systems (e.g., Git) and CI/CD pipelines.
· Knowledge of data governance and data cataloguing tools, especially Informatica IDMC.
WHAT’S ON OFFER
You will be remunerated with an excellent base salary and entitled to attractive company benefits. Additionally, you will get the opportunity to enjoy a fun and collaborative work environment, alongside a strong career progression.
To submit your application, please apply online or email your UPDATED CV in Microsoft Word format to [email protected]. Your interest will be treated with strict confidentiality.
CONSULTANT DETAILS
Consultant Name: Preethi Kanthappan
Reg No: R1765546
Avensys Consulting Pte Ltd
EA Licence 12C5759
Privacy Statement: Data collected will be used for recruitment purposes only. Personal data provided will be used strictly in accordance with the relevant data protection law and Avensys' privacy policy.