Company Overview:
Meinhardt Digital Technology Solutions is a new business unit of Meinhardt Group, a global construction engineering consultancy with 61 offices worldwide. We are dedicated to leveraging cutting-edge technology to drive innovation in the construction engineering industry.
Position Overview:
We are seeking a Data Engineer with a strong focus on pipeline development and data engineering. In this role, you will design and implement data workflows for AI applications used within the construction engineering industry. You will be part of a dynamic team developing AI applications and data-driven solutions, collaborating closely with data engineers and software engineers, and reporting directly to the Lead Data Scientist. This is an excellent opportunity for individuals passionate about backend development and data engineering for AI.
Responsibilities:
Data Engineering
- Collect, clean, and pre-process structured and unstructured data from various sources within the construction domain
- Design and manage workflows for data ingestion pipelines across cloud and Databricks services
- Perform monitoring and debugging to implement alerts and catches for continuous ingestion requests from a frontend
- Develop innovative approaches to data ingestion and arrangement for unstructured data including images, PDFs, written reports, and other free-text elements.
- Use vector databases to store, persist, and extract embeddings from large bodies of vectors
- Perform exploratory data analysis to identify patterns, trends, and anomalies
- Architect tools and dashboards that transform NLP-related data into interpretable numerical analysis during and following ingestion
- Create clear and insightful visualizations to communicate complex findings to non-technical stakeholders
AI Model Development
- Assist in the development and implementation of Large Language Models and Natural Language Processing techniques to address construction- and asset/facilities management-related challenges
- Collaborate with senior data scientists to build RAG pipeline and fine-tuned models for products serving internal and client needs
Data Privacy and Security
- Ensure compliance with data privacy regulations and cloud security protocols to ensure domain-locked data is compliant with cross-border data flow restrictions.
What We Are Looking For
- Bachelor’s degree in Data Science, Computer Science, Statistics, Applied Mathematics, or a related field, or equivalent experience
- At least 1 year of experience in data engineering, AI model development and natural language processing
- Strong programming skills in Python
- Knowledge of data engineering and data pre-processing techniques
- Knowledge of vector stores and databases in relational and non-relational formats
- Familiarity with data engineering workflows and tools, particularly in cloud environments (Azure, Databricks)
- Ability to propose and test creative solutions to natural language processing and computer vision
- Effective communication skills to convey technical concepts to non-technical stakeholders
- Experience with SQL and NoSQL databases
- Familiarity with agile development methodologies
- Good communication and interpersonal skills
Nice to Have
- Experience with construction/infrastructure industry data and processes
- Experience using managed workflows and collaborative workspaces (Databricks)
- Experience in AI product development
Note to Potential Candidates:
We understand that no candidate will meet every single requirement listed. Studies show that minorities, including women and underrepresented groups, often hesitate to apply if they don’t meet all criteria. We want to assure you that your unique experiences and perspectives are valued here. If this role interests you, we strongly encourage you to apply. Passion and potential are key to us, and we believe diverse backgrounds strengthen our team.