Requirements:
Cloud Services: Familiarity with AWS and Azure, such as AWS EC2, S3, MWAA/Airflow, AWS Lambda, Azure Blob Storage, Data Factory, Synapse Analytics, etc.
Machine Learning Platform:
- Familiarity with the architecture, features, and best practices of the Databricks platform, Databricks certification or relevant certifications are a plus.
- Proficiency in PySpark, Python, SQL, and other big data and machine learning technologies; practical project experience is preferred.
- Familiarity with concepts and methods related to data, such as data warehouses, data lakes, data pipelines, data quality, data security, and data governance.
- Understanding of machine learning principles, processes, and application scenarios; experience in training, deploying, automating, and managing machine learning models using Databricks is a plus.