Good programming skills in Python
Must possess experience or interest in training and deploying machine learning models end to end.
Good understanding of basic statistics and deep learning techniques
Prior experience in data integration, profiling, validation and cleansing
Proficiency with SQL
Software delivery
Strong understanding of agile methodologies
Strong understanding of CI/CD
Strong understanding of test-driven development
Comfortable using version control and enterprise task management tools.
Data Engineering
Experience in deploying TensorFlow and PyTorch into production
Building data pipelines and architecture
Should have extensive experience with relational and NO-SQL databases
Should have proficiency in handling both structured and unstructured data sources
Strong experience deploying applications to cloud platforms like Azure, AWS
Interest in building efficient batch and streaming data engineering pipelines
Exposure to distributed data processing platforms like Spark