Vulcan AI is looking for a highly skilled and motivated Data Scientist specializing in Vision Transformer and Foundational Models to join our AI tech team.
As a Data Scientist focusing on Vision Transformer and Foundational Models, you will handle researching, developing, and implementing innovative machine learning solutions in computer vision. You will play a pivotal role in driving innovation and advancing our capabilities in using these models for various applications across several types of computer vision applications (remote sensing / satellite imagery, real time applications on phone and other edge devices, industrial defect, and quality check etc.).
Responsibilities:
- Research, design, and develop state-of-the-art vision transformer models and foundational architectures to solve complex problems in computer vision.
- Collaborate closely with cross-functional teams to understand business requirements and translate them into scalable and efficient machine learning solutions.
- Experiment with novel techniques in computer vision including vision transformer-based models, self-supervised vision models, model distillation with student /teacher model etc., to find the best solution for given AI use case
- Able to optimize inference model using platforms such as tflite, OpenVino and TensorRT to deploy models on relevant edge platform (phone, CPU based edge device etc.)
Requirements:
- Master's or Ph.D. in Computer Science, Data Science, or a related field with a focus on machine learning, computer vision, or artificial intelligence.
- Proven experience (> 3 years) in researching, developing, and implementing vision transformer models and foundational architectures for computer vision applications.
- Proficiency in TensorFlow and PyTorch, and strong programming skills in Python or other relevant languages.
- In-depth understanding of convolutional neural networks (CNNs), attention mechanisms, and hands-on experience with vision transformer architectures.
- Solid foundation in statistical analysis, data manipulation, and visualization techniques.
- Excellent problem-solving skills with a track record of delivering impactful solutions in the field of computer vision.
Preferred Qualifications:
- Experience working with large-scale datasets and distributed training of large models on multi-gpu setups
- Experience in computer vision and relevant signal processing techniques using OpenCV (e.g. morphological operations, FFT etc.)
- Publications in reputable conferences/journals highlighting contributions to the field of computer vision, generative AI and transformer-based architectures
- Knowledge of deploying machine learning models in production environments and familiarity with cloud platforms like AWS, GCP, or Azure.