Salary up to S$200,000 per annum
Location: Pasir Panjang
Responsibilities:
- Oversee the implementation and enhancement of distributed training methods across multi-GPU and multi-node environments.
- Apply advanced optimization techniques (e.g., SGD, Adam, Adagrad) to accelerate learning processes while ensuring model accuracy.
- Lead initiatives in model compression, pruning, and quantization to reduce model size and boost computational efficiency.
- Innovate in knowledge distillation to transfer learning from larger models to more efficient, smaller models.
- Optimize resource utilization through micro-tuning strategies, such as prompt-based tuning and parameter-efficient methods.
- Implement mixed precision training and hardware-specific optimizations (e.g., CUDA, Tensor Cores) to maximize hardware acceleration.
- Manage hyperparameter tuning using automated tools to achieve optimal model configurations.
- Collaborate with cross-functional teams to apply large model technology to real-world business applications, integrating cutting-edge research into production systems.
- Design and optimize large language models, develop fine-tuning strategies, and streamline training processes.
- Explore and implement deep learning architectures like Seq2Seq and Transformer, including advanced techniques such as Fine-tuning, Prompt Engineering, and Soft Prompting (SFT).
- Develop systems for efficient model training and deployment, involving data preprocessing, parallel training, and resource management.
- Establish performance evaluation systems and monitor training metrics to ensure high model quality and efficient iterations.
Requirements:
- Bachelor’s degree or higher in Computer Science, Artificial Intelligence, Mathematics, or a related field.
- Minimum of 5 years of experience in AI, with at least 3 years focused on large-scale language model development and optimization, with proven successful projects.
- Proficient in deep learning theories, experienced with PyTorch and TensorFlow, and skilled in model fine-tuning and SFT.
- Strong skills in algorithm design and optimization, with experience in large-scale data processing and high-performance computing.
- Demonstrated leadership, teamwork, communication, and project management abilities, with the capability to track international research trends and drive innovation.
HOW TO APPLY:
Interested applicants, please submit your updated resume in PDF/MS Word format to [email protected]
Please state your availability, current & expected salaries for processing purpose.
All applications will be treated in the strictest confidence.
We regret that only shortlisted candidates will be notified.
Brandon Lim | EA Registration No: R21102894
EPS Consultants Pte Ltd| 95C5630