Responsibilities
- Development of AI compiler framework, high performance kernel authoring and acceleration onto next generation of hardware architectures.
- Contribute to the development of the industry-leading machine learning framework core compilers to support new state of the art inference and training machine learning/AI accelerators and optimize their performance.
- Collaborating with AI research scientists to accelerate the next generation of deep learning models such as recommendation systems, computer vision, or natural language processing.
- Performance tuning & optimizations of deep learning frameworks.
- Model optimization by developing the pruning & quantization algorithms and hardware neural architecture search technique.
Qualifications
- Bachelor’s or Master’s degree in computer science, machine learning, mathematics, physics, electrical engineering or related field.
- Experience in C/C++, Python, or other related programming language
- Experience in accelerating deep learning models or libraries on hardware architectures.
- Experience with Post Training Quantization (PTQ), Quantization Aware Training (QAT) and other quantization techniques and strategies
- Experience working with machine learning frameworks such as PyTorch, TensorFlow, ONNX etc.
- Ability to speak and write in English at a business level.
- Experience of Product Owner of scrum team is plus.
By sending us your personal data and CV, you are deemed to consent to PERSOLKELLY Singapore Pte Ltd and its affiliates to collect, use and disclose your personal data for account creation in GO and the purposes set out in the Privacy Policy : https://www.persolkelly.com.sg/policies.
You acknowledge that you have read, understood, and agree with GO’s Terms of Use, https://go.persolkelly.com/Tac and the Privacy Policy.
If you wish to withdraw your consent, please email us at [email protected]. Please feel free to contact us if you have any queries.
EA Reg. ID: R22109658 (Teng Yong Yao)
EA License No.: 01C4394