Responsibilities
· Development of AI compiler framework, high performance kernel authoring and acceleration onto next generation of hardware architectures.
· Contribute to the development of the industry-leading machine learning framework core compilers to support new state of the art inference and training machine learning/AI accelerators and optimize their performance.
· Analyze deep learning networks, develop & implement compiler optimization algorithms.
· Collaborating with AI research scientists to accelerate the next generation of deep learning models such as recommendation systems, computer vision, or natural language processing.
· AI compiler kernel development, including graph optimization, tiling and memory allocation.
Qualifications
· Bachelor’s or Master’s degree in computer science, machine learning, mathematics, physics, electrical engineering or related field.
· Experience in C/C++, Python, or other related programming language
· Experience in accelerating deep learning models or libraries on hardware architectures.
· Experience working with machine learning frameworks such as PyTorch, TensorFlow, ONNX etc.
· Experience in intermediate IR such as LLVM.
· Experience of Product Owner of scrum team is plus.