Key Responsibilities:
- Design and develop AI systems(i.e. training acceleration framework, inference speedup framework or integration of framework) that meet business requirements, including natural language processing, computer vision, machine learning, and deep learning algorithms.
- Analyze and evaluate existing AI systems and propose improvements for increased performance, accuracy, and scalability.
- Collaborate with cross-functional teams, including software engineers, data scientists, and product managers, to integrate AI solutions into existing products and services.
- Stay up-to-date with the latest AI technologies and research, and evaluate their potential impact on our business and industry.
- Ensure that all AI systems are secure, reliable, and scalable, and comply with regulatory requirements.
- Develop and maintain documentation and user guides for AI systems.
Requirements:
- Proficient in TensorFlow/PyTorch, Ray/DeepSpeed/NVIDIA Megatron/Colossal-AI, and familiar with the internal operating mechanisms of these systems.
- Familiar with various optimization algorithms and model architectures, proficient in Python or C++ optimization algorithm libraries, including various classical algorithms and models based on gradients (BERT, GPT-3, Swin Transformer, ViT, MLP-Mixer).
- Familiar with SaaS, architecture, compilers, networks, CUDA, and other related knowledge or relevant project experience.
- Strong programming and engineering implementation abilities. Those who have won programming competition awards or published high-quality papers will be given priority in recruitment.
- Bachelor's degree or above from 211, 985, or overseas well-known universities, majoring in computer science, software engineering, electronic information, automation, mathematics, physics, or other artificial intelligence-related majors.