Responsibilities:
- Develop distributed artificial intelligence systems and deploy them on large-scale clusters or clouds.
- Develop and optimize algorithm systems from specific scenarios and problems, and produce solutions applied to the scenarios.
- Participate in the integration design and optimization of artificial intelligence technology and existing tools to improve product performance.
- Write high-quality scientific papers and have the opportunity to serve as the first author of important papers.
Requirements:
- Proficient in TensorFlow/PyTorch, Ray/DeepSpeed/NVIDIA Megatron, and familiar with the internal operating mechanisms of these systems.
- Familiar with various optimization algorithms and model architectures, proficient in Python or C++ optimization algorithm libraries, including various classical algorithms and models based on gradients (BERT, GPT-3, Swin Transformer, ViT, MLP-Mixer).
- Familiar with SaaS, architecture, compilers, networks, CUDA, and other related knowledge or have relevant project experience.
- Strong programming and engineering implementation abilities. Those who have won programming competition awards or published high-quality papers will be given priority in recruitment.
- Bachelor's degree or above from 211, 985, or overseas well-known universities, majoring in computer science, software engineering, electronic information, automation, mathematics, physics, or other artificial intelligence-related majors.