Job Overview:
We are seeking a Mobile AI Inference Engineer to join our dynamic team. This role involves optimizing and deploying AI models on mobile platforms such as iOS and Android to enhance the performance and responsiveness of AI-driven applications.
Responsibilities:
- Optimize and deploy AI models on mobile devices to improve the performance and response speed of applications.
- Collaborate with team members to analyze requirements, contribute to architecture design, and implement code for mobile inference engines.
- Stay informed about the latest technological advancements in AI, with a focus on mobile applications, both from academic research and industry developments.
Requirements:
- Bachelor’s degree or higher in Computer Science or a related field. A minimum of 2 years of relevant professional experience is preferred.
- Understanding of AI acceleration technologies, including model quantization and pruning.
- Familiarity with deep learning frameworks, notably PyTorch.
- Experience with mobile AI acceleration frameworks.