Summary
We are seeking an AI/ML Researcher to lead the research and development of large language models (LLMs) solutions. The ideal candidate will have a minimum of 2 years of experience in NLP research and a deep understanding of core LLM components, including LLM modeling, information retrieval, entity-relation extraction, agentic frameworks, and reasoning. The role involves leveraging AI/ML algorithm framework, such as TensorFlow and PyTorch, to develop cutting-edge generative LLM solutions. Responsibilities include designing and conducting scientific experiments, utilizing NLP techniques and LLM core components to develop novel LLM solutions in cloud or on-premises environments. The candidate will also be expected to collaborate with the team, participate in code reviews, architecture discussions, and create comprehensive documentation. The ideal candidate should possess a strong foundation in statistical analysis, machine learning, deep learning, generative AI, and text representation techniques. Additionally, the candidate should have excellent communication skills, a strong work ethic, and a passion for delivering high-quality, innovative solutions.
Responsibilities:
- Leading in designing the research and experiment methodology, specifically in LLM core components.
- Conduct independent research and report experiment results, research findings to stakeholders, e.g., ...
- Collaborate with engineering team in various data engineering tasks, including data collection, preprocessing, quality control, and augmentation, to curate high-quality datasets essential for model training,
- Utilize analytical techniques to extract meaningful insights from large datasets
- Communicating regularly with supervisor and Research leads on scientifical progress, insight and research direction.
- Publishing research papers in reputable journals and conferences is valued, it is not a strict requirement for this role.
- Contribute to other program-related projects.
- Collaborate with engineering team to translate researched technology (low TRL) to engineering (high TRL).
- Candidates applying for senior staff positions are expected to mentor and support team members (or cross-team members) in technical advising.
Requirement:
- PhD or Master degree in Computer Science, Computer Engineering, Statistics, Engineering, or related field.
- 2-4 years of experience in research and development, specifically in one or more of the following areas:
- Natural language processing, specifically in large language model (LLM) core component, e.g., joint entity relation, information retrieval, prompt engineering, reasoning, sentiment analysis, vulnerabilities in LLM, etc....
- Generative AI, specifically in generating synthetic data.
- Demonstrate an analytical mindset with adept problem-solving skills.
- Keep abreast of the latest advancements in program-related techniques and methodologies,
- Exhibit a strong drive to deliver optimal solutions within tight timelines.
- Strong communication and organizational skills.
- Comprehensive documentation of technical specifications, architectural designs, and best practices.