Responsibilities
About ByteDance
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
Why Join Us
Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us.
About the team
As a core member of our LLM Global Data Team, you will be at the heart of our coding operations. This role offers a unique opportunity to gain first-hand experience in understanding the intricacies of training Large Language Models (LLMs) with diverse data sets. Through our carefully designed rotation program, you will witness how different verticals of high-quality data are meticulously crafted and used. Upon completion, you will contribute to initiatives in coding for data generation and quality assurance, paving your way to lead, train, or oversee large-scale coding data QA and operation projects.
Your Role Will Involve:
1. Perform quality assurance and develop case studies to tackle intricate data challenges involving coding, with occasional work in mathematics.
2. Collaborate with product managers and algorithmic engineers to identify the most effective coding data for improving our LLMs.
3. Engage in data research to craft strategic insights that guide our data production.
4. Gain direct experience in human feedback data production to understand the synergy between humans and data in LLM training.
Qualifications
1. Bachelor's degree in Computer Science, Information Science or a related technical discipline
2. Proficiency in one or more programming languages, including but not limited to Python, Java, Go, and C.
3. Strong communication and problem-solving skills; effective execution and enforcement; and adeptness in document writing.
Preferred Qualifications:
1. Experience with large scale codebases or advanced coding skills in algorithm optimisation.
2. Experience in operations and technical writing.
3. Proven leadership skills, including mentoring team members and facilitating the swift onboarding of new hires.
4. Strong sense of responsibility and the ability to adapt to a high-intensity work environment.
5. A deep interest in LLMs, human behaviour, and user experience. The ideal candidate is an enthusiastic learner who finds engagement with diverse case studies and annotators stimulating.
Note: This role requires a paper test prior to interviews.
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.