Recruit Detail
Find out more about the work we do, the experience and skills we can bring to the table, and our terms and conditions.
Company Name
CyberAgent, Inc.
Job Type
[AI Lab] Research Engineer (Reinforcement Learning)
Work Detail
■Mission The Reinforcement Learning team is working on a wide range of projects, from theoretical research to solving real-world problems, with the aim of "practical applications of reinforcement learning." This effort does not end with publishing papers, but also aims to create practical value, and we are particularly focused on reinforcement learning in the field of generative AI. In this field, the development and experimentation of Large Language Models (LLM) and Reinforcement Learning from Human Feedback (RLHF) will be the focus. The Research Engineer (Reinforcement Learning) will work with the research scientists throughout the entire process from algorithm development to implementation, and will play an important role in the engineering side of the project. ■Major tasks Research and development of reinforcement learning and RLHF technologies to improve the performance of language models Implementation of algorithms, experiments, and result analysis Data collection, preprocessing, and construction of datasets Creation of demos and prototypes
Ideal Profile
[Desired Skills] - Deep understanding and practical experience in reinforcement learning or language generation AI - Programming ability using Python [Preferred] - Experience in experiments and application development using language generation models - Experience using the Huggingface Transformers library - Development experience using container technology (Docker, etc.) - Experience designing and implementing demonstration experiments [Desired Profile] - Someone who is passionate about the social implementation of reinforcement learning and language generation AI and wants to contribute to its practical application - Someone who has the self-reliance to solve problems through trial and error based on data analysis - Someone who can actively discuss with other members and collaborate to continuously improve experiments and implementations
Work Location
150-6121 Shibuya Scramble Square 22F, 2-24-12 Shibuya, Shibuya-ku, Tokyo
Phd. Stating Salary
Negotiable Preferential treatment will be given according to our company regulations, taking into consideration experience and ability.