LabBase Doctor・PostDoctor-CyberAgent, Inc.-[AI Lab] Research Engineer (Reinforcement Learning)

Recruit Detail

Find out more about the work we do, the experience and skills we can bring to the table, and our terms and conditions.

Company Name

CyberAgent, Inc.

Job Type

[AI Lab] Research Engineer (Reinforcement Learning)

Work Detail

■Mission The Reinforcement Learning Team is working on a wide range of projects, from theoretical research to solving real-world problems, with the aim of "practical applications of reinforcement learning." These efforts go beyond publishing papers and aim to create practical value, with a particular focus on reinforcement learning in the field of generative AI. In this field, the team's focus is on the development and experimentation of Large Language Models (LLM) and Reinforcement Learning from Human Feedback (RLHF). The Research Engineer (Reinforcement Learning) will collaborate with research scientists throughout the entire process, from algorithm development to implementation, and play a key role in the engineering aspects of the project. ■Major Tasks Research and development of reinforcement learning and RLHF technologies to improve the performance of language models Algorithm implementation, experiments, and result analysis Data collection, preprocessing, and dataset construction Creating demos and prototypes

Ideal Profile

[Desired Skills] - Deep understanding and practical experience in reinforcement learning or language generation AI - Python programming skills [Preferred] - Experience in experimenting and developing applications using language generation models - Experience using the Huggingface Transformers library - Development experience using container technologies (e.g., Docker) - Experience designing and implementing proof-of-concept experiments [Desired Profile] - Passionate about the social implementation of reinforcement learning and language generation AI and wanting to contribute to their practical application - Self-motivated and able to solve problems through trial and error based on data analysis - Ability to actively discuss and collaborate with other team members to continuously improve experiments and implementations

Work Location

150-6121 Shibuya Scramble Square 22F, 2-24-12 Shibuya, Shibuya-ku, Tokyo

Phd. Stating Salary

Negotiable. We will give preferential treatment according to our company regulations, taking into consideration experience and ability.

External Site

https://hrmos.co/pages/cyberagent-group/jobs/1694993421304987678