Snapshot
Are you a Research Engineer with a passion for Reinforcement Learning and Multimodality? Join Google DeepMind's Frontier AI Unit! We are seeking a researcher to help us make learning efficient through conversational environments. While text-based reasoning has shown immense promise, we are moving the frontier toward image-grounded, multimodal, and retrieval-augmented conversational setups. You will bridge the gap between conversational learning and the visual domain, applying the latest RL methods to create scalable, semi-verifiable environments that power the next generation of our models (e.g., Gemini).
About us
Google DeepMind: Artificial Intelligence could be one of humanity's most useful inventions. At Google DeepMind, we're working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
Frontier AI Unit: The Frontier AI Unit is responsible for building and scaling the next generation of our core models. Within this group, our team focuses on "conversationality" as a mechanism for efficient learning. We believe that learning conversationally transfers between environments. We are moving beyond Chain-of-Thought (CoT) and text-only setups to build multimodal, multi-turn reasoning capabilities, leveraging an ecosystem of autoraters and autousers to scale environment creation.
The role
We have strong evidence that conversational environments lead to better learning in a transferable way. However, we need to go beyond text. As a Research Engineer, you will play a pivotal role in expanding Meta Reinforcement Learning to multimodal setups. You will help us leapfrog current industry benchmarks by extending our focus from verifiable domains to semi-verifiable, multimodal domains (e.g., Lens, Image-grounded reasoning).
This is an ecosystem play: you will leverage our advantages in autoraters and autousers to scale the creation of these conversational environments. You will be the bridge between the core conversational work and the specifics of grounding in the visual domain, moving our training infra from static data towards dynamic, multi-turn environments.
Key responsibilities
MNCJobs.ch will not be responsible for any payment made to a third-party. All Terms of Use are applicable.