Job Description
Overview
Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.
Responsibilities
Bridge the gap between research and production by turning cutting-edge algorithms into scalable training systems. You will design and optimize the core infrastructure behind frontier AI models — from reinforcement learning training loops and distributed GPU training to massive-scale data pipelines. Our systems train models across thousands of GPUs and process petabyte-scale datasets. We care deeply about numerical stability, throughput, and reproducibility. This team owns and evolves the core infrastructure behind our training systems.
We Focus On
- Reinforcement learning training infrastructure
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application