Job Description
About The Team
Reliable services are what enables OpenAI to train the best AI models in the world and to bring the promise of safe, effective AI to the world. The SRE team in research is responsible for defining, measuring, and improving the reliability of the research platform. The SRE team works closely with the supercomputing and hardware health teams to improve the functioning of the existing research platform and build the future platform. The research platform is the platform used to conduct basic AI research and to train the next generation of models. This is the team that helps make the infrastructure enabling progress at the world’s leading AI lab.
About The Role
As OpenAI continues to grow, we are building a team focused on the reliability of the research platform and enabling our systems to scale. Our success depends on our ability to quickly iterate on research ideas while also ensuring that the underlying platform is performant, usable, an...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application