Job Description
Senior Software Engineer, ML Platform (Stability & Infrastructure)
London
About Iso
Isomorphic Labs (IsoLabs) launched in 2021 to advance human health by building AI models that accelerate scientific discovery.
Your Impact
You will play a pivotal role in ensuring the reliability and scalability of the foundations making our AI work possible.
What You Will Do
- Own the end-to-end strategy for platform reliability, focusing on accelerator (GPU/TPU) infrastructure and workload orchestration.
- Lead reliability work for our global job scheduler, designing and implementing a robust “test harness” to validate infrastructure upgrades.
- Architect and optimize next-generation inference services to address scaling limits and maintain high-throughput performance.
- Overhaul logging and monitoring systems to provide proactive alerting and telemetry that identifies failures before they impact research...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application