Job Description
At Fractile, we’re taking a revolutionary approach to computing to run the world’s largest language models 100x faster than existing systems. Our fast-growing team is working at the cutting edge of the latest AI developments in both hardware and software. Want to get involved?
We are looking for Senior ML Runtime Engineers with experience of key ML software ecosystem components to work on inference server integrations and the runtime stack of our ground-breaking AI accelerators. You can be based in either our London office or Bristol, the choice is yours.
In this role, you will:
- Integrate Fractile's innovative AI acceleration hardware with leading open source projects like PyTorch, vLLM, and SGLang
- Develop our underlying high-performance Rust runtime
- Work with hardware, lower-level software, and ML engineers in a highly collaborative hardware-software co-design methodology
It would be great if you have:
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application