Job Description
Role Title: Platform / SRE Engineer
Location: Sheffield (3 days a week onsite is mandatory)
Duration: 30/11/2026
Rate: 525p/d via Umbrella
Role Description:
Own deployment, observability, reliability, cost control, and production operations for the AI helpdesk platform.
Key responsibilities
- Build and manage CI/CD pipelines, infrastructure, and runtime environments for AI services.
- Deploy and operate model-serving, orchestration, and application workloads.
- Implement monitoring, tracing, alerting, logging, and operational dashboards.
- Manage scaling, release processes, rollback mechanisms, and production support.
- Optimize inference cost, latency, uptime, and system reliability.
- Create runbooks, incident response processes, and operational standards.
Required skills
- Strong experience in DevOps, SRE.
- Experience with Docke...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application