Full Time Posted June 01, 2026
Apply Now

Job Description

We are seeking a Senior Technical Lead Site Reliability Engineer to own the reliability, scalability, performance, and operational integrity of critical production services. This role is accountable for the full-service lifecycle, from design and deployment readiness through production operations, incident response, and continuous improvement. Reliability is a core engineering responsibility, requiring strong software engineering skills and autonomous operation across AWS, hybrid data centers, and customer-hosted environments.


· Own production services end to end. Accountable for reliability, availability, scalability, performance, and operational health.


· Define and manage SLIs and SLOs, using error budgets to guide delivery decisions.


· Influence of service and system design to improve fault tolerance, observability and operational sustainability.


· Debug complex production issues across application code, services and infrastructure using s...

Apply for This Position

Ready to take the next step? Click the button below to submit your application.

Submit Application