Full-time Posted June 03, 2026
Apply Now

Job Description

Join a skilled team as a Senior Site Reliability Engineer, leveraging your expertise in Azure Kubernetes Service and observability tools like Dynatrace and Splunk. Deliver high-impact solutions to enhance system reliability and performance.
As a critical member in this role, you will design observability-as-code solutions using Terraform to create effective monitoring pipelines and dashboards. Your responsibilities will encompass driving real-time performance insights, troubleshooting complex production incidents, and automating operational tasks to build resilient systems. You will collaborate with cross-functional teams to ensure service excellence and reliability.
Key Responsibilities:
• Design observability-as-code solutions with Terraform
• Drive improvements using Dynatrace, ELK, and Splunk
• Instrument applications for comprehensive observability
• Troubleshoot complex incidents in production environments
• Lead incident response and blameless postmortems

Apply for This Position

Ready to take the next step? Click the button below to submit your application.

Submit Application