Job Description
Join a skilled team as a Senior Site Reliability Engineer, leveraging your expertise in Azure Kubernetes Service and observability tools like Dynatrace and Splunk. Deliver high-impact solutions to enhance system reliability and performance.
As a critical member in this role, you will design observability-as-code solutions using Terraform to create effective monitoring pipelines and dashboards. Your responsibilities will encompass driving real-time performance insights, troubleshooting complex production incidents, and automating operational tasks to build resilient systems. You will collaborate with cross-functional teams to ensure service excellence and reliability.
Key Responsibilities: • Design observability-as-code solutions with Terraform • Drive improvements using Dynatrace, ELK, and Splunk • Instrument applications for comprehensive observability • Troubleshoot complex incidents in production environments • Lead incident response and blameless postmortems
<...
As a critical member in this role, you will design observability-as-code solutions using Terraform to create effective monitoring pipelines and dashboards. Your responsibilities will encompass driving real-time performance insights, troubleshooting complex production incidents, and automating operational tasks to build resilient systems. You will collaborate with cross-functional teams to ensure service excellence and reliability.
Key Responsibilities: • Design observability-as-code solutions with Terraform • Drive improvements using Dynatrace, ELK, and Splunk • Instrument applications for comprehensive observability • Troubleshoot complex incidents in production environments • Lead incident response and blameless postmortems
<...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application