Job Description
As a Site Reliability Engineer - Application Support, you will:
- Ensure System Reliability & Availability : Monitor, troubleshoot, and maintain critical backend applications and infrastructure to meet SLA/SLO targets and ensure high availability of trading platforms
- Implement SRE Best Practices : Design and implement monitoring, alerting, and observability solutions using tools like Grafana, Dynatrace, and Elasticsearch to proactively identify and resolve issues
- Automate Operations : Develop automation scripts and tools using Linux shell scripting and Python to reduce manual intervention, improve system efficiency, and eliminate toil
- Manage Cloud Infrastructure : Work with AWS services and terraform to provision, manage, and optimize cloud infrastructure while ensuring cost efficiency and security
- Container Orchestration : Manage and t...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application