Job Description
Will be responsible for Eyes on glass Monitoring, Triage & Incident Ownership, Troubleshooting & Restoration, Cross-Team Collaboration, Platform & Application Stack Awareness and Service Quality & Process Excellence. • Triage & Incident Ownership
o Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents.
o Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service.
o Maintain accurate, real-time incident timelines and post-incident documentation.
• Troubleshooting & Restoration
o Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers.
o Use observability/monitoring tools (e.g., Kibana, Dynatrace, Cloud Watch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns.
o Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quic...
o Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents.
o Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service.
o Maintain accurate, real-time incident timelines and post-incident documentation.
• Troubleshooting & Restoration
o Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers.
o Use observability/monitoring tools (e.g., Kibana, Dynatrace, Cloud Watch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns.
o Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quic...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application