Job Description
We are currently looking for an SRE/DevOps Engineer in Canada. This role sits at the frontline of enterprise platform reliability, ensuring the stability, availability, and performance of large‑scale cloud and hybrid systems. You will act as the first line of response for incidents across up‑to‑date infrastructure environments, including Kubernetes, APIs, databases, and cloud‑native services. Working in a highly operational and collaborative setting, you will monitor systems, execute runbooks, and support rapid incident resolution to minimize downtime. The position combines hands‑on technical troubleshooting with structured operational processes, where precision and communication are critical. You will contribute directly to service reliability by identifying issues, escalating intelligently, and improving documentation and automation opportunities. Accountabilities
Monitor system health across cloud and on‑prem environments using observability tools such as dashboards, logs, ...
Monitor system health across cloud and on‑prem environments using observability tools such as dashboards, logs, ...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application