Job Description
About the Role
As a Principal AI/ML Infrastructure Engineer , you will be instrumental in designing and implementing cutting‑edge AI/ML‑powered solutions to optimize and automate our infrastructure across GCP and multi‑cloud environments.
Responsibilities
- Design and implement AI/ML‑powered solutions for infrastructure use cases, including predictive autoscaling, anomaly detection, intelligent cost optimization, and automated remediation across GCP and multi‑cloud environments.
- Build and maintain AI‑driven monitoring and observability systems that correlate logs, metrics, and traces to surface root causes, predict bottlenecks, and reduce mean time to resolution (MTTR).
- Develop and operate automated incident response workflows using AI‑powered playbooks that diagnose, contain, and resolve infrastructure issues with minimal manual intervention.
- Integrate AI tooling into CI/CD pipelines to improve deployment reliability,...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application