Permanent Posted June 14, 2026
Apply Now

Job Description

Job Responsibilities:

  • Operate and maintain AWS-native services in production (e.g. Lambda, ECS, EKS, Redshift, GuardDuty, WAF, etc.).
  • Ensure high availability, uptime, and secure cloud operations.
  • Monitor infrastructure, manage alerts, and respond to production incidents.
  • Design and maintain Infrastructure-as-Code (IaC) using Terraform, CloudFormation, and Ansible.
  • Troubleshoot deployment pipelines and resolve environment drift.
  • Manage OS lifecycle and patching (RHEL, Windows Server) using AWS Patch Manager, WSUS, YUM/DNF.
  • Track and manage SSL certificate renewals and End-of-Life (EOL) components.
  • Integrate tools like NGINX into observability and monitoring stacks.
  • Maintain documentation including runbooks, patch logs, audit artifacts, and post-mortem reports.
  • Collaborate with cross-functional SRE and engineering teams.
  • Provide mentorship to junior engineers and support continuous improveme...
  • Apply for This Position

    Ready to take the next step? Click the button below to submit your application.

    Submit Application