xelys jobs xelys jobs

Senior Site Reliability Engineer

Cloudbeds

full-remoteseniorpermanentdevopsbackend United States 2 days ago via LinkedIn
145,000 - 165,000 USD/annual

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Site Reliability EngineeringAWSKubernetesEKSTerraformGitOpsArgoCDObservabilityIncident ManagementGrafana

About the role

Role Overview

As a Senior Site Reliability Engineer (SRE) at Cloudbeds, you’ll ensure the reliability and performance of a global hospitality platform. You’ll architect and operate scalable systems on AWS, improve automation and observability, and lead incident response practices to keep services running 24/7.

Responsibilities

  • Design and implement reliable, scalable AWS architecture.
  • Maintain and support highly loaded Kubernetes (EKS) clusters and infrastructure components.
  • Support CI/CD using ArgoCD and GitOps.
  • Automate deployments with Terraform (Infrastructure as Code).
  • Build and continuously improve monitoring and observability (Grafana, Prometheus, Datadog, CloudWatch).
  • Participate in incident management and perform root cause analysis (RCA).
  • Optimize performance, troubleshoot production issues, and improve reliability targets.
  • Collaborate with engineering teams on monitoring best practices.
  • Partner with security teams to implement and maintain security best practices.
  • Join an infrastructure support rotation and provide guidance to other teams.

Requirements

  • 5+ years experience as an SRE/DevOps in the AWS ecosystem.
  • 5+ years with Kubernetes (EKS) and Helm.
  • Experience building CI/CD pipelines using ArgoCD (GitOps) and GitHub Actions.
  • Strong Terraform IaC experience.
  • Observability/monitoring experience with Grafana, Prometheus, Datadog, CloudWatch.
  • Incident management and full-stack troubleshooting, performance analysis, and RCA.
  • Web application systems experience: Nginx, Ingress controllers, load balancing, and CDNs.
  • Database and middleware experience: MySQL, PostgreSQL, Aurora, Redis, Memcached, SQS.
  • Networking fundamentals: VPC, Security Groups, Network ACLs.
  • Ability to work remotely and manage time across a global team.
  • Strong written and verbal communication in English.
  • Bachelor’s degree in Computer Science or equivalent experience.

Bonus Skills

  • Advanced Database Administration experience (Aurora, MySQL, PostgreSQL).
  • Experience in a PCI-compliant environment.
  • Experience with Kong API Gateway.

About Cloudbeds

Cloudbeds is a hospitality technology company building an intelligently designed platform for property management. Its software helps hotels and hotel groups streamline operations and improve commercial strategy through integrations with hundreds of partners, processing billions of bookings annually.

Scraped 4/15/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.