xelys jobs xelys jobs

Senior Site Reliability Engineer

Cloudbeds

full-remoteseniorpermanentdevops United States 3 days ago via LinkedIn
145,000 - 165,000 USD/annual

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

AWSKubernetesEKSTerraformArgoCDGitOpsCI/CDGrafanaPrometheusIncident Management

About the role

Role Overview

As a Senior Site Reliability Engineer (SRE) at Cloudbeds, you will ensure the reliability and performance of the hospitality platform. You’ll architect and operate scalable AWS infrastructure that supports high transaction volumes globally, while driving automation, resilience, and continuous improvement.

Responsibilities

  • Design and implement reliable, scalable AWS architecture for organizational needs.
  • Operate and maintain Kubernetes (EKS) clusters and related infrastructure components.
  • Support CI/CD using ArgoCD and GitOps.
  • Automate deployments with Terraform (Infrastructure as Code).
  • Build and improve observability and monitoring using Grafana, Prometheus, Datadog, and CloudWatch.
  • Participate in incident management and perform root cause analysis (RCA) to minimize service impact.
  • Optimize performance and troubleshoot issues.
  • Collaborate with engineering teams to define monitoring best practices and meet reliability targets.
  • Work with security teams to implement and maintain security best practices.
  • Provide guidance via an infrastructure support rotation.

Requirements

  • 5+ years in DevOps or SRE within the AWS ecosystem.
  • 5+ years with Kubernetes (EKS) and Helm.
  • Experience designing/building CI/CD pipelines with ArgoCD and GitHub Actions.
  • Terraform infrastructure-as-code experience.
  • Observability/monitoring experience with Grafana, Prometheus, Datadog, CloudWatch.
  • Incident management, full-stack troubleshooting, performance analysis, and RCA experience.
  • Experience with web application systems: Nginx, Ingress controllers, load balancing, CDNs.
  • Database and middleware experience: MySQL, PostgreSQL, Aurora, Redis, Memcached, SQS.
  • Strong networking skills: VPC, Security Groups, Network ACLs.
  • Ability to work remotely and manage time across a global team.
  • English communication skills (written and verbal).
  • Bachelor’s degree in Computer Science or equivalent experience.

Bonus Skills

  • Advanced database administration experience (Aurora, MySQL, PostgreSQL).
  • Experience in a PCI-compliant environment.
  • Experience with Kong API Gateway.

About Cloudbeds

Cloudbeds builds an intelligent hospitality platform (hotel PMS) used by properties across 150 countries. The company helps hoteliers improve operations and commercial strategy through a unified system that integrates with hundreds of partners. Cloudbeds operates with a completely remote team and focuses on reliability, performance, and modern cloud technologies.

Scraped 4/16/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.