Senior Site Reliability Engineer
Cloudbeds
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
About the role
Role Overview
As a Senior Site Reliability Engineer (SRE) at Cloudbeds, you will ensure the reliability and performance of the hospitality platform. You’ll architect and operate scalable AWS infrastructure that supports high transaction volumes globally, while driving automation, resilience, and continuous improvement.
Responsibilities
- Design and implement reliable, scalable AWS architecture for organizational needs.
- Operate and maintain Kubernetes (EKS) clusters and related infrastructure components.
- Support CI/CD using ArgoCD and GitOps.
- Automate deployments with Terraform (Infrastructure as Code).
- Build and improve observability and monitoring using Grafana, Prometheus, Datadog, and CloudWatch.
- Participate in incident management and perform root cause analysis (RCA) to minimize service impact.
- Optimize performance and troubleshoot issues.
- Collaborate with engineering teams to define monitoring best practices and meet reliability targets.
- Work with security teams to implement and maintain security best practices.
- Provide guidance via an infrastructure support rotation.
Requirements
- 5+ years in DevOps or SRE within the AWS ecosystem.
- 5+ years with Kubernetes (EKS) and Helm.
- Experience designing/building CI/CD pipelines with ArgoCD and GitHub Actions.
- Terraform infrastructure-as-code experience.
- Observability/monitoring experience with Grafana, Prometheus, Datadog, CloudWatch.
- Incident management, full-stack troubleshooting, performance analysis, and RCA experience.
- Experience with web application systems: Nginx, Ingress controllers, load balancing, CDNs.
- Database and middleware experience: MySQL, PostgreSQL, Aurora, Redis, Memcached, SQS.
- Strong networking skills: VPC, Security Groups, Network ACLs.
- Ability to work remotely and manage time across a global team.
- English communication skills (written and verbal).
- Bachelor’s degree in Computer Science or equivalent experience.
Bonus Skills
- Advanced database administration experience (Aurora, MySQL, PostgreSQL).
- Experience in a PCI-compliant environment.
- Experience with Kong API Gateway.
About Cloudbeds
Cloudbeds builds an intelligent hospitality platform (hotel PMS) used by properties across 150 countries. The company helps hoteliers improve operations and commercial strategy through a unified system that integrates with hundreds of partners. Cloudbeds operates with a completely remote team and focuses on reliability, performance, and modern cloud technologies.
Scraped 4/16/2026