Senior Site Reliability Engineer
Cloudbeds
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
About the role
Role Overview
As a Senior Site Reliability Engineer (SRE) at Cloudbeds, you will be responsible for the reliability and performance of the platform powering hospitality properties worldwide. You’ll architect and implement scalable AWS solutions, strengthen automation and resilience, and help ensure high transaction availability across the globe.
Responsibilities
- Design and implement reliable, scalable AWS architectures for organizational needs
- Maintain and support highly loaded Kubernetes (EKS) clusters and related infrastructure components
- Support CI/CD using ArgoCD and GitOps
- Automate deployments with Terraform (infrastructure-as-code)
- Build and continuously improve Observability and Monitoring (Grafana, Prometheus, Datadog, CloudWatch)
- Participate in Incident Management and Root Cause Analysis (RCA) to minimize service impact
- Optimize performance and troubleshoot production issues
- Collaborate with engineering teams on monitoring best practices and reliability targets
- Collaborate with security teams to maintain security best practices
- Join an infrastructure support rotation to support and guide other engineering teams
Requirements
- 5+ years experience as a DevOps/SRE in the AWS ecosystem
- 5+ years experience with Kubernetes (EKS) and Helm charts
- Experience building/supporting CI/CD pipelines with ArgoCD and GitHub Actions
- Strong Terraform infrastructure-as-code experience
- Observability/monitoring experience with Grafana, Prometheus, Datadog, and CloudWatch
- Incident management, troubleshooting, performance analysis, and RCA experience
- Experience with web application systems such as Nginx, Ingress controllers, load balancing, and CDNs
- Database and middleware experience: MySQL, PostgreSQL, Aurora, plus Redis, Memcached, SQS
- Good networking skills: VPC, Security Groups, Network ACLs
- Ability to work remotely and manage time with a global team; strong English communication
- Bachelor’s degree in Computer Science or equivalent experience
Bonus Skills
- Advanced Database Administration (Aurora, MySQL, PostgreSQL)
- Experience in a PCI-compliant environment
- Experience with Kong API Gateway
About Cloudbeds
Cloudbeds is a hospitality technology company building a PMS (property management system) platform used by properties across more than 150 countries. The platform processes billions of bookings annually and integrates with hundreds of partners, enabling hotels to improve operations and commercial strategy. Cloudbeds operates with a fully remote team and focuses on reliability, automation, and AI-powered solutions.
Scraped 5/14/2026