Senior Site Reliability Engineer
Cloudbeds
full-remoteseniorpermanentdevopsbackend United States 2 days ago via LinkedIn
145,000 - 165,000 USD/annual
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
AWSSite Reliability EngineeringKubernetesEKSTerraformArgoCDGitOpsObservabilityPrometheusDatadog
About the role
Role: Senior Site Reliability Engineer (SRE)
As a Senior Site Reliability Engineer at Cloudbeds, you’ll be responsible for the reliability and performance of a globally used hospitality platform, helping ensure high-volume transactions run smoothly 24/7.
Responsibilities
- Design, implement, and maintain reliable, scalable AWS architecture.
- Support and operate highly loaded Kubernetes (EKS) clusters and related infrastructure components.
- Improve and support CI/CD using ArgoCD and GitOps.
- Automate deployments and infrastructure using Terraform (IaC).
- Build and continuously improve Observability/Monitoring systems using Grafana, Prometheus, Datadog, and CloudWatch.
- Participate in Incident Management and conduct Root Cause Analysis (RCA) to minimize service impact.
- Optimize performance, troubleshoot production issues, and support reliability best practices.
- Collaborate with development and security teams to ensure monitoring and security standards are met.
- Join an infrastructure support rotation to provide guidance across engineering teams.
Requirements
- 5+ years of DevOps/SRE experience in the AWS ecosystem.
- 5+ years with Kubernetes (EKS) and Helm.
- Experience with CI/CD pipelines using ArgoCD and GitHub Actions.
- Hands-on Terraform infrastructure-as-code experience.
- Observability and monitoring experience with Grafana, Prometheus, Datadog, and CloudWatch.
- Incident management, full-stack troubleshooting, performance analysis, and RCA experience.
- Web application systems experience: Nginx, Ingress controllers, load balancing, and CDN.
- Database and middleware experience: MySQL/PostgreSQL/Aurora, Redis/Memcached, and SQS.
- Strong networking knowledge (VPC, Security Groups, Network ACLs).
- Ability to work remotely and manage time with a global team; strong English communication.
- Bachelor’s degree in Computer Science or equivalent experience.
Nice to Have
- Advanced database administration (Aurora, MySQL, PostgreSQL).
- Experience in PCI-compliant environments.
- Experience with Kong API Gateway.
About Cloudbeds
Cloudbeds is a hospitality technology company building an intelligently designed platform that powers property operations for hotels and hotel groups worldwide. The platform supports bookings and integrates with hundreds of partners, enabling hotels to improve operations and commercial strategy. Cloudbeds operates with a completely remote team.
Scraped 4/15/2026