Site Reliability Engineer
Talener
full-remoteseniorpermanentdevopsbackend Orlando, FL Yesterday via LinkedIn
110,000 - 130,000 USD/annual
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
Site Reliability EngineeringSREAzureKubernetesDockerTerraformObservabilityDatadogIncident ResponseCI/CD
About the role
Role Overview
Talener is seeking a Site Reliability Engineer (SRE) to help scale and support a cloud-based platform in a remote (U.S.) setting. You’ll strengthen reliability, operational efficiency, observability, and automation across production environments, with a focus on incident response and continuous improvement.
Key Responsibilities
- Ensure reliability, scalability, performance, and security of cloud infrastructure and applications
- Monitor, troubleshoot, and resolve production issues across distributed systems
- Lead incident response, perform root cause analysis, and run blameless post-mortems
- Build and maintain operational runbooks and automated remediation workflows
- Develop and improve observability/telemetry for proactive monitoring and alerting
- Collaborate with engineering, DevOps, QA, security, and operations to improve platform health and deployment processes
- Support infrastructure automation and configuration management
- Apply Infrastructure-as-Code (IaC) and improve CI/CD operational workflows
- Promote reliability engineering and operational excellence best practices
- Participate in an on-call rotation (including occasional off-hours support for West Coast operations)
Required Qualifications
- 5+ years in SRE, DevOps, cloud infrastructure, or related disciplines
- Strong production troubleshooting and support experience
- Hands-on observability/monitoring experience with Datadog, New Relic, or similar tools
- Experience with Azure and modern containerized infrastructure
- Knowledge of Docker and Kubernetes
- Experience with IaC tools such as Terraform, Terragrunt, or OpenTofu
- Scripting/automation with PowerShell, Python, JavaScript, or similar
- Experience with Git and CI/CD tooling (e.g., Azure DevOps)
- Understanding of cloud security principles, compliance, and operational best practices
- Strong collaboration and communication in Agile environments
Preferred Qualifications
- Experience improving operational visibility (telemetry, dashboards, reports, alerting)
- Experience evolving incident response processes and operational tooling
- Interest in mentoring and promoting operational excellence across teams
- Strong problem-solving mindset with emphasis on continuous improvement and automation
About Talener
Talener is a healthcare technology organization focused on building and scaling a high-impact cloud platform to improve healthcare delivery. The role supports platform reliability, operational efficiency, observability, and automation across production environments.
Scraped 6/13/2026