xelys jobs xelys jobs

Site Reliability Engineer

Talener

full-remoteseniorpermanentdevopsbackend Orlando, FL Yesterday via LinkedIn
110,000 - 130,000 USD/annual

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Site Reliability EngineeringSREAzureKubernetesDockerTerraformObservabilityDatadogIncident ResponseCI/CD

About the role

Role Overview

Talener is seeking a Site Reliability Engineer (SRE) to help scale and support a cloud-based platform in a remote (U.S.) setting. You’ll strengthen reliability, operational efficiency, observability, and automation across production environments, with a focus on incident response and continuous improvement.

Key Responsibilities

  • Ensure reliability, scalability, performance, and security of cloud infrastructure and applications
  • Monitor, troubleshoot, and resolve production issues across distributed systems
  • Lead incident response, perform root cause analysis, and run blameless post-mortems
  • Build and maintain operational runbooks and automated remediation workflows
  • Develop and improve observability/telemetry for proactive monitoring and alerting
  • Collaborate with engineering, DevOps, QA, security, and operations to improve platform health and deployment processes
  • Support infrastructure automation and configuration management
  • Apply Infrastructure-as-Code (IaC) and improve CI/CD operational workflows
  • Promote reliability engineering and operational excellence best practices
  • Participate in an on-call rotation (including occasional off-hours support for West Coast operations)

Required Qualifications

  • 5+ years in SRE, DevOps, cloud infrastructure, or related disciplines
  • Strong production troubleshooting and support experience
  • Hands-on observability/monitoring experience with Datadog, New Relic, or similar tools
  • Experience with Azure and modern containerized infrastructure
  • Knowledge of Docker and Kubernetes
  • Experience with IaC tools such as Terraform, Terragrunt, or OpenTofu
  • Scripting/automation with PowerShell, Python, JavaScript, or similar
  • Experience with Git and CI/CD tooling (e.g., Azure DevOps)
  • Understanding of cloud security principles, compliance, and operational best practices
  • Strong collaboration and communication in Agile environments

Preferred Qualifications

  • Experience improving operational visibility (telemetry, dashboards, reports, alerting)
  • Experience evolving incident response processes and operational tooling
  • Interest in mentoring and promoting operational excellence across teams
  • Strong problem-solving mindset with emphasis on continuous improvement and automation

About Talener

Talener is a healthcare technology organization focused on building and scaling a high-impact cloud platform to improve healthcare delivery. The role supports platform reliability, operational efficiency, observability, and automation across production environments.

Scraped 6/13/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.