Senior Site Reliability Engineer
UJET
seniorpermanentdevops Austin, TX, US 3 days ago via RemoteOK
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
Site Reliability EngineeringSRESLIs/SLOsError BudgetsObservabilityIncident ResponsePostmortemsAutomationCloud-native
About the role
Role Overview
UJET is hiring a Senior Site Reliability Engineer to build and scale a high-impact SRE function. You will act as a technical leader improving system reliability, reducing operational toil, and establishing engineering best practices across production services.
Responsibilities
- Lead efforts to improve system reliability, scalability, and performance across critical services
- Define and implement SLIs/SLOs and error budgets to guide engineering priorities
- Design and develop observability systems: metrics, logging, tracing, and alerting (with minimal alert noise)
- Lead incident response and serve as incident commander when needed
- Run postmortems focused on systemic causes; ensure corrective actions are completed
- Identify and eliminate operational toil via automation, tooling, and workflow improvements
- Partner with product and platform teams on architecture decisions and production readiness
Requirements
- Strong technical leadership skills with experience improving reliability in production environments
- Deep capability defining reliability targets (SLIs/SLOs, error budgets)
- Ability to build effective observability and alerting practices
- Proven experience leading incident response and conducting high-quality postmortems
Nice-to-haves
- Experience building SRE processes and tooling end-to-end
- Experience with reliability-oriented platform/production readiness practices
About UJET
UJET provides an AI-powered contact center platform built on a cloud-native architecture. It uses advanced AI and multimodality to automate customer interactions, improve operational efficiency, and deliver actionable insights with a CRM-first approach designed to avoid storing PII.
Scraped 4/19/2026