Lead Site Reliability Engineer
Alteryx
full-remoteleadpermanentdevopssecurity Full remote - Irvine, US Today via WTTJ
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
Site Reliability Engineering (SRE)SLOsSLAsKubernetesGitOpsArgo CDMulti-region ArchitecturesObservabilityIncident ManagementChaos Engineering
About the role
Role Overview
Join Alteryx as a Lead Site Reliability Engineer (SRE). You’ll own reliability outcomes for a modern multi-region SaaS platform serving enterprise customers, defining reliability strategy and mentoring senior engineers.
Key Responsibilities
- Define and drive reliability strategy for control/data systems, including multi-region resilience and split-plane architectures.
- Establish and operationalize SLOs, SLAs, and error budgets, ensuring they inform engineering planning and decisions.
- Lead measurable improvements in:
- MTTR reduction
- Incident prevention
- Overall service health
- Mentor and provide hands-on technical leadership, influencing cross-team decisions.
Requirements
- Leadership experience mentoring senior engineers and influencing cross-team decisions.
- Proven track record improving SLOs, MTTR, and system reliability at scale.
- Strong background in multi-region and split-plane architectures (control-plane / data-plane).
- Infrastructure as Code and cloud platforms.
- Experience with:
- Kubernetes (multi-cluster)
- CI/CD
- GitOps (e.g., Argo CD)
- Proficiency in one or more languages such as Python, Java, C++, or JavaScript.
- 6+ years leading delivery of complex distributed systems or SaaS platforms.
- Expertise in disaster recovery, resilience, and security best practices.
- Deep experience with SLO/SLA design, observability, and incident management.
- Experience with chaos engineering and large-scale reliability automation.
Nice to Have
- Background in enterprise SaaS platforms and split-plane architectures.
- Experience leveraging modern observability platforms such as Datadog or Grafana.
About Alteryx
Alteryx is an analytics company focused on helping enterprises manage and analyze data through modern software platforms. The company builds and operates a multi-region SaaS offering that serves enterprise customers and prioritizes reliability and service health.
Scraped 5/12/2026