xelys jobs xelys jobs

Lead Site Reliability Engineer

Alteryx

full-remoteleadpermanentdevopssecurity Full remote - Irvine, US Today via WTTJ

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Site Reliability Engineering (SRE)SLOsSLAsKubernetesGitOpsArgo CDMulti-region ArchitecturesObservabilityIncident ManagementChaos Engineering

About the role

Role Overview

Join Alteryx as a Lead Site Reliability Engineer (SRE). You’ll own reliability outcomes for a modern multi-region SaaS platform serving enterprise customers, defining reliability strategy and mentoring senior engineers.

Key Responsibilities

  • Define and drive reliability strategy for control/data systems, including multi-region resilience and split-plane architectures.
  • Establish and operationalize SLOs, SLAs, and error budgets, ensuring they inform engineering planning and decisions.
  • Lead measurable improvements in:
    • MTTR reduction
    • Incident prevention
    • Overall service health
  • Mentor and provide hands-on technical leadership, influencing cross-team decisions.

Requirements

  • Leadership experience mentoring senior engineers and influencing cross-team decisions.
  • Proven track record improving SLOs, MTTR, and system reliability at scale.
  • Strong background in multi-region and split-plane architectures (control-plane / data-plane).
  • Infrastructure as Code and cloud platforms.
  • Experience with:
    • Kubernetes (multi-cluster)
    • CI/CD
    • GitOps (e.g., Argo CD)
  • Proficiency in one or more languages such as Python, Java, C++, or JavaScript.
  • 6+ years leading delivery of complex distributed systems or SaaS platforms.
  • Expertise in disaster recovery, resilience, and security best practices.
  • Deep experience with SLO/SLA design, observability, and incident management.
  • Experience with chaos engineering and large-scale reliability automation.

Nice to Have

  • Background in enterprise SaaS platforms and split-plane architectures.
  • Experience leveraging modern observability platforms such as Datadog or Grafana.

About Alteryx

Alteryx is an analytics company focused on helping enterprises manage and analyze data through modern software platforms. The company builds and operates a multi-region SaaS offering that serves enterprise customers and prioritizes reliability and service health.

Scraped 5/12/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.