Site Reliability Engineer 5 - Live SRE
Netflix
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
About the role
Role Overview
As a Site Reliability Engineer (Live SRE), you’ll support Netflix live streaming events by managing and validating cloud traffic flows—especially through API Gateway and inter-process communication (IPC) between microservices. You’ll run load/performance testing, build observability, and help improve scalability to handle sudden traffic spikes (notably the “thundering herd” problem).
Responsibilities
- Drive continual improvement in observability, monitoring, and scalability for live streaming traffic.
- Solve the thundering herd problem by improving behavior around cloud traffic (API Gateway and IPC between microservices).
- Implement, automate, execute, and analyze functional, performance, resilience, and fault injection testing for live streaming delivery.
- Write/review code, develop documentation, and debug complex cross-system issues.
- Coordinate with multiple stakeholders to ensure smooth live-streaming event execution.
- Participate in an on-call rotation and be available for flexible hours aligned to event schedules.
Requirements
- 5+ years service reliability/operational experience operating large-scale, high-performance internet services, with strong focus on traffic at scale.
- Proven knowledge of L4 load balancers, HTTP caching, and reverse proxy technologies.
- Expert-level understanding of Unix/Linux and TCP/IP fundamentals.
- Strong networking/protocol knowledge, especially DNS, TLS, and HTTP/HTTPS.
- Proficiency in a programming language such as Go, Python, or Rust.
- Experience with real-time and big data analytics processing, e.g. Kafka, time series databases, and Presto/Trino or Spark SQL.
- Strong communication and collaboration skills in a partner-heavy environment.
Preferred
- B.S. in Computer Science, Electrical, or Computer Engineering (or equivalent experience).
About Netflix
Netflix is a global entertainment company focused on streaming and innovative storytelling at massive scale. Its engineering teams build and operate cloud services and live streaming infrastructure to keep availability high during high-traffic events.
Scraped 6/20/2026