Site Reliability Engineer (Senior or Staff), Observability
MongoDB
hybridseniorpermanentdevopsbackend United States 45 days ago via LinkedIn
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
ObservabilitySite Reliability Engineering (SRE)MetricsLoggingDistributed SystemsMonitoring and AlertingOn-callPost-mortemsHTTPTLS
About the role
Role Overview
Site Reliability Engineering (SRE) Observability team within MongoDB’s Platform Engineering. You will build and operate the observability stack (metrics, logging, tracing) used across engineering teams, including the telemetry pipeline and monitoring/alerting infrastructure.
Responsibilities
- Set standards and a vision for MongoDB’s mission-critical observability platform.
- Design, architect, build, and deliver core observability services in collaboration with SWE and SRE partners.
- Monitor and troubleshoot services spanning multiple cloud providers and global infrastructure.
- Improve reliability by making services resilient, fault-tolerant, and self-healing.
- Define and configure key metrics to detect incidents and measure health, availability, and performance.
- Participate in a week-long on-call rotation and run blameless post-mortems.
- Optimize observability capabilities for cost, ease of use, and maintainability.
Requirements
- Experience running mission-critical services at scale.
- Strong experience with observability for large-scale distributed systems.
- Understanding of information security issues.
- Proficiency in at least one modern programming language beyond basic scripting.
- Solid grasp of web and network protocols/standards (e.g., HTTP, TLS, DNS).
- Bachelor’s degree in Computer Science or equivalent experience.
Nice to Haves
- Experience with at least one major cloud provider (AWS, Google Compute, or Microsoft Azure).
- Experience in Kubernetes-based environments (Kubernetes clusters).
Location / Work Model
- Hybrid in NYC HQ, or fully remote from locations in Eastern or Central time zones (US).
About MongoDB
MongoDB is a software company providing a unified, cloud-native database platform designed for globally distributed workloads. Its MongoDB Atlas service is multi-cloud and supports organizations modernizing legacy systems and building AI-ready applications.
Scraped 4/1/2026