xelys jobs xelys jobs

Site Reliability Engineer (Senior or Staff), Observability

MongoDB

hybridseniorpermanentdevopsbackend United States 45 days ago via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

ObservabilitySite Reliability Engineering (SRE)MetricsLoggingDistributed SystemsMonitoring and AlertingOn-callPost-mortemsHTTPTLS

About the role

Role Overview

Site Reliability Engineering (SRE) Observability team within MongoDB’s Platform Engineering. You will build and operate the observability stack (metrics, logging, tracing) used across engineering teams, including the telemetry pipeline and monitoring/alerting infrastructure.

Responsibilities

  • Set standards and a vision for MongoDB’s mission-critical observability platform.
  • Design, architect, build, and deliver core observability services in collaboration with SWE and SRE partners.
  • Monitor and troubleshoot services spanning multiple cloud providers and global infrastructure.
  • Improve reliability by making services resilient, fault-tolerant, and self-healing.
  • Define and configure key metrics to detect incidents and measure health, availability, and performance.
  • Participate in a week-long on-call rotation and run blameless post-mortems.
  • Optimize observability capabilities for cost, ease of use, and maintainability.

Requirements

  • Experience running mission-critical services at scale.
  • Strong experience with observability for large-scale distributed systems.
  • Understanding of information security issues.
  • Proficiency in at least one modern programming language beyond basic scripting.
  • Solid grasp of web and network protocols/standards (e.g., HTTP, TLS, DNS).
  • Bachelor’s degree in Computer Science or equivalent experience.

Nice to Haves

  • Experience with at least one major cloud provider (AWS, Google Compute, or Microsoft Azure).
  • Experience in Kubernetes-based environments (Kubernetes clusters).

Location / Work Model

  • Hybrid in NYC HQ, or fully remote from locations in Eastern or Central time zones (US).

About MongoDB

MongoDB is a software company providing a unified, cloud-native database platform designed for globally distributed workloads. Its MongoDB Atlas service is multi-cloud and supports organizations modernizing legacy systems and building AI-ready applications.

Scraped 4/1/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.