xelys jobs xelys jobs

Government Site Reliability Engineer

Veeam

full-remotemidpermanentdevopsbackend Full remote Today via WTTJ

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Site Reliability Engineering (SRE)TypeScriptTerraformKubernetesAzureCI/CDObservabilityPrometheusGrafanaOpenTelemetryRegulated Environments

About the role

Role Overview

Join Veeam as a Government Site Reliability Engineer to support Veeam Data Cloud (SaaS) in Government and Sovereign Cloud environments. You will work with senior engineers to execute reliability work, respond to incidents, and maintain the operational foundation of the team.

Key Responsibilities

  • Incident response: triage, investigation, mitigation, and postmortems.
  • Reliability management: implement and maintain SLIs, SLOs, and error budgets.
  • Observability: drive monitoring and system visibility to improve reliability.
  • Infrastructure & delivery:
    • Use Infrastructure as Code (IaC) and CI/CD pipelines.
    • Support deployment tooling in compliance-restricted environments.
    • Contribute to testing, canary deployments, and release validation workflows.
  • Collaboration: partner with engineering, security, compliance, and operations teams on reliability improvements.
  • Communication: clearly communicate system behavior, risk, and status.

Required Qualifications

  • Strong programming skills in TypeScript/JavaScript, Go, Java, C#, or similar.
  • Experience with IaC tools: Terraform, Terragrunt, or Pulumi.
  • Experience with Kubernetes (container orchestration).
  • Cloud infrastructure experience on Azure or a comparable provider.
  • 3+ years in software engineering, including at least 1 year in SRE/Platform Engineering/DevOps for cloud-hosted services.
  • Ability to read and understand code to investigate system behavior.
  • Strong written and verbal communication.
  • Solid understanding of distributed systems and networking fundamentals.
  • Monitoring/observability experience with tools such as Prometheus, Grafana, OpenTelemetry, and ELK stack.
  • CI/CD tooling experience (e.g., GitHub Actions, Azure DevOps, GitLab CI, ArgoCD).
  • Familiarity with regulated/compliance-oriented environments (e.g., FedRAMP, CMMC, PCI-DSS, HIPAA) and how compliance constrains operations.
  • Experience in Government or Sovereign Cloud environments (e.g., Azure Government, AWS GovCloud).
  • Background with SaaS or multi-tenant systems.

Nice-to-Haves

  • Familiarity with chaos engineering, resilience testing, or load testing.
  • Experience building/improving reliability practices on a team.
  • Exposure to AI-first development workflows using LLM-powered tools for automation, code generation, or documentation.

About Veeam

Veeam is a data management solutions provider focused on protecting, managing, and delivering data across modern cloud and enterprise environments. The role is centered on Veeam Data Cloud, a SaaS platform, with a specific emphasis on Government and Sovereign Cloud operations.

Scraped 6/14/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.