Government Site Reliability Engineer
Veeam
full-remotemidpermanentdevopsbackend Full remote Today via WTTJ
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
Site Reliability Engineering (SRE)TypeScriptTerraformKubernetesAzureCI/CDObservabilityPrometheusGrafanaOpenTelemetryRegulated Environments
About the role
Role Overview
Join Veeam as a Government Site Reliability Engineer to support Veeam Data Cloud (SaaS) in Government and Sovereign Cloud environments. You will work with senior engineers to execute reliability work, respond to incidents, and maintain the operational foundation of the team.
Key Responsibilities
- Incident response: triage, investigation, mitigation, and postmortems.
- Reliability management: implement and maintain SLIs, SLOs, and error budgets.
- Observability: drive monitoring and system visibility to improve reliability.
- Infrastructure & delivery:
- Use Infrastructure as Code (IaC) and CI/CD pipelines.
- Support deployment tooling in compliance-restricted environments.
- Contribute to testing, canary deployments, and release validation workflows.
- Collaboration: partner with engineering, security, compliance, and operations teams on reliability improvements.
- Communication: clearly communicate system behavior, risk, and status.
Required Qualifications
- Strong programming skills in TypeScript/JavaScript, Go, Java, C#, or similar.
- Experience with IaC tools: Terraform, Terragrunt, or Pulumi.
- Experience with Kubernetes (container orchestration).
- Cloud infrastructure experience on Azure or a comparable provider.
- 3+ years in software engineering, including at least 1 year in SRE/Platform Engineering/DevOps for cloud-hosted services.
- Ability to read and understand code to investigate system behavior.
- Strong written and verbal communication.
- Solid understanding of distributed systems and networking fundamentals.
- Monitoring/observability experience with tools such as Prometheus, Grafana, OpenTelemetry, and ELK stack.
- CI/CD tooling experience (e.g., GitHub Actions, Azure DevOps, GitLab CI, ArgoCD).
- Familiarity with regulated/compliance-oriented environments (e.g., FedRAMP, CMMC, PCI-DSS, HIPAA) and how compliance constrains operations.
- Experience in Government or Sovereign Cloud environments (e.g., Azure Government, AWS GovCloud).
- Background with SaaS or multi-tenant systems.
Nice-to-Haves
- Familiarity with chaos engineering, resilience testing, or load testing.
- Experience building/improving reliability practices on a team.
- Exposure to AI-first development workflows using LLM-powered tools for automation, code generation, or documentation.
About Veeam
Veeam is a data management solutions provider focused on protecting, managing, and delivering data across modern cloud and enterprise environments. The role is centered on Veeam Data Cloud, a SaaS platform, with a specific emphasis on Government and Sovereign Cloud operations.
Scraped 6/14/2026