xelys jobs xelys jobs

Senior Site Reliability Engineer (Remote Build)

Remote

full-remoteseniorpermanentbackenddevops Full remote Today via WTTJ

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Site Reliability EngineeringTerraformKubernetesAWSLinuxObservabilityDatadogPrometheusCI/CDMulti-tenant Platforms

About the role

Role Overview

Join Remote as a Senior Site Reliability Engineer (Remote Build). You will drive operational excellence and the infrastructure strategy for Remote Build’s platform, partnering with leadership, product managers, engineers, and customer success to ensure scalability and reliability from day one.

Key Responsibilities

  • Infrastructure as Code (IaC): Design, implement, and maintain IaC patterns using Terraform and Kubernetes to support standard connectors and custom builds.
  • Observability: Build and maintain monitoring, logging, and alerting systems.
  • Reliability Leadership: Lead incident response, run post-mortems, and drive continuous improvement in reliability.
  • Security & Compliance: Work with the Security team to embed security across the Build infrastructure and meet compliance requirements across 100+ jurisdictions.
  • Performance & Cost: Optimize system performance and costs.
  • Developer Experience: Improve developer experience through better tooling, processes, and documentation (e.g., runbooks).

Required Profile

  • Scripting & Systems Knowledge: Strong bash scripting; comfortable debugging system-level issues, reading logs, and understanding Linux kernel basics.
  • Communication: Able to explain complex infrastructure decisions clearly to engineers and non-technical stakeholders; write clear runbooks and documentation.
  • Kubernetes & AWS: Deep hands-on experience running Kubernetes in production plus solid AWS fundamentals (compute, networking, storage, managed services).
  • Infrastructure as Code: Proficiency with Terraform (or similar). Write IaC code rather than using console clicks.
  • Senior SRE/DevOps Experience: Demonstrated SRE/DevOps/SysOps experience and experience operating production systems at scale.
  • CI/CD & Deployment Automation: Real experience operating tools such as GitLab, GitHub Actions, Jenkins; understand deployment strategies, rollbacks, and safety mechanisms.
  • Multi-tenancy: Experience scaling multi-tenant platforms.
  • Observability Tooling: Depth with tools like Datadog, Prometheus, ELK, Grafana (or similar).
  • Artifact/Registry Management: Experience with ECR and container registries (e.g., Docker Hub).
  • Consultancy Experience: Experience in consultancy settings.
  • Backend Language: Experience with 1+ backend programming language (e.g., Elixir, Python, Go, Java, Node.js).

Nice-to-Haves

  • Not explicitly listed beyond the required profile (consultancy experience and backend language are included as requirements).

About Remote

Remote is a global HR and payroll platform provider. It supports distributed teams and enterprises with payroll operations and related tooling, and operates platform infrastructure used by customers worldwide.

Scraped 6/20/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.