xelys jobs xelys jobs

Staff Software Engineer (Platform, SysEng)

Grafana Labs

full-remoteleadpermanentbackenddevops Full remote Today via WTTJ

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

GoPythonKubernetesTerraformDistributed SystemsSLOs/SLIsSystem DesignReliability EngineeringCloud-NativeAI-Assisted Development

About the role

Role overview

Join Grafana Labs as a Staff Software Engineer (Platform, SysEng) within the Platform SysEng squad. You will help improve performance, reliability, and efficiency as the company scales its cloud infrastructure in a remote-first environment.

Key missions & responsibilities

  • Conceive, design, and deliver platform projects that improve performance, reliability, and efficiency.
  • Collaborate cross-functionally to reduce new region build timelines and meet customer demands.
  • Own reliability and performance end-to-end, including:
    • Defining SLOs/SLIs
    • Capacity planning
    • Performance tuning
    • Driving reliability work from design through execution
  • Work across the full software lifecycle:
    • Write design docs
    • Incorporate developer feedback
    • Ensure integration testing

Requirements

  • Strong interest in distributed systems and operating code in production.
  • Holistic development mindset: see the big picture while focusing on details.
  • Experience delivering and operating large distributed systems across multiple teams, with demonstrated technical leadership.
  • Comfort with remote-first collaboration and clear written/verbal communication.
  • Demonstrable system design skills and understanding of tradeoffs involving:
    • Latency, consistency, availability, scaling, and cost
  • Strong engineering fundamentals:
    • Write clear, maintainable, well-tested code and lead technical designs
    • Primary experience expected with Go; other languages like Python/C/C++/Rust are transferable
  • Hands-on experience with cloud-native architectures, including:
    • Microservices
    • Containers / Kubernetes
    • IaC (Infrastructure as Code)
    • Operational practices for keeping systems healthy

Nice-to-haves

  • Comfort with AI-assisted/agentic development and integrating AI developer tools into workflows.
  • Experience with Tanka and/or Jsonnet.
  • Experience with Terraform and/or Crossplane.
  • Familiarity with Kubernetes scheduling and tools like Karpenter.
  • Prior work in open source or community-based projects.

Team culture

  • Consensus-driven decision-making with collaborative, kind, and respectful communication.
  • Room to grow with strong team knowledge sharing.

About Grafana Labs

Grafana Labs builds open source observability software, including tools for monitoring, metrics, logs, and dashboards. The company operates in the cloud infrastructure and platform ecosystem and develops solutions used by engineering teams worldwide. Their culture emphasizes open-source contribution and reliability at scale.

Scraped 6/14/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.