Senior Software Engineer (Kubernetes and Distributed Systems, Radian Arc (EMEA), Contractor or FTE)
Submer
full-remoteseniorpermanentbackenddevops Full remote Today via WTTJ
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
GoKubernetesKubernetes OperatorsDistributed SystemsLinuxPrometheusGrafanaHigh AvailabilityFault ToleranceDistributed AI Workloads
About the role
Role overview
Join Radian Arc as a Senior Software Engineer working on the platform control plane for GPU cloud infrastructure and distributed AI workloads. You will build Kubernetes-native components and ensure the platform’s reliability, operability, and seamless integration with underlying infrastructure.
Key missions
- Design and develop the platform control plane to manage GPU cloud infrastructure and distributed AI workloads.
- Build APIs, services, and Kubernetes-native operators to automate infrastructure lifecycle management.
- Provide primitives to run large-scale AI workloads across multiple regions.
- Engineer for high availability and fault tolerance.
- Implement observability, monitoring, and alerting for platform services.
Responsibilities (implied)
- Collaborate with multiple teams to integrate platform services with infrastructure.
- Participate in on-call rotations and incident response.
Requirements
- 5+ years building distributed systems or infrastructure platforms.
- Strong Linux systems knowledge.
- Strong programming experience in Go.
- Experience participating in on-call rotations and handling incidents.
- Experience building Kubernetes operators and controllers.
- Understanding of networking and storage infrastructure used by distributed systems.
- Strong analytical and problem-solving skills.
- Experience designing and operating distributed systems at scale.
- Experience building reliable, highly available infrastructure services.
- Understanding of distributed state management and service coordination.
- Experience operating high-availability production systems.
- Familiarity with multi-tenant Kubernetes environments.
- Observability tooling experience with Prometheus and Grafana.
- Ability to troubleshoot complex production systems.
- Strong understanding of Kubernetes internals and control plane architecture.
Nice to have
- Not explicitly stated beyond the above skill set.
Work arrangement / location
- Full remote, based in EMEA.
- Option for contractor or full-time employment.
About Submer
Submer is a fast-growing scale-up building technology platforms with a focus on impactful, real-world outcomes. In this role, you will join its Radian Arc initiative to design infrastructure software for managing GPU cloud environments and distributed AI workloads across regions.
Scraped 6/20/2026