xelys jobs xelys jobs

Infrastructure Engineer (Kubernetes/Golang)

Bayside Solutions

hybridseniorcontractdevopsbackend Cupertino, CA Yesterday via LinkedIn
114,400 - 135,200 USD/annual

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

KubernetesGolangDistributed SystemsOn-PremBatch ProcessingPrometheusGrafanaCI/CDObservabilityGitOps

About the role

Role Overview

Infrastructure Engineer focused on Kubernetes (on-prem) and Golang for data infrastructure and distributed systems. You will own production operations for large-scale Kubernetes clusters and build tooling and platforms that support high-volume batch processing.

Responsibilities

  • Provide daily operational support for on-prem Kubernetes clusters (reliability, availability, scalability, performance).
  • Develop and maintain a batch orchestration platform running 100,000+ jobs/day.
  • Build migration tooling to move job configurations and workloads to new data platforms.
  • Design and implement distributed data systems for high availability, resilience, and performance.
  • Write high-performance Go (including concurrency and distributed systems patterns) for automation and platform services.
  • Create operational automation scripts and tooling using Bash and Python for observability and infrastructure workflows.
  • Implement and improve CI/CD pipelines and DevOps standards across Kubernetes environments.
  • Set up and optimize Prometheus/Grafana and monitoring/alerting for full-stack observability.
  • Troubleshoot Kubernetes workloads and distributed systems in production.
  • Participate in system design reviews and cross-team technical discussions.
  • Collaborate with Platform, SRE, and Data Engineering teams to improve resilience and operational efficiency.

Requirements

  • 8+ years of experience in Data Infrastructure (Kubernetes), Platform Engineering, DevOps, or SRE.
  • Deep hands-on Kubernetes operational experience, especially on-prem.
  • Strong Golang proficiency (core language, concurrency, distributed systems patterns).
  • Experience designing/supporting distributed systems at scale.
  • Strong scripting skills (Bash/Shell; Python is a plus).
  • Experience with Prometheus, Grafana, and end-to-end observability.
  • Solid understanding of Linux, networking, container runtimes, and CI/CD.
  • Experience operating systems handling tens of thousands of jobs.
  • Strong debugging/analytical skills for distributed systems.
  • Excellent communication and cross-team collaboration.
  • High ownership with a focus on reliability and operational excellence.

Preferred Qualifications

  • Experience with hybrid cloud + on-prem Kubernetes architectures.
  • Familiarity with service mesh (Istio, Linkerd) and advanced Kubernetes networking.
  • Exposure to data engineering workflows and/or batch processing frameworks.
  • Experience with GitOps tooling (ArgoCD, Helm, Kustomize).
  • Knowledge of infrastructure security: RBAC, certificates, cluster hardening.
  • Experience supporting critical production systems at very large scale.

Location / Work Arrangement

  • Cupertino, CA and Remote.

About Bayside Solutions

Bayside Solutions is a technology services firm supporting engineering hiring for infrastructure and platform roles. The posting focuses on operating and evolving Kubernetes-based data and distributed systems for production workloads. The work spans on-prem Kubernetes, observability, automation, and CI/CD practices.

Scraped 6/19/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.