Infrastructure Engineer (Kubernetes/Golang)
Bayside Solutions
hybridseniorcontractdevopsbackend Cupertino, CA Yesterday via LinkedIn
114,400 - 135,200 USD/annual
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
KubernetesGolangDistributed SystemsOn-PremBatch ProcessingPrometheusGrafanaCI/CDObservabilityGitOps
About the role
Role Overview
Infrastructure Engineer focused on Kubernetes (on-prem) and Golang for data infrastructure and distributed systems. You will own production operations for large-scale Kubernetes clusters and build tooling and platforms that support high-volume batch processing.
Responsibilities
- Provide daily operational support for on-prem Kubernetes clusters (reliability, availability, scalability, performance).
- Develop and maintain a batch orchestration platform running 100,000+ jobs/day.
- Build migration tooling to move job configurations and workloads to new data platforms.
- Design and implement distributed data systems for high availability, resilience, and performance.
- Write high-performance Go (including concurrency and distributed systems patterns) for automation and platform services.
- Create operational automation scripts and tooling using Bash and Python for observability and infrastructure workflows.
- Implement and improve CI/CD pipelines and DevOps standards across Kubernetes environments.
- Set up and optimize Prometheus/Grafana and monitoring/alerting for full-stack observability.
- Troubleshoot Kubernetes workloads and distributed systems in production.
- Participate in system design reviews and cross-team technical discussions.
- Collaborate with Platform, SRE, and Data Engineering teams to improve resilience and operational efficiency.
Requirements
- 8+ years of experience in Data Infrastructure (Kubernetes), Platform Engineering, DevOps, or SRE.
- Deep hands-on Kubernetes operational experience, especially on-prem.
- Strong Golang proficiency (core language, concurrency, distributed systems patterns).
- Experience designing/supporting distributed systems at scale.
- Strong scripting skills (Bash/Shell; Python is a plus).
- Experience with Prometheus, Grafana, and end-to-end observability.
- Solid understanding of Linux, networking, container runtimes, and CI/CD.
- Experience operating systems handling tens of thousands of jobs.
- Strong debugging/analytical skills for distributed systems.
- Excellent communication and cross-team collaboration.
- High ownership with a focus on reliability and operational excellence.
Preferred Qualifications
- Experience with hybrid cloud + on-prem Kubernetes architectures.
- Familiarity with service mesh (Istio, Linkerd) and advanced Kubernetes networking.
- Exposure to data engineering workflows and/or batch processing frameworks.
- Experience with GitOps tooling (ArgoCD, Helm, Kustomize).
- Knowledge of infrastructure security: RBAC, certificates, cluster hardening.
- Experience supporting critical production systems at very large scale.
Location / Work Arrangement
- Cupertino, CA and Remote.
About Bayside Solutions
Bayside Solutions is a technology services firm supporting engineering hiring for infrastructure and platform roles. The posting focuses on operating and evolving Kubernetes-based data and distributed systems for production workloads. The work spans on-prem Kubernetes, observability, automation, and CI/CD practices.
Scraped 6/19/2026