DevOps Engineer
Scale.jobs
midpermanentdevops New York, NY 4 days ago via LinkedIn
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
DevOpsAWSTerraformCI/CDKubernetesPrometheusGrafanaELK StackDockerInfrastructure as Code
About the role
Role Overview
Design, build, and maintain core cloud infrastructure and deployment pipelines for global, highly available services. You will automate operational work, improve reliability, and deliver self-service platform capabilities to product engineering teams.
Responsibilities
- Provision, manage, and scale AWS infrastructure using Terraform or CloudFormation via Infrastructure-as-Code (IaC)
- Build and maintain CI/CD pipelines using GitHub Actions, GitLab CI, or Jenkins
- Orchestrate containerized workloads with Kubernetes (including Helm chart creation, ingress control, and cluster upgrades)
- Implement monitoring, logging, and alerting using Prometheus, Grafana, and the ELK stack
- Participate in an on-call rotation; lead root-cause analyses (RCAs) and drive post-mortem improvements
- Optimize cloud cost and utilization via automated scaling, instance right-sizing, and architectural reviews
- Partner with software engineers to optimize application performance and architect containerized deployments
Requirements
- 3–6 years of experience in DevOps, Site Reliability Engineering, or Systems Engineering managing production workloads
- Strong scripting/automation skills with Python, Go, or Bash
- Deep hands-on experience with AWS (VPC, IAM, EC2, RDS, EKS)
- Production experience managing containers with Docker and Kubernetes
- Solid networking fundamentals (TCP/IP, DNS, SSL/TLS, load balancing, CDN configurations)
Bonus
- Service mesh experience (Istio, Linkerd)
- Compliance frameworks (SOC2, HIPAA)
- Experience managing high-volume SQL/NoSQL databases
About Scale.jobs
Scale.jobs is hiring for a DevOps/SRE role focused on building and operating cloud infrastructure and deployment pipelines that support global, highly available services. The work centers on automation, reliability, and providing self-service platform tooling for product engineering teams.
Scraped 6/16/2026