Site Reliability Engineer
Empower
middevops United States Today via LinkedIn
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
Site Reliability EngineeringAWSKubernetesEKSTerraformInfrastructure as CodeCI/CDGitOpsObservabilityIncident Management
About the role
Role Overview
Empower is hiring a Site Reliability Engineer (SRE) to help ensure the reliability, scalability, and performance of its financial services platform. You’ll support production systems at massive scale, improve operational excellence, and partner with engineering teams to maintain high availability and strong observability in a regulated environment.
What You Will Do
- Own operational excellence for assigned systems and services; support projects across teams
- Participate in on-call rotations, respond to incidents, troubleshoot complex system/deployment issues, and drive resolution
- Run postmortems, perform root cause analysis, and implement preventive measures
- Define service level indicators (SLIs); build proactive monitoring and alerting
- Manage observability for Kubernetes environments, including AWS EKS
- Build, maintain, and optimize Infrastructure as Code across multiple AWS environments
- Manage and optimize EKS clusters for availability, resilience, and scalability of containerized apps
- Collaborate with development teams on releases and implement scalable, resilient services using GitOps and progressive delivery
- Maintain and improve CI/CD pipelines and automation to reduce toil and improve operational efficiency
- Lead capacity planning and right-sizing for performance and reliability
- Document critical systems, runbooks, and architecture decisions; mentor entry-level SREs
What You Will Bring
- Bachelor’s degree (CS/IT or related) or equivalent practical experience
- 2–4 years in SRE/DevOps/Systems Engineering
- Experience with AWS high availability/resiliency, including EKS, EC2, RDS, S3, and VPC
- Production experience with Kubernetes and containerization (e.g., Docker)
- Proficiency with Infrastructure as Code frameworks such as Terraform and CloudFormation
- Observability experience (incident detection/response implications)
- CI/CD familiarity; experience with GitLab CI or Jenkins or equivalent
- Networking fundamentals and troubleshooting
- Familiarity with GitOps, incident management, and on-call practices
- Strong problem-solving and judgment
What Will Set You Apart
- Experience in financial services or other regulated industries
- Knowledge of compliance frameworks like SOC 2 or PCI DSS
- Experience with observability/APM tools such as Datadog, AppDynamics, or New Relic
- Strong programming skills in shell, Go, Python, or similar
- Experience supporting Java Spring Boot applications
About Empower
Empower is a financial services company focused on helping customers achieve financial freedom. It operates in a regulated environment and supports large-scale production systems serving millions of customers.
Scraped 4/7/2026