xelys jobs xelys jobs

Site Reliability Engineer (SRE)

PeopleFinders

seniorpermanentdevopssecurity Sacramento, CA 45 days ago via LinkedIn
100,000 - 130,000 USD/annual

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

AWSKubernetesGitLab CI/CDCloudflarePrometheusGrafanaCloudWatchInfrastructure as CodeDockerSite Reliability Engineering

About the role

Role Overview

PeopleFinders is hiring a Site Reliability Engineer (SRE) to join a small, high-impact infrastructure team. You’ll own infrastructure projects end-to-end with minimal oversight, working across AWS, Kubernetes, CI/CD, networking, CDNs, security, and observability to keep the platform performant, secure, and highly available.

Key Responsibilities

  • AWS infrastructure: Architect, maintain, and troubleshoot VPCs, subnets, routing, security groups, and related cloud networking components.
  • Kubernetes reliability: Manage and optimize Kubernetes environments running Dockerized applications for reliability, performance, and scalability.
  • CI/CD: Build, maintain, and improve GitLab CI/CD pipelines for testing, deployments, and infrastructure workflows.
  • CDNs & anti-bot: Configure and manage CDNs (especially Cloudflare) and implement bot mitigation / anti-scraping controls.
  • Monitoring & observability: Implement monitoring, alerting, and observability using tools such as Prometheus, Grafana, and CloudWatch.
  • Production support: Collaborate with engineering teams to support deployments and diagnose issues.
  • Reliability improvements: Use automation, GitOps, and Infrastructure-as-Code (IaC) to improve operability.
  • Incident response: Investigate incidents urgently and deliver corrective actions under tight timelines.

Required Qualifications

  • 4+ years as an SRE, DevOps Engineer, or Systems Engineer.
  • Strong hands-on Linux experience (Ubuntu/CentOS) and Windows Server administration.
  • Proficiency with Docker and containerized architectures.
  • Deep experience with GitLab CI/CD pipelines.
  • Solid understanding of CDNs (e.g., Cloudflare/Fastly/Akamai) and techniques to mitigate scraping/bot traffic.
  • Strong networking fundamentals: DNS, TLS, routing, firewalls, load balancing.
  • Familiarity with infrastructure security for public-facing systems.
  • Experience with monitoring/logging tools (e.g., BetterStack, Datadog, Prometheus, Grafana, ELK).
  • Ability to thrive in a small team: adapt quickly and manage multiple priorities; strong ownership mindset.

Preferred Qualifications

  • Experience with Kubernetes or other orchestration platforms.
  • Knowledge of IaC tools such as Terraform, Ansible, or Pulumi.
  • Scripting/programming in Bash, Python, or Go.
  • Experience with WAFs, bot mitigation systems, and rate-limiting.
  • Familiarity with Zero Trust or modern access-management concepts.

Additional Desired Skills

  • Hands-on Cloudflare experience (firewall rules, bot management, workers, caching strategies).
  • Experience protecting high-value web apps against scraping/automated attacks.
  • Experience working in fully remote/distributed engineering teams.
  • Background supporting organizations with cloud-managed infrastructure.

Soft Skills

  • Excellent communication and cross-team collaboration.
  • Ability to work independently and make sound technical decisions.
  • Strong problem-solving skills and willingness to dive into complex issues.
  • Flexibility to pivot technologies or priorities as needs evolve.

About PeopleFinders

PeopleFinders.com is an online consumer service that helps people and businesses locate, contact, and verify information. The company has become a major owner and distributor of public records data through a large network of websites, operating as a data-driven web platform business.

Scraped 4/1/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.