xelys jobs xelys jobs

Site Reliability Engineer

Orange Logic

seniorpermanentdevopsbackend United States Yesterday via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Site Reliability EngineeringScriptingPythonTerraformKubernetesAWSAzureGCPObservabilityIncident Response

About the role

Role Overview

Site Reliability Engineer (SRE) responsible for the availability, reliability, and optimal performance of Orange Logic’s critical platform services and applications across a global, cloud-based infrastructure.

Responsibilities

  • Application & System Support
    • Monitor, administer, and troubleshoot application performance and infrastructure health using observability tools.
    • Respond to alerts/outages/system degradations; execute recovery procedures.
    • Perform root cause analysis and support post-incident reviews.
    • Provide front-end and back-end application support and consult stakeholders on performance improvements.
  • Infrastructure Management & Automation
    • Manage infrastructure using Infrastructure as Code (IaC) (e.g., Terraform, Ansible, Puppet).
    • Administer cloud-native services on AWS, Azure, or Google Cloud (e.g., EC2, S3, RDS, Kubernetes).
    • Develop automation scripts for deployments, configuration management, and repetitive tasks.
    • Ensure reliable code migration across environments.
  • Monitoring & Observability
    • Deploy and maintain monitoring/observability tools such as Prometheus, Grafana, and ELK.
    • Implement proactive alerting and improve system visibility.
  • Operations & Reliability
    • Plan and execute change procedures with minimal disruption.
    • Support scheduled maintenance (patching, updates, server health checks).
    • Participate in an on-call rotation (evenings/weekends).
  • Continuous Improvement
    • Collaborate with Development, Infrastructure, and Production Support to improve performance and scalability.
    • Identify and implement process improvements for reliability and deployment efficiency.

Requirements

  • Bachelor’s or Master’s in Computer Science, Engineering, or related field.
  • 8+ years experience in SRE, DevOps, or production engineering.
  • Strong knowledge of distributed systems, cloud platforms (AWS/Azure/GCP), and containerized environments (Docker/Kubernetes).
  • Proficient in SQL and scripting (Python, Bash, PowerShell).
  • Extensive experience with observability stacks and automated alerting.
  • Familiarity with web protocols, networking fundamentals, and API performance optimization.
  • Ability to lead cross-functional initiatives and influence without authority.
  • Excellent written/verbal communication.
  • Experience mentoring engineers and conducting architectural or reliability reviews.

Perks

  • Competitive salary; medical/dental/vision insurance
  • Life & disability insurance
  • 401(k) & Roth with 4% employer match (fully vested)
  • 20 days PTO; 8 weeks parental leave

About Orange Logic

Orange Logic is a software company focused on solving complex content challenges through an intelligent Digital Asset Management (DAM) platform called the Orange Logic Platform. The platform helps organizations manage, access, and leverage digital assets across industries. The company emphasizes innovation and impact through strong engineering and collaboration.

Scraped 4/15/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.