xelys jobs xelys jobs

DevOps Engineer

Careflow

full-remoteseniorpermanentdevops New York, NY Yesterday via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Google Cloud Platform (GCP)DevOpsSite Reliability Engineering (SRE)CI/CDMonitoringLoggingObservabilityCloud SecurityIncident ResponseInfrastructure Automation

About the role

Role Overview

Careflow is hiring an experienced DevOps Engineer to own and improve cloud infrastructure, security, observability, and operational reliability. The role requires hands-on work across deployments, monitoring, troubleshooting, and (when needed) application-level debugging.

What You’ll Do

Cloud Infrastructure & Operations

  • Manage and maintain the Google Cloud Platform (GCP) environment
  • Design and improve infrastructure for scalability, reliability, and cost efficiency
  • Handle networking, compute, databases, storage, and related cloud services
  • Monitor system health and address performance bottlenecks proactively

Monitoring, Logging & Observability

  • Build and maintain centralized logging and monitoring
  • Create dashboards and alerts for system health, application performance, and critical workflows
  • Establish operational metrics and usage tracking
  • Lead incident response and perform root cause analysis

Security & Compliance

  • Implement security best practices across infrastructure and applications
  • Manage identity/access controls, secrets management, and environment security
  • Support security reviews and vulnerability remediation
  • Assist with compliance initiatives and audit readiness

CI/CD & Automation

  • Improve deployment pipelines and release processes
  • Automate infrastructure provisioning and operational workflows
  • Enhance development environments and deployment reliability
  • Reduce manual operational tasks via automation

Reliability Engineering

  • Improve uptime, resiliency, backups, and disaster recovery
  • Define service-level objectives and operational standards
  • Drive platform stability and performance improvements

Cross-Functional Support

  • Partner with engineering, product, and leadership teams
  • Provide technical guidance on infrastructure and operational considerations
  • Participate in an on-call/operational support rotation

Bonus Responsibilities

  • Troubleshoot and fix application-level issues when needed
  • Contribute code improvements and bug fixes
  • Assist with performance optimization and debugging

Required Qualifications

  • 5+ years of DevOps, Site Reliability Engineering, Cloud Engineering, or related experience
  • Strong hands-on experience with Google Cloud Platform (GCP)
  • Experience building and maintaining CI/CD pipelines
  • Strong understanding of monitoring, logging, and alerting systems
  • Experience with cloud security best practices (posting cuts off before full list)

Success Metrics (First 90 Days)

  • Take ownership of GCP infrastructure and environments
  • Establish visibility into performance, reliability, and usage metrics
  • Improve monitoring, alerting, and incident response processes
  • Identify and reduce security and operational risks
  • Reduce infrastructure-related issues and deployment friction
  • Become a trusted technical resource for reliability and operations

About Careflow

Careflow is a software company building and operating a growing cloud-based platform. The role focuses on ensuring the platform’s infrastructure reliability, security, observability, and operational excellence on Google Cloud Platform (GCP).

Scraped 6/20/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.