xelys jobs xelys jobs

DevOps Engineer

Careflow

full-remoteseniorpermanentdevopsbackend Houston, TX 3 days ago via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

Google Cloud Platform (GCP)DevOpsSite Reliability Engineering (SRE)CI/CDMonitoringLoggingObservabilityIncident ResponseSecurity & ComplianceDisaster Recovery

About the role

Role overview

Careflow is hiring an experienced DevOps Engineer to own and improve cloud infrastructure, security, observability, and operational reliability for a growing platform. The role combines infrastructure ownership with hands-on troubleshooting across the stack, including the option to debug application-level issues when needed.

What you’ll do

Cloud Infrastructure & Operations

  • Manage and maintain the Google Cloud Platform (GCP) environment
  • Design and improve infrastructure for scalability, reliability, and cost efficiency
  • Oversee networking, compute, databases, storage, and other cloud services
  • Monitor system health and proactively address performance bottlenecks

Monitoring, Logging & Observability

  • Build and maintain centralized logging and monitoring solutions
  • Create dashboards and alerts for system health, application performance, and business-critical workflows
  • Establish operational metrics and usage tracking across the platform
  • Lead incident response and root cause analysis

Security & Compliance

  • Implement and maintain security best practices across infrastructure and applications
  • Manage identity and access controls, secrets management, and environment security
  • Conduct security reviews and help remediate vulnerabilities
  • Support compliance initiatives and audit readiness

CI/CD & Automation

  • Improve deployment pipelines and release processes
  • Automate infrastructure provisioning and operational workflows
  • Improve development environments and deployment reliability
  • Reduce manual operational tasks via automation

Reliability Engineering

  • Improve uptime, resiliency, backup strategies, and disaster recovery
  • Define service-level objectives (SLOs) and operational standards
  • Drive improvements in platform stability and performance

Cross-functional support

  • Partner with engineering, product, and leadership teams
  • Provide technical guidance on infrastructure and operational considerations
  • Participate in an on-call and operational support rotation

Bonus responsibilities

  • Troubleshoot and fix application-level issues when needed
  • Contribute code improvements and bug fixes
  • Assist with performance optimization and debugging

Requirements

  • 5+ years in DevOps, Site Reliability Engineering, Cloud Engineering, or related experience
  • Strong hands-on experience with Google Cloud Platform (GCP)
  • Experience building and maintaining CI/CD pipelines
  • Strong understanding of monitoring, logging, and alerting systems

Success in your first 90 days

  • Gain ownership of GCP infrastructure and environments
  • Establish visibility into performance, reliability, and usage metrics
  • Improve monitoring, alerting, and incident response processes
  • Identify and address security and operational risks
  • Reduce infrastructure-related issues and deployment friction
  • Become a trusted resource for reliability and operational excellence

Role details

  • Title: DevOps Engineer
  • Employment type: Full-time
  • Location: Fully remote
  • Schedule: Flexible; availability for Saturday coverage with a weekday day off in exchange
  • Reports to: Lead Architect

About Careflow

Careflow is a software company focused on building and operating a growing platform that requires high reliability, security, and scalability. The DevOps team supports cloud infrastructure, observability, and operational excellence as the company expands its systems and deployment workflows.

Scraped 6/19/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.