DevOps Engineer
Careflow
full-remoteseniorpermanentdevops New York, NY Yesterday via LinkedIn
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
Google Cloud Platform (GCP)DevOpsSite Reliability Engineering (SRE)CI/CDMonitoringLoggingObservabilityCloud SecurityIncident ResponseInfrastructure Automation
About the role
Role Overview
Careflow is hiring an experienced DevOps Engineer to own and improve cloud infrastructure, security, observability, and operational reliability. The role requires hands-on work across deployments, monitoring, troubleshooting, and (when needed) application-level debugging.
What You’ll Do
Cloud Infrastructure & Operations
- Manage and maintain the Google Cloud Platform (GCP) environment
- Design and improve infrastructure for scalability, reliability, and cost efficiency
- Handle networking, compute, databases, storage, and related cloud services
- Monitor system health and address performance bottlenecks proactively
Monitoring, Logging & Observability
- Build and maintain centralized logging and monitoring
- Create dashboards and alerts for system health, application performance, and critical workflows
- Establish operational metrics and usage tracking
- Lead incident response and perform root cause analysis
Security & Compliance
- Implement security best practices across infrastructure and applications
- Manage identity/access controls, secrets management, and environment security
- Support security reviews and vulnerability remediation
- Assist with compliance initiatives and audit readiness
CI/CD & Automation
- Improve deployment pipelines and release processes
- Automate infrastructure provisioning and operational workflows
- Enhance development environments and deployment reliability
- Reduce manual operational tasks via automation
Reliability Engineering
- Improve uptime, resiliency, backups, and disaster recovery
- Define service-level objectives and operational standards
- Drive platform stability and performance improvements
Cross-Functional Support
- Partner with engineering, product, and leadership teams
- Provide technical guidance on infrastructure and operational considerations
- Participate in an on-call/operational support rotation
Bonus Responsibilities
- Troubleshoot and fix application-level issues when needed
- Contribute code improvements and bug fixes
- Assist with performance optimization and debugging
Required Qualifications
- 5+ years of DevOps, Site Reliability Engineering, Cloud Engineering, or related experience
- Strong hands-on experience with Google Cloud Platform (GCP)
- Experience building and maintaining CI/CD pipelines
- Strong understanding of monitoring, logging, and alerting systems
- Experience with cloud security best practices (posting cuts off before full list)
Success Metrics (First 90 Days)
- Take ownership of GCP infrastructure and environments
- Establish visibility into performance, reliability, and usage metrics
- Improve monitoring, alerting, and incident response processes
- Identify and reduce security and operational risks
- Reduce infrastructure-related issues and deployment friction
- Become a trusted technical resource for reliability and operations
About Careflow
Careflow is a software company building and operating a growing cloud-based platform. The role focuses on ensuring the platform’s infrastructure reliability, security, observability, and operational excellence on Google Cloud Platform (GCP).
Scraped 6/20/2026