Principal MLOps Engineer
Raft
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
About the role
Role overview
Raft is hiring a Principal MLOps Engineer to design, deploy, and mature mission-critical AI and data platforms. The role focuses on building secure, scalable end-to-end ML platforms and MLOps infrastructure that support model development, evaluation, deployment, monitoring, and lifecycle management across cloud and constrained environments.
Responsibilities
- Design, build, and maintain secure, scalable MLOps infrastructure and deployment pipelines for production ML systems
- Mature internal ML platform capabilities: model packaging, model registry/catalog workflows, deployment, monitoring, and operational support
- Deploy and manage ML workloads on Kubernetes, including GPU-enabled clusters
- Support model serving/inference infrastructure for multiple ML use cases (traditional ML, computer vision, speech/audio, and LLM systems)
- Build and maintain CI/CD workflows for ML services, model artifacts, and platform components
- Partner with ML engineers, software engineers, and product teams to move models from experimentation to reliable production deployment
- Improve observability, reliability, security, and maintainability across ML infrastructure and services
- Evaluate and standardize runtime patterns, serving frameworks, and deployment architectures for production workloads
- Contribute to infrastructure decisions across edge, on-prem, and cloud deployment environments
- Support compliance-driven deployment and secure software supply chain practices for defense environments
Requirements
- 7+ years hands-on experience in software engineering/platform engineering/DevOps/MLOps or related technical roles
- 5+ years experience with Docker and Kubernetes in production
- 5+ years experience supporting enterprise cloud infrastructure or applications in AWS, Azure, or similar
Nice-to-haves / implied by the posting
- Experience provisioning and operating GPU-enabled infrastructure and production model serving systems
- Familiarity with secure production operations, supply-chain security, and defense compliance practices
About Raft
Raft is a customer-obsessed defense tech company building AI/ML and data solutions for U.S. military and government agencies. It focuses on autonomous data fusion, agentic AI, distributed data systems, and cloud-native platforms that process large volumes of real-time sensor data into usable intelligence.
Scraped 5/14/2026