xelys jobs xelys jobs

Principal MLOps Engineer

Raft

leadpermanentdevopsbackend Colorado Springs, CO Yesterday via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

MLOpsKubernetesDockerCI/CDAWSAzureModel ServingModel Lifecycle ManagementObservabilitySecure Software Supply Chain

About the role

Principal MLOps Engineer

Raft is building mission-critical AI and data platforms for the Department of Defense, including an end-to-end ML platform that supports model development, evaluation, deployment, monitoring, and lifecycle management across cloud and constrained environments.

Role overview

In this role, you will design, deploy, and mature Raft’s ML platform and MLOps infrastructure, working across Kubernetes-based deployments, GPU-enabled infrastructure, model serving systems, CI/CD pipelines, and secure production operations.

Responsibilities

  • Design, build, and maintain secure, scalable MLOps infrastructure and deployment pipelines for production ML
  • Mature internal ML platform capabilities, including:
    • model packaging
    • model registry/catalog workflows
    • deployment, monitoring, and operational support
  • Deploy and manage ML workloads on Kubernetes, including GPU-enabled clusters
  • Support model serving and inference infrastructure for multiple ML use cases (traditional ML, computer vision, speech/audio, and LLM-based systems)
  • Build and maintain CI/CD workflows for ML services, model artifacts, and platform components
  • Partner with ML engineers and product/software teams to move models from experimentation to reliable operations
  • Improve observability, reliability, security, and maintainability of ML infrastructure and services
  • Help evaluate and standardize runtime patterns, serving frameworks, and deployment architectures
  • Contribute to infrastructure decisions spanning edge, on-prem, and cloud deployment environments
  • Support compliance-driven deployment practices and secure software supply chain requirements in defense environments

Requirements

  • 7+ years hands-on experience in software engineering, platform engineering, DevOps, MLOps, or related technical roles
  • 5+ years with Docker and Kubernetes in production
  • 5+ years supporting enterprise cloud infrastructure/applications in AWS, Azure, or similar

Location / eligibility

  • U.S.-based role requiring U.S. citizenship and work performed within the continental U.S.

About Raft

Raft is a customer-obsessed non-traditional defense technology company focused on delivering AI/ML and data solutions to U.S. military and government agencies. It specializes in autonomous data fusion and agentic AI, building cloud-native platforms and distributed data systems that support mission-critical operations.

Scraped 4/23/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.