xelys jobs xelys jobs

Principal MLOps Engineer

Raft

leadpermanentdevopsbackend United States Today via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

MLOpsMachine LearningKubernetesDockerCI/CDModel ServingAWSAzureObservabilitySecure Software Supply Chain

About the role

Role overview

Raft is hiring a Principal MLOps Engineer to design, deploy, and mature mission-critical AI and data platforms. The role focuses on building secure, scalable end-to-end ML platforms and MLOps infrastructure that support model development, evaluation, deployment, monitoring, and lifecycle management across cloud and constrained environments.

Responsibilities

  • Design, build, and maintain secure, scalable MLOps infrastructure and deployment pipelines for production ML systems
  • Mature internal ML platform capabilities: model packaging, model registry/catalog workflows, deployment, monitoring, and operational support
  • Deploy and manage ML workloads on Kubernetes, including GPU-enabled clusters
  • Support model serving/inference infrastructure for multiple ML use cases (traditional ML, computer vision, speech/audio, and LLM systems)
  • Build and maintain CI/CD workflows for ML services, model artifacts, and platform components
  • Partner with ML engineers, software engineers, and product teams to move models from experimentation to reliable production deployment
  • Improve observability, reliability, security, and maintainability across ML infrastructure and services
  • Evaluate and standardize runtime patterns, serving frameworks, and deployment architectures for production workloads
  • Contribute to infrastructure decisions across edge, on-prem, and cloud deployment environments
  • Support compliance-driven deployment and secure software supply chain practices for defense environments

Requirements

  • 7+ years hands-on experience in software engineering/platform engineering/DevOps/MLOps or related technical roles
  • 5+ years experience with Docker and Kubernetes in production
  • 5+ years experience supporting enterprise cloud infrastructure or applications in AWS, Azure, or similar

Nice-to-haves / implied by the posting

  • Experience provisioning and operating GPU-enabled infrastructure and production model serving systems
  • Familiarity with secure production operations, supply-chain security, and defense compliance practices

About Raft

Raft is a customer-obsessed defense tech company building AI/ML and data solutions for U.S. military and government agencies. It focuses on autonomous data fusion, agentic AI, distributed data systems, and cloud-native platforms that process large volumes of real-time sensor data into usable intelligence.

Scraped 5/14/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.