xelys jobs xelys jobs

Principal MLOps Engineer

Raft

leadpermanentdevopsbackend United States 4 days ago via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

MLOpsKubernetesDockerCI/CDAWSAzureModel ServingLLMsObservabilitySecure Supply Chain

About the role

Role Overview

Raft is hiring a Principal MLOps Engineer for a U.S.-based position supporting mission-critical AI and data platforms for the Department of Defense. The role focuses on designing, deploying, and maturing Raft’s end-to-end ML platform and MLOps infrastructure across cloud and constrained environments.

Responsibilities

  • Design, build, and maintain secure, scalable MLOps infrastructure and deployment pipelines for production ML systems
  • Mature internal ML platform capabilities across the model lifecycle, including:
    • model packaging
    • registry/catalog workflows
    • deployment, monitoring, and operational support
  • Deploy and manage ML workloads on Kubernetes, including GPU-enabled clusters
  • Support model serving and inference infrastructure for multiple ML domains, including:
    • traditional ML
    • computer vision
    • speech/audio
    • LLM-based systems
  • Build and maintain CI/CD workflows for ML services, model artifacts, and platform components
  • Partner with ML engineers, software engineers, and product teams to move models from experimentation to reliable production
  • Improve observability, reliability, security, and maintainability across ML infrastructure and services
  • Help evaluate and standardize runtime patterns, serving frameworks, and deployment architectures
  • Contribute to infrastructure decisions across edge, on-prem, and cloud environments
  • Support compliance-driven and secure software supply chain practices in defense settings
  • Provide hands-on customer support in advanced DoD environments

Requirements

  • 7+ years hands-on experience in software engineering, platform engineering, DevOps, MLOps, or related technical roles
  • 5+ years experience with Docker and Kubernetes in production
  • 5+ years experience supporting enterprise cloud infrastructure or applications in AWS, Azure, or similar environments

Nice-to-haves

  • Strong fit for candidates who understand both:
    • the infrastructure needed to run ML systems in production
    • the practical needs of ML engineers building and deploying models

Location / Eligibility

  • Position requires U.S. citizenship
  • Work must be conducted within the continental U.S.

About Raft

Raft is a customer-obsessed non-traditional defense tech company providing AI/ML and data solutions to U.S. military and government agencies. It focuses on autonomous data fusion and agentic AI, building mission-critical distributed data systems and cloud-native platforms that operate at scale for time-sensitive decision-making.

Scraped 4/19/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.