xelys jobs xelys jobs

MLOps Engineer (Relocation to Serbia)

Akvelon, Inc.

hybridmidcontractbackenddevops Georgia 2 days ago via LinkedIn

See how well this job matches your profile

Sign up to get an AI match score and generate a tailored application in seconds.

Get your match score

Tags

MLOpsKubernetesAKSTerraformPythonLLM PlatformsKubeflow PipelinesArgo WorkflowsMLflowObservability

About the role

Role Overview

You will build and operate an internal AI platform that helps developers ship AI-powered (LLM-based) services faster. The platform covers model connectivity, prompt testing/evaluation, monitoring/observability, and the underlying AI infrastructure layer—aimed at improving developer experience (DevEx) and reducing time-to-market.

Responsibilities

  • Build and operate AI platform infrastructure for faster delivery of LLM-based services
  • Implement and maintain Kubernetes-based runtime environments for AI workloads (including AKS)
  • Manage infrastructure with Terraform (modules, multi-environment setups, and CI/CD automation)
  • Support LLM workflows, including:
    • RAG, agents, prompt experimentation, evaluations, and deployment patterns
  • Integrate and operate AI tooling such as Azure AI Foundry, LiteLLM, Langfuse, and MLflow
  • Orchestrate ML/LLM pipelines using Kubeflow Pipelines and/or Argo Workflows
  • Improve reliability and observability (monitoring, logging, tracing, cost/performance signals)
  • Partner with developers to improve DX (APIs, templates, documentation, automation, “golden paths”)

Requirements

  • Strong hands-on production experience with Kubernetes (preferably AKS)
  • Solid Terraform expertise (IaC best practices, multi-environment setups)
  • Experience supporting ML/LLM workloads in a platform or MLOps/DevOps context
  • Proficiency in Python for automation, scripting, and supporting APIs/evaluation tooling
  • Understanding of CI/CD, release processes, and production-grade operations
  • Ability to deliver pragmatically under tight timelines

Nice to Have

  • Experience building internal developer platforms (“paved roads”)
  • Familiarity with LLM evaluation frameworks, prompt testing workflows, and LLM observability
  • Exposure to RAG architectures, vector databases, and agentic patterns
  • Experience with Kubeflow, Argo, and ML lifecycle tooling

Contract / Team / Location

  • Long-term B2B contract
  • Team of 5, with 3 AI Platform Engineers being added
  • Remote work from Croatia, Poland, Portugal, or Serbia with European working hours (occasional meetings up to 10:00 AM PST for US overlap)

About Akvelon, Inc.

Akvelon, Inc. is a technology services company that delivers engineering and platform solutions for clients. The role described focuses on building and operating an internal AI platform, spanning MLOps/LLM infrastructure, orchestration, monitoring, and developer enablement.

Scraped 4/16/2026

xelys jobs xelys jobs

Built for remote job seekers. Powered by AI.