New Release • Packt Publishing • May 2026

Operational AI with Docker

By Ajeet Singh Raina and Harsh Manvar • Packt Publishing • 1st Edition

Operational AI with Docker — book cover by Ajeet Singh Raina and Harsh Manvar, published by PacktJust Released

Why this book

The missing manual for shipping AI in production.

Production-First

Stop building prototypes. Learn the operational patterns that take AI from notebook to production.

Docker-Native

Master Docker Model Runner, MCP Gateway, Docker Agent, and Sandboxes — the new AI stack from Docker.

Battle-Tested

Every chapter includes working code, real architectures, and lessons from production deployments.

About the book

Run production-grade GenAI workloads by containerizing, serving, and scaling LLMs, agents, and multi-model pipelines with Docker, MCP, and Kubernetes for cloud platforms.

Modern AI systems don’t fail at modeling; they fail in production. Moving from experiments to reliable, scalable systems requires more than notebooks and scripts. It requires infrastructure.

Operational AI with Docker shows you how to build, deploy, and operate AI systems that work beyond a single machine. You’ll learn how to use Docker as a consistent runtime for machine learning workflows, package models as reproducible artifacts, and run them reliably across environments.

Starting with containerized machine learning, you’ll progress to model serving, AI deployment, and scalable infrastructure using Kubernetes. You’ll implement production-ready patterns for resource management, autoscaling, observability, and performance tuning, ensuring your AI workloads remain stable under real-world conditions.

The book goes beyond traditional MLOps by introducing agentic AI systems, including autonomous agents, multi-agent architectures, and secure execution environments. You’ll also explore modern integration patterns using the Model Context Protocol (MCP), enabling AI systems to interact safely with tools, APIs, and data sources.

By the end of this book, you’ll be able to design and operate production AI systems that are reproducible, scalable, and ready for real-world deployment using Docker and Kubernetes.

What you'll learn

From local inference to production deployment.

  • Containerize GenAI services using Docker images, registries, and Compose-based deployment stacks
  • Package and distribute models as OCI artifacts for repeatable builds and controlled promotions across environments
  • Choose GGUF quantization levels to balance cost, latency, and accuracy for cloud and hybrid runtimes
  • Serve LLMs via Docker Model Runner with an OpenAI-compatible API suitable for internal platforms
  • Integrate tools and data securely using MCP and Docker MCP Gateway with least-privilege access patterns
  • Run ML container models on Kubernetes with autoscaling, observability, and performance tuning
  • Build autonomous AI agents and multi-agent architectures for production workloads
  • Operate agentic AI services across cloud, hybrid, and edge environments

Who this book is for

Built for the engineers shipping AI to production.

Cloud EngineersDevOps EngineersSREsPlatform Engineers

Cloud engineers, DevOps engineers, SREs, and platform engineers who need to deploy, operate, and scale GenAI workloads using Docker and Kubernetes on cloud, hybrid, or edge environments. You should be comfortable with the command line and basic service operations; prior Docker or Kubernetes exposure is helpful but not required.

Table of contents

9 chapters. Production focus throughout.

  1. CH 01Docker Desktop — The Runtime Foundation for AI/ML Workflows
  2. CH 02Understanding AI Models in Docker
  3. CH 03Model Service with Docker Model Runner
  4. CH 04Docker Offload for AI and ML Workflows
  5. CH 05Running ML Container Models on Kubernetes
  6. CH 06Protocol-Based AI Integration with MCP
  7. CH 07Building Autonomous AI Agents
  8. CH 08Multi-Model and Multi-Agent Architectures
  9. CH 09Advanced Agent Orchestration

Meet the authors

Practitioners, not pundits.

Ajeet Singh Raina

Developer Advocate at Docker

Developer Advocate at Docker and an early Docker adopter who has authored 600+ blogs on containerization, cloud-native technologies, and DevOps. He leads a large Docker community ecosystem and organizes initiatives such as Kubetools, sharing practical guidance across Docker, Kubernetes, IoT, and AI/ML operations.

Harsh Manvar

Senior Software Engineer • Docker Captain

Senior Software Engineer with over a decade of experience in software engineering and DevOps. A Docker Captain, Google Developer Expert, CNCF Ambassador, and Google Champion Innovator, he focuses on building scalable, reliable cloud-native systems and is a top contributor in the Kubernetes space on Stack Overflow.

Reader reviews

What practitioners are saying.

5.0 out of 5 on Amazon · Read all reviews

"Highly recommended if you're tired of tutorials that stop at "it ran on my machine.""

I've been running local LLMs and experimenting with agents for a while, but getting them to run reliably at scale has always been a pain. This book finally bridges that gap. The authors blend Docker fundamentals with real AI workflows — not just "here's a Dockerfile," but practical patterns for containerizing LLMs, quantizing models, and serving them efficiently. One of the most practical "LLMOps in the real world" resources I've found.
VladimirVerified Purchase • Paperback • May 3, 2026

"Essential AI Guide for 2026"

Operational AI with Docker is not only a hands-on guide, but it's also a journey. You're in for a ride to learn different aspects of agentic AI development. I read the book cover to cover, and it's very rewarding. Ajeet and Harsh need no introduction — they did an awesome job teaching cutting-edge technology and tools that have not yet been discovered by many.
Mohammad Ali ARABIVerified Purchase • Paperback • May 11, 2026

"Strong Practical Guide for AI + Docker Workflows"

This book does a great job of connecting modern AI workflows with real-world Docker and Kubernetes practices. Instead of staying theoretical, it focuses on practical topics like containerizing AI workloads, running models locally, orchestration, observability, and multi-agent systems. The writing is clear and hands-on, and the examples feel relevant to how teams are actually deploying AI services today.
Christopher WestVerified Purchase • Paperback • May 8, 2026

"Great AI deployment with Docker"

A comprehensive, hands-on guide for engineers and developers looking to deploy, scale, and manage AI/ML services using Docker and Kubernetes. Filled with real-world examples, step-by-step tutorials, and configuration samples. An essential resource for anyone responsible for moving AI models from notebooks to scalable, secure, and maintainable production systems.
Guy GormanVerified Purchase • Kindle • April 30, 2026

Free preview

Read a free sample.

Preview Operational AI with Docker instantly on Amazon — or drop your email and we'll send you a sample.

Prefer it in your inbox?

No spam. Unsubscribe anytime.

Where to buy

Available everywhere books are sold.

FAQ

Questions, answered.

Ready to ship AI that actually runs in production?

Join thousands of developers building the next generation of AI applications with Docker.