Key Features
- Build and deploy your first Generative AI workload on Kubernetes with confidence
- Learn to optimize costly resources such as GPUs using fractional allocation, Spot Instances, and automation
- Gain hands-on insights into observability, infrastructure automation, and scaling Generative AI workloads
- Purchase of the print or Kindle book includes a free PDF eBook
Who this book is for
This book is for solutions architects, product managers, engineering leads, DevOps teams, GenAI developers, and AI engineers. It's also suitable for students and academics learning about GenAI, Kubernetes, and cloud-native technologies. A basic understanding of cloud computing and AI concepts is needed, but no prior knowledge of Kubernetes is required.