Generative AI with Kubernetes: Implementing secure and observable AI infrastructure to deliver reliable AI applications
Par :Formats :
- Compatible avec une lecture sur My Vivlio (smartphone, tablette, ordinateur)
- Compatible avec une lecture sur liseuses Vivlio
- Pour les liseuses autres que Vivlio, vous devez utiliser le logiciel Adobe Digital Edition. Non compatible avec la lecture sur les liseuses Kindle, Remarkable et Sony
- Non compatible avec un achat hors France métropolitaine

Notre partenaire de plateforme de lecture numérique où vous retrouverez l'ensemble de vos ebooks gratuitement
- FormatePub
- ISBN978-93-6589-282-6
- EAN9789365892826
- Date de parution28/02/2025
- Protection num.Adobe DRM
- Infos supplémentairesepub
- ÉditeurBPB Publications
Résumé
The book covers essential technical implementations from ML fundamentals through advanced deployment strategies, focusing on practical patterns. Core topics include Kubernetes-native GPU scheduling and resource management, MLOps pipeline architectures using Kubeflow/MLflow, and advanced model serving patterns. It details data management architectures, vector databases, and RAG systems, alongside monitoring solutions with Prometheus/Grafana.
Finally, we will look at some advanced concerns for production in the realm of security and data reliability. After reading this book, you will be equipped with a broad knowledge of the end-to-end generative AI pipeline and how Kubernetes can be leveraged to run your generative AI workloads at scale in the real-world. KEY FEATURES ? Learn how Kubernetes can help you run your generative AI workloads.? Using hands-on examples, you will work with real-world foundational models and a variety of tools and capabilities in the K8s ecosystem.? A broad survey of both generative AI and Kubernetes in one book.
WHAT YOU WILL LEARN? How to evaluate and compare models for new applications and use cases.? How Kubernetes can add reliability and scale to your AI applications.? What does an AI delivery pipeline contain and how to start one.? How AI models encode words and work with natural language.? How prompting and refinement techniques can improve results.? How to use your own data to augment AI responses.
WHO THIS BOOK IS FORThis book is for teams building new applications or new functionality with generative AI, but want to better understand the infrastructure needed to bring their AI applications to production. This book is also for shared services, infrastructure, or cybersecurity teams who provide platforms and infrastructure for application, or product development.