OFFRE LISEUSES

Une liseuse achetée = une housse offerte* jusqu'au 21 juin

Small Language Models for Edge AI: Deploying Efficient LLMs on IoT and Embedded Devices

Par : Rylan Vesper
Offrir maintenant
Ou planifier dans votre panier
Disponible dans votre compte client Decitre ou Furet du Nord dès validation de votre commande. Le format ePub est :
  • Compatible avec une lecture sur My Vivlio (smartphone, tablette, ordinateur)
  • Compatible avec une lecture sur liseuses Vivlio
  • Pour les liseuses autres que Vivlio, vous devez utiliser le logiciel Adobe Digital Edition. Non compatible avec la lecture sur les liseuses Kindle, Remarkable et Sony
Logo Vivlio, qui est-ce ?

Notre partenaire de plateforme de lecture numérique où vous retrouverez l'ensemble de vos ebooks gratuitement

Pour en savoir plus sur nos ebooks, consultez notre aide en ligne ici
C'est si simple ! Lisez votre ebook avec l'app Vivlio sur votre tablette, mobile ou ordinateur :
Google PlayApp Store
  • FormatePub
  • ISBN8233969775
  • EAN9798233969775
  • Date de parution23/03/2026
  • Protection num.pas de protection
  • Infos supplémentairesepub
  • ÉditeurLinda Balsamo

Résumé

Big language models are impressive. They're also power-hungry, cloud-dependent, and about as edge-friendly as a data center strapped to a toaster. This book is for everyone who's asked the obvious question: What if we made language models small enough to actually live on devices? Small Language Models for Edge AI is a practical, motivating, and occasionally sarcastic guide to running efficient language models on IoT and embedded systems-where memory is tiny, power is precious, and latency is not a suggestion. Inside, you'll learn how modern language models actually work (without drowning in theory), then quickly move into the real-world engineering problems that matter: shrinking models, optimizing inference, managing memory, and squeezing performance out of hardware that was never designed to run chatbots.
From microcontrollers and SoCs to NPUs and edge accelerators, this book shows how to design models that respect hardware limits instead of ignoring them. You'll explore proven techniques like quantization, pruning, knowledge distillation, and low-rank adaptation-explained clearly and applied specifically to edge AI. We'll cover toolchains such as TensorFlow Lite and ONNX, walk through efficient deployment pipelines, and discuss how to fine-tune and personalize models without setting your device on fire (thermally or emotionally). Security, privacy, and reliability are treated as first-class citizens, not afterthoughts.
Learn why on-device language models are a win for privacy, how to protect models from extraction, and what it takes to deploy responsibly at scale. Real-world case studies-from smart homes to industrial IoT, wearables, robotics, and automotive systems-ground the concepts in practical experience. Finally, we'll look ahead: ultra-low-power models, hardware-model co-design, federated learning, and where edge language intelligence is actually going (hint: smaller, smarter, and everywhere). This book is written for embedded engineers, AI practitioners, system architects, and curious builders who want language models that ship, not just demo.
If you're tired of cloud dependency, excited by efficient AI, and ready to make language models work in the real world, this book is for you. Small models. Big impact. Welcome to the edge.