Hands-On LLM Serving and Optimization: Hosting LLMs at Scale 1st Edition

★★★★★ 4.9 126 reviews

$68.32
Price when purchased online
Free shipping Free 30-day returns

Sold and shipped by parkfuels.com
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here.
$68.32
Price when purchased online
Free shipping Free 30-day returns

How do you want your item?
You get 30 days free! Choose a plan at checkout.
Shipping
Arrives Jul 6
Free
Pickup
Check nearby
Delivery
Not available

Sold and shipped by parkfuels.com
Free 30-day returns Details

Product details

Management number 222070571 Release Date 2026/05/04 List Price $27.33 Model Number 222070571
Category

Large language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point has arrived: as the world races to deploy AI at scale, model inference has moved to the center of the stack. Welcome to the inference era.Without proper optimization, however, LLMs can be expensive and slow to serve. Hands-On LLM Serving and Optimization is a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.In this hands-on, engineering-focused book, authors Chi Wang and Peiheng Hu combine practical examples, code, and strategies for building robust, performant, and cost-efficient AI token factories. Whether you’re building the LLM inference infrastructure or the applications that consume it, a deep understanding of LLM serving will make you a more effective, future-ready engineer as AI transforms how we work and build.Learn the foundations of model serving with core concepts, design paradigms, and industry best practicesUnderstand the common challenges of hosting LLMs at scaleBalance latency and throughput to meet the demands of AI applications and business requirementsHost LLMs cost-effectively with practical, code-backed techniques Read more

ISBN13 979-8341621497
Edition 1st
Language English
Publisher O'Reilly Media
Dimensions 7 x 2 x 9.19 inches
Item Weight 1.31 pounds
Print length 371 pages
Publication date June 2, 2026

Correction of product information

If you notice any omissions or errors in the product information on this page, please use the correction request form below.

Correction Request Form

Customer ratings & reviews

4.9 out of 5
★★★★★
126 ratings | 52 reviews
How item rating is calculated
View all reviews
5 stars
89% (112)
4 stars
1% (1)
3 stars
0% (0)
2 stars
0% (0)
1 star
10% (13)
Sort by

There are currently no written reviews for this product.