Tag: vllm

Reproducing vLLM and LMCache KV Cache Reuse on a CPU-Only MacBook

July 3, 2026

I became interested in LMCache because it sits in the part of LLM serving that feels both very practical and very under-discussed: KV cache movement.

vllm
lmcache
kv cache
cpu
uv