I became interested in LMCache because it sits in the part of LLM serving that feels both very practical and very under-discussed: KV cache movement.
I became interested in LMCache because it sits in the part of LLM serving that feels both very practical and very under-discussed: KV cache movement.