ShadowServe Boosts LLM Serving with Interference‑Free KV Cache Fetching
ShadowServe moves KV cache fetching to a SmartNIC, eliminating host interference. With ≤20 Gbps bandwidth it lowers TPOT up to 2.2× and TTFT up to 1.38×, raising throughput ~1.35×. getnews.me/shadowserve-boosts-llm-s... #shadowserve #smartnic
0
0
0
0