Skip to content

feat(M32d): KV cache for qwen3_moe inference path — 19× speedup#1832

Merged
noahgift merged 3 commits into
mainfrom
feat/m32d-moe-kv-cache
May 20, 2026
Merged

feat(M32d): KV cache for qwen3_moe inference path — 19× speedup#1832
noahgift merged 3 commits into
mainfrom
feat/m32d-moe-kv-cache

Commits

Commits on May 20, 2026