NeurIPS 24
KV Cache is 1 Bit Per Channel
NoMAD-Attention
Accelerating Inference
Beta
✨
Contact
Blog
Sign Up
model-release
Home
model-release
model-release
Meet the xMADified Gemma 2 (9B): High Pe...
By xMAD.ai
Date
Nov 01 2024