Research
70% Size, 100% Accuracy
KV Cache is 1 Bit Per Channel
NoMAD-Attention
Accelerating Inference
Sketch to Adapt
Contact
Blog
Llama Agents
Sign Up
model-release
Home
model-release
model-release
Meet the xMADified Gemma 2 (9B): High Pe...
By xMAD.ai
Date
Nov 01 2024