Blog

Home
Blog

Lightweight Llama - Steps to Make It Even Better

Lightweight Llama - Steps to Make It Eve...

xMAD.ai

By xMAD.ai

Date

Dec 04 2024

This is the Last Mile of LLMs, and It Matters More Than Ever

This is the Last Mile of LLMs, and It Ma...

xMAD.ai

By xMAD.ai

Date

Nov 15 2024

The Overlooked AI Expense That Can Cost 3x More Than Your Compute

The Overlooked AI Expense That Can Cost ...

xMAD.ai

By xMAD.ai

Date

Nov 12 2024

Bringing Generative AI Within Reach: How xMAD.ai Lowers Barriers to LLM Innovation

Bringing Generative AI Within Reach: How...

xMAD.ai

By xMAD.ai

Date

Nov 05 2024

Meet the xMADified Gemma 2 (9B): High Performance, Minimal VRAM 🚀

Meet the xMADified Gemma 2 (9B): High Pe...

xMAD.ai

By xMAD.ai

Date

Nov 01 2024

Revolutionizing AI Adaptation with SpaLLM

Revolutionizing AI Adaptation with SpaLL...

xMAD.ai

By xMAD.ai

Date

Oct 31 2024

The Hidden Flaw in Your AI Strategy: Think Smaller for Bigger Gains

The Hidden Flaw in Your AI Strategy: Thi...

xMAD.ai

By xMAD.ai

Date

Aug 02 2024

Discover How Model Quantization Can Drastically Reduce AI Costs and Boost Efficiency!

Discover How Model Quantization Can Dras...

xMAD.ai

By xMAD.ai

Date

Aug 02 2024

We achieved 1000 tokens per second on a single A100

We achieved 1000 tokens per second on a ...

xMAD.ai

By xMAD.ai

Date

Aug 02 2024