Meta · 2026 · 04 · 08 · Model · ~2 min read

Meta released Llama 5

Meta's open-weights flagship got a generational upgrade. 600B+ parameter mixture-of-experts, 5-million-token context window, native video and audio, and a community licence that finally got friendlier for commercial use. Llama 4's promised 'Behemoth' never shipped — Meta jumped straight here.

What's actually new

  • Native video and audio. Earlier Llamas only handled text and images. Llama 5 reads and produces both formats natively.
  • 5-million-token context. Longer than Gemini 3.1's two million. Still degrades past around 1M, but the headline number is real.
  • Recursive self-improvement training. Meta says the model generates its own high-quality training data — a technique that quietly took over the field through 2025.
  • Updated Llama Community License. Friendlier for medium-and-large companies than the Llama 4 terms.

If you want more

Worth knowing~30s
  • Behemoth never shipped. Meta promised the 2-trillion-parameter Llama 4 sibling for over a year, then quietly skipped to Llama 5 — saving face by calling it a strategic pivot.
  • 'Recursive self-improvement' is loaded language. It's a real technique, not the AGI threshold the marketing implied.
  • Inference cost is real. Mixture-of-experts makes the per-question cost lower, but you still need serious GPUs to host it.
Who should care~20s

Companies running their own AI infrastructure who want long-context video and audio without sending data to OpenAI or Anthropic. Privacy-sensitive teams. Researchers studying mixture-of-experts at scale. Anyone whose 'open AI' bet had been Llama 4.

What to do about it~20s

If you're running Llama 4 in production, plan a Llama 5 evaluation in your real workload. Don't believe the launch-day comparisons — Meta's earlier model launches taught us to test on YOUR tasks. The 5M context is the underrated upgrade if your work involves long documents.

Honest take~45s

Llama 5 was Meta's 'we're back' moment after the awkward Behemoth saga. The strategic pivot from 'one giant teacher model' to 'one big multi-modal flagship' was the right call — Behemoth was clearly not paying off. The bigger story is that open-source AI now matches closed-source on most things you'd actually buy, which is structurally cheaper and structurally less locked-in. The leading edge still belongs to closed labs; the production tier doesn't.

Other recent model updates

Sources

Last verified · 2026 · 05 · 05 · Found a fact wrong? corrections@aguidetocloud.com