The 2024 Frontier Race

Claude 3, Gemini, and the frontier model competition.

Claude 3 Family

Anthropic’s March 2024 release introduced a three-tier model system — Haiku, Sonnet, and Opus — all with 200K context windows, with Opus becoming the first model to credibly challenge GPT-4’s supremacy across major benchmarks.

Gemini 1.5

Google DeepMind’s Gemini 1.5, released in February 2024, introduced a Mixture of Experts architecture with an unprecedented 1 million token context window — later extended to 2 million — fundamentally redefining what it means to give a model “enough context.”

GPT-4o

OpenAI’s GPT-4o (“Omni”), released in May 2024, was the first truly unified multimodal model — trained end-to-end to accept and generate text, audio, images, and video through a single neural network, at 2x the speed and half the cost of GPT-4 Turbo.

Claude 3.5 Sonnet

Released on June 20, 2024, Claude 3.5 Sonnet shattered the assumption that mid-tier models must be inferior — it outperformed Claude 3 Opus on nearly every benchmark at 2x the speed and lower cost, becoming the most influential single model release of 2024.

LLaMA 3 and LLaMA 3.1

Meta’s LLaMA 3 (April 2024) and LLaMA 3.1 (July 2024) proved that open-weight models could compete at the absolute frontier, with the 405B parameter model rivaling GPT-4o and Claude 3.5 Sonnet while being freely available for download.

LLaMA 3.2: Multimodal and Edge Models

Meta’s LLaMA 3.2 (September 2024) brought vision capabilities to the open-weight LLaMA family for the first time with 11B and 90B multimodal models, while also releasing tiny 1B and 3B text models for on-device deployment — and LLaMA 3.3 later showed a 70B model could match the 405B.

Grok and xAI

Elon Musk’s xAI built Grok from zero to frontier-competitive in under two years, open-sourcing the 314B parameter Grok-1, scaling on the massive Colossus GPU cluster, and reaching the top of LMArena rankings by late 2025 — embodying the “move fast, scale hard” philosophy.

PaLM 2 and the Gemini Evolution

Google’s journey from PaLM (540B dense, 2022) through PaLM 2 (Chinchilla-optimal, 2023) to Gemini 1.0 (2023) and Gemini 1.5 (MoE, 2024) traces the company’s strategic pivot from “scale the biggest model” to “scale efficiently with MoE and long context.”

Mistral Large and Enterprise Expansion

Mistral AI expanded from its scrappy open-source origins into a full enterprise AI platform through 2024, releasing Mistral Large 2 (123B dense), Codestral (22B code specialist), Pixtral (12B multimodal), and Mistral Nemo (12B) — establishing Europe’s first credible frontier AI lab.