Cost Revolution & Global Competition

DeepSeek, cost reduction, and international competition.

DeepSeek V2: Multi-head Latent Attention

DeepSeek V2 introduced Multi-head Latent Attention (MLA), a novel attention mechanism that compressed the KV cache by 93.3%, making frontier-quality inference dramatically cheaper and signaling that architectural innovation could substitute for brute-force compute.

DeepSeek V3: Frontier Quality at Startup Cost

DeepSeek V3 matched Claude 3.5 Sonnet and GPT-4o across most benchmarks while training for just $5.576 million, combining innovations in FP8 training, multi-token prediction, and efficient MoE routing to shatter assumptions about the cost of frontier AI.

The DeepSeek Cost Revolution

DeepSeek demonstrated through V2, V3, and R1 that frontier AI could be built for a fraction of Western lab budgets, triggering a trillion-dollar market shock and forcing the entire industry to rethink the relationship between compute spending and AI capability.

Qwen 1 and 2: Alibaba’s Ascent

Alibaba’s Qwen model family evolved from a competent bilingual system in 2023 to a leading open-weight family by late 2024, demonstrating that consistent iteration on data quality and architecture could close the gap with Western frontier models.

Qwen 3: The Open Frontier Challenger

Qwen 3 brought hybrid thinking, MoE scaling, and 119-language support to the open-weight ecosystem, challenging the notion that frontier reasoning required closed, proprietary models.

Chinese AI Labs: The Global Competition Landscape

Beyond DeepSeek and Qwen, a diverse ecosystem of Chinese AI labs emerged between 2023 and 2025, collectively challenging Western dominance through architectural innovation, massive domestic deployment, and creative adaptation to chip export restrictions.