Cost Revolution & Global Competition
DeepSeek, cost reduction, and international competition.
DeepSeek V2: Multi-head Latent Attention
DeepSeek V2 introduced Multi-head Latent Attention (MLA), a novel attention mechanism that compressed the KV cache by 93.3%, making frontier-quality inference dramatically cheaper and signaling that architectural innovation could substitute for brute-force compute.
DeepSeek V3: Frontier Quality at Startup Cost
DeepSeek V3 matched Claude 3.5 Sonnet and GPT-4o across most benchmarks while training for just $5.576 million, combining innovations in FP8 training, multi-token prediction, and efficient MoE routing to shatter assumptions about the cost of frontier AI.
The DeepSeek Cost Revolution
DeepSeek demonstrated through V2, V3, and R1 that frontier AI could be built for a fraction of Western lab budgets, triggering a trillion-dollar market shock and forcing the entire industry to rethink the relationship between compute spending and AI capability.
Qwen 1 and 2: Alibaba’s Ascent
Alibaba’s Qwen model family evolved from a competent bilingual system in 2023 to a leading open-weight family by late 2024, demonstrating that consistent iteration on data quality and architecture could close the gap with Western frontier models.
Qwen 3: The Open Frontier Challenger
Qwen 3 brought hybrid thinking, MoE scaling, and 119-language support to the open-weight ecosystem, challenging the notion that frontier reasoning required closed, proprietary models.
Chinese AI Labs: The Global Competition Landscape
Beyond DeepSeek and Qwen, a diverse ecosystem of Chinese AI labs emerged between 2023 and 2025, collectively challenging Western dominance through architectural innovation, massive domestic deployment, and creative adaptation to chip export restrictions.