China's AI Model Lands on Nature Cover! DeepSeek Reveals R1 Training Costed Just $2 Million
DeepSeek R1's cover feature in Nature sparks debate over training costs, scientific evaluation, and discourse power.
Releases, papers, SOTA benchmarks
DeepSeek R1's cover feature in Nature sparks debate over training costs, scientific evaluation, and discourse power.
Ant Group and Renmin University unveil a diffusion language model roadmap, exploring the potential paradigm of combining Mixture-of-Experts with diffusion for language modeling.
A European version of OpenAI is under fire for allegedly distilling data from DeepSeek and fabricating results, leading to a significant credibility crisis.
GPT-5 pricing and price wars: API rates seen as pressure on competitors, sparking debate over 'commoditization' of models.
OpenAI launches GPT-5 with unified routing and fast-response modes to capture the default chat entry point. Day-one routing glitches spark user experience controversies.
Google's IMO-level math reasoning model is now open for trial. Its performance at the Math Olympiad level has become a focal point of summer discussions.
DeepSeek's paper and academic recognition in sparse attention, reasoning systems, and cost transparency fuel the 'China vs. Silicon Valley' narrative.
China's major model 'industry papers' and cost-reduction practices are revealed: training and inference costs, distillation, and open-source strategies become the focus of public debate.
Tencent's RLVER open-domain emotional reinforcement learning addresses 2025–2026 AI industry extension topics (batch 2 archived, complementing existing timeline).
A 4B-parameter model's math reasoning via reinforcement learning rivals large models. This 2025–2026 AI industry extension topic (batch 2 archived) complements the main timeline.