Shanghai AI Lab Surpasses DeepSeek in Math Reasoning Without Distilling R1, Using RL to Break Limits
Reinforcement learning breakthroughs enable math reasoning surpassing DeepSeek without distilling R1. This 2025-2026 AI industry extension topic complements existing timelines.