Kimi Paper Reveals Inference Architecture, Handling 80% of Traffic
Kimi paper reveals inference architecture handling 80% of traffic: AI forward-looking topic (batch3, outside main timeline Jan 2025–May 2026).
Browse AI news across models, agents, media, industry, and compute policy.
Kimi paper reveals inference architecture handling 80% of traffic: AI forward-looking topic (batch3, outside main timeline Jan 2025–May 2026).
HKU and ByteDance have released an open-source autoregressive text-to-image model, enabling Llama to generate images. Online demos are now available.
StepFun's valuation has doubled in six months, shifting the large model startup landscape into a new 'Big Six' configuration.
Yao Class prodigies collaborate on the sequel to the hit game, introducing AI tools for workplace efficiency.
Kaiming He leads his first team post-MIT hiring on a new AI generation project, with double Olympiad gold medalist Mingyang Deng participating. This work falls outside the main 2025-2026 publication timeline.
Jensen Huang's vision of a robot world still requires AI data to be trained. This is an AI forward-looking topic (batch 3, outside the main timeline of Jan 2025–May 2026).
A new pure MLP architecture from Tsinghua University and Ant Group outperforms Transformers in short- and long-term time series forecasting.
A new domestic generative AI model matches AlphaFold3 in performance, simultaneously predicting antigen-antibody complex structures and enabling de novo antibody design.
Surpassing Devin! Led by the Yao Class, they set a new record in large model programming. (AI Forward Agenda, batch 3; published outside the main timeline of Jan 2025–May 2026).
BaiChuan unveils a new model that tops Chinese benchmarks alongside its first AI assistant, Bai Xiaoying, dubbed the most search-savvy. This falls outside the main 2025-2026 timeline.