Frontier Models

Filter by category

Frontier Models 36 Industry & Capital 6 Compute & Infrastructure 17 Policy & Governance 12

Frontier Models Jun 06, 2025 · Elena Volkov · ~5 min read

Gemini's New Version Tops Arena Leaderboard, But Is Jailbroken Shortly After Release

Multimodal failures and model personality issues: Interaction styles during code debugging and security jailbreak cases spark reflections on product reliability.

Frontier Models May 21, 2025 · Lin Mei Huang · ~13 min read

Google's Annual Showcase: All AI Models Upgraded; Gemini 2.5 Dominates Top Spots, New Video/Image Models Debut

At Google I/O, the entire Gemini suite was upgraded. Gemini 2.5 lines, video (Veo), image models, and developer APIs received high-density updates on the same day.

Frontier Models May 21, 2025 · James Hayes · ~13 min read

Google's Annual Power Move: All AI Models Upgraded; Gemini 2.5 Dominates Rankings, New Video/Image Models Debut

Gemini 2.5 family GA and Flash-Lite introduce segmented SKUs along the latency-cost curve to target enterprise and edge scenarios.

Frontier Models May 16, 2025 · Amara Okonkwo · ~15 min read

GPT-4V Only Reaches Level-2? Global First Multimodal Generalist Ranking Released, General-Level Sets New Paradigm for Evaluating Multimodal AI

Anthropic releases Claude 4 (Opus/Sonnet), focusing on long-horizon autonomous tasks and coding scenarios, while simultaneously raising API pricing tiers.

Frontier Models May 06, 2025 · James Hayes · ~4 min read

Large Models Fail en masse! New Chinese Web Search Test Shows GPT-4o Accuracy at Just 6.2%

Search experience revolution: 'AI Mode' expands in the US, coexisting with traditional link lists, sparking debate over traffic distribution and ad landscapes.

Frontier Models Apr 17, 2025 · James Hayes · ~7 min read

Testing o3/o4-mini: Solving Euler's Problem in 3 Minutes Proves OpenAI's Top Model Lives Up to Its Reputation

OpenAI releases reasoning models like o3 and o4-mini, emphasizing tool use and scientific/programming logic. Third-party reproductions and hallucination rates spark debate.

Frontier Models Mar 29, 2025 · David Kowalski · ~9 min read

Toward Swarm Intelligence: BAAI Unveils First Cross-Embodiment Brain-Cerebellum Collaboration Framework and Open-Source Embodied AI Brain

BAAI releases a robot brain-cerebellum collaboration framework and open-source ecosystem, advancing real-world data integration and swarm intelligence narratives.

Frontier Models Mar 26, 2025 · David Kowalski · ~2 min read