AI model intelligence feed
Frontier model releases, benchmarks, and price changes — in one live feed.
Built for AI engineers, founders, and analysts who need to know what changed before everyone else. Tracks the mainstream models and agents — releases, benchmarks, leaderboards, capital activity, and provider status across the frontier labs and the open-source race.
Recent releases
View all →Claude 4.8 Opus
released 2026-05-28
Anthropic's flagship hybrid-reasoning model — Opus 4.8 pushes coding and AI-agent workflows further (agentic coding 69.2% vs 64.3% on 4.7; multidisciplinary reasoning 57.9% vs 54.7%). 1M-token context, $5/$25 per MTok pricing held flat from Opus 4.7. Introduces Dynamic Workflows for Claude Code (research preview). GA on Anthropic API (`claude-opus-4-8`), AWS Bedrock, Google Vertex, Microsoft Foundry, and GitHub Copilot.
- Context
- 1,000,000
- License
- proprietary
Gemini 3.5
released 2026-05-20
Google DeepMind's next-gen Gemini — positioned as "frontier intelligence with action". Built for complex agentic workflows. Announced at Google I/O 2026.
- Context
- 2,000,000
- License
- proprietary
Qwen3.7-Max
released 2026-05-20
Alibaba's flagship agent model — 1M-token context, extended-thinking mode, 56.6 on the Artificial Analysis Intelligence Index v4.0 (5th overall, #1 Chinese). 50.8% on Terminal-Bench Hard. Designed for long-horizon agent workloads (hundreds-to-thousands of steps). Closed-weight, $2.50/$7.50 per 1M tokens.
- Context
- 1,000,000
- License
- proprietary
Gemini Omni
released 2026-05-20
Multimodal Gemini variant introduced at Google I/O 2026 — unified text, image, audio, and video processing in a single model.
- Context
- 1,000,000
- License
- proprietary
GPT-5.5
released 2026-04-23
OpenAI's smartest and most intuitive model — successor to GPT-5, with major coding, research, and document workflow gains. GPT-5.5 Pro variant also available for heavier reasoning.
- Context
- 400,000
- License
- proprietary
DeepSeek-V4-Flash
released 2026-04-22
Smaller, faster sibling to DeepSeek-V4-Pro. Same 1M context window with a much lighter 284B / 13B-active MoE.
- Context
- 1,000,000
- Params
- 284B (13B active)
- License
- MIT
- Source
- open
AI agents
All agents →The deployed-product layer atop raw models — coding agents, browser agents, autonomous assistants.
Replit Agent
Replitcoding
Replit's in-browser coding agent. Agent 4 (Mar 2026) introduced parallel task forking that auto-resolves merge conflicts ~90% of the time.
Hermes Agent
Nous Researchgeneral·open source
Open-source AI agent from Nous Research with a built-in learning loop — creates skills from experience, persists knowledge, builds a model of its user across sessions. ~60K stars in two months.
Cursor
Anyspherecoding
AI-native code editor, fork of VS Code. Hit $2B ARR in February 2026. Composer mode for natural-language multi-file refactors.
Codex
OpenAIcoding
OpenAI's coding agent — runs in CLI, IDE, and cloud-hosted runtimes. Drives the Plus/Pro coding workflow.
Claude Code
Anthropiccoding
Anthropic's CLI coding agent. Top of the SWE-bench Verified leaderboard at 80.8% on Opus 4.6. Lives in the terminal — reads files, executes commands, navigates repos, applies multi-file edits autonomously.
OpenClaw
Erik Steinbergergeneral·open source
Open-source autonomous AI agent (formerly Clawdbot, Moltbot — lobster theme). Self-hosted; supports Claude, GPT, KIMI, MiMo, Qwen 3, Llama 4, Mistral, plus local models via Ollama. ~347K GitHub stars.
Latest news
All news →- release
OpenAI SDK v2.44.0
v2.44.0 was released on 2026-06-24 with a bug fix in auth to prioritize the first auth header. This matters because header precedence can affect which credentials are used when multiple auth headers are present, reducing ambiguity and unexpected authentication behavior.
- release
Anthropic SDK v0.112.0
v0.112.0, released on 2026-06-24, adds client support for streaming `system.message` events and includes API updates for a new refusal category and sending a User Profile ID in request headers. The release also fixes the memory tool so it creates parent directories with the correct permissions, a small but important filesystem-safety change.
Introducing computer use in Gemini 3.5 Flash
Google introduced computer-use capabilities in Gemini 3.5 Flash, enabling the model to interact with computer interfaces as part of its workflow. It matters because this moves Gemini from text and image generation toward agentic task execution, a step toward automating multi-step actions in software.
- infra
Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel
NVIDIA NeMo AutoModel is being highlighted for accelerating Transformer fine-tuning workflows. It matters because faster fine-tuning can shorten iteration cycles for adapting large models to new tasks and datasets.
- capital
OpenAI reveals its first AI processor: Jalapeño
OpenAI revealed its first AI server chip, Jalapeño, an “intelligence processor” built with Broadcom and designed as an ASIC for AI inference in current and future large language models. It matters because it marks OpenAI’s move into custom silicon for serving products like ChatGPT and Codex, following its chip partnership announcement just nine months earlier.
- capital
Exclusive: XCures Lands $46M Series B To Clean Up Messy Medical Records With AI
xCures raised a $46 million Series B led by Innovius Capital, with iGrow, Spring Mountain Capital and existing backers participating, bringing total funding to more than $76 million and valuing the 2018 startup at $127 million post-money. The company, which pivoted from direct-to-consumer cancer decision support after wrestling with faxed and FedExed records, is betting its “Clinical Clarity Engine” can turn messy patient data into decision-ready clinical intelligence faster than transport-focused rivals.
- capital
Why Ex-Meta CTO Mike Schroepfer Says It’s A Great Time To Build A Hard Tech Company: ‘Infrastructure Is The Moat’
Mike Schroepfer, Meta’s former CTO, founded Gigascale Capital in 2023 and just raised a $250 million first institutional fund to back companies rebuilding the physical economy, with more than 25 portfolio investments and check sizes from $1 million to $10 million. He says AI is making software cheaper while shifting the moat to infrastructure like power, compute, manufacturing and supply chains, and argues the combination of rising demand and falling costs in areas like solar, batteries and electrolyzers makes this a rare moment for hard-tech startups.
- infra
OpenAI and Broadcom unveil LLM-optimized inference chip
OpenAI and Broadcom unveiled Jalapeño, a custom AI chip designed for LLM inference to improve performance, efficiency, and scale across AI systems. A chip optimized specifically for inference could lower serving costs and increase throughput for large models, which is increasingly important as deployment demand grows.
Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World
FFASR introduced a new leaderboard for benchmarking automatic speech recognition in real-world conditions. It matters because it gives researchers and practitioners a way to compare ASR systems on practical, noisy, and diverse audio rather than only on controlled test sets.
- capital
Anthropic Backer Menlo Ventures Raises $3B In New Funds To Back AI Startups Across Stages
Menlo Ventures raised $3 billion in new capital, its largest fundraise in 50 years, split between Menlo Ventures XVII for seed and Series A deals and Menlo Inflection IV for Series B+ AI startups across enterprise, healthcare, and consumer. The firm pointed to its early 2023 investment in Anthropic—now valued at $965 billion and planning a 2026 IPO—as the basis for its AI strategy and a potential path to its biggest exit ever.