Software.
Google I/O 2026: Gemini 3.5 Flash GA (AA Intelligence Index 55.3, Terminal-Bench 2.1 76.2%, ~280 tok/sec — beats prior-gen Gemini 3.1 Pro on most agent benchmarks at 4x speed; list price $1.50/$9 per Mtok), Gemini Omni Flash (native multimodal create-and-edit with grounded video out, immediately in Gemini app + Flow + YouTube Shorts), Antigravity 2.0 standalone agent IDE with managed Linux sandboxes, TPU 8t/8i dual-chip 8th gen, and Managed Agents API
blog.google/innovation-and-ai/technology/developers-tools/google-io-2026-developer-highlights, deepmind.google model card, artificialanalysis.ai, simonwillison.net
Alibaba Cloud Summit ships Qwen3.7-Max (AA Intelligence Index 56.6 — first Chinese model in the global top 5, ahead of Gemini 3.5 Flash; 1M context, demonstrated 35-hour autonomous 1,158-tool-call run), Qwen3.7-Plus-Preview (multimodal sibling), homegrown Zhenwu M890 AI chip (3x predecessor), and a rebuilt full-stack agentic cloud platform
alibabacloud.com/blog/alibaba-unveils-new-ai-chip-flagship-model-and-rebuilt-cloud-stack-ai-for-agentic-era, marktechpost.com, Artificial Analysis
Cohere ships Command A+ under Apache 2.0 — first-ever fully-permissive license from Cohere on a 218B/25B-active MoE that runs on 2x H100s; τ²-Bench Telecom jumped 37% (Command A Reasoning) to 85%, Terminal-Bench Hard 3% to 25%, AA-Omniscience Non-Hallucination #1 at 86%; 48 languages with citation grounding for sovereign / on-prem RAG deployments
cohere.com/blog/command-a-plus, venturebeat.com, artificialanalysis.ai, MarkTechPost
Microsoft Research Fara1.5 family (4B / 9B / 27B, open weights, fine-tuned on Qwen3.5) — 27B variant hits 72% Online-Mind2Web, beating OpenAI Operator (58.3%) and Gemini 2.5 Computer Use (57.3%); released with MagenticLite sandboxed browser plus FaraGen1.5 synthetic-data pipeline; collapses per-task cost for browser-agent fleets
microsoft.com/en-us/research/articles/fara1-5-computer-use-agent, decrypt.co, Azure AI Foundry
Anthropic publishes Project Glasswing initial update — Claude Security beta (Opus 4.7-powered) patched 2,100 vulnerabilities in three weeks; Anthropic reiterates Mythos will NOT release publicly until 'far stronger safeguards' exist despite UK AISI showing Mythos Preview at 73% on expert CTFs and first to autonomously complete the 32-step 'Last Ones' corporate intrusion (3 of 10 runs)
anthropic.com/research/glasswing-initial-update, aisi.gov.uk evaluation-of-claude-mythos-previews-cyber-capabilities
What this means
Two simultaneous arcs: (1) speed-intelligence Pareto frontier reset — Gemini 3.5 Flash now beats prior-gen Pro at 4x throughput, so architects should re-baseline cost-per-task this quarter rather than waiting for Pro-tier price drops; (2) agent-platform lock-in becomes the procurement decision — every flagship this week (Antigravity 2.0, Codex Goal mode, Claude Managed Agents sandboxes, Grok Build, Command A+) is engineered for sustained agentic loops, with the harness (AGENTS.md, SKILL.md, sandbox runtime) now the multi-year commit rather than the model. Boards should treat federal procurement and capability-gating (Anthropic's continued Mythos non-release; METR's first Frontier Risk Report May 19) as durable vendor-risk dimensions, not capability-fade speculation.