Anthropic's native Claude Platform is now generally available through AWS. Authentication via IAM, audit via CloudTrail, billing via existing AWS invoice, and day-one access to every native API feature—Managed Agents, Skills, code execution, MCP connector, the Console, and Opus 4.7 / Sonnet 4.6 / Haiku 4.5.
News
What's happening in web development and AI.
Three new realtime audio models in the API. GPT-Realtime-2 brings GPT-5-class reasoning to voice with 128K context, parallel tool calls, adjustable reasoning, and a 26-point lift in Zillow's adversarial call benchmark. Plus a 70-language live translator and a streaming transcriber.
Anthropic's 2026 Dev Conference shipped four updates to Claude Managed Agents: Multi-Agent Orchestration (~33% cheaper), Memory (Global + Personal markdown), Dreaming (research preview—agents review past sessions and rewrite their own memory between runs), and Outcomes (coming soon).
Claude Code's five-hour limits doubled across Pro, Max, Team, and Enterprise. Peak-hour throttling on Pro and Max removed. Opus API rate limits raised. Funded by a SpaceX Colossus 1 deal—300+ MW and 220,000+ NVIDIA GPUs within the month.
Cloudflare ships @cloudflare/dynamic-workflows—a 300-line MIT library that lets a single Worker dispatch durable workflow runs to per-tenant code. Closes the gap between dynamic deployment and durable execution, with CI/CD as the headline use case.
GPT-5.5 hits 82.7% on Terminal-Bench 2.0, 58.6% on SWE-Bench Pro, and 78.7% on OSWorld-Verified—matching GPT-5.4 latency while using fewer tokens. Rolling out today in ChatGPT and Codex; API at $5/$30 per 1M tokens with a 1M context window.
Codex-powered shared agents that handle long-running team workflows, runnable in ChatGPT or Slack, governed by org permissions. Five launch-day agent patterns include software review, lead outreach, and weekly metrics. Free until May 6, then credit-based.
Vertex AI becomes the Gemini Enterprise Agent Platform with graph-based ADK, Memory Bank, A2A orchestration, Agent Identity, Agent Gateway, Model Armor, and Agent Simulation. Plus an Agent Gallery with validated partner agents from Salesforce, ServiceNow, Adobe, and more.
Two autonomous research agents in the Gemini API—one tuned for speed, one for depth via extended test-time compute. Both ship with MCP support, native chart and infographic generation, multimodal grounding, and collaborative planning.
Anthropic Labs ships Claude Design, a Claude Opus 4.7-powered tool for designs, prototypes, slides, and one-pagers. Brand systems baked in from your codebase, inline edits and custom sliders, and a one-instruction handoff bundle for Claude Code.
A purpose-built reasoning model for biology, drug discovery, and translational medicine, plus a free Life Sciences research plugin for Codex that orchestrates 50+ public databases and tools. Beats GPT-5.4 on 6 of 11 LABBench2 tasks.
Codex adds background computer use on Mac, an in-app browser, gpt-image-1.5, 90+ plugins mixing skills and MCP, GitHub PR review support, multi-terminal and alpha SSH devboxes, memory preview, and deeper automations.
Opus 4.7 is generally available with stronger long-horizon software engineering, higher-resolution vision for screenshots and diagrams, a new xhigh reasoning tier, and API task budgets—pricing unchanged from Opus 4.6.
Meta ships Muse Spark from its new Superintelligence Labs — a multimodal reasoning model with multi-agent orchestration, 10x compute efficiency over Llama 4, and a Contemplating mode that hits 58% on Humanity's Last Exam.
JetBrains surveyed 10,000 developers worldwide. Claude Code grew 6x in nine months to match Cursor at 18% adoption, Copilot's growth stalled at 29%, and staff+ engineers are the fastest adopters of agentic workflows.
Cursor ships a new interface built from scratch around AI agents. Run parallel agents across repos, hand off between local and cloud, compare models with /best-of-n, and annotate UI elements in Design Mode.
Cloudflare ships EmDash, an open-source TypeScript CMS built on Astro that sandboxes plugins in Worker isolates, scales to zero, and ships with MCP and AI agent tooling built in. MIT-licensed and targeting a new generation of developers.
JetBrains announces Central, an open system that connects coding agents from any ecosystem — Claude, Codex, Gemini CLI — with governance, execution infrastructure, and shared semantic context. Code With Me is retired.
Anthropic ships auto mode for Claude Code — a two-layer classifier system that approves safe actions and blocks dangerous ones, replacing the approval fatigue that pushed developers to skip permissions entirely.
Spline ships Omma, a generative AI canvas that unifies 3D, motion, animation, and UI into a single natural language workflow. Build production-ready interactive experiences in minutes, not weeks.
Anthropic ships computer use in Claude Cowork and Claude Code, letting Claude point, click, and navigate your screen. Paired with Dispatch, you can assign work from your phone and walk away.
Google ships the Antigravity coding agent in AI Studio with built-in Firebase integration. Build multiplayer apps, add databases and auth, and connect to real-world services — all from a single prompt.
WordPress.com adds 19 write capabilities to its MCP integration, letting AI agents draft posts, build pages, manage comments, and organize content — all with approval safeguards.
Workers AI now serves frontier-scale open-source models with a 256k context window. Cloudflare cut its own security agent costs by 77% and ships prefix caching, session affinity, and async APIs for agentic workloads.
Cursor builds its own frontier-level coding model with continued pretraining and RL. Scores 61.3 on CursorBench and 73.7 on SWE-bench Multilingual, priced at $0.50/M input tokens.
Next.js bundles AGENTS.md in create-next-app for 100% eval pass rates, forwards browser errors to the terminal for agent debugging, and adds experimental Agent DevTools.
Netlify launches Agent Runners, letting teams start web projects from prompts using Claude Code, Codex, or Gemini CLI — with live apps on production infrastructure in minutes.
A new Google Labs preview that converts design mockups into production-ready front-end code with surprising fidelity.
GitHub's MCP Server can now scan code changes for exposed secrets before committing, letting AI coding agents catch credential leaks in real time.
A new plugin for Claude Code and Cursor injects 47 skills and platform-specific knowledge directly into the agent's context.
Perplexity's answer to OpenClaw: a Mac mini-based AI agent that orchestrates 20+ models across 400+ apps, positioned as the "serious" alternative with full audit trails and a kill switch.
The world's most popular editor shifts from monthly to weekly stable releases, enabled by AI agents handling code review, issue triage, and validation.
The YC president open-sources a collection of task-specific prompt sets designed for LLM-powered development workflows.
Event-driven coding agents that trigger from Slack, GitHub, and PagerDuty — no human prompt required. Cursor also demonstrated building a browser from scratch with zero humans for a week.
An AI agent built in an hour with Claude Code reached 250,000 GitHub stars in 60 days, obliterating React's decade-long record. Its creator then joined OpenAI.
One engineer and Claude AI produced vinext across 800+ sessions in 7 days for $1,100 in API tokens. The result: a drop-in Next.js replacement with 4.4x faster builds, 57% smaller bundles, and single-command deploys to Cloudflare Workers.