NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark
Establishes the frame — a new agentic-infrastructure benchmark (AgentPerf) that Blackwell leads in its first published round.
4 items · 2 sources · 4 days
Operational story trace
Follow in this browser to see new updates on your Live feed.
Latest change
Anthropic's Claude models are now generally available in Microsoft Foundry on Azure running on NVIDIA GB300 Blackwell Ultra GPUs, putting a frontier model on Blackwell Ultra as a managed Azure option.
Through June, NVIDIA positioned Blackwell as the substrate for agentic AI: it topped the first agentic-infrastructure benchmark (AgentPerf from Artificial Analysis) and swept MLPerf Training 6.0. Attention then moved from training to inference, where DFlash speculative decoding was reported to lift throughput up to 15x on Blackwell.
Arc
Establishes the frame — a new agentic-infrastructure benchmark (AgentPerf) that Blackwell leads in its first published round.
Extends the leadership claim from agentic inference to training with an MLPerf Training 6.0 sweep.
Pivots the thread from training to inference, citing speculative decoding (DFlash) for up to 15x serving gains on Blackwell.
Turns benchmark leadership into product availability — Claude generally available on Azure GB300 Blackwell Ultra via Microsoft Foundry.
Establishes the frame — a new agentic-infrastructure benchmark (AgentPerf) that Blackwell leads in its first published round.
Extends the leadership claim from agentic inference to training with an MLPerf Training 6.0 sweep.
Pivots the thread from training to inference, citing speculative decoding (DFlash) for up to 15x serving gains on Blackwell.
Turns benchmark leadership into product availability — Claude generally available on Azure GB300 Blackwell Ultra via Microsoft Foundry.
What to watch — open questions
Storylines are threaded mechanically from the feed: stories that share a distinctive anchor across multiple days and sources. Each item links to its original source. The evidence trace, current state, and open questions are written by the editor routine and refreshed whenever a new beat lands.