📰 Story

vllm_releases · Jun 12, 2026 · release

← Live feed 📰 Daily recap 🗓️ Weekly recap 🔔 RSS

vllm v0.23.0

Release highlights

  • DeepSeek-V4 matures across backends : Following its introduction in v0.22.0, DeepSeek-V4 received another large hardening and optimization pass. Its sparse M...
  • Model Runner V2 expands to more dense models : MRv2 is now selected by default for Llama and Mistral dense models in addition to Qwen3. It gained a FlashInfe...
  • Rust frontend grows up : The experimental Rust frontend added a streaming generate endpoint , dynamic LoRA endpoints , /version and /server_info endpoints, a...
Read the original at github.com →Open in live feed

Related stories 4 items