Story

triton_releases · Jun 26, 2026 · release

Source brief

Release 2.70.0 corresponding to NGC container 26.06

github.comJun 26, 2026
original source linked

Release highlights

  • (Breaking) Removed deprecated Windows support : Dropped the Windows server build and removed Windows from the core build and documentation
  • (Breaking) Python client BF16 handling : The client now requires ml_dtypes.bfloat16 arrays for BF16 input/output tensors and no longer casts to/from np.float...
  • TensorRT backend : Added multi-device (multi-GPU) inference support
Feed lens
agent

Continue reading

Read the original at github.com →Open in live feed

Earlier in this thread 1 item