Story
triton_releases · Jun 26, 2026 · release
github.comJun 26, 2026
original source linked
Release highlights
- (Breaking) Removed deprecated Windows support : Dropped the Windows server build and removed Windows from the core build and documentation
- (Breaking) Python client BF16 handling : The client now requires ml_dtypes.bfloat16 arrays for BF16 input/output tensors and no longer casts to/from np.float...
- TensorRT backend : Added multi-device (multi-GPU) inference support
Feed lens
agent
Continue reading