Release notes from TensorRT

TensorRT 10.15 Release

2026-02-03T22:22:41Z

For more information, see the TensorRT 10.15 Release Notes:

Sample changes

Added 2 safety samples sampleSafeMNIST, and sampleSafePluginV3 to demonstrate how to use TensorRT with the safety workflow.
Added trtSafeExec to accompany the safety workflow release.
Added python/stream_writer to showcase how to serialize a TensorRT engine directly to a custom stream using the IStreamWriter interface, rather than writing to a file or a contiguous memory buffer.
Added python/strongly_type_autocast to demonstrate how to convert FP32 ONNX models to mixed precision (FP32-FP16) using ModelOpt's AutoCast tool and subsequently building the engine with TensorRT's Strong Typing mode.
Added sampleCudla to demonstrate how to use the cuDLA API to run TensorRT engines on the Deep Learning Accelerator (DLA) hardware, which is available on NVIDIA Jetson and DRIVE platforms.
Deprecated sampleCharRNN.

Plugin changes

Deprecated bertQKVToContextPlugin and will be removed in a future release. No alternatives are planned to be provided.

Parser changes

Added support for RotaryEmbedding, RMSNormalization and TensorScatter for improved LLM model support
Added more specialized quantization ops for models quantized through TensorRT ModelOptimizer.
Added kREPORT_CAPABILITY_DLA flag to enable per-node validation when building DLA engines through TensorRT.
Added kENABLE_PLUGIN_OVERRIDE flag to enable TensorRT plugin override for nodes that share names with user plugins.
Improved error reporting for models with multiple subgraphs, such as Loop or Scan nodes.

Demo changes

demoDiffusion: Stable Diffusion 1.5, 2.0 and 2.1 pipelines have been deprecated and removed.

TensorRT 10.14 Release

2026-01-28T17:30:19Z

10.14 GA - 2025-11-7

Sample changes
- Replace all pycuda usages with cuda-python APIs
- Removed the efficientnet samples
- Deprecated tensorflow_object_detection and efficientdet samples
- Samples will no longer be released with the packages. The TensorRT GitHub repository will be the single source.
Parsers:
- Added support for the Attention operator
- Improved refit for ConstantOfShape nodes
Demos
- demoDiffusion:
  - Added support for the Cosmos-Predict2 text2image and video2world pipelines

TensorRT 10.13.3 Release

2025-09-09T00:16:35Z

See the TensorRT 10.13.3 Release Notes for more information.

Added support for TensorRT API Capture and Replay feature, see the developer guide for more information.

Demo changes

Added support for Flux Kontext pipeline.

TensorRT 10.13.2 Release

2025-08-19T16:44:08Z

10.13.2 GA - 2025-8-18

For more information, see the 10.13.2 release notes.

Added support for CUDA 13.0, dropped support for CUDA 11.X
Dropped support for Ubuntu 20.04
Dropped support for Python versions < 3.10 for samples and demos

TensorRT 10.13 Release

2025-07-24T22:00:51Z

10.13.0 GA - 2025-7-24

Plugin changes
- Fixed a division-by-zero error in geluPlugin that occured when the bias is omitted.
- Completed transition away from using static plugin field/attribute member variables in standard plugins. There's no such need since presently, TRT does not access field information after plugin creators are destructed (deregistered from the plugin registry), nor does access such information without a creator instance.
Sample changes
- Deprecated the yolov3_onnx sample due to unstable url of yolo weights.
- Updated the 1_run_onnx_with_tensorrt and 2_construct_network_with_layer_apis samples to use cuda-python instead of PyCUDA for latest GPU/CUDA support.
Parser changes
- Decreased memory usage when importing models with external weights
- Added loadModelProto, loadInitializer and parseModelProto APIs for IParser. These APIs are meant to be used to load user initializers when parsing ONNX models.
- Added loadModelProto, loadInitializer and refitModelProto APIs for IParserRefitter. These APIs are meant to be used to load user initializers when refitting ONNX models.
- Deprecated IParser::parseWithWeightDescriptors.

TensorRT 10.12 Release

2025-06-18T21:41:29Z

10.12.0 GA - 2025-6-10

Key Features and Updates:

Plugin changes
- Migrated IPluginV2-descendent version 1 of cropAndResizeDynamic, to version 2, which implements IPluginV3.
- Note: The newer versions preserve the attributes and I/O of the corresponding older plugin version. The older plugin versions are deprecated and will be removed in a future release
- Deprecated the listed versions of the following plugins:
  - DecodeBbox3DPlugin (version 1)
  - DetectionLayer_TRT (version 1)
  - EfficientNMS_TRT (version 1)
  - FlattenConcat_TRT (version 1)
  - GenerateDetection_TRT (version 1)
  - GridAnchor_TRT (version 1)
  - GroupNormalizationPlugin (version 1)
  - InstanceNormalization_TRT (version 2)
  - ModulatedDeformConv2d (version 1)
  - MultilevelCropAndResize_TRT (version 1)
  - MultilevelProposeROI_TRT (version 1)
  - RPROI_TRT (version 1)
  - PillarScatterPlugin (version 1)
  - PriorBox_TRT (version 1)
  - ProposalLayer_TRT (version 1)
  - ProposalDynamic (version 1)
  - Region_TRT (version 1)
  - Reorg_TRT (version 2)
  - ResizeNearest_TRT (version 1)
  - ScatterND (version 1)
  - VoxelGeneratorPlugin (version 1)
Demo changes
- Added Image-to-Image support for Stable Diffusion v3.5-large ControlNet models.
- Enabled download of pre-exported ONNX models for the Stable Diffusion v3.5-large pipeline.
Sample changes
- Added two refactored python samples 1_run_onnx_with_tensorrt and 2_construct_network_with_layer_apis
Parser changes
- Added support for integer-typed base tensors for Pow operations
- Added support for custom MXFP8 quantization operations
- Added support for ellipses, diagonal, and broadcasting in Einsum operations

TensorRT 10.11 Release

2025-05-21T22:59:38Z

10.11.0 GA - 2025-5-21

Key Features and Updates:

Plugin changes
- Migrated IPluginV2-descendent version 1 of modulatedDeformConvPlugin, to version 2, which implements IPluginV3.
- Migrated IPluginV2-descendent version 1 of DisentangledAttention_TRT, to version 2, which implements IPluginV3.
- Migrated IPluginV2-descendent version 1 of MultiscaleDeformableAttnPlugin_TRT, to version 2, which implements IPluginV3.
- Note: The newer versions preserve the attributes and I/O of the corresponding older plugin version. The older plugin versions are deprecated and will be removed in a future release.
Demo changes
- demoDiffusion
  - Added support for Stable Diffusion 3.5-medium and 3.5-large pipelines in BF16 and FP16 precisions.
Parser changes
- Added kENABLE_UINT8_AND_ASYMMETRIC_QUANTIZATION_DLA parser flag to enable UINT8 asymmetric quantization on engines targeting DLA.
- Removed restriction that inputs to RandomNormalLike and RandomUniformLike must be tensors.
- Clarified limitations of scan outputs for Loop nodes.

TensorRT OSS v10.10.0

2025-05-09T22:27:41Z

10.10.0 GA

For more information, see the TensorRT 10.10.0 release notes.

Key Features and Updates:

Demo changes
- demoDiffusion
  - Added fp16 and fp8 LoRA support for demo diffusion’s SDXL and FLUX pipeline.
  - Added fp16 ControlNet support for demo diffusion’s SDXL pipeline.
Plugin changes
- Deprecated the enum classes PluginVersion & PluginCreatorVersion. PluginVersion & PluginCreatorVersion are used only in relation to IPluginV2-descendent plugin interfaces, which are all deprecated.
- Added the following APIs that enable users to obtain a list of all Plugin Creators hierarchically registered to a TensorRT Plugin Registry (C++, Python) instance.
  - C++ API: IPluginRegistry::getAllCreatorsRecursive()
  - Python API: IPluginRegistry.all_creators_recursive
Parser changes
- Cleaned up log spam when the ONNX network contained a mixture Plugins and LocalFunctions
- UINT8 constants are now properly imported for QuantizeLinear & DequantizeLinear nodes
- Plugin fallback importer now also reads its namespace from a Node's domain field
Sample changes
- Added support for the python_plugin sample to compile targets to Blackwell.

TensorRT OSS v10.9.0

2025-03-11T22:00:06Z

10.9.0 GA

For more information, see the TensorRT 10.9.0 release notes.

Key Features and Updates:

Demo changes
- demoDiffusion
  - Added Canny ControlNet support for the SDXL pipeline
Plugin changes
- Added a readme to the GroupNormalization plugin (GroupNormalizationPlugin) - 4314
- Fixed bug in CustomQKVToConte mxtPluginDynamic version 3 where SM 100 was not considered a supported platform.
Parser changes
- Added support for Python AOT plugins
- Added support for opset 21 GroupNorm - 4336
- Fixed support for opset 18+ ScatterND
Sample changes
- Added a new sample dds_faster_rcnn which demonstrates how to handle data-dependent shaped outputs with IOutputAllocator.
Fixed issues:
- Fixed streamReaderV2 Python API performance issue - 4327

TensorRT OSS v10.8.0

2025-02-01T01:09:15Z

10.8.0 GA

For more information, see the TensorRT 10.8.0 release notes.

Key Features and Updates:

Demo changes
- demoDiffusion
  - Added Image-to-Image support for Flux-1.dev and Flux.1-schnell pipelines.
  - Added ControlNet support for FLUX.1-Canny-dev and FLUX.1-Depth-dev pipelines. Native FP8 quantization is also supported for these pipelines.
  - Added support for ONNX model export only mode. See --onnx-export-only.
  - Added FP16, BF16, FP8, and FP4 support for all Flux Pipelines.
Plugin changes
- Added SM 100 and SM 120 support to bertQKVToContextPlugin. This enables demo/BERT on Blackwell GPUs.
Sample changes
- Added a new sampleEditableTimingCache to demonstrate how to build an engine with the desired tactics by modifying the timing cache.
- Deleted the sampleAlgorithmSelector sample.
- Fixed sampleOnnxMNIST by updating the correct INT8 dynamic range.
Parser changes
- Added support for FLOAT4E2M1 types for quantized networks.
- Added support for dynamic axes and improved performance of CumSum operations.
- Fixed the import of local functions when their input tensor names aliased one from an outside scope.
- Added support for Pow ops with integer-typed exponent values.
Fixed issues
- Fixed segmentation of boolean constant nodes - 4224.
- Fixed accuracy issue when multiple optimization profiles were defined 4250.