TensorRT
Development & Tools
NVIDIA runtime for high-performance inference
What is TensorRT?
Optimizes and compiles models for NVIDIA GPUs, improving latency and throughput with mixed precision and layer fusion.
Real-World Examples
- •Deploying CNNs for real-time inference on GPUs
Related Terms
Learn more about concepts related to TensorRT