Quantisierung
Quantisieren von Netzparametern zu Datentypen mit reduzierter Präzision; Vorbereiten von Deep-Learning-Netzen für Festkomma-Codegenerierung
Quantisieren Sie die Gewichte, Verzerrungen und Aktivierungen von Schichten auf skalierte Ganzzahl-Datentypen mit reduzierter Genauigkeit. Aus diesem quantisierten Netz können Sie daraufhin C/C++, CUDA®- oder HDL-Code zur Bereitstellung auf GPU, FPGA oder CPU generieren.
Einen detaillierten Überblick über die in der Deep Learning Toolbox™ Model Compression Library verfügbaren Komprimierungstechniken finden Sie unter Reduce Memory Footprint of Deep Neural Networks.
Funktionen
dlquantizer | Quantize a deep neural network to 8-bit scaled integer data types |
dlquantizationOptions | Options for quantizing a trained deep neural network |
prepareNetwork | Prepare deep neural network for quantization (Seit R2024b) |
calibrate | Simulate and collect ranges of a deep neural network |
quantize | Quantize deep neural network (Seit R2022a) |
validate | Quantize and validate a deep neural network |
quantizationDetails | Display quantization details for a neural network (Seit R2022a) |
estimateNetworkMetrics | Estimate network metrics for specific layers of a neural network (Seit R2022a) |
equalizeLayers | Equalize layer parameters of deep neural network (Seit R2022b) |
exportNetworkToSimulink | Generate Simulink model that contains deep learning layer blocks and subsystems that correspond to deep learning layer objects (Seit R2024b) |
Apps
| Deep Network Quantizer | Quantize deep neural network to 8-bit scaled integer data types |
Themen
Quantisierung verstehen
- Quantization of Deep Neural Networks
Learn about deep learning quantization tools and workflows. - Data Types and Scaling for Quantization of Deep Neural Networks
Understand effects of quantization and how to visualize dynamic ranges of network convolution layers.
Workflows vor der Bereitstellung
- Prepare Data for Quantizing Networks
Learn about supported data formats for quantization workflows. - Quantize Multiple-Input Network Using Image and Feature Data
Quantize a network with multiple inputs. - Export Quantized Networks to Simulink and Generate Code
Export a quantized neural network to Simulink and generate code from the exported model. - Quantization-Aware Training with Pseudo-Quantization Noise
Perform quantization-aware training with pseudo-quantization noise on the MobileNet-V2 network. (Seit R2026a)
Bereitstellung
- Quantize Semantic Segmentation Network and Generate CUDA Code
Quantize a convolutional neural network trained for semantic segmentation and generate CUDA code. - Classify Images on FPGA by Using Quantized GoogLeNet Network (Deep Learning HDL Toolbox)
This example shows how to use the Deep Learning HDL Toolbox™ to deploy a quantized GoogleNet network to classify an image. - Compress Image Classification Network for Deployment to Resource-Constrained Embedded Devices
Reduce the memory footprint and computation requirements of an image classification network for deployment to resource-constrained embedded devices such as the Raspberry Pi®.
Erwägungen
- Quantization Workflow System Requirements
See what products are required for the quantization of deep neural networks. - Supported Layers for Quantization
Learn which deep neural network layers are supported for quantization.






