Deep Learning INT8 Quantization

Calibrate, validate, and deploy quantized pretrained series deep learning networks

Increase throughput, reduce resource utilization, and deploy larger networks onto smaller target boards by quantizing your deep learning networks.

After calibrating your pretrained series network by collecting instrumentation data, quantize your series network and validate the accuracy of your quantized network. Once the quantized network has been validated, generate code for and deploy the quantized network.

Functions

expand all

Quantization and Validation

`dlquantizationOptions`	Options for quantizing a trained deep neural network
`dlquantizer`	Quantize a deep neural network to 8-bit scaled integer data types
`calibrate`	Simulate and collect ranges of a deep neural network
`validate`	Quantize and validate a deep neural network

Code Generation and Deployment

`dlhdl.Workflow`	Configure deployment workflow for deep learning neural network
`dlhdl.Target`	Configure interface to target board for workflow deployment
`compile`	Compile workflow object
`deploy`	Deploy the specified neural network to the target FPGA board
`estimate`	Estimate performance of specified deep learning network and bitstream for target device board
`predict`	Run inference on deployed network and profile speed of neural network deployed on specified target device
`release`	Release the connection to the target device
`validateConnection`	Validate SSH connection and deployed bitstream

Topics

Get Started

Supported Networks, Layers and Boards

Pretrained deep learning networks and network layers for which code can be generated by Deep Learning HDL Toolbox™.

Quantization of Deep Neural Networks

Understand effects of quantization and how to visualize dynamic ranges of network convolution layers.

Quantization Workflow

Quantization Workflow Prerequisites

Products required for the quantization of deep learning networks.

Calibration

Simulate your pretrained series network and collect the dynamic range of weights and biases.

Validation

Quantize and validate your pretrained series deep learning network.

Code Generation and Deployment

Generate code and deploy your quantized pretrained series deep learning network.

Tutorials

Deploy Quantized Neural Network

Deploy a pretrained quantized series network.

Quantize Neural Network for FPGA Execution Environment

Compare the accuracy between a pretrained series network and a quantized pretrained series network.

Documentation