CN113128116B

CN113128116B - Pure integer quantization method for lightweight neural network

Info

Publication number: CN113128116B
Application number: CN202110421738.5A
Authority: CN
Inventors: 姜伟雄; 哈亚军
Original assignee: ShanghaiTech University
Current assignee: ShanghaiTech University
Priority date: 2021-04-20
Filing date: 2021-04-20
Publication date: 2023-09-26
Anticipated expiration: 2041-04-20
Also published as: US11934954B2; WO2022222369A1; US20230196095A1; CN113128116A

Abstract

The application provides a pure integer quantification method for a lightweight neural network, which is characterized by comprising the following steps of: obtaining the maximum value of the pixel values of each channel of the current layer of feature map; dividing the pixel value of each pixel of each channel of the feature map by the t power of the maximum value, t e [0,1]; multiplying the value of each channel of the weight by the maximum value of the pixel value of the corresponding feature map channel; and convolving the processed feature map with the processed weight to obtain a next layer of feature map. The algorithm provided by the application is respectively verified on SkyNet and MobileNet, INT8 lossless quantization is obtained on SkyNet, and the highest quantization precision is obtained on MobileNet v 2.

Description

Pure integer quantization method for lightweight neural network

Technical Field

The application relates to a quantization method for a lightweight neural network.

Background

In recent years, a great deal of work has explored quantization techniques for traditional models. But these techniques can be applied to lightweight networks with significant loss of accuracy. Such as: jacob Benoit et al quantization and training of neural networks for efficient integer-arithmetical-only reference. InCVPR, pages 2704-2713,2018 reduced from 73.03% to 0.1% in imageNet dataset accuracy when quantifying MobileNet v 2; raghura Krishnamoorthi.Quantizing deep convolutional networks for efficient inference: A whistepaper.CoRR, abs/1806.08342,2018 achieved a 2% loss of accuracy. To recover these loss of accuracy, many efforts employ retraining or quantization techniques at training. But these techniques are time consuming and require data set support. Nagel et al propose a DFQ algorithm to solve the above-mentioned problem, and they believe that the difference in distribution of weights results in the traditional quantization method performing poorly on models employing deep-split convolution. For this reason Nagel et al propose cross-layer weight balancing, adjusting the balance of weights between different layers. However, this technique can only be applied to network models with ReLU as an activation function, but most lightweight networks currently employ ReLU6. Direct replacement of ReLU6 with ReLU would in turn result in significant loss of accuracy. And the method proposed by Nagel et al is not suitable for pure integer quantization.

Disclosure of Invention

The application aims to solve the technical problems that: simply combining lightweight neural network techniques and quantization techniques can result in either a significant degradation in accuracy or a longer retrain time; furthermore, many quantization methods only quantize the weights and feature maps, but the bias and quantization coefficients are also floating point numbers, which is very unfriendly to an ASIC/FPGA.

In order to solve the technical problems, the technical scheme of the application provides a pure integer quantization method for a lightweight neural network, which is characterized by comprising the following steps:

step 1, setting N channels in a feature map, wherein N is more than or equal to 1, and obtaining the maximum value of pixel values of each channel in the feature map of the current layer;

step 2, pixels of each channel of the feature map are processed as follows:

dividing the pixel value of each pixel of the nth channel of the feature map by the power t of the maximum value of the nth channel obtained in the step 1, wherein t is [0,1];

there are N sets of weights corresponding to N channels of the next layer of feature map, each set of weights is composed of N weights corresponding to N channels of the current layer of feature map, and each set of weights is processed as follows:

the N weights in the nth group of weights are respectively and correspondingly multiplied by the maximum value of the pixel values of the N channels obtained in the step 1;

and step 3, convolving the feature map processed in the step 2 with the N groups of weights processed in the step 2 to obtain a next layer of feature map.

Preferably, when t=0, no imbalance transfer is done; when t=1, the imbalance among the channels of the characteristic map of the previous layer is completely transferred to the weight of the subsequent layer.

Preferably, the current layer is any layer except the last layer in the lightweight neural network.

The algorithm provided by the application is respectively verified on SkyNet and MobileNet, INT8 lossless quantization is obtained on SkyNet, and the highest quantization precision is obtained on MobileNet v 2.

Drawings

FIG. 1 is a schematic diagram of 1X1 convolution with unbalanced transitions.

Detailed Description

The application will be further illustrated with reference to specific examples. It is to be understood that these examples are illustrative of the present application and are not intended to limit the scope of the present application. Furthermore, it should be understood that various changes and modifications can be made by one skilled in the art after reading the teachings of the present application, and such equivalents are intended to fall within the scope of the application as defined in the appended claims.

The inventor analyzes and models the quantization flow of the neural network, and finds that the equalization of tensors can be used as a prediction index of quantization errors. Under the guidance of the index, the application provides an adjustable unbalanced shift algorithm to optimize the quantization error of the feature map, which comprises the following specific contents:

in view of the current neural network computing mode, the weights can be quantized channel by channel, and the feature images can only be quantized layer by layer, so that the quantization error of the weights is smaller, but the quantization error of the feature images is larger.

The application divides the pixel value of each pixel of each channel of the current layer of characteristic diagram in the neural network by the maximum value of the pixel value of the channel where the pixel value is located, and then carries out quantization to realize equivalent channel-by-channel quantization. In order to ensure that the calculation result is unchanged, the value of each channel of the weight convolved with the feature map is multiplied by the maximum value of the pixel value of the corresponding feature map channel. This achieves that the imbalance between the channels of the feature map of the current layer is all transferred to the weights of the following layer.

In practice, however, the overall shift of the inter-channel non-uniformities of the feature map is not an optimal solution. In order to adjust the degree of the imbalance transfer, the application additionally adds a super-parameter imbalance transfer coefficient t, wherein in the step, the pixel value of each pixel of each channel of the characteristic diagram is divided by the t power of the maximum value of the pixel value of the channel where the pixel value is located, and the range of t is 0 to 1. When t=0, this corresponds to no imbalance transfer; when t=1, this corresponds to the aforementioned shift of all imbalances. By adjusting t, the application can obtain the optimal quantization precision. This operation applies to any network model, any convolution kernel size.

Fig. 1 shows a schematic diagram of unbalanced shift of 1X1 convolution, and tensors circled by dotted lines share the same quantization coefficient. The pixel value of each pixel of each channel of A1 is divided by the maximum value of the pixel value of the respective channel, and then the corresponding channel of W2 is multiplied by this maximum value, so that the calculation result is not changed, but the equalization of A1 is greatly increased. At the same time, the balance of the weights is not significantly reduced. Therefore, the quantization error of the feature map can be reduced, and the accuracy of the quantized model is improved.

Claims

1. A pure integer quantization method for a lightweight neural network, comprising the steps of:

step 2, pixels of each channel of the feature map are processed as follows:

step 3, convolving the feature map processed in the step 2 with the N groups of weights processed in the step 2 to obtain a next layer of feature map;

when t=0, no imbalance transfer is done; when t=1, the imbalance among the channels of the characteristic map of the previous layer is completely transferred to the weight of the subsequent layer.

2. A purely integer quantization method as defined in claim 1, wherein the current layer is any layer of the lightweight neural network other than the last layer.