CN113780550A

CN113780550A - Convolutional neural network pruning method and device for quantizing feature map similarity

Info

Publication number: CN113780550A
Application number: CN202110977310.9A
Authority: CN
Inventors: 王睿; 王子都; 刘学鑫; 黄泷; 张宇飞; 林志康
Original assignee: University of Science and Technology Beijing USTB
Current assignee: University of Science and Technology Beijing USTB
Priority date: 2021-08-24
Filing date: 2021-08-24
Publication date: 2021-12-10

Abstract

The invention discloses a convolutional neural network pruning method and device for quantizing feature map similarity, and relates to the technical field of computer science. And by quantifying the 'information similarity' between the characteristic graphs, pruning to remove convolution kernels corresponding to the characteristic graphs with similar information, then carrying out fine tuning, and carrying out layer-by-layer iteration to obtain a new model, thereby reducing the storage size of model parameters and the model. The accuracy of the finally obtained pruning model is basically not changed, and the parameter quantity is greatly reduced after pruning is finished, so that the occupied memory of the model is reduced, the required computing power is reduced, and the computing speed is accelerated; and thus can be deployed on edge devices with limited computer resources in a more optimized implementation. And the deep neural network can be better applied under the situations of limited computing resources, real-time online processing and the like.

Description

Convolutional neural network pruning method and device for quantizing feature map similarity

Technical Field

The invention relates to the technical field of computer science, in particular to a convolutional neural network pruning method, a convolutional neural network pruning device, a convolutional neural network pruning method and a convolutional neural network pruning device for quantizing feature map similarity.

Background

Along with the increase of the model performance of the deep neural network, the depth and the breadth of the neural network are also increased, the corresponding disadvantages of high storage and high power consumption are caused, and the application of the deep neural network in the situations of limited computing resources, real-time online processing and the like is severely restricted. The deep neural network model with parameters over millions of orders of magnitude stores a large amount of redundant information, the original deep network model is compressed, the parameters are reduced, the weight of the deep neural network model is lightened, and the deep neural network model is applied to edge equipment with limited computing resources as far as possible without losing accuracy, so that the deep neural network model becomes a hot spot of attention of people at present.

Aiming at the light weight of the neural network, the methods proposed by the predecessors mainly comprise parameter pruning, parameter sharing, low-rank decomposition, knowledge distillation and the like.

The non-patent literature (Mingbao Lin, et al, HRank: Filter Pruning High-rate Feature map. Proc. arxiv,2020.) uses the statistical averaging idea to prune the corresponding convolution kernel by calculating the Rank of the Feature map, but it does not indicate the similarity between Feature maps.

The deep neural network is the most important problem to be solved at present, which enables the network to be applied to the edge equipment with limited computing resources accurately, and reduces the parameters of the deep neural network to make the deep neural network light.

Disclosure of Invention

The invention provides a convolutional neural network pruning method and a convolutional neural network pruning device for quantifying feature map similarity, aiming at the problems that in the prior art, a network cannot be accurately applied to edge equipment with limited computing resources and the convolutional neural network is light in weight.

In order to solve the technical problems, the invention provides the following technical scheme:

the embodiment of the invention provides a convolutional neural network pruning method and device for quantizing feature map similarity. The technical scheme is as follows:

in one aspect, the invention provides a convolutional neural network pruning method for quantizing feature map similarity, which comprises the following steps:

s1: according to the pruning compression ratio, pre-pruning calculation is carried out on the neural network model needing pruning to obtain the number N of the pruned convolution kernels of each layer in the convolution neural network_i2；

S2: inputting pictures in a picture dataset to L_i-1Model after layer pruning, pair L_iPruning the layers; quantizing the similarity through an SSIM method to obtain the similarity between the quantized Feature maps Feature _ map; determining Delete according to similarity of quantized Feature map Feature _ mapⁱ；

S3: by the resulting DeleteⁱPruning is carried out to obtain a Model after pruning_iAnd for Model_iFine tuning is carried out to enable i to increase automatically, and the steps S2-S3 are repeated until the convolution layer L is subjected to fine tuning₁…L_nPruning operation is completed; obtaining the final pruned convolutional neural network Model_n。

Optionally, in step S1, pre-pruning calculation is performed on the neural network model to be pruned according to the pruning compression ratio to obtain the number N of truncated convolution kernels of each layer in the convolutional neural network_i2(ii) a The well-trained pruning Model is a Model₀The method comprises the following steps:

s11: establishing a picture data set Train for model pruning, wherein the Train comprises M pictures;

wherein N is₀Number of channels, X, representing picture₀，Y₀The height and width of the picture are separated; determining the total convolution layer number n;

s12: determining the total number of convolution kernels per layer N_iAccording to the compression ratio, for N_iPruning calculation is carried out to obtain the number N of the truncated convolution kernels of each layer_i2。

Alternatively, in step S12, according to the compression ratioFor the original convolution kernel number N_iPruning calculation is carried out to obtain the number N of the truncated convolution kernels of each layer_i2The method comprises the following steps:

pre-pruning the number of layers of the convolutional neural network in sequence of 1-N, and then carrying out pre-pruning on the original N of each layer_iOne Fiter, N after pruning_i2The cut filters; n is a radical of_iA set of filters is

Wherein K_iRepresenting the height and width of the convolution kernel.

Optionally, in step S2, a picture in the picture data set is input to L_i-1Model after layer pruning, pair L_iPruning the layers; quantizing the similarity through an SSIM method to obtain the similarity between the quantized Feature maps Feature _ map; determining Delete according to similarity of quantized Feature map Feature _ mapⁱThe method comprises the following steps:

s21: when it is to the convolution layer L_iPruning is carried out, and a pruning Model is available at the moment_i-1(ii) a Model for pruning Model_i-1Inputting the k picture in the picture data set, then rolling up the layer L_iCorresponding inputs and outputs are respectively

And

s22: for ═ L_iJth of layer

Having an output of

Then N is_iA Feature _ map constitutes

S23: if i is equal to 1, judging that i is less than or equal to n; if not, finishing the iteration and outputting a lightweight model; if yes, define DeleteⁱThe collection is empty; defining k as 1, and inputting M pictures in the Trian into the pruning model;

s24: taking out M Feature _ maps sets corresponding to M pictures in Train generated by the ith layer; judging that k is less than or equal to M, if so, calculating the similarity of any two Feature _ maps in the kth Feature _ maps set

Calculating the rank of each Feature _ maps in the kth Feature _ maps set

k is increased automatically;

s25: judging whether k satisfies k>M, if not, repeatedly executing S24-S25; if yes, then statistics is carried out on the average similarity

And average rank

From high to low, constitute SSIMⁱSet and RankⁱGathering;

s26: traversal of SSIMⁱElements in a collection

Subscripts m, n of (a);

s27: judge DeleteⁱWhether the number of elements is less than N_i2(ii) a If yes, screening preset conditions: when in use

While, the Filter is put in_(i，n)Put in Deleteⁱ(ii) a When in use

While, the Filter is put in_(i，m)Put in Deleteⁱ；

If not, then Delete is obtainedⁱAnd (4) collecting.

Optionally, in step S3, Delete is obtained by the obtained resultⁱPruning is carried out to obtain a Model after pruning_iAnd for Model_iPerforming fine tuning, including:

s31: pruning and DeleteⁱObtaining a Model of the Model by the corresponding Filter in the set_i；

S32: for obtaining Model_iPerforming retraining trimming, i.e. Model_iFine tuning (Model)_i-1＝＝-＝Delteⁱ)。

In one aspect, the present invention provides a convolutional neural network pruning device for quantifying similarity of feature maps, where the device is applied to the method described above, and includes:

a pruning pre-judging module for performing pre-pruning calculation on the neural network model to be pruned according to the pruning compression ratio to obtain the number N of the pruned convolution kernels of each layer in the convolution neural network_i2；

A similarity calculation module for inputting the pictures in the picture data set to L_i-1Model after layer pruning, pair L_iPruning the layers; quantizing the similarity through an SSIM method to obtain the similarity between the quantized Feature maps Feature _ map; determining Delete according to similarity of quantized Feature map Feature _ mapⁱ；

A pruning result calculation module for obtaining Delete through the obtained pruning resultⁱPruning is carried out to obtain a Model after pruning_iAnd for Model_iFine tuning is carried out, i is increased automatically, and the pruning model is obtained by repeated calculation until the convolutional layer L is subjected to fine tuning₁…L_nPruning operation is completed; obtaining the final pruned convolutional neural network Model_n。

Optionally, the similarity calculation module deletes redundant information in the convolutional neural network through the similarity between Feature _ maps.

Alternatively, between Feature _ maps, the similarity is determined by inputting the picture data set into a convolution network to obtain data, and calculating the statistical average of the data.

Optionally, the average rank calculation module clips the corresponding Filter through a feature map with higher similarity and lower rank.

Optionally, the preset conditions are:

Filter_(i,m)and Filter_(i,n)Are all belonged to Deleteⁱ；

On the premise that m and n satisfy the above,

maximum;

the technical scheme of the embodiment of the invention at least has the following beneficial effects:

in the scheme, the invention provides a convolutional neural network pruning method and a convolutional neural network pruning device for quantizing the similarity of characteristic graphs, wherein the convolutional kernels corresponding to the characteristic graphs with similar information are pruned by quantizing the information similarity between the characteristic graphs, then fine tuning is carried out, layer-by-layer iteration is carried out, the difference between the accuracy rate of the obtained new model and the accuracy rate of the original model is within 1%, and the parameter quantity and the calculated quantity can be greatly reduced according to requirements, so that the storage size of the model is reduced, the lightweight of a deep neural network is realized, and the convolutional neural network pruning method and the convolutional neural network pruning device can be deployed on edge equipment with limited calculation resources.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

FIG. 1 is a general flowchart of a convolutional neural network pruning method to quantify feature map similarity according to an embodiment of the present invention;

FIG. 2 is a detailed flowchart of a convolutional neural network pruning method for quantifying similarity of feature maps according to an embodiment of the present invention;

FIG. 3 is a diagram illustrating a visualization result of convolutional layer output provided by an embodiment of the present invention;

fig. 4 is a schematic diagram of a process for deleting and optimizing redundant convolution kernels in a neural network convolution layer according to an embodiment of the present invention.

Detailed Description

In order to make the technical problems, technical solutions and advantages of the present invention more apparent, the following detailed description is given with reference to the accompanying drawings and specific embodiments.

As shown in fig. 1, an embodiment of the present invention provides a convolutional neural network pruning method for quantizing feature map similarity, including:

Taking a VGG16(Visual Geometry Group 16, a Visual Geometry Group including 16 hidden layers) neural network commonly used in an image classification technology as an example, the VGG16 neural network includes 13 convolutional layers and 3 fully-connected layers, and local features of an image to be classified are extracted through parameters of the 13 convolutional layers; the images are classified by the parameters of the 3-layer fully connected layer.

When extracting features, for example VGG-16, Conv1-1, the input dimension is 224 x 3, the 64 filters of 3 x 3 dimension, and the dimension of the set of output feature-maps is 224 x 64.

And all the extracted local features are assembled into a complete graph through a weight matrix again through a full connection layer of the VGG neural network, the probability of the predicted corresponding bit is represented in each position in the weight matrix, and the classification of the pictures is completed by judging which bit has high probability. If the last convolution yields a feature-map of 7 x 512, the fully connected layer spreads the feature-map into a one-dimensional vector 1 x 4096, which is the feature map matrix T that provides the input to the classifier, and based on the probabilities in the matrix, the image classification is completed. The invention prunes and removes the convolution kernel corresponding to the characteristic graphs with similar information by quantizing the information similarity between the characteristic graphs, then carries out fine adjustment and obtains a new model through layer-by-layer iteration, thereby reducing the storage size of model parameters and models and accurately applying the network to edge equipment with limited computer resources.

As shown in fig. 2, in step S1, according to the pruning compression ratio, pre-pruning calculation is performed on the neural network model to be pruned, so as to obtain the number N of truncated convolution kernels in each layer of the convolutional neural network_i2(ii) a The well-trained pruning Model is a Model₀The method comprises the following steps:

In step S12, the original convolution kernel number N is compared with the compression ratio_iPruning calculation is carried out to obtain the number N of the truncated convolution kernels of each layer_i2The method comprises the following steps:

Wherein K_iRepresenting the height and width of the convolution kernel.

In this embodiment, the variable is first declared: neural network Model using VGG16(Visual Geometry Group 16, Visual Geometry Group containing 16 hidden layers)₀For example, pruning compression is performed with a compression rate of 0.3, using 50000 pictures of the cifar10 data set as a training set, each picture having a size of 32 × 32 in 3 channels, Train ═ Image [ { Image }₁,Image₂,……,Image₅₀₀₀₀}∈R^{50000×3×32×32}。

The number of convolution kernels per layer pruning for a compression ratio of 0.3 for VGG16 is shown in table 1:

TABLE 1

Layer name	Original convolution kernel number	Number of pruned convolution kernels (N)_i2)
			Block1_conv1	64	0
Block1_conv2	64	19
			Block2_conv1	128	38
Block2_conv2	128	38
			Block3_conv1	256	76
Block3_conv2	256	76
			Block3_conv3	256	76
Block4_conv1	512	153
			Block4_conv2	512	153
Block4_conv3	512	153
			Block5_conv1	512	153
Block5_conv2	512	153
			Block5_conv3	512	153

In step S2, a picture in the picture data set is input to L_i-1Model after layer pruning, pair L_iPruning the layers; quantizing the similarity through an SSIM (structural similarity index) method to obtain the similarity between the quantized Feature maps Feature _ map; determining Delete according to similarity of quantized Feature map Feature _ mapⁱThe method comprises the following steps:

S22: for L_iJth of layer

Having an output of

Then N is_iA Feature _ map forms

Calculating the rank of each Feature _ maps in the kth Feature _ maps set

k is increased automatically;

And average rank

From high to low, constitute SSIMⁱSet and RankⁱGathering;

s26: traversal of SSIMⁱElements in a collection

Subscripts m, n of (a);

While, the Filter is put in_(i，m)Put in Deleteⁱ(ii) a When in use

While, the Filter is put in_(i,n)Put in Deleteⁱ；

If not, then Delete is obtainedⁱAnd (4) collecting.

In this example, pruning was performed in the order of Block1_ conv1 to Block5_ conv3, according to table 1.

For discrimination, here the pruning example process starts with convolutional layer Block1_ conv2, i.e. with the 2 nd pruning. Setting Delete²For storing cut filters, N₂₂＝19。

A statistical average of the structural similarity is calculated. The Model is now Model₁Namely, the convolution layer Block1_ conv1 is pruned to form the corresponding model. Each pair of Model₁Inputting a picture Image_kThe output of convolutional layer Block1_ conv2 is

Calculate any two of the 64 Feature _ maps

And

value of SSIM similarity of

The calculation formula is as follows:

wherein mu_xAnd mu_yIs the mean, σ, of two feature maps_xAnd σ_yIs the standard deviation, σ, between the two feature maps_xyIs the covariance between the two feature maps. The statistical average of the structural similarity is

Get a set

(m,n＝1,2,3,……，64)。

A statistical average of the rank is calculated. Calculate each

Rank of

Then O is₂Middle mth Feature _ map_(2,m)Is statistically averaged to be:

get a set

(m＝1,2,3,……，64)。

And (6) pruning. First SSIM²Arranged according to the size from high to low and then matched with Rank²Determining a Filter to be pruned, and setting the conditions as follows:

condition 1: filter_(2,m)And Filter_(2,n)Are all belonged to Delete²；

Condition 2: on the premise that m and n satisfy the condition 1,

maximum;

condition 3:

condition 4:

if conditions 1,2 and 3 are met simultaneously, the Filter is switched_(2,n)Put in Delete²If

conditions

1,2 and 4 are satisfied simultaneously, the Filter is used_(2,m)Put in Delete²In the method, the operation is continuously carried out until Delete²If the number of filters in the set is 19, Delete is determined²。

As shown in fig. 3, which is a schematic diagram of the output visualization result of the second convolutional layer of the VGG16 convolutional neural network, for the second convolutional layer of VGG16, when the input picture size is 32 × 3, the visualization results of 64 Feature maps (after one pooling operation, the Feature _ map dimension is 16 × 16) can be seen to be relatively similar for the 31 th and 51 th Feature _ maps, relatively similar for the 7 th and 24 th Feature _ maps, relatively similar for the 27 th and 45 th Feature _ maps, and so on.

As shown in fig. 4, a pruning process is described that removes convolution kernels corresponding to similar feature maps. Specifically, for the pruning of the second convolutional layer of VGG16, there are 64 convolution kernels for the layer before pruning and 45 convolution kernels remaining for the layer after pruning. The parameters of the network model after pruning are reduced, and the calculation amount is reduced.

In the specific implementation process, the similarity between the Feature maps (Feature _ maps) used in the present invention may be calculated by using SSIM, or may be calculated by using another method for quantizing the similarity, such as PSNR (Peak Signal-to-Noise Ratio).

In step S3, Delete is obtained by obtainingⁱPruning is carried out to obtain a Model after pruning_iAnd for Model_iPerforming fine tuning, including:

S32: for obtaining Model_iPerforming retraining trimming, i.e. Model_iFine tuning (Model)_i-1-Delteⁱ)。

In this embodiment, the output model is fine-tuned. Output Model after 2 nd pruning₂I.e. by

Model₂Fine tuning (Model)₁-Delete²)

After fine adjustment, i is increased, and then steps S2-S3 are repeated, i.e., the convolutional layer Block2_ conv1 is pruned, i.e., the third pruning is performed.

Setting Delete³For storing cut filters, N₃₂＝38。

A statistical average of the structural similarity is calculated. The Model is now Model₂Each pair of Model₂Inputting a picture Image_kThe output of convolutional layer Block2_ conv1 is

Calculate any two of the 128 Feature _ maps

And

value of SSIM similarity of

The statistical average of the structural similarity is

Get a set

(m,n＝1,2,3,……，128)。

A statistical average of the rank is calculated. MeterCalculate each

Rank of

Then O is₃Middle mth Feature _ map_(3,m)Is statistically averaged to be:

get a set

(m＝1,2,3,……，128)。

And (6) pruning. First SSIM³Arranged according to the size from high to low and then matched with Rank³Determining a Filter to be pruned, and setting the conditions as follows:

condition 1: filter_(3,m)And Filter_(3,n)Are all belonged to Delete³；

Condition 2: on the premise that m and n satisfy the condition 1,

maximum;

condition 3:

condition 4:

if conditions 1,2 and 3 are met simultaneously, the Filter is switched_(3,n)Put in Delete³If

conditions

1,2 and 4 are satisfied simultaneously, the Filter is used_(3,m)Put in Delete³In the method, the operation is continuously carried out until Delete³If the number of filters in the set is 38, find Delete³。

And (6) fine adjustment. Output through 3 rd pruningModel₃I.e. by

Model₃Fine tuning (Model)₂-Delete³)

The above operations are repeatedly executed until Block5_ conv3 pruning is completed, namely 13 times of pruning are completed, and the output model is the compressed model.

Wherein the accuracy and two pairs of parameters before and after pruning are shown in table 2:

TABLE 2

	Before pruning	After pruning
			Accuracy of	93.17％	92.42％
Amount of ginseng	15,001,418	7,453,636

As can be seen from table 2, the accuracy of the pruning model finally obtained by the pruning method provided by the present invention is not changed basically, and the parameter amount after pruning is reduced greatly, so that the occupied memory of the model is reduced, the required computing power is reduced, and thus deployment on edge devices with limited computer resources can be realized more optimally. And the deep neural network can be better applied under the situations of limited computing resources, real-time online processing and the like.

In this embodiment, for the task of classifying a 3-channel color picture with 32 × 32 pixels, the original model and the pruned model are deployed on a computer, and the prediction of the picture category is performed. The parameters of the used computer equipment and the software environment are shown in table 3, the average execution time of each picture is shown in table 4, and the result shows that the method can be effectively applied to the reasoning scene of the accelerated convolution neural network.

TABLE 3

TABLE 4

The method for accelerating the forward propagation of the neural network provided by the invention can be suitable for an application scene of accelerating the inference of the convolutional neural network. Particularly, the method is applicable to the parts related to the inference acceleration of the convolutional neural network in the technical processes of improving image classification, target detection, face recognition and the like; the present invention can reduce time delay on the premise of ensuring original effect, so as to obtain the results of image classification, target detection, face recognition, etc. more quickly.

The invention also provides a convolutional neural network pruning device for quantizing the similarity of the characteristic graphs, which is applied to the method and comprises the following steps:

a pruning pre-judging module for performing pre-pruning calculation on the image data set to be pruned according to the pruning compression ratio to obtain the number N of the pruned convolution kernels of each layer in the convolution neural network_i2(ii) a The well-trained pruning Model is a Model₀；

A similarity calculation module forBy inputting picture pairs in a picture dataset into the convolutional layer L_iPruning is carried out; quantizing the similarity through an SSIM method to obtain the similarity between the quantized Feature maps Feature _ map; determining Delete according to similarity of quantized Feature map Feature _ mapⁱ；

And the similarity calculation module deletes redundant information in the convolutional neural network through the similarity between Feature _ maps.

Between Feature _ maps, the image data set is input into a convolution network to obtain data, and the statistical average of the data is calculated to determine the similarity.

And the average rank calculation module cuts the corresponding Filter through a feature map with higher similarity and lower rank.

The preset conditions are as follows:

Filter_(i,m)and Filter_(i,n)Are all belonged to Deleteⁱ；

On the premise that m and n satisfy the above condition,

maximum;

in the specific implementation process, the similarity between the quantized Feature maps (Feature _ maps) used in the present invention may be calculated by using SSIM, or may be calculated by using other methods for quantizing the similarity, such as euclidean distance, PSNR (Peak Signal-to-Noise Ratio), and the like.

The invention prunes and removes the convolution kernel corresponding to the characteristic graphs with similar information by quantifying the information similarity between the characteristic graphs, then carries out fine adjustment, and obtains a new model through layer-by-layer iteration, thereby reducing the storage size of model parameters and models, and being capable of accurately applying the network to edge equipment with limited computing resources.

It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, where the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A convolutional neural network pruning method for quantizing feature map similarity is characterized by comprising the following steps:

2. The convolutional neural network pruning method for quantifying feature map similarity according to claim 1, wherein in step S1, pre-pruning calculation is performed on the neural network model to be pruned according to the pruning compression ratio to obtain the number N of pruned convolutional kernels in each layer of the convolutional neural network_i2(ii) a The well-trained pruning Model is a Model₀The method comprises the following steps:

wherein R is a real number set, N₀Number of channels, X, representing picture₀，Y₀The height and width of the picture are separated; determining the total convolution layer number n;

s12: determining the total number of convolution kernels per layer N_iAccording to the compression ratio, for N_iPruning calculation is carried out to obtain the number N of the truncated convolution kernels of each layer_i2Wherein the initial value of i is 1.

3. The convolutional neural network pruning method for quantizing feature map similarity according to claim 2, wherein in step S12, the original convolution kernel number N is compared with the original convolution kernel number N according to the compression ratio_iPruning calculation is carried out to obtain the number N of the truncated convolution kernels of each layer_i2The method comprises the following steps:

pre-pruning the number of layers of the convolutional neural network in sequence of 1-N, and then carrying out pre-pruning on the original N of each layer_iOne Filter has N after pruning_i2The cut filters; n is a radical of_iA set of filters is

Wherein K_iRepresenting the height and width of the convolution kernel.

4. The convolutional neural network pruning method for quantizing feature map similarity according to claim 1, wherein in step S2, the pictures in the picture data set are input to L_i-1Model after layer pruning, pair L_iPruning the layers; quantizing the similarity through an SSIM method to obtain the similarity between the quantized Feature maps Feature _ map; determining Delete according to similarity of quantized Feature map Feature _ mapⁱThe method comprises the following steps:

And

s22: for ═ L_iJth of layer

Having an output of

Then N is_iA Feature _ map forms

S23: if i is equal to 1, judging that i is less than or equal to n; if not, finishing the iteration and outputting a lightweight model; if yes, define DeleteⁱIs integrated as space(ii) a Defining k as 1, and inputting M pictures in the Trian into the pruning model;

Calculating the rank of each Feature _ maps in the kth Feature _ maps set

k is self-increasing, wherein m, N is 1,2, … N_iAnd m is not equal to n;

And average rank

From high to low, constitute SSIMⁱSet and RankⁱGathering;

s26: traversal of SSIMⁱElements in a collection

Subscripts m, n of (a);

While, the Filter is put in_(i,n)Put in Deleteⁱ(ii) a When in use

While, the Filter is put in_(i,m)Put in Deleteⁱ；

If not, D is obtainedeleteⁱAnd (4) collecting.

5. The convolutional neural network pruning method for quantifying feature map similarity according to claim 4, wherein in the step S3, Delete is obtained through the obtained resultⁱPruning is carried out to obtain a Model after pruning_iAnd for Model_iPerforming fine tuning, including:

6. A convolutional neural network pruning device for quantifying feature map similarity, the device being applied to any one of claims 1 to 5, comprising:

7. The convolutional neural network pruning device for quantizing feature map similarity according to claim 6,

and the similarity calculation module deletes redundant information in the convolutional neural network through the similarity between the Feature _ maps.

8. The convolutional neural network pruning device for quantizing Feature map similarity according to claim 7, wherein between the Feature _ maps, the similarity is determined by inputting a picture data set into a convolutional network to obtain data and calculating a statistical average of the data.

9. The convolutional neural network pruning device for quantizing feature map similarity according to claim 6, wherein the average rank calculation module prunes the corresponding Filter through one feature map with higher similarity and lower rank.

10. The convolutional neural network pruning device for quantizing feature map similarity according to claim 6, wherein the preset condition is:

Filter_(i,m)and Filter_(i,n)Are all belonged to Deleteⁱ；

On the premise that m and n satisfy the above,

maximum;