CN113963185A

CN113963185A - Visualization and quantitative analysis method and system for layer feature expression capability in neural network

Info

Publication number: CN113963185A
Application number: CN202111240906.7A
Authority: CN
Inventors: 张拳石
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2021-10-25
Filing date: 2021-10-25
Publication date: 2022-01-21
Also published as: WO2023072094A1

Abstract

The application relates to the technical field of machine learning, and discloses a method and a system for visualizing and quantitatively analyzing the expression capability of a layer feature in a neural network, which can realize automatic visualization and quantitative analysis of the expression capability of the layer feature in the neural network under an unsupervised condition. The method comprises the following steps: providing a neural network to be analyzed, wherein the neural network is a deep neural network pre-trained on a certain data set; providing a group of input samples, inputting the samples into the neural network, and extracting sample level features and area level features corresponding to the samples; respectively reducing the dimensions of the sample level features and the region level features to obtain a visualization result in a low-dimensional space; and quantitatively analyzing the quantity and quality of knowledge points in the features based on the visualization result of the region-level features.

Description

Visualization and quantitative analysis method and system for layer feature expression capability in neural network

Technical Field

The application relates to the technical field of machine learning, in particular to a method and a system for visualizing and quantitatively analyzing the expression capability of medial features in a neural network.

Background

At present, deep neural networks have exhibited powerful performance in various fields, but the black box nature of neural networks makes it difficult for people to understand their internal behaviors. In the prior art, the visualization method is the most widely applied method in the field of artificial intelligence interpretability, but the expression capability of the layer characteristics in the neural network cannot be quantitatively analyzed by using the existing visualization method.

Therefore, the combination of the visual interpretation of the neural network and the quantitative analysis of the expression capability of the middle-layer characteristics is a problem to be solved in the field of artificial intelligence interpretability.

Disclosure of Invention

The invention aims to provide a method and a system for visualizing and quantitatively analyzing the expression capability of a layer feature in a neural network, which can realize automatic visualization and quantitative analysis of the expression capability of the layer feature in the neural network under an unsupervised condition.

The application discloses a visualization and quantitative analysis method for the expression capability of medial features in a neural network, which comprises the following steps

(1) Selecting a feature interpretation object:

selecting a model to be analyzed, wherein the model comprises a middle layer expression and comprises the following steps: a neural network, a hierarchical graph model;

(2) extracting neural network features:

providing a group of input samples, inputting the samples into the neural network, and extracting characteristics of the samples, wherein the characteristics comprise: sample-wise feature (sample-wise feature) and region-wise feature (regional feature);

(3) and (3) reducing the dimension of the features to obtain a visual result:

firstly, carrying out dimension reduction on sample level characteristics to obtain a visual result of the sample level characteristics in a low-dimensional space; secondly, reducing the dimension of the region level features based on the low-dimensional representation and the region level features of the sample level features to obtain a visual result of the region level features in a low-dimensional space;

(4) and (3) carrying out quantitative analysis on the characteristics according to the visualization result:

and quantitatively analyzing the quantity and quality of knowledge points (knowledge points) in the features based on the visualization result.

In a preferred example, the step (1) further comprises the following steps:

based on a certain data set, a neural network is trained as the neural network to be analyzed. Optionally, the neural network is a classification neural network.

In a preferred embodiment, the step (2) further comprises the following sub-steps:

(a) extracting sample level features: inputting a given group of samples into a neural network to be analyzed, and extracting the output characteristics of a middle layer of the neural network for each sample so as to obtain the sample-level characteristics corresponding to each input sample, namely the sample-level characteristics corresponding to the group of input samples.

(b) Extracting region level features: inputting a given group of input samples into a neural network to be analyzed, and extracting the output features of a certain convolution layer of the neural network for each sample so as to obtain a feature map (feature map) corresponding to each input sample, wherein the high-dimensional vector corresponding to each position of the feature map is the region-level feature of the sample in the region. When the height and width of this feature map are H and W, respectively, and there are K channels in total, then this feature map contains HW region-level features, where each region-level feature is a K-dimensional vector.

In a preferred embodiment, the step (3) further comprises the following sub-steps:

(a) reducing the dimension of the sample level features, and visualizing the sample level features in a low-dimensional space;

(b) and reducing the dimension of the region-level features and visualizing the region-level features in a low-dimensional space.

In the above sub-step (a), for each sample x, a corresponding sample level characteristic is obtainedSign for

By a projection matrix

Mapping the low-dimensional feature into a low-dimensional space to obtain the low-dimensional characterization of the sample-level features

And, optimizing M such that the low-dimensional token should satisfy that the closeness of the low-dimensional token g and each class should be as consistent as possible with the closeness of the sample x and each class.

Optionally, in the calculation of the "proximity of the low-dimensional characterization to each class", the distribution of the low-dimensional characterization g of the sample-level features in the low-dimensional space is modeled by radial distribution (radial distribution), and the proximity of the low-dimensional characterization to each class is calculated based on the distribution.

Based on the radial distribution, the probability density function of g in the low-dimensional space can be calculated by the following formula.

Wherein y ∈ {1, 2.,. C } represents different classes in the classification task; pi_yRepresenting the prior probability of class y; l_g-g | | | represents the L2 norm of g, called the strength of g (strength); o_g＝g/l_gRepresents the direction of g (orientation); mu.s_yMean direction (mean direction) indicating the y-th class; k (·) is a monotonically increasing function. p (l)_gY) represents l on category y_gA priori probability of p_vMF(o_g|μ_y，κ(l_g) ) mean direction is μ_yThe aggregation parameter is κ (l)_g) vMF distribution (von Mises-Fisher distribution).

Alternatively, κ (·) may be any monotonically increasing function, and more preferably,the κ (·) function may be generated by: given a non-negative constant k_mAnd dimension d, from the mean direction, of

The aggregation parameter is k ═ k_mvMF distribution of the image signal is sampled to obtain N samples

Scaling these samples to length l without changing direction, where l is an arbitrary non-negative number, and noting the scaled samples as

Obtaining N Gaussian noise samples by sampling from standard normal distribution

Correspondingly adding the samples obtained by the scaling and the Gaussian noise samples to obtain

Define kappa (l) as

Wherein

N ranges from 1000 to 100000, and in one embodiment, N is 10000.

Based on the radial distribution, and assuming l_gIs independent of the class y, then the low-dimensional representation of the proximity Q of g and class y_M(y | x) can be calculated by the following formula.

Alternatively, the "closeness of the sample to each class" is calculated as the output probability of the sample, i.e. the closeness P (y | x) of the sample x and the y-th class is the output probability value of the corresponding y-th class in the neural network output.

Optionally, the optimization of the projection matrix M further comprises the low-dimensional characterization g and the closeness Q of each class_M(y | x), and the calculation of the proximity P (y | x) of the sample x to each class, the projection matrix M is optimized such that P (y | x) and Q_MThe KL divergence (Kullback-Leibler divergence) between (y | x) is minimized.

Optionally, the projection matrix M is optimized alternately with the parameters { pi, μ } ═ pi in the radial distribution in the process of obtaining a low-dimensional characterization of the sample-level features_y，μ_y}_y∈Y. Wherein, when optimizing the projection matrix M, the parameters { π, μ } in radial distribution are fixed, and M is updated such that KL [ P (Y | X) | Q_M(Y|X)]The value of (d) is minimized; when optimizing the parameters { pi, mu } in the radial distribution, the projection matrix M is fixed and { pi, mu } is updated such that the likelihood pi_gp(g)＝Π_g∑_y′π_y′·p_vMF(o_g|μ_y′，κ(l_g) ) has the largest value.

In sub-step (b) above, for each sample x, HW region-level features

By a projection matrix

Mapping them into a low-dimensional space to obtain the low-dimensional representation of HW region-level features

And, optimizing Λ such that the low-dimensional characterization should be satisfied, based on the low-dimensional characterization h ═ h { (h)⁽¹⁾，h⁽²⁾，...，h^(HW)The inferred inter-sample similarity is as consistent as possible with the inferred inter-sample similarity based on the network output, and further, the low-dimensional characterization of the region-level features needs to be aligned with the low-dimensional characterization of the sample-level features.

Optionally, in the calculation of "similarity between samples inferred based on low-dimensional features", the similarity between samples is split into weighted products of the similarities between low-dimensional features corresponding to each region based on the bag-of-words model; and based on vMF distribution, the similarity between the corresponding low-dimensional representations of each region is quantified.

Alternatively, in the above description, let x₁And x₂For any two samples, the low-dimensional characteristics of the corresponding region level features are respectively

And

based on bag of words model, let x₁And x₂Degree of similarity Q between_Λ(x₂|x₁) The segmentation is performed as a weighted product of the similarity between the corresponding low-dimensional features of each region, as shown below.

Wherein,

represents a sample x₂The degree of importance of the r-th region feature to the classification is, in a preferred embodiment,

is a non-negative number. And,

further quantified as follows.

Wherein,

is composed of

In the average direction of

An aggregation parameter of

vMF distribution of (1); as stated in claim 6, k (·) is a monotonically increasing function.

Alternatively, in the calculation of "similarity between samples inferred based on network output", let x be₁And x₂For any two of the samples, the sample is,

the network output probabilities corresponding to the two samples respectively are based on the similarity P (x) between the samples deduced by the network output₂|x₁) It can be further calculated as follows.

Wherein,

cos (·, ·) represents the cosine similarity between two vectors, κ_pIs a non-negative constant.

Alternatively, making "the inter-sample similarity inferred based on the low-dimensional characterization as consistent as possible with the inter-sample similarity inferred based on the network output" is equivalent to minimizing the loss function as follows.

Wherein, P (x)₂|x₁) And Q_Λ(x₂|x₁) As shown in claim 16 and claim 17, respectively.

Optionally, aligning the low-dimensional characterization of the region-level features with the low-dimensional characterization of the sample-level features is equivalent to optimizing the loss function as follows.

Wherein M/(. cndot.;) represents mutual information; g represents a low-dimensional characterization of the sample-level features of sample x; h is^(r)Low-dimensional characterization of the r-th region-level feature, w, representing the sample x^(r)Indicating how important the r-th region-level feature of the sample x is for classification.

Optionally, the optimization of the projection matrix Λ comprises the steps of: calculating the two loss functions respectively

And

further calculating the total loss function

Optimizing Λ based on the total loss function such that the total loss function

And (4) minimizing. Where α is a positive constant, in a preferred embodiment, α ranges from 0.01 to 100. In a fruitIn the examples, the value of α was taken to be 0.1.

In a preferred example, the step (4) further comprises the following steps:

(a) quantizing the knowledge points in the region-level features;

(b) reliable knowledge points are further quantified, as well as the proportion of reliable knowledge points.

Optionally, the knowledge point is defined as a set of region-level features such that the following equation is greater than some threshold.

Wherein h is^(r)Is a low-dimensional representation of the r-th region-level feature corresponding to a certain sample x. Therefore, the knowledge point representation is such that max_c p(y＝c|h^(r)) Features at the region level > τ, i.e. set h^(r)：max_c p(y＝c|h^(r)) τ where τ is a positive constant, and in a preferred embodiment, τ ranges from 0.3 to 0.8. In one embodiment, τ is 0.4.

Optionally, reliable knowledge points and unreliable knowledge points in the knowledge points can be further quantized, so that the proportion of reliable knowledge points to total knowledge points is quantized. The reliable knowledge points are a set of knowledge points that further satisfy the following equation.

Wherein h is^(r)A low-dimensional characterization of the r-th region-level feature corresponding to a certain sample x, c^truthA true category label representing the sample. That is, the reliable knowledge points are the set { h }^(r)：c^truth＝arg max_c p(y＝c|h^(r)) Area level features contained in (c) }.

Further, the ratio of reliable knowledge points to total knowledge points measures the quality of knowledge points, and can be calculated by the following equation.

In a second aspect of the present invention, there is provided a system for visualizing and quantitatively analyzing a layer feature expression capability in a neural network, comprising:

(1) the input module is configured to be a pre-trained classification neural network and input samples containing all possible classes;

(2) a feature extraction module configured to extract sample-level features and region-level features of the input sample;

(3) the visualization module is configured to reduce the dimension of the extracted sample-level features and the extracted region-level features to obtain low-dimensional representations and visualize the low-dimensional representations in a low-dimensional space;

(3) and the quantitative analysis module is configured to quantitatively analyze the quantity and quality of the knowledge points in the features based on the visualization result of the region-level features.

It is to be understood that within the scope of the present invention, the above-described features of the present invention and those specifically described below (e.g., in the examples) may be combined with each other to form new or preferred embodiments. Not to be reiterated herein, but to the extent of space.

Other features, objects and advantages of the present invention will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, with reference to the accompanying drawings.

Drawings

FIG. 1 is a flow chart of a method for visualizing and quantitatively analyzing the expression capability of layer features in a neural network according to a first embodiment of the present invention;

FIG. 2 is a schematic illustration of a visualization of sample-level features in a low-dimensional space, obtained in accordance with the present invention;

FIG. 3 is a schematic diagram of the visualization of region-level features in a low-dimensional space obtained according to the present invention at different training stages of a neural network;

FIG. 4 is a schematic diagram of the visualization of region-level features in a low-dimensional space obtained according to the present invention at different forward propagation stages of a neural network;

FIG. 5 is a graph of the variation of knowledge points and reliable knowledge points in different interlayer features of a neural network with different training stages of the neural network, obtained according to the quantization of the number and quality of the knowledge points in the present invention;

FIG. 6 is a schematic view of the visualization of knowledge points in different inter-layer features of a neural network, obtained by quantifying the knowledge points according to the present invention;

fig. 7 is a schematic structural diagram of a system for visualizing and quantitatively analyzing the expression capability of layer features in a neural network according to a second embodiment of the present invention.

Detailed Description

Through careful and intensive research, the inventor develops a method and a system for visualizing and quantitatively analyzing the layer characteristic expression capability in a neural network for the first time. By the method and the system, the visual interpretation of the neural network is closely related to the quantitative analysis of the layer feature expression capability in the neural network; by visualization, the process that the expression capacity of the layer characteristics in the neural network emerges along with time and space can be clearly shown; the method can quantitatively analyze the quantity and quality of the layer knowledge points in the neural network, and further analyze the reliability of the model to be explained; based on the method and the system, a brand-new angle interpretation framework can be provided for the existing deep learning algorithms, such as attack resistance and knowledge distillation.

Based on the method and the system, a brand-new angle interpretation framework can be provided for the existing deep learning algorithms, such as attack resistance and knowledge distillation.

General procedure

Typically, the present invention comprises the steps of:

(1) providing a neural network to be analyzed, wherein the neural network is a deep neural network pre-trained on a certain data set;

(2) providing a group of input samples, inputting the samples into the neural network, and extracting sample-wise features (sample-wise features) and regional-wise features (regional features) corresponding to the samples;

(3) respectively reducing the dimensions of the sample level features and the region level features to obtain a visualization result in a low-dimensional space;

(4) and quantitatively analyzing the quantity and quality of knowledge points (knowledge points) in the features based on the visualization result of the region-level features.

The main advantages of the invention are:

(1) a visualization and quantitative analysis method and system for the expression ability of the layer characteristics in the neural network;

(2) by the method and the system, the visual interpretation of the neural network is closely related to the quantitative analysis of the layer feature expression capability in the neural network;

(3) by visualization, the process that the expression capacity of the layer characteristics in the neural network emerges along with time and space can be clearly shown;

(4) the method can quantitatively analyze the quantity and quality of the layer knowledge points in the neural network, and further analyze the reliability of the model to be explained;

(5) based on the method and the system, a brand-new angle interpretation framework can be provided for the existing deep learning algorithms, such as attack resistance and knowledge distillation.

Examples

To make the objects, technical solutions and advantages of the present application more clear, embodiments of the present application will be described in further detail below with reference to the accompanying drawings.

The first embodiment of the present invention relates to a method for visualizing and quantitatively analyzing the expression capability of a layer feature in a neural network, the flow of which is shown in fig. 1, and the method comprises the following steps:

in step 101: based on a certain data set, a neural network is trained as the neural network to be analyzed. Optionally, the neural network is a classification neural network.

Then, step 102 is entered, which may be further divided into the following two sub-steps:

Then, step 103 is entered, which may be further divided into the following two sub-steps:

In the sub-step (a), for each sample x, corresponding sample-level features

By a projection matrix

Alternatively, κ (·) may be any monotonically increasing function, and more preferably, the κ (·) function may be generated by: given a non-negative constant k_mAnd dimension d, from the mean direction, of

From a standard normalSampling in the distribution to obtain N Gaussian noise samples

Define kappa (l) as

Wherein

N ranges from 1000 to 100000, and in one embodiment, N is 10000.

Optionally, the projection matrix M and the radial direction are optimized alternately in obtaining the low-dimensional characterization of the sample-level featuresParameter { pi, mu } - { pi in distribution_y，μ_y}_y∈Y. Wherein, when optimizing the projection matrix M, the parameters { π, μ } in radial distribution are fixed, and M is updated such that KL [ P (Y | X) | Q_M(Y|X)]The value of (d) is minimized; when optimizing the parameters { pi, mu } in the radial distribution, the projection matrix M is fixed and { pi, mu } is updated such that the likelihood pi_gp(g)＝Π_g∑_y′π_y′·p_vMF(o_g|μ_y′，κ(l_g) ) has the largest value.

As shown in FIG. 2, in one embodiment of the present invention, a VGG-16 network pre-trained on a Tiny ImageNet image classification data set is given, wherein, during training, the neural network only adopts ten categories of steel arch bridge, school bus, sports car, tabby cat, desk, golden retriever, tall, iPod, life, and orange, and the characteristics of the VGG-16 network after the last second fully connected layer are extracted as sample-level characteristics, and the aforementioned method is used to perform dimension reduction to obtain a low-dimensional characterization scatter diagram in a three-dimensional space. In the figure, different colors indicate different categories shown in the legend, and the arrows corresponding to the colors of the respective categories indicate the average directions of the respective categories.

In sub-step (b) above, for each sample x, HW region-level features

By a projection matrix

And, optimizing Λ so that this is lowThe dimension characterization should satisfy, based on the low dimension characterization h ═ h⁽¹⁾，h⁽²⁾，...，h^(HW)The inferred inter-sample similarity is as consistent as possible with the inferred inter-sample similarity based on the network output, and further, the low-dimensional characterization of the region-level features needs to be aligned with the low-dimensional characterization of the sample-level features.

And

Wherein,

is a non-negative number. And,

further quantified as follows.

Wherein,

is composed of

In the average direction of

An aggregation parameter of

Wherein,

Wherein MI (;) represents mutual information; g represents a low-dimensional characterization of the sample-level features of sample x; h is^(r)Low-dimensional characterization of the r-th region-level feature, w, representing the sample x^(r)Indicating how important the r-th region-level feature of the sample x is for classification.

And

further calculating the total loss function

And (4) minimizing. Where α is a positive constant, in a preferred embodiment, α ranges from 0.01 to 100. In one embodiment, the value of α is taken to be 0.1.

As shown in FIG. 3, given a VGG-16 network pre-trained on a Tiny ImageNet data set as the previous one, the output features of the conv _53 layer are extracted as region-level features, and dimension reduction is carried out to three dimensions by the method to obtain low-dimensional representations and distribution in three-dimensional space. As before, the scatter points of different colors represent the low-dimensional representations of the region-level features corresponding to the samples of different classes, and the ellipses in the diagram represent the approximate distribution of the low-dimensional representations of the region-level features corresponding to the samples of different classes.

As shown in fig. 4, given a VGG-16 network pre-trained on the same Tiny ImageNet data set as described above, the output features of the conv _12, conv _22, conv _33, conv _43, and conv _53 layers are extracted as region-level features, and the above-described method is used to perform dimension reduction to a three-dimensional space to obtain low-dimensional features. Fig. 4 shows a distribution variation of the low-dimensional characterization of the region-level features corresponding to different samples with increasing number of forward-propagating layers, wherein the vertical upward arrow represents the correct average direction of the sample, and the scatter points of different colors represent the distribution of the low-dimensional characterization of the region-level features corresponding to different layers in a three-dimensional space.

Then, step 104 is entered, which may be further divided into the following sub-steps:

(a) quantizing the knowledge points in the region-level features;

Wherein h is^(r)Is a low-dimensional representation of the r-th region-level feature corresponding to a certain sample x. Therefore, the knowledge point representation is such that max_c p(y＝c|h^(r)) Features at the region level > τ, i.e. set h^(r)：max_c p(y＝c|h^(r)) τ, where τ is a positive constant,in a preferred embodiment, τ is in the range of 0.3-0.8. In one embodiment, τ is 0.4.

As shown in fig. 5, given a VGG-16 network pre-trained on the Tiny ImageNet dataset as described above, the number of all knowledge points and the number of reliable knowledge points in the corresponding region-level features of the conv _33, conv _43 and conv _53 layers were calculated by the method described above. Fig. 5 shows the variation curve of the total amount of knowledge points and the number of reliable knowledge points of different layers of the neural network with the number of training iterations of the neural network.

As shown in fig. 6, given a VGG-16 network pre-trained on the Tiny ImageNet dataset as described above, all knowledge points in the corresponding region-level features of the conv _33, conv _43, and conv _53 layers were obtained by the method described above. Fig. 6 shows the areas of the knowledge points corresponding to different layers in the graph, as indicated by the highlighted parts of the graph.

A second embodiment of the present invention relates to a system for visualizing and quantitatively analyzing the expression capability of a layer feature in a neural network, which has a structure shown in fig. 7 and includes:

(2) the visualization module is configured to reduce the dimension of the extracted sample-level features and the extracted region-level features to obtain low-dimensional representations and visualize the low-dimensional representations in a low-dimensional space;

It should be noted that, as will be understood by those skilled in the art, the implementation functions of the modules shown in the embodiment of the system for visualizing and quantitatively analyzing the expression capability of the layer features in the neural network described above can be understood by referring to the foregoing description of the method for visualizing and quantitatively analyzing the expression capability of the layer features in the neural network. The functions of the modules shown in the embodiment of the system for visualizing and quantitatively analyzing layer feature expression capability in a neural network can be realized by a program (executable instructions) running on a processor, and can also be realized by specific logic circuits. The visualization and quantitative analysis system for the layer feature expression capability in the neural network according to the embodiment of the present invention may be implemented in the form of a software functional module and may be stored in a computer-readable storage medium when the system is sold or used as an independent product. Based on such understanding, the technical solutions of the embodiments of the present invention may be essentially implemented or a part contributing to the prior art may be embodied in the form of a software product stored in a storage medium, and including several instructions for enabling a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the method of the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read Only Memory (ROM), a magnetic disk, or an optical disk. Thus, embodiments of the invention are not limited to any specific combination of hardware and software.

Accordingly, the embodiment of the present invention also provides a computer-readable storage medium, wherein computer-executable instructions are stored, and when being executed by a processor, the computer-executable instructions realize the method embodiments of the present invention. Computer-readable storage media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable storage medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.

In addition, the embodiment of the invention also provides a system for visualizing and quantitatively analyzing the expression capability of the layer characteristics in the neural network, which comprises a memory for storing computer executable instructions and a processor; the processor is configured to implement the steps of the method embodiments described above when executing the computer-executable instructions in the memory. The Processor may be a Central Processing Unit (CPU), other general-purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), or the like. The aforementioned memory may be a read-only memory (ROM), a Random Access Memory (RAM), a Flash memory (Flash), a hard disk, or a solid state disk. The steps of the method disclosed in the embodiments of the present invention may be directly implemented by a hardware processor, or implemented by a combination of hardware and software modules in the processor.

It is noted that, in the patent specification, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element. In the invention document of this patent, if it is mentioned that a certain action is performed according to a certain element, it means that the action is performed at least according to the element, and two cases are included: performing the action based only on the element, and performing the action based on the element and other elements. The expression of a plurality of, a plurality of and the like includes 2, 2 and more than 2, more than 2 and more than 2.

All documents referred to in this application are to be considered as being incorporated in their entirety into the disclosure of the present invention for the purpose of making available modifications as necessary. It should be understood that the above description is only a preferred embodiment of the present disclosure, and is not intended to limit the scope of the present disclosure. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of one or more embodiments of the present disclosure should be included in the scope of protection of one or more embodiments of the present disclosure.

Claims

1. A visualization and quantitative analysis method for the expression ability of the layer characteristics in the neural network is characterized by comprising the following steps

(1) Selecting a feature interpretation object:

(2) extracting neural network features:

(3) and (3) reducing the dimension of the features to obtain a visual result:

2. The method of claim 1, wherein the step (2) of extracting the sample-level features further comprises the steps of:

inputting a given group of samples into a neural network to be analyzed, and extracting the output characteristics of a middle layer of the neural network for each sample so as to obtain the sample-level characteristics corresponding to each input sample, namely the sample-level characteristics corresponding to the group of input samples.

3. The method of claim 1, wherein the extracting of the region-level features in step (2) further comprises the steps of:

inputting a given group of input samples into a neural network to be analyzed, and extracting output features of a certain convolution layer of the neural network for each sample so as to obtain a feature map (feature map) corresponding to each input sample, wherein a high-dimensional vector corresponding to each position of the feature map is the region-level feature of the sample in the region; when the height and width of this feature map are H and W, respectively, and there are K channels in total, then this feature map contains HW region-level features, where each region-level feature is a K-dimensional vector.

4. The method of claim 1, wherein the dimension reduction process for the sample-level features in step (3) comprises the steps of:

for each sample x corresponding sample level feature

By a projection matrix

And, optimizing M such that the low-dimensional token should satisfy, the low-dimensional token g and the classes.

5. The method of claim 4, wherein the computing of the closeness of the low-dimensional tokens to the respective classes comprises:

(a) modeling a distribution of low-dimensional tokens g of sample-level features in a low-dimensional space using radial distribution (radial distribution);

(b) the proximity of the low-dimensional tokens to the respective classes is calculated.

6. The method of claim 5, wherein step (a) comprises:

based on the radial distribution, the probability density function of g in a low dimensional space can be written as follows:

wherein y ∈ {1, 2.,. C } represents different classes in the classification task; pi_yRepresenting the prior probability of class y; l_g-g | | | represents the L2 norm of g, called the strength of g (strength); o_g＝g/l_gRepresents the direction of g (orientation); mu.s_yMean direction (mean direction) indicating the y-th class; kappa (. cndot.) is a monotonically increasing function, p (l)_gY) represents l on category y_gA priori probability of p_vMF(o_g|μ_y，κ(l_g) ) mean direction is μ_yThe aggregation parameter is κ (l)_g) vMF distribution (von Mises-Fisher distribution).

7. The method of claim 5, wherein step (b) comprises:

based on the radial distribution, and assuming l_gIs independent of the class y, then the low-dimensional representation of the proximity Q of g and class y_M(y | x) is expressed as follows:

8. the method of claim 1, wherein the dimension reduction process for the region-level features in step (3) further comprises the steps of:

HW region level features for each sample x

By a projection matrix

And, optimizing Λ such that the low-dimensional characterization should be satisfied, based on the low-dimensional characterization h ═ h { (h)⁽¹⁾，h⁽²⁾,...，h^(HW)The inferred inter-sample similarity is as consistent as possible with the inferred inter-sample similarity based on the network output, and further, the low-dimensional characterization of the region-level features needs to be aligned with the low-dimensional characterization of the sample-level features.

9. The method of claim 1, wherein the knowledge points in step (4) are a set of region-level features that make the following equation greater than a certain threshold:

wherein h is^(r)A low-dimensional representation of the r-th region level feature corresponding to a certain sample x; therefore, the knowledge point representation is such that max_cp(y＝c|h^(r)) Features at the region level > τ, i.e. set h^(r)：max_cp(y＝c|h^(r)) τ where τ is a positive constant, and in a preferred embodiment, τ ranges from 0.3 to 0.8.

10. A system for visualizing and quantitatively analyzing the expression capability of layer features in a neural network is characterized by comprising the following modules:

(4) and the quantitative analysis module is configured to quantitatively analyze the quantity and quality of the knowledge points in the features based on the visualization result of the region-level features.