CN109919241B - Hyperspectral unknown class target detection method based on probability model and deep learning - Google Patents

Hyperspectral unknown class target detection method based on probability model and deep learning Download PDF

Info

Publication number
CN109919241B
CN109919241B CN201910200211.2A CN201910200211A CN109919241B CN 109919241 B CN109919241 B CN 109919241B CN 201910200211 A CN201910200211 A CN 201910200211A CN 109919241 B CN109919241 B CN 109919241B
Authority
CN
China
Prior art keywords
class
vector
activation
activation vector
probability
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910200211.2A
Other languages
Chinese (zh)
Other versions
CN109919241A (en
Inventor
江天
彭元喜
张立雄
宋明辉
郝昊
刘煜
张俊
李春潮
余永涛
张龙龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN201910200211.2A priority Critical patent/CN109919241B/en
Publication of CN109919241A publication Critical patent/CN109919241A/en
Application granted granted Critical
Publication of CN109919241B publication Critical patent/CN109919241B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention belongs to the field of hyperspectral intelligent perception, and discloses a hyperspectral unknown class target detection method based on a probability model and deep learning, which comprises the steps of S1, inputting hyperspectral training data into a trained CNN classification model, and outputting activation vectors of all samples of each class; s2, accumulating and averaging all the activation vectors of the samples which belong to the same category and are correctly classified to obtain a mean activation vector, and representing the center of the category by the mean activation vector; s3, fitting a Weibull model belonging to each class based on the activation vectors of all samples in each class and the mean activation vector of the class; s4, inputting the hyperspectral test data into a CNN model and a probability model to form a network based on the Weibull fitting result of each category, and calculating the probability of the hyperspectral test data belonging to unknown categories; the method has a clear structure, is easy to realize, reduces the training requirement of the neural network learning model, and can obviously improve the effect of detecting the unknown class target.

Description

Hyperspectral unknown class target detection method based on probability model and deep learning
Technical Field
The invention mainly relates to the field of hyperspectral intelligent sensing, in particular to a hyperspectral unknown class target detection method based on a probability model and deep learning.
Background
The hyperspectral data represents two-dimensional information of ground object distribution and one-dimensional information describing spectral characteristics of the ground objects, the spectral resolution of the hyperspectral data reaches the nanometer level, the spectral information of dozens of to hundreds of wave bands is provided for each pixel, and a complete and continuous spectral curve can be generated. Compared with visible light and multiple spectrums, the high spectrums have richer surface feature spectrum information and can reflect the fine spectrum attributes of the surface features to be detected in detail. According to the difference of spectral characteristics between different ground objects to be detected, a computer or special equipment is used for operating a classification and identification program to allocate a label for a class mark to each pixel, and the ground objects to be detected are divided into a plurality of regions of different classes, which is called hyperspectral image classification and identification. The hyperspectral image classification and identification are classified into supervised classification identification and unsupervised classification identification (including semi-supervised classification).
The hyper-spectral image classification and identification method based on deep learning mostly belongs to supervised classification and identification. The deep neural network is trained through the labeled hyperspectral image data, and then the trained deep neural network is used for classifying and identifying the hyperspectral image. Due to the complexity and dynamics of the actual environment, categories which are not available in the training sample are necessarily encountered in the process of classifying the hyperspectral images by using the trained deep neural network. We define objects that are input during the classification and recognition process but do not belong to any one of the classes in the training sample as hyper-spectral unknown class objects. The hyperspectral unknown class target detection is to determine the position and the size of the hyperspectral unknown class target. The unknown target position and size are determined through unknown target detection, and the purpose of identifying the unknown type target is achieved.
At present, the detection of an unknown class target by utilizing a deep neural network is mostly realized by depending on a SoftMax layer of the neural network, and when the probability of belonging to each known class calculated by the SoftMax layer is less than a given threshold value, the unknown class target is judged. But the accuracy of identifying and detecting the unknown class of objects is poor only by depending on the calculation result of the SoftMax layer.
The detection of the hyperspectral unknown class targets can also use a hyperspectral abnormal target detection technology. Abnormal target detection of the hyperspectral image refers to searching sparse pixels with unknown spectrum shapes in a hyperspectral data cube. Anomalous target detection does not require any prior information of the spectrum. There are some achievements and patents in the field of hyperspectral anomalous target detection. Comparison document 1 (Chinese patent invention, application number: CN201510593935) proposes a sparse RX abnormal target detection algorithm based on sparse representation theory. According to the algorithm, firstly, a spatial preprocessing method is utilized to enable target information of the hyperspectral image to be more prominent than background information, and then a bilateral filtering method is utilized to filter out the influence of noise interference on the hyperspectral image; and then calculating a sparse difference index of the hyperspectral image by using a sparse representation theory, reconstructing a hyperspectral image data vector by using the sparse difference index, and finally performing abnormal target detection by using an RX method to obtain an abnormal target detection result. The detection precision, the false alarm rate, the robustness and the like of the algorithm are improved to a certain extent, and the performance of the RX anomaly detection algorithm is improved. But the computation is relatively complex and the whole algorithm runs long.
Generally, currently, few effective methods are available in the field of hyperspectral unknown class target detection. The deep neural network only uses a SoftMax layer to detect unknown class targets, and the detection and identification precision is not high. Although the hyperspectral abnormal target detection algorithm can also be used for hyperspectral unknown class target detection, the calculation is complex and the running time is long. The invention provides a hyperspectral unknown class target detection method based on a probability model and deep learning. In practical use, the activation vector of the CNN network is input into the existing probability model, the existing probability model is used for measuring the distance between the vector and the mean activation vector of all known classes, and whether the activation vector belongs to the target of the unknown class or not is judged. The method solves the problem that the detection precision of the deep neural network only using the SoftMax layer to detect the unknown class target is not high, and improves the efficiency and the precision of the detection of the hyperspectral unknown class target.
Disclosure of Invention
And the target which is input by the hyperspectral classification and recognition system in the classification and recognition process and does not belong to any one category in the training sample is called a hyperspectral unknown category target. Determining the position and the size of the hyperspectral unknown class target is called hyperspectral unknown class target detection. The hyperspectral abnormal target detection can also be used for detecting unknown targets, but many abnormal target detection algorithms are complex in calculation and too long in calculation time. The detection precision of the deep neural network for detecting the unknown class of targets only by using the SoftMax layer is not high, and the synthesized deception image cannot be effectively detected and identified.
The invention aims at the problems and the characteristics, and introduces a Weibull probability model calculation layer between a first last layer (SoftMax layer) and a second last layer (CNN classification model output activated layer) of the deep neural network. The probability of belonging to each known class and the probability of belonging to the unknown class are calculated for the activation vector output by the CNN by using a Weibull probability model established by classifying correct samples during training, and classification is performed according to the calculation result.
A hyperspectral unknown class target detection method based on a probability model and deep learning comprises the following steps:
s1, inputting hyperspectral training data into a trained CNN classification network for classification, wherein the classification output result is the activation vector of all samples of each class, and the activation vector output by the CNN classification model is expressed as v (x) -value (v)1(x),v2(x),...,vN(x) Wherein v) is1(x),v2(x)...vN(x) Is excitingAn element or component of a live vector;
s2, accumulating the activation vectors of all the samples which belong to the same category and are correctly classified, and averaging to obtain a mean activation vector, and using the mean activation vector to represent the center of the category, wherein the ith input sample is correctly classified into the jth category after passing through the CNN network, and the ith input sample is represented as xi,jSample xi,jThe activation vector output after the CNN model processing is represented as vj(xi,j) Taking si,j=vj(xi,j) The number of training samples correctly classified as the jth class among the training samples is N1The formula for solving the mean vector of the j-type activation vector is as follows:
Figure BDA0001996937930000021
∑ in the formula is a summation over N1Summing the activation vectors of class j, dividing by N1Obtaining an average activation vector, and calculating N times in this way to obtain average activation vectors of N classes;
s3, fitting a Weibull (Weibull) probability model belonging to each class based on the activation vectors of the samples and the mean activation vector of each class, wherein the fitting result is the position parameter tau of the Weibull (Weibull) probability model of each classjProportional parameter lambdajShape parameter κj
Wherein the probability density function of the Weibull distribution is
Figure BDA0001996937930000022
In the formula, λjIs a proportional parameter, κjIs the shape parameter, τjIs a position parameter, x is a random variable;
wherein, the j-th activation vector is represented by taujjjThe method can be calculated by using a FitHigh function in a python language libMR library, and the calculation method comprises the following steps:
first, the average activation vector μ calculated in S2 is usedjCalculate farthest distance ηj,ηjIs all activations of class jVector to average activation vector mujThe maximum value of the distances;
further, a normalized vector is calculated by averaging the activation vectors
Figure BDA0001996937930000031
Is an activation vector si,jSubtract the average activation vector μjThen dividing the variance of the j-th activation vector to obtain the variance of the j-th activation vector;
finally, using the calculated
Figure BDA0001996937930000038
The value calls a FitHigh function in a python language libMR library to calculate the tau of the j typejjjWherein j is 1.. multidot.n; tau isjjjIs calculated by
Figure BDA0001996937930000033
And calculating the obtained parameter taujjjStoring;
s4, inputting the new activation vector of the hyperspectral test data into the Weibull distribution model of each category based on the Weibull fitting result of each category, and calculating the probability of belonging to the unknown category, wherein the specific steps are as follows:
(4a) the activation vector output after the test data is processed by the CNN network is v (x) ═ v1(x),v2(x),...,vN(x) For activation vector v (x) v ═ v)1(x),v2(x),...,vN(x) Returns the position index of each component in the activation vector in descending order, wherein s (i) ═ argsort (v)j(x) Is expressed for the activation vector component vj(x) Is ordered from small to large and returns the component subscript, s (i) is vj(x) Subscript, preset adjustment factor ωj1, and j 1,2,3.. N;
(4b) adjusting the categories of alpha top levels, and calculating an adjustment coefficient in the following way:
Figure BDA0001996937930000034
wherein i is 1, 2.. multidot. α;. taus(i)、λs(i)、κs(i)Position parameter, scale parameter, shape parameter, ω, representing class s (i)s(i)(x) Is the calculated adjustment coefficients, and is calculated by α adjustment coefficients omegas(i)(x) S (i) ═ 1,2,. multidot. α and the remaining N- α omega with no adjustmentjWherein, ω isjN, N components make up the adjustment vector ω (x) ═ ω (ω) · α +1, α +21(x),ω2(x),...,ωα(x),1,...,1)TT denotes vector transposition;
(4c) and (4) adjusting the activation vector by using the adjustment vector omega (x) obtained by the calculation in the step (4b), wherein the adjustment mode is as follows:
Figure BDA0001996937930000035
in the formula (I), the compound is shown in the specification,
Figure BDA0001996937930000036
means multiplication of two vectors, multiplication of two vector-corresponding elements to obtain a new vector-corresponding element, v (x) ═ v1(x),v2(x),...,vN(x) Is an activation vector, ω (x) is an adjustment vector formed by adjustment coefficients, and v (x) and ω (x) are vector-multiplied to obtain an adjusted activation vector
Figure BDA0001996937930000037
(4d) Computing unknown class activation adjustment components
Figure BDA0001996937930000041
Figure BDA0001996937930000042
Wherein ∑ represents the summation, i ranges from 1 to α, vi(x) Is the ith element or component of the activation vector, ωi(x) Is the adjustment coefficient, the result of calculationIs that
Figure BDA0001996937930000043
(4e) Using a SoftMax function to
Figure BDA0001996937930000044
And
Figure BDA0001996937930000045
calculating the probability of the input test data belonging to an unknown class target as input, wherein the calculation formula of the probability belonging to the jth class is as follows:
Figure BDA0001996937930000046
wherein j ranges from 0 to N,
Figure BDA0001996937930000047
the method comprises the steps of solving an exponent with e as the base for the components of the adjusted activation vector, adding the exponents, and dividing the exponent with e as the base for the jth activation component of the adjusted activation vector by the input test sample with the probability of belonging to the jth class
Figure BDA0001996937930000048
Thus calculating N +1 probabilities;
the category number of the maximum probability among the N +1 probabilities is taken:
y*=argmaxj P(y=j|x),j=0,1,2,...,N
in the formula, argmaxj represents a class number that takes the maximum value among N +1 probabilities, and j ═ 0 is an unknown target class number; if the maximum probability class number is equal to 0, i.e., y ═ 0, or the maximum probability is less than the threshold, P (y ═ y | x) <isrepresented by a probability, then it is determined that the test input x belongs to the unknown class object.
Wherein, S1 to S3 are to establish a probability model of each known class by using training data, and may be calculated in an Offline (Offline) manner based on a training set, and store the calculation result. S4, detecting unknown class targets for input data by using the established probability model, and processing test data in an Online (Online) mode in real time.
Compared with the prior art, the invention has the advantages that:
1. according to the invention, statistical Weibull distribution (Weibull distribution) is introduced, and a Weibull probability model calculation layer is added in front of a SoftMax layer of a deep neural network, so that the neural network is more suitable for open set identification application; in addition, the invention classifies the targets and detects the unknown targets, thereby improving the efficiency and the precision of the detection of the unknown targets.
2. The invention reduces the training requirement of the neural network learning model; the invention applies deep learning to the field of hyperspectral unknown target recognition and widens the field of deep learning application.
Drawings
FIG. 1 is a flow chart of a hyperspectral unknown object detection framework of the invention;
FIG. 2 is a schematic diagram of hyperspectral unknown object detection algorithms S1 to S3 (part of establishing a probability model);
FIG. 3 is a schematic diagram of a hyperspectral unknown object detection algorithm S4 (detection of unknown object parts) according to the invention.
Detailed Description
The invention will be described in further detail below with reference to the drawings and specific examples.
Fig. 1 shows an overall flowchart of unknown class target detection, where hyperspectral training data is input into a trained CNN classification model, and the output of the last fully-connected layer of the CNN classification model is used as an activation vector of a sample of each class. And accumulating the activation vectors of the correctly classified samples in each category, calculating a mean value, and characterizing the category by using the mean activation vector. Fitting a Weibull (Weibull) probability model belonging to each class by using the activation vectors and the mean activation vector of each class, wherein the fitting result is a position parameter tau of the Weibull (Weibull) probability modeljProportional parameter lambdajShape parameter, etc.. kappaj. Inputting new activation vectors of hyperspectral test data into a Weibull distribution model of each class based on Weibull fitting results of each classType, the probability of belonging to an unknown class is calculated.
FIG. 2 shows the calculation flow of the training part when the probabilistic model is built in S1 to S3, and the process only depends on the training set, so that the process can be calculated in advance.
S1, inputting hyperspectral training data into a trained CNN classification network for classification, wherein the classification output result is the activation vector of all samples of each class, and the activation vector output by the CNN classification model is expressed as v (x) -value (v)1(x),v2(x),...,vN(x) Wherein v) is1(x),v2(x)...vN(x) Is an element or component of an activation vector;
s2, accumulating the activation vectors of all samples belonging to the same class and correctly classified, and averaging to obtain a mean activation vector, and using the mean activation vector to represent the center of the class, wherein the ith input sample is correctly classified into the jth class after passing through the CNN network, and the ith input sample is represented as xi,jSample xi,jThe activation vector output after the CNN model processing is represented as vj(xi,j) Taking si,j=vj(xi,j) The number of training samples correctly classified as the jth class among the training samples is N1The formula for solving the mean vector of the j-type activation vector is as follows:
Figure BDA0001996937930000051
∑ in the formula is a summation over N1Summing the activation vectors of class j, dividing by N1And obtaining an average activation vector, thus obtaining an average activation vector of N classes.
In order to realize the hyperspectral unknown class detection function, the probabilistic model uses the theory of Meta-Recognition (Meta-Recognition), namely, the output activation vector of the CNN is analyzed by means of Extreme Value theory (EVT for short). The theorem explains the statistical characteristics of an extreme value of probability distribution, and simultaneously defines three distribution functions to carry out mathematical modeling on the characteristics, wherein the Weibull (Weibull) distribution conforms to the application scene of unknown class target detection. When calculating a Weibull (Weibull) model of each category, model parameters can be obtained by using FitHigh functions in a python language libMR library to complete model fitting and calculate model parameters.
S3, fitting a Weibull (Weibull) probability model belonging to each class based on the activation vectors of the samples and the mean activation vector of each class, wherein the fitting result is the position parameter tau of the Weibull (Weibull) probability model of each classjProportional parameter lambdajShape parameter κj
Wherein the probability density function of the Weibull distribution is
Figure BDA0001996937930000052
In the formula, λjIs a proportional parameter, κjIs the shape parameter, τjIs a position parameter, x is a random variable;
wherein, the j-th activation vector is represented by taujjjThe method can be calculated by using a FitHigh function in a python language libMR library, and the calculation method comprises the following steps:
first, the average activation vector μ calculated in S2 is usedjCalculate farthest distance ηj,ηjIs all activation vectors of class j to the mean activation vector mujThe maximum value of the distances;
further, a normalized vector is calculated by averaging the activation vectors
Figure BDA0001996937930000061
Is an activation vector si,jSubtract the average activation vector μjThen dividing the variance of the j-th activation vector to obtain the variance of the j-th activation vector;
finally, using the calculated
Figure BDA00019969379300000610
The value calls a FitHigh function in a python language libMR library to calculate the tau of the j typejjjWherein j is 1.. multidot.n; tau isjjjIs calculated by
Figure BDA0001996937930000063
And calculating the obtained parameter taujjjStoring;
fig. 3 shows a calculation flow for detecting unknown target portions, which involves real data processing, so that the calculation needs to be completed online, and this step corresponds to an inference step in computer vision. The index of the unknown class object in this algorithm is 0. As shown, the inputs to the detection section are the output activation vector of CNN (i.e., the activation vector of the test data) and the Weibull model parameters for each class calculated in the previous section.
S4, inputting the new activation vector of the hyperspectral test data into the Weibull distribution model of each category based on the Weibull fitting result of each category, and calculating the probability of belonging to the unknown category, wherein the specific steps are as follows:
(4a) the activation vector output after the test data is processed by the CNN network is v (x) ═ v1(x),v2(x),...,vN(x) For activation vector v (x) v ═ v)1(x),v2(x),...,vN(x) Returns the position index of each component in the activation vector in descending order, wherein s (i) ═ argsort (v)j(x) Is expressed for the activation vector component vj(x) Is ordered from small to large and returns the component subscript, s (i) is vj(x) Subscript, preset adjustment factor ωj1, and j 1,2,3.. N;
(4b) adjusting the categories of alpha top levels, and calculating an adjustment coefficient in the following way:
Figure BDA0001996937930000064
wherein i is 1, 2.. multidot. α;. taus(i)、λs(i)、κs(i)Position parameter, scale parameter, shape parameter, ω, representing class s (i)s(i)(x) Is the calculated adjustment coefficients, and is calculated by α adjustment coefficients omegas(i)(x) S (i) ═ 1,2,. multidot. α and the remaining N- α did notRegulated omegajWherein, ω isjN, N components make up the adjustment vector ω (x) ═ ω (ω) · α +1, α +21(x),ω2(x),...,ωα(x) 1, 1.., 1) T, T represents a vector transpose;
(4c) and (4) adjusting the activation vector by using the adjustment vector omega (x) obtained by the calculation in the step (4b), wherein the adjustment mode is as follows:
Figure BDA0001996937930000065
in the formula (I), the compound is shown in the specification,
Figure BDA0001996937930000066
means multiplication of two vectors, multiplication of two vector-corresponding elements to obtain a new vector-corresponding element, v (x) ═ v1(x),v2(x),...,vN(x) Is an activation vector, ω (x) is an adjustment vector formed by adjustment coefficients, and v (x) and ω (x) are vector-multiplied to obtain an adjusted activation vector
Figure BDA0001996937930000067
(4d) Computing unknown class activation adjustment components
Figure BDA0001996937930000068
Figure BDA0001996937930000069
Wherein ∑ represents the summation, i ranges from 1 to α, vi(x) Is the ith element or component of the activation vector, ωi(x) Is an adjustment coefficient, the calculation result is
Figure BDA0001996937930000071
(4e) Using a SoftMax function to
Figure BDA0001996937930000072
And
Figure BDA0001996937930000073
calculating the probability of the input test data belonging to an unknown class target as input, wherein the calculation formula of the probability belonging to the jth class is as follows:
Figure BDA0001996937930000074
wherein j ranges from 0 to N,
Figure BDA0001996937930000075
the method comprises the steps of solving an exponent with e as the base for the components of the adjusted activation vector, adding the exponents, and dividing the exponent with e as the base for the jth activation component of the adjusted activation vector by the input test sample with the probability of belonging to the jth class
Figure BDA0001996937930000076
Thus calculating N +1 probabilities;
the category number of the maximum probability among the N +1 probabilities is taken:
y*=argmaxj P(y=j|x),j=0,1,2,...,N
in the formula, argmaxj represents a class number that takes the maximum value among N +1 probabilities, and j ═ 0 is an unknown target class number; if the maximum probability class number is equal to 0, i.e., y ═ 0, or the maximum probability is less than the threshold, P (y ═ y | x) <isrepresented by a probability, then it is determined that the test input x belongs to the unknown class object.
In the calculation process, the joblib library in python language is used to complete the storage of all intermediate data, including model parameters of Weibull (Weibull) distribution, average activation vectors (MAV) of all classes, and the like.
The above description is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above embodiments, and all technical solutions belonging to the idea of the present invention belong to the protection scope of the present invention. It should be noted that modifications and embellishments within the scope of the invention may occur to those skilled in the art without departing from the principle of the invention, and are considered to be within the scope of the invention.

Claims (1)

1. A hyperspectral unknown class target detection method based on a probability model and deep learning is characterized by comprising the following steps: the method comprises the following steps:
s1, inputting hyperspectral training data into a trained CNN classification network for classification, wherein the classification output result is the activation vector of all samples of each class, and the activation vector output by the CNN classification model is expressed as v (x) -value (v)1(x),v2(x),...,vN(x) Wherein v) is1(x),v2(x)...vN(x) Is an element or component of an activation vector;
s2, accumulating and averaging the activation vectors of all samples which belong to the same class and are correctly classified to obtain an average activation vector, and using the average activation vector to represent the center of the class, wherein the ith input sample is correctly classified into the jth class after passing through the CNN network, and the ith input sample is represented as xi,jSample xi,jThe activation vector output after the CNN model processing is represented as vj(xi,j) Taking si,j=vj(xi,j) The number of training samples correctly classified as the jth class among the training samples is N1The formula for calculating the average activation vector of the j-type activation vectors is as follows:
Figure FDA0002621816570000011
∑ in the formula is a summation over N1Summing the activation vectors of class j, dividing by N1Obtaining an average activation vector, and calculating N times in this way to obtain average activation vectors of N classes;
s3, fitting a Weibull probability model belonging to each class based on the activation vectors of the samples and the average activation vector of each class, wherein the fitting result is the position parameter tau of the Weibull probability model of each classjProportional parameter lambdajShape parameter κj
Wherein the probability density function of the Weibull distribution is
Figure FDA0002621816570000012
In the formula, λjIs a proportional parameter, κjIs the shape parameter, τjIs a position parameter, x is a random variable;
wherein, the j-th activation vector is represented by taujjjThe method is calculated by using a FitHigh function in a python language libMR library, and the calculation method comprises the following steps:
first, the average activation vector μ calculated in S2 is usedjCalculate farthest distance ηj,ηjIs all activation vectors of class j to the mean activation vector mujThe maximum value of the distances;
further, a normalized vector is calculated by averaging the activation vectors
Figure FDA0002621816570000013
Is an activation vector si,jSubtract the average activation vector μjThen dividing the variance of the j-th activation vector to obtain the variance of the j-th activation vector;
finally, the calculated μ is usedjj,
Figure FDA0002621816570000014
The value calls a FitHigh function in a python language libMR library to calculate the tau of the j typejjjWherein j is 1jjjThe calculation formula is
Figure FDA0002621816570000015
And calculating the obtained parameter taujjjStoring;
s4, inputting the new activation vector of the hyperspectral test data into the Weibull distribution model of each category based on the Weibull fitting result of each category, and calculating the probability of belonging to the unknown category, wherein the specific steps are as follows:
(4a) the activation vector output after the test data is processed by the CNN network is v (x) ═ v1(x),v2(x),...,vN(x) For activation vector v (x) v ═ v)1(x),v2(x),...,vN(x) Returns the position index of each component in the activation vector in descending order, wherein s (i) ═ arg sort (v)j(x) Is expressed for the activation vector component vj(x) Is ordered from small to large and returns the component subscript, s (i) is vj(x) Subscript, preset adjustment factor ωj1, and j 1,2,3.. N;
(4b) adjusting the categories of alpha top levels, and calculating an adjustment coefficient in the following way:
Figure FDA0002621816570000021
wherein i is 1, 2.. multidot. α;. taus(i)、λs(i)、κs(i)Position parameter, scale parameter, shape parameter, ω, representing class s (i)s(i)(x) Is the calculated adjustment coefficients, and is calculated by α adjustment coefficients omegas(i)(x) S (i) ═ 1,2,. multidot. α and the remaining N- α omega with no adjustmentjWherein, ω isjN, N components make up the adjustment vector ω (x) ═ ω (ω) · α +1, α +21(x),ω2(x),...,ωα(x),1,...,1)TT denotes vector transposition;
(4c) and (4) adjusting the activation vector by using the adjustment vector omega (x) obtained by the calculation in the step (4b), wherein the adjustment mode is as follows:
Figure FDA0002621816570000022
in the formula (I), the compound is shown in the specification,
Figure FDA0002621816570000023
means multiplication of two vectors, multiplication of two vector-corresponding elements to obtain a new vector-corresponding element, v (x) ═ v1(x),v2(x),...,vN(x) Is an activation vector, ω (x) is an adjustment vector formed by adjustment coefficients, and v (x) and ω (x) are vector-multiplied to obtain an adjusted activation vector
Figure FDA0002621816570000024
(4d) Computing unknown class activation adjustment components
Figure FDA0002621816570000025
Figure FDA0002621816570000026
Wherein ∑ represents the summation, i ranges from 1 to α, vi(x) Is the ith element or component of the activation vector, ωi(x) Is an adjustment coefficient, the calculation result is
Figure FDA0002621816570000027
(4e) Using a SoftMax function to
Figure FDA0002621816570000028
And
Figure FDA0002621816570000029
calculating the probability of the input test data belonging to an unknown class target as input, wherein the calculation formula of the probability belonging to the jth class is as follows:
Figure FDA00026218165700000210
wherein j ranges from 0 to N,
Figure FDA00026218165700000211
the method comprises the steps of solving an exponent with e as the base for the components of the adjusted activation vector, adding the exponents, and dividing the exponent with e as the base for the jth activation component of the adjusted activation vector by the input test sample with the probability of belonging to the jth class
Figure FDA00026218165700000212
Thus calculating N +1 probabilities;
the category number of the maximum probability among the N +1 probabilities is taken:
y*=argmaxj P(y=j|x),j=0,1,2,...,N
in the formula, argmaxj represents a class number that takes the maximum value among N +1 probabilities, and j ═ 0 is an unknown target class number; if the maximum probability class number is equal to 0, i.e., y ═ 0, or the maximum probability is less than the threshold, P (y ═ y | x) <isrepresented by a probability, then it is determined that the test input x belongs to the unknown class object.
CN201910200211.2A 2019-03-15 2019-03-15 Hyperspectral unknown class target detection method based on probability model and deep learning Active CN109919241B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910200211.2A CN109919241B (en) 2019-03-15 2019-03-15 Hyperspectral unknown class target detection method based on probability model and deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910200211.2A CN109919241B (en) 2019-03-15 2019-03-15 Hyperspectral unknown class target detection method based on probability model and deep learning

Publications (2)

Publication Number Publication Date
CN109919241A CN109919241A (en) 2019-06-21
CN109919241B true CN109919241B (en) 2020-09-29

Family

ID=66965454

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910200211.2A Active CN109919241B (en) 2019-03-15 2019-03-15 Hyperspectral unknown class target detection method based on probability model and deep learning

Country Status (1)

Country Link
CN (1) CN109919241B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110276189B (en) * 2019-06-27 2022-02-11 电子科技大学 User identity authentication method based on gait information
CN110909760B (en) * 2019-10-12 2021-03-16 中国人民解放军国防科技大学 Image open set identification method based on convolutional neural network
CN110826608A (en) * 2019-10-25 2020-02-21 广州思德医疗科技有限公司 Image classification method and device
CN111027600B (en) * 2019-11-25 2021-03-23 腾讯科技(深圳)有限公司 Image category prediction method and device
CN111105082A (en) * 2019-12-05 2020-05-05 山东浪潮人工智能研究院有限公司 Workpiece quality prediction model construction method and prediction method based on machine learning
CN111914919A (en) * 2020-07-24 2020-11-10 天津大学 Open set radiation source individual identification method based on deep learning
CN112200121B (en) * 2020-10-24 2022-06-07 中国人民解放军国防科技大学 Hyperspectral unknown target detection method based on EVM and deep learning
CN112149758B (en) * 2020-10-24 2022-03-04 中国人民解放军国防科技大学 Hyperspectral open set classification method based on Euclidean distance and deep learning
CN116628619B (en) * 2023-07-26 2023-10-20 西南交通大学 Unknown abnormal electrical phenomenon identification method based on vehicle network coupling

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106980875A (en) * 2017-03-13 2017-07-25 南京邮电大学 The zero sample image recognition methods based on attribute low-rank representation
CN108171122A (en) * 2017-12-11 2018-06-15 南京理工大学 The sorting technique of high-spectrum remote sensing based on full convolutional network
CN109344891A (en) * 2018-09-21 2019-02-15 北京航空航天大学 A kind of high-spectrum remote sensing data classification method based on deep neural network
CN109376786A (en) * 2018-10-31 2019-02-22 中国科学院深圳先进技术研究院 A kind of image classification method, device, terminal device and readable storage medium storing program for executing

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10657422B2 (en) * 2017-04-20 2020-05-19 The Boeing Company Methods and systems for hyper-spectral systems

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106980875A (en) * 2017-03-13 2017-07-25 南京邮电大学 The zero sample image recognition methods based on attribute low-rank representation
CN108171122A (en) * 2017-12-11 2018-06-15 南京理工大学 The sorting technique of high-spectrum remote sensing based on full convolutional network
CN109344891A (en) * 2018-09-21 2019-02-15 北京航空航天大学 A kind of high-spectrum remote sensing data classification method based on deep neural network
CN109376786A (en) * 2018-10-31 2019-02-22 中国科学院深圳先进技术研究院 A kind of image classification method, device, terminal device and readable storage medium storing program for executing

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
probability models for open set recognition;Walter J et al;《IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTERLLIGENCE》;20141130;第36卷(第11期);第2317-2323页 *
towards open set deep networks;bendale et al;《2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》;20161231;第1563-1572页 *
飞机可靠性分析中的混合威布尔分布参数估计方法;吴江;《工程设计学报》;20151231(第1期);第26-29页 *

Also Published As

Publication number Publication date
CN109919241A (en) 2019-06-21

Similar Documents

Publication Publication Date Title
CN109919241B (en) Hyperspectral unknown class target detection method based on probability model and deep learning
Ghaderizadeh et al. Hyperspectral image classification using a hybrid 3D-2D convolutional neural networks
CN107563355B (en) Hyperspectral anomaly detection method based on generation of countermeasure network
KR102005628B1 (en) Method and system for pre-processing machine learning data
US10713563B2 (en) Object recognition using a convolutional neural network trained by principal component analysis and repeated spectral clustering
CN105224872B (en) A kind of user&#39;s anomaly detection method based on neural network clustering
CN106295124B (en) The method of a variety of image detecting technique comprehensive analysis gene subgraph likelihood probability amounts
CN107633226B (en) Human body motion tracking feature processing method
CN109002755B (en) Age estimation model construction method and estimation method based on face image
CN110458192B (en) Hyperspectral remote sensing image classification method and system based on visual saliency
CN111950488A (en) Improved fast-RCNN remote sensing image target detection method
CN111709313A (en) Pedestrian re-identification method based on local and channel combination characteristics
Henila et al. Segmentation using fuzzy cluster‐based thresholding method for apple fruit sorting
CN111008652A (en) Hyper-spectral remote sensing image classification method based on GAN
Kumar et al. Multiclass support vector machine based plant leaf diseases identification from color, texture and shape features
Ju et al. A novel fully convolutional network based on marker-controlled watershed segmentation algorithm for industrial soot robot target segmentation
CN110837787B (en) Multispectral remote sensing image detection method and system for three-party generated countermeasure network
CN113191996A (en) Remote sensing image change detection method and device and electronic equipment thereof
Srivatsa et al. Application of least square denoising to improve admm based hyperspectral image classification
CN115393631A (en) Hyperspectral image classification method based on Bayesian layer graph convolution neural network
Rush et al. Feature Map Activation Analysis for Object Key-Point Detection
CN113469078B (en) Hyperspectral image classification method based on automatic design of long and short-term memory network
CN113469084B (en) Hyperspectral image classification method based on contrast generation countermeasure network
Cui et al. A Siamese Network Tracking Algorithm Based on DeblurGAN
Rashme et al. Self-Organizing Feature Map and K-Means Algorithm with Automatically Splitting and Merging Clusters based Image Segmentation.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant