CN110378384B

CN110378384B - Image classification method combining privilege information and ordering support vector machine

Info

Publication number: CN110378384B
Application number: CN201910532935.7A
Authority: CN
Inventors: 刘倩; 刘波; 肖燕珊; 李松松; 刘芷菁
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2019-06-19
Filing date: 2019-06-19
Publication date: 2023-10-10
Anticipated expiration: 2039-06-19
Also published as: CN110378384A

Abstract

The invention discloses an image classification method combining privilege information and a sequencing support vector machine, which comprises the steps of firstly constructing a positive marked image set and an unmarked image set, identifying a reliable negative image from the unmarked image set, and utilizing the reliable negative image to establish a positive prototype image and a negative prototype image; deleting reliable negative images from the unlabeled image set, converting the rest unlabeled images, positive images and negative images into vectors respectively, and performing difference on the vectors to form a difference image; representing the difference image by using a similarity model; and constructing a classification model by using the difference image, the similarity weight and the privilege information, and training, wherein the trained classification model is used for classifying the image. In the method, the similarity of the privilege information and the difference image is considered, and the similarity and the privilege information are integrated into a learning stage of the sequencing support vector machine so as to quickly classify the images.

Description

Image classification method combining privilege information and ordering support vector machine

Technical Field

The invention relates to the field of image processing and machine learning, in particular to an image classification method combining privilege information and a sequencing support vector machine.

Background

With the advent of the information age, information data has exploded, and image information expression is vivid and direct, and gradually becomes one of the mainstream information propagation modes. The traditional method of image classification is feature description and detection, and the traditional method may be effective for some simple image classification, but the traditional classification method is not heavy due to the rapid increase of the image data volume, and a quick, efficient and reasonable method is needed for processing and analyzing the images.

The image classification method based on the convolutional neural network becomes a mainstream algorithm of current image classification, and compared with the traditional image classification method, the image classification method does not need to manually describe and extract the characteristics of the target image. The classical convolutional neural network models in common use are of a wide variety, for example, the AlexNet model: the feature description and extraction capability of the image is very limited, and the GoogLeNet model: as the model hierarchy deepens, the gradient dispersion problem becomes more serious, making the network difficult to train, VGGNet model: the convolution kernel used by the convolution layer is smaller, but the depth of the model is increased, and the network uses too many parameters and the training speed is slow. With the development of learning (Positive andunlabeled learning) of the positively labeled examples and unlabeled examples, image classification methods based on PU learning have been proposed, which extend limited training images with unlabeled images to improve the accuracy of the classifier. In PU learning, training examples consist of some positive images and some unlabeled images, which may be positive or negative, with positive and unlabeled images constructing a classifier. Many practical applications today can be seen as PU learning tasks, e.g. information retrieval based on search engine click data, where pages clicked by a user can be as a positive class, while pages not clicked are unlabeled classes.

Existing PU learning methods can be categorized into three categories, namely, two-step strategy-based methods, cost-sensitive methods, and similarity-based methods, depending on the manner in which the unlabeled examples are handled.

However, in many practical applications, it is common for both a positively marked instance and a large number of unmarked instances and their privilege information to be available only during the training phase but not during testing. For example, in image classification, annotations from experts are used as privilege information, such annotations being available only during the training phase but not during testing. In order to solve the problem existing in the three at the same time, a method is needed to effectively use the unlabeled image and also to fully use the privilege information.

Disclosure of Invention

The invention aims to provide an image classification method combining privilege information and a sequencing support vector machine, which considers the existence of privilege information on a similarity PU model and integrates the privilege information into a learning stage of the sequencing support vector machine; the similarity and privilege information of the difference images are considered and are integrated into the learning stage of the sequencing support vector machine so as to quickly classify the images.

In order to realize the tasks, the invention adopts the following technical scheme:

an image classification method combining privilege information and a sequencing support vector machine comprises the following steps:

step 1, randomly selecting images from an image data set to construct positive and negative classes, randomly selecting part of the images from all the images in the positive class to mark, marking the part of the images as marked positive images, and forming a positive mark image set; forming an unlabeled image set from the rest images in the positive class and all the images in the negative class;

step 2, identifying reliable negative images from the unlabeled images, and establishing a positive prototype image and a negative prototype image by utilizing the reliable negative images;

step 3, deleting the reliable negative image from the unlabeled image set, then respectively converting the rest unlabeled image, positive image and negative image into vectors, and making difference between the vectors to form a difference image;

representing the difference image by using a similarity model;

and 4, constructing a classification model by using the difference image, the similarity weight and the privilege information, and training, wherein the trained classification model is used for classifying the image.

Further, the identifying reliable negative images from the unlabeled image set includes:

and respectively utilizing the Spy technology and the Rocchio technology to identify the reliable negative images from the unlabeled image set, taking the reliable images primarily identified by the Spy technology and the Rocchio technology as intersections, and taking the images in the intersections as final reliable negative images.

Further, the creating representative positive and negative prototype images with reliable negative images includes:

the method comprises the steps of setting a reliable negative images, dividing the reliable negative images into m micro-clusters by using a k-means clustering method to establish a positive prototype image and a negative prototype image, wherein the rest unlabeled images are u', and the calculation formula is as follows:

positive prototype image:

negative prototype image:

wherein, the parameters alpha, beta and t are adjusting parameters,p _v representing a positive prototype image representative of the v-th micro-cluster, n _v Representing a negative prototype image representative of the v-th micro-cluster; PS denotes a positive marked image set, US denotes an unmarked image set, NS _v Representing a reliable negative image set of the v-th micro-cluster.

Further, the difference image is expressed as:

X _e ＝x _i -x _j ,X _q ＝x _i -x _k ,X _n ＝x _k -x _j ；

wherein x is _i ∈PS,x _j ∈NS,x _k ∈US'，X _e ,X _q ,X _n Is the difference image formed, US' represents the set of images remaining after the reliable negative image is deleted from the unlabeled image set.

Further, the method for representing the difference image by using the similarity model is specifically expressed as follows:

{X,(m ⁺ (X),m ^- (X))}

wherein X represents a difference image, m ⁺ (X) and m ^- (X) is a similarity weight representing the similarity of the difference image X to the positive and negative classes, respectively; d, d ₁ 、d ₂ The Euclidean distance from the difference image to the representative positive prototype image and the representative negative prototype image; the selection method of the representative positive prototype image and the representative negative prototype image comprises the following steps:

calculating Euclidean distance from each positive prototype image to each negative prototype image, and selecting d for maximizing Euclidean distance _max A pair of positive and negative prototype images serves as a representative positive prototype image and a representative negative prototype image.

Further, the classification model is expressed as:

in the above formula:

X _e ＝x _i -x _j ,X _q ＝x _i -x _k ,X _n ＝x _k -x _j ；S ₁ ＝p·a,S ₂ ＝p·u',S ₃ ＝u'·a，/>where a represents the number of reliable negative images extracted, u' represents the number of unlabeled images remaining, p represents the number of labeled positive images, x _i ∈PS,x _j ∈NS,x _k ∈US'，/>Privilege information indicating positive image, negative image, remaining unlabeled image, respectively, ++>Privilege information, w and w representing difference images ^* The weight vector is represented by a weight vector,<,>represent the inner product, C ₁ 、C ₂ And C ₃ Is a penalty factor controlling the trade-off between hyperplane edges and errors, is an error term, gamma is a non-negative parameter, parameter θ _e ，θ _q And theta _n Control error items +.>Is a coefficient of (a).

The invention has the following technical characteristics:

1. the scheme refers to the fact that the prediction is also positive, and the prediction is called True (TP); the actual positive class, predicted negative class, called False Negative (FN); the actual negative class, predicted positive class, called False Positive (FP); the actual negative class, the predicted negative class, is also called True Negative (TN). In the analysis of anomaly classification models, the main concern is that anomalies (i.e., true negative rates) are correctly detected,the classification accuracy of the classifier is improved by using unlabeled images and similarity weights, so that the number of instances of TNs is increased, the number of instances of FPs is reduced, the true negative rate is slightly increased, namely the probability of correctly detecting abnormality is increased, and the system can be timely repaired. It is naturally important to reduce the number of instances that are mispredicted as abnormal instances (false negative rates), which in fact have little impact on the system. In a true negative rate sample, as opposed to a false negative rate, information about the correct identification of the abnormal instance is contained.

2. The invention is proposed in the case of the simultaneous existence of PU problem and privilege information. Firstly, learning by considering the added privilege information and the common training image, and defining a correction function and a relaxation variable in a privilege feature space to evaluate the relaxation variable in the standard support vector machine. And then the ranking information is integrated into a learning stage of a ranking support vector machine, and the ranking support vector machine is used for effectively training ranking, so that compared with a standard support vector machine, the ranking information can solve the same optimization problem and is faster. Since a rank support vector machine is used, the similarity weights are for the difference images.

Drawings

FIG. 1 is a schematic flow chart of the method of the present invention.

Detailed Description

When the method is used in the image classification and identification process, positive and negative classes are constructed according to the image data set; for all images (instances) in the positive class, a small portion of them is randomly selected as marked positive images, while the other images in the positive class and all images in the negative class constitute an unmarked image set. Thus, a set of positively marked and unmarked images can be obtained:p represents the number of positive mark images, x _i Representing a positive mark image; />u represents the number of unlabeled images, x _k Representing an unlabeled image. The marked positive images belong to the class of high correlation, while the unmarked images belong to the class of lower correlation. All images are adjusted to 10 multiplied by 10 pixels, each image takes 100-dimensional original pixel vectors as main feature vectors, each training image is added with a description, and the training image is converted into 21-dimensional text feature vectors and is used as privilege information; the goal of this scheme is to construct a classifier using positively labeled and unlabeled images, and then classify the images using the classifier.

An image classification method combining privilege information and a sequencing support vector machine has the basic idea that the most reliable negative image NS is extracted from unlabeled examples, and then NS is deleted from the unlabeled images, namely US' =US-NS. Then image aggregation in NS using K-means clusteringClass m micro-clusters, denoted NS ₁ ，NS ₂ ，L,NS _m To build a representative positive prototype and a representative negative prototype; generating two similarity weights m for difference images ⁺ (x) And m ^- (x) The method comprises the steps of carrying out a first treatment on the surface of the The difference examples formed by the positive image set, the negative image set and the unlabeled image set, and similar weight and privilege information are integrated into a learning model of a sequencing support vector machine. The specific steps of the invention are as follows:

step 1, constructing a positive marked image set and an unmarked image set

Randomly selecting some images from the image dataset to construct a positive class and a negative class, wherein the images in the positive class are called positive images, and the images in the negative class are called negative images; for all images in the positive class, randomly selecting part of the images for marking, marking the images as marked positive images, and forming a positive marking image set; the remaining images in the positive class and all images in the negative class are formed into an unlabeled image set.

For example, in the MNIST dataset, the scheme selects a digital image of "0" as the positive class, and the remaining digital images as the negative classes. For all positive images in the positive class, the scheme randomly selects 30% of them as marked positive images, i.e. marks the 30% of positive class images as +1, while the other images in the positive class and all negative images in the negative class constitute an unlabeled image set. Thus, a positive marked image set and an unmarked image set can be obtained.

Step 2, establishing representative positive prototype and negative prototype images

Using the Spy technique (Spy technology, [ Liu et al, 2002.)]) And Rocchio technology (Rocchio technique, [ Li and Liu, 2003)]) Identifying a reliable negative image from the unlabeled image set, denoted N ₁ ，N ₂ The method comprises the steps of carrying out a first treatment on the surface of the Only if both techniques consider them as negative images, they are considered as reliable negative images, namely: ns=n ₁ ⌒N ₂ 。

The reliable negative images obtained in the unlabeled image set are provided with a pieces, the rest unlabeled images are u pieces, the a reliable negative images are divided into m micro-clusters by a k-means clustering method, so that a positive prototype image and a negative prototype image are established, and the calculation formula is as follows:

positive prototype image:

negative prototype image:

wherein x represents norms of the image x; the adjustment parameters alpha and beta are set to 16 and 4,t is an adjustment parameter, generally set to 30; p is p _v Representing a positive prototype image representative of the v-th micro-cluster, n _v Representing a negative prototype image representative of the v-th micro-cluster; PS denotes a positive marked image set, US denotes an unmarked image set, NS _v Representing a reliable negative image set of the v-th micro-cluster.

Step 3, constructing a difference image and calculating the similarity weight of the difference image

Deleting reliable negative images from the unlabeled image set, and marking the set of the rest unlabeled images as US '(US' =us-NS); converting the positive image and the negative image into vectors by using an HOG feature extraction method, and marking the left unlabeled images as positive labeled vectors, negative class vectors and unlabeled class vectors;

generating a difference image represented by the difference vector by differencing the positive label vector, the negative class vector and the rest of the unlabeled class vectors, namely, differencing the positive label vector and the negative class vector, differencing the positive label vector and the unlabeled class vector, differencing the unlabeled vector and the negative class vector, such as:

X _e ＝x _i -x _j ,X _q ＝x _i -x _k ,X _n ＝x _k -x _j ；

wherein x is _i ∈PS,x _j ∈NS,x _k ∈US'，X _e ,X _q ,X _n Is the difference image formed. Since it is not known which class the difference image belongs to, the scheme introduces a similarity model { X, (m) ⁺ (X),m ^- (X)) } to represent a difference image X (X is represented by X) _e 、X _q Or X _n ). In order to calculate the similarity weight of the difference image, the scheme provides a similarity weight generation method based on a distance similarity mechanism:

the scheme firstly calculates each positive prototype image p _v To each negative prototype image n _v Is selected such that the Euclidean distance is a maximum d _max Is denoted as p _max ,n _max The method comprises the steps of carrying out a first treatment on the surface of the Respectively calculating difference images to representative positive prototype images p _max Representative negative prototype image n _max Euclidean distance d of (2) ₁ 、d ₂ The method comprises the steps of carrying out a first treatment on the surface of the The corresponding weights for the difference image are calculated as follows:

wherein m is ⁺ (X) and m ^- (X) is a similarity weight which respectively represents the similarity of the difference image X with the positive class and the negative class, and m is more than or equal to 0 ⁺ (X) is less than or equal to 1 and 0 less than or equal to m ^- (X) is less than or equal to 1. If the difference image is of the positive class, then m ⁺ (X) is equal to 1, corresponding m ^- (X) equals 0 and vice versa, m if negative ⁺ (X) is equal to 0, m ^- (X) is equal to 1.

And 4, constructing a classification model by utilizing the difference image, the similar weight and the privilege information, and realizing the learning and classification of the image:

the scheme combines the difference image, the similarity weight and the privilege information thereof into a learning stage of a sequencing support vector machine so as to construct a more accurate classifier.

Is provided withIs a new training set of images formed from the difference images, their labels and privilege information. Training equation 1 on the new image training set, and obtaining the optimal ranking function. Wherein X is _l Representing a difference image, and X _l Related additional information->Is privilege information, for example, each training image in MNIST data set is added with an overall description, and an expert converts the training image into a 21-dimensional feature vector and takes the feature vector as privilege information; z _l X represents _l S (s=p·u) represents the number of images in the training set, where p represents the number of marked positive images in the image dataset and u represents the number of unmarked images in the original image dataset. Assuming that both the ordering function and the correction function are linear, i.e. f (X) =<w,X>＝w ^T X (T represents a transpose) and phi (X ^* )＝<w ^* ,X ^* >Wherein w, w ^* The weight is represented by a weight that,<,>representing the inner product. We find the best ranking function from a set of allowed functions, which ensures that the error classification rate is the lowest. The problem of requiring optimization can therefore be expressed as:

in formula 1:

X _e ＝x _i -x _j ,X _q ＝x _i -x _k ,X _n ＝x _k -x _j ；S ₁ ＝p·a,S ₂ ＝p·u',S ₃ ＝u'·a，/>where a represents the number of reliable negative images extracted, u' represents the number of unlabeled images remaining, p represents the number of labeled positive images, x _i ∈PS,x _j ∈NS,x _k ∈US'，Privilege messages respectively representing positive image, negative image and remaining unlabeled imageRest, I/II>Privilege information (superscript x indicates privilege information) indicating a difference image. w and w ^* The weight vector is represented by a weight vector,<w,w>and<w ^* ,w ^* >privilege information X for dividing difference image X and difference image ^* Are the regular terms of (2) to prevent overfitting;<,>represent the inner product, C ₁ 、C ₂ And C ₃ Is a penalty factor controlling the trade-off between hyperplane edge and error,/->Is an error term of the error term,is a correction function, f (X _e )＝<w,X _e >,f(X _q )＝<w,X _q >,f(X _n )＝<w,X _n >Respectively difference image X _e ,X _q ,X _n Ordering function of->And->Errors that can be considered to be of different weights; gamma is a non-negative parameter reflecting the proportion of privilege information in the classifier model, parameter θ _e ，θ _q And theta _n Control error items +.> Is a coefficient of (a).

By combining Lagrangian functions with the original variables w, w ^* 、Differentiation is performed to obtain the formula1 to dual form 2:

wherein alpha is _e ≥0，α _q ≥0，α _n ≥0，β _e ≥0，β _q ≥0，β _n ≥0，λ _e ≥0，λ _q Not less than 0 and lambda _n And ≡0 is Lagrangian multiplier. Parameter θ _e ，θ _q And theta _n Respectively control error termsIs a coefficient of (a).

Solving parameters w, w ^* ：

The Lagrangian multiplier alpha can be calculated for the formula 2 by utilizing the SMO algorithm _e ,α _q ,α _n ,β _e ,β _q ,β _n W and w can be obtained by using the following formula ^* Is the optimal solution of (a):

and after the optimal solution is obtained, an optimal sorting function (optimal classifier) can be obtained, and the images in the image data set can be sorted by utilizing the optimal sorting function, so that sorting is performed. The sorting function value of the positive class image is higher than that of the negative class image, namely the correlation of the positive class image is higher than that of the negative class image.

For MNIST datasets, all images are first turned into feature vectors by existing HOG feature extraction methods. Centralizing image dataThe vectors are subtracted from each other to form a disparity vector (representing a disparity image X _bc ＝x _b -x _c ). Then taking the difference image as the input of the optimal sorting function, wherein the prediction label of the difference image is as follows:

if w ^T X _bc Not less than 0, x _b ＞ x _c I.e. image x _b Is higher than image x _c Is a correlation of (3). According to the correlation of the positive class image being higher than that of the negative class image, x can be determined _b Is a positive image, x _c Is a negative image. If the number 0 is considered as positive in the experiment, then image x _b For the image number 0, if the number 1 is considered as positive in the experiment, then the image x _b Image number 1, and so on. I.e. classification is completed.

For PU learning problems, the method is different from a two-step strategy classification method, unlabeled images are excluded in a learning stage, and the classifier is trained by using positive and negative images only, so that many researches show that the unlabeled images are more easily positioned near decision boundaries and play a critical role in the construction of the classifier, and the method reduces the performance of the classifier. The method considers the condition that the positive marked image and a large amount of unmarked images and privilege information thereof exist simultaneously, and in addition, the positive marked images and the unmarked images are not integrated into the learning phase of the standard support vector machine, but are sequenced into the learning phase of the support vector machine. For PI learning problems, the method presented herein differs from the classical paradigm of LUPI in that rather than integrating the training image set into the learning phase of a standard support vector machine, it orders the support vector machine learning phase.

Claims

1. An image classification method combining privilege information and a sequencing support vector machine is characterized by comprising the following steps:

representing the difference image by using a similarity model;

step 4, constructing a classification model by using the difference image, the similarity weight and the privilege information, and training, wherein the trained classification model is used for classifying the image;

is provided withIs a new image training set formed by the difference image, the label and the privilege information thereof;

training 1 on the new image training set, and solving an optimal sorting function; wherein X is _l Representing a difference image, and X _l Related additional informationIs privilege information; z _l X represents _l (s=p·u represents the number of images in the training set, where p represents the number of marked positive images in the image dataset and u represents the number of unmarked images in the original image dataset;

assuming that both the ordering function and the correction function are linear, i.e. f (X) =<w,X>＝w ^T X, T represents a transpose; phi (X) ^* )＝<w ^* ,X ^* >Wherein w, w ^* The weight is represented by a weight that,<,>representing the inner product; the problem of optimization to be solved is expressed as:

in formula 1:

S ₁ ＝p·a，S ₂ ＝p·u'，S ₃ ＝u'·a，where a represents the number of reliable negative images extracted, u' represents the number of unlabeled images remaining, p represents the number of labeled positive images, x _i ∈PS,x _j ∈NS,x _k ∈US'，/>Privilege information respectively representing a positive image, a negative image, and the remaining unlabeled image,privilege information indicating a difference image, and superscript indicates privilege information; w and w ^* The weight vector is represented by a weight vector,<w,w>and<w ^* ,w ^* >privilege information X of difference image X and difference image X, respectively ^* Are the regular terms of (2) to prevent overfitting;<,>represent the inner product, C ₁ 、C ₂ And C ₃ Is a penalty factor controlling the trade-off between hyperplane edges and errors,is an error term->Is a correction function, f (X _e )＝<w,X _e >,f(X _q )＝<w,X _q >,f(X _n )＝<w,X _n >Respectively difference image X _e ,X _q ,X _n Is used as a ranking function of (a),and->Considered as error of different weights, X represents difference image, m ⁺ (X) and m ^- (X) is a similarity weight representing the similarity of the difference image X to the positive and negative classes, respectively; gamma is a non-negative parameter reflecting the proportion of privilege information in the classifier model, parameter θ _e ，θ _q And theta _n Control error items +.>Coefficients of (2);

by combining Lagrangian functions with the original variables w, w ^* 、Differentiation is performed to convert equation 1 to dual form 2:

s.t.

C ₁ θ _e -α _e -λ _e ＝0

C ₂ m ^- (X _q )θ _q -α _q -λ _q ＝0

C ₃ m ^- (X _n )θ _n -α _n -λ _n ＝0

0≤α _e ≤θ _e C ₁ ,0≤α _q ≤θ _q C ₂ ,0≤α _n ≤θ _n C ₁

… … type 2

Wherein alpha is _e ≥0，α _q ≥0，α _n ≥0，β _e ≥0，β _q ≥0，β _n ≥0，λ _e ≥0，λ _q Not less than 0 and lambda _n 0 is Lagrangian multiplier; parameter θ _e ，θ _q And theta _n Respectively control error termsCoefficients of (2);

solving parameters w, w ^* ；

and obtaining an optimal sorting function after obtaining the optimal solution, and sorting the images in the image data set by using the optimal sorting function so as to sort the images.

2. The method for image classification in combination with a privilege information and ordering support vector machine of claim 1 wherein said identifying reliable negative images from a set of unlabeled images comprises:

3. The method of claim 1, wherein said creating representative positive and negative prototype images from reliable negative images comprises:

positive prototype image:

negative prototype image:

4. The method for classifying images in combination with privilege information and ordered support vector machine as claimed in claim 1, wherein said difference image is expressed as:

X _e ＝x _i -x _j ,X _q ＝x _i -x _k ,X _n ＝x _k -x _j ；

5. The method for classifying images in combination with privilege information and a sequencing support vector machine as claimed in claim 1, wherein said representing said difference image by using a similarity model is specifically:

{X,(m ⁺ (X),m ^- (X))}