CN110516092B

CN110516092B - Automatic image annotation method based on K nearest neighbor and random walk algorithm

Info

Publication number: CN110516092B
Application number: CN201910822291.5A
Authority: CN
Inventors: 王振武
Original assignee: China University of Mining and Technology Beijing CUMTB
Current assignee: China University of Mining and Technology Beijing CUMTB
Priority date: 2019-09-02
Filing date: 2019-09-02
Publication date: 2020-12-01
Anticipated expiration: 2039-09-02
Also published as: CN110516092A

Abstract

The invention provides an automatic image labeling method based on K neighbor and random walk algorithm, belonging to the field of image retrieval. The method is characterized in that a vertex set of a probability graph model is constructed by adopting a K nearest neighbor algorithm, an edge set of a random walk graph is constructed based on the interrelation among labels in a training image label set, and the labels of an image to be labeled are automatically labeled by the random walk algorithm. The invention has good adaptability, accuracy and universality.

Description

Automatic image annotation method based on K nearest neighbor and random walk algorithm

Technical Field

The invention belongs to the field of image retrieval, and aims at the problem that manually marked images cannot compete with massive image marking tasks on the Internet, and automatic marking of the images is realized by using a K neighbor and random walk algorithm.

Background

The invention realizes automatic labeling of massive images of the Internet and provides basic support for image retrieval. The traditional manual image labeling method has large workload, and the subjectivity and the inaccuracy are inevitably brought, so that the automatic image labeling of a computer is imperative. The automatic labeling of the image is to make a computer automatically add semantic keywords capable of reflecting the content of the image to the image, and the use of the automatic labeling can effectively improve the difficulty of the current image retrieval.

The K-nearest neighbor algorithm is a typical representative of the lazy learning method, has no obvious training process, and has the advantages of high classification precision, insensitivity to abnormal values and no data input assumption. The random walk algorithm was first described mathematically by einstein in 1926. Since many entities in nature move in unpredictable ways, random walk algorithms are used to describe such unstable movement processes in which a mobile node randomly selects a direction and speed to move from a current location to a new location. The random walk algorithm is a global optimization method and has the advantage of being not easy to fall into a local minimum value.

Disclosure of Invention

The invention aims to improve the accuracy of similarity calculation between images by comprehensively considering the difference between discrete features and continuous features of the images and based on Hamming loss and a Gaussian kernel function, reduce the time complexity and the space complexity of the algorithm by constructing a probability graph model based on a K nearest neighbor algorithm, and realize automatic annotation of the images by performing random walk on the probability graph model. The method has good adaptability, accuracy and universality.

The image automatic labeling method based on the K nearest neighbor and the random walk algorithm comprises the following steps:

step (a 1): extracting the features of the image to form a training set

And test set

Where i is the number of the image,

for the ith image x_iIs determined by the feature vector of (a),

for continuous features, D₁Is the number of consecutive features that are,

for discrete features, D is the total number of features,

is x_iLabel set y of_iThe tag vector of (a) is determined,

as a total label set,/_qThe number of the Q label in the L is Q, the Q is the number of the label, and the Q is the total number of the labels;

step (a 2): computing x based on Hamming losses and Gaussian functions_iAnd x_jSimilarity (x) between_i,x_j) (i, j ═ 1,2, … m + n); step (a 3): probability graph model constructed based on K nearest neighbors

Wherein

Is composed of

The set of the vertices of (a) is,

is composed of

The edge set of (1);

step (a 4): prediction probability vector of image belonging to each label based on random walk algorithm

Wherein

Is x_iBelong to_qThe prediction probability of (i ═ m +1, m +2, …, m + n, Q ═ 1,2, …, Q);

step (a 5): construction of a predicted tag set y 'of an image based on predicted probabilities'_i(i＝m+1,m+2,…,m+n)。

The method for calculating the similarity between the images based on the Hamming loss and the Gaussian kernel function comprises the following steps:

step (B1) constructs x based on Hamming losses_iAnd x_jSimilarity of each discrete feature therebetween

(i, j ═ 1,2, … m + n, and i and j cannot be greater than m at the same time);

step (B2) constructs x based on Gaussian kernel function_iAnd x_jSimilarity of each successive feature therebetween

Wherein

Representing a vector

And

the distance between the two adjacent electrodes is less than the total distance,

is a regulatory factor;

step (B3) is performed by applying NomSimiliity (x)_i,x_j) And NumSimiliity (x)_i,x_j) Weighted summation, construction of x_iAnd x_jSimilarity between them

The method for constructing the probability map model based on the K nearest neighbor comprises the following steps:

step (C1): structure x_iIn the training set

Middle K nearest neighbor

Wherein K is

The number of the middle elements, x represents χ_trainAll images in (1) and x_iAfter the similarity between the images is sorted from big to small, x is one of the first K most similar images;

step (C2): structure of the device

Set of vertices of

Wherein

r is

The number of the middle vertex;

step (C3): structure of the device

Edge set of

Wherein

Wherein s is

The number of the middle vertex is,

are each v_rAnd v_sA set of labels corresponding to the image;

step (C4): constructing a probabilistic graphical model

The steps of constructing the prediction probability of each label of the image based on the random walk algorithm are as follows:

step (D1): the structure is based on

Transition matrix of

Wherein

Is composed of

Row r +1 and column s +1,

step (D2): the structure is based on

Is a hop probability vector

Wherein 1 is_K+1A K +1 dimensional column vector with each component being 1;

step (D3): the structure is based on

Formula of random walk

Wherein k is the iteration number, alpha is the probability of jumping during random walk,

is the probability distribution vector at the kth random walk time,

a probability distribution vector corresponding to 0 is k;

step (D4): continuously iterating until obtaining stable probability distribution vector

Wherein

Satisfy the requirement of

Is an iteration error;

step (D5): calculating a final probability distribution vector

Wherein

Is 1_qIn that

A prior probability of (1);

a step (D6) of calculating a prediction probability vector of each label to which the image belongs

Wherein

Is x_iBelong to_qThe probability of prediction of (a) is,

the steps of constructing the image prediction label set based on the label prediction probability adopted by the invention are as follows:

step (E1) of calculating

In_qAverage length of the label set of the image of

(q＝1,2,…,Q)；

Step (E2) of calculating

Middle image x_iPredicted length of the tag set of

(i＝m+1,m+2,…,m+n)

Step (E3) of

Sorting from big to small;

step (E4) of

From big to smallSelecting front length (y'_i) And (4) taking the set of labels corresponding to the individual probabilities as an image prediction label set to finish automatic labeling of the image.

The invention has the following advantages:

1. the invention can label all types of images and has strong universality.

2. The invention can process images containing continuous features and discrete features and has strong adaptability.

3. The method is used for automatically marking the image based on the K nearest neighbor and the random walk algorithm, and is high in robustness and accuracy.

Drawings

FIG. 1 is a flow chart of the present invention for computing inter-image similarity based on Hamming loss and Gaussian kernel function

FIG. 2 is a flow chart of the present invention for constructing a probabilistic graphical model based on K-nearest neighbors

FIG. 3 is a flow chart of the present invention for constructing the prediction probability of each label belonging to an image based on the random walk algorithm

FIG. 4 is a flow chart of constructing an image prediction tag set based on tag prediction probability according to the present invention

FIG. 5 is a simplified flow chart of the present invention

Detailed Description

The invention relates to a method for constructing image similarity measurement based on Hamming loss and Gaussian kernel function, which constructs a probability graph model based on a K nearest neighbor algorithm and realizes automatic annotation of images by random walk on the probability graph model.

The process of calculating the similarity between images based on Hamming loss and Gaussian kernel function is as follows:

(1) as shown in fig. 1, the similarity between images with respect to discrete features is calculated by hamming loss, and the similarity between images with respect to continuous features is calculated by gaussian kernel function;

(2) and carrying out weighted average on the similarity between the discrete features and the similarity between the continuous features to obtain the similarity between the images.

The process of constructing the probability map model based on the K neighbors is as follows:

(1) as shown in fig. 2, a vertex set of the probability map model is constructed based on K neighbors of the image, and an edge set of the probability map model is constructed based on the correlation between corresponding label sets of the image;

(2) and combining the vertex set and the edge set into a probability graph model.

The process of constructing the prediction probability of each label of the image based on the random walk algorithm is as follows:

(1) constructing a state transition matrix and a jump probability vector in a random walk process based on a probability map model diagram as shown in FIG. 3;

(2) obtaining probability vectors of the image on each vertex in the probability map model through random walk;

(3) the probability that the image belongs to each label is calculated based on the probability distribution vector.

The process of constructing the image prediction tag set based on the tag prediction probability is as follows:

(1) as shown in fig. 4, the average length of the tag sets of the images belonging to the respective tags and the length of the predicted tag set of the image are calculated;

(2) a set of predictive labels for the image is constructed.

Claims

1. An automatic image labeling method based on K neighbors and random walks is characterized in that similarity between images is calculated based on Hamming loss and a Gaussian kernel function, a probability model graph is constructed based on the K neighbors, prediction probabilities of the images belonging to all labels are constructed based on the random walks, an image prediction label set is constructed based on the label prediction probabilities, and the method sequentially comprises the following steps:

step (a 1): extracting the features of the image to form a training set

And test set

Where i is the number of the image,

for the ith image x_iIs determined by the feature vector of (a),

for continuous features, D₁Is the number of consecutive features that are,

for discrete features, D is the total number of features,

is x_iLabel set y of_iThe vector of the labels is then used to,

L＝{l₁,l₂,…,l_Qas the total label set, l_qThe number of the Q label in the L is Q, the number of the label is Q, and the number of the total label is Q;

step (a 2): computing x based on Hamming losses and Gaussian functions_iAnd x_jSimilarity (x) between_i,x_j)(i,j＝1,2,…m+n)；

Step (a 3): probability model graph constructed based on K nearest neighbors

Wherein

Is composed of

The set of the vertices of (a) is,

is composed of

The edge set of (1);

step (a 4): prediction summary for image belonging to each label based on random walk algorithmRate vector

Wherein

step (a 5): construction of a predicted tag set y 'of an image based on predicted probabilities'_i(i＝m+1,m+2,…,m+n)；

step (B1) constructs x based on Hamming losses_iAnd x_jSimilarity between features with respect to each other

And i and j cannot be greater than m simultaneously);

step (B2) constructs x based on Gaussian kernel function_iAnd x_jSimilarity between successive features

Wherein

Representing a vector

And

is a regulatory factor;

The method for constructing the probability model graph based on the K neighbors comprises the following steps:

step (C1): structure x_iIn the training set

Middle K nearest neighbor

Wherein K is

Number of elements (A) in (B), where x belongs to x_iAt x_trainK in (1) is near neighbor represented in x_trainThe image corresponding to all the characteristic vectors in the image is x_iIn the sequence of similarity from large to small, x is positioned in the first K;

step (C2): structure of the device

Set of vertices of

Wherein

r is

The number of the first vertex;

step (C3): structure of the device

Edge set of

Wherein

Wherein s is

The number of the second vertex in the list,

are each v_r、v_sA set of labels for the corresponding image;

step (C4): constructing a probabilistic model graph

The method for constructing the prediction probability of the image belonging to each label based on the random walk comprises the following steps:

step (D1): the structure is based on

Transition matrix of

Wherein

Is composed of

Row r +1 and column s +1,

step (D2): the structure is based on

Is a hop probability vector

Wherein 1 is_K+1Is a K +1 dimensional column vector with a component of 1;

step (D3): the structure is based on

Formula of random walk

Where k is the probability of the occurrence of a jump when the iteration number alpha is random walk,

is the probability distribution vector at the kth random walk time,

a probability distribution vector corresponding to 0 is k;

Wherein

Satisfy the requirement of

Is an iteration error;

step (D5): calculating a final probability distribution vector

Wherein

Is 1_qIn that

A prior probability of (1);

Wherein

Is x_iBelong to_qThe probability of prediction of (a) is,

the image prediction label set is constructed based on the label prediction probability, and the steps are as follows:

step (E1) of calculating

In_qAverage length of the label set of the image of

Step (E2) of calculating

Middle image x_iPredicted length of the tag set of

Step (E3) of

Sorting according to the sequence from big to small;

step (E4) of

Selecting the front length (y) in the sequence from big to small_i') labels corresponding to the probabilities, and forming an image prediction label set by the collection of the labels.