CN115205632B

CN115205632B - Semi-supervised multi-view metric learning method in Riemann space

Info

Publication number: CN115205632B
Application number: CN202210847014.1A
Authority: CN
Inventors: 梁建青; 梁吉业
Original assignee: Shanxi University
Current assignee: Shanxi Jinxinan Technology Co ltd
Priority date: 2022-07-07
Filing date: 2022-07-07
Publication date: 2023-07-18
Anticipated expiration: 2042-07-07
Also published as: CN115205632A

Abstract

The invention discloses a semi-supervised multi-view measurement learning method in Riemann space, which comprises the following steps: extracting multi-view features of the image from the training set and generating a sample pair; constructing a multi-view intra-class and inter-class divergence matrix, embedding semantic information into a feature subspace, and realizing migration and fusion of data and knowledge; embedding data and knowledge from Euclidean space into Riemann manifold subspace to complete feature mapping; and carrying out multi-view fusion to obtain the unified representation of the features. The invention solves the problem of high dependence on strong supervision information and Euclidean space in the related technology, provides a novel high-efficiency measurement learning method suitable for complex application scenes and weak supervision labeling environments, and improves the performance of related tasks of weak supervision heterogeneous data mining and pattern recognition.

Description

Semi-supervised multi-view metric learning method in Riemann space

Technical Field

The invention belongs to the technical field of machine learning, and particularly relates to a semi-supervised multi-view measurement learning method in a Riemann space.

Background

Distance metrics play a decisive role in the performance of most machine learning methods. In the face of complex and varied application scenarios, conventional metric functions have not been able to capture real data structures. How to learn to get task and data driven, flexible distance metrics is a research hotspot in the field of machine learning. As one of the mainstream technologies in the current machine learning field, metric learning aims to automatically learn a suitable metric from data, and is widely used in the fields of face recognition, information retrieval, network link prediction and the like.

Under the background of big data, the data presents the characteristics of high dimension, multi-source isomerism and extremely weak supervision, which makes learning of quick and effective distance measurement difficult, and simultaneously brings unprecedented challenges to intelligent information processing in the fields of traditional machine learning, pattern recognition and the like. The high dependence on the strongly supervised information and the euclidean space is a common problem in the current metric learning research, which leads to the great limitation of the application range of the existing learning model and algorithm in practical application.

Disclosure of Invention

The invention provides a semi-supervised multi-view measurement learning method in Riemann space, which aims to overcome the high dependence on strong supervision information and Euclidean space. The invention can accurately describe manifold distribution of data in weak supervision labeling environment and non-European space, and improves the performance of weak supervision heterogeneous data measurement learning.

The technical scheme of the invention is as follows: a semi-supervised multi-view measurement learning method under Riemann space comprises the following specific steps:

step 101: extracting multi-view features of the image from the training set and generating a sample pair;

step 102: constructing a multi-view intra-class and inter-class divergence matrix, embedding semantic information into a feature subspace, and realizing migration and fusion of data and knowledge;

step 103: embedding data and knowledge from Euclidean space into Riemann manifold subspace to complete feature mapping;

step 104: and carrying out multi-view fusion to obtain the unified representation of the features.

Optionally, the step 101 extracts multi-view features of the image from the training set and forms a sample pair, and further includes:

the training set is transmitted into a local feature HOG, a SIFT feature descriptor and a deep convolutional neural network, and after a word bag model and a final full-connection layer of a feature extraction network are passed, 500-dimensional word bag representation and 1024-dimensional depth features of an image are respectively obtainedAnd obtaining the similar sample pair set S, the dissimilar sample pair set D and the unmarked sample set U according to the sample labels.

Optionally, the loss function is:

wherein L is a measurement learning total loss function, L _dis To distinguish loss lambda ₁ And lambda (lambda) ₂ To control balance parameters between targets, L _reg1 For regular loss of semi-supervised graph, L _reg2 To measure regular loss, w _v For v View weight, A ^(v) For v-view metric matrix, S ^(v) For the intra-class divergence of v views, D ^(v) For v view inter-class divergence, X ^(v) For the v view feature matrix, L is Laplacian matrix, D _sld (A ^(v) ,A ₀ ) For symmetrical LogDet divergence, A ₀ The matrix is positive determined for a priori symmetry.

Optionally, discriminating loss L _dis And obtaining the distance measurement with strong discrimination capability under the measurement matrix constructed by each view.

Alternatively, a laplace matrix l=d-W, wherein,for a diagonal matrix, the adjacency matrix W is defined as follows

Optionally, semi-supervised graph regularization loss L _reg1 And a laplace matrix L, samples located within a local area of the low-dimensional manifold have similar classes based on manifold assumptions.

Optionally, measure canonical loss L _reg2 So that in the matrix S ^(v) Guarantee A in case of near odd or irreversible ^(v) There is a solution.

Optionally, the loss term L is discriminated from the objective function _dis Part, metric matrix A ^(v) Is generalized by the following objective function

Wherein delta _R Riemann distance for SPD matrix

δ _R (X,Y):＝||log(Y ^-1/2 XY ^-1/2 )|| _F For X, Y > 0,

optionally, the metric matrix a for each view is obtained and then solved for w.

The measurement learning method solves the problem of high dependence on strong supervision information and Euclidean space in the related technology, provides a novel high-efficiency measurement learning method suitable for complex application scenes and weak supervision labeling environments, and improves the performance of weak supervision heterogeneous data measurement learning.

Drawings

FIG. 1 is a flow chart of a semi-supervised multi-view metric learning method in Riemann space according to an embodiment of the present invention;

FIG. 2 is a specific technical scheme of an embodiment of the present invention.

Detailed Description

In order to enable those skilled in the art to better understand the present invention, the following description will make clear and complete descriptions of the technical solutions according to the embodiments of the present invention with reference to the accompanying drawings. It will be apparent that the described embodiments are merely some, but not all embodiments of the invention. All other embodiments, which can be made by one of ordinary skill in the art without undue burden on the person of ordinary skill in the art based on the embodiments of the present invention, are intended to be within the scope of the present invention.

Assume that there are N samples from m viewsAiming at each view, the invention obtains the distance measurement with strong discrimination capability under the measurement matrix constructed by each view. The invention constructs the Laplace matrix and the regular loss guide data distribution of the semi-supervised graph based on manifold assumption in order to effectively utilize a large number of unmarked samples. In consideration of the condition that the intra-class divergence matrix approaches odd or is irreversible, the invention utilizes symmetrical LogDet divergence to construct measurement regular loss, thereby ensuring that each view measurement matrix has a solution. Finally, the invention generalizes the solving of each metric matrix from Euclidean space to Riemann space, so that the distance metric obtained by learning can better meet the requirements of actual complex application scenes. In the solving process, the invention obtains the measurement matrix of each view and then calculates the weight.

The steps of the invention are specifically described below with reference to fig. 1 and 2:

step 101: multi-view features of the image are extracted from the training set and pairs of samples are generated.

The training set is transmitted into a local feature HOG, a SIFT feature descriptor and a deep convolutional neural network, and after a word bag model and a final full-connection layer of a feature extraction network are passed, 500-dimensional word bag representation and 1024-dimensional depth features of an image are respectively obtainedAnd obtaining the similar sample pair S, the dissimilar sample pair D and the unmarked sample set U according to the sample label.

Step 102: and constructing a multi-view intra-class and inter-class divergence matrix, embedding semantic information into a feature subspace, and realizing migration and fusion of data and knowledge.

By means of the thought of large interval, loss L is distinguished _dis And obtaining the distance measurement with strong discrimination capability under the measurement matrix constructed by each view.

Based on manifold assumptions, samples located in the low-dimensional manifold local area have similar categories, and a semi-supervised graph regular loss L is constructed _reg1 And a laplace matrix l=d-W, wherein,for a diagonal matrix, the adjacency matrix W is defined as follows

Taking into consideration the condition that the intra-class divergence matrix approaches odd or is irreversible, utilizing symmetrical LogDet divergence to construct a measurement regularized loss L _reg2 Thereby ensuring the measurement matrix A of each view ^(v) Has the specific form of

The total loss function is defined as follows:

Step 103: and embedding the data and the knowledge from the Euclidean space into the Riemann manifold subspace to finish the feature mapping.

Firstly, solving A by considering fixed w, and judging loss term L in objective function _dis Part, metric matrix A ^(v) Is generalized by the following objective function

Wherein delta _R Riemann distance for SPD matrix

δ _R (X,Y):＝||log(Y ^-1/2 XY ^-1/2 )|| _F For X, Y > 0,

the above problem has a closed-form solution in the Riemann manifold subspace in the form of a weighted geometric mean

A ^(v) ＝(S ^(v) ) ^-1 # _t D ^(v)

Further for the overall objective function, each view measures matrix A ^(v) Solution of (2)

After a measurement matrix A of each view is obtained by utilizing an alternate solving strategy, carrying constraint conditions into an objective function to construct a generalized Lagrange function for derivation, and then solving w

Finally, it should be noted that: the above embodiments are only for illustrating the technical solution of the present invention, and not for limiting the same; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some or all of the technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit of the invention.

Claims

1. The semi-supervised multi-view measurement learning method in the Riemann space is characterized by comprising the following steps:

2. The semi-supervised multiview metric learning method under Riemann space of claim 1, wherein the step 101 of extracting multiview features of an image from a training set and forming sample pairs further comprises:

the training set is transmitted into a local feature HOG, a SIFT feature descriptor and a deep convolutional neural network, and after a word bag model and a final full-connection layer of a feature extraction network are passed, 500-dimensional word bag representation and 1024-dimensional depth features of an image are respectively obtainedAnd obtaining a similar sample pair set S, a dissimilar sample pair set D and a label-free sample set according to the sample labels>

3. The semi-supervised multiview metric learning method under Riemann space of claim 1, wherein the loss function is:

wherein, the liquid crystal display device comprises a liquid crystal display device,learning the total loss function for metrics>To distinguish loss lambda ₁ And lambda (lambda) ₂ In order to control the balance parameters between the targets,regular loss for semi-supervised graphs, < >>To measure regular loss, w _v For v View weight, A ^(v) For v-view metric matrix, S ^(v) For the intra-class divergence of v views, D ^(v) For v view inter-class divergence, X ^(v) For the v view feature matrix, L is Laplacian matrix, D _sld (A ^(v) ,A ₀ ) For symmetrical LogDet divergence, A ₀ The matrix is positive determined for a priori symmetry.

4. A semi-supervised multiview metric learning method as defined in claim 3, wherein said discrimination lossAnd obtaining the distance measurement with strong discrimination capability under the measurement matrix constructed by each view.

5. The method for semi-supervised multiview metric learning under Rieman space as set forth in claim 3, wherein the Laplace matrix L = D-W, wherein,for a diagonal matrix, the adjacency matrix W is defined as follows

。

6. A semi-supervised multiview metric learning method as claimed in claim 3, wherein the semi-supervised graph regularization lossAnd a laplace matrix L, samples located within a local area of the low-dimensional manifold have similar classes according to manifold assumptions.

7. A semi-supervised multiview metric learning method as claimed in claim 3, wherein said metric regularization lossSo that in the matrix S ^(v) Guarantee A in case of near odd or irreversible ^(v) There is a solution.

8. A semi-supervised multiview metric learning method as claimed in claim 3, wherein the loss term is determined in the objective functionPart, metric matrix A ^(v) The solution of (2) is augmented by an objective function

Wherein delta _R Riemann distance for SPD matrix

δ _R (X,Y):＝||log(Y ^-1/2 XY ^-1/2 )|| _F For X, Y > 0.

9. A semi-supervised multiview metric learning method as claimed in claim 3, wherein the metric matrix a for each view is obtained and then solved for w.