CN111916162A

CN111916162A - Multi-modal neuroimaging feature selection method based on sample weight and low-rank constraint

Info

Publication number: CN111916162A
Application number: CN202010754793.1A
Authority: CN
Inventors: 陈伟斌; 张笑钦; 钱乐旦; 赵丽
Original assignee: Wenzhou University
Current assignee: Wenzhou University
Priority date: 2020-07-31
Filing date: 2020-07-31
Publication date: 2020-11-10

Abstract

The invention discloses a multi-modal neuroimaging feature selection method based on sample weight and low-rank constraint, which comprises the following steps of: (1) collecting multi-mode brain image data; (2) after data of a plurality of modes are obtained, feature selection is carried out in a multi-mode feature collaborative analysis mode, a regression model from each mode data to a classification class target is established, group sparse constraint is carried out on regression vectors, and therefore a public feature subset relevant to all tasks is obtained; (3) modeling multi-modal data features; (4) writing the target function into an augmented Lagrange form, wherein the target function is changed into a convex function; (5) obtaining a weight matrix of each mode and a weight of each feature of each sample; (6) and fusing the multi-modal characteristics of the sample by using a method of a multi-core support vector machine for classification. According to the technical scheme, the importance of the samples among the modes is considered, the correlation among the modes is also considered, more meaningful features are found out, and the accuracy of classification and prediction is improved.

Description

Multi-modal neuroimaging feature selection method based on sample weight and low-rank constraint

Technical Field

The invention relates to the technical field of neural image analysis and processing, in particular to a multi-modal neural image feature selection method based on sample weight and low-rank constraint.

Background

The rapid development of neuroimaging technology has made possible the measurement and visualization of structural information of the human brain and functional information of mental activities, thereby leading to an explosive growth of knowledge about brain cognition in recent years. However, whatever the imaging mode, it only provides an imaging means, how to effectively and quantitatively analyze and visualize the data obtained by neuroimaging, and how to integrate the scattered multi-modal multi-map data information into a panoramic image reflecting the physiological and pathological mechanisms of brain tissue structure and brain function, which is the key of the subsequent clinical application.

However, for the characteristic representation of multi-modal multi-map data isomerism, the traditional characteristic selection method, namely the multi-modal brain image data fusion diagnosis, adopts the simple splicing processing of characteristic vectors, and realizes the improvement of diagnosis precision in a simple information quantity mode. Multiple sets of features are taken from multiple atlases for each sample and then cascaded for subsequent classification tasks. In recent years, a multi-kernel learning (multi-kernel learning) method has been applied to multimodal data fusion analysis, and has achieved a favorable effect. However, a large amount of redundant features irrelevant to a diagnosis and prediction task exist in multi-modal data, so that some researchers expand on the basis of a single-modal or single-map data embedded sparse feature selection technology, and adopt a multi-task learning framework to realize multi-modal and multi-map group sparse feature selection, namely, a consistent brain image marker is selected as a diagnostic discriminant feature.

Therefore, the obvious disadvantages of the existing feature selection method are as follows: only single-mode data or simple cascade multi-mode data are used, complementary information among the multi-mode data cannot be fully utilized, the feature representation method is single, and the learning performance of the model is limited to a certain extent.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a multi-modal neuroimaging feature selection method based on sample weight and low-rank constraint.

In order to achieve the purpose, the invention provides the following technical scheme: a multi-modal neuroimaging feature selection method based on sample weight and low-rank constraint comprises the following steps:

(1) collecting multi-mode brain image data;

(2) after data of a plurality of modes are obtained, feature selection is carried out in a multi-mode feature collaborative analysis mode, a regression model from each mode data to a classification class target is established, group sparse constraint is carried out on regression vectors, and therefore a public feature subset relevant to all tasks is obtained;

(3) modeling multi-modal data features;

(4) writing the target function into an augmented Lagrange form, wherein the target function is changed into a convex function;

(5) solving each variable of the formula in the step (4) by using an alternating direction multiplier algorithm by using a gradient descent method to obtain a weight matrix W of each mode of each sample and a weight beta of each characteristic^j。

(6) And fusing the multi-modal characteristics of the sample by using a method of a multi-core support vector machine for classification.

Preferably, step (1) is performed by performing multiple imaging in multiple modes or multiple imaging in the same mode on the same patient, and obtaining information from several images to perform comprehensive analysis.

Preferably, in step (3), the data of each modality is used

Wherein the superscript j denotes the jth mode and the subscript i denotes the ith modeThen, the process is carried out; the category to which each sample belongs is used as y_iIn this representation, the index i represents the ith sample; inputting the data of each sample and the belonged category into an objective function for feature selection, wherein the objective function for feature selection is as follows:

wherein n is the number of samples, m is the number of modes, beta^jA vector is selected for the features of the jth modality,

what weight is occupied by the j mode of the ith sample, W is the weight matrix of all modes of the training sample, wherein each row represents a different sample, each column represents a different mode, and lambda_LIs a regularization parameter that constrains the feature sparsity, λ_RIs a regularization parameter that constrains the sample multi-modal association.

Preferably, the characteristic selection target function is changed into a convex function form by constructing an augmented lagrange equation:

wherein c is a fixed step length parameter of gradient rise, and the step (3) is a constraint function

Is equivalent to

v_iIs an equivalent transform factor. λ is a penalty parameter.

Preferably, the augmented lagrange equation is optimized by the ADMM method, the optimization problem being decomposed into β^j、

Lambda three subproblems, and continuously updating Lagrange multipliers in an iterative mode in the following mode to obtain component values:

wherein argmin is the value of the variable when the following expression reaches the minimum value, and the parameter C is more than or equal to 0.

Preferably, in step (5), each variable in the formula is solved by using an alternating direction multiplier algorithm ADMM to obtain a weight matrix of each mode and a weight of each feature of each sample, and then the multi-mode features of the samples are fused by using a multi-core support vector machine method to classify.

The invention has the advantages that: compared with the prior art, the method constructs a learning model with high classification performance, improves the diagnosis accuracy of the brain diseases, assists in searching biomarkers which are possibly sensitive to the diseases, solves the problems that only single-mode data or only simple cascade multi-mode data is used and the multi-mode data cannot be fully utilized by fully mining and utilizing the prior information of the neuroimage data, mines complementary information among different-mode data, fully considers the difference generated in the acquisition process of the sample and the individual difference of the sample and even the outlier contained in the sample, and selects the characteristic with strong discriminability.

According to the invention, by considering the weight of the samples, the weight of the modes and the weight of the features, not only the importance of the samples among the modes but also the correlation among the modes are considered, more meaningful features are found out, and the accuracy of classification and prediction is improved.

The invention is further described with reference to the drawings and the specific embodiments in the following description.

Drawings

FIG. 1 is a flow chart of an embodiment of the present invention.

Detailed Description

Referring to fig. 1, the invention discloses a multi-modal neuroimaging feature selection method based on sample weight and low rank constraint, comprising the following steps:

(1) collecting multi-mode brain image data;

(3) modeling multi-modal data features;

It is often clinically desirable to perform multiple modalities or multiple imaging of the same modality on the same patient. I.e. information is obtained from several images at the same time for comprehensive analysis. The single mode imaging uses only one imaging device and can be used for observing lesion growth, comparing treatment effects before and after an operation, and the like. In the invention, multiple imaging modes (MRI, fMRI, PET and the like) are adopted to simultaneously express multiple information of human bodies, and multiple information integration is realized.

Preferably, in step (3), the data of each modality is used

Where the superscript j denotes the jth modality, and the subscript i denotes the ith sample; the category to which each sample belongs is used as y_iIn this representation, the index i represents the ith sample; inputting the data of each sample and the belonged category into an objective function for feature selection, wherein the objective function for feature selection is as follows:

Is equivalent to

v_iIs an equivalent transform factor. λ is a penalty parameter.

Aiming at the characteristics that typical brain diseases have multi-modal data such as biology, physiology and the like, the invention explores a method for efficiently processing and understanding the abundant, various and complex data. On the basis of meaningful measurement in a brain feeling region or a brain network through feature extraction, efficient feature selection is performed, irrelevant and redundant features are filtered out to reduce the dimension of the features so as to facilitate subsequent classification, biomarkers which are possibly sensitive to diseases are searched for, automatic diagnosis and classification of individuals are completed, pathology is better explained, and key support is performed for realizing typical intelligent learning.

The method solves the problem that individual differences and characteristics contain noise, considers the weight of the samples, the weight of the modes and the weight of the characteristics, considers the importance of the samples among the modes and the correlation among the modes, finds out more meaningful characteristics, and improves the accuracy of classification and prediction.

The above embodiments are described in detail for the purpose of further illustrating the present invention and should not be construed as limiting the scope of the present invention, and the skilled engineer can make insubstantial modifications and variations of the present invention based on the above disclosure.

Claims

1. A multi-modal neuroimaging feature selection method based on sample weight and low-rank constraint is characterized in that: the method comprises the following steps:

(1) collecting multi-mode brain image data;

(3) modeling multi-modal data features;

(5) solving each variable of the formula in the step (4) by using an alternating direction multiplier algorithm by using a gradient descent method to obtain a weight matrix of each mode and a weight of each characteristic of each sample;

2. The method of claim 1, wherein the method for multi-modal neuroimaging feature selection based on sample weights and low rank constraints comprises: step (1), multiple modes or multiple times of imaging of the same mode are carried out on the same patient, and information is obtained from several images at the same time for comprehensive analysis.

3. The method of claim 2, wherein the method for multi-modal neuroimaging feature selection based on sample weights and low rank constraints comprises: step (3) of using data of each modality

4. The method of claim 3, wherein the method for multi-modal neuroimaging feature selection based on sample weights and low rank constraints comprises: the characteristic selection target function is changed into a convex function form by constructing an augmented Lagrange equation:

Is equivalent to

v_iIs an equivalent transform factor. λ is a penalty parameter.

5. The method of claim 4, wherein the method for multi-modal neuroimaging feature selection based on sample weights and low rank constraints comprises: the augmented Lagrange equation is optimized by the ADMM method, and the optimization problem is decomposed into beta^j、

6. The method of claim 5, wherein the method for multi-modal neuroimaging feature selection based on sample weights and low rank constraints comprises: and (5) solving each variable in the formula by using an alternative direction multiplier Algorithm (ADMM) to obtain a weight matrix of each mode and a weight of each feature of each sample, and then fusing multi-mode features of the samples for classification by using a multi-core support vector machine method.