CN111046951A

CN111046951A - Medical image classification method

Info

Publication number: CN111046951A
Application number: CN201911271721.5A
Authority: CN
Inventors: 李泽琦
Original assignee: Anhui Weiaoman Robot Co ltd
Current assignee: Anhui Weiaoman Robot Co ltd
Priority date: 2019-12-12
Filing date: 2019-12-12
Publication date: 2020-04-21

Abstract

The invention relates to a medical image classification method, which comprises the following steps: 1) collecting a large number of medical images, extracting features to obtain a sample set, manually labeling and initializing; 2) randomly generating an input weight vector and an input bias of a hidden layer mapping function; 3) generating a hidden layer output function; 4) generating a hidden layer output matrix; 5) constructing a graph Laplace matrix; 6) and predicting by using the trained model. The invention has the following advantages: 1) only a small amount of labels are needed to be marked for the medical images by experts; 2) data distribution and prior connection information can be fully utilized to realize high-accuracy classification under a small amount of data; 3) the model training is efficient without the need to resort to large numbers of expensive high-speed computers.

Description

Medical image classification method

Technical Field

The invention belongs to the technical field of computers, and particularly relates to a medical image classification method.

Background

Medical images are increasingly used in clinical diagnosis and treatment, and how to use a large number of medical images to assist doctors in diagnosis and treatment of diseases is a problem of extensive research in the industry at present. An excellent medical image classification method must be carefully classified according to the types of diseases and the types of donors so as to perform efficient retrieval and information analysis and mining at any time. The traditional medical image mainly adopts the methods of manual identification and character classification. However, with the increase of medical images, especially the differences of race, gender, age, etc. involved therein, the difficulty of manual identification is getting greater and the workload is getting greater. Therefore, it is a future trend to introduce increasingly sophisticated computer image recognition technology to replace manual work to accomplish the above work.

Disclosure of Invention

The invention overcomes the defects of the prior art and provides a medical image classification method, which comprises the following steps:

step 1, collecting a large number of medical images

Using pairs of automatic encoders

Training to obtain

A set of feature vectors, i.e. a set of samples

To pair

Marking to obtain corresponding category label

Wherein the content of the first and second substances,

is a d-dimensional column vector and is,

in the case of a real number,

for the number of labeled samples and n for the number of all samples,

number of unlabeled samples;

initialization: the following parameters were manually set: model complexity coefficient gamma_A>0, smooth conformity coefficient γ_I>0, ligation fusionCoefficient η ∈ (0,1), loss coefficient C>0, number of hidden layer nodes N>0；

Step 2, randomly generating an input weight vector of a hidden layer mapping function

And is offset from the input

The following were used:

randomly generating N w to obtain w₁,...,w_N(ii) a Randomly generating N b to obtain b₁,...,b_N；

Step 3, generating a hidden layer output function:

h(x)＝[G(w₁,b₁,x),…,G(W_N,b_N,x)]^T

wherein G (w, b, x) is an activation function, and x represents a sample;

step 4, generating a hidden layer output matrix:

step 5, constructing a graph Laplace matrix:

step 501, constructing a Laplace matrix L of the feature similarity graph_G：

L_G＝D_G-W_G

Wherein, W_GIs a feature similarity matrix whose i row and j column elements [ W ]_G]_ijComprises the following steps:

wherein x is_iAnd x_jIn order to be a sample of the sample,

σ>0 is the Gaussian kernel width; d_GIs W_GA degree matrix of (c);

step 502,Constructing a Bilink Laplace matrix L_m：

L_m＝D_m-W_m

Wherein, W_mFor a must-join graph matrix, when x_iAnd x_jWhen it is a homogeneous sample, W_mIth row and jth column element [ W ]_m]_ijWhen x is 1_iAnd x_jIf the same type of sample is unknown, [ W ]_m]_ij＝0；D_mIs W_mA degree matrix of (c);

step 503, construct the Laplace matrix L of the must-break graph_c：

L_c＝D_c-W_c

Wherein, W_cTo break the graph matrix, when x_iAnd x_jWhen it is a heterogeneous sample, W_cIth row and jth column element [ W ]_c]_ijWhen x is 1_iAnd x_jIf the heterogeneous sample is unknown, [ W ]_c]_ij＝0；D_cIs W_cA degree matrix of (c);

and 6, predicting the category of the medical image by using the following models:

wherein, I is a unit array,

is a diagonal matrix in front of

Each diagonal element is 1, and the other diagonal elements are 0;

the number of marked samples, u the number of unmarked samples,

wherein, the activation function G (w, b, x) involved in step 3 is:

or:

wherein N is>d、

η∈(0.5,1)。

Wherein the auto-encoder comprises at least one convolutional layer and one pooling layer.

Compared with the prior art, the invention has the advantages that: 1) only a small amount of labels are needed to be marked for the medical images by experts; 2) data distribution and prior connection information can be fully utilized to realize high accuracy under a small amount of data; 3) the model training is efficient without the need to resort to large numbers of expensive high-speed computers.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

Detailed Description

The invention is further described below with reference to examples, but the scope of the invention is not limited thereto.

As shown in fig. 1, the present invention is specifically implemented as follows:

step 1, collecting a large number of medical images

Using pairs of automatic encoders

Training to obtain

A set of feature vectors, i.e. a set of samples

To pair

Marking to obtain corresponding category label

Wherein the content of the first and second substances,

is a d-dimensional column vector and is,

in the case of a real number,

for the number of labeled samples and n for the number of all samples,

number of unlabeled samples;

initialization: the following parameters were manually set: model complexity coefficient gamma_A>0, smooth conformity coefficient γ_I>0, connectivity fusion coefficient η ∈ (0,1), loss coefficient C>0, number of hidden layer nodes N>0；

And is offset from the input

The following were used:

Step 3, generating a hidden layer output function:

h(x)＝[G(w₁,b₁,x),…,G(W_N,b_N,x)]^T

wherein G (w, b, x) is an activation function, and x represents a sample;

step 4, generating a hidden layer output matrix:

step 5, constructing a graph Laplace matrix:

step 501, constructing a Laplace matrix L of the feature similarity graph_G：

L_G＝D_G-W_G

wherein x is_iAnd x_jIn order to be a sample of the sample,

σ>0 is the Gaussian kernel width; d_GIs W_GA degree matrix of (c);

step 502, construct the Laplace matrix L of the must-link graph_m：

L_m＝D_m-W_m

step 503, construct the Laplace matrix L of the must-break graph_c：

L_c＝D_c-W_c

wherein, I is a unit array,

is a diagonal matrix in front of

Each diagonal element is 1, and the other diagonal elements are 0;

the number of marked samples, u the number of unmarked samples,

preferably, the activation function G (w, b, x) involved in step 3 is:

or:

preferably, N>d、

η∈(0.5,1)。

Preferably, the auto-encoder comprises at least one convolutional layer and one pooling layer.

The degree matrix D of the matrix W is calculated as follows, D is a diagonal matrix, the ith diagonal element D of D_ii＝∑_jW_ijWherein W is_ijIs the ith row and jth column element of W.

For an acquired medical image, some image preprocessing work is generally required, and the medical image to be recognized is subjected to correction, scaling, filtering and resolution adjustment.

The above examples are provided only for the purpose of describing the present invention, and are not intended to limit the scope of the present invention. The scope of the invention is defined by the appended claims. Various equivalent substitutions and modifications can be made without departing from the spirit and principles of the invention, and are intended to be within the scope of the invention.

Claims

1. A medical image classification method is characterized by comprising the following steps:

step 1, collecting a large number of medical images

Using pairs of automatic encoders

Training to obtain

A set of feature vectors, i.e. a set of samples

To pair

Marking to obtain corresponding category label

Wherein the content of the first and second substances,

is a d-dimensional column vector and is,

the number of the labeled samples is a real number, l is the number of the labeled samples, n is the number of all the samples, and u-n-l is the number of the unlabeled samples;

And is offset from the input

The following were used:

Step 3, generating a hidden layer output function:

h(x)＝[G(w₁,b₁,x),…,G(W_N,b_N,x)]^T

wherein G (w, b, x) is an activation function, and x represents a sample;

step 4, generating a hidden layer output matrix:

H＝[h(x₁),…,h(x_l+u)]^T

step 5, constructing a graph Laplace matrix:

step 501, constructing a Laplace matrix L of the feature similarity graph_G：

L_G＝D_G-W_G

wherein x is_iAnd x_jIs the sample, i, j ∈ {1, …, l + u }, σ>0 is the Gaussian kernel width; d_GIs W_GA degree matrix of (c);

step 502, construct the Laplace matrix L of the must-link graph_m：

L_m＝D_m-W_m

step 503, construct the Laplace matrix L of the must-break graph_c：

L_c＝D_c-W_c

wherein, I is a unit array,

is a diagonal matrix, the first one diagonal elements of which are 1, and the other diagonal elements are 0; l is the number of marked samples, u is the number of unmarked samples,

2. a method as claimed in claim 1, wherein the activation function G (w, b, x) in step 3 is:

。

3. a method as claimed in claim 1, wherein the activation function G (w, b, x) in step 3 is:

。

4. a method for classifying medical images according to any of claims 1, 2 and 3, wherein N > d.

5. A method for classifying medical images as claimed in any one of claims 1, 2 and 3, wherein l > N.

6. A method for classifying medical images as claimed in any one of claims 1, 2 and 3, wherein η e (0.5, 1).

7. A method for classifying medical images as claimed in claim 4, wherein η e (0.5, 1).

8. A method for classifying medical images as claimed in claim 5, wherein η e (0.5, 1).

9. A method for classifying medical images according to any one of claims 1, 2 and 3, wherein said automatic encoder comprises at least one convolutional layer and one pooling layer.

10. The method of classifying medical images of claim 6, wherein said automated encoder includes at least one convolutional layer and one pooling layer.