CN112784884A

CN112784884A - Medical image classification method, system, medium and electronic terminal

Info

Publication number: CN112784884A
Application number: CN202110018207.1A
Authority: CN
Inventors: 彭德光; 朱楚洪; 孙健; 唐贤伦
Original assignee: Chongqing Zhaokun Intelligent Medical Technology Co ltd
Current assignee: Chongqing Zhaokun Intelligent Medical Technology Co ltd
Priority date: 2021-01-07
Filing date: 2021-01-07
Publication date: 2021-05-11

Abstract

The invention provides a medical image classification method, a medical image classification system, a medical image classification medium and an electronic terminal, wherein the medical image classification method comprises the following steps: constructing a convolutional neural network based on deep learning; inputting the acquired medical image data set into a convolutional neural network for training to obtain a classification model, wherein the training process comprises the following steps: feature extraction and classification, and class distribution imbalance optimization; the step of optimizing the class distribution imbalance comprises the following steps: according to the number of samples in different categories, obtaining the weight parameters and/or the unbalance proportion of the samples of the different categories, and further carrying out class unbalance optimization; inputting medical images to be classified into a classification model, and classifying the medical images; the medical image classification method of the invention is characterized in that a medical image data set is input into a convolutional neural network based on deep learning for training, and the training process comprises the following steps: the feature extraction and classification and the unbalanced class distribution optimization are carried out, so that a better classification model is obtained, the problem of unbalanced distribution of image samples can be effectively avoided, and the classification accuracy is improved.

Description

Medical image classification method, system, medium and electronic terminal

Technical Field

The present invention relates to the field of image classification, and in particular, to a medical image classification method, system, medium, and electronic terminal.

Background

With the development of machine diagnosis, computer-aided diagnosis (CAD) systems have received more and more attention, and are widely applied to segmentation, classification, and retrieval of anatomical medical images to reduce the workload of doctors, however, at present, the method for retrieving anatomical medical images mainly adopts a text-based retrieval method, and retrieves based on the representation text of one or more images, and text manual text information needs to be summarized and recorded by experts and doctors with abundant experience and knowledge for a large amount of time, so that the implementation is difficult, the operation is complex, and when the distribution of the obtained image samples is unbalanced, accurate classification is difficult to achieve, and the classification efficiency is low.

Disclosure of Invention

The invention provides a medical image classification method, a medical image classification system, a medical image classification medium and an electronic terminal, and aims to solve the problems that in the prior art, a text-based method is adopted to search medical images, the operation is complex, and when image samples are not distributed evenly, the image classification accuracy is low.

The invention provides a medical image classification method, which comprises the following steps:

constructing a convolutional neural network based on deep learning;

inputting the acquired medical image data set into the convolutional neural network for training to obtain a classification model, wherein the training process comprises the following steps: feature extraction and classification, and class distribution imbalance optimization;

the step of class distribution imbalance optimization comprises: according to the number of samples in different categories, obtaining the weight parameters and/or the unbalance proportion of the samples of the different categories, and further carrying out class unbalance optimization;

and inputting the medical image to be classified into the classification model to classify the medical image.

Optionally, the step of feature extraction and classification includes:

inputting the medical image data set into the convolutional neural network, and performing feature extraction to obtain a feature matrix;

vectorizing the feature matrix to obtain a feature vector;

classifying the feature vectors through a full connection layer of the convolutional neural network to obtain a first classification result;

inputting the characteristic vector into a softmax classifier for classification, and obtaining a second classification result;

and comparing the second classification result with the first classification result, and training the convolutional neural network according to the comparison result.

Optionally, the step of obtaining the weight parameters of different categories according to the number of samples in different categories includes:

obtaining weight parameters of different classes according to the number of samples in the different classes, wherein the samples comprise: the feature vector and the category label corresponding to the feature vector;

optimizing the number imbalance of the samples of different classes according to the weight parameters;

according to the weight parameters, the mathematical expression for optimizing the number imbalance of the samples of different classes is as follows:

FL(p_t)＝-α_t(1-p_t)^γlog(p_t)

wherein p is_tFor the prediction value, α is a weight parameter, (1-p)_t)^γThe sample difficulty weighting adjustment factor is, and gamma is the inhibition parameter.

Optionally, the step of obtaining the imbalance ratio of the samples according to the number of the samples in different categories further includes:

presetting a sample quantity threshold;

obtaining a class to be optimized according to the number of samples in different classes and the sample number threshold, wherein the number of samples in the class to be optimized is smaller than the feature vector number threshold, and the samples comprise: the feature vector and the category label corresponding to the feature vector;

and acquiring a new synthesized sample according to the samples in the class to be optimized, and further optimizing the distribution imbalance of different classes.

Optionally, the step of obtaining a new synthesized sample according to the sample in the class to be optimized includes:

defining a sample in the class to be optimized as an initial sample;

obtaining Euclidean distances from the initial sample to other samples in the class to be optimized;

acquiring one or more neighbor samples according to a preset distance threshold and the Euclidean distance, wherein the neighbor samples correspond to the initial samples, and the Euclidean distance from the initial samples to the corresponding neighbor samples is smaller than the distance threshold;

determining the imbalance proportion of the samples according to the number of the samples in different categories and the sample number threshold;

and acquiring a new synthesized sample according to the unbalance proportion, thereby reducing the distribution unbalance of different classes.

Optionally, the step of obtaining a new synthesized sample according to the imbalance ratio includes:

acquiring the required number of the synthesized samples according to the unbalance proportion;

obtaining samples to be synthesized according to the required number, wherein the number of the samples to be synthesized is the same as the required number;

and acquiring a new synthesis sample according to the sample to be synthesized, wherein the mathematical expression of the new synthesis sample is as follows:

wherein,

for new synthetic samples, XⁱFor the initial sample, σ is uniformly distributed at [0,1 ]]Random number in the range, X^tIs a sample to be synthesized, k is a sample to be synthesizedThe number of samples.

Optionally, the training process further includes: feature encoding, the step of feature encoding comprising:

self-coding the obtained sample to obtain a coding result, wherein the sample comprises: the feature vector and the category label corresponding to the feature vector;

inputting the coding result into a decoder for decoding to obtain a decoding result;

and adjusting the encoding parameters in the convolutional neural network according to the decoding result and the sample corresponding to the decoding result.

The present invention also provides a medical image classification system, comprising:

the preprocessing module is used for constructing a convolutional neural network based on deep learning;

the processing module is used for inputting the acquired medical image data set into the convolutional neural network for training to obtain a classification model, wherein the training process comprises the following steps: feature extraction and classification, and class distribution imbalance optimization, wherein the class distribution imbalance optimization comprises the following steps: according to the number of samples in different categories, obtaining the weight parameters and/or the unbalance proportion of the samples of the different categories, and further carrying out class unbalance optimization;

and the image classification module is used for inputting the medical image to be classified into the classification model to classify the medical image, and the preprocessing module and the processing module are connected with the image classification module.

The invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the method as defined in any one of the above.

The present invention also provides an electronic terminal, comprising: a processor and a memory;

the memory is adapted to store a computer program and the processor is adapted to execute the computer program stored by the memory to cause the terminal to perform the method as defined in any one of the above.

The invention has the beneficial effects that: the medical image classification method of the invention is characterized in that a medical image data set is input into a convolutional neural network based on deep learning for training, and the training process comprises the following steps: the feature extraction and classification and the unbalanced class distribution are optimized, so that a better classification model is obtained, the medical image classification process is directly oriented to image content, the medical image is prevented from being retrieved and classified by adopting a text-based method, meanwhile, the problem of unbalanced distribution of image samples can be effectively avoided, and the classification accuracy is improved.

Drawings

FIG. 1 is a first flowchart illustrating a medical image classification method according to an embodiment of the present invention;

FIG. 2 is a second flowchart illustrating a medical image classification method according to an embodiment of the present invention;

FIG. 3 is a schematic structural diagram of a convolutional neural network of a medical image classification method according to an embodiment of the present invention;

FIG. 4 is a schematic structural diagram of an auto-encoder of the medical image classification method according to an embodiment of the present invention;

fig. 5 is a schematic structural diagram of a medical image classification system in an embodiment of the invention.

Detailed Description

The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It is to be noted that the features in the following embodiments and examples may be combined with each other without conflict.

It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present invention, and the components related to the present invention are only shown in the drawings rather than drawn according to the number, shape and size of the components in actual implementation, and the type, quantity and proportion of the components in actual implementation may be changed freely, and the layout of the components may be more complicated.

The inventor finds that with the development of machine diagnosis, a Computer Aided Diagnosis (CAD) system receives more and more attention, and is widely applied to segmentation, classification, and retrieval of anatomical medical images for reducing the workload of doctors, however, at present, the method for retrieving anatomical medical images mainly adopts a text-based retrieval method, and the retrieval is performed based on the representation text of one or more images, and text manual text information needs to be summarized and recorded by expert doctors with abundant experience and knowledge for a lot of time, so that the implementation is difficult, the operation is complex, errors are easily caused by human factors, and when the distribution of the obtained image samples is unbalanced, accurate classification is difficult to achieve, and the classification efficiency is low, therefore, the inventor proposes a medical image classification method, a system, a medium and an electronic terminal, which are trained by inputting medical image data into a convolutional neural network based on deep learning, the training process comprises the following steps: the method comprises the steps of feature extraction and classification, class distribution imbalance optimization and feature coding, so that a classification model is obtained, the medical image classification process is directly oriented to image content, the medical image is prevented from being retrieved and classified by a text-based method, meanwhile, the problem of unbalanced distribution of image samples can be effectively avoided, and the classification accuracy is improved.

As shown in fig. 1, the medical image classification method in the present embodiment includes:

s101: constructing a convolutional neural network based on deep learning;

s102: inputting the acquired medical image data set into the convolutional neural network for training to obtain a classification model, wherein the training process comprises the following steps: feature extraction and classification, and class distribution imbalance optimization, wherein the class distribution imbalance optimization comprises the following steps: according to the number of samples in different categories, obtaining the weight parameters and/or the unbalance proportion of the samples of the different categories, and further carrying out class unbalance optimization; for example: acquiring a medical image dataset, the medical image dataset comprising: different numbers of anatomical medical images of different parts of an organism can acquire deep features with strong identification capability by inputting medical image data sets into a convolutional neural network based on deep learning for feature extraction, simultaneously reduce the number of the features, retain more important information, reduce the complexity of data processing and improve the classification efficiency, because the classification of the anatomical medical images is a special technical field, the usually acquired medical image data sets are unbalanced, namely the number of samples of different classes is different, which easily reduces the accuracy of the classification of the medical images, therefore, the problem of unbalanced distribution of the features of the samples of different classes can be effectively solved by adding training of unbalanced distribution optimization in the training process, the classification accuracy is improved, the anti-interference capability is strong, and related personnel such as doctors carry out retrieval, in the training process, the feature coding is carried out, so that the practicability is strong and the cost is low;

s103: inputting the medical image to be classified into the classification model, and classifying the medical image; training by inputting medical image data sets into a deep learning based convolutional neural network, the training process comprising: the method comprises the steps of feature extraction and classification, unbalanced optimization of class distribution and feature coding, so that a classification model is obtained, image content is directly oriented in the classification process of the medical images, the medical images are prevented from being retrieved and classified by adopting a text-based method, classification errors caused by human factors are reduced, meanwhile, the problem of unbalanced distribution of image samples can be effectively solved, classification accuracy is improved, and practicability is high.

As shown in fig. 2, the medical image classification method in some embodiments includes:

s201: constructing a convolutional neural network based on deep learning; for example: as shown in fig. 3, a 13-layer linear Convolutional Neural Network (CNN) is constructed, which includes: 4 convolutional layers, 4 batch normalization layers, 3 pooling layers and 2 Full Connected Layers (FCL), the convolutional neural network based on deep learning is constructed, so that the feature extraction can be better performed on the medical image, the efficiency is improved, the accuracy is higher, the pooling layers can perform feature dimension reduction, the number of data and parameters is compressed, the overfitting is reduced, and meanwhile, the fault tolerance of the model is improved; the mathematical expression of the output of each layer of the convolutional neural network (CNN1, CNN2, CNN3, CNN4) is:

f(x)＝P_m×m(Lr(ω×I)+b)

wherein, P represents pooling, m is the size of the pool kernel, Lr () is a LeakyReLu activation function, ω is the weight of the layer, I is the input of the layer, and b is the offset of the layer;

s202: inputting the medical image data set into the convolutional neural network, and performing feature extraction to obtain a feature matrix; the medical image data set is input into the convolutional neural network for feature extraction, so that more important features in the medical image can be extracted, and the feature extraction efficiency is improved;

s203: vectorizing the feature matrix to obtain a feature vector;

s204: classifying the feature vectors through a full connection layer of the convolutional neural network to obtain a first classification result;

s205: inputting the characteristic vector into a softmax classifier for classification, and obtaining a second classification result;

s206: comparing the second classification result with the first classification result, and training the convolutional neural network according to the comparison result; for example: classifying the feature vectors through a full connection layer to obtain a first classification result, classifying through a softmax classifier to obtain a second classification result, performing error back propagation according to the second classification result and the second classification result, and training a convolutional neural network, so that the classification accuracy of medical images is effectively improved;

in some embodiments, further comprising:

normalizing the feature vector to obtain a normalized feature vector; because the distribution of the feature vectors of the medical image is unbalanced, the feature vectors are normalized, namely different channels of the same sample or the feature vectors are normalized, so that the influence on feature classification caused by the number of the samples is avoided, and because the depth of a deep learning neural network is uncertain, the calculation complexity of training can be reduced by performing normalization processing on different channels of the same feature vector;

classifying the normalized feature vectors through a full-link layer of the convolutional neural network to obtain a first classification result;

inputting the normalized feature vector into a softmax classifier for classification, and obtaining a second classification result;

comparing the second classification result with the first classification result, and training the convolutional neural network according to the comparison result;

the mathematical expression of the normalization process on the feature vector is as follows:

wherein μ is a mean value, σ is a variance, H is the number of hidden nodes in a layer, a is a normalized feature vector, and i is a feature coefficient.

S207: obtaining weight parameters of different classes according to the number of samples in the different classes, wherein the samples comprise: the feature vector and the category label corresponding to the feature vector;

s208: optimizing the number imbalance of the samples of different classes according to the weight parameters;

in some embodiments, the mathematical expression for optimizing the number imbalance of the samples of the different classes according to the weight parameter is:

FL(p_t)＝-α_t(1-p_t)^γlog(p_t)

wherein p is_tFor the prediction value, α is a weight parameter, (1-p)_t)^γThe sample difficulty weighting adjustment factor is, and gamma is the inhibition parameter. By optimizing the number imbalance of the samples of different categories according to the weight parameters of different categories, the loss of China caused by the imbalance of the categories in the classification process is reduced;

s209: obtaining a class to be optimized according to the number of samples in different classes and a preset sample number threshold, wherein the number of samples in the class to be optimized is smaller than the feature vector number threshold, and the samples comprise: the feature vector and the category label corresponding to the feature vector;

s210: obtaining a new synthesized sample according to the samples in the class to be optimized, and further optimizing the distribution imbalance of different classes;

in some embodiments, the step of obtaining a new synthesized sample from the samples in the class to be optimized comprises:

defining a sample in the class to be optimized as an initial sample;

In some embodiments, the step of obtaining a new synthesized sample according to the imbalance ratio comprises:

wherein,

for new synthetic samples, XⁱFor the initial sample, σ is uniformly distributed at [0,1 ]]Random number in the range, X^tK is the number of samples to be synthesized. By acquiring a new synthetic sample, the feature vectors of the medical image with uniform class distribution can be obtained, and the accuracy is higher;

s211: self-coding the obtained sample to obtain a coding result, wherein the sample comprises: the feature vector and the category label corresponding to the feature vector;

s212: inputting the coding result into a decoder for decoding to obtain a decoding result;

s213: adjusting encoding parameters in the convolutional neural network according to the decoding result and a sample corresponding to the decoding result so as to obtain a classification model; as shown in fig. 4, in the self-encoding process, firstly performing convolution operation, then performing maximum pooling operation on the result after convolution, where the size of a pooling window can be selected according to circumstances, data is transmitted to a full connection layer after passing through a convolution-pooling layer and a result is output, and simultaneously, the result output by the full connection layer can be decoded, a decoder finally outputs a decoding result through the full connection layer and a reverse sampling layer, and according to the decoding result and a sample corresponding to the decoding result, the encoding parameters in the convolutional neural network are adjusted to improve the encoding accuracy, which is beneficial for relevant personnel to search medical image classification, and by training the constructed convolutional neural network through the above steps, a better classification model can be obtained, the problem of imbalance among classes is solved, the accuracy is higher, and the anti-interference capability is stronger, the practicability is strong.

S214: and inputting the medical image to be classified into the classification model to classify the medical image.

As shown in fig. 5, the present embodiment further provides a medical image classification system, including:

and the image classification module is used for inputting the medical images to be classified into the classification model to classify the medical images, and the preprocessing module, the processing module and the image classification module are sequentially connected. Training by inputting medical image data sets into a deep learning based convolutional neural network, the training process comprising: the feature extraction and classification and the unbalanced class distribution are optimized, so that a classification model is obtained, the medical image classification process is directly oriented to image content, the medical image is prevented from being retrieved and classified by adopting a text-based method, meanwhile, the problem of unbalanced distribution of image samples can be effectively avoided, and the classification accuracy is improved.

In some embodiments, the step of the processing module performing feature extraction and classification comprises: vectorizing the feature matrix to obtain a feature vector;

comparing the second classification result with the first classification result, and training the convolutional neural network according to the comparison result; for example: classifying the feature vectors through a full connection layer to obtain a first classification result, classifying through a softmax classifier to obtain a second classification result, performing error back propagation according to the second classification result and the second classification result, and training a convolutional neural network, so that the classification accuracy of medical images is effectively improved;

in some embodiments, further comprising:

In some embodiments, the step of the processing module performing class distribution imbalance optimization comprises:

and optimizing the number imbalance of the samples of different classes according to the weight parameters.

FL(p_t)＝-α_t(1-p_t)^γlog(p_t)

obtaining a class to be optimized according to the number of samples in different classes and a preset sample number threshold, wherein the number of samples in the class to be optimized is smaller than the feature vector number threshold, and the samples comprise: the feature vector and the category label corresponding to the feature vector;

defining a sample in the class to be optimized as an initial sample;

wherein,

for new synthetic samples, XⁱFor the initial sample, σ is uniformly distributed at [0,1 ]]Random number in the range, X^tK is the number of samples to be synthesized. By obtaining a new synthesized sample, the feature vectors of the medical image with uniform class distribution can be obtained, and the accuracy is high.

In some embodiments, the training process of the processing module further comprises: feature encoding, the step of feature encoding comprising:

adjusting encoding parameters in the convolutional neural network according to the decoding result and a sample corresponding to the decoding result so as to obtain a classification model; as shown in fig. 4, in the self-encoding process, firstly performing convolution operation, then performing maximum pooling operation on the result after convolution, where the size of a pooling window can be selected according to circumstances, data is transmitted to a full connection layer after passing through a convolution-pooling layer and a result is output, and simultaneously, the result output by the full connection layer can be decoded, a decoder finally outputs a decoding result through the full connection layer and a reverse sampling layer, and according to the decoding result and a sample corresponding to the decoding result, the encoding parameters in the convolutional neural network are adjusted to improve the encoding accuracy, which is beneficial for relevant personnel to search medical image classification, and by training the constructed convolutional neural network through the above steps, a better classification model can be obtained, the problem of imbalance among classes is solved, the accuracy is higher, and the anti-interference capability is stronger, the practicability is strong.

The present embodiments also provide a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the method as defined in any one of the above.

The present embodiment further provides an electronic terminal, including: a processor and a memory;

the memory is used for storing computer programs, and the processor is used for executing the computer programs stored by the memory so as to enable the terminal to execute the method in the embodiment.

The computer-readable storage medium in the present embodiment can be understood by those skilled in the art as follows: all or part of the steps for implementing the above method embodiments may be performed by hardware associated with a computer program. The aforementioned computer program may be stored in a computer readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.

The electronic terminal provided by the embodiment comprises a processor, a memory, a transceiver and a communication interface, wherein the memory and the communication interface are connected with the processor and the transceiver and are used for completing mutual communication, the memory is used for storing a computer program, the communication interface is used for carrying out communication, and the processor and the transceiver are used for operating the computer program so that the electronic terminal can execute the steps of the method.

In this embodiment, the Memory may include a Random Access Memory (RAM), and may also include a non-volatile Memory (non-volatile Memory), such as at least one disk Memory.

The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.

The foregoing embodiments are merely illustrative of the principles and utilities of the present invention and are not intended to limit the invention. Any person skilled in the art can modify or change the above-mentioned embodiments without departing from the spirit and scope of the present invention. Accordingly, it is intended that all equivalent modifications or changes which can be made by those skilled in the art without departing from the spirit and technical spirit of the present invention be covered by the claims of the present invention.

Claims

1. A method of medical image classification, comprising:

constructing a convolutional neural network based on deep learning;

2. The medical image classification method according to claim 1, characterized in that the step of feature extraction and classification comprises:

vectorizing the feature matrix to obtain a feature vector;

3. A medical image classification method according to claim 1, characterized in that the step of obtaining weight parameters for different classes depending on the number of samples in the different classes comprises:

FL(p_t)＝-α_t(1-p_t)γlog(p_t)

4. A medical image classification method according to claim 1, characterized in that the step of obtaining the imbalance ratio of the samples according to the number of samples in different classes comprises:

presetting a sample quantity threshold;

5. The medical image classification method according to claim 4, characterized in that the step of obtaining a new synthetic sample from the samples in the class to be optimized comprises:

defining a sample in the class to be optimized as an initial sample;

6. The medical image classification method according to claim 5, characterized in that the step of obtaining a new synthetic sample according to the imbalance ratio comprises:

wherein,

for new synthetic samples, XⁱFor the initial sample, σ is uniformly distributed at [0,1 ]]Random number in the range, X^tK is the number of samples to be synthesized.

7. The medical image classification method according to claim 1, characterized in that the training process further comprises: feature encoding, the step of feature encoding comprising:

8. A medical image classification system, comprising:

9. A computer-readable storage medium having stored thereon a computer program, characterized in that: the computer program, when executed by a processor, implements the method of any one of claims 1 to 7.

10. An electronic terminal, comprising: a processor and a memory;

the memory is for storing a computer program and the processor is for executing the computer program stored by the memory to cause the terminal to perform the method of any of claims 1 to 7.