CN113658110A

CN113658110A - Medical image identification method based on dynamic field adaptive learning

Info

Publication number: CN113658110A
Application number: CN202110832315.2A
Authority: CN
Inventors: 刘贵松; 解修蕊; 张鸿杰; 黄鹂; 占求港; 郑余
Original assignee: Southwestern University Of Finance And Economics
Current assignee: Southwestern University Of Finance And Economics
Priority date: 2021-07-22
Filing date: 2021-07-22
Publication date: 2021-11-16

Abstract

The invention discloses a medical image identification method based on dynamic field adaptive learning, relates to the technical field of transfer learning, and solves the problem of high acquisition cost of labeled data in near-infrared brain imaging. The invention comprises the following steps: preprocessing is carried out on the basis of the ImageNet data set to obtain a source domain data set, and meanwhile, a near-infrared brain imager is used for collecting a small amount of brain medical image data to obtain a target domain data set; defining a convolutional neural network model for image recognition based on a source domain data set, training the convolutional neural network by using a back propagation algorithm, updating parameters until the network converges, and finishing the training; and substituting the target domain data set into the trained convolutional neural network to obtain the recognition output result of the brain medical image data, and testing. The invention realizes the function of dynamically and automatically searching the optimal opportunity for transfer learning, and obviously improves the classification precision and the convergence rate of the network.

Description

Medical image identification method based on dynamic field adaptive learning

Technical Field

The invention belongs to the field of transfer learning, and relates to a medical image identification method based on dynamic field adaptive learning.

Background

With the continuous development and progress of medical imaging technology and computer technology, medical image analysis has become an indispensable tool and technological means in medical research, clinical disease diagnosis and treatment. In recent years, Deep Learning (DL), in particular, a Deep Convolutional Neural Network (CNN), has rapidly developed into a research hotspot of medical image analysis, which can automatically diagnose a disease hidden in a special area from medical image big data. In the field of medical imaging, a physician or researcher usually needs to know some detailed information about a certain internal tissue organ in order to make a correct treatment decision when performing quantitative analysis, real-time monitoring and treatment planning on such tissue organ. Biomedical imaging has become an integral part of, and increasingly important in, disease diagnosis and treatment. However, due to the particularity of medical image data, it is extremely difficult to acquire a large amount of labeled medical data, and the neural network is often trained only by a small amount of labeled data. It is therefore of great interest to study how to use small amounts of tagged medical data for medical image analysis.

Transfer learning (Transfer learning) refers to a learning process in which a model learned in an old field is applied to a new field by using similarities between data, tasks, and models. In the conventional classification learning, in order to ensure that the classification model obtained by training has accuracy and high reliability, there are two basic assumptions: (1) the training sample for learning and the new test sample satisfy independent same distribution; (2) there must be enough training samples available to learn a good classification model. However, in practical applications, we find that these two conditions are often not satisfied. How to use a small amount of labeled training samples or source domain data to establish a reliable model to predict target domains with different data distributions. This is the problem to be solved by the transfer learning.

Domain Adaptation (Domain Adaptation), which is a representative method in transfer learning, refers to using information-rich source Domain samples to improve the performance of a target Domain model. Two crucial concepts in the domain adaptation problem: a source domain (source domain) represents a different domain from the test sample, but has rich supervisory information; the target domain represents the domain where the test sample is located, with no or only a few labels. The source domain and the target domain tend to belong to the same class of tasks, but are distributed differently. According to different types of target domains and source domains, the domain adaptive problem has four different scenes: unsupervised, supervised, heterogeneous distribution and multiple source domain problems. By performing domain adaptation at different stages, researchers have proposed four different domain adaptation methods:

1) and (4) sample self-adaptation, namely performing weighted resampling on the source domain samples so as to approximate the distribution of the target domain.

2) Feature adaptation, projecting the source domain and the target domain into a common feature subspace.

3) And (4) model self-adaptation, namely modifying the error function of the source domain and considering the error of the target domain.

4) And (4) performing relation self-adaptation, mining and carrying out analogy migration by utilizing the relation between the source domain and the target domain.

The maximum Mean difference mmd (maximum Mean Di spread), which is the most frequently used metric in migration learning, measures the distance between two distributions in the regenerated hilbert space, and is a nuclear learning method. Many domain adaptive methods use maximum mean difference to measure the distribution difference between the source domain and the target domain and perform knowledge migration by reducing the distribution difference. The mathematical formula is as follows:

domain Adaptation (Domain Adaptation) is a representative method in migration learning, and refers to using information-rich source Domain samples to improve the performance of a target Domain model. Two crucial concepts in the domain adaptation problem: a source domain (source domain) represents a different domain from the test sample, but has rich supervisory information; the target domain represents the domain where the test sample is located, with no or only a few labels. The source domain and the target domain tend to belong to the same class of tasks, but are distributed differently.

Deep network Adaptation, many deep learning methods develop an Adaptation Layer (Adaptation Layer) to perform the Adaptation of the source domain and target domain data. The self-adaptation can make the data distribution of the source domain and the target domain closer, thereby making the effect of the network better. Most important in deep networks is the definition of network loss, and most deep migration learning methods adopt the following loss definition mode:

L＝L_class+λL_A

where L represents the ultimate loss of the network, L_classRepresents the regular classification loss of the network over the source domain, L_AWhich represents the loss of adaptation of the network, is usually represented using the maximum mean difference MMD. And λ is a weighting parameter that trades off the two components.

As a newly developed functional brain imaging device, compared with the existing brain imaging and detection devices, the functional near infrared spectrum imaging (fNIRS) has the advantages that the instrument is noiseless during operation, the requirements of users on the conventional imaging space and time resolution are met, the anti-interference performance is good, the manufacturing cost is low, and the instrument is portable, wearable and noninvasive, and is suitable for popularization and application in places such as communities, families, schools and the like. The types of devices for which fNIRS can be used for brain health monitoring include functional magnetic resonance imaging, in particular, fNIRS uses two different wavelengths of near infrared light to measure changes in the concentration of oxygenated hemoglobin (HbO or O2Hb), deoxygenated hemoglobin (HbR or HHb), total hemoglobin (tHbO) in the corresponding brain regions during brain activity, thereby monitoring metabolic changes in neural activity within the brain. fNIRS is a method of measuring the change in the concentration of HbO and HbR by absorbing near infrared light diffusely reflected back through tissue. Near infrared light is chosen because of its relatively low absorption by substances such as water, hemoglobin, proteins, collagen, and fat.

Disclosure of Invention

The invention aims to: the invention provides a medical image recognition method based on self-adaptive learning in the dynamic field, which solves the technical scheme adopted by the invention as follows:

1. a medical image recognition method based on dynamic field adaptive learning is characterized in that: the method comprises the following steps:

s101: preprocessing is carried out on the basis of the image data set to obtain a source domain data set, and meanwhile, a near-infrared brain imager is used for collecting a small amount of brain medical image data to obtain a target domain data set;

s102: defining a convolutional neural network model for image recognition based on a source domain dataset, initializing each level parameter and training round number epoch_maxAdding a domain self-adaptive layer before an output layer of a full connection layer in the convolutional neural network;

s103: calculating the domain-adaptive loss L of a convolutional neural network_MMDAnd a classification loss L_{classification}；

S104: defining an initial population lambda₁To lambda_iAnd searching the optimal position of the adaptive parameter lambda based on the differential evolution algorithm, and based on the optimal adaptive weight parameter lambda and the domain adaptive loss L_MMDAnd a classification loss L_{classification}Training the convolutional neural network by using a back propagation algorithm, updating parameters until the network converges, and finishing the training;

s105: and substituting the target domain data set into the trained convolutional neural network to obtain the recognition output result of the brain medical image data, and testing.

Further, in S102, defining a convolutional neural network model for image recognition based on the source domain data set specifically includes: and defining the total number of the neural network layers, including a convolutional layer and a fully-connected layer, defining the parameters of each layer of the convolutional layer and the parameters of the fully-connected layer, adding an adaptive layer after the last-but-one fully-connected layer, and calculating the maximum mean difference MMD between a source domain and a target domain through the output of the adaptive layer.

Further, in S104, an adaptive loss L is constructed based on the adaptive weight parameter λ_MMDAdjusting parameters of each layer of the convolutional neural network specifically comprises: round number epoch based on preset population and population evolution stage_sAcquiring an optimal adaptive weight parameter lambda by using a differential evolution algorithm;

the loss function is defined as: l ═ L_{classification}+λL_MMDWherein L represents the net ultimate loss L_{classification}Represents the regular classification loss of the network over the source domain, L_MMDRepresents the adaptive loss of the network, here represented by the maximum mean difference MMD;

round number epoch of training phase based on preset neural network_tAnd performing iterative training on the convolutional neural network by using a back propagation algorithm until the final loss L is converged, and stopping training.

Further, the method for obtaining the adaptive weight parameter λ in the evolution phase specifically includes: initializing number and position [ lambda ] of original population₁,λ₂,λ₃,…,λ_i]And copying each individual once to form an individual pair [ lambda ]₁,λ₁,λ₂,λ₂,λ₃,λ₃,…,λ_i,λ_i]；

Performing network iterative training by using each individual pair in turn to obtain the same individual lambda_iThe deceleration speed of the two adjacent iteration loss functions is used as an individual adaptive value to evaluate the current individual lambda_iPerformance of (d);

realizing individual variation through a differential evolution algorithm, taking a difference vector of two individuals randomly selected from a population as a random variation source of a third individual, weighting the difference vector, and carrying out vector synthesis with the third individual to generate a variant individual;

randomly selecting individuals through crossing, and carrying out parameter mixing on the variant individuals and a predetermined target individual to generate test individuals; if the adaptive value of the test individual is better than that of the target individual, replacing the target individual with the test individual in the next generation, otherwise, still storing the target individual;

and (4) performing iterative training, reserving good individuals in each generation, eliminating poor individuals, and guiding the search process to approach to the global optimal solution to obtain the optimal adaptive weight parameter lambda.

Further, in S104, the variation function of the differential evolution algorithm is defined as:

wherein r is₁，r₁And r₁Is three random numbers with interval of [1, i]F is a scaling factor which is a determined constant; g represents the g-th.

Further, in the step S104, L is lost through domain adaptation_MMDAnd a classification loss L_{classification}The value of (2) completes the training, and the method comprises the following steps:

calculating the current loss L, i.e. the classification loss L_{classification}Maximum mean difference MMD domain adaptive loss L of optimal adaptive weight parameter lambda_MMDSumming;

and when the loss is greater than the expected value, iterating by using a back propagation algorithm, and adjusting the neural network parameters until the loss value is stable and the network model converges.

Further, in S105, the target domain data set is a brain medical image data set, which specifically includes: storing the trained network model parameters; and (4) carrying out precision test on the network model by using the brain medical image data set, and outputting a test result.

In summary, due to the adoption of the technical scheme, the invention has the beneficial effects that:

1. the invention provides an effective dynamic field adaptation method of a medical image to solve the problem of 'when to transfer'. The method automatically searches for a proper transfer learning opportunity in the training process, and is not a traditional fixed-domain adaptive parameter lambda method.

2. The method uses the self-adaptive weight parameter lambda to optimize the loss function, and compared with the accuracy of the traditional transfer learning method, the method not only improves the convergence speed of the algorithm, but also improves the test precision.

3. The invention provides a dynamic field self-adaptive method applicable to medical image recognition, which can assist in training a small number of marked medical image domains by means of a trained domain and has great application value.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are required to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and that for those skilled in the art, other relevant drawings can be obtained according to the drawings without inventive effort, wherein:

FIG. 1 is a flow chart of a dynamic domain adaptation method proposed by the present invention;

FIG. 2 is a schematic diagram of an ImageNet image dataset;

FIG. 3 is a schematic diagram of a medical brain image dataset acquired by the FNI RS;

FIG. 4 is a flow chart for training a neural network using a back propagation algorithm.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.

It is noted that relational terms such as "first" and "second," and the like, may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

The features and properties of the present invention are described in further detail below with reference to examples.

Example 1

Referring to fig. 1, the present embodiment provides a medical image recognition method based on dynamic domain adaptive learning, including the following steps:

s101: the data is preprocessed based on the ImagNet dataset, an example of an image in the dataset, as shown in fig. 2, is defined as the source domain dataset. Acquiring medical image data of human brain by using a near-infrared brain imager, as shown in fig. 3, and manually marking a small amount of data to define the data as a target domain data set;

s102: defining the structure of the convolutional neural network, defining how to obtain output from the input of the neural network, using a classical convolutional neural network model such as AlexNet or VGGNet, and adding a domain adaptive layer before the output layer of the full-connection layer in the convolutional neural network;

s103: domain adaptive loss L defining convolutional neural network_MMDAnd a classification loss L_{classification}；

Where the domain adaptation loses L_MMDIs defined as:

wherein X_SAnd X_TRepresenting the source domain data set and the target domain data set, respectively, phi (—) is a kernel function that can map the original distribution to a reproducible hilbert space.

Loss of classification L_{classification}Using cross-entropy representation, defined as:

wherein

And

the label value and the predicted value of the source domain.

S104: defining an initial population lambda₁To lambda₁And copying each individual in the population once to form an individual pair lambda₁,λ₁,λ₂,λ₂,λ₃,λ₃,…,λ_i,λ_i]Performing network iterative training using each individual pair in turn to obtain the same individual lambda_iThe deceleration speed of the two adjacent iteration loss functions is used as the fitness value of the individual to evaluate the current individual lambda_iPerformance of (d); searching the optimal position of the adaptive parameter lambda based on a differential evolution algorithm, and searching the optimal position of the adaptive parameter lambda based on the optimal adaptive weight parameter lambda and the domain adaptive loss L_MMDAnd a classification loss L_{classification}Defining a loss function as：

L＝L_{classification}+λL_MMD

Training the convolutional neural network by using a back propagation algorithm, updating parameters until the network converges, and finishing the training;

s105: and storing the trained network, and loading the target domain data set of the human brain medical image to a network model for testing.

According to the above process, after the algorithm is finished, the classification accuracy rate higher than that of the method using the static weight parameters is obtained, the classification and identification accuracy rate of the brain medical image is improved by about 5%, and the method is greatly improved in the field of transfer learning.

As mentioned above in S104, the back propagation algorithm training neural network is also an important process, and the back propagation algorithm training neural network is described in detail below.

Example 2

On the basis of embodiment 1, embodiment 2 of the present invention provides a method flow for training a neural network by using a back propagation algorithm. Referring to fig. 4, the method flow includes the following steps:

s201, initializing a variable of training times to 0;

s202, selecting a part of training data in a source domain data set, namely Batch processing (Batch);

s203, obtaining a predicted value of output through forward propagation;

s204, calculating loss, and updating various parameters of the neural network through a back propagation algorithm;

s205, judging whether the training expectation is reached, if so, jumping to S208, and if not, jumping to S201;

s206, judging whether the set training times are reached, if so, jumping to S208, and if not, jumping to S207;

s207, performing 1 adding operation on the training times, and jumping to S202;

and S208, finishing the training and finishing the iteration.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and should not be taken as limiting the scope of the present invention, and any modifications, equivalents and improvements made by those skilled in the art within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

2. The medical image recognition method based on dynamic domain adaptive learning according to claim 1, characterized in that: in S102, defining a convolutional neural network model for image recognition based on a source domain dataset includes:

and defining the total number of the neural network layers, including a convolutional layer and a fully-connected layer, defining the parameters of each layer of the convolutional layer and the parameters of the fully-connected layer, adding an adaptive layer after the last-but-one fully-connected layer, and calculating the maximum mean difference MMD between a source domain and a target domain through the output of the adaptive layer.

3. The medical image recognition method based on dynamic domain adaptive learning according to claim 1, characterized in that: in the S104, an adaptive loss L is constructed based on an adaptive weight parameter lambda_MMDAdjusting parameters of each layer of the convolutional neural network specifically comprises:

round number epoch based on preset population and population evolution stage_sAcquiring an optimal adaptive weight parameter lambda by using a differential evolution algorithm;

4. The medical image recognition method based on dynamic domain adaptive learning according to claim 3, characterized in that: the method for acquiring the self-adaptive weight parameter lambda in the evolution stage specifically comprises the following steps:

initializing number and position [ lambda ] of original population₁，λ₂，λ₃，…，λ_i]And copying each individual once to form an individual pair [ lambda ]₁，λ₁，λ₂，λ₂，λ₃，λ₃，…，λ_i，λ_i]；

5. The medical image recognition method based on dynamic domain adaptive learning according to claim 1, characterized in that: in S104, the variation function of the differential evolution algorithm is defined as:

6. The medical image recognition method based on dynamic domain adaptive learning according to claim 1, characterized in that: in S104, loss L is adaptively determined through the domain_MMDAnd a classification loss L_{classification}The value of (2) completes the training, and the method comprises the following steps:

7. The medical image recognition method based on dynamic domain adaptive learning according to claim 1, characterized in that: in S105, the target domain data set is a brain medical image data set, which specifically includes: storing the trained network model parameters; and (4) carrying out precision test on the network model by using the brain medical image data set, and outputting a test result.