CN114782760B

CN114782760B - Stomach disease picture classification system based on multitask learning

Info

Publication number: CN114782760B
Application number: CN202210711643.1A
Authority: CN
Inventors: 戴捷; 赖春晓; 张希钢; 鹿伟民
Original assignee: Zidong Information Technology Suzhou Co ltd
Current assignee: Zidong Information Technology Suzhou Co ltd
Priority date: 2022-06-22
Filing date: 2022-06-22
Publication date: 2022-09-13
Anticipated expiration: 2042-06-22
Also published as: CN114782760A

Abstract

The invention discloses a stomach disease picture classification system based on multitask learning, which comprises the following steps: the data acquisition module is used for acquiring pictures and respectively making data sets; the first feature processing module is used for obtaining a first picture feature; the second characteristic processing module is used for obtaining a second picture characteristic; the auxiliary disease classification module is used for training an auxiliary disease classifier by utilizing the first picture characteristics and inputting the second picture characteristics into the auxiliary disease classifier to obtain the disease probability; the third characteristic processing module is used for fusing the morbidity probability and the second picture characteristic through element-by-element multiplication to obtain a third picture characteristic; and the stomach disease classification module is used for training a stomach disease classifier by using the third picture characteristic and inputting the stomach disease picture test set into the trained stomach disease classifier to obtain a stomach disease picture classification result. According to the method, the similarity among diseases is utilized, and the multi-label classification recognition rate of the stomach picture is improved through a multi-task artificial intelligence model.

Description

Stomach disease picture classification system based on multitask learning

Technical Field

The invention relates to the technical field of image recognition, in particular to a stomach disease image classification system based on multitask learning.

Background

Gastric diseases are organic or functional diseases occurring in the stomach, and the causes of the diseases are very complex, including physical and chemical stimulation, infection, toxin, heredity, mental factors, developmental disturbance, operation influence and the like. Gastritis, gastric polyp, gastric ulcer, gastric cancer, bile reflux and gastrorrhagia are common in clinic. Early stage stomach diseases have no obvious symptoms, so the early stage stomach diseases are easy to ignore by people. When the stomach is diseased, the secretion and motor functions of the stomach are disturbed, which may affect digestion and absorption and sometimes cause anemia, emaciation or even life-threatening problems. In clinic, digestive diseases are the most common, and stomach diseases are the most common. The diagnosis of gastric diseases by stomach picture analysis of doctors consumes time and effort of doctors. The stomach image classification method based on the artificial intelligence technology can assist doctors to make more accurate judgment and can reduce the probability of the doctors missing suspicious early gastric cancer.

In the field of artificial intelligence, single-picture-based classification methods are commonly used for the determination of whether a stomach picture contains a stomach disease. This artificial intelligence approach is a process of extracting meaning from a single image using computer vision and machine learning algorithms. The most widespread artificial intelligence method at present is to model the problem as inputting an image and adopting a conventional classification algorithm to output labels of a plurality of diseases.

However, the existing model often processes the situation that the model input is a single picture, and ignores the situation that a plurality of pictures are used as input; secondly, because the cost for labeling the pictures with the stomach disease labels by professionals is higher, the number of the conventional labeled pictures containing the stomach diseases is less; in addition, the existing model only utilizes the stomach disease picture, but cannot utilize other related resources. Therefore, the recognition rate of the existing multi-label classification technology based on stomach pictures is not high enough.

Disclosure of Invention

The invention aims to provide a stomach disease image classification system based on multitask learning, which utilizes the similarity between diseases and improves the multi-label classification recognition rate of stomach images through a multitask artificial intelligence model.

In order to solve the above technical problem, the present invention provides a stomach disease picture classification system based on multitask learning, comprising:

the data acquisition module is used for acquiring pictures and respectively making an esophageal tumor picture data set, a duodenal ulcer picture data set and a stomach disease picture data set; the stomach disease picture data set comprises a stomach disease picture training set and a stomach disease picture testing set, and the pictures in the esophagus tumor picture data set, the duodenal ulcer picture data set and the stomach disease picture training set are all labeled with corresponding disease categories;

the first feature processing module is used for extracting the image features in the esophageal tumor image data set and the duodenal ulcer image data set to obtain a first image feature;

the second feature processing module is used for extracting picture features in the stomach disease picture training set to obtain second picture features;

the auxiliary disease classification module is used for training an auxiliary disease classifier by utilizing the first picture characteristics and inputting the second picture characteristics into the trained auxiliary disease classifier to obtain the classification probability of the stomach disease picture about the auxiliary disease;

the third feature processing module is used for fusing the classification probability and the second picture feature through element-by-element multiplication to obtain a third picture feature;

and the stomach disease classification module is used for training a stomach disease classifier by using the third picture characteristic and inputting the stomach disease picture test set into the trained stomach disease classifier to obtain a stomach disease picture classification result.

As a further improvement of the invention, the esophageal tumor picture data set, the duodenal ulcer picture data set and the stomach disease picture data set comprise a plurality of samples, and each sample comprisesnAnd (5) opening a picture.

As a further improvement of the present invention, the first feature processing module includes a first picture preprocessing module, and the first picture preprocessing module includes:

a first feature extractor for extracting features of the images in any one of the esophageal tumor image data set and the duodenal ulcer image data set to obtain image features

；

A first feature segmentation module for segmenting features of the picture

Partitioning to obtain a feature-segmented picture region set

；

A first feature serialization module to serialize a feature of the first set of features

All the regions in (a) are spliced in sequence to form a sequence

。

As a further improvement of the present invention, the second feature processing module includes a second picture preprocessing module, and the second picture preprocessing module includes:

a second feature extractor for extracting features of the pictures in any sample of the stomach disease picture training set to obtain picture features

；

A second feature segmentation module for segmenting the picture features

Partitioning to obtain a feature-segmented picture region set

；

A second feature serialization module to serialize the second feature

All the regions in (a) are spliced in sequence to form a sequence

。

As a further improvement of the present invention, the auxiliary disease classifier comprises:

the first Encoder layer adopts a pre-trained Vision Transformer framework, the pixel value of each region plus the position of the region in the whole sequence is embedded through the Encoder layer, and the intermediate representation of each region is obtained through multi-head attention mechanism learning of the Vision Transformer framework

；

A first characteristic pooling module for pooling characteristics output by the first Encoder layer to obtain final representation of the picture

；

A first fully-connected layer of

（softmax) And (4) classifying:

wherein the content of the first and second substances,

and

representing learnable weights and biases in the first fully connected layer,

representing the classification probability of the output of the assisted disease classifier.

As a further improvement of the present invention, the training of the auxiliary disease classifier optimizes the objectives:

wherein the content of the first and second substances,

representing cross entropy loss between real auxiliary disease labels of images in the esophageal tumor image dataset and the duodenal ulcer image dataset and output prediction results of an auxiliary disease classifier;

the first in the label representing true auxiliary diseasejThe number of the elements is one,

indicating the probability of the auxiliary disease classifier outputting the predicted outcomejThe number of the elements is one,

indicating the number of total categories of auxiliary diseases.

As a further improvement of the invention, the auxiliary disease classification module classifies the sequences

Inputting the trained auxiliary disease classifier to obtain the classification probability of the stomach disease picture about the auxiliary disease

。

As a further improvement of the present invention, the third feature processing module includes:

a matrix conversion module for converting the classification probabilities

Expansion to the diseased matrix:

wherein the content of the first and second substances,

representing the matrix of disease

To middleiLine ofjThe elements of the column are,

representing classification probabilities

In the prediction of disease classcThe probability of (a) of (b) being,cbelong to

，

The number of all regions for a plurality of pictures,

the dimension of the first Encoder layer, i.e.,

has a row dimension of

In the column dimension of

；

Feature fusion module to fuse sequences

Diseased matrix corresponding to picture

Performing element-by-element multiplication while adding sequences

Thereby obtaining the final pictureSign representation

：

Wherein the content of the first and second substances,

the correlation coefficients representing the different disease classes,

belong to

And represents a general multiplication of the data of the original,

representing the multiplication of corresponding elements of the two matrices.

As a further improvement of the present invention, the gastric disease classifier comprises:

the second Encoder layer adopts a pre-trained Vision Transformer framework, the pixel value of each region and the position of the region in the whole sequence are embedded through the Encoder layer, and the middle representation of each region is obtained through multi-head attention mechanism learning of the Vision Transformer framework

；

A second feature pooling module for pooling features output by the second Encoder layer to obtain final representation of the picture

；

A second fully-connected layer of

（sigmoid) And (4) classifying:

wherein the content of the first and second substances,

and

representing learnable weights and biases in the second fully connected layer;

a set of classification probabilities for each label representing gastric disease.

As a further improvement of the present invention, the training of the gastric disease classifier optimizes the objectives:

wherein the content of the first and second substances,

representing a two-class cross entropy loss between a true gastric disease label for a picture in the gastric disease picture dataset and a prediction result output by the gastric disease classifier;

the first of the labels representing real gastric diseasesjThe number of the elements is one,

representing the probability of the classifier outputting the prediction result of the gastric diseasejAn element;

indicating the number of gastric disease categories.

The invention has the beneficial effects that: the method solves the defects of the prior art, utilizes the correlation among diseases, uses the prediction result of the esophageal tumor and the prediction result of the duodenal ulcer to assist the prediction of the stomach diseases, and assists the prediction of the existing stomach diseases through the prediction results of related tasks (non-stomach diseases such as the esophageal diseases and the duodenal ulcer), thereby improving the identification accuracy of the multi-label classification technology based on stomach pictures; the invention can process the multi-picture input of a case; meanwhile, the invention adopts a multitask artificial intelligence model: the auxiliary disease classifier and the stomach disease classifier can actually predict the categories of auxiliary diseases, and play a role in win-win; the auxiliary diseases adopted by the invention are similar to the input pictures used by stomach diseases, the prediction is assisted by utilizing the correlation among the diseases, and the marked data can be expanded actually, so that the burden of data annotation is relieved.

Drawings

FIG. 1 is a schematic diagram of the system architecture of the present invention;

fig. 2 is a process schematic of an embodiment of the invention.

Detailed Description

The present invention is further described below in conjunction with the following figures and specific examples so that those skilled in the art may better understand the present invention and practice it, but the examples are not intended to limit the present invention.

As described in the background, the prior art is divided into the following steps: (1) a professional annotates a large number of pictures with gastric disease labels, each picture is used as a sample, and a plurality of annotation corpora with annotation samples are obtained; (2) training and labeling linguistic data based on a deep learning network (generally a multilayer convolutional neural network) to obtain a classification model; (3) and predicting the picture with an unknown label by using a classification model to obtain the stomach disease label of the picture. In the prediction process, each time the classification model is input, a single picture is input. In the classification of gastric diseases, the labels are 7 of gastritis, gastric polyps, gastric ulcers, gastric cancer, bile reflux, gastric bleeding and normal.

The deep learning network of the second step generally adopts a convolutional neural network and an attention-based Transformer network, and includes an Encoder layer (Encoder) and an FC layer (full connection layer). The Encoder layer is responsible for extracting the features of the image and generally comprises a series of convolutional layers, activation layers, pooling layers, self-attention layers and the like, and commonly used Encoder layers comprise transformers, VGG, Residual Net, Dense Net, efficiency Net and the like. The FC layer is responsible for mapping image features to classes of pictures. Inputting an image, and coding the image through a series of convolutional layers, activation layers, pooling layers, self-attention layers and the like in the model to obtain the characteristics of the image; and finally, classifying the images through full connection. It is worth noting that: (1) the input of the traditional disease diagnosis model is a single picture, and the input of a plurality of pictures cannot be processed simultaneously; (2) the traditional disease diagnosis model can only use limited stomach disease labeling samples, and can not use labeling samples of other diseases.

However, there are many similarities between esophageal and duodenal diseases and gastric diseases, for example, the shape of esophageal tumors is similar to the shape of gastric tumors, the pictures of duodenal ulcers and gastric ulcers are similar, and the existing data sets for esophageal tumors and duodenal ulcers are relatively numerous. In other words, if a model can effectively predict the symptoms presented to the duodenum, then the model can more likely accurately infer the disease state of the stomach. However, in the prior art, the similarity of diseases of different parts is neglected, and the similarity can assist the prediction of the existing stomach diseases through the prediction result of related tasks (non-stomach diseases such as esophageal diseases and duodenal ulcer) on the premise of not increasing the labor cost, so that the identification accuracy of the multi-label classification technology based on the stomach images is improved.

Thus, with reference to fig. 1, the present invention provides a picture classification system for gastric diseases based on multitask learning, comprising:

the data acquisition module is used for acquiring pictures and respectively making an esophageal tumor picture data set, a duodenal ulcer picture data set and a stomach disease picture data set; the stomach disease picture data set comprises a stomach disease picture training set and a stomach disease picture testing set, and pictures in the esophagus tumor picture data set, the duodenal ulcer picture data set and the stomach disease picture training set are all labeled with corresponding disease types;

the first feature processing module is used for extracting picture features in the esophageal tumor picture data set and the duodenal ulcer picture data set to obtain first picture features;

and the stomach disease classification module is used for training the stomach disease classifier by using the third picture characteristic and inputting the stomach disease picture test set into the trained stomach disease classifier to obtain a stomach disease picture classification result.

The deep learning model of the stomach disease image classification based on the multitask learning is shown in figure 1, firstly, a large number of marked esophageal tumor image data sets and duodenal ulcer image data sets are obtained, each sample in the data sets is composed of a plurality of images, an auxiliary disease classifier composed of Encoder1 and FC1 layers is constructed, and the auxiliary classifier is pre-trained (esophageal tumor and duodenal ulcer images are input, and disease categories are output); secondly, acquiring a small amount of stomach picture samples containing disease categories, and acquiring the probability that the stomach picture is possibly ill about the esophagus or the duodenum through an auxiliary disease classifier; and finally, taking the prevalence probability of the auxiliary diseases as a clue, fusing the prevalence probability with the stomach pictures, and training the stomach disease classifier through a newly constructed main task classifier (consisting of Encoder2 and FC2 layers). The auxiliary disease classifier was trained using all of the picture samples in the esophageal tumor picture dataset and the duodenal ulcer picture dataset (both of these tasks may be referred to as auxiliary tasks). Then, a plurality of pictures in the same case in the stomach disease data set (the task is a main task) are processed by an auxiliary disease classifier, and the presumed disease probability about the esophagus or the duodenum of the stomach pictures is obtained. And finally, fusing the stomach picture characteristics with the morbidity probability of the auxiliary diseases, and obtaining the type prediction of the stomach diseases through a stomach disease classifier formed by an Encoder2 layer and an FC2 layer. The invention can process the multi-picture input of a case, adopts a multi-task artificial intelligence model: the auxiliary disease classifier and the stomach disease classifier can actually predict the type of the auxiliary disease and play a role of win-win, the auxiliary disease adopted by the invention is similar to an input picture used by the stomach disease, the prediction is assisted by utilizing the correlation among the diseases, and the effect of expanding marked data can be actually played, so that the burden of data annotation is relieved.

The invention includes assisting in the training of disease classifiers and in the training of gastric disease classifiers. Specifically, the method comprises the following steps:

first, training phase of auxiliary disease classifier

1. Characteristic extraction: in one example of a given esophageal tumor image dataset and duodenal ulcer image datasetnPicture frame

Sequentially transmitting them to a feature extractor of the picture to obtain picture features

. The feature extractor adopts the current commonly used picture feature extractor Residual Network (2)ResNet）：

Wherein the content of the first and second substances,

is shown as

And (5) opening a picture.

2. And (3) feature segmentation: characterizing a picture

Partitioning to obtain a plurality of non-overlapping regular pixel blocks: a picture is divided into

The number of the small areas is small,npicture is divided into

And (4) small area:

wherein the content of the first and second substances,

for the set of picture regions after feature segmentation,

is a blocking function.

3. Characteristic ordering: will be provided with

All the small regions in the sequence are spliced in sequence to form a sequence

。

4. Sequence coding: the Encoder layer adopts a pre-trained Vision transform framework (VT), pixel values of each small region are added with position embedding (positions in the whole sequence), the middle representation of each small region is obtained through multi-head attention mechanism learning of a multi-layer transform through the Encoder

：

4. Characteristic pooling: encoder output characteristics are fused (pooled) via featuresPoolingObtain the final representation of the picture:

5. and (4) classification: through the use of a full connection layer

（softmax) And (4) classifying:

wherein the content of the first and second substances,

and

representing learnable weights and biases in the fully connected layer.

Namely, it is

，

6. Loss optimization: in summary, the following are optimization goals for the assisted disease classifier:

wherein the content of the first and second substances,

cross entropy loss between the true auxiliary disease label representing the picture in the esophageal tumor picture dataset, the duodenal ulcer picture dataset, and the auxiliary disease classifier output prediction results, which needs to be minimized;

the first in the label (category) representing true auxiliary diseasesjThe number of the elements is one,

indicating the number of total categories of auxiliary diseases.

Second, training phase of stomach disease classifier

Picture preprocessing: extracting, segmenting and ordering the features of multiple stomach pictures in the same case in a given stomach disease picture data set to form a sequence, and obtaining the sequence and the auxiliary disease pictures

The same procedure is used. However, to distinguish from the sequence of picture generation for the auxiliary disease, the sequence of picture acquisition for the gastric disease is represented as

。

1. Auxiliary disease incidence: sequence of

Is trainedLater assisted disease classifier to obtain classification probabilities

。

2. Generating a disease matrix: probability of classification

Expansion to the disease matrix:

wherein the content of the first and second substances,

representing the matrix of the disease

To middleiLine ofjThe elements of the column are,

indicating the outcome of the assisted disease classification

In the prediction of disease classcProbability of belonging to

，

The number of all the small squares for a plurality of pictures,

dimension of Encoder. Therefore, the temperature of the molten metal is controlled,

has a row dimension of

In the column dimension of

。

3. Characteristic processing: characterizing a picture sequence

Diseased matrix corresponding to picture

Performing element-by-element multiplication while adding the original picture feature sequence

To obtain the final picture feature representation

：

Wherein the content of the first and second substances,

is set correlation coefficient of different disease categories, and the range belongs to

Different from each othercCorresponding to

The settings are different. Denotes a normal multiplication for multiplying the coefficients.

Representing the multiplication of corresponding elements of two matrices. Thus the disease information of the auxiliary task is fused to the characteristic sequence of the main task.

4. Sequence coding: v with pretrained Encoder layerThe final feature representation of each small region plus position embedding (positions in the whole sequence) passes through an Encoder, and the intermediate representation of each small region is obtained through multi-head attention mechanism learning of a multi-layer Transformer

：

5. Characteristic pooling: and (3) obtaining final representation of the multivariate picture through feature fusion (pooling) of the Encoder output features:

6. and (4) classification: through the use of a full connection layer

（sigmoid) And (4) classifying:

wherein the content of the first and second substances,

and

representing learnable weights and biases in the fully connected layer.

7. Loss optimization: in summary, the following are the optimization objectives:

wherein the content of the first and second substances,

the first in the label representing real stomach diseasesjThe number of the elements is one,

representing the probability of the outcome of a prediction output by a gastric disease classifierjAn element;

indicating the number of gastric disease categories.

Examples

As shown in FIG. 2, the samples in the stomach disease picture test set are input into the trained artificial intelligence model of the stomach disease picture classification system based on multi-task learning provided by the present invention to obtain the classification result of the stomach disease. Meanwhile, 2 experienced endoscopists are invited to interpret and diagnose the stomach pictures of the test set. The overall accuracy, sensitivity and positive predictive value of the model and 2 doctors in disease diagnosis are obtained.

The method for calculating the diagnostic effect evaluation index comprises the following steps: overall accuracy = number of cases identified correct/number of actual cases tested set disease x 100%; sensitivity = number of correctly identified cases of a certain category/number of actual cases of the category x 100%; positive predictive value = number of cases identified as correct for a certain category/number of cases identified as that category by the model or endoscopist x 100%.

Specifically, raw data is collected: the invention collects gastroscopic pictures of patients in endoscopic central gastroscopy (including painless gastroscopy, conscious sedation gastroscopy and ordinary gastroscopy). The picture taking devices are mainly endoscopes of Olympus 240, 260, 290 series and Fujinon 560, 580 series, japan. All pictures are taken in a white light non-amplification mode, and optical dyeing such as BLI, FICE, NBI and the like and chemical dyeing such as indigo carmine, acetic acid dyeing amplification and the like are not studied for the moment. Inclusion criteria were: the diagnosis is as follows: gastritis, gastric polyps, gastric ulcers, gastric cancer, bile reflux, gastrorrhagia and normal gastroscopic mucosa pictures. Exclusion criteria: patients are under 16 years of age or over 95 years of age; secondly, the observed pictures are influenced by abnormal blurring, artifacts, abnormal distortion and the like of the pictures; and thirdly, a large amount of foam, viscous lake or food and other pictures with serious interference exist.

Construction of a data set and picture preprocessing: according to different application models, the method is divided into an esophageal tumor image data set, a 12-finger intestinal ulcer image data set and a stomach disease image data set. The esophageal tumor and 12-finger intestinal ulcer picture data set contains 20,000 samples in total, and the stomach disease picture data set contains 3460 samples in total. In the data set of the esophageal tumor and 12-finger intestinal ulcer pictures, the number of cases of esophageal tumor, normal esophagus, 12-finger intestinal ulcer and normal 12-finger intestine is respectively as follows: 4872, 5675, 4325 and 5128. In the classification data set of stomach diseases, the number of cases of gastritis, gastric polyp, gastric ulcer, gastric cancer, bile reflux, gastrorrhagia and normal stomach are respectively: 650 cases, 610 cases, 400 cases, 500 cases, 200 cases, 450 cases, and 650 cases. The acquired case image data is processed through a series of image operations such as image format conversion, image size scaling, image enhancement, image normalization and the like so as to ensure the identification of the image by the artificial intelligence model. In order to ensure that no repeated part exists in the cases of the training set, the verification set and the test set, the original cases are divided into the training set (520 cases of gastritis, 488 cases of gastric polyp, 320 cases of gastric ulcer, 400 cases of gastric cancer, 160 cases of bile reflux, 360 cases of gastric bleeding and 520 cases of normal stomach) and the test set (65 cases of gastritis, 61 cases of gastric polyp, 40 cases of gastric ulcer, 50 cases of gastric cancer, 200 cases of bile reflux, 450 cases of gastric bleeding and 650 cases of normal stomach) and (65 cases of gastritis, 61 cases of gastric polyp, 40 cases of gastric ulcer, 50 cases of gastric cancer, 200 cases of bile reflux, 450 cases of gastric bleeding and 650 cases of normal stomach) according to the proportion of approximately 8:1:1 by using a random principle. Training the model parameters of the artificial intelligence through a training set, then verifying the effectiveness of the model by using a verification set, improving the generalization capability of the model, carrying out optimal adjustment on the parallel hyper-parameters to form a final artificial intelligence algorithm model, and finally evaluating and considering the performance of the artificial intelligence algorithm model through a test set.

The specific results are as follows: the overall accuracy of the doctor was 86%, and the overall accuracy of the invention was 91.5%. The sensitivity of the doctor was 87%, and the sensitivity of the present invention was 90%. The positive predictive value of the doctor is 89%, the positive predictive value of the invention is 92%, and the overall accuracy, sensitivity and positive predictive value of the stomach disease identification of the invention are obviously superior to those of an experienced endoscope doctor; compared with the method for training the stomach disease classifier (single task) independently, the auxiliary disease strategy (multi-task) adopted by the method can effectively reduce data annotation of the stomach disease, and as the single task method needs to label 2000 samples under the condition that the sample prediction accuracy is 85%, the multi-task method provided by the invention only needs to label 1500 samples.

The above-mentioned embodiments are merely preferred embodiments for fully illustrating the present invention, and the scope of the present invention is not limited thereto. The equivalent substitution or change made by the technical personnel in the technical field on the basis of the invention is all within the protection scope of the invention. The protection scope of the invention is subject to the claims.

Claims

1. A stomach disease picture classification system based on multitask learning is characterized in that: the method comprises the following steps:

the third feature processing module is used for fusing the classification probability and the second picture features through element-by-element multiplication to obtain third picture features;

2. The system of claim 1, wherein the image classification system comprises: the esophageal tumor image data set, the duodenal ulcer image data set and the stomach disease image data set comprise a plurality of samples, and each sample comprisesnAnd (5) opening a picture.

3. The system of claim 2, wherein the image classification system comprises: the first feature processing module comprises a first picture preprocessing module, and the first picture preprocessing module comprises:

；

A first feature segmentation module for segmenting features of the picture

Partitioning to obtain a feature-segmented picture region set

；

A first feature serialization module for aggregating picture regions

All the regions in (a) are spliced in sequence to form a sequence

。

4. The system of claim 3, wherein the image classification system comprises: the second feature processing module comprises a second picture preprocessing module, and the second picture preprocessing module comprises:

a second feature extractor for extracting features of the pictures in any sample in the stomach disease picture training set to obtain picture features

；

A second feature segmentation module for segmenting the picture features

Partitioning to obtain a feature-segmented picture region set

；

A second feature serialization module for aggregating picture regions

All the regions in (a) are spliced in sequence to form a sequence

。

5. The system of claim 4, wherein the image classification system comprises: the auxiliary disease classifier includes:

the first Encoder layer adopts a pre-trained Vision Transformer framework, the pixel value of each region and the position of the region in the whole sequence are embedded through the Encoder layer, and the intermediate representation of each region is obtained through multi-head attention mechanism learning of the Vision Transformer framework

；

A first fully-connected layer of

The classification is carried out, and the classification is carried out,

by usingsoftmaxFunction:

wherein the content of the first and second substances,

and

representing learnable weights in a first fully connected layerAnd a bias means for biasing the movable member in a direction perpendicular to the axis,

6. The system of claim 5, wherein the image classification system comprises: training optimization objectives of the assisted disease classifier are as follows:

wherein the content of the first and second substances,

indicating the number of total categories of auxiliary diseases.

7. The system of claim 6, wherein the image classification system comprises: the auxiliary disease classification module classifies the sequences

Inputting a trained auxiliary disease classifierObtaining the classification probability of the stomach disease picture about the auxiliary disease

。

8. The system of claim 7, wherein the image classification system comprises: the third feature processing module includes:

a matrix conversion module for converting the classification probabilities

Expansion to the disease matrix:

wherein the content of the first and second substances,

representing the matrix of disease

To middleiLine ofjThe elements of the column(s) are,

representing classification probabilities

，

The number of all regions for a plurality of pictures,

is a dimension of the first Encoder layer, that is,

has a row dimension of

In the column dimension of

；

Feature fusion module to fuse sequences

Diseased matrix corresponding to picture

Performing element-by-element multiplication while adding sequences

To obtain the final picture feature representation

：

Wherein the content of the first and second substances,

the correlation coefficients representing the different disease classes,

belong to

And denotes a number of the ordinary multiplications,

representing the multiplication of corresponding elements of the two matrices.

9. The system of claim 8, wherein the image classification system comprises: the gastric disease classifier includes:

；

；

A second fully-connected layer of

The classification is carried out, and the classification is carried out,

by usingsigmoidFunction:

wherein the content of the first and second substances,

and

representing learnable weights and biases in the second fully connected layer;

10. A gastric disease picture classification system based on multitasking learning according to any one of claims 1-9, characterized by that: the training of the gastric disease classifier optimizes a goal:

wherein the content of the first and second substances,

indicating the number of gastric disease categories.