CN113793345A

CN113793345A - Medical image segmentation method and device based on improved attention module

Info

Publication number: CN113793345A
Application number: CN202111042489.5A
Authority: CN
Inventors: 李郁欣; 曹加旺; 刘学玲; 冯瑞; 耿道颖
Original assignee: Huashan Hospital of Fudan University
Current assignee: Huashan Hospital of Fudan University
Priority date: 2021-09-07
Filing date: 2021-09-07
Publication date: 2021-12-14
Anticipated expiration: 2041-09-07
Also published as: CN113793345B

Abstract

The invention discloses an image processing method, an image processing device, computer equipment and a storage medium. The method comprises the following steps: acquiring target tensor data of a target image; inputting the target tensor data into a target neural network model, wherein the target neural network model comprises an encoder, a decoder and an attention module; the encoder is used for performing down-sampling according to the target tensor data to obtain a first characteristic matrix; the attention module is used for carrying out cavity convolution according to the first feature matrix to obtain a second feature matrix; determining an original attention diagram according to the second feature matrix; determining a hole attention map according to the original attention map; determining a third feature matrix according to the first feature matrix and the hole attention diagram; a decoder for outputting the third feature matrix; the decoder is used for performing up-sampling according to the third characteristic matrix to obtain an image segmentation result; and performing segmentation feedback of the target image according to the output of the target neural network model. The accuracy of image segmentation can be improved.

Description

Medical image segmentation method and device based on improved attention module

Technical Field

Embodiments of the present invention relate to machine learning technologies, and in particular, to an image processing method and apparatus, a computer device, and a storage medium.

Background

Medical image segmentation is an important task in computer-aided diagnosis, but the segmentation work is always challenging due to irregular shape of a target object, fuzzy boundary and the like. Therefore, accurate and reliable segmentation methods are needed for identifying these complex target objects of different scales.

With the development of the deep Convolutional Neural Networks (CNNs), U-Net type Neural Networks and a technical solution for fusing U-Net with attention mechanism are developed. However, the current U-Net combined with attention mechanism cannot accurately segment targets with different shapes and scales in medical images, and the accuracy of image segmentation is low.

Disclosure of Invention

The invention provides an image processing method, an image processing device, computer equipment and a storage medium, which are used for improving the accuracy of image segmentation.

In a first aspect, an embodiment of the present invention provides an image processing method, including:

acquiring target tensor data of a target image;

inputting the target tensor data into a target neural network model, wherein the target neural network model comprises an encoder, a decoder and an attention module;

the encoder is used for performing down-sampling according to the target tensor data to obtain a first characteristic matrix; the attention module is used for carrying out cavity convolution according to the first characteristic matrix to obtain a second characteristic matrix; determining an original attention diagram according to the second feature matrix; determining a hole attention map according to the original attention map; determining a third feature matrix according to the first feature matrix and the hole attention diagram; a decoder for outputting the third feature matrix; the decoder is used for performing up-sampling according to the third characteristic matrix to obtain an image segmentation result;

and performing segmentation feedback of the target image according to the output of the target neural network model.

In a second aspect, an embodiment of the present invention further provides an image processing apparatus, including:

the target image tensor acquisition module is used for acquiring target tensor data of a target image;

the segmentation module is used for inputting the target tensor data into a target neural network model, and the target neural network model comprises an encoder, a decoder and an attention module;

and the output module is used for performing segmentation feedback on the target image according to the output of the target neural network model.

In a third aspect, an embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the image processing method according to the embodiment of the present invention when executing the computer program.

In a third aspect, embodiments of the present invention further provide a storage medium containing computer-executable instructions, which are used for executing the image processing method according to the embodiments of the present invention when executed by a computer processor.

The image processing method provided by the embodiment of the invention can acquire the target tensor data of the target image; inputting the target tensor data into a target neural network model, and performing down-sampling on an encoder of the target neural network model according to the target tensor data to obtain a first characteristic matrix; the attention module of the target neural network model is used for carrying out cavity convolution according to the first characteristic matrix to obtain a second characteristic matrix; determining an original attention diagram according to the second feature matrix; determining a hole attention map according to the original attention map; determining a third feature matrix according to the first feature matrix and the hole attention diagram; a decoder for outputting the third feature matrix; the decoder of the target neural network model is used for performing up-sampling according to the third characteristic matrix to obtain an image segmentation result; and performing segmentation feedback of the target image according to the output of the target neural network model. Compared with the problem that the accuracy of the U-Net image segmentation combined with the attention mechanism is low at present, according to the image processing method provided by the embodiment of the invention, the attention module of the target neural network model can process the first feature matrix of the target image based on the hole convolution, the hole convolution can improve the scope of the receptive field, further the region of the target image, which is interested by the user, can be determined more accurately, and the segmentation is carried out according to the region, so that the accuracy of the image segmentation is improved.

Drawings

FIG. 1 is a flowchart of an image processing method according to a first embodiment of the present invention;

FIG. 2 is a schematic structural diagram of an image processing apparatus according to a third embodiment of the present invention;

fig. 3 is a schematic structural diagram of a computer device in the fourth embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It is to be further noted that, for the convenience of description, only a part of the structure relating to the present invention is shown in the drawings, not the whole structure.

Example one

Fig. 1 is a flowchart of an image processing method according to an embodiment of the present invention, which is applicable to a case of segmenting a region of interest in a user in an image, in particular, a region of interest in a user such as a lesion in a medical image, and the method may be executed by a computer device providing an image segmentation function for the user, where the computer device may be a personal computer, a laptop computer, a tablet computer, or a server. The method specifically comprises the following steps:

and step 110, acquiring target tensor data of the target image.

The target image may be an image to be segmented, and the target tensor data is a target image expressed in a vector form. For example, the shape of the target tensor data may be 1 × 128 × 128, which represents 128 pixels in the horizontal direction and 128 pixels in the vertical direction in a certain color channel of the target image.

On the basis of the above embodiment, acquiring target tensor data of a target image includes:

preprocessing a target image according to a preset preprocessing mode to obtain a preprocessed image, wherein the preprocessing mode comprises an image cutting mode and an image normalization mode; target tensor data is determined from the preprocessed image.

The image cropping mode includes cropping images of different sizes into images of the same size. The image normalization mode comprises the step of normalizing the images with different formats into a unified file format. The preprocessed image with uniform specification can be obtained through a preprocessing mode. Because the number of pixels contained in the horizontal direction and the vertical direction of the preprocessed image is the same, and the number of color channels is the same, the data form of the target tensor obtained according to the preprocessed image is also the same. The preprocessed target tensor data are suitable for the target neural network model, and the segmentation accuracy of the target neural network model can be improved.

On the basis of the above embodiment, the target image is a medical image, and the medical image includes: images were taken using a computed tomography image, a nuclear magnetic resonance image, a high definition camera.

The high definition camera can be medical field high definition camera equipment. For example, a high definition camera may be a DRIVE eye disease dataset image acquisition tool. Specifically, images were acquired using a canon CR5 non-mydriatic 3CCD camera at 45 degree field of view. Each image was captured with 8 bits on a color plane of 768 x 584 pixels. The field of view of each image is circular and about 540 pixels in diameter.

desensitizing the target image to obtain a desensitized image;

target tensor data for the target image is determined from the desensitized image.

In the embodiment, because the nuclear magnetic resonance scanning image related to the suspected patient of the clinical parkinson disease is desensitized data which is subjected to desensitization processing and has a format of nii.gz, information such as the name of the patient cannot be read, important privacy is not involved, and the information security of a user is ensured to the maximum extent.

And 120, inputting the target tensor data into the target neural network model.

Wherein the target neural network model includes an encoder, a decoder, and an attention module.

The encoder is used for performing down-sampling according to the target tensor data to obtain a first characteristic matrix; the attention module is used for carrying out cavity convolution according to the first characteristic matrix to obtain a second characteristic matrix; determining an original attention diagram according to the second feature matrix; determining a hole attention map according to the original attention map; determining a third feature matrix according to the first feature matrix and the hole attention diagram; a decoder for outputting the third feature matrix; and the decoder is used for performing up-sampling according to the third characteristic matrix to obtain an image segmentation result.

The encoder comprises a plurality of volume blocks. The convolutional layer of each convolutional block is followed by an activation function (ReLU) and a Batch Normalization (BN) layer, and then downsampled using a max pooling operation, which may be a downsampling step size of 2. After each down-sampling step, the number of feature channels will double. The first feature matrix is obtained by multiple donation blocks.

In one implementation, a data call model may be constructed from data features of clinical brain nmr images used for training. Specifically, the method comprises the following steps: account numbers and passwords with partial user rights are obtained from a PACS system of a hospital, and computer tomography images of suspected patients in the PACS system, namely Nii.gz, and corresponding label data are obtained through a communication protocol and stored.

During storage, the brain mri images may have different slice numbers according to the scan layer thickness and are stored for the patient's head. And performing down-sampling on the brain nuclear magnetic resonance scanning image through an encoder to obtain a first feature matrix.

After the first feature matrix is obtained, the first feature matrix is input to an attention module (also called AHA module). And optimizing the first feature matrix in the attention module, and outputting a third feature matrix.

The decoder and the encoder comprise a plurality of symmetrical rolling blocks, and the original image resolution is restored by up-sampling the third characteristic matrix through the decoder.

In one implementation, the attention module is configured to perform a hole convolution according to the first feature matrix to obtain a second feature matrix, and includes: and respectively performing hole convolution on the first characteristic matrix according to the hole convolutions to obtain a plurality of second characteristic matrices, wherein each hole convolution corresponds to one second characteristic matrix. Correspondingly, the original attention diagram is determined according to the second feature matrix, and the method comprises the following steps: performing mutual matrix multiplication according to the plurality of second feature matrices to obtain a matrix multiplication result; the original attention map is determined from the matrix multiplication result. Correspondingly, the method for determining the hole attention diagram according to the original attention diagram comprises the following steps: and carrying out transitive closure operation on the original attention diagram to obtain a hole attention diagram. Correspondingly, the third feature matrix is determined according to the first feature matrix and the hole attention map, and the third feature matrix comprises the following steps: and multiplying the first feature matrix by the hole attention map to obtain a third feature matrix.

In one implementation, the generation of the hole high order attention matrix is as follows: wherein, at the bottom of the coder, a characteristic diagram (also called a first characteristic matrix) is obtained from the previous stage

Wherein H × W × C are eachThe height, width, and number of channels of the feature map are expressed, and the feature map is sent to 4 shared weight convolution layers (r is 1, r is 2, r is 4, and r is 8, respectively), and a multi-scale feature map (also called a second feature matrix) X is generated_r＝1,2,4,8. Carrying out scale transformation on the multi-scale characteristic diagram (also called a second characteristic matrix) by 1 multiplied by 1 x 1 convolution, and carrying out dot multiplication by the following formula I to obtain a preliminary attention matrix (also called an original attention diagram)

According to the theory of 'transitive closure' in the graph theory, the original attention diagram can be obtained

Obtaining a high-order attention moment array

The calculation process is as follows:

first, an original attention matrix (also called a void attention map) after thresholding is obtained according to the following formula two:

where δ is a hyper-parameter threshold, and may be set to 0.5 in the embodiment of the present application.

Then, according to the transmission closure theory, the transmission closure calculation is carried out on the hole attention diagram, and according to the following formula III, a high-order attention moment array is obtained

Where M represents the high rank attention of the mth order. And weighting the first characteristic diagram through the high-order attention matrix to obtain a third characteristic matrix, thereby filtering the noise influence and strengthening the useful information.

The above-mentioned attention module can be embedded into any "U" -shaped encoder-decoder neural network model for use in medical image segmentation. In order to enlarge the limited reception fields of the partial convolution of the convolution layer in the down-sampling stage and the deconvolution layer in the up-sampling stage, more global information can be effectively captured in a high dimension by adopting multi-scale hole convolution. In addition, the high-order attention mechanism constructs a high-order attention feature graph through the transfer closure of the graph model so as to obtain stronger relevant context features.

Optionally, the segmentation algorithm used in the embodiment of the present invention may be a threshold segmentation algorithm, so that the influence of peripheral noise can be removed, the quality of the normalized image can be improved, and the robustness of the "U" shaped neural network model based on the improved attention module can be improved.

And step 130, performing segmentation feedback of the target image according to the output of the target neural network model.

On the basis of the above embodiment, the segmentation feedback of the target image according to the output of the target neural network model includes: and outputting the target image, and determining a highlight area of the target image according to the image segmentation result.

The user can import the target image through the computer equipment, and the computer equipment divides the target image according to the method and then displays the target image and the image division result. The segmented user regions of interest may be displayed using highlighting.

On the basis of the above embodiment, before inputting the tensor data into the target neural network model, the method further includes:

acquiring training tensor data of a training image of a training set; and training the target neural network model according to the training tensor data so as to optimize the network parameters of the target neural network model.

The training images can be obtained by interfacing with a hospital image Archiving and Communication system (PACS), and can be expanded according to the images obtained by interfacing. The expansion mode comprises random turning, random rotation, random translation, random clipping and Gaussian noise addition. The target neural network model is trained by using the training tensor data obtained by expansion, so that the sensitivity of the target neural network model to unstable factors such as rotation, translation, Gaussian noise and the like can be improved, and the robustness of the U-shaped neural network model based on the improved attention module is further improved.

Example two

An embodiment of the present invention provides an image processing method, which is used to further describe the above embodiment, and includes:

and step E1, storing medical image information related to the suspected Parkinson patient. The medical imaging information includes brain magnetic resonance imaging.

In clinical practice, the head magnetic resonance imaging is standard imaging format data which is processed by desensitization and has a format of Nii.

And E2, preprocessing the target image according to a preset preprocessing mode to obtain a preprocessed image, and determining target tensor data according to the preprocessed image.

The pretreatment method comprises the following steps:

and step T1, obtaining the model of the detection machine corresponding to the brain nuclear magnetic resonance imaging, adjusting the nuclear magnetic resonance image to obtain a standard image to be detected, and normalizing to obtain a normalized image to be detected.

In this example, all magnetic resonance examinations were performed on a clinical radiology 3.0-T magnetic resonance scanner (diagnostics TM MR750 from GE healthcare) equipped with an 8-channel head matrix coil.

In step T2, since the pixel ratio of the region to be segmented is small, in particular, in this embodiment, the original image is subjected to center clipping, the middle region is taken for further processing, and the nuclear magnetic resonance image of the brain to be measured is converted into Tensor data as preprocessing data.

The shape of the Tensor data is 1 × 128 × 128.

And E3, training the neural network model as a medical image segmentation model.

Training the U-shaped neural network model based on the improved attention module comprises the following steps:

and step S1, constructing a data calling model according to the data characteristics of the clinical brain nuclear magnetic resonance image for training. Specifically, the method comprises the following steps:

the method comprises the steps of firstly obtaining an account number and a password with partial user authority from a PACS system of a hospital, obtaining a Nii.gz computed tomography image of a suspected patient in the PACS system and corresponding label data through a communication protocol, and storing the image.

In addition, during the storage process, the brain nuclear magnetic resonance scanning images can have different slice numbers according to the different scanning layer thicknesses and are stored according to the patient for the head.

Step S2, inputting the image into a down-sampling feature extraction network to obtain a first feature matrix;

and step S3, correcting the first feature matrix by using a cavity high-rank attention mechanism to obtain a second feature matrix.

Step S3 includes the following sub-steps: step S3-1, respectively performing cavity convolution on the first characteristic matrix according to the cavity convolutions to obtain a plurality of second characteristic matrices, wherein each cavity convolution corresponds to one of the second characteristic matrices, and step S3-2, performing mutual matrix multiplication according to the second characteristic matrices to obtain a matrix multiplication result; determining an original attention diagram according to a matrix multiplication result; step S2-3, carrying out transitive closure operation on the original attention map to obtain a hole attention map; and step S2-4, multiplying the first feature matrix by the hole attention map to obtain a third feature matrix.

And step S4, outputting the third feature matrix obtained in the previous step to an up-sampling segmentation network to obtain a segmentation result.

And step S4, training the U-shaped neural network model based on the improved attention module to obtain the trained U-shaped neural network model based on the improved attention module as the cavity attention segmentation neural network model.

In step S4-1, the original brain nmr image is preprocessed and converted into Tensor data as training data. Specifically, the method comprises the following steps:

the method comprises the steps of extracting a brain nuclear magnetic resonance scanning image pixel value, and carrying out normalization and pixel value truncation on the nuclear magnetic resonance image pixel, so that the influence of peripheral noise is removed, and the model robustness is improved. The middle brain image with length and width of 128 is retained using the center cropping method. And the pixel matrix is converted into the sensor Tensor data by using the sensor API in the PyTorch framework, so that the GPU can be used for parallel calculation conveniently.

And step S4-2, expanding the normalized image by a preset data expansion method to obtain an expanded image.

The data expansion method comprises random overturning, random rotating, random translating, random cutting and Gaussian noise adding.

Step S4-3, the augmented image and the tag data are input as the processing result to the "U" shaped neural network model based on the improved attention module.

In this embodiment, the setting conditions of the relevant parameters when training the void attention segmentation neural network model are as follows: the batch size is set to be 48, the epoch is 100, the gradient descent method is random gradient descent optimization (SGD), the learning rate adopts an Adam optimizer strategy, the initial value is 0.001, the momentum is 0.9, and the attenuation rate is 3E-5.

In step E4, the user-focused region is segmented from the target image (medical image) input by the user.

In this embodiment, the activation function is a commonly used activation function (ReLU), the user attention region is a key division region that requires model learning and division, and the user attention region is a Substantia Nigra Compacta (SNpc) in this embodiment.

And E5, displaying the nuclear magnetic resonance scanning image, displaying the model segmentation result in a red highlight area, and assisting a doctor in diagnosis.

All data obtained in this example were obtained by scanning with 3.0T NMR scanner (model: Discovery TM MR750, GE Healthcare). In total 188 cases of brain mri screening samples, specifically 140 cases of parkinson disease patients and 48 healthy people. The focus area of the Parkinson's disease is the substantia nigra pars compacta, so that the classification influence of Parkinson patients and healthy people does not need to be considered in the segmentation task, and all segmentation labels are independently labeled by professional doctors with eight years of clinical experience in the imaging department.

The data are divided into two independent sets, namely training for the algorithm model development process and other verification data for algorithm verification. The training set includes 152 subjects 'head nmr scan images, and the validation set includes 36 subjects' head nmr scan images.

Through verification by the method or the device, the Daiss coefficient (DSC) of the U-shaped neural network model based on the improved attention module on verification data is 0.8769, the Accuracy (ACC) is 0.9992, the Sensitivity (SE) is 0.8935, the Specificity (SP) is 0.9995, and the area under the operating characteristic curve (ROC) of the subject (AUC) is 0.9465.

Meanwhile, experiments show that the total time consumption of the test process of a batch of 32 images is about 3 seconds, the total time consumption of all tests of one patient is about 30 seconds, and the device has high time efficiency.

In summary, compared with the manual diagnosis, the medical image segmentation apparatus 1 based on the improved attention module and the medical image segmentation method based on the improved attention module of the present invention have high accuracy and high efficiency on the medical image related segmentation task, and approach the diagnosis level of the specialist, so that the present invention is particularly suitable for the situations of shortage of primary medical resources and insufficient number of specialists. In addition, the device can also effectively assist doctors to carry out rapid diagnosis, and is beneficial to improving the diagnosis and treatment efficiency of the links of the department of imaging in hospitals.

EXAMPLE III

Fig. 2 is a schematic structural diagram of an image processing apparatus according to a third embodiment of the present invention, which is applicable to a case where a region of interest in a user in an image is segmented, especially a case where a region of interest in a user, such as a lesion in a medical image, is segmented, where the method may be executed by a computer device providing an image segmentation function for the user, and the computer device may be a personal computer, a laptop, a tablet computer, or a server. The device includes: a target image tensor acquisition module 210, a segmentation module 220, and an output module 230.

A target image tensor acquisition module 210, configured to acquire target tensor data of a target image;

a segmentation module 220 for inputting the target tensor data to a target neural network model, the target neural network model including an encoder, a decoder and an attention module;

and an output module 230, configured to perform segmentation feedback on the target image according to the output of the target neural network model.

On the basis of the above embodiment, the segmentation module 220 is configured to:

respectively performing hole convolution on the first feature matrix according to the hole convolutions to obtain a plurality of second feature matrices, wherein each hole convolution corresponds to one second feature matrix;

performing mutual matrix multiplication according to the plurality of second feature matrices to obtain a matrix multiplication result; determining an original attention diagram according to a matrix multiplication result;

carrying out transmission closure operation on the original attention diagram to obtain a hole attention diagram;

and multiplying the first feature matrix by the hole attention map to obtain a third feature matrix.

On the basis of the foregoing embodiment, the target image tensor acquisition module 210 is configured to:

preprocessing a target image according to a preset preprocessing mode to obtain a preprocessed image, wherein the preprocessing mode comprises an image cutting mode and an image normalization mode;

target tensor data is determined from the preprocessed image.

In the above embodiment, the target image is a medical image, and the medical image includes: images were taken using a computed tomography image, a nuclear magnetic resonance image, a high definition camera.

On the basis of the above embodiment, the output module 230 is configured to:

and outputting the target image, and determining a highlight area of the target image according to the image segmentation result.

On the basis of the foregoing embodiment, the acquired target image tensor acquisition module 210 is configured to:

desensitizing the target image to obtain a desensitized image;

On the basis of the above embodiment, the training device further comprises a training module. The training module is used for:

acquiring training tensor data of a training image of a training set;

and training the target neural network model according to the training tensor data so as to optimize the network parameters of the target neural network model.

In the image processing apparatus provided in the embodiment of the present invention, the target image tensor acquisition module 210 can acquire target tensor data of a target image; the segmentation module 220 inputs the target tensor data into the target neural network model, and an encoder of the target neural network model performs down-sampling according to the target tensor data to obtain a first feature matrix; the attention module of the target neural network model is used for carrying out cavity convolution according to the first characteristic matrix to obtain a second characteristic matrix; determining an original attention diagram according to the second feature matrix; determining a hole attention map according to the original attention map; determining a third feature matrix according to the first feature matrix and the hole attention diagram; a decoder for outputting the third feature matrix; the decoder of the target neural network model is used for performing up-sampling according to the third characteristic matrix to obtain an image segmentation result; the output module 230 performs segmentation feedback of the target image according to the output of the target neural network model. Compared with the problem of low accuracy of the U-Net image segmentation combined with the attention mechanism at present, according to the image processing method provided by the embodiment of the invention, the attention module of the target neural network model can process the first feature matrix of the target image based on the hole convolution, the hole convolution can improve the scope of the receptive field, further, the region of interest of a user in the target image can be more accurately determined, and the segmentation is carried out according to the region, so that the accuracy of the image segmentation is improved.

The image processing device provided by the embodiment of the invention can execute the image processing method provided by any embodiment of the invention, and has corresponding functional modules and beneficial effects of the execution method.

Example four

Fig. 3 is a schematic structural diagram of a computer apparatus according to a fourth embodiment of the present invention, as shown in fig. 3, the computer apparatus includes a processor 30, a memory 31, an input device 32, and an output device 33; the number of processors 30 in the computer device may be one or more, and one processor 30 is taken as an example in fig. 3; the processor 30, the memory 31, the input device 32 and the output device 33 in the computer apparatus may be connected by a bus or other means, and the connection by the bus is exemplified in fig. 3.

The memory 31 is used as a computer-readable storage medium for storing software programs, computer-executable programs, and modules, such as program instructions/modules corresponding to the image processing method in the embodiment of the present invention (for example, the target image tensor acquisition module 210, the segmentation module 220, and the output module 230 in the image processing apparatus). The processor 30 executes various functional applications of the computer device and data processing by executing software programs, instructions and modules stored in the memory 31, that is, implements the image processing method described above.

The memory 31 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal, and the like. Further, the memory 31 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some examples, the memory 31 may further include memory remotely located from the processor 30, which may be connected to a computer device over a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.

The input device 32 may be used to receive input numeric or character information and to generate key signal inputs relating to user settings and function controls of the computer apparatus. The output device 33 may include a display device such as a display screen.

EXAMPLE five

Embodiments of the present invention also provide a storage medium containing computer-executable instructions, which when executed by a computer processor, perform an image processing method, the method comprising:

acquiring target tensor data of a target image;

On the basis of the foregoing embodiment, the attention module is configured to perform void convolution according to the first feature matrix to obtain a second feature matrix, and includes:

correspondingly, the original attention diagram is determined according to the second feature matrix, and the method comprises the following steps:

correspondingly, the method for determining the hole attention diagram according to the original attention diagram comprises the following steps:

correspondingly, the third feature matrix is determined according to the first feature matrix and the hole attention map, and the third feature matrix comprises the following steps:

In addition to the above embodiments, the acquiring of the target tensor data of the target image includes:

target tensor data is determined from the preprocessed image.

In addition to the above embodiments, the method for performing segmentation feedback of a target image according to an output of a target neural network model includes:

desensitizing the target image to obtain a desensitized image;

In the above embodiment, before the tensor data is input to the target neural network model, the method further includes:

acquiring training tensor data of a training image of a training set;

Of course, the storage medium containing the computer-executable instructions provided by the embodiments of the present invention is not limited to the method operations described above, and may also perform related operations in the image processing method provided by any embodiment of the present invention.

From the above description of the embodiments, it is obvious for a person skilled in the art that the present invention can be implemented by software and necessary general hardware, and certainly by hardware, but the former is a better embodiment in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as a floppy disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a FLASH Memory (FLASH), a hard disk or an optical disk of a computer, and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device) to execute the methods according to the embodiments of the present invention.

It should be noted that, in the embodiment of the above search apparatus, the included units and modules are only divided according to functional logic, but are not limited to the above division as long as the corresponding functions can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.

It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions without departing from the scope of the invention. Therefore, although the present invention has been described in more detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims

1. An image processing method, comprising:

acquiring target tensor data of a target image;

inputting the target tensor data to a target neural network model, the target neural network model comprising an encoder, a decoder, and an attention module;

the encoder is used for performing down-sampling according to the target tensor data to obtain a first feature matrix; the attention module is used for carrying out cavity convolution according to the first feature matrix to obtain a second feature matrix; determining an original attention diagram according to the second feature matrix; determining a hole attention map from the original attention map; determining a third feature matrix according to the first feature matrix and the hole attention map; a decoder for decoding the third feature matrix output values; the decoder is used for performing up-sampling according to the third characteristic matrix to obtain an image segmentation result;

2. The method of claim 1, wherein the attention module is configured to perform a hole convolution according to the first feature matrix to obtain a second feature matrix, and the method comprises:

performing hole convolution on the first feature matrix according to a plurality of hole convolutions respectively to obtain a plurality of second feature matrices, wherein each hole convolution corresponds to one second feature matrix;

correspondingly, the determining an original attention diagram according to the second feature matrix includes:

performing mutual matrix multiplication according to the plurality of second feature matrices to obtain a matrix multiplication result; determining an original attention diagram according to the matrix multiplication result;

correspondingly, the determining a hole attention map according to the original attention map includes:

carrying out a transmission closure operation on the original attention diagram to obtain a hole attention diagram;

correspondingly, the determining a third feature matrix according to the first feature matrix and the hole attention map includes:

and multiplying the first feature matrix and the hole attention diagram to obtain a third feature matrix.

3. The method of claim 1, wherein obtaining target tensor data for a target image comprises:

preprocessing the target image according to a preset preprocessing mode to obtain a preprocessed image, wherein the preprocessing mode comprises an image cutting mode and an image normalization mode;

and determining target tensor data according to the preprocessed image.

4. The method of claim 1, wherein the target image is a medical image comprising: images were taken using a computed tomography image, a nuclear magnetic resonance image, a high definition camera.

5. The method of claim 1, wherein the feedback of the segmentation of the target image from the output of the target neural network model comprises:

6. The method of claim 1, wherein obtaining target tensor data for a target image comprises:

desensitizing the target image to obtain a desensitized image;

determining target tensor data for the target image from the desensitized image.

7. The method of claim 1, further comprising, prior to inputting the tensor data to a target neural network model:

acquiring training tensor data of a training image of a training set;

and training the target neural network model according to the training tensor data so as to optimize network parameters of the target neural network model.

8. An image processing apparatus characterized by comprising:

a segmentation module for inputting the target tensor data to a target neural network model, the target neural network model including an encoder, a decoder, and an attention module;

9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the image processing method according to any of claims 1-7 when executing the program.

10. A storage medium containing computer-executable instructions for performing the image processing method of any one of claims 1-7 when executed by a computer processor.