CN111340820B

CN111340820B - Image segmentation method and device, electronic equipment and storage medium

Info

Publication number: CN111340820B
Application number: CN202010085655.9A
Authority: CN
Inventors: 张勇东; 徐海; 谢洪涛; 毛震东
Original assignee: Beijing Zhongke Research Institute; University of Science and Technology of China USTC
Current assignee: Beijing Zhongke Research Institute; University of Science and Technology of China USTC
Priority date: 2020-02-10
Filing date: 2020-02-10
Publication date: 2022-05-17
Anticipated expiration: 2040-02-10
Also published as: CN111340820A

Abstract

An image segmentation method is applied to the technical field of medical image processing and comprises the following steps: the method comprises the steps of coding an image to be segmented step by step to obtain a highest-level feature image of the image to be segmented, inputting the highest-level feature image to a compact feature learning model to obtain a compact feature image, decoding the compact feature image in combination with the highest-level feature image step by step to restore spatial information of the compact feature image to obtain a target image to be segmented, predicting the class to be segmented of each pixel in the target image to be segmented to obtain a segmentation prediction image of the target image to be segmented, and extracting a prediction boundary of the target image to be segmented according to the segmentation prediction image. The application also discloses an image segmentation device, electronic equipment and a storage medium, which can improve the image segmentation accuracy and reduce the noise influence in image segmentation.

Description

Image segmentation method and device, electronic equipment and storage medium

Technical Field

The present application relates to the field of medical image processing technologies, and in particular, to an image segmentation method and apparatus, an electronic device, and a storage medium.

Background

Limited by medical imaging methods and medical imaging equipment, medical images often have the characteristic of edge blurring, which causes difficulty in labeling work of doctors, and particularly, labeling of boundaries depends heavily on experience and competent standards of doctors. The labeling differentiation exists between multiple labeling of the same doctor and between different labeling of doctors, so that the inherent noise problem caused by the labeling differentiation is often ignored in the automatic segmentation model training process.

In the existing research method, the boundary optimization mode is usually to give higher weight to boundary pixels in the training process, which enhances the sensitivity of the model to the boundary to a certain extent, but also increases the influence caused by noise in the training process.

Disclosure of Invention

The present application provides an image segmentation method, an image segmentation apparatus, an electronic device, and a storage medium, which can improve image segmentation accuracy and reduce noise influence in image segmentation.

In order to achieve the above object, a first aspect of embodiments of the present application provides an image segmentation method, including:

inputting an image to be segmented into a coding module of a semantic segmentation network for coding step by step, sequentially reducing spatial information and increasing semantic information of the multilayer characteristic image, and acquiring a characteristic image with minimum spatial information and maximum semantic information in the multilayer characteristic image as a highest-level characteristic image;

sending the highest-level feature image to a compact feature module, calculating feature centers of various categories to be segmented through rough segmentation prediction results in the compact feature learning module, and then performing feature reconstruction on the highest-level feature image according to the similarity of each pixel in the highest-level feature image and each feature center to obtain a compact feature image;

combining the obtained compact feature map with the highest-level feature map to obtain a first-level compact feature image, obtaining a feature image in the encoding process with the same spatial information as the first-level compact feature image, performing feature fusion on the first-level compact feature image and the feature image to obtain a second-level compact feature image, and accordingly obtaining an Nth-level compact feature image, wherein the spatial information of the Nth-level compact feature image is the same as the spatial information of the image to be segmented, and taking the Nth-level compact feature image as a target image to be segmented;

predicting the class to be segmented of each pixel in the target image to be segmented to obtain a segmentation prediction image of the target image to be segmented;

extracting a prediction boundary of the target image to be segmented according to the segmentation prediction graph;

optionally, the inputting the highest-level feature image into a compact feature learning model to obtain a compact feature image includes:

predicting the feature center of each category to be segmented according to the highest-level feature image;

according to a self-adaptive attention mechanism, calculating the similarity between the feature of each pixel in the highest-level feature image and each feature center;

and reconstructing the highest-level feature image according to the similarity between the feature of each pixel in the highest-level feature image and each feature center to obtain the compact feature image.

Optionally, the extracting, according to the segmentation prediction map, the prediction boundary of the target image to be segmented includes:

and extracting boundary pixels from the segmentation prediction graph of the target image to be segmented, wherein the types of the classes to be segmented of all the pixels in the preset neighborhood of the boundary pixels comprise at least two types.

Respectively acquiring the number of boundary pixels in each pixel preset neighborhood;

taking the number of boundary pixels in each pixel preset neighborhood as the weight of the corresponding pixel;

calculating the loss of a preset neighborhood of each pixel according to the weight of each pixel;

and obtaining the predicted boundary of the target image to be segmented according to the loss of the preset neighborhood of each pixel.

Optionally, the step-by-step decoding the compact feature image in combination with the highest-level feature image to restore the spatial information of the compact feature image, and obtaining the target image to be segmented includes:

performing feature fusion on the compact feature image and the highest-level feature image to obtain a first-level compact feature image;

acquiring a characteristic image in the encoding process which is the same as the first-stage compact characteristic image space information;

performing feature fusion on the first-level compact feature image and the feature image to obtain a second-level compact feature image, and accordingly obtaining an Nth-level compact feature image, wherein the spatial information of the Nth-level compact feature image is the same as that of the image to be segmented;

and obtaining a target image to be segmented by the Nth-level compact characteristic image through a prediction layer.

Optionally, the spatial information of the first to N-level compact feature images is sequentially increased, and the semantic information is sequentially increased.

Optionally, when the number of boundary pixels in the preset neighborhood of the pixel is 0, the weight of the pixel is assigned to be 1.

Optionally, the step-by-step encoding the image to be segmented to obtain the highest-level feature image of the image to be segmented includes:

the method comprises the steps of coding an image to be segmented step by step to obtain a multilayer characteristic image, wherein spatial information of the multilayer characteristic image is sequentially reduced, and semantic information is sequentially increased;

and acquiring the characteristic image with the minimum spatial information and the maximum semantic information in the multilayer characteristic images as the highest-level characteristic image of the image to be segmented.

A second aspect of the embodiments of the present application provides an image segmentation apparatus, including:

the encoding module is used for inputting an image to be segmented into an encoding module of a semantic segmentation network for encoding step by step, the spatial information of the multilayer characteristic images is sequentially reduced, the semantic information is sequentially increased, and the characteristic image with the minimum spatial information and the maximum semantic information in the multilayer characteristic images is obtained as the highest-level characteristic image;

the input module is used for sending the highest-level feature image to the compact feature module, calculating feature centers of various types to be segmented through rough segmentation prediction results in the compact feature learning module, and then performing feature reconstruction on the highest-level feature image according to the similarity between each pixel in the highest-level feature image and each feature center to obtain a compact feature image;

the decoding module is used for combining the obtained compact characteristic image with the highest-level characteristic image to obtain a first-level compact characteristic image, obtaining a characteristic image in the encoding process, the spatial information of which is the same as that of the first-level compact characteristic image, performing characteristic fusion on the first-level compact characteristic image and the characteristic image to obtain a second-level compact characteristic image, and accordingly obtaining an Nth-level compact characteristic image, wherein the spatial information of the Nth-level compact characteristic image is the same as that of the image to be segmented, and taking the Nth-level compact characteristic image as a target image to be segmented;

and the prediction module is used for predicting the class to be segmented of each pixel in the target image to be segmented to obtain a segmentation prediction image of the target image to be segmented.

A third aspect of embodiments of the present application provides an electronic device, including:

the image segmentation method is characterized in that the processor executes the program to implement the image segmentation method provided by the first aspect of the embodiment of the present application.

A fourth aspect of the embodiments of the present application provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the image segmentation method provided in the first aspect of the embodiments of the present application.

As can be seen from the foregoing embodiments of the present application, the image segmentation method, apparatus, electronic device, and storage medium provided in the present application encode an image to be segmented step by step to obtain a top-level feature image of the image to be segmented, input the top-level feature image to a compact feature learning model to obtain a compact feature image, decode the compact feature image in combination with the top-level feature image step by step to restore spatial information of the compact feature image to obtain a target image to be segmented, predict categories to be segmented of pixels in the target image to be segmented to obtain a segmentation prediction map of the target image to be segmented, extract a prediction boundary of the target image to be segmented according to the segmentation prediction map, thereby improving image segmentation accuracy and reducing noise influence in image segmentation.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present application, and other drawings can be obtained by those skilled in the art without creative efforts.

Fig. 1 is a schematic flowchart of an image segmentation method according to an embodiment of the present application;

fig. 2 is a schematic diagram of a framework of an image segmentation method according to an embodiment of the present application;

FIG. 3 is a schematic diagram of a compact feature learning module provided by an embodiment of the present application;

fig. 4 is a schematic structural diagram of an image segmentation apparatus according to an embodiment of the present application;

fig. 5 shows a hardware structure diagram of an electronic device.

Detailed Description

In order to make the purpose, features and advantages of the present application more obvious and understandable, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Referring to fig. 1 and fig. 2, fig. 1 is a schematic flowchart of an image segmentation method according to an embodiment of the present application, and fig. 2 is a schematic diagram of a framework of the image segmentation method according to an embodiment of the present application, where the method is applicable to an electronic device, where the electronic device includes: the method mainly comprises the following steps that electronic devices capable of performing data processing in moving, such as mobile phones, tablet computers, portable computers, intelligent watches, intelligent glasses and the like, and electronic devices capable of performing data processing in moving, such as desktop computers, all-in-one machines, intelligent televisions and the like, are adopted:

s101, coding an image to be segmented step by step to obtain a highest-level feature image of the image to be segmented;

s102, inputting the highest-level feature image to a compact feature learning model to obtain a compact feature image;

s103, combining the compact characteristic image with the highest-level characteristic image to decode step by step so as to restore the spatial information of the compact characteristic image and obtain a target image to be segmented;

s104, predicting the to-be-segmented category of each pixel in the to-be-segmented target image to obtain a segmentation prediction image of the to-be-segmented target image;

and S105, extracting the prediction boundary of the target image to be segmented according to the segmentation prediction graph.

In one embodiment of the present application, the step S101 includes:

the method comprises the steps of coding an image to be segmented step by step to obtain a multilayer characteristic image, wherein spatial information of the multilayer characteristic image is sequentially reduced, and semantic information is sequentially increased; and acquiring a characteristic image with minimum spatial information and maximum semantic information in the multilayer characteristic images as the highest-level characteristic image of the image to be segmented.

The spatial information of the image can be understood as the resolution of the image, and the semantic information can be understood as the feature information of the image.

In one embodiment of the present application, step S102 includes:

predicting the feature center of each category to be segmented according to the highest-level feature image; calculating the similarity between the feature of each pixel in the highest-level feature image and each feature center according to a self-adaptive attention mechanism; and reconstructing the highest-level feature image according to the similarity between the feature of each pixel in the highest-level feature image and each feature center to obtain the compact feature image. The obtained compact feature image enables pixels of the same category to be more compactly expressed in feature space, and pixels of different categories are more obviously different in feature space.

Specifically, a feature of the medical image is that there are few segmentation classes, often less than 5 classes, and the feature dimension learned by the network is very high, few hundreds of dimensions, and many thousands of dimensions, such feature expression is very redundant, and at the same time, it is very easy to over-fit to the inherent noise, in order to make the segmentation prediction obtained by the decisionThe segmentation boundary in the graph is more robust to noise, see FIG. 3, for high-dimensional features X ∈ R^C*H*W(wherein C is the dimension of the feature, and H and W respectively represent the height and width of the feature), firstly, the relay supervision thought is utilized to predict, and a rough segmentation probability map P is obtained_inter∈R^K*H*W(K represents the number of classes to be segmented), and then calculating the feature centers of the classes according to the rough segmentation probability graph:

wherein x is_iThe feature at the i-position is represented,

f, which is calculated by representing the probability of the rough segmentation probability map at the i position belonging to the class c to be segmented_cRepresenting the feature center of class c.

After the feature centers of all the categories to be segmented are obtained, the features of the highest-level feature image are reconstructed according to the distance (similarity) from the features to the feature centers of all the categories to be segmented, and a compact feature learning model is enabled to learn a mapping mode of feature reconstruction by itself in combination with an adaptive attention mechanism, wherein the formula is as follows:

the above formula represents the adaptively learned feature x_iAnd c feature center f of class to be segmented_cWhere θ, φ represents a convolution operation with a convolution kernel size of 1 × 1, the compact feature image x obtained after reconstruction_compactIs represented as follows:

x_compact＝∑x′_ic·γ(f_c)

in the above formula, gamma is also the convolution operation with convolution kernel size of 1 x 1, and the obtained new feature expression x_compactThe distance between the feature of each pixel in the image including the highest-level feature and the center of each feature is iteratedAfter learning, the class to be segmented which is close to the self is closer and farther, and the original class to be segmented which is farther, so that the effects that the feature expression in the same class to be segmented is more compact and the difference between different classes to be segmented is larger are achieved, and the robustness of the model to noise is further enhanced.

The category to be segmented refers to the type of the image in the picture to be segmented. For example, if the picture to be segmented is a chest picture, the image in the picture to be segmented includes a background, a heart and a lung, and the category to be segmented is the background, the heart and the lung; if the picture to be segmented is an eye piece, the image in the picture to be segmented comprises a background and an eyeball, and the class to be segmented is the background and the eyeball.

In one embodiment of the present application, the step S103 includes:

performing feature fusion on the compact feature image and the highest-level feature image to obtain a first-level compact feature image; acquiring a characteristic image in the encoding process which is the same as the first-stage compact characteristic image space information; performing feature fusion on the first-level compact feature image and the feature image to obtain a second-level compact feature image, and accordingly obtaining an Nth-level compact feature image, wherein the spatial information of the Nth-level compact feature image is the same as that of the image to be segmented; and taking the Nth-level compact characteristic image as a target image to be segmented.

In one embodiment of the application, the spatial information of the first-level to N-level compact feature images is sequentially increased, and the semantic information is sequentially increased.

In one embodiment of the present application, step S105 includes:

and extracting boundary pixels from the segmentation prediction graph of the target image to be segmented, wherein the types of the classes to be segmented of all the pixels in the preset neighborhood of the boundary pixels comprise at least two types. Respectively acquiring the number of boundary pixels in each pixel preset neighborhood; taking the number of boundary pixels in each pixel preset neighborhood as the weight of the corresponding pixel; calculating the loss of a preset neighborhood of each pixel according to the weight of each pixel; and obtaining the predicted boundary of the target image to be segmented according to the loss of the preset neighborhood of each pixel.

Specifically, the model training process is always affected by noise due to the labeling noise problem caused by boundary blurring, which is more obvious under the loss of cross entropy (cross entropy) which is a traditional optimization objective function, because the cross entropy function is a convex function and the gradient is non-boundary. The proportion of potentially noisy pixels in the loss is highlighted, thus disturbing the correct direction of update. The method is based on the priori knowledge that noise often exists at the boundary, the pixels with large information amount are given higher weight to the current model in the optimization process, the thought of course learning is adopted, the information amount of the samples which are too simple or difficult is not large for improving the current model capability, and the samples near the decision boundary are important to the model in the current iteration process. Based on this, the method proposes the concept of "pseudo boundary" (pseudo) and "boundary pixel" (boundary) to extract the region of interest in each step of the update process, and the specific operations are as follows:

firstly, in the t-th iteration process, obtaining a segmentation prediction graph P according to network output;

then, it is determined which pixels belong to the boundary pixels from the generated prediction map. The judgment rule is that if the prediction categories are inconsistent (i.e. more than two prediction categories are contained) in the K-neighborhood of the pixel, the pixel is a "boundary pixel". The formulation is expressed as follows:

where i denotes the ith pixel, Nb (i) denotes the pixel in the k neighborhood of the i pixel, p_iRepresenting a prediction class of an ith pixel

Is a binary determinant when p is_iAnd p_jThe same is 1, and the opposite is 0.

Then, the number of boundary pixels in the K neighborhood of the pixel i is counted as the weight of the pixel i, and in order to avoid information loss, the weight of 1 is given to pixels without boundary pixels in the K neighborhood.

In the above formula W_t(i) Represents the weight of pixel i during t iterations, Nb (i) represents the pixels in k neighborhood of i pixels, F_t(i) Indicating whether a pixel j within the k neighborhood of pixel i is a boundary pixel.

Then, the loss in the t iteration rounds is calculated, which is called domain loss:

wherein N is the number of pixels in the image, W_t(i) Represents the weight of pixel i, y represents the label class of pixel i,

representing the probability that pixel i is predicted as labeling class y.

Through the optimization strategy, the model updating process conforms to the optimization process of course learning, each step of model updating focuses more on the boundary of the current model judgment capability, so that the influence of boundary noise in the early updating process is reduced, and the boundary noise is gradually fitted to the actual boundary, so that the purposes of accelerating the convergence speed and enhancing the noise robustness of the model are achieved.

Referring to fig. 4, fig. 4 is a schematic structural diagram of an image segmentation apparatus according to an embodiment of the present application, which may be embedded in an electronic device, and the apparatus mainly includes:

the encoding module 201 is configured to encode an image to be segmented step by step to obtain a highest-level feature image of the image to be segmented;

an input module 202, configured to input the highest-level feature image to a compact feature learning model to obtain a compact feature image;

the decoding module 203 is configured to decode the compact feature image in combination with the highest-level feature image step by step to restore spatial information of the compact feature image, so as to obtain a target image to be segmented;

the prediction module 204 is configured to predict a to-be-segmented category of each pixel in the to-be-segmented target image, so as to obtain a segmentation prediction map of the to-be-segmented target image;

and the extracting module 205 is configured to extract the prediction boundary of the target image to be segmented according to the segmentation prediction map.

In one embodiment of the present application, the encoding module 201 includes:

the coding sub-module is used for coding the image to be segmented step by step to obtain a multilayer characteristic image, the spatial information of the multilayer characteristic image is sequentially reduced, and the semantic information is sequentially increased;

and the first acquisition submodule is used for acquiring the characteristic image with the minimum spatial information and the maximum semantic information in the multilayer characteristic image as the highest-level characteristic image of the image to be segmented.

In one embodiment of the present application, the input module 202 includes:

the prediction submodule is used for predicting the feature center of each category to be segmented according to the highest-level feature image;

the calculation submodule calculates the similarity between the feature of each pixel in the highest-level feature image and each feature center according to a self-adaptive attention mechanism;

and the reconstruction submodule is used for reconstructing the highest-level feature image according to the similarity between the feature of each pixel in the highest-level feature image and each feature center to obtain the compact feature image.

In one embodiment of the present application, the decoding module 203 comprises:

the fusion submodule is used for carrying out feature fusion on the compact feature image and the highest-level feature image to obtain a first-level compact feature image;

the second acquisition submodule is used for acquiring the characteristic image in the coding process, which is the same as the space information of the first-stage compact characteristic image;

the fusion sub-module is further configured to perform feature fusion on the first-level compact feature image and the feature image to obtain a second-level compact feature image, and accordingly obtain an nth-level compact feature image, where spatial information of the nth-level compact feature image is the same as spatial information of the image to be segmented;

and the determining submodule is used for taking the Nth-level compact characteristic image as a target image to be segmented.

In one embodiment of the present application, the extraction module 205 comprises:

and the extraction submodule is used for extracting boundary pixels from the segmentation prediction image of the target image to be segmented, and the types of the classes to be segmented of all the pixels in the preset neighborhood of the boundary pixels comprise at least two types.

The acquisition module is used for respectively acquiring the number of boundary pixels in each pixel preset neighborhood;

the weighting module is used for taking the number of boundary pixels in each pixel preset neighborhood as the weight of the corresponding pixel;

the calculation module is used for calculating the preset neighborhood loss of each pixel in the current iteration process according to the weight of each pixel;

and the determining module is used for obtaining the prediction boundary of the target image to be segmented according to the loss of the preset neighborhood of each pixel.

Referring to fig. 5, fig. 5 is a hardware structure diagram of an electronic device.

The electronic device described in this embodiment includes:

a memory 31, a processor 32 and a computer program stored on the memory 31 and executable on the processor, the processor implementing the image segmentation method described in the foregoing embodiments shown in fig. 1 to 2 when executing the program.

Further, the electronic device further includes:

at least one input device 33; at least one output device 34.

The memory 31, processor 32 input device 33 and output device 34 are connected by a bus 35.

The input device 33 may be a camera, a touch panel, a physical button, or a mouse. The output device 34 may specifically be a display screen.

The Memory 31 may be a high-speed Random Access Memory (RAM) Memory or a non-volatile Memory (non-volatile Memory), such as a disk Memory. The memory 31 is used for storing a set of executable program code, and the processor 32 is coupled to the memory 31.

Further, an embodiment of the present invention further provides a computer-readable storage medium, where the computer-readable storage medium may be provided in the terminal in the foregoing embodiments, and the computer-readable storage medium may be the memory in the foregoing embodiment shown in fig. 5. The computer-readable storage medium has stored thereon a computer program which, when executed by a processor, implements the image segmentation method described in the foregoing embodiments shown in fig. 1 to 2. Further, the computer-readable storage medium may be various media that can store program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

It should be noted that each functional module in each embodiment of the present invention may be integrated into one processing module, or each module may exist alone physically, or two or more modules are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode.

The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be substantially or partially embodied in the form of a software product, or all or part of the technical solution that contributes to the prior art.

It should be noted that, for the sake of simplicity, the above-mentioned method embodiments are described as a series of acts or combinations, but those skilled in the art should understand that the present invention is not limited by the described order of acts, as some steps may be performed in other orders or simultaneously according to the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no acts or modules are necessarily required of the invention.

In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.

In view of the above description of the image segmentation method, apparatus, electronic device and storage medium provided by the present invention, those skilled in the art will recognize that there may be variations in the embodiments and applications of the concept of the present invention, and therefore the present description should not be construed as limiting the present invention.

Claims

1. An image segmentation method, comprising:

inputting an image to be segmented into a coding module of a semantic segmentation network for coding step by step to obtain a multilayer characteristic image, wherein the spatial information of the multilayer characteristic image is reduced in sequence, the semantic information is increased in sequence, and the characteristic image with the minimum spatial information and the maximum semantic information in the multilayer characteristic image is obtained as a highest-level characteristic image;

and extracting the prediction boundary of the target image to be segmented according to the segmentation prediction graph.

2. The image segmentation method according to claim 1, wherein extracting the prediction boundary of the target image to be segmented according to the segmentation prediction map comprises:

extracting boundary pixels from a segmentation prediction image of the target image to be segmented, wherein the types of the classes to be segmented of all pixels in a preset neighborhood of the boundary pixels comprise at least two types;

and according to the loss of the preset neighborhood of each pixel, the model is updated in an iterative mode, and the prediction result of the target image to be segmented is finally obtained.

3. The image segmentation method according to claim 2, wherein the weight of the pixel is assigned to 1 when the number of boundary pixels in the pixel preset neighborhood is 0.

4. The image segmentation method according to any one of claims 1 to 3, wherein the step-by-step encoding of the image to be segmented to obtain the highest-level feature image of the image to be segmented comprises:

and acquiring a characteristic image with minimum spatial information and maximum semantic information in the multilayer characteristic images as a highest-level characteristic image of the image to be segmented.

5. An image segmentation apparatus, comprising:

the system comprises an encoding module, a semantic segmentation network and a feature image acquisition module, wherein the encoding module is used for inputting an image to be segmented into an encoding module of the semantic segmentation network to perform step-by-step encoding to obtain a multilayer feature image, the spatial information of the multilayer feature image is sequentially reduced, the semantic information is sequentially increased, and the feature image with the minimum spatial information and the maximum semantic information in the multilayer feature image is acquired as a highest-level feature image;

the decoding module is used for combining the obtained compact feature map with the highest-level feature map to obtain a first-level compact feature image, obtaining a feature image in the encoding process, the spatial information of which is the same as that of the first-level compact feature image, performing feature fusion on the first-level compact feature image and the feature image to obtain a second-level compact feature image, and accordingly obtaining an Nth-level compact feature image, the spatial information of which is the same as that of the image to be segmented, and taking the Nth-level compact feature image as a target image to be segmented;

the prediction module is used for predicting the class to be segmented of each pixel in the target image to be segmented to obtain a segmentation prediction image of the target image to be segmented;

and the extraction module is used for extracting the prediction boundary of the target image to be segmented according to the segmentation prediction graph.

6. An electronic device, comprising: memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the image segmentation method according to any one of claims 1 to 4 when executing the computer program.

7. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the image segmentation method according to any one of claims 1 to 4.