CN114693926A

CN114693926A - Image semantic segmentation method based on deep learning

Info

Publication number: CN114693926A
Application number: CN202210292438.6A
Authority: CN
Inventors: 匡芳君; 张思扬; 邢军; 沈良忠; 郝慧君; 李博然
Original assignee: Wenzhou Business College
Current assignee: Wenzhou Business College
Priority date: 2022-03-23
Filing date: 2022-03-23
Publication date: 2022-07-01

Abstract

The invention provides an image semantic segmentation method based on deep learning, which comprises four steps of data acquisition, model construction, model training and semantic segmentation, wherein the original image data and the image data for training are pre-processed to reduce the resolution of the original image data and the image data for training and avoid the influence of overhigh resolution on the operation speed of image semantic segmentation, a residual connecting module and a dense connecting module are introduced into the image semantic segmentation network model based on deep learning, the residual modules are used for carrying out feature fusion, so that the information quantity for describing the image features and details is increased, the dense connecting module is used for enhancing the transmission among the features, thereby avoiding the loss of feature information in the feature extraction process, improving the feature extraction effect, and testing the trained model by introducing a test data set, and optimizing the trained model continuously according to the test result.

Description

Image semantic segmentation method based on deep learning

Technical Field

The invention relates to the technical field of image semantic segmentation, in particular to an image semantic segmentation method based on deep learning.

Background

The digital image processing technology is a interdisciplinary field, with the continuous development of computer science and technology, image processing and analysis gradually form a scientific system, and new processing methods are developed endlessly, and although the development history is not long, the method attracts the wide attention of people in all aspects. Firstly, vision is the most important perception means of human beings, and images are the basis of vision, so that digital images become effective tools for researchers in various fields such as psychology, physiology, computer science and the like to research visual perception;

in the field of computer vision, the application of the current neural network mainly comprises image recognition, target positioning and detection and semantic segmentation, wherein the image recognition is used for telling you what the image is, the target positioning and detection are used for telling you where the target is in the image, and the semantic segmentation is used for answering the two questions from the pixel level;

image semantic segmentation (semantic segmentation), which is understood literally, a computer is segmented according to image semantics, and a scene understanding task in the computer vision field is mainly realized by an image segmentation technology, but traditional image segmentation methods including a region segmentation method, an edge detection segmentation method and an image segmentation method cannot really recognize semantics of objects in a picture, have less information describing image features and details, and cannot achieve the purpose of understanding scene information, so that the invention provides an image semantic segmentation method based on deep learning to solve the problems in the prior art.

Disclosure of Invention

In view of the above problems, the present invention aims to provide an image semantic segmentation method based on deep learning, which solves the problem of less information describing image features and details in the prior art.

In order to realize the purpose of the invention, the invention is realized by the following technical scheme: an image semantic segmentation method based on deep learning comprises the following steps:

step one, data acquisition

The method comprises the steps of obtaining original image data to be processed and image data for training, and preprocessing the original image data and the image data for training to obtain preprocessed original image data and preprocessed image data for training;

step two: model construction

Constructing an image semantic segmentation network model based on deep learning, wherein the image semantic segmentation network model based on deep learning comprises a down-sampling encoder layer, an up-sampling decoder layer, a residual error connection module and a dense connection module;

step three: model training

Inputting the training image data obtained in the first step into the deep learning-based image semantic segmentation network model constructed in the second step, and performing model training to obtain a trained image semantic segmentation network model;

step four: semantic segmentation

And (4) inputting the preprocessed original image data obtained in the first step into the trained image semantic segmentation network model in the third step, and outputting a result.

The further improvement lies in that: in the first step, the preprocessing is to perform downsampling on the original image data and the training image data, reduce the resolution of the original image data and the training image data, and avoid the influence of too high image resolution on the operation speed of image semantic segmentation.

The further improvement lies in that: in the first step, a plurality of groups of image data for training are required to be obtained to form an image data set for model training.

The further improvement lies in that: in the first step, the resolution of the preprocessed original image data is consistent with that of the image data for training.

The further improvement lies in that: in the second step, a plurality of groups of encoder layers for down sampling and decoder layers for up sampling are provided.

The further improvement lies in that: in the second step, the residual connecting module is used for connecting with the encoder layers and introducing a residual item in the model training process, and the residual connecting module is not connected with all the encoder layers.

The further improvement lies in that: in the second step, the image semantic segmentation network model is an improvement on the existing full convolution network model.

The further improvement is that: and in the third step, a test image data set is obtained and input into the trained model for testing.

The invention has the beneficial effects that: according to the image semantic segmentation method based on the deep learning, the residual connection module and the dense connection module are introduced into the image semantic segmentation network model based on the deep learning, the residual module is used for carrying out feature fusion, so that the information quantity for describing image features and details is increased, the dense connection module is used for enhancing the transmission among the features, therefore, the loss of feature information in the feature extraction process is avoided, the feature extraction effect is improved, the original image data and the image data for training can be preprocessed, the resolution of the original image data and the image data for training are reduced, and the problem that the running speed of image semantic segmentation is influenced due to the overhigh resolution of the images is avoided.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

FIG. 1 is a flowchart illustrating a first step of the present invention.

FIG. 2 is a schematic diagram of an encoder layer connection structure according to a second embodiment of the present invention

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example one

According to fig. 1, the embodiment proposes an image semantic segmentation method based on deep learning, which includes the following steps:

step one, data acquisition

The method comprises the steps of obtaining original image data to be processed and image data for training, and preprocessing the original image data and the image data for training to obtain preprocessed original image data and preprocessed image data for training, wherein in the first step, preprocessing is to perform down-sampling on the original image data and the image data for training, the resolution ratios of the original image data and the image data for training are reduced, and the problem that the operating speed of semantic segmentation of images is affected due to overhigh resolution ratios of the images is avoided;

step two: model construction

Constructing an image semantic segmentation network model based on deep learning, wherein the image semantic segmentation network model based on deep learning comprises a down-sampling encoder layer, an up-sampling decoder layer, a residual error connection module and a dense connection module, in the second step, a plurality of groups of encoder layers for down sampling and decoder layers for up sampling are arranged, in the second step, the residual error connecting module is used for connecting with the encoder layers, and residual error items are introduced in the process of model training, and the residual error connecting module is not connected with all the encoder layers, in the second step, the image semantic segmentation network model is an improvement of the existing full convolution network model, a plurality of groups of encoder layers form an encoder, the front part of the coder layers in the coder are connected by adopting a residual error connecting module, the rear part of the coder layers are connected by adopting a dense connecting module, and a plurality of groups of decoder layers form a decoder;

step three: model training

step four: semantic segmentation

Example two

The embodiment provides an image semantic segmentation method based on deep learning, which includes the following steps:

step one, data acquisition

step two: model construction

Constructing an image semantic segmentation network model based on deep learning, wherein the image semantic segmentation network model based on deep learning comprises a down-sampling encoder layer, an up-sampling decoder layer, a residual error connection module and an intensive connection module, in the second step, a plurality of groups of encoder layers and up-sampling decoder layers are arranged, in the second step, the residual error connection module is used for being connected with the encoder layers and introducing a residual error item in the model training process, the residual error connection module is not connected with all the encoder layers, in the second step, the image semantic segmentation network model is formed by improving the existing full convolution network model, a plurality of groups of encoder layers form an encoder, and a plurality of groups of decoder layers form a decoder;

wherein, the residual connecting module mainly forms residual connection between the encoder layers, and in the constructed image semantic segmentation network model, as shown in FIG. 2, is a distribution schematic diagram of the encoder layers, through residual connection, the condition that the parameters of the identity mapping of the layer are learned in the model construction process can be avoided, thereby simplifying the difficulty of model learning, avoiding the influence of a redundant layer on the network effect, effectively relieving the degradation phenomenon caused by the increase of the network depth, meanwhile, the residual error connection module also adopts an element addition method for fusion, so that the information quantity describing the image characteristics and details is increased after the subsequent characteristic fusion is carried out, the encoder layer also comprises a convolution of 3 multiplied by 3, when the picture is input, the feature extraction is carried out through the convolution of 3 multiplied by 3, and the generated feature graph enters a subsequent network model on one hand and is directly fused with a new feature graph on the other hand;

the dense connection module leads the output of the previous encoder layer into the following encoder layer, namely, all layers are directly connected on the premise of ensuring the maximum information transmission between the layers in the network, and the dense connection module is fused in a channel number merging mode through an element splicing method, the feature graph fused by the method not only contains the features after convolution extraction, but also directly receives the enhancement of the initial features, and the information contained in different feature graphs is more effectively utilized

Step three: model training

step four: semantic segmentation

Inputting the preprocessed original image data obtained in the first step into a trained image semantic segmentation network model in the third step, outputting a result, wherein the result output is output through a decoder layer, the decoder layer performs up-sampling through deconvolution initialized by a bilinear interpolation filter, and simultaneously, a splicing fusion method is adopted to directly fuse a feature map output by each encoder layer with the feature map sampled on the decoder layer, so that the segmentation result is refined by using spatial information of different resolutions at different stages in an encoder, and accurate and detailed segmentation is generated, and the image semantic separation operation of the original image data is completed.

EXAMPLE III

step one, data acquisition

The method comprises the steps of acquiring original image data to be processed and image data for training, and preprocessing the original image data and the image data for training to obtain preprocessed original image data and preprocessed image data for training, wherein in the first step, preprocessing is to downsample the original image data and the image data for training, the resolutions of the original image data and the image data for training are reduced, and the operating speed of semantic segmentation of an image is prevented from being influenced by overhigh image resolution;

step two: model construction

step three: model training

Inputting the training image data obtained in the first step into the deep learning-based image semantic segmentation network model constructed in the second step, performing model training to obtain a trained image semantic segmentation network model, acquiring a test image data set in the third step, inputting the test image data set into the trained model for testing, and testing according to a test result;

step four: semantic segmentation

The difference between this embodiment and the first and second embodiments is that, in the trained model, a test image data set is introduced to perform corresponding tests on the trained model, and parameters of the image semantic segmentation network model are continuously adjusted according to a test result, that is, the image semantic segmentation network model is continuously optimized according to the test result, which is beneficial to improving the image semantic segmentation effect.

According to the method, the residual connecting module and the dense connecting module are introduced into the construction of the image semantic segmentation network model based on deep learning, the residual module is used for carrying out feature fusion, so that the information quantity for describing image features and details is increased, the dense connecting module is used for enhancing the transmission among the features, the loss of feature information in the feature extraction process is avoided, the feature extraction effect is improved, the original image data and the image data for training can be preprocessed, the resolution of the original image data and the image data for training are reduced, and the problem that the operating speed of image semantic segmentation is influenced due to overhigh image resolution is avoided.

The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims

1. An image semantic segmentation method based on deep learning is characterized in that: the method comprises the following steps:

step one, data acquisition

step two: model construction

step three: model training

step four: semantic segmentation

2. The image semantic segmentation method based on deep learning according to claim 1, characterized in that: in the first step, the preprocessing is to perform downsampling on the original image data and the training image data, reduce the resolution of the original image data and the training image data, and avoid the influence of too high image resolution on the operation speed of image semantic segmentation.

3. The image semantic segmentation method based on deep learning according to claim 1, characterized in that: in the first step, a plurality of groups of image data for training are required to be obtained to form an image data set for model training.

4. The image semantic segmentation method based on deep learning according to claim 1, characterized in that: in the first step, the resolution of the preprocessed original image data is consistent with that of the preprocessed image data for training.

5. The image semantic segmentation method based on deep learning according to claim 1, characterized in that: in the second step, a plurality of groups of encoder layers for down sampling and decoder layers for up sampling are provided.

6. The image semantic segmentation method based on deep learning according to claim 1, characterized in that: in the second step, the residual connecting module is used for connecting with the encoder layers and introducing a residual item in the model training process, and the residual connecting module is not connected with all the encoder layers.

7. The image semantic segmentation method based on deep learning according to claim 1, characterized in that: in the second step, the image semantic segmentation network model is an improvement on the existing full convolution network model.

8. The image semantic segmentation method based on deep learning according to claim 1, characterized in that: and in the third step, a test image data set is obtained and input into the trained model for testing.