CN109101975A - Image, semantic dividing method based on full convolutional neural networks - Google Patents

Image, semantic dividing method based on full convolutional neural networks Download PDF

Info

Publication number
CN109101975A
CN109101975A CN201810947884.XA CN201810947884A CN109101975A CN 109101975 A CN109101975 A CN 109101975A CN 201810947884 A CN201810947884 A CN 201810947884A CN 109101975 A CN109101975 A CN 109101975A
Authority
CN
China
Prior art keywords
image
semantic
layer
semantic segmentation
details
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810947884.XA
Other languages
Chinese (zh)
Other versions
CN109101975B (en
Inventor
程建
苏炎洲
林莉
高银星
李恩泽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN201810947884.XA priority Critical patent/CN109101975B/en
Publication of CN109101975A publication Critical patent/CN109101975A/en
Application granted granted Critical
Publication of CN109101975B publication Critical patent/CN109101975B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The present invention discloses a kind of image, semantic dividing method based on full convolutional neural networks, is related to image, semantic segmentation and deep learning field, includes the following steps: to select training dataset;It constructs and trains by the disaggregated model of image to class label, and as semantic segmentation model front network;The characteristic pattern of each piece of front network output passes through details respectively, and to retain pond layer down-sampled at unified size, then this four output characteristic patterns is connected, and by the heavy correction module of feature, after re-calibrating characteristic pattern, by the incoming back-end network of obtained characteristic pattern;Back-end network is to be mainly responsible for picture up-sampling, by up-sampling and then carrying out error back propagation finally with the semantic tagger image of training dataset calculating cross entropy by the global pool of a variable weight.The present invention solves the problems, such as that image segmentation accuracy rate in the prior art is lower.

Description

Image, semantic dividing method based on full convolutional neural networks
Technical field
The present invention relates to image, semantic segmentation and deep learning fields, more particularly to the image based on full convolutional neural networks Semantic segmentation method.
Background technique
Semantic segmentation is an important problem in computer vision field.Image, semantic segmentation is to give each pixel A different label (classification) is assigned, therefore is considered an intensive classification problem.
In recent years, most current optimal image, semantic dividing methods are all based on full convolutional neural networks.Allusion quotation The semantic segmentation network structure of type is coder-decoder structure, and encoder is an image drop sampling process, is responsible for extracting figure As coarse semantic feature, it is followed by a decoder, decoder is a picture up-sampling process, is responsible for down-sampled Obtained characteristics of image carries out up-sampling and is restored to the original dimension of input picture.
Although pond is in the process a crucial component part in the down-sampled of convolutional neural networks, can be used to reduce The scale of parameter enhances the invariance to certain distortions, while increasing receptive field.But because of pond, itself is a damage The process of consumption makes semantic point so it will lead to the loss of image, semantic information during the image drop sampling of semantic segmentation The precision for cutting result is relatively low.
In depth convolutional neural networks, pond layer is replaced commonly using the convolution that strides (strided convolutions) Have the function that down-sampled, the convolution that strides only considers a node of the fixation position of each local neighborhood, activates without considering Importance.From the angle of image drop sampling, such down-sampled mode equally also results in the distortion of feature.Full convolutional Neural Network is the state-of-the-art image, semantic partitioning algorithm of a large amount of application programming, and wherein the innovation of network structure is mainly concentrated It encodes or is connected to the network to promote gradient current in room for improvement.
Summary of the invention
It is existing to solve it is an object of the invention to design a kind of image, semantic dividing method based on full convolutional neural networks The problem for having the image segmentation accuracy rate in technology lower.
Technical scheme is as follows:
Image, semantic dividing method based on full convolutional neural networks, includes the following steps:
Step 1: selection training dataset.
Step 2: constructing and train the disaggregated model by image to class label, and as semantic segmentation model front end Network;
The structure of semantic segmentation model front network includes Conv1, Conv2_x, Conv3_x and Conv4_x, Conv1, Conv2_x, Conv3_x and Conv4_x include multiple convolutional layers, Conv4_x, Conv1, Conv2_x, Conv3_x and It is all connected with a details behind Conv4_x and retains pond layer one.
Step 3: based on trained semantic segmentation model front network, constructing semantic segmentation model back-end network.
The structure of back-end network include details retain pond layer two, feature weigh correction module, convolutional layer, Conv5_x, Conv6_x, Conv7_x, convolutional layer, variable weight global pool layer and up-sampling layer;The output of Conv1, Conv2_x, Conv3_x point Tong Guo not be after three details retain and are connected in series after pond layers two with Conv4_x, common input feature vector weighs correction module;Conv5_ X, be all connected with a up-sampling layer before Conv6_x and Conv7_x, Conv5_x, Conv6_x and Conv7_x include convolutional layer, Batch normalization layer and line rectification unit, Conv5_x, Conv6_x, Conv7_x by jump structure respectively successively with Conv3_ X, the output characteristic pattern series connection of Conv2_x and Conv1.
Step 4: whole image semantic segmentation model is trained.
Step 5: inputting new image, a propagated forward is carried out in trained deep neural network model, hold The semantic segmentation result of prediction is exported to end.
It specifically, include 33 residual error structures, each residual error structure packet in the semantic segmentation front end model front network Containing 11 × 1 convolution, 13 × 3 convolution, 11 × 1 convolution sum 1 quick connection.
Specifically, the details after the Conv1 retains down-sampled 8 times of output characteristic pattern of pond layer a pair of Conv1, Details after Conv2_x retains down-sampled 4 times of output characteristic pattern of pond layer a pair of Conv2_x, and the details after Conv3_x retains Down-sampled 2 times of the output characteristic pattern of pond layer a pair of Conv3_x.
Specifically, the details retains pond layer one and details retains the detailed process of pond layer two are as follows:
The output P of each position is calculated according to the characteristic pattern I of input:
Wherein,[P] indicates that input feature vector figure retains the value of pond Hua Cenghou output position P by details;
Neighborhood spaceThe space average weights omega of input nodeα,β[p, q] is
Wherein α is biased exponent, and β is reward index.ρβ(·)It is anti-bilateral filtering function, is used in neighborhood space ΩpMeter The weight of input point is calculated, β reduces the dynamic range of reward function, and β → 0 is exactly that simple field is average.
It is linear-scale reduction factor, specifically:
Wherein F is in neighborhoodOn one can learn, the 2D filter of nonstandardized technique, thisSize be 3 × 3。
Specifically, the feature weight correction module is to combine space characteristics to correct the network corrected again with channel characteristics again Module.
Specifically, the process of training whole image semantic segmentation model are as follows:
Step 4.1: the image concentrated to training data pre-processes, and is fixed dimension by image cutting-out.
Step 4.2: whole image semantic segmentation model is initialized.
Step 4.3: the data concentrated to training data are expanded by way of overturning, scaling and rotation.
Step 4.4: using the sum of the intersection entropy loss of each pixel as loss function, reusing random tonsure descent algorithm Error back propagation is carried out, model parameter is updated, obtains trained semantic segmentation model.
After adopting the above scheme, beneficial effects of the present invention are as follows:
(1) image, semantic parted pattern of the invention introduces details reservation pond layer, during down-sampled, Neng Goubao Stay more image detail informations.It is a kind of adaptive pond method that details, which retains pond layer, and this method can amplify sky Between change and retain important CONSTRUCTED SPECIFICATION, it is also important that, its parameter can learn jointly with the rest part of network.
(2) introduced feature weight correction module in the present invention, is carrying out re-graduation just to feature, space characteristics re-graduation just can be more preferable The importance by same position pixels all in space re-calibrated, and be assigned to corresponding weight, improve semantic segmentation Accuracy rate, important channel can just be assigned to high weight by channel characteristics re-graduation, prominent importance;In short, feature re-graduation is just Module can efficiently solve image, semantic segmentation, and accuracy rate is low, during pond the problem of detailed information loss, finally obtains Preferable semantic segmentation result.
(3) the variable weight global pool described in, since traditional global draw pondization operates, to the same of all feature channels Position is carried out same operation, i.e. 1 × 1 convolution, cannot protrude the correct class categories of each pixel in semantic segmentation, give 1 × 1 convolution in the average pond of the overall situation adds 1 weight vector, is distributed by standard gaussian and carries out parameter initialization, in error In back-propagation process, the weight of pixel is constantly updated, can preferably be classified pixel-by-pixel, moreover it is possible to it is convergent to play quickening Effect.
Detailed description of the invention
Fig. 1 is flow chart of the invention;
Fig. 2 is image, semantic parted pattern structure chart of the invention;
Fig. 3 is residual error structure chart of the invention;
Fig. 4 is the positive function structure chart of feature re-graduation of the invention;
Fig. 5 is the positive function structure chart of channel characteristics re-graduation of the invention;
Fig. 6 is the positive function structure chart of space characteristics re-graduation of the invention.
Specific embodiment
Understand to make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with specific implementation with and it is attached Figure, invention is further described in detail.
It is proposed by the present invention a kind of based on full convolution to solve the problems, such as that image segmentation accuracy rate in the prior art is lower The image, semantic dividing method of neural network can be widely applied for the field of General Two-Dimensional image, semantic segmentation.
As shown in Figure 1, the image, semantic dividing method based on full convolutional neural networks, the present invention include the following steps:
Step 1: selection training dataset;Using 21 classes of 2012 data set of VOC in the present embodiment (wherein 1 class is background) On the basis of scene type, acquires the image in COCO data set comprising above-mentioned 20 class classification target and data set is added, finally obtain Trained and test data set.
Step 2: constructing and train the disaggregated model by image to class label, and as semantic segmentation model front end Network.
As shown in Fig. 2, the structure of semantic segmentation model front network includes Conv1, Conv2_x, Conv3_x and Conv4_ X, Conv1, Conv2_x, Conv3_x and Conv4_x include multiple convolutional layers, Conv4_x, Conv1, Conv2_x, Conv3_ It is all connected with a details behind x and Conv4_x and retains pond layer one;After each piece of Conv2_x, Conv3_x and Conv4_x Face adds a details to retain pond layer, this is a kind of adaptive pool method, can amplify spatial variations and retain important knot Structure details.
As shown in figure 3, including 33 residual error structures in the model front network of semantic segmentation front end, each residual error structure includes 11 × 1 convolution, 13 × 3 convolution, 11 × 1 convolution sum 1 quick connection (shortcut connection).
For the convenience of description, by the characteristic pattern (having a size of 112 × 112) of Conv1 output, the characteristic pattern of Conv2_x output (having a size of 56 × 56), the characteristic pattern (having a size of 28 × 28) of Conv3_x output, the characteristic pattern of Conv4_x output is (having a size of 14 × 14) characteristic pattern Res_1, characteristic pattern Res_2, characteristic pattern Res_3 and characteristic pattern Res_4 are denoted as.
Details after Conv1 retains down-sampled 8 times of pond layer a pair of characteristic pattern Res_1, and the details after Conv2_x retains pond Change down-sampled 4 times of layer a pair of characteristic pattern Res_2, the details after Conv3_x retains pond layer a pair of characteristic pattern Res_3 down-sampled 2 Times.
Step 3: based on trained semantic segmentation model front network, constructing semantic segmentation model back-end network.
As shown in Fig. 2, the structure of back-end network include details retain pond layer two, feature weigh correction module, convolutional layer, Conv5_x, Conv6_x, Conv7_x, convolutional layer, variable weight global pool layer and up-sampling layer;Characteristic pattern Res_1, characteristic pattern Res_2, characteristic pattern Res_3 pass through respectively after three details retain and be connected in series after pond layers two with Conv4_x, common input spy Levy weight correction module;Be all connected with a up-sampling layer before Conv5_x, Conv6_x and Conv7_x, Conv5_x, Conv6_x and Conv7_x includes that convolutional layer, batch normalization layer and line rectification unit, Conv5_x, Conv6_x, Conv7_x pass through jump Structure is successively connected with the output characteristic pattern of Conv3_x, Conv2_x and Conv1 respectively.
For in step 2 and step 3, the details retains pond layer one and details retains the detailed process of pond layer two Are as follows:
The output P of each position is calculated according to the characteristic pattern I of input:
Wherein,[P] indicates that input feature vector figure retains the value of pond Hua Cenghou output position P by details;Neighborhood is empty BetweenThe space weighted average ω of input nodeα,β[p, q] is
Wherein α is biased exponent, and β is reward index.ρβ() is anti-bilateral filtering function, is used in neighborhood space ΩpMeter The weight of input point is calculated, β reduces the dynamic range of reward function, and β → 0 is exactly that simple field is average.
It is linear-scale reduction factor, specifically:
Wherein F is in neighborhoodOn one can learn, the 2D filter of nonstandardized technique, thisSize be 3 × 3。
Specifically, feature weight correction module it is (as shown in Figure 4) be in conjunction with space characteristics re-graduation just with channel characteristics re-graduation just Network module.
It will separately be illustrated below:
As shown in figure 5, process in space characteristics weight correction module are as follows:
(1) by primitive character figureIt is 1 × 1 by a convolution kernel size, port number is c (each logical The weight in road is not shared, it is allowed to obtain from study) convolution, obtain a characteristic pattern
(2) one sigmoid layers are passed it through again, by McEach spatial position M ' (i, j), i ∈ { 1,2 ..., H }, j The importance of ∈ { 1,2 ..., W } re-calibrates, and is assigned to the weight p (i, j) of each spatial position one, obtained p (i, j) with Primitive character figure McCarry out dot product.
Finally, McThe characteristic pattern just obtained by space characteristics re-graduation are as follows:
Space characteristics re-graduation just can preferably be re-calibrated the importance of same position pixels all in space, And it is assigned to corresponding weight, improve the accuracy rate of semantic segmentation.
As shown in fig. 6, process in channel characteristics weight correction module are as follows:
(1) by primitive character figureIt is averaged pond by an overall situation, obtains a characteristic patternAgain by M ' and primitive character figure McIt is connected entirely, carries out the integration of characteristic pattern.
(2) characteristic pattern after integrating is modified feature using a linear amending unit.
It (3) is finally H × W using a convolution kernel size to revised characteristic pattern, port number is that the convolution of c obtains One feature vector
(4) the activation range of feature vector z is limited between [0,1], obtains using one sigmoid layers by characteristic pattern To a channel weight vectorMcThe characteristic pattern just obtained by channel characteristics re-graduation:
Just by channel characteristics re-graduation, important channel can be assigned to high weight, prominent importance.
Step 4: whole image semantic segmentation model is trained;The process of training whole image semantic segmentation model For.
Step 4.1: the image concentrated to training data pre-processes, and is fixed dimension 513 × 513 by image cutting-out.
Step 4.2: whole image semantic segmentation model being initialized, i.e., mould is divided with the good image, semantic of pre-training The parameter value of type is initial value.
Step 4.3: the data concentrated to training data are expanded by way of overturning, scaling and rotation;Specifically, Overturning is random overturning;In the random zoomed image between 0.5 to 2 times of original image;In original image between -10 to 10 degree, Random-Rotation image.
Step 4.4: using the sum of the intersection entropy loss of each pixel as loss function, reusing random tonsure descent algorithm Error back propagation is carried out, with multinomial learning strategy, model parameter is updated, obtains trained semantic segmentation model.It is multinomial In formula learning strategy, learning rate lr setting are as follows:
Wherein, baselr is initial learning rate, is set as 0.001, power settingization 0.9 here.
Step 5: inputting new image, a propagated forward is carried out in trained deep neural network model, hold The semantic segmentation result of prediction is exported to end.
The principle of the present invention and process are as follows: using the spy of Conv1 output in image, semantic parted pattern of the invention The feature of characteristic pattern Res_3, the Conv4_x output of characteristic pattern Res_2, the Conv3_x output of sign figure Res_1, Conv2_x output Scheme Res_4, the respectively first layer, the second layer of front network (i.e. feature extraction network), third layer and the 4th layer.It then will be special Sign figure Res_1, which retains pond layer one by details, to carry out retaining down-sampled 8 times of details convolution, and Res_2 passes through to be retained by details Pond layer one carries out retaining down-sampled 4 times of details pondization, and Res_3 passes through to carry out retaining details volume by details reservation pond layer one Down-sampled 2 times and Res_1 of product is together in series, and is input to feature weight correction module, just by channel characteristics re-graduation, space characteristics Re-graduation just can preferably be re-calibrated the importance of same position pixels all in space, and be assigned to weigh accordingly Value, improves the accuracy rate of semantic segmentation, and important channel can be just assigned to high weight, prominent importance by channel characteristics re-graduation. Then the characteristic pattern of feature weight correction module output is passed through to 11 × 1 convolutional layer, obtained characteristic pattern De_1, by characteristic pattern De_1 is up-sampled to 28 × 28, and obtained characteristic pattern passes through 23 × 3 convolution, batch normalization layer and line rectification units, most It connects afterwards with characteristic pattern Res_3, obtained characteristic pattern De_2;De_1De_2 is up-sampled to 56 × 56, using 23 × 3 Convolution, batch normalization layer and line rectification unit, connect with characteristic pattern Res_2, obtained characteristic pattern De_3, by characteristic pattern De_3 Up-sampling is to 112 × 112, using 23 × 3 convolution, batch normalization layer and line rectification unit, obtained characteristic pattern De_4, Characteristic pattern De_4 is finally passed through into 1 variable weight global pool, is finally up-sampled to original image size, and marks and calculates with semantic segmentation Cross entropy is propagated using direction of error, obtains the network model of semantic segmentation.Variable weight global pool, since traditional overall situation is flat Office's pondization operation, is carried out same operation, i.e. 1 × 1 convolution to the same position in all feature channels, cannot protrude semantic segmentation In the correct class categories of each pixel pass through mark to 1 × 1 convolution in global average pond plus 1 weight vector Quasi- Gaussian Profile carries out parameter initialization, in training process, according to backpropagation, is assigned to Gao Quan to the pixel for belonging to target category Value, can preferably be classified pixel-by-pixel, moreover it is possible to play the role of accelerating convergent.The present invention is in VOC2012 semantic segmentation number According to achieved on collection mIoU be 76.33% result.
All technology deformations made according to the technique and scheme of the present invention, fall within the scope of protection of the present invention.

Claims (6)

1. the image, semantic dividing method based on full convolutional neural networks, which comprises the steps of:
Step 1: selection training dataset;
Step 2: constructing and train the semantic segmentation model front network by image to class label;
The structure of semantic segmentation model front network includes Conv1, Conv2_x, Conv3_x and Conv4_x, Conv1, Conv2_ X, Conv3_x and Conv4_x includes multiple convolutional layers, Conv4_x, after Conv1, Conv2_x, Conv3_x and Conv4_x Face is all connected with a details and retains pond layer;
Step 3: based on trained semantic segmentation model front network, constructing semantic segmentation model back-end network;
The structure of back-end network include details retain pond layer two, feature weigh correction module, convolutional layer, Conv5_x, Conv6_x, Conv7_x, convolutional layer, variable weight global pool layer and up-sampling layer;The output of Conv1, Conv2_x, Conv3_x pass through three respectively After being connected in series after a details reservation pond layer two with Conv4_x, common input feature vector weight correction module;Conv5_x,Conv6_ A up-sampling layer is all connected with before x and Conv7_x, Conv5_x, Conv6_x and Conv7_x include convolutional layer, batch normalization Layer and line rectification unit, Conv5_x, Conv6_x, Conv7_x by jump structure respectively successively with Conv3_x, Conv2_x It connects with the output characteristic pattern of Conv1;
Step 4: whole image semantic segmentation model is trained;
Step 5: new image is inputted, a propagated forward is carried out in trained deep neural network model, it is end-to-end The semantic segmentation result of ground output prediction.
2. the image, semantic dividing method according to claim 1 based on full convolutional neural networks, which is characterized in that described It include 33 residual error structures in semantic segmentation front network, each residual error structure includes 11 × 1 convolution, 13 × 3 volume Product, 11 × 1 convolution sum 1 quick connection.
3. the image, semantic dividing method according to claim 1 based on full convolutional neural networks, which is characterized in that described Details after Conv1 retains down-sampled 8 times of output characteristic pattern of pond layer a pair of Conv1, and the details after Conv2_x retains pond Down-sampled 4 times of output characteristic pattern of layer a pair of Conv2_x, the details after Conv3_x retain the output of pond layer a pair of Conv3_x Down-sampled 2 times of characteristic pattern.
4. the image, semantic dividing method according to claim 1 based on full convolutional neural networks, which is characterized in that
The details retains pond layer one and details retains the detailed process of pond layer two are as follows:
The output P of each position is calculated according to the characteristic pattern I of input:
Wherein,Indicate that input feature vector figure retains the value of pond Hua Cenghou output position P by details;
Neighborhood spaceThe space weighted average ω of input nodeα,β[p, q] is
Wherein α is biased exponent, and β is reward index;ρβ() is anti-bilateral filtering function, is used in neighborhood space ΩpIt calculates defeated The weight of access point, β reduce the dynamic range of reward function, and β → 0 is exactly simple neighborhood averaging.
It is linear-scale reduction factor, specifically:
Wherein F is in neighborhoodOn one can learn, the 2D filter of nonstandardized technique, thisSize be 3 × 3.
5. the image, semantic dividing method according to claim 1 based on full convolutional neural networks, which is characterized in that described Feature weight correction module is to combine the space characteristics re-graduation just positive network module with channel characteristics re-graduation.
6. the image, semantic dividing method according to claim 1 based on full convolutional neural networks, which is characterized in that training The process of whole image semantic segmentation model are as follows:
Step 4.1: the image concentrated to training data pre-processes, and is fixed dimension by image cutting-out;
Step 4.2: whole image semantic segmentation model is initialized;
Step 4.3: the data concentrated to training data are expanded by way of overturning, scaling and rotation;
Step 4.4: using the sum of the intersection entropy loss of each pixel as loss function, reusing random tonsure descent algorithm and carry out Error back propagation updates model parameter, obtains trained semantic segmentation model.
CN201810947884.XA 2018-08-20 2018-08-20 Image semantic segmentation method based on full convolution neural network Expired - Fee Related CN109101975B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810947884.XA CN109101975B (en) 2018-08-20 2018-08-20 Image semantic segmentation method based on full convolution neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810947884.XA CN109101975B (en) 2018-08-20 2018-08-20 Image semantic segmentation method based on full convolution neural network

Publications (2)

Publication Number Publication Date
CN109101975A true CN109101975A (en) 2018-12-28
CN109101975B CN109101975B (en) 2022-01-25

Family

ID=64850450

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810947884.XA Expired - Fee Related CN109101975B (en) 2018-08-20 2018-08-20 Image semantic segmentation method based on full convolution neural network

Country Status (1)

Country Link
CN (1) CN109101975B (en)

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109886273A (en) * 2019-02-26 2019-06-14 四川大学华西医院 A kind of CMR classification of image segmentation system
CN109886082A (en) * 2019-01-03 2019-06-14 南京理工大学 A kind of Small object enhancing prediction module detection method based on SSD
CN109919080A (en) * 2019-03-05 2019-06-21 南京航空航天大学 More full convolutional neural networks of decoder and its corresponding microscopical structure recognition methods
CN110084319A (en) * 2019-05-07 2019-08-02 上海宝尊电子商务有限公司 Fashion images clothes collar recognition methods and system based on deep neural network
CN110147763A (en) * 2019-05-20 2019-08-20 哈尔滨工业大学 Video semanteme dividing method based on convolutional neural networks
CN110189334A (en) * 2019-05-28 2019-08-30 南京邮电大学 The medical image cutting method of the full convolutional neural networks of residual error type based on attention mechanism
CN110225342A (en) * 2019-04-10 2019-09-10 中国科学技术大学 The bit distribution system and method for Video coding based on semantic distortion metrics
CN110232693A (en) * 2019-06-12 2019-09-13 桂林电子科技大学 A kind of combination thermodynamic chart channel and the image partition method for improving U-Net
CN110263706A (en) * 2019-06-19 2019-09-20 南京邮电大学 A kind of haze weather Vehicular video Detection dynamic target and know method for distinguishing
CN110276267A (en) * 2019-05-28 2019-09-24 江苏金海星导航科技有限公司 Method for detecting lane lines based on Spatial-LargeFOV deep learning network
CN110363134A (en) * 2019-07-10 2019-10-22 电子科技大学 A kind of face blocked area localization method based on semantic segmentation
CN110428009A (en) * 2019-08-02 2019-11-08 南京航空航天大学 A kind of full convolutional neural networks and corresponding microscopical structure recognition methods
CN110517272A (en) * 2019-08-29 2019-11-29 电子科技大学 Blood cell segmentation method based on deep learning
CN110689020A (en) * 2019-10-10 2020-01-14 湖南师范大学 Segmentation method of mineral flotation froth image and electronic equipment
CN110782023A (en) * 2019-11-04 2020-02-11 华南理工大学 Reduction residual module porous convolution architecture network and rapid semantic segmentation method
CN110930409A (en) * 2019-10-18 2020-03-27 电子科技大学 Salt body semantic segmentation method based on deep learning and semantic segmentation model
CN111091550A (en) * 2019-12-12 2020-05-01 创新奇智(北京)科技有限公司 Multi-size self-adaptive PCB solder paste area detection system and detection method
CN111192248A (en) * 2019-12-30 2020-05-22 山东大学 Multi-task relation learning method for positioning, identifying and segmenting vertebral body in nuclear magnetic resonance imaging
CN111209972A (en) * 2020-01-09 2020-05-29 中国科学院计算技术研究所 Image classification method and system based on hybrid connectivity deep convolution neural network
CN111259906A (en) * 2020-01-17 2020-06-09 陕西师范大学 Method for generating and resisting remote sensing image target segmentation under condition containing multilevel channel attention
CN111275712A (en) * 2020-01-15 2020-06-12 浙江工业大学 Residual semantic network training method oriented to large-scale image data
CN111489364A (en) * 2020-04-08 2020-08-04 重庆邮电大学 Medical image segmentation method based on lightweight full convolution neural network
CN111612740A (en) * 2020-04-16 2020-09-01 深圳大学 Pathological image processing method and device
CN111862190A (en) * 2020-07-10 2020-10-30 北京农业生物技术研究中心 Method and device for automatically measuring area of isolated plant soft rot disease spot
CN111860386A (en) * 2020-07-27 2020-10-30 山东大学 Video semantic segmentation method based on ConvLSTM convolutional neural network
WO2020238734A1 (en) * 2019-05-27 2020-12-03 腾讯科技(深圳)有限公司 Image segmentation model training method and apparatus, computer device, and storage medium
CN112364878A (en) * 2020-09-25 2021-02-12 江苏师范大学 Power line classification method based on deep learning under complex background
CN112766279A (en) * 2020-12-31 2021-05-07 中国船舶重工集团公司第七0九研究所 Image feature extraction method based on combined attention mechanism
CN113487524A (en) * 2021-04-07 2021-10-08 北京百度网讯科技有限公司 Image format conversion method, device, equipment, storage medium and program product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107480726A (en) * 2017-08-25 2017-12-15 电子科技大学 A kind of Scene Semantics dividing method based on full convolution and shot and long term mnemon
CN107784654A (en) * 2016-08-26 2018-03-09 杭州海康威视数字技术股份有限公司 Image partition method, device and full convolutional network system
US20180137338A1 (en) * 2016-11-16 2018-05-17 The Governing Council Of The University Of Toronto System and method for classifying and segmenting microscopy images with deep multiple instance learning
CN108062756A (en) * 2018-01-29 2018-05-22 重庆理工大学 Image, semantic dividing method based on the full convolutional network of depth and condition random field

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107784654A (en) * 2016-08-26 2018-03-09 杭州海康威视数字技术股份有限公司 Image partition method, device and full convolutional network system
US20180137338A1 (en) * 2016-11-16 2018-05-17 The Governing Council Of The University Of Toronto System and method for classifying and segmenting microscopy images with deep multiple instance learning
CN107480726A (en) * 2017-08-25 2017-12-15 电子科技大学 A kind of Scene Semantics dividing method based on full convolution and shot and long term mnemon
CN108062756A (en) * 2018-01-29 2018-05-22 重庆理工大学 Image, semantic dividing method based on the full convolutional network of depth and condition random field

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
ABHIJIT GUHA ROY等: "Concurrent Spatial and Channel ‘Squeeze & Excitation" in Fully Convolutional Networks", 《ARXIV:1803.02579V2 [CS.CV]》 *
FARAZ SAEEDAN等: "Detail-Preserving Pooling in Deep Networks", 《ARXIV:1804.04076V1 [》 *
OZAN OKTAY等: "Attention U-Net:Learning Where to Look for the Pancreas", 《ARXIV:1804.03999V1 [CS.CV]》 *
辛鹏等: "全卷积网络多层特征融合的飞机快速检测", 《光学学报》 *

Cited By (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109886082A (en) * 2019-01-03 2019-06-14 南京理工大学 A kind of Small object enhancing prediction module detection method based on SSD
CN109886273A (en) * 2019-02-26 2019-06-14 四川大学华西医院 A kind of CMR classification of image segmentation system
CN109919080A (en) * 2019-03-05 2019-06-21 南京航空航天大学 More full convolutional neural networks of decoder and its corresponding microscopical structure recognition methods
CN109919080B (en) * 2019-03-05 2019-10-11 南京航空航天大学 More full convolutional neural networks of decoder and its corresponding microscopical structure recognition methods
CN110225342A (en) * 2019-04-10 2019-09-10 中国科学技术大学 The bit distribution system and method for Video coding based on semantic distortion metrics
CN110225342B (en) * 2019-04-10 2021-03-09 中国科学技术大学 Video coding bit distribution system and method based on semantic distortion measurement
CN110084319A (en) * 2019-05-07 2019-08-02 上海宝尊电子商务有限公司 Fashion images clothes collar recognition methods and system based on deep neural network
CN110084319B (en) * 2019-05-07 2023-06-30 上海宝尊电子商务有限公司 Fashion image clothing collar type recognition method and system based on deep neural network
CN110147763A (en) * 2019-05-20 2019-08-20 哈尔滨工业大学 Video semanteme dividing method based on convolutional neural networks
CN110147763B (en) * 2019-05-20 2023-02-24 哈尔滨工业大学 Video semantic segmentation method based on convolutional neural network
US11961233B2 (en) 2019-05-27 2024-04-16 Tencent Technology (Shenzhen) Company Limited Method and apparatus for training image segmentation model, computer device, and storage medium
WO2020238734A1 (en) * 2019-05-27 2020-12-03 腾讯科技(深圳)有限公司 Image segmentation model training method and apparatus, computer device, and storage medium
CN110189334A (en) * 2019-05-28 2019-08-30 南京邮电大学 The medical image cutting method of the full convolutional neural networks of residual error type based on attention mechanism
CN110276267A (en) * 2019-05-28 2019-09-24 江苏金海星导航科技有限公司 Method for detecting lane lines based on Spatial-LargeFOV deep learning network
CN110189334B (en) * 2019-05-28 2022-08-09 南京邮电大学 Medical image segmentation method of residual error type full convolution neural network based on attention mechanism
CN110232693A (en) * 2019-06-12 2019-09-13 桂林电子科技大学 A kind of combination thermodynamic chart channel and the image partition method for improving U-Net
CN110263706B (en) * 2019-06-19 2021-07-27 南京邮电大学 Method for detecting and identifying dynamic target of vehicle-mounted video in haze weather
CN110263706A (en) * 2019-06-19 2019-09-20 南京邮电大学 A kind of haze weather Vehicular video Detection dynamic target and know method for distinguishing
CN110363134A (en) * 2019-07-10 2019-10-22 电子科技大学 A kind of face blocked area localization method based on semantic segmentation
CN110363134B (en) * 2019-07-10 2021-06-08 电子科技大学 Human face shielding area positioning method based on semantic segmentation
CN110428009B (en) * 2019-08-02 2020-06-16 南京航空航天大学 Full convolution neural network and corresponding mesoscopic structure identification method
CN110428009A (en) * 2019-08-02 2019-11-08 南京航空航天大学 A kind of full convolutional neural networks and corresponding microscopical structure recognition methods
CN110517272A (en) * 2019-08-29 2019-11-29 电子科技大学 Blood cell segmentation method based on deep learning
CN110517272B (en) * 2019-08-29 2022-03-25 电子科技大学 Deep learning-based blood cell segmentation method
CN110689020A (en) * 2019-10-10 2020-01-14 湖南师范大学 Segmentation method of mineral flotation froth image and electronic equipment
CN110930409A (en) * 2019-10-18 2020-03-27 电子科技大学 Salt body semantic segmentation method based on deep learning and semantic segmentation model
CN110782023B (en) * 2019-11-04 2023-04-07 华南理工大学 Reduction residual module porous convolution architecture network and rapid semantic segmentation method
CN110782023A (en) * 2019-11-04 2020-02-11 华南理工大学 Reduction residual module porous convolution architecture network and rapid semantic segmentation method
CN111091550A (en) * 2019-12-12 2020-05-01 创新奇智(北京)科技有限公司 Multi-size self-adaptive PCB solder paste area detection system and detection method
CN111192248B (en) * 2019-12-30 2023-05-05 山东大学 Multi-task relation learning method for positioning, identifying and segmenting vertebral body in nuclear magnetic resonance imaging
CN111192248A (en) * 2019-12-30 2020-05-22 山东大学 Multi-task relation learning method for positioning, identifying and segmenting vertebral body in nuclear magnetic resonance imaging
CN111209972A (en) * 2020-01-09 2020-05-29 中国科学院计算技术研究所 Image classification method and system based on hybrid connectivity deep convolution neural network
CN111275712A (en) * 2020-01-15 2020-06-12 浙江工业大学 Residual semantic network training method oriented to large-scale image data
CN111259906A (en) * 2020-01-17 2020-06-09 陕西师范大学 Method for generating and resisting remote sensing image target segmentation under condition containing multilevel channel attention
CN111489364B (en) * 2020-04-08 2022-05-03 重庆邮电大学 Medical image segmentation method based on lightweight full convolution neural network
CN111489364A (en) * 2020-04-08 2020-08-04 重庆邮电大学 Medical image segmentation method based on lightweight full convolution neural network
CN111612740A (en) * 2020-04-16 2020-09-01 深圳大学 Pathological image processing method and device
CN111612740B (en) * 2020-04-16 2023-07-25 深圳大学 Pathological image processing method and device
CN111862190B (en) * 2020-07-10 2024-04-05 北京农业生物技术研究中心 Method and device for automatically measuring area of soft rot disease spots of isolated plants
CN111862190A (en) * 2020-07-10 2020-10-30 北京农业生物技术研究中心 Method and device for automatically measuring area of isolated plant soft rot disease spot
CN111860386B (en) * 2020-07-27 2022-04-08 山东大学 Video semantic segmentation method based on ConvLSTM convolutional neural network
CN111860386A (en) * 2020-07-27 2020-10-30 山东大学 Video semantic segmentation method based on ConvLSTM convolutional neural network
CN112364878A (en) * 2020-09-25 2021-02-12 江苏师范大学 Power line classification method based on deep learning under complex background
CN112766279A (en) * 2020-12-31 2021-05-07 中国船舶重工集团公司第七0九研究所 Image feature extraction method based on combined attention mechanism
CN113487524A (en) * 2021-04-07 2021-10-08 北京百度网讯科技有限公司 Image format conversion method, device, equipment, storage medium and program product

Also Published As

Publication number Publication date
CN109101975B (en) 2022-01-25

Similar Documents

Publication Publication Date Title
CN109101975A (en) Image, semantic dividing method based on full convolutional neural networks
CN111259905B (en) Feature fusion remote sensing image semantic segmentation method based on downsampling
CN110458165B (en) Natural scene text detection method introducing attention mechanism
CN104599275B (en) The RGB-D scene understanding methods of imparametrization based on probability graph model
CN110111335A (en) A kind of the urban transportation Scene Semantics dividing method and system of adaptive confrontation study
CN108985269A (en) Converged network driving environment sensor model based on convolution sum cavity convolutional coding structure
CN110427937A (en) A kind of correction of inclination license plate and random length licence plate recognition method based on deep learning
CN110111366A (en) A kind of end-to-end light stream estimation method based on multistage loss amount
CN108288035A (en) The human motion recognition method of multichannel image Fusion Features based on deep learning
CN110414387A (en) A kind of lane line multi-task learning detection method based on lane segmentation
CN113627228B (en) Lane line detection method based on key point regression and multi-scale feature fusion
CN105631415A (en) Video pedestrian recognition method based on convolution neural network
CN108021889A (en) A kind of binary channels infrared behavior recognition methods based on posture shape and movable information
CN112232351B (en) License plate recognition system based on deep neural network
CN113160062B (en) Infrared image target detection method, device, equipment and storage medium
CN109711401A (en) A kind of Method for text detection in natural scene image based on Faster Rcnn
CN111882620A (en) Road drivable area segmentation method based on multi-scale information
CN111161244B (en) Industrial product surface defect detection method based on FCN + FC-WXGboost
CN116051953A (en) Small target detection method based on selectable convolution kernel network and weighted bidirectional feature pyramid
CN110517272A (en) Blood cell segmentation method based on deep learning
CN110555461A (en) scene classification method and system based on multi-structure convolutional neural network feature fusion
CN109886082A (en) A kind of Small object enhancing prediction module detection method based on SSD
CN113743417A (en) Semantic segmentation method and semantic segmentation device
CN113205103A (en) Lightweight tattoo detection method
Zhaosheng et al. Rapid detection of wheat ears in orthophotos from unmanned aerial vehicles in fields based on YOLOX

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20220125