CN111259904A - Semantic image segmentation method and system based on deep learning and clustering - Google Patents

Semantic image segmentation method and system based on deep learning and clustering Download PDF

Info

Publication number
CN111259904A
CN111259904A CN202010047315.7A CN202010047315A CN111259904A CN 111259904 A CN111259904 A CN 111259904A CN 202010047315 A CN202010047315 A CN 202010047315A CN 111259904 A CN111259904 A CN 111259904A
Authority
CN
China
Prior art keywords
clustering
convolution
matrix
neural network
pooling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010047315.7A
Other languages
Chinese (zh)
Other versions
CN111259904B (en
Inventor
郭丽
刘知贵
张小乾
白克强
薛旭倩
刘道广
李理
张活力
吴均
付聪
喻琼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southwest University of Science and Technology
Original Assignee
Southwest University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southwest University of Science and Technology filed Critical Southwest University of Science and Technology
Priority to CN202010047315.7A priority Critical patent/CN111259904B/en
Publication of CN111259904A publication Critical patent/CN111259904A/en
Application granted granted Critical
Publication of CN111259904B publication Critical patent/CN111259904B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a semantic image segmentation method and a semantic image segmentation system based on deep learning and clustering, which comprise the following steps: s1, performing convolution and pooling on the original image through a convolution neural network to obtain a linear characteristic matrix of the original image; s2, performing subspace clustering on the linear feature matrix to obtain clustered feature data; and S3, processing the clustered feature data to the same pixels as the original image through deconvolution and upsampling to obtain a segmented image. The invention combines the Convolutional Neural Network (CNN) in the deep neural network with subspace clustering, and replaces the full connection layer in the CNN with sparse subspace, thereby solving the problems of complicated semantic image segmentation calculation, large data volume and poor information in the prior art. A subspace clustering method is introduced into the neural network, so that a large amount of marking data required by the CNN during working is reduced, and unsupervised learning of the CNN neural network is realized.

Description

Semantic image segmentation method and system based on deep learning and clustering
Technical Field
The invention relates to the field of machine vision, in particular to a semantic image segmentation method and a semantic image segmentation system based on deep learning and clustering.
Background
The purpose of semantic image segmentation is to classify the semantics of each region, i.e. what objects the region is, i.e. to indicate all objects from the image to their respective classes, and to segment them. At present, the image segmentation and the sub-space clustering are realized by adopting a deep learning neural network tool as a common segmentation mode. They have advantages and disadvantages, respectively.
In the deep learning neural network mode, the characteristic data is extracted in an automatic learning mode of a computer, and the characteristic learning is integrated into the model establishing process, so that the imperfection caused by artificial design characteristics is reduced, and the characteristic of better classification performance is achieved.
Subspace clustering is an effective way to realize high-dimensional data clustering, dimensionality reduction is performed in advance, a graph beneficial to segmentation is constructed by often using a subspace clustering method to solve the problem of complex image segmentation, the feature data dimensionality of a complex image region is high, a large number of irrelevant attributes exist, clusters are difficult to form, and the problems of abundant data and poor information exist. However, subspace clustering methods require linear representation between data, realistic image data generally cannot be represented linearly in an input space, and subspace segmentation becomes difficult at this time, kernel-based methods are generally adopted at present, but kernel methods are similar to template-based methods, performance of kernel functions depends on selection of kernel functions to a great extent, kernel function selection is empirical, and nonlinear transformation is ambiguous, and it cannot be determined whether a predefined kernel can generate an implicit feature space suitable for subspace clustering, which brings difficulty in processing a large-scale data set.
Disclosure of Invention
The invention aims to solve the technical problems that in the existing semantic image segmentation technology, the system data volume is large, the computation is complex, the feature data are difficult to form clusters, and the feature data are inconvenient to extract.
The invention is realized by the following technical scheme:
a semantic image segmentation method based on deep learning and clustering is characterized by comprising the following steps:
s1, performing convolution and pooling on the original image through a convolution neural network to obtain a linear characteristic matrix of the original image;
s2, performing subspace clustering on the linear feature matrix to obtain clustered feature data;
and S3, processing the clustered feature data to the same pixels as the original image through deconvolution and upsampling to obtain a segmented image.
Firstly, a deep learning method is adopted to carry out multilayer convolution and pooling on the characteristic data of an original image through a convolution neural network, namely a layered nonlinear transformation process, a linear characteristic matrix of the original image is obtained and is transmitted to a subspace clustering layer through a highest pooling layer, and since the image characteristics after convolution and pooling have linearity, subspace segmentation becomes easy. And obtaining an affinity matrix after reconstructing and thinning the subspace, and clustering by using a spectral clustering mode. The invention replaces the full-connection layer of the convolutional neural network with the subspace clustering layer, thereby reducing the long calculation time and the large calculation difficulty brought by the classification mode of the full-connection layer. And finally, in order to improve the segmentation precision, the invention adopts a deconvolution and upsampling mode in a deep neural network to restore the segmented image to the original pixel and output the original pixel.
Further, the convolutional neural network comprises a plurality of layers of networks from low to high, each layer of network comprises convolution and pooling, and the step S1 comprises the following sub-steps:
in a lower layer network of the convolutional neural network, performing convolutional operation by using an M convolutional kernel, extracting simple common characteristic data through pooling, performing convolutional operation by using an N convolutional kernel in a higher layer network of the convolutional neural network, and extracting complex common characteristic data through pooling, wherein the number of the M convolutional kernels is less than that of the N convolutional kernels;
the object of each layer of network convolution and pooling of the convolutional neural network is common characteristic data obtained by the previous layer of network, and the object of the lowest layer of network convolution and pooling of the convolutional neural network is an original image;
and forming a linear matrix by using the common characteristic data obtained by the convolution and pooling of the highest layer network of the convolutional neural network, wherein the linear matrix is the linear characteristic matrix of the original image.
The invention is based on deep learning and clustering semantic image segmentation, and a definite mapping is learned to make a subspace more separable. The method comprises the steps of firstly adopting a deep learning method to realize layered nonlinear transformation extraction of feature data through a convolutional neural network, namely sequentially extracting simple common features from a lower layer network through convolution operation with less convolutional kernels, then adopting more convolutional kernels to realize extraction of complex features from a higher layer network, then realizing feature data clustering through a sparse subspace clustering method by using the features extracted from the highest pooling layer, and finally recovering original pixels of the segmented image through a decoding layer formed by the deep neural network and then outputting the image.
Further, the subspace clustering is sparse subspace clustering, and the step S2 includes the following sub-steps: and performing sparsification processing on the linear characteristic matrix through a subspace to obtain a sparse coefficient matrix, calculating the similarity between all joints of the image according to the sparse coefficient matrix to obtain an affinity matrix, and performing spectral clustering on the affinity matrix to obtain clustered characteristic data. Preferably, the linear feature matrix is partitioned into different subspaces by sparse coefficients. Preferably, the spectral clustering adopts a K-means clustering algorithm.
Further, the number of convolution layers for the deconvolution in step S3 is the same as the number of convolution layers for the convolution in step S1. When the image feature data mapping is restored to be the same as the original input image resolution, the convolution operation can be stopped, and a segmented image is obtained.
In addition, a semantic image segmentation system based on deep learning and clustering comprises:
an input unit: used for inputting the original image;
an encoding unit: the system comprises a convolution neural network, a characteristic matrix and a characteristic matrix, wherein the convolution neural network is used for performing convolution and pooling on an original image to obtain a linear characteristic matrix of the original image;
a clustering unit: the linear feature matrix is used for performing subspace clustering on the linear feature matrix to obtain clustered feature data;
a decoding unit: the characteristic data after clustering is processed to the same pixel as the original image through deconvolution and upsampling to obtain a segmented image;
an output unit: for outputting the segmented image.
Compared with the prior art, the invention has the following advantages and beneficial effects:
the convolutional neural network has the advantages that the image data features are extracted, the learning is supervised, meanwhile, the subspace clustering method is introduced into the neural network, the feature learning modules of all layers of the convolutional coding layer can be supervised through the output of spectral clustering, a large amount of labeled data needed by the convolutional neural network during working is reduced, the unsupervised learning of the convolutional neural network is realized, and the operation of a full connection layer of the convolutional neural network on a large amount of data is avoided. A subspace clustering method is implemented by learning a set of explicit nonlinear mapping functions through a convolutional neural network to map the input to another space, and using the sample representations in the new space to compute an affinity matrix. After the neural network training, the noise of the feature data is reduced, and the robustness of the subspace clustering model to the noise is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the embodiments of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention. In the drawings:
FIG. 1 is a schematic view of the present invention.
Fig. 2 is a design block diagram of embodiment 2.
FIG. 3 is a fully-connected layer neural network.
Fig. 4 is a schematic diagram of a subspace clustering layer instead of a fully connected layer design.
Fig. 5 shows the feature classification results of example 4.
FIG. 6 is a schematic diagram of a fully connected layer classification.
FIG. 7 is a block diagram of the decoding layer design of the convolutional neural network of embodiment 5.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to examples and accompanying drawings, and the exemplary embodiments and descriptions thereof are only used for explaining the present invention and are not meant to limit the present invention.
Example 1
The embodiment is a semantic image segmentation method based on deep learning and clustering, and the design principle is shown in fig. 1. Firstly, a deep learning method is adopted to realize the hierarchical nonlinear transformation of the feature data of an original image through a coding layer of a Convolutional Neural Network (CNN), the feature data are transmitted to a subspace clustering layer through a highest pooling layer, the feature data clustering is realized through a sparse subspace clustering method, and finally, in order to improve the segmentation precision, a decoding layer formed by the deep neural network is adopted to restore the original pixel of the segmented image and output the restored image.
Example 2
The present embodiment is based on embodiment 1, and elaborates the principle of the deep neural network coding layer. The deep neural network of the coding layer can select neural network architectures such as BP neural network, RNN neural network, CNN and the like. The invention utilizes a powerful function of extracting data characteristics from image information by a Convolutional Neural Network (CNN), adopts a Convolutional Neural Network (CNN) architecture at a deep neural network encoder layer, and consists of convolutional layers 1-a pooling layer 1-a convolutional layer 2-a pooling layer 2-a convolutional layer 3-a pooling layer 3-a convolutional layer 4-a pooling layer 4 …. Compared with the common method that the original data is directly used as input, the design method has the advantages that data are layered and changed in a multi-linear mode, subspace representation errors are reduced, and the robustness of a subspace clustering model to noise is improved. The block diagram of the convolutional neural network coding layer design is shown in fig. 2.
Example 3
In this embodiment, on the basis of embodiment 1, a subspace clustering layer design method is elaborated, and a sparse subspace clustering method is used to replace a full connection layer of the CNN deep neural network. The number of net parameters of the fully connected layer is very large, even much larger than the parameter amount of the plurality of convolutional layers, and taking fig. 3 as an example, if the image is pooled into 20 images with 12 × 12 outputs, after the action of the fully connected first layer of 100 neurons, the whole image has 100 × 20 × 12 — 288000 parameters, which increases the calculation time and difficulty. The invention avoids the generation of a classification mode by full connection, adopts a sparse subspace clustering method introduced into a deep neural network structure, and realizes the classification of characteristic data by utilizing the sparse subspace clustering method on the characteristic matrix output by the front-end top layer convolution layerClass, make the subspace more separable, e.g., divide the image features of the entire picture into k subspaces in this design
Figure BDA0002369892620000041
In addition, the sparse subspace clustering method defines the training target of the front-end neural network: the learning to reconstruct the input data error is minimal. And the weights among all nodes in the full-connection layer are the similarity among the corresponding nodes, all the weights form a coefficient matrix with X being XC, the coefficient matrix further constructs an affinity matrix among data points, and the data clustering is realized by utilizing a spectral clustering method. The subspace clustering layer instead of the fully connected layer design schematic is shown in fig. 4. Subspace representation coefficient calculation method:
subspace representation model: min | | C | luminance1s.t.X=XC,Cii=0。
Using the alternating bi-directional multiplier method f (c) ═ X-XC,
Figure BDA0002369892620000042
a calculation step: 1.
Figure BDA0002369892620000043
2、λk+1=λkkh(Ck+1)
as a result: calculating by subspace representation coefficient to obtain coefficient matrix C
The similarity matrix calculation method comprises the following steps: w ═ C | + | CT|)/2
The design method of replacing the full connection layer by the subspace clustering layer has the advantages of reducing a large amount of marking data required by the CNN during working and realizing unsupervised learning of the neural network.
Example 4
In this embodiment, based on embodiment 3, the process of replacing the full connection layer with the subspace clustering layer is described by taking an example of dividing the 2-class data into 6 groups of feature data matrices. Using a sparse matrix calculation formula: min | | C | luminance1s.t.X=XC,Cii0; whereinX represents a 6 characteristic data matrix output by the highest pooling layer, C represents a coefficient matrix and is a sparse matrix, and formula (1) is shown as follows:
Figure BDA0002369892620000051
pairing the coefficient matrix in equation (1) by using the alternating bi-directional multiplier method
Figure BDA0002369892620000052
The coefficient matrix obtained by the construction has a large number of 0 vectors, such as the matrix C/in the formula (2).
Figure BDA0002369892620000053
The matrix has more coefficients with 0, i.e. the number in the feature can be represented by a small amount of data, and X11 in formula (1) is represented by the corresponding other data in the 6 groups of data as shown in formula (3):
Figure BDA0002369892620000061
X11=X12*0+X13*1/2+X14*1/2+X15*0+X16*0
X12=X11*0+X13*0+X14*0+X15*1/4+X16*3/4
………………………..
………………………..
………………………..
X16=X11*0+X12*1/2+X13*0+X14*0+X15*1/2
i.e. by calculation with a small amount of data
Figure BDA0002369892620000062
Figure BDA0002369892620000063
And
Figure BDA0002369892620000064
these 5 sets of feature vectors are classified intoShown in FIG. 5
The A and B classes are not required to be known before clustering the characteristic values, the classification can be completed only by knowing the number of the classes, and the classes of each class are not required to be known exactly before the classification like neural network learning.
Example 5
In this embodiment, under the calculation condition of embodiment 4, a full-connection manner is adopted to implement two types of data classification processes of A, B on 6 groups of feature data matrices:
the first step is as follows: it is necessary to give A, B label
The second step is that: establishing a full connection layer and constructing a hidden layer, wherein all vectors in the 6 groups of characteristic values are connected with each neuron of each layer in the hidden layer by using a weight value, and the connection weight value is adjusted repeatedly. As shown in fig. 6.
Example 6
The present embodiment is based on embodiment 1, and details the principle of the deep neural network decoding layer. Data output after an input image is subjected to convolution and pooling operations of a deep neural network encoder layer is compressed into an image with lower pixels, if the input image is directly segmented by using image features of the highest layer, edge details of a segmented target image are fuzzy, because the features of the higher layer of the CNN are more complex and semantic information is richer, but the bottom layer features of the CNN usually contain many common features and contain more information such as edges, positions and the like. Therefore, in order to improve the image segmentation precision, a clear segmented image is output at an output layer, a deep neural network decoding layer is designed at the moment, and the feature map is processed into the size of an original image through an up-sampling and deconvolution operation. The decoding layer is specifically composed of an up-sampling layer 1-a convolution layer n + 1-an up-sampling layer 2-a convolution layer n + 2-an up-sampling layer 3-a convolution layer n +3 …. In order to prevent the over-fitting phenomenon, the number of the convolutional layers in the decoder can be the same as that of the convolutional layers in the coding layer, the convolution operation can be stopped when the feature mapping is restored to be the same as the resolution of the original input image, and finally the segmented image is output at the output layer. The design block diagram of the decoding layer of the deep neural network is shown in FIG. 7.
The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are merely exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (10)

1. A semantic image segmentation method based on deep learning and clustering is characterized by comprising the following steps:
s1, performing convolution and pooling on the original image through a convolution neural network to obtain a linear characteristic matrix of the original image;
s2, performing subspace clustering on the linear feature matrix to obtain clustered feature data;
and S3, processing the clustered feature data to the same pixels as the original image through deconvolution and upsampling to obtain a segmented image.
2. The deep learning and clustering based semantic image segmentation method according to claim 1, wherein the convolutional neural network comprises a plurality of layers from low to high, each layer of the layers comprising convolution and pooling, and the step S1 comprises the following sub-steps:
in a lower layer network of the convolutional neural network, performing convolutional operation by using an M convolutional kernel, extracting simple common characteristic data through pooling, performing convolutional operation by using an N convolutional kernel in a higher layer network of the convolutional neural network, and extracting complex common characteristic data through pooling, wherein the number of the M convolutional kernels is less than that of the N convolutional kernels;
the object of each layer of network convolution and pooling of the convolutional neural network is common characteristic data obtained by the previous layer of network, and the object of the lowest layer of network convolution and pooling of the convolutional neural network is an original image;
and forming a linear matrix by using the common characteristic data obtained by the convolution and pooling of the highest layer network of the convolutional neural network, wherein the linear matrix is the linear characteristic matrix of the original image.
3. The deep learning and clustering based semantic image segmentation method according to claim 1, wherein the subspace clustering is sparse subspace clustering, and the step S2 comprises the following substeps: and performing sparsification processing on the linear characteristic matrix through a subspace to obtain a sparse coefficient matrix, calculating the similarity between all joints of the image according to the sparse coefficient matrix to obtain an affinity matrix, and performing spectral clustering on the affinity matrix to obtain clustered characteristic data.
4. The deep learning and clustering based semantic image segmentation method according to claim 3, wherein the linear feature matrix is segmented into different subspaces by sparse coefficients.
5. The deep learning and clustering based semantic image segmentation method according to claim 3, wherein the spectral clustering adopts a K-means clustering algorithm.
6. The method for semantic image segmentation based on deep learning and clustering according to claim 1, wherein the number of convolution layers of the deconvolution in step S3 is the same as the number of convolution layers of the convolution in step S1.
7. A semantic image segmentation system based on deep learning and clustering, comprising:
an input unit: used for inputting the original image;
an encoding unit: the system comprises a convolution neural network, a characteristic matrix and a characteristic matrix, wherein the convolution neural network is used for performing convolution and pooling on an original image to obtain a linear characteristic matrix of the original image;
a clustering unit: the linear feature matrix is used for performing subspace clustering on the linear feature matrix to obtain clustered feature data;
a decoding unit: the characteristic data after clustering is processed to the same pixel as the original image through deconvolution and upsampling to obtain a segmented image;
an output unit: for outputting the segmented image.
8. The deep learning and clustering based semantic image segmentation system according to claim 7 wherein the coding unit comprises a plurality of layers of networks from low to high, each layer of networks comprising convolution and pooling, comprising:
the lower layer network of the coding unit uses M convolution kernels to perform convolution operation and extracts simple common characteristic data through pooling, the upper layer network of the coding unit uses N convolution kernels to perform convolution operation and extracts complex common characteristic data through pooling, and the number of the M convolution kernels is less than that of the N convolution kernels;
the object of each layer of network convolution and pooling of the coding unit is common characteristic data obtained by a network of the previous layer, and the object of the lowest layer of network convolution and pooling of the coding unit is an original image;
and forming a linear matrix by the common characteristic data obtained by the convolution and pooling of the highest layer network of the coding unit, wherein the linear matrix is the linear characteristic matrix of the original image.
9. The deep learning and clustering based semantic image segmentation system of claim 7 wherein the clustering unit is a sparse subspace clustering unit configured to: and carrying out sparsification treatment on the characteristic matrix through a subspace to obtain an affinity matrix, and carrying out spectral clustering on the affinity matrix to obtain clustered characteristic data.
10. The deep learning and clustering based semantic image segmentation system according to claim 7, wherein the number of convolution layers that the decoding unit deconvolves is the same as the number of convolution layers that the encoding unit convolves.
CN202010047315.7A 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering Active CN111259904B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010047315.7A CN111259904B (en) 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010047315.7A CN111259904B (en) 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering

Publications (2)

Publication Number Publication Date
CN111259904A true CN111259904A (en) 2020-06-09
CN111259904B CN111259904B (en) 2022-12-27

Family

ID=70954191

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010047315.7A Active CN111259904B (en) 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering

Country Status (1)

Country Link
CN (1) CN111259904B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112164067A (en) * 2020-10-12 2021-01-01 西南科技大学 Medical image segmentation method and device based on multi-mode subspace clustering
CN112784929A (en) * 2021-03-14 2021-05-11 西北工业大学 Small sample image classification method and device based on double-element group expansion
CN113450351A (en) * 2021-08-06 2021-09-28 推想医疗科技股份有限公司 Segmentation model training method, image segmentation method, device, equipment and medium
CN113793340A (en) * 2021-08-31 2021-12-14 南京邮电大学 Image segmentation neural network and remote biological imaging method and system
CN113821661A (en) * 2021-08-30 2021-12-21 上海明略人工智能(集团)有限公司 Image retrieval method, system, storage medium and electronic device
WO2023098688A1 (en) * 2021-12-03 2023-06-08 华为技术有限公司 Image encoding and decoding method and device
WO2024036758A1 (en) * 2022-08-15 2024-02-22 广州广电运通金融电子股份有限公司 Automatic image annotation method and apparatus, device, and medium

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107169415A (en) * 2017-04-13 2017-09-15 西安电子科技大学 Human motion recognition method based on convolutional neural networks feature coding
CN107341517A (en) * 2017-07-07 2017-11-10 哈尔滨工业大学 The multiple dimensioned wisp detection method of Fusion Features between a kind of level based on deep learning
CN108010049A (en) * 2017-11-09 2018-05-08 华南理工大学 Split the method in human hand region in stop-motion animation using full convolutional neural networks
US20180137630A1 (en) * 2016-11-15 2018-05-17 Samsung Electronics Co., Ltd. Image processing apparatus and method
CN108665463A (en) * 2018-03-30 2018-10-16 哈尔滨理工大学 A kind of cervical cell image partition method generating network based on confrontation type
CN108717568A (en) * 2018-05-16 2018-10-30 陕西师范大学 A kind of image characteristics extraction and training method based on Three dimensional convolution neural network
CN108734719A (en) * 2017-04-14 2018-11-02 浙江工商大学 Background automatic division method before a kind of lepidopterous insects image based on full convolutional neural networks
CN109145920A (en) * 2018-08-21 2019-01-04 电子科技大学 A kind of image, semantic dividing method based on deep neural network
CN109784424A (en) * 2019-03-26 2019-05-21 腾讯科技(深圳)有限公司 A kind of method of image classification model training, the method and device of image procossing
US20190205758A1 (en) * 2016-12-30 2019-07-04 Konica Minolta Laboratory U.S.A., Inc. Gland segmentation with deeply-supervised multi-level deconvolution networks
CN110399840A (en) * 2019-05-22 2019-11-01 西南科技大学 A kind of quick lawn semantic segmentation and boundary detection method
CN110599461A (en) * 2019-08-21 2019-12-20 东南大学 Thalamus function partitioning method based on subspace feature learning

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180137630A1 (en) * 2016-11-15 2018-05-17 Samsung Electronics Co., Ltd. Image processing apparatus and method
US20190205758A1 (en) * 2016-12-30 2019-07-04 Konica Minolta Laboratory U.S.A., Inc. Gland segmentation with deeply-supervised multi-level deconvolution networks
CN107169415A (en) * 2017-04-13 2017-09-15 西安电子科技大学 Human motion recognition method based on convolutional neural networks feature coding
CN108734719A (en) * 2017-04-14 2018-11-02 浙江工商大学 Background automatic division method before a kind of lepidopterous insects image based on full convolutional neural networks
CN107341517A (en) * 2017-07-07 2017-11-10 哈尔滨工业大学 The multiple dimensioned wisp detection method of Fusion Features between a kind of level based on deep learning
CN108010049A (en) * 2017-11-09 2018-05-08 华南理工大学 Split the method in human hand region in stop-motion animation using full convolutional neural networks
CN108665463A (en) * 2018-03-30 2018-10-16 哈尔滨理工大学 A kind of cervical cell image partition method generating network based on confrontation type
CN108717568A (en) * 2018-05-16 2018-10-30 陕西师范大学 A kind of image characteristics extraction and training method based on Three dimensional convolution neural network
CN109145920A (en) * 2018-08-21 2019-01-04 电子科技大学 A kind of image, semantic dividing method based on deep neural network
CN109784424A (en) * 2019-03-26 2019-05-21 腾讯科技(深圳)有限公司 A kind of method of image classification model training, the method and device of image procossing
CN110399840A (en) * 2019-05-22 2019-11-01 西南科技大学 A kind of quick lawn semantic segmentation and boundary detection method
CN110599461A (en) * 2019-08-21 2019-12-20 东南大学 Thalamus function partitioning method based on subspace feature learning

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
EHSAN ELHAMIFAR等: "Sparse Subspace Clustering: Algorithm, Theory, and Applications", 《IEEE TRANSACTIONS ON PATTERN ANALYSIS & MACHINE INTELLIGENCE》 *
QILINLI等: "Affinity learning via a diffusion process for subspace clustering", 《PATTERN RECOGNITION》 *
SHUAI YANG等: "Residual Encoder-Decoder Network for Deep Subspace Clustering", 《COMPUTER SCIENCE-MACHINE LEARNING》 *
刘杰平等: "基于视觉词典的深度图生成算法", 《光学学报》 *
王振等: "基于改进全卷积神经网络的玉米叶片病斑分割", 《计算机工程与应用》 *
舒大松: "基于LVQ神经网络的条形码识别研究", 《邵阳学院学报(自然科学版)》 *
赵欣欣等: "基于卷积神经网络的铁路桥梁高强螺栓缺失图像识别方法", 《中国铁道科学》 *
马力等: "基于稀疏化双线性卷积神经网络的细粒度图像分类", 《模式识别与人工智能》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112164067A (en) * 2020-10-12 2021-01-01 西南科技大学 Medical image segmentation method and device based on multi-mode subspace clustering
CN112784929A (en) * 2021-03-14 2021-05-11 西北工业大学 Small sample image classification method and device based on double-element group expansion
CN112784929B (en) * 2021-03-14 2023-03-28 西北工业大学 Small sample image classification method and device based on double-element group expansion
CN113450351A (en) * 2021-08-06 2021-09-28 推想医疗科技股份有限公司 Segmentation model training method, image segmentation method, device, equipment and medium
CN113450351B (en) * 2021-08-06 2024-01-30 推想医疗科技股份有限公司 Segmentation model training method, image segmentation method, device, equipment and medium
CN113821661A (en) * 2021-08-30 2021-12-21 上海明略人工智能(集团)有限公司 Image retrieval method, system, storage medium and electronic device
CN113821661B (en) * 2021-08-30 2024-04-02 上海明略人工智能(集团)有限公司 Image retrieval method, system, storage medium and electronic device
CN113793340A (en) * 2021-08-31 2021-12-14 南京邮电大学 Image segmentation neural network and remote biological imaging method and system
CN113793340B (en) * 2021-08-31 2023-10-13 南京邮电大学 Image segmentation neural network and remote biological imaging method and system
WO2023098688A1 (en) * 2021-12-03 2023-06-08 华为技术有限公司 Image encoding and decoding method and device
WO2024036758A1 (en) * 2022-08-15 2024-02-22 广州广电运通金融电子股份有限公司 Automatic image annotation method and apparatus, device, and medium

Also Published As

Publication number Publication date
CN111259904B (en) 2022-12-27

Similar Documents

Publication Publication Date Title
CN111259904B (en) Semantic image segmentation method and system based on deep learning and clustering
CN110826596A (en) Semantic segmentation method based on multi-scale deformable convolution
CN113240683B (en) Attention mechanism-based lightweight semantic segmentation model construction method
CN109389667B (en) High-efficiency global illumination drawing method based on deep learning
Zhang et al. Learning 3D faces from 2D images via stacked contractive autoencoder
CN110363068A (en) A kind of high-resolution pedestrian image generation method based on multiple dimensioned circulation production confrontation network
CN111080591A (en) Medical image segmentation method based on combination of coding and decoding structure and residual error module
CN116168197A (en) Image segmentation method based on Transformer segmentation network and regularization training
CN116863194A (en) Foot ulcer image classification method, system, equipment and medium
CN115936992A (en) Garbage image super-resolution method and system of lightweight transform
Dong et al. Research on image classification based on capsnet
CN114780767A (en) Large-scale image retrieval method and system based on deep convolutional neural network
CN111860668B (en) Point cloud identification method for depth convolution network of original 3D point cloud processing
Dong et al. Occlusion expression recognition based on non-convex low-rank double dictionaries and occlusion error model
CN113269235B (en) Assembly body change detection method and device based on unsupervised learning
CN112990336B (en) Deep three-dimensional point cloud classification network construction method based on competitive attention fusion
CN115272696A (en) Point cloud semantic segmentation method based on self-adaptive convolution and local geometric information
Pandey et al. A conspectus of deep learning techniques for single-image super-resolution
CN114972851A (en) Remote sensing image-based ship target intelligent detection method
Islam et al. Class aware auto encoders for better feature extraction
TWI722383B (en) Pre feature extraction method applied on deep learning
Pu et al. Convolutional Channel Attentional Facial Expression Recognition Network and Its Application in Human–Computer Interaction
CN113111848A (en) Human body image analysis method based on multi-scale features
CN115496910B (en) Point cloud semantic segmentation method based on full-connected graph coding and double-expansion residual error
CN112116062A (en) Multilayer perceptron nonlinear compression method based on tensor string decomposition

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant