CN111259904B - Semantic image segmentation method and system based on deep learning and clustering - Google Patents

Semantic image segmentation method and system based on deep learning and clustering Download PDF

Info

Publication number
CN111259904B
CN111259904B CN202010047315.7A CN202010047315A CN111259904B CN 111259904 B CN111259904 B CN 111259904B CN 202010047315 A CN202010047315 A CN 202010047315A CN 111259904 B CN111259904 B CN 111259904B
Authority
CN
China
Prior art keywords
convolution
clustering
layer
neural network
matrix
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010047315.7A
Other languages
Chinese (zh)
Other versions
CN111259904A (en
Inventor
郭丽
刘知贵
张小乾
白克强
薛旭倩
刘道广
李理
张活力
吴均
付聪
喻琼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southwest University of Science and Technology
Original Assignee
Southwest University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southwest University of Science and Technology filed Critical Southwest University of Science and Technology
Priority to CN202010047315.7A priority Critical patent/CN111259904B/en
Publication of CN111259904A publication Critical patent/CN111259904A/en
Application granted granted Critical
Publication of CN111259904B publication Critical patent/CN111259904B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning

Abstract

The invention discloses a semantic image segmentation method and a semantic image segmentation system based on deep learning and clustering, which comprise the following steps: s1, performing convolution and pooling on an original image through a convolution neural network to obtain a linear characteristic matrix of the original image; s2, performing subspace clustering on the linear feature matrix to obtain clustered feature data; and S3, processing the clustered feature data to the same pixels as the original image through deconvolution and upsampling to obtain a segmented image. The invention combines the Convolutional Neural Network (CNN) in the deep neural network with subspace clustering, and replaces the full connection layer in the CNN with sparse subspace, thereby solving the problems of complicated semantic image segmentation calculation, large data volume and poor information in the prior art. A subspace clustering method is introduced into the neural network, so that a large amount of marking data required by the CNN during working is reduced, and unsupervised learning of the CNN neural network is realized.

Description

Semantic image segmentation method and system based on deep learning and clustering
Technical Field
The invention relates to the field of machine vision, in particular to a semantic image segmentation method and a semantic image segmentation system based on deep learning and clustering.
Background
The purpose of semantic image segmentation is to classify the semantics of each region, i.e. what objects the region is, i.e. to indicate all objects from the image to their respective classes, and to segment them. At present, the image segmentation and the sub-space clustering are realized by adopting a deep learning neural network tool as a common segmentation mode. They have advantages and disadvantages, respectively.
In the deep learning neural network mode, the characteristic data is extracted in an automatic learning mode of a computer, and the characteristic learning is integrated into the model establishing process, so that the imperfection caused by artificial design characteristics is reduced, and the characteristic of better classification performance is achieved.
Subspace clustering is an effective way to realize high-dimensional data clustering, dimensionality reduction is performed in advance, a graph beneficial to segmentation is constructed by often using a subspace clustering method to solve the problem of complex image segmentation, the feature data dimensionality of a complex image region is high, a large number of irrelevant attributes exist, clusters are difficult to form, and the problems of abundant data and poor information exist. However, subspace clustering methods require linear representation between data, realistic image data generally cannot be represented linearly in an input space, and subspace segmentation becomes difficult at this time, kernel-based methods are generally adopted at present, but kernel methods are similar to template-based methods, performance of kernel functions depends on selection of kernel functions to a great extent, kernel function selection is empirical, and nonlinear transformation is ambiguous, and it cannot be determined whether a predefined kernel can generate an implicit feature space suitable for subspace clustering, which brings difficulty in processing a large-scale data set.
Disclosure of Invention
The invention aims to solve the technical problems that in the existing semantic image segmentation technology, the system data volume is large, the computation is complex, the feature data are difficult to form clusters, and the feature data are inconvenient to extract.
The invention is realized by the following technical scheme:
a semantic image segmentation method based on deep learning and clustering is characterized by comprising the following steps:
s1, performing convolution and pooling on an original image through a convolution neural network to obtain a linear characteristic matrix of the original image;
s2, performing subspace clustering on the linear feature matrix to obtain clustered feature data;
and S3, processing the clustered feature data to the same pixels as the original image through deconvolution and upsampling to obtain a segmented image.
Firstly, a deep learning method is adopted to carry out multilayer convolution and pooling on the characteristic data of an original image through a convolution neural network, namely a layered nonlinear transformation process, a linear characteristic matrix of the original image is obtained and is transmitted to a subspace clustering layer through a highest pooling layer, and since the image characteristics after convolution and pooling have linearity, subspace segmentation becomes easy. And obtaining an affinity matrix after reconstructing and thinning the subspace, and clustering by using a spectral clustering mode. The invention replaces the full-connection layer of the convolutional neural network with the subspace clustering layer, thereby reducing the long calculation time and the large calculation difficulty brought by the classification mode of the full-connection layer. And finally, in order to improve the segmentation precision, the invention adopts a deconvolution and upsampling mode in a deep neural network to restore the segmented image to the original pixel and output the original pixel.
Further, the convolutional neural network comprises a plurality of layers of networks from low to high, each layer of network comprises convolution and pooling, and the step S1 comprises the following substeps:
in a lower layer network of the convolutional neural network, performing convolutional operation by using an M convolutional kernel, extracting simple common characteristic data through pooling, performing convolutional operation by using an N convolutional kernel in a higher layer network of the convolutional neural network, and extracting complex common characteristic data through pooling, wherein the number of the M convolutional kernels is less than that of the N convolutional kernels;
the object of each layer of network convolution and pooling of the convolutional neural network is common characteristic data obtained by the previous layer of network, and the object of the lowest layer of network convolution and pooling of the convolutional neural network is an original image;
and forming a linear matrix by using the common characteristic data obtained by the convolution and pooling of the highest layer network of the convolutional neural network, wherein the linear matrix is the linear characteristic matrix of the original image.
The invention is based on deep learning and clustering semantic image segmentation, and a definite mapping is learned to make a subspace more separable. The method comprises the steps of firstly adopting a deep learning method to realize layered nonlinear transformation extraction of feature data through a convolutional neural network, namely sequentially extracting simple common features from a lower layer network through convolution operation with less convolutional kernels, then adopting more convolutional kernels to realize extraction of complex features from a higher layer network, then realizing feature data clustering through a sparse subspace clustering method by using the features extracted from the highest pooling layer, and finally recovering original pixels of the segmented image through a decoding layer formed by the deep neural network and then outputting the image.
Further, the subspace clustering is sparse subspace clustering, and the step S2 includes the following substeps: and performing sparsification processing on the linear characteristic matrix through a subspace to obtain a sparse coefficient matrix, calculating the similarity between all joints of the image according to the sparse coefficient matrix to obtain an affinity matrix, and performing spectral clustering on the affinity matrix to obtain clustered characteristic data. Preferably, the linear feature matrix is partitioned into different subspaces by sparse coefficients. Preferably, the spectral clustering adopts a K-means clustering algorithm.
Further, the number of convolution layers of the deconvolution in step S3 is the same as the number of convolution layers of the convolution in step S1. When the image feature data mapping is restored to be the same as the original input image resolution, the convolution operation can be stopped, and a segmented image is obtained.
In addition, a semantic image segmentation system based on deep learning and clustering comprises:
an input unit: used for inputting the original image;
an encoding unit: the system comprises a convolution neural network, a data processing unit and a data processing unit, wherein the convolution neural network is used for performing convolution and pooling on an original image to obtain a linear characteristic matrix of the original image;
a clustering unit: the linear feature matrix is used for performing subspace clustering on the linear feature matrix to obtain clustered feature data;
a decoding unit: the characteristic data after clustering is processed to the same pixel as the original image through deconvolution and upsampling to obtain a segmented image;
an output unit: for outputting the segmented image.
Compared with the prior art, the invention has the following advantages and beneficial effects:
the convolutional neural network has the advantages that the image data features are extracted, the learning is supervised, meanwhile, the subspace clustering method is introduced into the neural network, the feature learning modules of all layers of the convolutional coding layer can be supervised through the output of spectral clustering, a large amount of marked data required by the convolutional neural network during working is reduced, the unsupervised learning of the convolutional neural network is realized, and the operation of a full connection layer of the convolutional neural network on a large amount of data is avoided. A subspace clustering method is implemented by learning a set of explicit nonlinear mapping functions through a convolutional neural network to map the input to another space, and using the sample representations in the new space to compute an affinity matrix. After the neural network training, the noise of the feature data is reduced, and the robustness of the subspace clustering model to the noise is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the embodiments of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention. In the drawings:
FIG. 1 is a schematic view of the present invention.
FIG. 2 is a block diagram of the design of embodiment 2.
FIG. 3 is a fully-connected layer neural network.
Fig. 4 is a schematic diagram of a subspace clustering layer instead of a full connection layer design.
Fig. 5 shows the feature classification results of example 4.
FIG. 6 is a schematic diagram of a fully connected layer classification.
FIG. 7 is a block diagram of the decoding layer design of the convolutional neural network of embodiment 5.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to examples and the accompanying drawings, and the exemplary embodiments and descriptions thereof are only used for explaining the present invention and are not used as limiting the present invention.
Example 1
The embodiment is a semantic image segmentation method based on deep learning and clustering, and the design principle is shown in fig. 1. Firstly, a deep learning method is adopted to realize the hierarchical nonlinear transformation of the feature data of an original image through a coding layer of a Convolutional Neural Network (CNN), the feature data are transmitted to a subspace clustering layer through a highest pooling layer, the feature data clustering is realized through a sparse subspace clustering method, and finally, in order to improve the segmentation precision, a decoding layer formed by the deep neural network is adopted to restore the original pixel of the segmented image and output the restored image.
Example 2
The present embodiment is based on embodiment 1, and elaborates the principle of the deep neural network coding layer. The deep neural network of the coding layer can select neural network architectures such as BP neural network, RNN neural network, CNN and the like. The method utilizes a powerful function of extracting data features from image information by a Convolutional Neural Network (CNN), adopts a Convolutional Neural Network (CNN) architecture at a deep neural network encoder layer, and comprises layers such as a convolutional layer 1-a pooling layer 1-a convolutional layer 2-a pooling layer 2-a convolutional layer 3-a pooling layer 3-a convolutional layer 4-a pooling layer 4 \8230, a convolutional layer n-a pooling layer n and the like, and realizes data feature extraction and information filtering on an input picture through a plurality of convolutional kernels and the like until cleaner data with higher information value is obtained, so that the convolutional operation can be stopped. Compared with the common method that the original data is directly used as input, the design method has the advantages that data are layered and changed in a multi-linear mode, subspace representation errors are reduced, and the robustness of a subspace clustering model to noise is improved. The block diagram of the convolutional neural network coding layer design is shown in fig. 2.
Example 3
In this embodiment, on the basis of embodiment 1, a subspace clustering layer design method is elaborated in detail, and a sparse subspace clustering method is used to replace a full connection layer of a CNN deep neural network. The number of net parameters of the fully-connected layer is very large, even much larger than the parameter quantity of the plurality of convolutional layers, and taking fig. 3 as an example, if the image is pooled into 20 images with 12 × 12 outputs, after the action of the fully-connected first layer of 100 neurons, the whole image has 100 × 20 × 12=288000 parameters, which undoubtedly increases the calculation time and difficulty. The invention avoids the generation of a classification mode by full connection, adopts a sparse subspace clustering method introduced into a deep neural network structure, realizes the classification of characteristic data by utilizing the sparse subspace clustering method on a characteristic matrix output by a front-end top layer convolution layer, and enables the subspace to be more separable, for example, in the design, the image characteristics of the whole picture are divided into k subspaces
Figure BDA0002369892620000041
In addition, the sparse subspace clustering method defines the training target of the front-end neural network: the learning to reconstruct the input data error is minimal. The weight between each node in the full connection layer is the similarity between corresponding nodes, and all the weights form X = XCC is a coefficient matrix, which further constructs an affinity matrix between data points and utilizes a spectral clustering method to realize data clustering. The subspace clustering layer instead of the fully connected layer design schematic is shown in fig. 4. Subspace representation coefficient calculation method:
subspace representation model: min | | C | luminance 1 s.t.X=XC,C ii =0。
Using the alternating bi-directional multiplier method, f (c) = X-XC,
Figure BDA0002369892620000042
a calculation step: 1.
Figure BDA0002369892620000043
2、λ k+1 =λ kk h(C k+1 )
as a result: calculating by subspace representation coefficient to obtain coefficient matrix C
The similarity matrix calculation method comprises the following steps: w = (| C | + | C) T |)/2
The design method of using the subspace clustering layer to replace the full connection layer has the advantages of reducing a large amount of marking data required by CNN during working and realizing unsupervised learning of a neural network.
Example 4
In this embodiment, based on embodiment 3, the process of replacing the full connection layer with the subspace clustering layer is described by taking the example of dividing the class 2 data into 6 sets of feature data matrices. Using a sparse matrix calculation formula: min | | C | non-calculation 1 s.t.X=XC,C ii =0; wherein, X represents the 6 characteristic data matrixes output by the highest pooling layer, C represents the coefficient matrix and is a sparse matrix, as shown in formula (1):
Figure BDA0002369892620000051
pairing the coefficient matrix in equation (1) by using the alternating bi-directional multiplier method
Figure BDA0002369892620000052
The coefficient matrix obtained by the construction has a large number of 0 vectors, such as the matrix C/in the formula (2).
Figure BDA0002369892620000053
The matrix has more coefficients with 0, i.e. the number in the feature can be represented by a small amount of data, and X11 in the formula (1) is represented by the corresponding other data in the 6 groups of data as shown in the formula (3):
Figure BDA0002369892620000061
X11=X12*0+X13*1/2+X14*1/2+X15*0+X16*0
X12=X11*0+X13*0+X14*0+X15*1/4+X16*3/4
………………………
………………………
………………………
X16=X11*0+X12*1/2+X13*0+X14*0+X15*1/2
i.e. by calculation with a small amount of data
Figure BDA0002369892620000062
Figure BDA0002369892620000063
And
Figure BDA0002369892620000064
these 5 sets of feature vectors are classified as shown in FIG. 5
The A and B classes are not required to be known before clustering the characteristic values, the classification can be completed only by knowing the number of the classes, and the classes of each class are not required to be known exactly before the classification like neural network learning.
Example 5
In this embodiment, under the calculation conditions of embodiment 4, a full-connection manner is adopted to implement classification processes on two types of data, namely, a type a and a type B, for 6 groups of characteristic data matrices:
the first step is as follows: need to give A and B labels
The second step is that: establishing a full connection layer and constructing a hidden layer, wherein all vectors in the 6 groups of characteristic values are connected with each neuron of each layer in the hidden layer by using a weight value, and the connection weight value is adjusted repeatedly. As shown in fig. 6.
Example 6
The present embodiment is based on embodiment 1, and details the principle of the deep neural network decoding layer. Data output after an input image is subjected to convolution and pooling operations of a deep neural network encoder layer is compressed into an image with lower pixels, if the input image is directly segmented by using image features of the highest layer, edge details of a segmented target image are fuzzy, because the features of the higher layer of the CNN are more complex and semantic information is richer, but the bottom layer features of the CNN usually contain many common features and contain more information such as edges, positions and the like. Therefore, in order to improve the image segmentation precision, a clear segmented image is output at an output layer, a deep neural network decoding layer is designed at the moment, and the feature map is processed into the size of an original image through an up-sampling and deconvolution operation. The decoding layer is composed of an upper sampling layer 1-a convolution layer n + 1-an upper sampling layer 2-a convolution layer n + 2-an upper sampling layer 3-a convolution layer n +3 \8230andthe like. In order to prevent the over-fitting phenomenon, the number of the convolutional layers in the decoder can be the same as that of the convolutional layers in the coding layer, the convolution operation can be stopped when the feature mapping is restored to be the same as the resolution of the original input image, and finally the segmented image is output at the output layer. The design block diagram of the deep neural network decoding layer is shown in fig. 7.
The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are merely exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (8)

1. A semantic image segmentation method based on deep learning and clustering is characterized by comprising the following steps:
s1: performing convolution and pooling on the original image through a convolution neural network to obtain a linear characteristic matrix of the original image;
s2: performing subspace clustering on the linear feature matrix, and replacing a full connection layer of the convolutional neural network with a subspace clustering layer to obtain clustered feature data;
s3: processing the clustered feature data to pixels which are the same as the original image through deconvolution and upsampling to obtain a segmented image;
the convolutional neural network comprises a plurality of layers of networks from low to high, each layer of network comprises convolution and pooling, and the step S1 comprises the following substeps:
in a lower-layer network of the convolutional neural network, performing convolution operation by using M convolution kernels, extracting simple common characteristic data through pooling, in a higher-layer network of the convolutional neural network, performing convolution operation by using N convolution kernels, and extracting complex common characteristic data through pooling, wherein the number of the M convolution kernels is less than that of the N convolution kernels;
the object of each layer of network convolution and pooling of the convolutional neural network is common characteristic data obtained by the previous layer of network, and the object of the lowest layer of network convolution and pooling of the convolutional neural network is an original image;
and forming a linear matrix by using the common characteristic data obtained by convolution and pooling of the highest layer network of the convolution neural network, wherein the linear matrix is the linear characteristic matrix of the original image.
2. The deep learning and clustering based semantic image segmentation method according to claim 1, wherein the subspace clustering is a sparse subspace clustering, and the step S2 comprises the following substeps: and performing sparsification processing on the linear characteristic matrix through a subspace to obtain a sparse coefficient matrix, calculating the similarity between all joints of the image according to the sparse coefficient matrix to obtain an affinity matrix, and performing spectral clustering on the affinity matrix to obtain clustered characteristic data.
3. The deep learning and clustering based semantic image segmentation method according to claim 1, characterized in that the linear feature matrix is segmented into different subspaces by sparse coefficients.
4. The deep learning and clustering based semantic image segmentation method according to claim 2, wherein the spectral clustering adopts a K-means clustering algorithm.
5. The semantic image segmentation method based on deep learning and clustering according to claim 1, wherein the number of convolution layers of the deconvolution in step S3 is the same as the number of convolution layers of the convolution in step S1.
6. A semantic image segmentation system based on deep learning and clustering, comprising:
an input unit: used for inputting the original image;
an encoding unit: the system comprises a convolution neural network, a data processing unit and a data processing unit, wherein the convolution neural network is used for performing convolution and pooling on an original image to obtain a linear characteristic matrix of the original image;
a clustering unit: the linear feature matrix is used for carrying out subspace clustering, and a subspace clustering layer is used for replacing a full connection layer of the convolutional neural network to obtain clustered feature data;
a decoding unit: the characteristic data after clustering is processed to the same pixels as the original image through deconvolution and upsampling to obtain a segmented image;
an output unit: for outputting the segmented image;
wherein, the coding unit comprises a plurality of layers of networks from low to high, each layer of network comprises convolution and pooling, and the coding unit comprises:
the lower layer network of the coding unit uses M convolution kernels to perform convolution operation and extracts simple common characteristic data through pooling, the upper layer network of the coding unit uses N convolution kernels to perform convolution operation and extracts complex common characteristic data through pooling, and the number of the M convolution kernels is less than that of the N convolution kernels;
the object of each layer of network convolution and pooling of the coding unit is common characteristic data obtained by a network of the previous layer, and the object of the lowest layer of network convolution and pooling of the coding unit is an original image;
and forming a linear matrix by using the common characteristic data obtained by the convolution and pooling of the highest layer network of the coding unit, wherein the linear matrix is the linear characteristic matrix of the original image.
7. The deep learning and clustering based semantic image segmentation system of claim 6 wherein the clustering unit is a sparse subspace clustering unit configured to: and carrying out sparsification processing on the characteristic matrix through a subspace to obtain an affinity matrix, and carrying out spectral clustering on the affinity matrix to obtain clustered characteristic data.
8. The deep learning and clustering based semantic image segmentation system according to claim 6, wherein the number of convolution layers that the decoding unit deconvolves is the same as the number of convolution layers that the encoding unit convolves.
CN202010047315.7A 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering Active CN111259904B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010047315.7A CN111259904B (en) 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010047315.7A CN111259904B (en) 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering

Publications (2)

Publication Number Publication Date
CN111259904A CN111259904A (en) 2020-06-09
CN111259904B true CN111259904B (en) 2022-12-27

Family

ID=70954191

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010047315.7A Active CN111259904B (en) 2020-01-16 2020-01-16 Semantic image segmentation method and system based on deep learning and clustering

Country Status (1)

Country Link
CN (1) CN111259904B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112164067A (en) * 2020-10-12 2021-01-01 西南科技大学 Medical image segmentation method and device based on multi-mode subspace clustering
CN112784929B (en) * 2021-03-14 2023-03-28 西北工业大学 Small sample image classification method and device based on double-element group expansion
CN113450351B (en) * 2021-08-06 2024-01-30 推想医疗科技股份有限公司 Segmentation model training method, image segmentation method, device, equipment and medium
CN113821661B (en) * 2021-08-30 2024-04-02 上海明略人工智能(集团)有限公司 Image retrieval method, system, storage medium and electronic device
CN113793340B (en) * 2021-08-31 2023-10-13 南京邮电大学 Image segmentation neural network and remote biological imaging method and system
CN116260983A (en) * 2021-12-03 2023-06-13 华为技术有限公司 Image coding and decoding method and device
CN115331039A (en) * 2022-08-15 2022-11-11 广州广电运通金融电子股份有限公司 Automatic image labeling method, device, equipment and medium

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10565713B2 (en) * 2016-11-15 2020-02-18 Samsung Electronics Co., Ltd. Image processing apparatus and method
WO2018125580A1 (en) * 2016-12-30 2018-07-05 Konica Minolta Laboratory U.S.A., Inc. Gland segmentation with deeply-supervised multi-level deconvolution networks
CN107169415B (en) * 2017-04-13 2019-10-11 西安电子科技大学 Human motion recognition method based on convolutional neural networks feature coding
CN108734719A (en) * 2017-04-14 2018-11-02 浙江工商大学 Background automatic division method before a kind of lepidopterous insects image based on full convolutional neural networks
CN107341517B (en) * 2017-07-07 2020-08-11 哈尔滨工业大学 Multi-scale small object detection method based on deep learning inter-level feature fusion
CN108010049A (en) * 2017-11-09 2018-05-08 华南理工大学 Split the method in human hand region in stop-motion animation using full convolutional neural networks
CN108665463A (en) * 2018-03-30 2018-10-16 哈尔滨理工大学 A kind of cervical cell image partition method generating network based on confrontation type
CN108717568B (en) * 2018-05-16 2019-10-22 陕西师范大学 A kind of image characteristics extraction and training method based on Three dimensional convolution neural network
CN109145920A (en) * 2018-08-21 2019-01-04 电子科技大学 A kind of image, semantic dividing method based on deep neural network
CN109784424B (en) * 2019-03-26 2021-02-09 腾讯科技(深圳)有限公司 Image classification model training method, image processing method and device
CN110399840B (en) * 2019-05-22 2024-04-02 西南科技大学 Rapid lawn semantic segmentation and boundary detection method
CN110599461B (en) * 2019-08-21 2023-04-07 东南大学 Thalamus function partitioning method based on subspace feature learning

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Sparse Subspace Clustering: Algorithm, Theory, and Applications;Ehsan Elhamifar等;《IEEE Transactions on Pattern Analysis & Machine Intelligence》;20130920;第35卷(第11期);正文第2767-2768页 *
基于改进全卷积神经网络的玉米叶片病斑分割;王振等;《计算机工程与应用》;20190529;第55卷(第22期);正文第129-130页、图2-3 *
基于稀疏化双线性卷积神经网络的细粒度图像分类;马力等;《模式识别与人工智能》;20190415(第04期);正文第337页、图1 *

Also Published As

Publication number Publication date
CN111259904A (en) 2020-06-09

Similar Documents

Publication Publication Date Title
CN111259904B (en) Semantic image segmentation method and system based on deep learning and clustering
CN107798381B (en) Image identification method based on convolutional neural network
CN110136062B (en) Super-resolution reconstruction method combining semantic segmentation
CN109711426B (en) Pathological image classification device and method based on GAN and transfer learning
CN110826596A (en) Semantic segmentation method based on multi-scale deformable convolution
CN110544297A (en) Three-dimensional model reconstruction method for single image
Zhang et al. Efficient feature learning and multi-size image steganalysis based on CNN
CN113240683B (en) Attention mechanism-based lightweight semantic segmentation model construction method
CN109389667B (en) High-efficiency global illumination drawing method based on deep learning
CN110363068B (en) High-resolution pedestrian image generation method based on multiscale circulation generation type countermeasure network
CN112329801B (en) Convolutional neural network non-local information construction method
Ahmad et al. 3D capsule networks for object classification from 3D model data
CN116863194A (en) Foot ulcer image classification method, system, equipment and medium
CN113222124A (en) SAUNet + + network for image semantic segmentation and image semantic segmentation method
CN115936992A (en) Garbage image super-resolution method and system of lightweight transform
CN117115563A (en) Remote sensing land coverage classification method and system based on regional semantic perception
CN116188435B (en) Medical image depth segmentation method based on fuzzy logic
CN113269235B (en) Assembly body change detection method and device based on unsupervised learning
CN116797456A (en) Image super-resolution reconstruction method, system, device and storage medium
CN112990336B (en) Deep three-dimensional point cloud classification network construction method based on competitive attention fusion
Pandey et al. A conspectus of deep learning techniques for single-image super-resolution
CN113793267B (en) Self-supervision single remote sensing image super-resolution method based on cross-dimension attention mechanism
CN115272696A (en) Point cloud semantic segmentation method based on self-adaptive convolution and local geometric information
CN114972851A (en) Remote sensing image-based ship target intelligent detection method
CN114170460A (en) Multi-mode fusion-based artwork classification method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant