CN106709568A - RGB-D image object detection and semantic segmentation method based on deep convolution network - Google Patents

RGB-D image object detection and semantic segmentation method based on deep convolution network Download PDF

Info

Publication number
CN106709568A
CN106709568A CN201611168200.3A CN201611168200A CN106709568A CN 106709568 A CN106709568 A CN 106709568A CN 201611168200 A CN201611168200 A CN 201611168200A CN 106709568 A CN106709568 A CN 106709568A
Authority
CN
China
Prior art keywords
frame
network
training
rcnn
fcn
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201611168200.3A
Other languages
Chinese (zh)
Other versions
CN106709568B (en
Inventor
刘波
邓广晖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Xiaofeng Technology Co ltd
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN201611168200.3A priority Critical patent/CN106709568B/en
Publication of CN106709568A publication Critical patent/CN106709568A/en
Application granted granted Critical
Publication of CN106709568B publication Critical patent/CN106709568B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biophysics (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses an RGB-D image object detection and semantic segmentation method based on a deep convolution network, which belongs to the field of depth learning and machine vision. According to the method provided by the technical scheme of the invention, Faster-RCNN is used to replace the original slow RCNN; Faster-RCNN uses GPU, which is fast in the aspect of feature extracting, and at the same time generates a regional scheme in the network; the whole training process is training from end to end; FCN is used to carry out RGB-D image semantic segmentation; FCN uses a GPU and the deep convolution network to rapidly extract the deep features of an image; deconvolution is used to fuse deep features and shallow features of the image convolution; and the local semantic information of the image is integrated into the global semantic information.

Description

The object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network
Technical field
The invention belongs to deep learning and field of machine vision, more particularly to a kind of object detection comprising RGB-D images With semantic segmentation method, this has application widely in reality scene, for example, pedestrian is examined in monitor video Survey and tracking, Navigation of Pilotless Aircraft, automatic Pilot etc..
Background technology
Object detection and semantic segmentation are two important research fields of computer vision, object detection mainly for detection of The position of objects in images and the classification of object, the main of object detection have two tasks, and one is the region side for finding out object Case (Region Proposals), zone scheme is a kind of pre-selection frame, represents object approximate location in the picture; Two is that the object in pre-selection frame is classified.The problem that semantic segmentation is solved is that each pixel of image is assigned to just True label, semantic segmentation is mainly used in scene and understands and there are many potential applications.With deep layer convolutional Neural net The rise of network, the object detection based on deep layer convolutional neural networks has turned into detection algorithm presently the most prevailing, is equally based on The semantic segmentation of (Region Proposals) has turned into semantic segmentation algorithm presently the most prevailing.
First, the method for producing Region Proposals relatively more prevailing is as follows.
The method of traditional generation Region Proposals has many kinds, such as selective search (Selective Search different colours feature (such as hsv color space, Lab color spaces etc.)), according to image merges super-pixel (has phase Like the adjacent pixel blocks of feature), image under cpu model using selective search produce Region Proposals when Between be 2s.Multiscale combination is grouped (MCG), it is necessary to merge super-pixel generating region according to contour feature under various zoom scale The two dimensional characters such as candidate scheme, area, girth, boundary intensity then according to zone scheme carry out ranking.Edge frame (EdgeBoxes), using the method generating region scheme of sliding window, using marginal information (profile number in frame and with The profile number that frame edge is overlapped), Region Proposals are ranked up.The above method is all in cpu model Under carry out.Zone scheme network (Region Proposals Network, abbreviation RPN) can utilize deep layer under GPU patterns Convolutional neural networks extract the Region Proposals produced while characteristics of image.This causes that the speed of object detection is obtained Large increase.
2nd, the region deep layer convolutional neural networks for being quickly used for object detection are as follows.
With quick region deep layer convolutional neural networks significantly improving in the speed and accuracy rate of object detection, with After occur in that many deep layer convolutional neural networks faster, such as Faster-RCNN is made up of two networks, and one is RPN, is used In Region Proposals are produced, one is quick region deep layer convolutional neural networks, for object identification;YOLO is thing The selection of body frame is combined with identification, is synchronously completed by primary network, but the area that YOLO is produced to piece image Domain scheme only has 98, causes the accuracy of object frame than relatively low;SSD is that acquiescence side is produced on each layer of characteristic image Frame, advantage is the input picture for low resolution, can also produce the frame of pinpoint accuracy, has the disadvantage detecting system to frame Size is very sensitive, for wisp, detects poor-performing;R-FCN is a kind of object detection network based on FCN, network house Grader layer is abandoned, full articulamentum has been changed into convolutional layer, core network selection ResNet-101, and propose a kind of to thing The sensitive mapping method in body position solves the translation changeability of object.
3rd, semantic segmentation network is as follows.
From convolutional neural networks, it replaces full articulamentum using convolutional layer for full convolutional network reorganization.In order to realize image Semantic segmentation, the method that FCN is used is that one or more deconvolution operation is performed to further feature image so that further feature figure As size as original image size, each pixel is classified using Softmax graders then, it is realized Can not consider to be lost during down-sampling for the semantic segmentation of the pixel to pixel end to end of whole pictures, but deconvolution behaviour True information.SegNet does not take deconvolution to operate, but successively up-sampling operation is performed to further feature image, again such that deep Then the size of layer characteristic image is classified using Softmax graders as original image size to each pixel, It considers the distortion information that image loses in convolution process because of down-sampling, but can so bring very big memory consumption. After DeepLab models add a condition random field (Conditional Random Field, abbreviation CRF) after FCN Treatment operation, optimizes to the image after segmentation in terms of edge details, but this processing procedure is not to locate end to end Reason process, in order to solve this problem, CRFasRNN is combined CRF and deep learning technology so that whole network structure It is a trainable network end to end.
The above research work is concentrated mainly in RGB color image, with the popularization of depth image sensor, for example Intel RealSense 3D Camera, Asus Xtion PRO LIVE, Microsoft Kinect, increasing research Person is transferred to research center of gravity on RGB-D images, for example object detection, three-dimensional reconstruction, robot vision, virtual reality, figure As segmentation etc..Image segmentation is concentrated mainly on the fields such as semantic segmentation, example segmentation, scene label.
Research on RGB-D images, wherein most typically Gupta et al. is fully used on the basis of RCNN RGB-D image studies object detections, and the semantic segmentation based on super-pixel feature.They propose one kind during object detection The novel method of converting for depth image being changed into triple channel image, and this triple channel is named as HHA, first by many chis The method generating region scheme of degree combination packet, is then respectively trained the RCNN of RGB and HHA, merges the two network extractions Feature, is finally classified using SVMs to each zone scheme.During semantic segmentation, the depth based on super-pixel Feature (geocentrical attitude) and geometric properties (size, shape) carry out classification mark using SVMs to super-pixel The prophesy of label, but the method is slowly, and the method generating region scheme being grouped using Multiscale combination is a kind of very slow Slow process, operating speed is slow and RCNN of redundancy, and training is divided into multiple flow line stages, and calculating super-pixel is characterized in One complexity and slow process.
The content of the invention
In order to solve the problems, such as the above, replace original slow using Faster-RCNN in the technical scheme that this method is used Slow RCNN, Faster-RCNN not only possess speed quickly using GPU in extraction characteristic aspect, and in a network can be simultaneously Generating region scheme, can realize that whole training process is to train end to end, while performing the language of RGB-D images using FCN Justice segmentation, FCN uses the further feature of GPU and deep layer convolutional network rapid extraction image, using deconvolution operation handlebar image volume Long-pending further feature and shallow-layer feature is merged, and the local semantic information of image is dissolved into global semantic information.
To achieve these goals, the technical solution adopted by the present invention is the RGB-D images based on deep layer convolutional network Object detection and semantic segmentation method, on object detection and semantic segmentation task, the content of this method is:
S1, by RGB image calculate gray level image, HHG images are merged into by gray level image and HHA images.Such as Fig. 2 institutes Show in tri- optical imagerys of the discrete Fourier transform of passage of HHA, there is the discrete fourier clearest differences is that A channel Conversion, the intensity that it is embodied in the i.e. transverse and longitudinal coordinate axle of DC component is very faint, therefore casts out this passage.Due to tri- passages of RGB Discrete Fourier transform optical imagery it is all similar and intensity of DC component is also strong, use the gray-scale map of RGB image As replacing the A channel image in HHA images, so using the triple channel image of fusion RGB image and depth image for HHG schemes Picture.
S2, using Faster-RCNN as HHG images object detecting system.Using HHG images as network input Data, Region Proposals are produced by the RPN in Faster-RCNN, and Region is extracted by Fast-RCNN The feature of Proposals, then classifies to each Region Proposals, and the testing result of this method is, in HHG Position and the scope of object are marked in image with a rectangle frame, and marks the classification of object in the rectangle frame, thing in such as Fig. 1 Shown in physical examination mapping.
S3, the mechanism for changing non-maxima suppression (Non-Maximum Suppression, abbreviation NMS) reservation frame, Frame quantity around frame is used as factor of evaluation.As shown in Figure 3.Specific step is as follows:
Each frame is 5 tuple (x1, y1, x2, y2, score), wherein (x1, y1) is the seat in the frame upper left corner Mark, (x2, y2) is the coordinate of lower right bezel corner, and score is the confidence level comprising object in frame.Frame is first according to score Value carries out ascending sort to each tuple.It is calculated as follows the double ratio of frame simultaneously (Intersection-over- Union) Duplication.
Wherein, O(i,j)Represent the double ratio and Duplication, inter of frame i and frame j(i,j)Represent the weight of frame i and frame j Folded area, area(i)Represent the area of frame i, area(j)Represent the area of frame j.For frame i, statistics's Quantity SumiIf, Sumi>=δ, casts out frame i, otherwise retains, and n represents frame total quantity, and δ is represented and accepted or rejected threshold value.
S4, the semantic segmentation task that RGB-D images are completed using HHG images and FCN.Using HHG images as the defeated of FCN Enter data, after FCN extracts semantic feature and classifies, the class label of each pixel, uses label in output HHG images It is worth as the pixel value of the pixel.The segmentation result of this method shows phase to belong to same category of pixel in HHG images Same color is as shown in semantic segmentation figure in Fig. 1.
The object detection of this paper and the structural framing of semantic segmentation are as shown in Figure 1.
Brief description of the drawings
The object detection of Fig. 1 RGB-D images and the flow chart and design sketch of semantic segmentation
Fig. 2 HHG images and RGB image, HHA image comparison figures
Nms ' and top2000 comparison diagrams when Fig. 3 reduces frame
Specific embodiment
The present invention is described in further detail below with reference to drawings and Examples.
The present invention will be illustrated from the following aspects:The fusion of RGB image and depth image, the NMS for changing, The training of model and experimental result.
The object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network comprise the following steps:
Firstth, RGB image and depth image are fused into HHG images according to the method described above;
Secondth, object detecting system model is trained;
The training method of Faster-RCNN has three kinds:One is alternately training (Alternating Training), and two is near Like joint training (Approximate Joint Training), three is non-approximated joint training (Non-approximate Joint Training).This method uses alternately training program, and alternately the thinking of training program is to make zone scheme network With the shared convolutional layer parameters of Fast-RCNN, the parameter for belonging to each automatic network is finely tuned, this scheme trains zone scheme net first Network, secondly trains Fast-RCNN models according to the zone scheme that zone scheme network is produced, and then uses Fast-RCNN models Initialization area scheme network.This process can be repeated.
This method uses 4- to walk alternately training program:The first step, uses the pre-training on ImageNet data sets Model initialization network, trains zone scheme network;Second step, the zone scheme produced using the zone scheme network of the first step As the pre-detection frame of Fast-RCNN, the model initialization network of the pre-training on ImageNet data sets, training are used Fast-RCNN.3rd step, the netinit zone scheme network and training network trained using second step, because using Shared convolutional layer parameter, so only fine setting belongs to the network layer parameter of zone scheme network here.4th step, equally shares convolution Layer parameter, the netinit Fast-RCNN trained using second step simultaneously finely tunes the network layer parameter for being pertaining only to the network.We The first step and second step are referred to as the first stage, the 3rd step and the 4th step are referred to as second stage.
The computational methods of this paper loss functions are referred to multitask loss (Multi-task loss) of Faster-RCNN Formula, formula expression is as follows:
Wherein, piProphesy probability of i-th anchor point frame (Anchors box) comprising object is represented,Represent that ground is true Value (Ground-Truth) label, if anchor point frame is positive example,If anchor point frame is counter-example,tiRepresent The coordinate (4 parameters) of prophesy frame i,Represent and the related ground truth frame of positive example anchor point frame.LclsRepresent Softmax Classification Loss,Represent that frame returns loss,NclsTable Show block size, N in experimentcls=256.NregRepresent the quantity of anchor point frame, λ represents an equalizing coefficient, taken in experiment λ= 10,Computational methods be referred to Fast-RCNN, formula is as follows:
3rd, semantic segmentation system model is trained
During the training of full convolution, backpropagation equally uses stochastic gradient descent (Stochastic gradient Descent, abbreviation SGD) backpropagation is carried out, loss is that the Softmax losses of each pixel are sued for peace.Full convolution Training network (herein only use Vgg-16 networks) be divided into three kinds.The first be after conv7 convolutional layers perform one across Step (Stride) is 32 deconvolution operation (FCN-32s);Be for second conv7 convolutional layers are performed one stride for 2 it is anti- Convolution results perform one and merges (average) with pool4 ponds layers result, then this fusion results execution one is striden for 16 deconvolution operation (FCN-16s);The third is that three kinds of results are merged, and these three results are respectively to be held to conv7 Row one stride for 4 deconvolution result, to pool4 ponds layer perform one stride for 2 deconvolution result, pool3 ponds Layer.Once striden again using this fusion results as 8 deconvolution operates (FCN-8s);Wherein second and the third side Formula is referred to as the jump framework of full convolutional network.
Full convolutional network training process typically uses a model fine setting FCN-32s network for training, then FCN- 16s is based on the model of FCN-32s models fine setting jump framework, finally finely tunes the model of FCN-8s using the model of FCN-16s.This Text continues to use this training mode, and the model trained using FCN-8s is tested, and as last experimental result.
4th, experimental result;
Using RGB color image and the fused images of depth image --- HHG images are tested, and are realized using HHG images The average accuracy of object detection is 37.6% (row of table the 1, the 6th), and the result than Gupta et al. improves 5.1%.
Table 1:1,2,3 row are the experimental results of Gupta et al., and 4-8 row are the experimental results of this method, and wherein nms ' is represented Be to use the experiment after the non-maxima suppression changed.Experimental result is the percentage of Average Accuracy.
The frame quantity produced by zone scheme network there are about 17000, then process overlap by non-maxima suppression Frame, be left 2000 to 3000 frames, this process need average time be 0.71s, then once changed again Non-maxima suppression afterwards is reduced to 2,000 (± 50) by the quantity of frame, and the average time that this process needs is 0.133s. Wherein the value on δ refer to table 2.When frame quantity is in different zones (between 2050 to 3000) δ value (δ ∈ [8, 13] it is) different, when quantity is less than 2050, the non-maxima suppression changed is not performed, (this when quantity is more than 3000 The situation of kind seldom occurs), take 2000 frames of score values highest.The average accuracy of experimental result is on the basis of HHG images Improve 1.6% (row of table the 1, the 7th).
Finally use VGG-16 network models as final object detection experimental result, average accuracy is 43.7% (row of table the 1, the 8th), the experimental result than Gupta et al. improves 11.2%.
Table 2:The first row is represented and processes remaining frame quantity interval by first time non-maxima suppression, and the second row is represented δ is in different interval values.
It is as shown in table 3 on segmentation result.This method obtains best segmentation using HHG images under FCN-8s networks As a result, average double ratio and 30.9% is brought up to from the 28.6% of Gupta et al..
Table 3:40 kinds of IU of semantic segmentation label (%), the first row is the semantic segmentation result of Gupta et al., and the second row is arrived Fourth line is that we use HHG images semantic segmentation result respectively under FCN-32s, FCN-16s, FCN-8s network.

Claims (4)

1. the object detection and semantic segmentation method of the RGB-D images of deep layer convolutional network are based on, it is characterised in that:
S1, by RGB image calculate gray level image, HHG images are merged into by gray level image and HHA images;Tri- passages of HHA Discrete Fourier transform optical imagery in, have the discrete Fourier transform clearest differences is that A channel, it is embodied in directly Flow component is that the intensity of transverse and longitudinal coordinate axle is very faint, therefore casts out this passage;Due to tri- discrete Fourier transforms of passage of RGB Optical imagery it is all similar and intensity of DC component is also strong, in replacing HHA images using the gray level image of RGB image A channel image, so using fusion RGB image and depth image triple channel image be HHG images;
S2, using Faster-RCNN as HHG images object detecting system;Using HHG images as network input number According to, Region Proposals are produced by the RPN in Faster-RCNN, Region is extracted by Fast-RCNN The feature of Proposals, then classifies to each Region Proposals, and the testing result of this method is, in HHG Position and the scope of object are marked in image with a rectangle frame, and marks the classification of object in the rectangle frame;
S3, change non-maxima suppression are the mechanism that NMS retains frame, using the frame quantity around frame as factor of evaluation; Specific step is as follows:
Each frame is 5 tuple (x1, y1, x2, y2, score), wherein (x1, y1) is the coordinate in the frame upper left corner, (x2, y2) is the coordinate of lower right bezel corner, and score is the confidence level comprising object in frame;It is right that frame is first according to score values Each tuple carries out ascending sort;It is calculated as follows the double ratio and Duplication of frame;
O ( i , j ) i = [ 1 , n - δ ] j = [ i + 1 , n ] = inter ( i , j ) area ( i ) + area ( j ) - inter ( i , j )
Wherein, O(i,j)Represent the double ratio and Duplication, inter of frame i and frame j(i,j)Represent the faying surface of frame i and frame j Product, area(i)Represent the area of frame i, area(j)Represent the area of frame j;For frame i, statisticsQuantity SumiIf, Sumi>=δ, casts out frame i, otherwise retains, and n represents frame total quantity, and δ is represented and accepted or rejected threshold value;
S4, the semantic segmentation task that RGB-D images are completed using HHG images and FCN;Using HHG images as FCN input number According to after FCN extracts semantic feature and classifies, the class label of each pixel, is made with label value in output HHG images It is the pixel value of the pixel.
2. the object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network according to claim 1, It is characterized in that:
The training method of Faster-RCNN has three kinds:One is alternately training, and two is approximate joint training, and three is non-approximated joint Training;This method uses alternately training program, and alternately the thinking of training program is to make zone scheme network and Fast-RCNN Shared convolutional layer parameter, fine setting belongs to the parameter of each automatic network, and this scheme trains zone scheme network first, secondly according to area The zone scheme training Fast-RCNN models that domain scheme network is produced, then use Fast-RCNN model initialization zone schemes Network;This process can be repeated.
3. the object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network according to claim 2, It is characterized in that:
This method uses 4- to walk alternately training program:The first step, uses the model of the pre-training on ImageNet data sets Initialization network, trains zone scheme network;Second step, using the first step zone scheme network produce zone scheme as The pre-detection frame of Fast-RCNN, using the model initialization network of the pre-training on ImageNet data sets, trains Fast- RCNN;3rd step, the netinit zone scheme network and training network trained using second step, because having used shared volume Lamination parameter, so only fine setting belongs to the network layer parameter of zone scheme network here;4th step, equally shared convolutional layer ginseng Number, the netinit Fast-RCNN trained using second step simultaneously finely tunes the network layer parameter for being pertaining only to the network;We are One step and second step are referred to as the first stage, and the 3rd step and the 4th step are referred to as second stage;
The computational methods of this paper loss functions are referred to the multitask loss formula of Faster-RCNN, and formula expression is as follows:
L ( { p i } , { t i } ) = 1 N c l s Σ i L c l s ( p i , p i * ) + λ 1 N r e g Σ i p i * L r e g ( t i , t i * )
Wherein, piI-th prophesy probability of the anchor point frame comprising object is represented,Ground truth label is represented, if anchor point side Frame is positive example,If anchor point frame is counter-example,tiThe coordinate (4 parameters) of prophesy frame i is represented,Represent With the related ground truth frame of positive example anchor point frame;LclsSoftmax Classification Loss is represented,Represent that frame is returned Loss,NclsRepresent block size, N in experimentcls=256;NregRepresent anchor The quantity of frame is put, λ represents an equalizing coefficient, λ=10 are taken in experiment,Computational methods be referred to Fast- RCNN, formula is as follows:
4. the object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network according to claim 1, It is characterized in that:
During the training of full convolution, backpropagation equally carries out backpropagation using stochastic gradient descent, and loss is to each picture The Softmax losses of vegetarian refreshments are sued for peace;The training network of full convolution is divided into three kinds;The first is held after conv7 convolutional layers Row one strides as 32 deconvolution operates FCN-32s;Be for second conv7 convolutional layers are performed one stride for 2 warp Product result is merged for one with the execution of pool4 ponds layers result, then this fusion results execution one is striden for 16 warp Product operation FCN-16s;The third is that three kinds of results are merged, these three results be respectively to conv7 perform one stride Be 4 deconvolution result, to pool4 ponds layer perform one stride for 2 deconvolution result, pool3 ponds layer;Using this Fusion results are once striden as 8 deconvolution operates FCN-8s again;It is referred to as full volume with the third mode wherein second The jump framework of product network;
Full convolutional network training process typically uses a model fine setting FCN-32s network for training, then FCN-16s bases In the model of FCN-32s models fine setting jump framework, finally the model of FCN-8s is finely tuned using the model of FCN-16s;This paper edges This training mode is used, the model trained using FCN-8s is tested, and as last experimental result.
CN201611168200.3A 2016-12-16 2016-12-16 The object detection and semantic segmentation method of RGB-D image based on deep layer convolutional network Active CN106709568B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611168200.3A CN106709568B (en) 2016-12-16 2016-12-16 The object detection and semantic segmentation method of RGB-D image based on deep layer convolutional network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611168200.3A CN106709568B (en) 2016-12-16 2016-12-16 The object detection and semantic segmentation method of RGB-D image based on deep layer convolutional network

Publications (2)

Publication Number Publication Date
CN106709568A true CN106709568A (en) 2017-05-24
CN106709568B CN106709568B (en) 2019-03-22

Family

ID=58938969

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611168200.3A Active CN106709568B (en) 2016-12-16 2016-12-16 The object detection and semantic segmentation method of RGB-D image based on deep layer convolutional network

Country Status (1)

Country Link
CN (1) CN106709568B (en)

Cited By (105)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103613A (en) * 2017-03-28 2017-08-29 深圳市未来媒体技术研究院 A kind of three-dimension gesture Attitude estimation method
CN107392189A (en) * 2017-09-05 2017-11-24 百度在线网络技术(北京)有限公司 For the method and apparatus for the driving behavior for determining unmanned vehicle
CN107437099A (en) * 2017-08-03 2017-12-05 哈尔滨工业大学 A kind of specific dress ornament image recognition and detection method based on machine learning
CN107507126A (en) * 2017-07-27 2017-12-22 大连和创懒人科技有限公司 A kind of method that 3D scenes are reduced using RGB image
CN107545263A (en) * 2017-08-02 2018-01-05 清华大学 A kind of object detecting method and device
CN107564025A (en) * 2017-08-09 2018-01-09 浙江大学 A kind of power equipment infrared image semantic segmentation method based on deep neural network
CN107563290A (en) * 2017-08-01 2018-01-09 中国农业大学 A kind of pedestrian detection method and device based on image
CN107563381A (en) * 2017-09-12 2018-01-09 国家新闻出版广电总局广播科学研究院 The object detection method of multiple features fusion based on full convolutional network
CN107563284A (en) * 2017-07-26 2018-01-09 中国农业大学 Pedestrian's method for tracing and device
CN107563405A (en) * 2017-07-19 2018-01-09 同济大学 Garage automatic Pilot semantic objects recognition methods based on multiresolution neutral net
CN107563372A (en) * 2017-07-20 2018-01-09 济南中维世纪科技有限公司 A kind of license plate locating method based on deep learning SSD frameworks
CN107578436A (en) * 2017-08-02 2018-01-12 南京邮电大学 A kind of monocular image depth estimation method based on full convolutional neural networks FCN
CN107657257A (en) * 2017-08-14 2018-02-02 中国矿业大学 A kind of semantic image dividing method based on multichannel convolutive neutral net
CN107680113A (en) * 2017-10-27 2018-02-09 武汉大学 The image partition method of multi-layer segmentation network based on Bayesian frame edge prior
CN107680109A (en) * 2017-09-15 2018-02-09 盐城禅图智能科技有限公司 It is a kind of to quote inverse notice and the image, semantic dividing method of pixel similarity study
CN107688784A (en) * 2017-08-23 2018-02-13 福建六壬网安股份有限公司 A kind of character identifying method and storage medium based on further feature and shallow-layer Fusion Features
CN107742311A (en) * 2017-09-29 2018-02-27 北京易达图灵科技有限公司 A kind of method and device of vision positioning
CN107808131A (en) * 2017-10-23 2018-03-16 华南理工大学 Dynamic gesture identification method based on binary channel depth convolutional neural networks
CN107862674A (en) * 2017-11-08 2018-03-30 杭州测度科技有限公司 Depth image fusion method and system
CN107886477A (en) * 2017-09-20 2018-04-06 武汉环宇智行科技有限公司 Unmanned neutral body vision merges antidote with low line beam laser radar
CN107886117A (en) * 2017-10-30 2018-04-06 国家新闻出版广电总局广播科学研究院 The algorithm of target detection merged based on multi-feature extraction and multitask
CN107908635A (en) * 2017-09-26 2018-04-13 百度在线网络技术(北京)有限公司 Establish textual classification model and the method, apparatus of text classification
CN107944457A (en) * 2017-11-23 2018-04-20 浙江清华长三角研究院 Drawing object identification and extracting method under a kind of complex scene
CN108009481A (en) * 2017-11-22 2018-05-08 浙江大华技术股份有限公司 A kind of training method and device of CNN models, face identification method and device
CN108090442A (en) * 2017-12-15 2018-05-29 四川大学 A kind of airport scene monitoring method based on convolutional neural networks
CN108171141A (en) * 2017-12-25 2018-06-15 淮阴工学院 The video target tracking method of cascade multi-pattern Fusion based on attention model
CN108182428A (en) * 2018-01-31 2018-06-19 福州大学 The method that front truck state recognition and vehicle follow
CN108320286A (en) * 2018-02-28 2018-07-24 苏州大学 Image significance detection method, system, equipment and computer readable storage medium
CN108334955A (en) * 2018-03-01 2018-07-27 福州大学 Copy of ID Card detection method based on Faster-RCNN
CN108345887A (en) * 2018-01-29 2018-07-31 清华大学深圳研究生院 The training method and image, semantic dividing method of image, semantic parted pattern
CN108399361A (en) * 2018-01-23 2018-08-14 南京邮电大学 A kind of pedestrian detection method based on convolutional neural networks CNN and semantic segmentation
CN108446662A (en) * 2018-04-02 2018-08-24 电子科技大学 A kind of pedestrian detection method based on semantic segmentation information
CN108491803A (en) * 2018-03-27 2018-09-04 北京中硕众联智能电子科技有限公司 A kind of device and the corresponding recognition methods of identification objects in images and temperature
CN108520219A (en) * 2018-03-30 2018-09-11 台州智必安科技有限责任公司 A kind of multiple dimensioned fast face detecting method of convolutional neural networks Fusion Features
CN108537292A (en) * 2018-04-10 2018-09-14 上海白泽网络科技有限公司 Semantic segmentation network training method, image, semantic dividing method and device
CN108596102A (en) * 2018-04-26 2018-09-28 北京航空航天大学青岛研究院 Indoor scene object segmentation grader building method based on RGB-D
CN108596240A (en) * 2018-04-20 2018-09-28 华中科技大学 A kind of image, semantic dividing method based on differentiation character network
CN108647562A (en) * 2018-03-27 2018-10-12 北京中硕众联智能电子科技有限公司 Identify device and the corresponding method of objects in images and temperature
CN108664974A (en) * 2018-04-03 2018-10-16 华南理工大学 A kind of semantic segmentation method based on RGBD images Yu Complete Disability difference network
CN108710863A (en) * 2018-05-24 2018-10-26 东北大学 Unmanned plane Scene Semantics dividing method based on deep learning and system
CN108734694A (en) * 2018-04-09 2018-11-02 华南农业大学 Thyroid tumors ultrasonoscopy automatic identifying method based on faster r-cnn
CN108876793A (en) * 2018-04-13 2018-11-23 北京迈格威科技有限公司 Semantic segmentation methods, devices and systems and storage medium
CN108875588A (en) * 2018-05-25 2018-11-23 武汉大学 Across camera pedestrian detection tracking based on deep learning
CN108876796A (en) * 2018-06-08 2018-11-23 长安大学 A kind of lane segmentation system and method based on full convolutional neural networks and condition random field
CN108895981A (en) * 2018-05-29 2018-11-27 南京怀萃智能科技有限公司 A kind of method for three-dimensional measurement, device, server and storage medium
CN108985194A (en) * 2018-06-29 2018-12-11 华南理工大学 A kind of intelligent vehicle based on image, semantic segmentation can travel the recognition methods in region
WO2018232592A1 (en) * 2017-06-20 2018-12-27 Microsoft Technology Licensing, Llc. Fully convolutional instance-aware semantic segmentation
CN109102457A (en) * 2018-06-12 2018-12-28 杭州米绘科技有限公司 A kind of intelligent color change system and method based on convolutional neural networks
CN109101914A (en) * 2018-08-01 2018-12-28 北京飞搜科技有限公司 It is a kind of based on multiple dimensioned pedestrian detection method and device
CN109271990A (en) * 2018-09-03 2019-01-25 北京邮电大学 A kind of semantic segmentation method and device for RGB-D image
CN109325505A (en) * 2018-09-11 2019-02-12 北京陌上花科技有限公司 Example dividing method and device, mobile phone terminal for embedded device
CN109325385A (en) * 2017-07-31 2019-02-12 株式会社理光 Target detection and region segmentation method, device and computer readable storage medium
CN109377479A (en) * 2018-09-27 2019-02-22 中国电子科技集团公司第五十四研究所 Satellite dish object detection method based on remote sensing image
CN109564684A (en) * 2018-01-15 2019-04-02 深圳鲲云信息科技有限公司 Image, semantic dividing method, programmable logic circuit, system and electronic equipment
CN109598268A (en) * 2018-11-23 2019-04-09 安徽大学 A kind of RGB-D well-marked target detection method based on single flow depth degree network
CN109598728A (en) * 2018-11-30 2019-04-09 腾讯科技(深圳)有限公司 Image partition method, device, diagnostic system and storage medium
CN109655019A (en) * 2018-10-29 2019-04-19 北方工业大学 Cargo volume measurement method based on deep learning and three-dimensional reconstruction
CN109685762A (en) * 2018-11-09 2019-04-26 五邑大学 A kind of Downtilt measurement method based on multiple dimensioned deep semantic segmentation network
CN109711413A (en) * 2018-12-30 2019-05-03 陕西师范大学 Image, semantic dividing method based on deep learning
CN109711367A (en) * 2018-12-29 2019-05-03 北京中科寒武纪科技有限公司 Operation method, device and Related product
CN109753903A (en) * 2019-02-27 2019-05-14 北航(四川)西部国际创新港科技有限公司 A kind of unmanned plane detection method based on deep learning
CN109782771A (en) * 2019-02-26 2019-05-21 西安交通大学 A kind of orchard mobile robot and edge of a field forward method
CN109801337A (en) * 2019-01-21 2019-05-24 同济大学 A kind of 6D position and orientation estimation method of Case-based Reasoning segmentation network and iteration optimization
CN109870983A (en) * 2017-12-04 2019-06-11 北京京东尚科信息技术有限公司 Handle the method, apparatus of pallet stacking image and the system for picking of storing in a warehouse
CN109872357A (en) * 2019-01-16 2019-06-11 创新奇智(广州)科技有限公司 A kind of article arrangement face accounting calculation method, system and electronic equipment
CN109903331A (en) * 2019-01-08 2019-06-18 杭州电子科技大学 A kind of convolutional neural networks object detection method based on RGB-D camera
CN109902572A (en) * 2019-01-24 2019-06-18 哈尔滨理工大学 A kind of vehicle checking method based on deep learning
CN109934342A (en) * 2018-12-28 2019-06-25 深圳奥比中光科技有限公司 Neural network model training method, depth image restorative procedure and system
CN110009648A (en) * 2019-03-04 2019-07-12 东南大学 Trackside image Method of Vehicle Segmentation based on depth Fusion Features convolutional neural networks
CN110008848A (en) * 2019-03-13 2019-07-12 华南理工大学 A kind of travelable area recognizing method of the road based on binocular stereo vision
CN110008808A (en) * 2018-12-29 2019-07-12 北京迈格威科技有限公司 Panorama dividing method, device and system and storage medium
CN110070124A (en) * 2019-04-15 2019-07-30 广州小鹏汽车科技有限公司 A kind of image amplification method and system based on production confrontation network
CN110084257A (en) * 2018-01-26 2019-08-02 北京京东尚科信息技术有限公司 Method and apparatus for detecting target
CN110188780A (en) * 2019-06-03 2019-08-30 电子科技大学中山学院 Method and device for constructing deep learning model for positioning multi-target feature points
CN110263731A (en) * 2019-06-24 2019-09-20 电子科技大学 A kind of single step face detection system
CN110287777A (en) * 2019-05-16 2019-09-27 西北大学 A kind of golden monkey body partitioning algorithm under natural scene
CN110298330A (en) * 2019-07-05 2019-10-01 东北大学 A kind of detection of transmission line polling robot monocular and localization method
CN110348342A (en) * 2019-06-27 2019-10-18 广东技术师范大学天河学院 A kind of piping disease image partition method based on full convolutional network
CN110363201A (en) * 2019-07-10 2019-10-22 上海交通大学 Weakly supervised semantic segmentation method and system based on Cooperative Study
CN110388931A (en) * 2018-04-17 2019-10-29 百度(美国)有限责任公司 The two-dimentional bounding box of object is converted into the method for the three-dimensional position of automatic driving vehicle
CN110399884A (en) * 2019-07-10 2019-11-01 浙江理工大学 A kind of adaptive anchor frame model vehicle checking method of Fusion Features
CN110473288A (en) * 2019-08-23 2019-11-19 清华四川能源互联网研究院 Dam model method for reconstructing, device and electronic equipment
CN110599538A (en) * 2019-09-30 2019-12-20 山东信通电子股份有限公司 Method and device for identifying icing thickness of transmission line conductor
CN110738132A (en) * 2019-09-23 2020-01-31 中国海洋大学 target detection quality blind evaluation method with discriminant perception capability
CN110766096A (en) * 2019-10-31 2020-02-07 北京金山云网络技术有限公司 Video classification method and device and electronic equipment
US10600167B2 (en) 2017-01-18 2020-03-24 Nvidia Corporation Performing spatiotemporal filtering
CN110941995A (en) * 2019-11-01 2020-03-31 中山大学 Real-time target detection and semantic segmentation multi-task learning method based on lightweight network
CN111027493A (en) * 2019-12-13 2020-04-17 电子科技大学 Pedestrian detection method based on deep learning multi-network soft fusion
CN111104840A (en) * 2018-10-26 2020-05-05 斯特拉德视觉公司 Learning and testing method and device based on regression loss
CN111222468A (en) * 2020-01-08 2020-06-02 浙江光珀智能科技有限公司 People stream detection method and system based on deep learning
CN111368829A (en) * 2020-02-28 2020-07-03 北京理工大学 Visual semantic relation detection method based on RGB-D image
CN111507985A (en) * 2020-03-19 2020-08-07 北京市威富安防科技有限公司 Image instance segmentation optimization processing method and device and computer equipment
CN111553925A (en) * 2020-04-27 2020-08-18 南通智能感知研究院 End-to-end crop image segmentation method and system based on FCN
TWI702536B (en) * 2019-12-31 2020-08-21 財團法人工業技術研究院 Training method and system of object detection model based on adaptive annotation design
CN111712853A (en) * 2018-02-16 2020-09-25 松下知识产权经营株式会社 Processing method and processing device using the same
CN111783784A (en) * 2020-06-30 2020-10-16 创新奇智(合肥)科技有限公司 Method and device for detecting building cavity, electronic equipment and storage medium
WO2020237693A1 (en) * 2019-05-31 2020-12-03 华南理工大学 Multi-source sensing method and system for water surface unmanned equipment
CN112528705A (en) * 2019-09-17 2021-03-19 同方威视技术股份有限公司 Image recognition system and method based on semantics
CN112667832A (en) * 2020-12-31 2021-04-16 哈尔滨工业大学 Vision-based mutual positioning method in unknown indoor environment
CN113033398A (en) * 2021-03-25 2021-06-25 深圳市康冠商用科技有限公司 Gesture recognition method and device, computer equipment and storage medium
CN113763422A (en) * 2021-07-30 2021-12-07 北京交通大学 RGB-D image saliency target detection method
CN114343579A (en) * 2022-01-05 2022-04-15 浙江大学 Infant iatrogenic skin injury automatic evaluation handheld device
CN114373218A (en) * 2022-03-21 2022-04-19 北京万里红科技有限公司 Method for generating convolution network for detecting living body object
TWI794414B (en) * 2018-02-21 2023-03-01 德商羅伯特博斯奇股份有限公司 Systems and methods for real-time object detection using depth sensors
US11651229B2 (en) 2017-11-22 2023-05-16 Zhejiang Dahua Technology Co., Ltd. Methods and systems for face recognition

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101478709B1 (en) * 2012-06-27 2015-01-05 한국과학기술원 Method and apparatus for extracting and generating feature point and feature descriptor rgb-d image
WO2016045711A1 (en) * 2014-09-23 2016-03-31 Keylemon Sa A face pose rectification method and apparatus
CN106204165A (en) * 2016-08-11 2016-12-07 广州出益信息科技有限公司 A kind of advertisement placement method and device
CN106203506A (en) * 2016-07-11 2016-12-07 上海凌科智能科技有限公司 A kind of pedestrian detection method based on degree of depth learning art

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101478709B1 (en) * 2012-06-27 2015-01-05 한국과학기술원 Method and apparatus for extracting and generating feature point and feature descriptor rgb-d image
WO2016045711A1 (en) * 2014-09-23 2016-03-31 Keylemon Sa A face pose rectification method and apparatus
CN106203506A (en) * 2016-07-11 2016-12-07 上海凌科智能科技有限公司 A kind of pedestrian detection method based on degree of depth learning art
CN106204165A (en) * 2016-08-11 2016-12-07 广州出益信息科技有限公司 A kind of advertisement placement method and device

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
JONATHAN LONG,EVAN SHELHAMER,TREVOR DARRELL: ""Fully Convolutional Networks for Semantic Segmentation"", 《2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 *
SAURABH GUPTA等: ""Aligning 3D Models to RGB-D Images of Cluttered Scenes"", 《2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 *
SAURABH GUPTA等: ""Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images"", 《2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》 *
蔡强等: ""基于ANNet网络的RGB-D图像的目标检测"", 《系统仿真学报》 *

Cited By (151)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11113800B2 (en) 2017-01-18 2021-09-07 Nvidia Corporation Filtering image data using a neural network
US10600167B2 (en) 2017-01-18 2020-03-24 Nvidia Corporation Performing spatiotemporal filtering
CN107103613A (en) * 2017-03-28 2017-08-29 深圳市未来媒体技术研究院 A kind of three-dimension gesture Attitude estimation method
CN107103613B (en) * 2017-03-28 2019-11-15 深圳市未来媒体技术研究院 A kind of three-dimension gesture Attitude estimation method
WO2018232592A1 (en) * 2017-06-20 2018-12-27 Microsoft Technology Licensing, Llc. Fully convolutional instance-aware semantic segmentation
CN107563405A (en) * 2017-07-19 2018-01-09 同济大学 Garage automatic Pilot semantic objects recognition methods based on multiresolution neutral net
CN107563372B (en) * 2017-07-20 2021-01-29 济南中维世纪科技有限公司 License plate positioning method based on deep learning SSD frame
CN107563372A (en) * 2017-07-20 2018-01-09 济南中维世纪科技有限公司 A kind of license plate locating method based on deep learning SSD frameworks
CN107563284A (en) * 2017-07-26 2018-01-09 中国农业大学 Pedestrian's method for tracing and device
CN107507126B (en) * 2017-07-27 2020-09-18 和创懒人(大连)科技有限公司 Method for restoring 3D scene by using RGB image
CN107507126A (en) * 2017-07-27 2017-12-22 大连和创懒人科技有限公司 A kind of method that 3D scenes are reduced using RGB image
CN109325385A (en) * 2017-07-31 2019-02-12 株式会社理光 Target detection and region segmentation method, device and computer readable storage medium
CN107563290A (en) * 2017-08-01 2018-01-09 中国农业大学 A kind of pedestrian detection method and device based on image
CN107578436A (en) * 2017-08-02 2018-01-12 南京邮电大学 A kind of monocular image depth estimation method based on full convolutional neural networks FCN
CN107545263B (en) * 2017-08-02 2020-12-15 清华大学 Object detection method and device
CN107545263A (en) * 2017-08-02 2018-01-05 清华大学 A kind of object detecting method and device
CN107437099A (en) * 2017-08-03 2017-12-05 哈尔滨工业大学 A kind of specific dress ornament image recognition and detection method based on machine learning
CN107564025A (en) * 2017-08-09 2018-01-09 浙江大学 A kind of power equipment infrared image semantic segmentation method based on deep neural network
CN107564025B (en) * 2017-08-09 2020-05-29 浙江大学 Electric power equipment infrared image semantic segmentation method based on deep neural network
CN107657257A (en) * 2017-08-14 2018-02-02 中国矿业大学 A kind of semantic image dividing method based on multichannel convolutive neutral net
CN107688784A (en) * 2017-08-23 2018-02-13 福建六壬网安股份有限公司 A kind of character identifying method and storage medium based on further feature and shallow-layer Fusion Features
WO2019047655A1 (en) * 2017-09-05 2019-03-14 百度在线网络技术(北京)有限公司 Method and apparatus for use in determining driving behavior of driverless vehicle
CN107392189A (en) * 2017-09-05 2017-11-24 百度在线网络技术(北京)有限公司 For the method and apparatus for the driving behavior for determining unmanned vehicle
CN107563381A (en) * 2017-09-12 2018-01-09 国家新闻出版广电总局广播科学研究院 The object detection method of multiple features fusion based on full convolutional network
CN107563381B (en) * 2017-09-12 2020-10-23 国家新闻出版广电总局广播科学研究院 Multi-feature fusion target detection method based on full convolution network
CN107680109A (en) * 2017-09-15 2018-02-09 盐城禅图智能科技有限公司 It is a kind of to quote inverse notice and the image, semantic dividing method of pixel similarity study
CN107886477A (en) * 2017-09-20 2018-04-06 武汉环宇智行科技有限公司 Unmanned neutral body vision merges antidote with low line beam laser radar
US10783331B2 (en) 2017-09-26 2020-09-22 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for building text classification model, and text classification method and apparatus
CN107908635B (en) * 2017-09-26 2021-04-16 百度在线网络技术(北京)有限公司 Method and device for establishing text classification model and text classification
CN107908635A (en) * 2017-09-26 2018-04-13 百度在线网络技术(北京)有限公司 Establish textual classification model and the method, apparatus of text classification
CN107742311A (en) * 2017-09-29 2018-02-27 北京易达图灵科技有限公司 A kind of method and device of vision positioning
CN107742311B (en) * 2017-09-29 2020-02-18 北京易达图灵科技有限公司 Visual positioning method and device
CN107808131A (en) * 2017-10-23 2018-03-16 华南理工大学 Dynamic gesture identification method based on binary channel depth convolutional neural networks
CN107808131B (en) * 2017-10-23 2019-12-10 华南理工大学 Dynamic gesture recognition method based on dual-channel deep convolutional neural network
CN107680113A (en) * 2017-10-27 2018-02-09 武汉大学 The image partition method of multi-layer segmentation network based on Bayesian frame edge prior
CN107886117A (en) * 2017-10-30 2018-04-06 国家新闻出版广电总局广播科学研究院 The algorithm of target detection merged based on multi-feature extraction and multitask
CN107862674B (en) * 2017-11-08 2020-07-03 杭州测度科技有限公司 Depth image fusion method and system
CN107862674A (en) * 2017-11-08 2018-03-30 杭州测度科技有限公司 Depth image fusion method and system
CN108009481A (en) * 2017-11-22 2018-05-08 浙江大华技术股份有限公司 A kind of training method and device of CNN models, face identification method and device
US11651229B2 (en) 2017-11-22 2023-05-16 Zhejiang Dahua Technology Co., Ltd. Methods and systems for face recognition
CN107944457A (en) * 2017-11-23 2018-04-20 浙江清华长三角研究院 Drawing object identification and extracting method under a kind of complex scene
CN109870983B (en) * 2017-12-04 2022-01-04 北京京东尚科信息技术有限公司 Method and device for processing tray stack image and system for warehousing goods picking
CN109870983A (en) * 2017-12-04 2019-06-11 北京京东尚科信息技术有限公司 Handle the method, apparatus of pallet stacking image and the system for picking of storing in a warehouse
CN108090442A (en) * 2017-12-15 2018-05-29 四川大学 A kind of airport scene monitoring method based on convolutional neural networks
CN108171141B (en) * 2017-12-25 2020-07-14 淮阴工学院 Attention model-based cascaded multi-mode fusion video target tracking method
CN108171141A (en) * 2017-12-25 2018-06-15 淮阴工学院 The video target tracking method of cascade multi-pattern Fusion based on attention model
WO2019136760A1 (en) * 2018-01-15 2019-07-18 深圳鲲云信息科技有限公司 Image semantic segmentation method, programmable logic circuit, system and electronic device
US11636665B2 (en) 2018-01-15 2023-04-25 Shenzhen Corerain Technologies Co., Ltd. Streaming image semantic segmentation method, logical integrated circuit system and electronic device
CN109564684B (en) * 2018-01-15 2023-05-02 深圳鲲云信息科技有限公司 Image semantic segmentation method, programmable logic circuit, system and electronic equipment
CN109564684A (en) * 2018-01-15 2019-04-02 深圳鲲云信息科技有限公司 Image, semantic dividing method, programmable logic circuit, system and electronic equipment
CN108399361A (en) * 2018-01-23 2018-08-14 南京邮电大学 A kind of pedestrian detection method based on convolutional neural networks CNN and semantic segmentation
CN110084257A (en) * 2018-01-26 2019-08-02 北京京东尚科信息技术有限公司 Method and apparatus for detecting target
CN108345887A (en) * 2018-01-29 2018-07-31 清华大学深圳研究生院 The training method and image, semantic dividing method of image, semantic parted pattern
CN108345887B (en) * 2018-01-29 2020-10-02 清华大学深圳研究生院 Training method of image semantic segmentation model and image semantic segmentation method
CN108182428A (en) * 2018-01-31 2018-06-19 福州大学 The method that front truck state recognition and vehicle follow
CN111712853A (en) * 2018-02-16 2020-09-25 松下知识产权经营株式会社 Processing method and processing device using the same
CN111712853B (en) * 2018-02-16 2023-11-07 松下知识产权经营株式会社 Processing method and processing device using same
TWI794414B (en) * 2018-02-21 2023-03-01 德商羅伯特博斯奇股份有限公司 Systems and methods for real-time object detection using depth sensors
CN108320286A (en) * 2018-02-28 2018-07-24 苏州大学 Image significance detection method, system, equipment and computer readable storage medium
CN108334955A (en) * 2018-03-01 2018-07-27 福州大学 Copy of ID Card detection method based on Faster-RCNN
CN108647562A (en) * 2018-03-27 2018-10-12 北京中硕众联智能电子科技有限公司 Identify device and the corresponding method of objects in images and temperature
CN108491803A (en) * 2018-03-27 2018-09-04 北京中硕众联智能电子科技有限公司 A kind of device and the corresponding recognition methods of identification objects in images and temperature
CN108520219A (en) * 2018-03-30 2018-09-11 台州智必安科技有限责任公司 A kind of multiple dimensioned fast face detecting method of convolutional neural networks Fusion Features
CN108446662A (en) * 2018-04-02 2018-08-24 电子科技大学 A kind of pedestrian detection method based on semantic segmentation information
CN108664974A (en) * 2018-04-03 2018-10-16 华南理工大学 A kind of semantic segmentation method based on RGBD images Yu Complete Disability difference network
CN108734694A (en) * 2018-04-09 2018-11-02 华南农业大学 Thyroid tumors ultrasonoscopy automatic identifying method based on faster r-cnn
CN108537292A (en) * 2018-04-10 2018-09-14 上海白泽网络科技有限公司 Semantic segmentation network training method, image, semantic dividing method and device
CN108537292B (en) * 2018-04-10 2020-07-31 上海白泽网络科技有限公司 Semantic segmentation network training method, image semantic segmentation method and device
CN108876793A (en) * 2018-04-13 2018-11-23 北京迈格威科技有限公司 Semantic segmentation methods, devices and systems and storage medium
CN110388931B (en) * 2018-04-17 2023-11-10 百度(美国)有限责任公司 Method for converting a two-dimensional bounding box of an object into a three-dimensional position of an autonomous vehicle
CN110388931A (en) * 2018-04-17 2019-10-29 百度(美国)有限责任公司 The two-dimentional bounding box of object is converted into the method for the three-dimensional position of automatic driving vehicle
CN108596240B (en) * 2018-04-20 2020-05-19 华中科技大学 Image semantic segmentation method based on discriminant feature network
CN108596240A (en) * 2018-04-20 2018-09-28 华中科技大学 A kind of image, semantic dividing method based on differentiation character network
CN108596102A (en) * 2018-04-26 2018-09-28 北京航空航天大学青岛研究院 Indoor scene object segmentation grader building method based on RGB-D
CN108596102B (en) * 2018-04-26 2022-04-05 北京航空航天大学青岛研究院 RGB-D-based indoor scene object segmentation classifier construction method
CN108710863A (en) * 2018-05-24 2018-10-26 东北大学 Unmanned plane Scene Semantics dividing method based on deep learning and system
CN108875588A (en) * 2018-05-25 2018-11-23 武汉大学 Across camera pedestrian detection tracking based on deep learning
CN108875588B (en) * 2018-05-25 2022-04-15 武汉大学 Cross-camera pedestrian detection tracking method based on deep learning
CN108895981A (en) * 2018-05-29 2018-11-27 南京怀萃智能科技有限公司 A kind of method for three-dimensional measurement, device, server and storage medium
CN108876796A (en) * 2018-06-08 2018-11-23 长安大学 A kind of lane segmentation system and method based on full convolutional neural networks and condition random field
CN109102457B (en) * 2018-06-12 2023-01-17 杭州米绘科技有限公司 Intelligent color changing system and method based on convolutional neural network
CN109102457A (en) * 2018-06-12 2018-12-28 杭州米绘科技有限公司 A kind of intelligent color change system and method based on convolutional neural networks
CN108985194A (en) * 2018-06-29 2018-12-11 华南理工大学 A kind of intelligent vehicle based on image, semantic segmentation can travel the recognition methods in region
CN109101914A (en) * 2018-08-01 2018-12-28 北京飞搜科技有限公司 It is a kind of based on multiple dimensioned pedestrian detection method and device
CN109101914B (en) * 2018-08-01 2021-08-20 苏州飞搜科技有限公司 Multi-scale-based pedestrian detection method and device
CN109271990A (en) * 2018-09-03 2019-01-25 北京邮电大学 A kind of semantic segmentation method and device for RGB-D image
CN109325505A (en) * 2018-09-11 2019-02-12 北京陌上花科技有限公司 Example dividing method and device, mobile phone terminal for embedded device
CN109377479A (en) * 2018-09-27 2019-02-22 中国电子科技集团公司第五十四研究所 Satellite dish object detection method based on remote sensing image
CN111104840A (en) * 2018-10-26 2020-05-05 斯特拉德视觉公司 Learning and testing method and device based on regression loss
CN111104840B (en) * 2018-10-26 2024-01-09 斯特拉德视觉公司 Regression loss-based learning and testing method and device
CN109655019A (en) * 2018-10-29 2019-04-19 北方工业大学 Cargo volume measurement method based on deep learning and three-dimensional reconstruction
CN109685762A (en) * 2018-11-09 2019-04-26 五邑大学 A kind of Downtilt measurement method based on multiple dimensioned deep semantic segmentation network
CN109598268A (en) * 2018-11-23 2019-04-09 安徽大学 A kind of RGB-D well-marked target detection method based on single flow depth degree network
CN109598268B (en) * 2018-11-23 2021-08-17 安徽大学 RGB-D (Red Green blue-D) significant target detection method based on single-stream deep network
CN109598728A (en) * 2018-11-30 2019-04-09 腾讯科技(深圳)有限公司 Image partition method, device, diagnostic system and storage medium
CN109934342A (en) * 2018-12-28 2019-06-25 深圳奥比中光科技有限公司 Neural network model training method, depth image restorative procedure and system
CN109934342B (en) * 2018-12-28 2022-12-09 奥比中光科技集团股份有限公司 Neural network model training method, depth image restoration method and system
CN110008808A (en) * 2018-12-29 2019-07-12 北京迈格威科技有限公司 Panorama dividing method, device and system and storage medium
CN109711367A (en) * 2018-12-29 2019-05-03 北京中科寒武纪科技有限公司 Operation method, device and Related product
CN109711413A (en) * 2018-12-30 2019-05-03 陕西师范大学 Image, semantic dividing method based on deep learning
CN109903331B (en) * 2019-01-08 2020-12-22 杭州电子科技大学 Convolutional neural network target detection method based on RGB-D camera
CN109903331A (en) * 2019-01-08 2019-06-18 杭州电子科技大学 A kind of convolutional neural networks object detection method based on RGB-D camera
CN109872357A (en) * 2019-01-16 2019-06-11 创新奇智(广州)科技有限公司 A kind of article arrangement face accounting calculation method, system and electronic equipment
CN109801337B (en) * 2019-01-21 2020-10-02 同济大学 6D pose estimation method based on instance segmentation network and iterative optimization
CN109801337A (en) * 2019-01-21 2019-05-24 同济大学 A kind of 6D position and orientation estimation method of Case-based Reasoning segmentation network and iteration optimization
CN109902572A (en) * 2019-01-24 2019-06-18 哈尔滨理工大学 A kind of vehicle checking method based on deep learning
CN109782771A (en) * 2019-02-26 2019-05-21 西安交通大学 A kind of orchard mobile robot and edge of a field forward method
CN109782771B (en) * 2019-02-26 2021-01-19 西安交通大学 Orchard mobile robot and ground steering method
CN109753903B (en) * 2019-02-27 2020-09-15 北航(四川)西部国际创新港科技有限公司 Unmanned aerial vehicle detection method based on deep learning
CN109753903A (en) * 2019-02-27 2019-05-14 北航(四川)西部国际创新港科技有限公司 A kind of unmanned plane detection method based on deep learning
CN110009648B (en) * 2019-03-04 2023-02-24 东南大学 Roadside image vehicle segmentation method based on depth feature fusion convolutional neural network
CN110009648A (en) * 2019-03-04 2019-07-12 东南大学 Trackside image Method of Vehicle Segmentation based on depth Fusion Features convolutional neural networks
CN110008848A (en) * 2019-03-13 2019-07-12 华南理工大学 A kind of travelable area recognizing method of the road based on binocular stereo vision
CN110070124A (en) * 2019-04-15 2019-07-30 广州小鹏汽车科技有限公司 A kind of image amplification method and system based on production confrontation network
CN110287777A (en) * 2019-05-16 2019-09-27 西北大学 A kind of golden monkey body partitioning algorithm under natural scene
WO2020237693A1 (en) * 2019-05-31 2020-12-03 华南理工大学 Multi-source sensing method and system for water surface unmanned equipment
CN110188780A (en) * 2019-06-03 2019-08-30 电子科技大学中山学院 Method and device for constructing deep learning model for positioning multi-target feature points
CN110263731B (en) * 2019-06-24 2021-03-16 电子科技大学 Single step human face detection system
CN110263731A (en) * 2019-06-24 2019-09-20 电子科技大学 A kind of single step face detection system
CN110348342A (en) * 2019-06-27 2019-10-18 广东技术师范大学天河学院 A kind of piping disease image partition method based on full convolutional network
CN110298330A (en) * 2019-07-05 2019-10-01 东北大学 A kind of detection of transmission line polling robot monocular and localization method
CN110363201A (en) * 2019-07-10 2019-10-22 上海交通大学 Weakly supervised semantic segmentation method and system based on Cooperative Study
CN110363201B (en) * 2019-07-10 2021-06-01 上海交通大学 Weak supervision semantic segmentation method and system based on collaborative learning
CN110399884B (en) * 2019-07-10 2021-08-20 浙江理工大学 Feature fusion self-adaptive anchor frame model vehicle detection method
CN110399884A (en) * 2019-07-10 2019-11-01 浙江理工大学 A kind of adaptive anchor frame model vehicle checking method of Fusion Features
CN110473288B (en) * 2019-08-23 2022-08-05 清华四川能源互联网研究院 Dam model reconstruction method and device and electronic equipment
CN110473288A (en) * 2019-08-23 2019-11-19 清华四川能源互联网研究院 Dam model method for reconstructing, device and electronic equipment
CN112528705A (en) * 2019-09-17 2021-03-19 同方威视技术股份有限公司 Image recognition system and method based on semantics
CN110738132A (en) * 2019-09-23 2020-01-31 中国海洋大学 target detection quality blind evaluation method with discriminant perception capability
CN110738132B (en) * 2019-09-23 2022-06-03 中国海洋大学 Target detection quality blind evaluation method with discriminant perception capability
CN110599538A (en) * 2019-09-30 2019-12-20 山东信通电子股份有限公司 Method and device for identifying icing thickness of transmission line conductor
CN110766096A (en) * 2019-10-31 2020-02-07 北京金山云网络技术有限公司 Video classification method and device and electronic equipment
CN110941995A (en) * 2019-11-01 2020-03-31 中山大学 Real-time target detection and semantic segmentation multi-task learning method based on lightweight network
CN111027493B (en) * 2019-12-13 2022-05-20 电子科技大学 Pedestrian detection method based on deep learning multi-network soft fusion
CN111027493A (en) * 2019-12-13 2020-04-17 电子科技大学 Pedestrian detection method based on deep learning multi-network soft fusion
TWI702536B (en) * 2019-12-31 2020-08-21 財團法人工業技術研究院 Training method and system of object detection model based on adaptive annotation design
US11321590B2 (en) 2019-12-31 2022-05-03 Industrial Technology Research Institute Training method and system of objects detection model based on adaptive annotation design
CN111222468A (en) * 2020-01-08 2020-06-02 浙江光珀智能科技有限公司 People stream detection method and system based on deep learning
CN111368829B (en) * 2020-02-28 2023-06-30 北京理工大学 Visual semantic relation detection method based on RGB-D image
CN111368829A (en) * 2020-02-28 2020-07-03 北京理工大学 Visual semantic relation detection method based on RGB-D image
CN111507985A (en) * 2020-03-19 2020-08-07 北京市威富安防科技有限公司 Image instance segmentation optimization processing method and device and computer equipment
CN111553925A (en) * 2020-04-27 2020-08-18 南通智能感知研究院 End-to-end crop image segmentation method and system based on FCN
CN111783784A (en) * 2020-06-30 2020-10-16 创新奇智(合肥)科技有限公司 Method and device for detecting building cavity, electronic equipment and storage medium
CN112667832B (en) * 2020-12-31 2022-05-13 哈尔滨工业大学 Vision-based mutual positioning method in unknown indoor environment
CN112667832A (en) * 2020-12-31 2021-04-16 哈尔滨工业大学 Vision-based mutual positioning method in unknown indoor environment
CN113033398A (en) * 2021-03-25 2021-06-25 深圳市康冠商用科技有限公司 Gesture recognition method and device, computer equipment and storage medium
CN113763422A (en) * 2021-07-30 2021-12-07 北京交通大学 RGB-D image saliency target detection method
CN113763422B (en) * 2021-07-30 2023-10-03 北京交通大学 RGB-D image saliency target detection method
CN114343579A (en) * 2022-01-05 2022-04-15 浙江大学 Infant iatrogenic skin injury automatic evaluation handheld device
CN114373218B (en) * 2022-03-21 2022-06-14 北京万里红科技有限公司 Method for generating convolution network for detecting living body object
CN114373218A (en) * 2022-03-21 2022-04-19 北京万里红科技有限公司 Method for generating convolution network for detecting living body object

Also Published As

Publication number Publication date
CN106709568B (en) 2019-03-22

Similar Documents

Publication Publication Date Title
CN106709568A (en) RGB-D image object detection and semantic segmentation method based on deep convolution network
CN111291739B (en) Face detection and image detection neural network training method, device and equipment
CN106778835A (en) The airport target by using remote sensing image recognition methods of fusion scene information and depth characteristic
Yang et al. Layered object models for image segmentation
CN110287960A (en) The detection recognition method of curve text in natural scene image
CN110287826B (en) Video target detection method based on attention mechanism
CN108009509A (en) Vehicle target detection method
CN107169974A (en) It is a kind of based on the image partition method for supervising full convolutional neural networks more
CN104156693B (en) A kind of action identification method based on the fusion of multi-modal sequence
CN112528976B (en) Text detection model generation method and text detection method
CN105160310A (en) 3D (three-dimensional) convolutional neural network based human body behavior recognition method
CN109086668A (en) Based on the multiple dimensioned unmanned aerial vehicle remote sensing images road information extracting method for generating confrontation network
CN106688011A (en) Method and system for multi-class object detection
CN111612807A (en) Small target image segmentation method based on scale and edge information
CN107451607A (en) A kind of personal identification method of the typical character based on deep learning
CN105956560A (en) Vehicle model identification method based on pooling multi-scale depth convolution characteristics
CN107784291A (en) target detection tracking method and device based on infrared video
CN107066916A (en) Scene Semantics dividing method based on deconvolution neutral net
CN110008900A (en) A kind of visible remote sensing image candidate target extracting method by region to target
CN107220971A (en) A kind of Lung neoplasm feature extracting method based on convolutional neural networks and PCA
CN107818299A (en) Face recognition algorithms based on fusion HOG features and depth belief network
CN108985145A (en) The Opposite direction connection deep neural network model method of small size road traffic sign detection identification
JP4567660B2 (en) A method for determining a segment of an object in an electronic image.
CN109657634A (en) A kind of 3D gesture identification method and system based on depth convolutional neural networks
CN107564007A (en) The scene cut modification method and system of amalgamation of global information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20211123

Address after: 518052 Room 201, building A, 1 front Bay Road, Shenzhen Qianhai cooperation zone, Shenzhen, Guangdong

Patentee after: Shenzhen Xiaofeng Technology Co.,Ltd.

Address before: 100124 No. 100 Chaoyang District Ping Tian Park, Beijing

Patentee before: Beijing University of Technology