CN106709568A - RGB-D image object detection and semantic segmentation method based on deep convolution network - Google Patents
RGB-D image object detection and semantic segmentation method based on deep convolution network Download PDFInfo
- Publication number
- CN106709568A CN106709568A CN201611168200.3A CN201611168200A CN106709568A CN 106709568 A CN106709568 A CN 106709568A CN 201611168200 A CN201611168200 A CN 201611168200A CN 106709568 A CN106709568 A CN 106709568A
- Authority
- CN
- China
- Prior art keywords
- frame
- network
- training
- rcnn
- fcn
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
Abstract
The invention discloses an RGB-D image object detection and semantic segmentation method based on a deep convolution network, which belongs to the field of depth learning and machine vision. According to the method provided by the technical scheme of the invention, Faster-RCNN is used to replace the original slow RCNN; Faster-RCNN uses GPU, which is fast in the aspect of feature extracting, and at the same time generates a regional scheme in the network; the whole training process is training from end to end; FCN is used to carry out RGB-D image semantic segmentation; FCN uses a GPU and the deep convolution network to rapidly extract the deep features of an image; deconvolution is used to fuse deep features and shallow features of the image convolution; and the local semantic information of the image is integrated into the global semantic information.
Description
Technical field
The invention belongs to deep learning and field of machine vision, more particularly to a kind of object detection comprising RGB-D images
With semantic segmentation method, this has application widely in reality scene, for example, pedestrian is examined in monitor video
Survey and tracking, Navigation of Pilotless Aircraft, automatic Pilot etc..
Background technology
Object detection and semantic segmentation are two important research fields of computer vision, object detection mainly for detection of
The position of objects in images and the classification of object, the main of object detection have two tasks, and one is the region side for finding out object
Case (Region Proposals), zone scheme is a kind of pre-selection frame, represents object approximate location in the picture;
Two is that the object in pre-selection frame is classified.The problem that semantic segmentation is solved is that each pixel of image is assigned to just
True label, semantic segmentation is mainly used in scene and understands and there are many potential applications.With deep layer convolutional Neural net
The rise of network, the object detection based on deep layer convolutional neural networks has turned into detection algorithm presently the most prevailing, is equally based on
The semantic segmentation of (Region Proposals) has turned into semantic segmentation algorithm presently the most prevailing.
First, the method for producing Region Proposals relatively more prevailing is as follows.
The method of traditional generation Region Proposals has many kinds, such as selective search (Selective
Search different colours feature (such as hsv color space, Lab color spaces etc.)), according to image merges super-pixel (has phase
Like the adjacent pixel blocks of feature), image under cpu model using selective search produce Region Proposals when
Between be 2s.Multiscale combination is grouped (MCG), it is necessary to merge super-pixel generating region according to contour feature under various zoom scale
The two dimensional characters such as candidate scheme, area, girth, boundary intensity then according to zone scheme carry out ranking.Edge frame
(EdgeBoxes), using the method generating region scheme of sliding window, using marginal information (profile number in frame and with
The profile number that frame edge is overlapped), Region Proposals are ranked up.The above method is all in cpu model
Under carry out.Zone scheme network (Region Proposals Network, abbreviation RPN) can utilize deep layer under GPU patterns
Convolutional neural networks extract the Region Proposals produced while characteristics of image.This causes that the speed of object detection is obtained
Large increase.
2nd, the region deep layer convolutional neural networks for being quickly used for object detection are as follows.
With quick region deep layer convolutional neural networks significantly improving in the speed and accuracy rate of object detection, with
After occur in that many deep layer convolutional neural networks faster, such as Faster-RCNN is made up of two networks, and one is RPN, is used
In Region Proposals are produced, one is quick region deep layer convolutional neural networks, for object identification;YOLO is thing
The selection of body frame is combined with identification, is synchronously completed by primary network, but the area that YOLO is produced to piece image
Domain scheme only has 98, causes the accuracy of object frame than relatively low;SSD is that acquiescence side is produced on each layer of characteristic image
Frame, advantage is the input picture for low resolution, can also produce the frame of pinpoint accuracy, has the disadvantage detecting system to frame
Size is very sensitive, for wisp, detects poor-performing;R-FCN is a kind of object detection network based on FCN, network house
Grader layer is abandoned, full articulamentum has been changed into convolutional layer, core network selection ResNet-101, and propose a kind of to thing
The sensitive mapping method in body position solves the translation changeability of object.
3rd, semantic segmentation network is as follows.
From convolutional neural networks, it replaces full articulamentum using convolutional layer for full convolutional network reorganization.In order to realize image
Semantic segmentation, the method that FCN is used is that one or more deconvolution operation is performed to further feature image so that further feature figure
As size as original image size, each pixel is classified using Softmax graders then, it is realized
Can not consider to be lost during down-sampling for the semantic segmentation of the pixel to pixel end to end of whole pictures, but deconvolution behaviour
True information.SegNet does not take deconvolution to operate, but successively up-sampling operation is performed to further feature image, again such that deep
Then the size of layer characteristic image is classified using Softmax graders as original image size to each pixel,
It considers the distortion information that image loses in convolution process because of down-sampling, but can so bring very big memory consumption.
After DeepLab models add a condition random field (Conditional Random Field, abbreviation CRF) after FCN
Treatment operation, optimizes to the image after segmentation in terms of edge details, but this processing procedure is not to locate end to end
Reason process, in order to solve this problem, CRFasRNN is combined CRF and deep learning technology so that whole network structure
It is a trainable network end to end.
The above research work is concentrated mainly in RGB color image, with the popularization of depth image sensor, for example
Intel RealSense 3D Camera, Asus Xtion PRO LIVE, Microsoft Kinect, increasing research
Person is transferred to research center of gravity on RGB-D images, for example object detection, three-dimensional reconstruction, robot vision, virtual reality, figure
As segmentation etc..Image segmentation is concentrated mainly on the fields such as semantic segmentation, example segmentation, scene label.
Research on RGB-D images, wherein most typically Gupta et al. is fully used on the basis of RCNN
RGB-D image studies object detections, and the semantic segmentation based on super-pixel feature.They propose one kind during object detection
The novel method of converting for depth image being changed into triple channel image, and this triple channel is named as HHA, first by many chis
The method generating region scheme of degree combination packet, is then respectively trained the RCNN of RGB and HHA, merges the two network extractions
Feature, is finally classified using SVMs to each zone scheme.During semantic segmentation, the depth based on super-pixel
Feature (geocentrical attitude) and geometric properties (size, shape) carry out classification mark using SVMs to super-pixel
The prophesy of label, but the method is slowly, and the method generating region scheme being grouped using Multiscale combination is a kind of very slow
Slow process, operating speed is slow and RCNN of redundancy, and training is divided into multiple flow line stages, and calculating super-pixel is characterized in
One complexity and slow process.
The content of the invention
In order to solve the problems, such as the above, replace original slow using Faster-RCNN in the technical scheme that this method is used
Slow RCNN, Faster-RCNN not only possess speed quickly using GPU in extraction characteristic aspect, and in a network can be simultaneously
Generating region scheme, can realize that whole training process is to train end to end, while performing the language of RGB-D images using FCN
Justice segmentation, FCN uses the further feature of GPU and deep layer convolutional network rapid extraction image, using deconvolution operation handlebar image volume
Long-pending further feature and shallow-layer feature is merged, and the local semantic information of image is dissolved into global semantic information.
To achieve these goals, the technical solution adopted by the present invention is the RGB-D images based on deep layer convolutional network
Object detection and semantic segmentation method, on object detection and semantic segmentation task, the content of this method is:
S1, by RGB image calculate gray level image, HHG images are merged into by gray level image and HHA images.Such as Fig. 2 institutes
Show in tri- optical imagerys of the discrete Fourier transform of passage of HHA, there is the discrete fourier clearest differences is that A channel
Conversion, the intensity that it is embodied in the i.e. transverse and longitudinal coordinate axle of DC component is very faint, therefore casts out this passage.Due to tri- passages of RGB
Discrete Fourier transform optical imagery it is all similar and intensity of DC component is also strong, use the gray-scale map of RGB image
As replacing the A channel image in HHA images, so using the triple channel image of fusion RGB image and depth image for HHG schemes
Picture.
S2, using Faster-RCNN as HHG images object detecting system.Using HHG images as network input
Data, Region Proposals are produced by the RPN in Faster-RCNN, and Region is extracted by Fast-RCNN
The feature of Proposals, then classifies to each Region Proposals, and the testing result of this method is, in HHG
Position and the scope of object are marked in image with a rectangle frame, and marks the classification of object in the rectangle frame, thing in such as Fig. 1
Shown in physical examination mapping.
S3, the mechanism for changing non-maxima suppression (Non-Maximum Suppression, abbreviation NMS) reservation frame,
Frame quantity around frame is used as factor of evaluation.As shown in Figure 3.Specific step is as follows:
Each frame is 5 tuple (x1, y1, x2, y2, score), wherein (x1, y1) is the seat in the frame upper left corner
Mark, (x2, y2) is the coordinate of lower right bezel corner, and score is the confidence level comprising object in frame.Frame is first according to score
Value carries out ascending sort to each tuple.It is calculated as follows the double ratio of frame simultaneously (Intersection-over-
Union) Duplication.
Wherein, O(i,j)Represent the double ratio and Duplication, inter of frame i and frame j(i,j)Represent the weight of frame i and frame j
Folded area, area(i)Represent the area of frame i, area(j)Represent the area of frame j.For frame i, statistics's
Quantity SumiIf, Sumi>=δ, casts out frame i, otherwise retains, and n represents frame total quantity, and δ is represented and accepted or rejected threshold value.
S4, the semantic segmentation task that RGB-D images are completed using HHG images and FCN.Using HHG images as the defeated of FCN
Enter data, after FCN extracts semantic feature and classifies, the class label of each pixel, uses label in output HHG images
It is worth as the pixel value of the pixel.The segmentation result of this method shows phase to belong to same category of pixel in HHG images
Same color is as shown in semantic segmentation figure in Fig. 1.
The object detection of this paper and the structural framing of semantic segmentation are as shown in Figure 1.
Brief description of the drawings
The object detection of Fig. 1 RGB-D images and the flow chart and design sketch of semantic segmentation
Fig. 2 HHG images and RGB image, HHA image comparison figures
Nms ' and top2000 comparison diagrams when Fig. 3 reduces frame
Specific embodiment
The present invention is described in further detail below with reference to drawings and Examples.
The present invention will be illustrated from the following aspects:The fusion of RGB image and depth image, the NMS for changing,
The training of model and experimental result.
The object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network comprise the following steps:
Firstth, RGB image and depth image are fused into HHG images according to the method described above;
Secondth, object detecting system model is trained;
The training method of Faster-RCNN has three kinds:One is alternately training (Alternating Training), and two is near
Like joint training (Approximate Joint Training), three is non-approximated joint training (Non-approximate
Joint Training).This method uses alternately training program, and alternately the thinking of training program is to make zone scheme network
With the shared convolutional layer parameters of Fast-RCNN, the parameter for belonging to each automatic network is finely tuned, this scheme trains zone scheme net first
Network, secondly trains Fast-RCNN models according to the zone scheme that zone scheme network is produced, and then uses Fast-RCNN models
Initialization area scheme network.This process can be repeated.
This method uses 4- to walk alternately training program:The first step, uses the pre-training on ImageNet data sets
Model initialization network, trains zone scheme network;Second step, the zone scheme produced using the zone scheme network of the first step
As the pre-detection frame of Fast-RCNN, the model initialization network of the pre-training on ImageNet data sets, training are used
Fast-RCNN.3rd step, the netinit zone scheme network and training network trained using second step, because using
Shared convolutional layer parameter, so only fine setting belongs to the network layer parameter of zone scheme network here.4th step, equally shares convolution
Layer parameter, the netinit Fast-RCNN trained using second step simultaneously finely tunes the network layer parameter for being pertaining only to the network.We
The first step and second step are referred to as the first stage, the 3rd step and the 4th step are referred to as second stage.
The computational methods of this paper loss functions are referred to multitask loss (Multi-task loss) of Faster-RCNN
Formula, formula expression is as follows:
Wherein, piProphesy probability of i-th anchor point frame (Anchors box) comprising object is represented,Represent that ground is true
Value (Ground-Truth) label, if anchor point frame is positive example,If anchor point frame is counter-example,tiRepresent
The coordinate (4 parameters) of prophesy frame i,Represent and the related ground truth frame of positive example anchor point frame.LclsRepresent
Softmax Classification Loss,Represent that frame returns loss,NclsTable
Show block size, N in experimentcls=256.NregRepresent the quantity of anchor point frame, λ represents an equalizing coefficient, taken in experiment λ=
10,Computational methods be referred to Fast-RCNN, formula is as follows:
3rd, semantic segmentation system model is trained
During the training of full convolution, backpropagation equally uses stochastic gradient descent (Stochastic gradient
Descent, abbreviation SGD) backpropagation is carried out, loss is that the Softmax losses of each pixel are sued for peace.Full convolution
Training network (herein only use Vgg-16 networks) be divided into three kinds.The first be after conv7 convolutional layers perform one across
Step (Stride) is 32 deconvolution operation (FCN-32s);Be for second conv7 convolutional layers are performed one stride for 2 it is anti-
Convolution results perform one and merges (average) with pool4 ponds layers result, then this fusion results execution one is striden for
16 deconvolution operation (FCN-16s);The third is that three kinds of results are merged, and these three results are respectively to be held to conv7
Row one stride for 4 deconvolution result, to pool4 ponds layer perform one stride for 2 deconvolution result, pool3 ponds
Layer.Once striden again using this fusion results as 8 deconvolution operates (FCN-8s);Wherein second and the third side
Formula is referred to as the jump framework of full convolutional network.
Full convolutional network training process typically uses a model fine setting FCN-32s network for training, then FCN-
16s is based on the model of FCN-32s models fine setting jump framework, finally finely tunes the model of FCN-8s using the model of FCN-16s.This
Text continues to use this training mode, and the model trained using FCN-8s is tested, and as last experimental result.
4th, experimental result;
Using RGB color image and the fused images of depth image --- HHG images are tested, and are realized using HHG images
The average accuracy of object detection is 37.6% (row of table the 1, the 6th), and the result than Gupta et al. improves 5.1%.
Table 1:1,2,3 row are the experimental results of Gupta et al., and 4-8 row are the experimental results of this method, and wherein nms ' is represented
Be to use the experiment after the non-maxima suppression changed.Experimental result is the percentage of Average Accuracy.
The frame quantity produced by zone scheme network there are about 17000, then process overlap by non-maxima suppression
Frame, be left 2000 to 3000 frames, this process need average time be 0.71s, then once changed again
Non-maxima suppression afterwards is reduced to 2,000 (± 50) by the quantity of frame, and the average time that this process needs is 0.133s.
Wherein the value on δ refer to table 2.When frame quantity is in different zones (between 2050 to 3000) δ value (δ ∈ [8,
13] it is) different, when quantity is less than 2050, the non-maxima suppression changed is not performed, (this when quantity is more than 3000
The situation of kind seldom occurs), take 2000 frames of score values highest.The average accuracy of experimental result is on the basis of HHG images
Improve 1.6% (row of table the 1, the 7th).
Finally use VGG-16 network models as final object detection experimental result, average accuracy is 43.7%
(row of table the 1, the 8th), the experimental result than Gupta et al. improves 11.2%.
Table 2:The first row is represented and processes remaining frame quantity interval by first time non-maxima suppression, and the second row is represented
δ is in different interval values.
It is as shown in table 3 on segmentation result.This method obtains best segmentation using HHG images under FCN-8s networks
As a result, average double ratio and 30.9% is brought up to from the 28.6% of Gupta et al..
Table 3:40 kinds of IU of semantic segmentation label (%), the first row is the semantic segmentation result of Gupta et al., and the second row is arrived
Fourth line is that we use HHG images semantic segmentation result respectively under FCN-32s, FCN-16s, FCN-8s network.
Claims (4)
1. the object detection and semantic segmentation method of the RGB-D images of deep layer convolutional network are based on, it is characterised in that:
S1, by RGB image calculate gray level image, HHG images are merged into by gray level image and HHA images;Tri- passages of HHA
Discrete Fourier transform optical imagery in, have the discrete Fourier transform clearest differences is that A channel, it is embodied in directly
Flow component is that the intensity of transverse and longitudinal coordinate axle is very faint, therefore casts out this passage;Due to tri- discrete Fourier transforms of passage of RGB
Optical imagery it is all similar and intensity of DC component is also strong, in replacing HHA images using the gray level image of RGB image
A channel image, so using fusion RGB image and depth image triple channel image be HHG images;
S2, using Faster-RCNN as HHG images object detecting system;Using HHG images as network input number
According to, Region Proposals are produced by the RPN in Faster-RCNN, Region is extracted by Fast-RCNN
The feature of Proposals, then classifies to each Region Proposals, and the testing result of this method is, in HHG
Position and the scope of object are marked in image with a rectangle frame, and marks the classification of object in the rectangle frame;
S3, change non-maxima suppression are the mechanism that NMS retains frame, using the frame quantity around frame as factor of evaluation;
Specific step is as follows:
Each frame is 5 tuple (x1, y1, x2, y2, score), wherein (x1, y1) is the coordinate in the frame upper left corner,
(x2, y2) is the coordinate of lower right bezel corner, and score is the confidence level comprising object in frame;It is right that frame is first according to score values
Each tuple carries out ascending sort;It is calculated as follows the double ratio and Duplication of frame;
Wherein, O(i,j)Represent the double ratio and Duplication, inter of frame i and frame j(i,j)Represent the faying surface of frame i and frame j
Product, area(i)Represent the area of frame i, area(j)Represent the area of frame j;For frame i, statisticsQuantity
SumiIf, Sumi>=δ, casts out frame i, otherwise retains, and n represents frame total quantity, and δ is represented and accepted or rejected threshold value;
S4, the semantic segmentation task that RGB-D images are completed using HHG images and FCN;Using HHG images as FCN input number
According to after FCN extracts semantic feature and classifies, the class label of each pixel, is made with label value in output HHG images
It is the pixel value of the pixel.
2. the object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network according to claim 1,
It is characterized in that:
The training method of Faster-RCNN has three kinds:One is alternately training, and two is approximate joint training, and three is non-approximated joint
Training;This method uses alternately training program, and alternately the thinking of training program is to make zone scheme network and Fast-RCNN
Shared convolutional layer parameter, fine setting belongs to the parameter of each automatic network, and this scheme trains zone scheme network first, secondly according to area
The zone scheme training Fast-RCNN models that domain scheme network is produced, then use Fast-RCNN model initialization zone schemes
Network;This process can be repeated.
3. the object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network according to claim 2,
It is characterized in that:
This method uses 4- to walk alternately training program:The first step, uses the model of the pre-training on ImageNet data sets
Initialization network, trains zone scheme network;Second step, using the first step zone scheme network produce zone scheme as
The pre-detection frame of Fast-RCNN, using the model initialization network of the pre-training on ImageNet data sets, trains Fast-
RCNN;3rd step, the netinit zone scheme network and training network trained using second step, because having used shared volume
Lamination parameter, so only fine setting belongs to the network layer parameter of zone scheme network here;4th step, equally shared convolutional layer ginseng
Number, the netinit Fast-RCNN trained using second step simultaneously finely tunes the network layer parameter for being pertaining only to the network;We are
One step and second step are referred to as the first stage, and the 3rd step and the 4th step are referred to as second stage;
The computational methods of this paper loss functions are referred to the multitask loss formula of Faster-RCNN, and formula expression is as follows:
Wherein, piI-th prophesy probability of the anchor point frame comprising object is represented,Ground truth label is represented, if anchor point side
Frame is positive example,If anchor point frame is counter-example,tiThe coordinate (4 parameters) of prophesy frame i is represented,Represent
With the related ground truth frame of positive example anchor point frame;LclsSoftmax Classification Loss is represented,Represent that frame is returned
Loss,NclsRepresent block size, N in experimentcls=256;NregRepresent anchor
The quantity of frame is put, λ represents an equalizing coefficient, λ=10 are taken in experiment,Computational methods be referred to Fast-
RCNN, formula is as follows:
4. the object detection and semantic segmentation method of the RGB-D images based on deep layer convolutional network according to claim 1,
It is characterized in that:
During the training of full convolution, backpropagation equally carries out backpropagation using stochastic gradient descent, and loss is to each picture
The Softmax losses of vegetarian refreshments are sued for peace;The training network of full convolution is divided into three kinds;The first is held after conv7 convolutional layers
Row one strides as 32 deconvolution operates FCN-32s;Be for second conv7 convolutional layers are performed one stride for 2 warp
Product result is merged for one with the execution of pool4 ponds layers result, then this fusion results execution one is striden for 16 warp
Product operation FCN-16s;The third is that three kinds of results are merged, these three results be respectively to conv7 perform one stride
Be 4 deconvolution result, to pool4 ponds layer perform one stride for 2 deconvolution result, pool3 ponds layer;Using this
Fusion results are once striden as 8 deconvolution operates FCN-8s again;It is referred to as full volume with the third mode wherein second
The jump framework of product network;
Full convolutional network training process typically uses a model fine setting FCN-32s network for training, then FCN-16s bases
In the model of FCN-32s models fine setting jump framework, finally the model of FCN-8s is finely tuned using the model of FCN-16s;This paper edges
This training mode is used, the model trained using FCN-8s is tested, and as last experimental result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611168200.3A CN106709568B (en) | 2016-12-16 | 2016-12-16 | The object detection and semantic segmentation method of RGB-D image based on deep layer convolutional network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611168200.3A CN106709568B (en) | 2016-12-16 | 2016-12-16 | The object detection and semantic segmentation method of RGB-D image based on deep layer convolutional network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106709568A true CN106709568A (en) | 2017-05-24 |
CN106709568B CN106709568B (en) | 2019-03-22 |
Family
ID=58938969
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611168200.3A Active CN106709568B (en) | 2016-12-16 | 2016-12-16 | The object detection and semantic segmentation method of RGB-D image based on deep layer convolutional network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106709568B (en) |
Cited By (105)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103613A (en) * | 2017-03-28 | 2017-08-29 | 深圳市未来媒体技术研究院 | A kind of three-dimension gesture Attitude estimation method |
CN107392189A (en) * | 2017-09-05 | 2017-11-24 | 百度在线网络技术(北京)有限公司 | For the method and apparatus for the driving behavior for determining unmanned vehicle |
CN107437099A (en) * | 2017-08-03 | 2017-12-05 | 哈尔滨工业大学 | A kind of specific dress ornament image recognition and detection method based on machine learning |
CN107507126A (en) * | 2017-07-27 | 2017-12-22 | 大连和创懒人科技有限公司 | A kind of method that 3D scenes are reduced using RGB image |
CN107545263A (en) * | 2017-08-02 | 2018-01-05 | 清华大学 | A kind of object detecting method and device |
CN107564025A (en) * | 2017-08-09 | 2018-01-09 | 浙江大学 | A kind of power equipment infrared image semantic segmentation method based on deep neural network |
CN107563290A (en) * | 2017-08-01 | 2018-01-09 | 中国农业大学 | A kind of pedestrian detection method and device based on image |
CN107563381A (en) * | 2017-09-12 | 2018-01-09 | 国家新闻出版广电总局广播科学研究院 | The object detection method of multiple features fusion based on full convolutional network |
CN107563284A (en) * | 2017-07-26 | 2018-01-09 | 中国农业大学 | Pedestrian's method for tracing and device |
CN107563405A (en) * | 2017-07-19 | 2018-01-09 | 同济大学 | Garage automatic Pilot semantic objects recognition methods based on multiresolution neutral net |
CN107563372A (en) * | 2017-07-20 | 2018-01-09 | 济南中维世纪科技有限公司 | A kind of license plate locating method based on deep learning SSD frameworks |
CN107578436A (en) * | 2017-08-02 | 2018-01-12 | 南京邮电大学 | A kind of monocular image depth estimation method based on full convolutional neural networks FCN |
CN107657257A (en) * | 2017-08-14 | 2018-02-02 | 中国矿业大学 | A kind of semantic image dividing method based on multichannel convolutive neutral net |
CN107680113A (en) * | 2017-10-27 | 2018-02-09 | 武汉大学 | The image partition method of multi-layer segmentation network based on Bayesian frame edge prior |
CN107680109A (en) * | 2017-09-15 | 2018-02-09 | 盐城禅图智能科技有限公司 | It is a kind of to quote inverse notice and the image, semantic dividing method of pixel similarity study |
CN107688784A (en) * | 2017-08-23 | 2018-02-13 | 福建六壬网安股份有限公司 | A kind of character identifying method and storage medium based on further feature and shallow-layer Fusion Features |
CN107742311A (en) * | 2017-09-29 | 2018-02-27 | 北京易达图灵科技有限公司 | A kind of method and device of vision positioning |
CN107808131A (en) * | 2017-10-23 | 2018-03-16 | 华南理工大学 | Dynamic gesture identification method based on binary channel depth convolutional neural networks |
CN107862674A (en) * | 2017-11-08 | 2018-03-30 | 杭州测度科技有限公司 | Depth image fusion method and system |
CN107886477A (en) * | 2017-09-20 | 2018-04-06 | 武汉环宇智行科技有限公司 | Unmanned neutral body vision merges antidote with low line beam laser radar |
CN107886117A (en) * | 2017-10-30 | 2018-04-06 | 国家新闻出版广电总局广播科学研究院 | The algorithm of target detection merged based on multi-feature extraction and multitask |
CN107908635A (en) * | 2017-09-26 | 2018-04-13 | 百度在线网络技术(北京)有限公司 | Establish textual classification model and the method, apparatus of text classification |
CN107944457A (en) * | 2017-11-23 | 2018-04-20 | 浙江清华长三角研究院 | Drawing object identification and extracting method under a kind of complex scene |
CN108009481A (en) * | 2017-11-22 | 2018-05-08 | 浙江大华技术股份有限公司 | A kind of training method and device of CNN models, face identification method and device |
CN108090442A (en) * | 2017-12-15 | 2018-05-29 | 四川大学 | A kind of airport scene monitoring method based on convolutional neural networks |
CN108171141A (en) * | 2017-12-25 | 2018-06-15 | 淮阴工学院 | The video target tracking method of cascade multi-pattern Fusion based on attention model |
CN108182428A (en) * | 2018-01-31 | 2018-06-19 | 福州大学 | The method that front truck state recognition and vehicle follow |
CN108320286A (en) * | 2018-02-28 | 2018-07-24 | 苏州大学 | Image significance detection method, system, equipment and computer readable storage medium |
CN108334955A (en) * | 2018-03-01 | 2018-07-27 | 福州大学 | Copy of ID Card detection method based on Faster-RCNN |
CN108345887A (en) * | 2018-01-29 | 2018-07-31 | 清华大学深圳研究生院 | The training method and image, semantic dividing method of image, semantic parted pattern |
CN108399361A (en) * | 2018-01-23 | 2018-08-14 | 南京邮电大学 | A kind of pedestrian detection method based on convolutional neural networks CNN and semantic segmentation |
CN108446662A (en) * | 2018-04-02 | 2018-08-24 | 电子科技大学 | A kind of pedestrian detection method based on semantic segmentation information |
CN108491803A (en) * | 2018-03-27 | 2018-09-04 | 北京中硕众联智能电子科技有限公司 | A kind of device and the corresponding recognition methods of identification objects in images and temperature |
CN108520219A (en) * | 2018-03-30 | 2018-09-11 | 台州智必安科技有限责任公司 | A kind of multiple dimensioned fast face detecting method of convolutional neural networks Fusion Features |
CN108537292A (en) * | 2018-04-10 | 2018-09-14 | 上海白泽网络科技有限公司 | Semantic segmentation network training method, image, semantic dividing method and device |
CN108596102A (en) * | 2018-04-26 | 2018-09-28 | 北京航空航天大学青岛研究院 | Indoor scene object segmentation grader building method based on RGB-D |
CN108596240A (en) * | 2018-04-20 | 2018-09-28 | 华中科技大学 | A kind of image, semantic dividing method based on differentiation character network |
CN108647562A (en) * | 2018-03-27 | 2018-10-12 | 北京中硕众联智能电子科技有限公司 | Identify device and the corresponding method of objects in images and temperature |
CN108664974A (en) * | 2018-04-03 | 2018-10-16 | 华南理工大学 | A kind of semantic segmentation method based on RGBD images Yu Complete Disability difference network |
CN108710863A (en) * | 2018-05-24 | 2018-10-26 | 东北大学 | Unmanned plane Scene Semantics dividing method based on deep learning and system |
CN108734694A (en) * | 2018-04-09 | 2018-11-02 | 华南农业大学 | Thyroid tumors ultrasonoscopy automatic identifying method based on faster r-cnn |
CN108876793A (en) * | 2018-04-13 | 2018-11-23 | 北京迈格威科技有限公司 | Semantic segmentation methods, devices and systems and storage medium |
CN108875588A (en) * | 2018-05-25 | 2018-11-23 | 武汉大学 | Across camera pedestrian detection tracking based on deep learning |
CN108876796A (en) * | 2018-06-08 | 2018-11-23 | 长安大学 | A kind of lane segmentation system and method based on full convolutional neural networks and condition random field |
CN108895981A (en) * | 2018-05-29 | 2018-11-27 | 南京怀萃智能科技有限公司 | A kind of method for three-dimensional measurement, device, server and storage medium |
CN108985194A (en) * | 2018-06-29 | 2018-12-11 | 华南理工大学 | A kind of intelligent vehicle based on image, semantic segmentation can travel the recognition methods in region |
WO2018232592A1 (en) * | 2017-06-20 | 2018-12-27 | Microsoft Technology Licensing, Llc. | Fully convolutional instance-aware semantic segmentation |
CN109102457A (en) * | 2018-06-12 | 2018-12-28 | 杭州米绘科技有限公司 | A kind of intelligent color change system and method based on convolutional neural networks |
CN109101914A (en) * | 2018-08-01 | 2018-12-28 | 北京飞搜科技有限公司 | It is a kind of based on multiple dimensioned pedestrian detection method and device |
CN109271990A (en) * | 2018-09-03 | 2019-01-25 | 北京邮电大学 | A kind of semantic segmentation method and device for RGB-D image |
CN109325505A (en) * | 2018-09-11 | 2019-02-12 | 北京陌上花科技有限公司 | Example dividing method and device, mobile phone terminal for embedded device |
CN109325385A (en) * | 2017-07-31 | 2019-02-12 | 株式会社理光 | Target detection and region segmentation method, device and computer readable storage medium |
CN109377479A (en) * | 2018-09-27 | 2019-02-22 | 中国电子科技集团公司第五十四研究所 | Satellite dish object detection method based on remote sensing image |
CN109564684A (en) * | 2018-01-15 | 2019-04-02 | 深圳鲲云信息科技有限公司 | Image, semantic dividing method, programmable logic circuit, system and electronic equipment |
CN109598268A (en) * | 2018-11-23 | 2019-04-09 | 安徽大学 | A kind of RGB-D well-marked target detection method based on single flow depth degree network |
CN109598728A (en) * | 2018-11-30 | 2019-04-09 | 腾讯科技(深圳)有限公司 | Image partition method, device, diagnostic system and storage medium |
CN109655019A (en) * | 2018-10-29 | 2019-04-19 | 北方工业大学 | Cargo volume measurement method based on deep learning and three-dimensional reconstruction |
CN109685762A (en) * | 2018-11-09 | 2019-04-26 | 五邑大学 | A kind of Downtilt measurement method based on multiple dimensioned deep semantic segmentation network |
CN109711413A (en) * | 2018-12-30 | 2019-05-03 | 陕西师范大学 | Image, semantic dividing method based on deep learning |
CN109711367A (en) * | 2018-12-29 | 2019-05-03 | 北京中科寒武纪科技有限公司 | Operation method, device and Related product |
CN109753903A (en) * | 2019-02-27 | 2019-05-14 | 北航(四川)西部国际创新港科技有限公司 | A kind of unmanned plane detection method based on deep learning |
CN109782771A (en) * | 2019-02-26 | 2019-05-21 | 西安交通大学 | A kind of orchard mobile robot and edge of a field forward method |
CN109801337A (en) * | 2019-01-21 | 2019-05-24 | 同济大学 | A kind of 6D position and orientation estimation method of Case-based Reasoning segmentation network and iteration optimization |
CN109870983A (en) * | 2017-12-04 | 2019-06-11 | 北京京东尚科信息技术有限公司 | Handle the method, apparatus of pallet stacking image and the system for picking of storing in a warehouse |
CN109872357A (en) * | 2019-01-16 | 2019-06-11 | 创新奇智(广州)科技有限公司 | A kind of article arrangement face accounting calculation method, system and electronic equipment |
CN109903331A (en) * | 2019-01-08 | 2019-06-18 | 杭州电子科技大学 | A kind of convolutional neural networks object detection method based on RGB-D camera |
CN109902572A (en) * | 2019-01-24 | 2019-06-18 | 哈尔滨理工大学 | A kind of vehicle checking method based on deep learning |
CN109934342A (en) * | 2018-12-28 | 2019-06-25 | 深圳奥比中光科技有限公司 | Neural network model training method, depth image restorative procedure and system |
CN110009648A (en) * | 2019-03-04 | 2019-07-12 | 东南大学 | Trackside image Method of Vehicle Segmentation based on depth Fusion Features convolutional neural networks |
CN110008848A (en) * | 2019-03-13 | 2019-07-12 | 华南理工大学 | A kind of travelable area recognizing method of the road based on binocular stereo vision |
CN110008808A (en) * | 2018-12-29 | 2019-07-12 | 北京迈格威科技有限公司 | Panorama dividing method, device and system and storage medium |
CN110070124A (en) * | 2019-04-15 | 2019-07-30 | 广州小鹏汽车科技有限公司 | A kind of image amplification method and system based on production confrontation network |
CN110084257A (en) * | 2018-01-26 | 2019-08-02 | 北京京东尚科信息技术有限公司 | Method and apparatus for detecting target |
CN110188780A (en) * | 2019-06-03 | 2019-08-30 | 电子科技大学中山学院 | Method and device for constructing deep learning model for positioning multi-target feature points |
CN110263731A (en) * | 2019-06-24 | 2019-09-20 | 电子科技大学 | A kind of single step face detection system |
CN110287777A (en) * | 2019-05-16 | 2019-09-27 | 西北大学 | A kind of golden monkey body partitioning algorithm under natural scene |
CN110298330A (en) * | 2019-07-05 | 2019-10-01 | 东北大学 | A kind of detection of transmission line polling robot monocular and localization method |
CN110348342A (en) * | 2019-06-27 | 2019-10-18 | 广东技术师范大学天河学院 | A kind of piping disease image partition method based on full convolutional network |
CN110363201A (en) * | 2019-07-10 | 2019-10-22 | 上海交通大学 | Weakly supervised semantic segmentation method and system based on Cooperative Study |
CN110388931A (en) * | 2018-04-17 | 2019-10-29 | 百度(美国)有限责任公司 | The two-dimentional bounding box of object is converted into the method for the three-dimensional position of automatic driving vehicle |
CN110399884A (en) * | 2019-07-10 | 2019-11-01 | 浙江理工大学 | A kind of adaptive anchor frame model vehicle checking method of Fusion Features |
CN110473288A (en) * | 2019-08-23 | 2019-11-19 | 清华四川能源互联网研究院 | Dam model method for reconstructing, device and electronic equipment |
CN110599538A (en) * | 2019-09-30 | 2019-12-20 | 山东信通电子股份有限公司 | Method and device for identifying icing thickness of transmission line conductor |
CN110738132A (en) * | 2019-09-23 | 2020-01-31 | 中国海洋大学 | target detection quality blind evaluation method with discriminant perception capability |
CN110766096A (en) * | 2019-10-31 | 2020-02-07 | 北京金山云网络技术有限公司 | Video classification method and device and electronic equipment |
US10600167B2 (en) | 2017-01-18 | 2020-03-24 | Nvidia Corporation | Performing spatiotemporal filtering |
CN110941995A (en) * | 2019-11-01 | 2020-03-31 | 中山大学 | Real-time target detection and semantic segmentation multi-task learning method based on lightweight network |
CN111027493A (en) * | 2019-12-13 | 2020-04-17 | 电子科技大学 | Pedestrian detection method based on deep learning multi-network soft fusion |
CN111104840A (en) * | 2018-10-26 | 2020-05-05 | 斯特拉德视觉公司 | Learning and testing method and device based on regression loss |
CN111222468A (en) * | 2020-01-08 | 2020-06-02 | 浙江光珀智能科技有限公司 | People stream detection method and system based on deep learning |
CN111368829A (en) * | 2020-02-28 | 2020-07-03 | 北京理工大学 | Visual semantic relation detection method based on RGB-D image |
CN111507985A (en) * | 2020-03-19 | 2020-08-07 | 北京市威富安防科技有限公司 | Image instance segmentation optimization processing method and device and computer equipment |
CN111553925A (en) * | 2020-04-27 | 2020-08-18 | 南通智能感知研究院 | End-to-end crop image segmentation method and system based on FCN |
TWI702536B (en) * | 2019-12-31 | 2020-08-21 | 財團法人工業技術研究院 | Training method and system of object detection model based on adaptive annotation design |
CN111712853A (en) * | 2018-02-16 | 2020-09-25 | 松下知识产权经营株式会社 | Processing method and processing device using the same |
CN111783784A (en) * | 2020-06-30 | 2020-10-16 | 创新奇智(合肥)科技有限公司 | Method and device for detecting building cavity, electronic equipment and storage medium |
WO2020237693A1 (en) * | 2019-05-31 | 2020-12-03 | 华南理工大学 | Multi-source sensing method and system for water surface unmanned equipment |
CN112528705A (en) * | 2019-09-17 | 2021-03-19 | 同方威视技术股份有限公司 | Image recognition system and method based on semantics |
CN112667832A (en) * | 2020-12-31 | 2021-04-16 | 哈尔滨工业大学 | Vision-based mutual positioning method in unknown indoor environment |
CN113033398A (en) * | 2021-03-25 | 2021-06-25 | 深圳市康冠商用科技有限公司 | Gesture recognition method and device, computer equipment and storage medium |
CN113763422A (en) * | 2021-07-30 | 2021-12-07 | 北京交通大学 | RGB-D image saliency target detection method |
CN114343579A (en) * | 2022-01-05 | 2022-04-15 | 浙江大学 | Infant iatrogenic skin injury automatic evaluation handheld device |
CN114373218A (en) * | 2022-03-21 | 2022-04-19 | 北京万里红科技有限公司 | Method for generating convolution network for detecting living body object |
TWI794414B (en) * | 2018-02-21 | 2023-03-01 | 德商羅伯特博斯奇股份有限公司 | Systems and methods for real-time object detection using depth sensors |
US11651229B2 (en) | 2017-11-22 | 2023-05-16 | Zhejiang Dahua Technology Co., Ltd. | Methods and systems for face recognition |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101478709B1 (en) * | 2012-06-27 | 2015-01-05 | 한국과학기술원 | Method and apparatus for extracting and generating feature point and feature descriptor rgb-d image |
WO2016045711A1 (en) * | 2014-09-23 | 2016-03-31 | Keylemon Sa | A face pose rectification method and apparatus |
CN106204165A (en) * | 2016-08-11 | 2016-12-07 | 广州出益信息科技有限公司 | A kind of advertisement placement method and device |
CN106203506A (en) * | 2016-07-11 | 2016-12-07 | 上海凌科智能科技有限公司 | A kind of pedestrian detection method based on degree of depth learning art |
-
2016
- 2016-12-16 CN CN201611168200.3A patent/CN106709568B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101478709B1 (en) * | 2012-06-27 | 2015-01-05 | 한국과학기술원 | Method and apparatus for extracting and generating feature point and feature descriptor rgb-d image |
WO2016045711A1 (en) * | 2014-09-23 | 2016-03-31 | Keylemon Sa | A face pose rectification method and apparatus |
CN106203506A (en) * | 2016-07-11 | 2016-12-07 | 上海凌科智能科技有限公司 | A kind of pedestrian detection method based on degree of depth learning art |
CN106204165A (en) * | 2016-08-11 | 2016-12-07 | 广州出益信息科技有限公司 | A kind of advertisement placement method and device |
Non-Patent Citations (4)
Title |
---|
JONATHAN LONG,EVAN SHELHAMER,TREVOR DARRELL: ""Fully Convolutional Networks for Semantic Segmentation"", 《2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 * |
SAURABH GUPTA等: ""Aligning 3D Models to RGB-D Images of Cluttered Scenes"", 《2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 * |
SAURABH GUPTA等: ""Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images"", 《2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》 * |
蔡强等: ""基于ANNet网络的RGB-D图像的目标检测"", 《系统仿真学报》 * |
Cited By (151)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11113800B2 (en) | 2017-01-18 | 2021-09-07 | Nvidia Corporation | Filtering image data using a neural network |
US10600167B2 (en) | 2017-01-18 | 2020-03-24 | Nvidia Corporation | Performing spatiotemporal filtering |
CN107103613A (en) * | 2017-03-28 | 2017-08-29 | 深圳市未来媒体技术研究院 | A kind of three-dimension gesture Attitude estimation method |
CN107103613B (en) * | 2017-03-28 | 2019-11-15 | 深圳市未来媒体技术研究院 | A kind of three-dimension gesture Attitude estimation method |
WO2018232592A1 (en) * | 2017-06-20 | 2018-12-27 | Microsoft Technology Licensing, Llc. | Fully convolutional instance-aware semantic segmentation |
CN107563405A (en) * | 2017-07-19 | 2018-01-09 | 同济大学 | Garage automatic Pilot semantic objects recognition methods based on multiresolution neutral net |
CN107563372B (en) * | 2017-07-20 | 2021-01-29 | 济南中维世纪科技有限公司 | License plate positioning method based on deep learning SSD frame |
CN107563372A (en) * | 2017-07-20 | 2018-01-09 | 济南中维世纪科技有限公司 | A kind of license plate locating method based on deep learning SSD frameworks |
CN107563284A (en) * | 2017-07-26 | 2018-01-09 | 中国农业大学 | Pedestrian's method for tracing and device |
CN107507126B (en) * | 2017-07-27 | 2020-09-18 | 和创懒人(大连)科技有限公司 | Method for restoring 3D scene by using RGB image |
CN107507126A (en) * | 2017-07-27 | 2017-12-22 | 大连和创懒人科技有限公司 | A kind of method that 3D scenes are reduced using RGB image |
CN109325385A (en) * | 2017-07-31 | 2019-02-12 | 株式会社理光 | Target detection and region segmentation method, device and computer readable storage medium |
CN107563290A (en) * | 2017-08-01 | 2018-01-09 | 中国农业大学 | A kind of pedestrian detection method and device based on image |
CN107578436A (en) * | 2017-08-02 | 2018-01-12 | 南京邮电大学 | A kind of monocular image depth estimation method based on full convolutional neural networks FCN |
CN107545263B (en) * | 2017-08-02 | 2020-12-15 | 清华大学 | Object detection method and device |
CN107545263A (en) * | 2017-08-02 | 2018-01-05 | 清华大学 | A kind of object detecting method and device |
CN107437099A (en) * | 2017-08-03 | 2017-12-05 | 哈尔滨工业大学 | A kind of specific dress ornament image recognition and detection method based on machine learning |
CN107564025A (en) * | 2017-08-09 | 2018-01-09 | 浙江大学 | A kind of power equipment infrared image semantic segmentation method based on deep neural network |
CN107564025B (en) * | 2017-08-09 | 2020-05-29 | 浙江大学 | Electric power equipment infrared image semantic segmentation method based on deep neural network |
CN107657257A (en) * | 2017-08-14 | 2018-02-02 | 中国矿业大学 | A kind of semantic image dividing method based on multichannel convolutive neutral net |
CN107688784A (en) * | 2017-08-23 | 2018-02-13 | 福建六壬网安股份有限公司 | A kind of character identifying method and storage medium based on further feature and shallow-layer Fusion Features |
WO2019047655A1 (en) * | 2017-09-05 | 2019-03-14 | 百度在线网络技术(北京)有限公司 | Method and apparatus for use in determining driving behavior of driverless vehicle |
CN107392189A (en) * | 2017-09-05 | 2017-11-24 | 百度在线网络技术(北京)有限公司 | For the method and apparatus for the driving behavior for determining unmanned vehicle |
CN107563381A (en) * | 2017-09-12 | 2018-01-09 | 国家新闻出版广电总局广播科学研究院 | The object detection method of multiple features fusion based on full convolutional network |
CN107563381B (en) * | 2017-09-12 | 2020-10-23 | 国家新闻出版广电总局广播科学研究院 | Multi-feature fusion target detection method based on full convolution network |
CN107680109A (en) * | 2017-09-15 | 2018-02-09 | 盐城禅图智能科技有限公司 | It is a kind of to quote inverse notice and the image, semantic dividing method of pixel similarity study |
CN107886477A (en) * | 2017-09-20 | 2018-04-06 | 武汉环宇智行科技有限公司 | Unmanned neutral body vision merges antidote with low line beam laser radar |
US10783331B2 (en) | 2017-09-26 | 2020-09-22 | Baidu Online Network Technology (Beijing) Co., Ltd. | Method and apparatus for building text classification model, and text classification method and apparatus |
CN107908635B (en) * | 2017-09-26 | 2021-04-16 | 百度在线网络技术(北京)有限公司 | Method and device for establishing text classification model and text classification |
CN107908635A (en) * | 2017-09-26 | 2018-04-13 | 百度在线网络技术(北京)有限公司 | Establish textual classification model and the method, apparatus of text classification |
CN107742311A (en) * | 2017-09-29 | 2018-02-27 | 北京易达图灵科技有限公司 | A kind of method and device of vision positioning |
CN107742311B (en) * | 2017-09-29 | 2020-02-18 | 北京易达图灵科技有限公司 | Visual positioning method and device |
CN107808131A (en) * | 2017-10-23 | 2018-03-16 | 华南理工大学 | Dynamic gesture identification method based on binary channel depth convolutional neural networks |
CN107808131B (en) * | 2017-10-23 | 2019-12-10 | 华南理工大学 | Dynamic gesture recognition method based on dual-channel deep convolutional neural network |
CN107680113A (en) * | 2017-10-27 | 2018-02-09 | 武汉大学 | The image partition method of multi-layer segmentation network based on Bayesian frame edge prior |
CN107886117A (en) * | 2017-10-30 | 2018-04-06 | 国家新闻出版广电总局广播科学研究院 | The algorithm of target detection merged based on multi-feature extraction and multitask |
CN107862674B (en) * | 2017-11-08 | 2020-07-03 | 杭州测度科技有限公司 | Depth image fusion method and system |
CN107862674A (en) * | 2017-11-08 | 2018-03-30 | 杭州测度科技有限公司 | Depth image fusion method and system |
CN108009481A (en) * | 2017-11-22 | 2018-05-08 | 浙江大华技术股份有限公司 | A kind of training method and device of CNN models, face identification method and device |
US11651229B2 (en) | 2017-11-22 | 2023-05-16 | Zhejiang Dahua Technology Co., Ltd. | Methods and systems for face recognition |
CN107944457A (en) * | 2017-11-23 | 2018-04-20 | 浙江清华长三角研究院 | Drawing object identification and extracting method under a kind of complex scene |
CN109870983B (en) * | 2017-12-04 | 2022-01-04 | 北京京东尚科信息技术有限公司 | Method and device for processing tray stack image and system for warehousing goods picking |
CN109870983A (en) * | 2017-12-04 | 2019-06-11 | 北京京东尚科信息技术有限公司 | Handle the method, apparatus of pallet stacking image and the system for picking of storing in a warehouse |
CN108090442A (en) * | 2017-12-15 | 2018-05-29 | 四川大学 | A kind of airport scene monitoring method based on convolutional neural networks |
CN108171141B (en) * | 2017-12-25 | 2020-07-14 | 淮阴工学院 | Attention model-based cascaded multi-mode fusion video target tracking method |
CN108171141A (en) * | 2017-12-25 | 2018-06-15 | 淮阴工学院 | The video target tracking method of cascade multi-pattern Fusion based on attention model |
WO2019136760A1 (en) * | 2018-01-15 | 2019-07-18 | 深圳鲲云信息科技有限公司 | Image semantic segmentation method, programmable logic circuit, system and electronic device |
US11636665B2 (en) | 2018-01-15 | 2023-04-25 | Shenzhen Corerain Technologies Co., Ltd. | Streaming image semantic segmentation method, logical integrated circuit system and electronic device |
CN109564684B (en) * | 2018-01-15 | 2023-05-02 | 深圳鲲云信息科技有限公司 | Image semantic segmentation method, programmable logic circuit, system and electronic equipment |
CN109564684A (en) * | 2018-01-15 | 2019-04-02 | 深圳鲲云信息科技有限公司 | Image, semantic dividing method, programmable logic circuit, system and electronic equipment |
CN108399361A (en) * | 2018-01-23 | 2018-08-14 | 南京邮电大学 | A kind of pedestrian detection method based on convolutional neural networks CNN and semantic segmentation |
CN110084257A (en) * | 2018-01-26 | 2019-08-02 | 北京京东尚科信息技术有限公司 | Method and apparatus for detecting target |
CN108345887A (en) * | 2018-01-29 | 2018-07-31 | 清华大学深圳研究生院 | The training method and image, semantic dividing method of image, semantic parted pattern |
CN108345887B (en) * | 2018-01-29 | 2020-10-02 | 清华大学深圳研究生院 | Training method of image semantic segmentation model and image semantic segmentation method |
CN108182428A (en) * | 2018-01-31 | 2018-06-19 | 福州大学 | The method that front truck state recognition and vehicle follow |
CN111712853A (en) * | 2018-02-16 | 2020-09-25 | 松下知识产权经营株式会社 | Processing method and processing device using the same |
CN111712853B (en) * | 2018-02-16 | 2023-11-07 | 松下知识产权经营株式会社 | Processing method and processing device using same |
TWI794414B (en) * | 2018-02-21 | 2023-03-01 | 德商羅伯特博斯奇股份有限公司 | Systems and methods for real-time object detection using depth sensors |
CN108320286A (en) * | 2018-02-28 | 2018-07-24 | 苏州大学 | Image significance detection method, system, equipment and computer readable storage medium |
CN108334955A (en) * | 2018-03-01 | 2018-07-27 | 福州大学 | Copy of ID Card detection method based on Faster-RCNN |
CN108647562A (en) * | 2018-03-27 | 2018-10-12 | 北京中硕众联智能电子科技有限公司 | Identify device and the corresponding method of objects in images and temperature |
CN108491803A (en) * | 2018-03-27 | 2018-09-04 | 北京中硕众联智能电子科技有限公司 | A kind of device and the corresponding recognition methods of identification objects in images and temperature |
CN108520219A (en) * | 2018-03-30 | 2018-09-11 | 台州智必安科技有限责任公司 | A kind of multiple dimensioned fast face detecting method of convolutional neural networks Fusion Features |
CN108446662A (en) * | 2018-04-02 | 2018-08-24 | 电子科技大学 | A kind of pedestrian detection method based on semantic segmentation information |
CN108664974A (en) * | 2018-04-03 | 2018-10-16 | 华南理工大学 | A kind of semantic segmentation method based on RGBD images Yu Complete Disability difference network |
CN108734694A (en) * | 2018-04-09 | 2018-11-02 | 华南农业大学 | Thyroid tumors ultrasonoscopy automatic identifying method based on faster r-cnn |
CN108537292A (en) * | 2018-04-10 | 2018-09-14 | 上海白泽网络科技有限公司 | Semantic segmentation network training method, image, semantic dividing method and device |
CN108537292B (en) * | 2018-04-10 | 2020-07-31 | 上海白泽网络科技有限公司 | Semantic segmentation network training method, image semantic segmentation method and device |
CN108876793A (en) * | 2018-04-13 | 2018-11-23 | 北京迈格威科技有限公司 | Semantic segmentation methods, devices and systems and storage medium |
CN110388931B (en) * | 2018-04-17 | 2023-11-10 | 百度(美国)有限责任公司 | Method for converting a two-dimensional bounding box of an object into a three-dimensional position of an autonomous vehicle |
CN110388931A (en) * | 2018-04-17 | 2019-10-29 | 百度(美国)有限责任公司 | The two-dimentional bounding box of object is converted into the method for the three-dimensional position of automatic driving vehicle |
CN108596240B (en) * | 2018-04-20 | 2020-05-19 | 华中科技大学 | Image semantic segmentation method based on discriminant feature network |
CN108596240A (en) * | 2018-04-20 | 2018-09-28 | 华中科技大学 | A kind of image, semantic dividing method based on differentiation character network |
CN108596102A (en) * | 2018-04-26 | 2018-09-28 | 北京航空航天大学青岛研究院 | Indoor scene object segmentation grader building method based on RGB-D |
CN108596102B (en) * | 2018-04-26 | 2022-04-05 | 北京航空航天大学青岛研究院 | RGB-D-based indoor scene object segmentation classifier construction method |
CN108710863A (en) * | 2018-05-24 | 2018-10-26 | 东北大学 | Unmanned plane Scene Semantics dividing method based on deep learning and system |
CN108875588A (en) * | 2018-05-25 | 2018-11-23 | 武汉大学 | Across camera pedestrian detection tracking based on deep learning |
CN108875588B (en) * | 2018-05-25 | 2022-04-15 | 武汉大学 | Cross-camera pedestrian detection tracking method based on deep learning |
CN108895981A (en) * | 2018-05-29 | 2018-11-27 | 南京怀萃智能科技有限公司 | A kind of method for three-dimensional measurement, device, server and storage medium |
CN108876796A (en) * | 2018-06-08 | 2018-11-23 | 长安大学 | A kind of lane segmentation system and method based on full convolutional neural networks and condition random field |
CN109102457B (en) * | 2018-06-12 | 2023-01-17 | 杭州米绘科技有限公司 | Intelligent color changing system and method based on convolutional neural network |
CN109102457A (en) * | 2018-06-12 | 2018-12-28 | 杭州米绘科技有限公司 | A kind of intelligent color change system and method based on convolutional neural networks |
CN108985194A (en) * | 2018-06-29 | 2018-12-11 | 华南理工大学 | A kind of intelligent vehicle based on image, semantic segmentation can travel the recognition methods in region |
CN109101914A (en) * | 2018-08-01 | 2018-12-28 | 北京飞搜科技有限公司 | It is a kind of based on multiple dimensioned pedestrian detection method and device |
CN109101914B (en) * | 2018-08-01 | 2021-08-20 | 苏州飞搜科技有限公司 | Multi-scale-based pedestrian detection method and device |
CN109271990A (en) * | 2018-09-03 | 2019-01-25 | 北京邮电大学 | A kind of semantic segmentation method and device for RGB-D image |
CN109325505A (en) * | 2018-09-11 | 2019-02-12 | 北京陌上花科技有限公司 | Example dividing method and device, mobile phone terminal for embedded device |
CN109377479A (en) * | 2018-09-27 | 2019-02-22 | 中国电子科技集团公司第五十四研究所 | Satellite dish object detection method based on remote sensing image |
CN111104840A (en) * | 2018-10-26 | 2020-05-05 | 斯特拉德视觉公司 | Learning and testing method and device based on regression loss |
CN111104840B (en) * | 2018-10-26 | 2024-01-09 | 斯特拉德视觉公司 | Regression loss-based learning and testing method and device |
CN109655019A (en) * | 2018-10-29 | 2019-04-19 | 北方工业大学 | Cargo volume measurement method based on deep learning and three-dimensional reconstruction |
CN109685762A (en) * | 2018-11-09 | 2019-04-26 | 五邑大学 | A kind of Downtilt measurement method based on multiple dimensioned deep semantic segmentation network |
CN109598268A (en) * | 2018-11-23 | 2019-04-09 | 安徽大学 | A kind of RGB-D well-marked target detection method based on single flow depth degree network |
CN109598268B (en) * | 2018-11-23 | 2021-08-17 | 安徽大学 | RGB-D (Red Green blue-D) significant target detection method based on single-stream deep network |
CN109598728A (en) * | 2018-11-30 | 2019-04-09 | 腾讯科技(深圳)有限公司 | Image partition method, device, diagnostic system and storage medium |
CN109934342A (en) * | 2018-12-28 | 2019-06-25 | 深圳奥比中光科技有限公司 | Neural network model training method, depth image restorative procedure and system |
CN109934342B (en) * | 2018-12-28 | 2022-12-09 | 奥比中光科技集团股份有限公司 | Neural network model training method, depth image restoration method and system |
CN110008808A (en) * | 2018-12-29 | 2019-07-12 | 北京迈格威科技有限公司 | Panorama dividing method, device and system and storage medium |
CN109711367A (en) * | 2018-12-29 | 2019-05-03 | 北京中科寒武纪科技有限公司 | Operation method, device and Related product |
CN109711413A (en) * | 2018-12-30 | 2019-05-03 | 陕西师范大学 | Image, semantic dividing method based on deep learning |
CN109903331B (en) * | 2019-01-08 | 2020-12-22 | 杭州电子科技大学 | Convolutional neural network target detection method based on RGB-D camera |
CN109903331A (en) * | 2019-01-08 | 2019-06-18 | 杭州电子科技大学 | A kind of convolutional neural networks object detection method based on RGB-D camera |
CN109872357A (en) * | 2019-01-16 | 2019-06-11 | 创新奇智(广州)科技有限公司 | A kind of article arrangement face accounting calculation method, system and electronic equipment |
CN109801337B (en) * | 2019-01-21 | 2020-10-02 | 同济大学 | 6D pose estimation method based on instance segmentation network and iterative optimization |
CN109801337A (en) * | 2019-01-21 | 2019-05-24 | 同济大学 | A kind of 6D position and orientation estimation method of Case-based Reasoning segmentation network and iteration optimization |
CN109902572A (en) * | 2019-01-24 | 2019-06-18 | 哈尔滨理工大学 | A kind of vehicle checking method based on deep learning |
CN109782771A (en) * | 2019-02-26 | 2019-05-21 | 西安交通大学 | A kind of orchard mobile robot and edge of a field forward method |
CN109782771B (en) * | 2019-02-26 | 2021-01-19 | 西安交通大学 | Orchard mobile robot and ground steering method |
CN109753903B (en) * | 2019-02-27 | 2020-09-15 | 北航(四川)西部国际创新港科技有限公司 | Unmanned aerial vehicle detection method based on deep learning |
CN109753903A (en) * | 2019-02-27 | 2019-05-14 | 北航(四川)西部国际创新港科技有限公司 | A kind of unmanned plane detection method based on deep learning |
CN110009648B (en) * | 2019-03-04 | 2023-02-24 | 东南大学 | Roadside image vehicle segmentation method based on depth feature fusion convolutional neural network |
CN110009648A (en) * | 2019-03-04 | 2019-07-12 | 东南大学 | Trackside image Method of Vehicle Segmentation based on depth Fusion Features convolutional neural networks |
CN110008848A (en) * | 2019-03-13 | 2019-07-12 | 华南理工大学 | A kind of travelable area recognizing method of the road based on binocular stereo vision |
CN110070124A (en) * | 2019-04-15 | 2019-07-30 | 广州小鹏汽车科技有限公司 | A kind of image amplification method and system based on production confrontation network |
CN110287777A (en) * | 2019-05-16 | 2019-09-27 | 西北大学 | A kind of golden monkey body partitioning algorithm under natural scene |
WO2020237693A1 (en) * | 2019-05-31 | 2020-12-03 | 华南理工大学 | Multi-source sensing method and system for water surface unmanned equipment |
CN110188780A (en) * | 2019-06-03 | 2019-08-30 | 电子科技大学中山学院 | Method and device for constructing deep learning model for positioning multi-target feature points |
CN110263731B (en) * | 2019-06-24 | 2021-03-16 | 电子科技大学 | Single step human face detection system |
CN110263731A (en) * | 2019-06-24 | 2019-09-20 | 电子科技大学 | A kind of single step face detection system |
CN110348342A (en) * | 2019-06-27 | 2019-10-18 | 广东技术师范大学天河学院 | A kind of piping disease image partition method based on full convolutional network |
CN110298330A (en) * | 2019-07-05 | 2019-10-01 | 东北大学 | A kind of detection of transmission line polling robot monocular and localization method |
CN110363201A (en) * | 2019-07-10 | 2019-10-22 | 上海交通大学 | Weakly supervised semantic segmentation method and system based on Cooperative Study |
CN110363201B (en) * | 2019-07-10 | 2021-06-01 | 上海交通大学 | Weak supervision semantic segmentation method and system based on collaborative learning |
CN110399884B (en) * | 2019-07-10 | 2021-08-20 | 浙江理工大学 | Feature fusion self-adaptive anchor frame model vehicle detection method |
CN110399884A (en) * | 2019-07-10 | 2019-11-01 | 浙江理工大学 | A kind of adaptive anchor frame model vehicle checking method of Fusion Features |
CN110473288B (en) * | 2019-08-23 | 2022-08-05 | 清华四川能源互联网研究院 | Dam model reconstruction method and device and electronic equipment |
CN110473288A (en) * | 2019-08-23 | 2019-11-19 | 清华四川能源互联网研究院 | Dam model method for reconstructing, device and electronic equipment |
CN112528705A (en) * | 2019-09-17 | 2021-03-19 | 同方威视技术股份有限公司 | Image recognition system and method based on semantics |
CN110738132A (en) * | 2019-09-23 | 2020-01-31 | 中国海洋大学 | target detection quality blind evaluation method with discriminant perception capability |
CN110738132B (en) * | 2019-09-23 | 2022-06-03 | 中国海洋大学 | Target detection quality blind evaluation method with discriminant perception capability |
CN110599538A (en) * | 2019-09-30 | 2019-12-20 | 山东信通电子股份有限公司 | Method and device for identifying icing thickness of transmission line conductor |
CN110766096A (en) * | 2019-10-31 | 2020-02-07 | 北京金山云网络技术有限公司 | Video classification method and device and electronic equipment |
CN110941995A (en) * | 2019-11-01 | 2020-03-31 | 中山大学 | Real-time target detection and semantic segmentation multi-task learning method based on lightweight network |
CN111027493B (en) * | 2019-12-13 | 2022-05-20 | 电子科技大学 | Pedestrian detection method based on deep learning multi-network soft fusion |
CN111027493A (en) * | 2019-12-13 | 2020-04-17 | 电子科技大学 | Pedestrian detection method based on deep learning multi-network soft fusion |
TWI702536B (en) * | 2019-12-31 | 2020-08-21 | 財團法人工業技術研究院 | Training method and system of object detection model based on adaptive annotation design |
US11321590B2 (en) | 2019-12-31 | 2022-05-03 | Industrial Technology Research Institute | Training method and system of objects detection model based on adaptive annotation design |
CN111222468A (en) * | 2020-01-08 | 2020-06-02 | 浙江光珀智能科技有限公司 | People stream detection method and system based on deep learning |
CN111368829B (en) * | 2020-02-28 | 2023-06-30 | 北京理工大学 | Visual semantic relation detection method based on RGB-D image |
CN111368829A (en) * | 2020-02-28 | 2020-07-03 | 北京理工大学 | Visual semantic relation detection method based on RGB-D image |
CN111507985A (en) * | 2020-03-19 | 2020-08-07 | 北京市威富安防科技有限公司 | Image instance segmentation optimization processing method and device and computer equipment |
CN111553925A (en) * | 2020-04-27 | 2020-08-18 | 南通智能感知研究院 | End-to-end crop image segmentation method and system based on FCN |
CN111783784A (en) * | 2020-06-30 | 2020-10-16 | 创新奇智(合肥)科技有限公司 | Method and device for detecting building cavity, electronic equipment and storage medium |
CN112667832B (en) * | 2020-12-31 | 2022-05-13 | 哈尔滨工业大学 | Vision-based mutual positioning method in unknown indoor environment |
CN112667832A (en) * | 2020-12-31 | 2021-04-16 | 哈尔滨工业大学 | Vision-based mutual positioning method in unknown indoor environment |
CN113033398A (en) * | 2021-03-25 | 2021-06-25 | 深圳市康冠商用科技有限公司 | Gesture recognition method and device, computer equipment and storage medium |
CN113763422A (en) * | 2021-07-30 | 2021-12-07 | 北京交通大学 | RGB-D image saliency target detection method |
CN113763422B (en) * | 2021-07-30 | 2023-10-03 | 北京交通大学 | RGB-D image saliency target detection method |
CN114343579A (en) * | 2022-01-05 | 2022-04-15 | 浙江大学 | Infant iatrogenic skin injury automatic evaluation handheld device |
CN114373218B (en) * | 2022-03-21 | 2022-06-14 | 北京万里红科技有限公司 | Method for generating convolution network for detecting living body object |
CN114373218A (en) * | 2022-03-21 | 2022-04-19 | 北京万里红科技有限公司 | Method for generating convolution network for detecting living body object |
Also Published As
Publication number | Publication date |
---|---|
CN106709568B (en) | 2019-03-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106709568A (en) | RGB-D image object detection and semantic segmentation method based on deep convolution network | |
CN111291739B (en) | Face detection and image detection neural network training method, device and equipment | |
CN106778835A (en) | The airport target by using remote sensing image recognition methods of fusion scene information and depth characteristic | |
Yang et al. | Layered object models for image segmentation | |
CN110287960A (en) | The detection recognition method of curve text in natural scene image | |
CN110287826B (en) | Video target detection method based on attention mechanism | |
CN108009509A (en) | Vehicle target detection method | |
CN107169974A (en) | It is a kind of based on the image partition method for supervising full convolutional neural networks more | |
CN104156693B (en) | A kind of action identification method based on the fusion of multi-modal sequence | |
CN112528976B (en) | Text detection model generation method and text detection method | |
CN105160310A (en) | 3D (three-dimensional) convolutional neural network based human body behavior recognition method | |
CN109086668A (en) | Based on the multiple dimensioned unmanned aerial vehicle remote sensing images road information extracting method for generating confrontation network | |
CN106688011A (en) | Method and system for multi-class object detection | |
CN111612807A (en) | Small target image segmentation method based on scale and edge information | |
CN107451607A (en) | A kind of personal identification method of the typical character based on deep learning | |
CN105956560A (en) | Vehicle model identification method based on pooling multi-scale depth convolution characteristics | |
CN107784291A (en) | target detection tracking method and device based on infrared video | |
CN107066916A (en) | Scene Semantics dividing method based on deconvolution neutral net | |
CN110008900A (en) | A kind of visible remote sensing image candidate target extracting method by region to target | |
CN107220971A (en) | A kind of Lung neoplasm feature extracting method based on convolutional neural networks and PCA | |
CN107818299A (en) | Face recognition algorithms based on fusion HOG features and depth belief network | |
CN108985145A (en) | The Opposite direction connection deep neural network model method of small size road traffic sign detection identification | |
JP4567660B2 (en) | A method for determining a segment of an object in an electronic image. | |
CN109657634A (en) | A kind of 3D gesture identification method and system based on depth convolutional neural networks | |
CN107564007A (en) | The scene cut modification method and system of amalgamation of global information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20211123 Address after: 518052 Room 201, building A, 1 front Bay Road, Shenzhen Qianhai cooperation zone, Shenzhen, Guangdong Patentee after: Shenzhen Xiaofeng Technology Co.,Ltd. Address before: 100124 No. 100 Chaoyang District Ping Tian Park, Beijing Patentee before: Beijing University of Technology |