CN108509839A - One kind being based on the efficient gestures detection recognition methods of region convolutional neural networks - Google Patents
One kind being based on the efficient gestures detection recognition methods of region convolutional neural networks Download PDFInfo
- Publication number
- CN108509839A CN108509839A CN201810105589.XA CN201810105589A CN108509839A CN 108509839 A CN108509839 A CN 108509839A CN 201810105589 A CN201810105589 A CN 201810105589A CN 108509839 A CN108509839 A CN 108509839A
- Authority
- CN
- China
- Prior art keywords
- network
- networks
- convolutional neural
- neural networks
- region
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/107—Static hand or arm
- G06V40/113—Recognition of static hand signs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
Abstract
The present invention relates to one kind being based on the efficient gestures detection recognition methods of region convolutional neural networks, includes the following steps:Chinese character gesture letter sample image is pre-processed;It builds and strengthens images of gestures data set;Gestures detection identification is carried out using based on region convolutional neural networks Faster R CNN networks, gesture feature is first extracted by feature extraction network, and the characteristic pattern of extraction is divided into two parts, first part is directly entered Fast R CNN networks and does profound convolution, second part, which enters after RPN networks generating region is suggested, inputs Fast R CNN networks, and the characteristic pattern obtained with first part enters the ponds RoI layer jointly, position is obtained after full context layer again to return in gesture classification score, it is final to realize gestures detection identification;Training network model realizes the detection identification of Chinese character gesture letter.The present invention can promote recognition speed and accuracy rate.
Description
Technical field
The present invention relates to gestures detection identification technology fields, efficient based on region convolutional neural networks more particularly to one kind
Gestures detection recognition methods.
Background technology
Gesture identification application field is extensive in recent years, such as the robot control that the translation of deaf-mute's gesture, gesture identification are taken pictures
The Intelligent housing etc. of system, door and window household electrical appliances etc..Classify by gesture acquisition mode, there are two types of mode classifications for gesture identification:It is a kind of
It is based on wearing technology, one is based on machine vision.Although Gesture Recognition based on wearable device has gesture
The advantages such as accurate positioning, data are relatively easy, response processing speed is very fast, but can not compensating cost is high, inconvenient, study
Disadvantages, these disadvantages such as of high cost, manipulation distance is limited, usage scenario limitation lead to the gesture identification method based on wearing technology
It is difficult to be widelyd popularize, so the necessarily ideal gesture identification of the gesture technology based on machine vision.Based on machine vision
Gesture identification method core be computer gesture target detection recognizer.
Traditional gesture target detection recognizer generally comprises Hand Gesture Segmentation, feature extraction, identifies these three steps.It is logical
Hand Gesture Segmentation often can be done with the methods of the model based on movable information, Motion mask, Skin Color Information, then again to segmentation after
Gesture carries out doing feature extraction with HOG, LBP, Fourier converter technique scheduling algorithm, finally recycles SVM, Adaboost, MLP etc.
Algorithm carries out Classification and Identification.Traditional gesture target detection recognizer can not evade engineer's gesture feature defect, therefore algorithm
Obtained model plasticity is poor.
Convolutional neural networks (Convolutional Neural Network, CNN) are deep learning (Deep
Learning, DL) it is theoretical in a very important algorithm, it solves traditional artificial definition description and selection target feature
Drawback can automatically extract the target of input picture deeper time feature by powerful self-learning ability and classify.
2014, Girshick R. proposed the convolutional neural networks model R-CNN based on region, according to Selective
Search Edge boxes generate candidate region, then carry out feature to the candidate region of generation with convolutional neural networks and carry
It takes, although there is precision deficiency and input image size limitation, is established in target detection for the thinking of RPN+CNN
Basis.Then Fast R-CNN models are proposed in Girshick R. in 2015, it is proposed that Region ofInterest
Pooling layers, the shortcomings that R-CNN, is improved, but due to its network to clarification of objective still by hand-designed, and
And evaluation work is only completed on CPU, the accuracy of such model is low and candidate region calculates time length still becomes the network
The drawbacks of.It is further boosting algorithm recognition efficiency after R-CNN and Fast R-CNN, Microsoft in 2015
Shaoqing.Ren etc. proposes Faster R-CNN models.Suggestion section is generated with region proposed way, is substituted
The methods of Selective Search, Edge boxes, and and detection network share convolution feature, so that region suggest meter
Evaluation time greatly shortens.
Invention content
Technical problem to be solved by the invention is to provide one kind being based on the efficient gestures detection of region convolutional neural networks
Recognition methods can promote recognition speed and accuracy rate.
The technical solution adopted by the present invention to solve the technical problems is:It provides a kind of high based on region convolutional neural networks
The gestures detection recognition methods of effect, includes the following steps:
(1) Chinese character gesture letter sample image is pre-processed;
(2) it builds and strengthens images of gestures data set, be divided into training set and test set;
(3) gestures detection identification, the network packet are carried out using based on region convolutional neural networks Faster R-CNN networks
It includes:Feature extraction network, the regions RPN suggest network and Fast R-CNN networks, and the feature extraction network is for extracting gesture
Feature, and the characteristic pattern of extraction is divided into first part and second part, the first part is directly entered Fast R-CNN nets
Network does profound convolution, and the second part, which enters after RPN networks generating region is suggested, inputs Fast R-CNN networks, and with the
The characteristic pattern that a part obtains enters the ponds RoI layer jointly, then obtains position after full context layer and return in gesture classification score,
It is final to realize gestures detection identification;
(4) training network model:This network is trained using Chinese character manual alphabet training set, obtains network parameter;Finally use
Test set or in real time the acquisition gesture video input trained network realize the detection identification of Chinese character gesture letter.
The step (1) is specially:Chinese character gesture letter video is recorded, and it is image that video, which is taken out frame, removal smear is tight
The image of weight and serious shielding, and enhancing processing is carried out using the method for high-pass filtering to image.
The images of gestures data set built in the step (2) includes original sample image and is carried out to original sample image
Label image after mark by hand, wherein the image tagged frame of markup information record is corresponded with original image;Using to original
The mode that beginning image does minute surface symmetrical treatment re-flags correspondence image, to achieve the purpose that strengthen static sign language data set.
Feature extraction network in the step (3) is 13 layers of VGG16 networks for removing 3 layers of full articulamentum.
The regions RPN in the step (3) suggest that network is suggested using the direct generating region of CNN convolutional neural networks,
The region that one time obtains multiple dimensioned more length-width ratios is slided by sliding window on last convolutional layer to suggest to extract detection zone
Domain, the regions RPN suggest that network also carries out end-to-end training by backpropagation and stochastic gradient descent.
Suggest that network does sliding sash on the characteristic pattern that last layer of convolution obtains using a convolution kernel and sweeps in the regions RPN
It retouches, which connect with the window on characteristic pattern entirely every time, obtains a low-dimensional vector, this low-dimensional vector is sent to
Two full articulamentums, i.e. bezel locations return layer and target classification layer, and the bezel locations return layer for predicting Suggestion box
The corresponding coordinate of anchor, the target classification layer is for judging that Suggestion box is target or background.
Suggest that the loss function of network is in the regions RPN
Wherein, piIt is the probability that i-th of anchor rectangle frame is target,It is sample label;tiIt is used to indicate that the parametrization that prediction obtains
Frame coordinate,It is the parametrization coordinate of positive sample;NclsIndicate most small quantities of amount of images in input network, NregIndicate anchor coordinate
Sum;LclsFor the loss function for classification;LregTo return loss function.
The regions RPN suggest that network and Fast R-CNN networks use feature shared mechanism in the step (3), using alternately
Training stage convolutional layer feature is shared.
Advantageous effect
Due to the adoption of the above technical solution, compared with prior art, the present invention having the following advantages that and actively imitating
Fruit:
The present invention realizes that the detection of static Chinese character gesture letter identifies using region convolutional neural networks Faster R-CNN,
Feature extraction is done with the VGG16 networks of the network, region proposed mechanism (RPN) comes formation zone and suggests (Region
Proposals), the region of generation suggests that entering back into Fast R-CNN networks does gesture target detection and classification;Due to directly defeated
It is gesture picture to enter, and output is also gesture picture after identification, so the frame has an advantage end to end, the above characteristic, and no
The speed for improving only gestures detection identification, more greatly improves recognition accuracy.
Description of the drawings
Fig. 1 is the schematic diagram of the gestures detection identification based on region convolutional neural networks of the present invention;
Fig. 2 is that network RPN structural schematic diagrams are suggested in region;
Fig. 3 is inventive network training flow chart;
Fig. 4 is the experimental result picture of gestures detection identification of the present invention.
Specific implementation mode
Present invention will be further explained below with reference to specific examples.It should be understood that these embodiments are merely to illustrate the present invention
Rather than it limits the scope of the invention.In addition, it should also be understood that, after reading the content taught by the present invention, people in the art
Member can make various changes or modifications the present invention, and such equivalent forms equally fall within the application the appended claims and limited
Range.
Embodiments of the present invention are related to a kind of static sign language based on modified single multi-target detection device and identify in real time
Method, as shown in Figure 1, including the following steps:Chinese character gesture letter sample image is pre-processed;It builds and strengthens gesture figure
As data set, it is divided into training set and test set;Gesture is carried out using based on region convolutional neural networks Faster R-CNN networks
Detection identification, the network are divided into three parts:Feature extraction network, the regions RPN suggest network and Fast R-CNN networks, described
Feature extraction network extracts gesture feature, and the characteristic pattern of extraction is divided into two parts, and first part is directly entered Fast R-CNN nets
Network does profound convolution again, and second part, which enters after RPN networks generating region is suggested, inputs Fast R-CNN networks, with first
The characteristic pattern got enters the ponds RoI layer jointly, then position recurrence and gesture classification score are obtained after full context layer, finally
Realize gestures detection identification;Training network model:This network is trained using Chinese character manual alphabet training set, obtains network parameter;
It finally can be used test set or camera to acquire the gesture video input trained network in real time, realize the detection of Chinese character gesture letter
Identification.It is specific as follows:
Step 1:Chinese character gesture letter sample image is pre-processed.This experimental data is adopted by high definition monocular cam
Collection is completed.It is representative to carry out choosing 5 letters in experiment in 26 Chinese letters gestures of static Sign Language Recognition, respectively A, B,
C、D、E.Experimental data is completed by 8 people, everyone is complete to each letter difference recorded video, then by Matlab videos pumping frame program
It at frame is taken out, removes that smear is serious, image of serious shielding manually, high-pass filtering is used for the poor image of certain display effects
Method enhancing processing is done to image, be convenient for target identification, obtained preliminary data collection, picture size is 640*480.
Step 2:Images of gestures data set is built and strengthened, training set and test set are divided into.The Chinese character gesture letter of structure
Image containing original sample and the label image after manual mark, the image tagged of markup information record are carried out to original sample image
Frame is corresponded with original image;By the way of doing minute surface symmetrical treatment to original image, and correspondence image is re-flagged, reached
To the purpose for strengthening static sign language data set.Final data collection is as shown in table 1, wherein each letter training set picture is 2500
Opening and closing meter 15000 is opened, and test set is that 500 opening and closing meters 2500 are opened.Handmarking is carried out with LabelImg programs to obtain really
Target labels file.
The static sign language data set table of table 1
Step 3:Gestures detection identification is carried out using based on region convolutional neural networks Faster R-CNN networks.The network
Core is divided into three parts:Network and Fast R-CNN networks are suggested in feature extraction network, the regions RPN.The Principles of Network are summarized
For:Gesture feature is first extracted by feature extraction network VGG16, this feature figure is divided into two parts, and first part is directly entered Fast
R-CNN networks do profound convolution again, and second part, which enters after RPN networks generating region is suggested, inputs Fast R-CNN networks,
The characteristic pattern obtained with first part enters the ponds RoI layer jointly, then obtains position after full context layer and return in gesture classification
Score, it is final to realize gestures detection identification.
The it is proposed of network (Region Proposal Networks, RPN) is suggested in region, for solving in Fast R-CNN
The generating mode of candidate region is the method based on selective search (Selective Search), due to this method calculation amount
Greatly, the speed of strong influence algorithm.Suggest network as shown in Fig. 2, RPN detailed processes are as follows in region:Use a small convolution
Core (be usually 3*3 sizes) does sliding sash scanning on the characteristic pattern that last layer of convolution obtains, the sliding convolution kernel every time with spy
The window of n*n on sign figure connect (VGG16 of the present invention using 228 pixels) entirely, and then obtaining a low-dimensional vector, (VGG16 is
512d), this low-dimensional vector is finally sent to two full articulamentums, i.e. bezel locations return layer (reg layer) and target point
Class layer (clslayer), bezel locations return layer for predicting the corresponding coordinate of the anchor of Suggestion box, and target classification layer is for judging
Suggestion box is target or background.
Loss function by RPN networks consists of two parts:1) it is used for the loss function L of classificationcls, to describe certain figure
As whether region is target;2) loss function L is returnedreg, to describe between the regions RP and real goal (Ground Truth)
Gap.The part total losses function representation is:
Wherein, piIt is the probability that i-th of anchor (Anchor) rectangle frame is target,It is sample label (1 corresponding anchor matrix
It is target, 0 on the contrary);tiIt is used to indicate that parametrization frame coordinate that prediction obtains, is a four-dimensional coordinate,It is positive sample
Parametrization coordinate specifically as shown in formula (1-4).NclsIndicate most small quantities of amount of images in input network, NregIndicate anchor coordinate
Sum, the two are all normalized weight parameter.λ is adjusting the two-part balance of formula.For frame Classification Loss function
Lcls, indicated using log loss functions;Loss function L is returned for framereg, computational methods are as follows:
Known by formula (1-1), when sample be timing, i.e.,Shi Caihui activates frame to return loss function.Frame, which returns, to be made
With being the coordinate for correcting anchor rectangle frame and true frame, keep the two closer, it is calculated using the coordinate of parametrization:
In formula, x, y, w, h indicate the center point coordinate of prediction frame, the width and height of frame respectively;xa,ya,wa,haRespectively
Indicate the coordinate of candidate frame key store, the width and height of frame;x*,y*,w*,h*The center point coordinate of practical frame is indicated respectively,
The width and height of frame;tx, tw,Loss is returned for calculating, i.e. returning from suggestion areas frame to neighbouring true frame
Return.
According to the multitask loss function of definition, the optimization algorithm that the present invention uses is SGD, is joined in the hope of optimal weight
Number.
When training RPN networks, pass through backpropagation (Back-Propagation, BP) and stochastic gradient descent
(Stochastic Gradient Descent, SGD) carries out end-to-end (end-to-end) training.
In the present invention, RPN mechanism uses feature shared mechanism with Fast R-CNN, that is, shares the convolutional layer of 13 layers of VGG,
It is shared using alternately training (Alternating training) stage convolutional layer feature, it avoids in Faster R-CNN networks
Learn two networks.
Step 4:The region convolutional Neural of training method training step 3 stage by stage is used using the gesture training set of step 2
Network, the iterations that setting four-stage is arranged are respectively 40k, 20k, 40k and 40k times, and each stage is learned using fixed
The mode of habit rate, learning rate are fixed as 0.001, using stochastic gradient descent method optimum results.Fig. 3 is network training flow
Figure.Through being repeatedly finely adjusted to network, selects one group of preferable model parameter of effect as final mask, be used for experiment test.
Fig. 4 is the experimental result picture of gestures detection identification of the present invention.Randomly select part of test results, gesture in every figure
Recognition result includes gesture class label and probability size.It can be seen that using present embodiment based on region convolutional Neural net
The method of network identifies gestures detection highly effective.
It is not difficult to find that the present invention need not describe Chinese character hand gesture feature, the convolution of use using hand-designed language
Neural network can obtain deeper characteristic information so that the plasticity of model is good;Using RPN mechanism do region suggest it can and
The convolution feature of entire detection network share full figure so that region suggests that the time used is less, is conducive to algorithm speed and improves;With
The detection and identification of the final gesture target of Fast R-CNN real-time performances;All of above characteristic so that the present invention program have compared with
Good recognition speed, particularly improves a lot on gestures detection recognition accuracy.
Claims (8)
1. one kind being based on the efficient gestures detection recognition methods of region convolutional neural networks, which is characterized in that include the following steps:
(1) Chinese character gesture letter sample image is pre-processed;
(2) it builds and strengthens images of gestures data set, be divided into training set and test set;
(3) gestures detection identification is carried out using based on region convolutional neural networks Faster R-CNN networks, which includes:It is special
Sign extraction network, the regions RPN suggest that network and Fast R-CNN networks, the feature extraction network are used to extract gesture feature,
And the characteristic pattern of extraction is divided into first part and second part, the first part is directly entered Fast R-CNN networks and does depth
Level convolution, the second part, which enters after RPN networks generating region is suggested, inputs Fast R-CNN networks, and and first part
Obtained characteristic pattern enters the ponds RoI layer jointly, then obtains position after full context layer and return in gesture classification score, final real
Existing gestures detection identification;
(4) training network model:This network is trained using Chinese character manual alphabet training set, obtains network parameter;Finally with test
Collection or in real time the acquisition gesture video input trained network realize the detection identification of Chinese character gesture letter.
2. according to claim 1 be based on the efficient gestures detection recognition methods of region convolutional neural networks, feature exists
In the step (1) is specially:Chinese character gesture letter video is recorded, and it is image that video, which is taken out frame, removal smear is serious and hides
Serious image is kept off, and enhancing processing is carried out using the method for high-pass filtering to image.
3. according to claim 1 be based on the efficient gestures detection recognition methods of region convolutional neural networks, feature exists
In the images of gestures data set built in the step (2) includes original sample image and carried out by hand to original sample image
Label image after mark, wherein the image tagged frame of markup information record is corresponded with original image;Using to original graph
Mode as doing minute surface symmetrical treatment re-flags correspondence image, to achieve the purpose that strengthen static sign language data set.
4. according to claim 1 be based on the efficient gestures detection recognition methods of region convolutional neural networks, feature exists
In the feature extraction network in the step (3) is 13 layers of VGG16 networks for removing 3 layers of full articulamentum.
5. according to claim 1 be based on the efficient gestures detection recognition methods of region convolutional neural networks, feature exists
In the regions RPN in the step (3) suggest that network is suggested using the direct generating region of CNN convolutional neural networks, pass through cunning
Dynamic window slides the region that one time obtains multiple dimensioned more length-width ratios on last convolutional layer suggests to extract detection zone, described
Suggest that network also carries out end-to-end training by backpropagation and stochastic gradient descent in the regions RPN.
6. according to claim 5 be based on the efficient gestures detection recognition methods of region convolutional neural networks, feature exists
In the regions RPN suggest that network does sliding sash scanning using a convolution kernel on the characteristic pattern that last layer of convolution obtains, should
Sliding convolution kernel is connect with the window on characteristic pattern entirely every time, obtains a low-dimensional vector, this low-dimensional vector is sent to two
Full articulamentum, i.e. bezel locations return layer and target classification layer, and the bezel locations return the anchor pair that layer is used to predict Suggestion box
The coordinate answered, the target classification layer is for judging that Suggestion box is target or background.
7. according to claim 5 be based on the efficient gestures detection recognition methods of region convolutional neural networks, feature exists
In the regions RPN suggest that the loss function of network isIts
In, piIt is the probability that i-th of anchor rectangle frame is target,It is sample label;tiIt is used to indicate that the parametrization frame that prediction obtains
Coordinate,It is the parametrization coordinate of positive sample;NclsIndicate most small quantities of amount of images in input network, NregIndicate the total of anchor coordinate
Number;LclsFor the loss function for classification;LregTo return loss function.
8. according to claim 1 be based on the efficient gestures detection recognition methods of region convolutional neural networks, feature exists
In the regions RPN suggest that network and FastR-CNN networks use feature shared mechanism in the step (3), using alternately training rank
Section convolutional layer feature is shared.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810105589.XA CN108509839A (en) | 2018-02-02 | 2018-02-02 | One kind being based on the efficient gestures detection recognition methods of region convolutional neural networks |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810105589.XA CN108509839A (en) | 2018-02-02 | 2018-02-02 | One kind being based on the efficient gestures detection recognition methods of region convolutional neural networks |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108509839A true CN108509839A (en) | 2018-09-07 |
Family
ID=63374571
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810105589.XA Pending CN108509839A (en) | 2018-02-02 | 2018-02-02 | One kind being based on the efficient gestures detection recognition methods of region convolutional neural networks |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108509839A (en) |
Cited By (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109325505A (en) * | 2018-09-11 | 2019-02-12 | 北京陌上花科技有限公司 | Example dividing method and device, mobile phone terminal for embedded device |
CN109657537A (en) * | 2018-11-05 | 2019-04-19 | 北京达佳互联信息技术有限公司 | Image-recognizing method, system and electronic equipment based on target detection |
CN109726690A (en) * | 2018-12-30 | 2019-05-07 | 陕西师范大学 | Learner behavior image multizone based on DenseCap network describes method |
CN109815802A (en) * | 2018-12-18 | 2019-05-28 | 中国海洋大学 | A kind of monitor video vehicle detection and recognition method based on convolutional neural networks |
CN109886245A (en) * | 2019-03-02 | 2019-06-14 | 山东大学 | A kind of pedestrian detection recognition methods based on deep learning cascade neural network |
CN109949297A (en) * | 2019-03-20 | 2019-06-28 | 天津工业大学 | Pulmonary nodule detection method based on Reception and Faster R-CNN |
CN109977777A (en) * | 2019-02-26 | 2019-07-05 | 南京邮电大学 | Gesture identification method based on novel RF-Net model |
CN110008910A (en) * | 2019-04-10 | 2019-07-12 | 江西理工大学 | A kind of movie and television play is exposed the false the detection method of camera lens |
CN110084852A (en) * | 2019-04-22 | 2019-08-02 | 西南交通大学 | A kind of accurate positioning method of high iron catenary support device sleeve puller bolt |
CN110211041A (en) * | 2019-05-08 | 2019-09-06 | 云南大学 | A kind of optimization method of the neural network image classifier integrated based on receptive field |
CN110232687A (en) * | 2019-06-21 | 2019-09-13 | 华北电力大学(保定) | The detection method of cotter bolt defect in a kind of electric inspection process image |
CN110232311A (en) * | 2019-04-26 | 2019-09-13 | 平安科技(深圳)有限公司 | Dividing method, device and the computer equipment of hand images |
CN110264457A (en) * | 2019-06-20 | 2019-09-20 | 浙江大学 | Weld seam autonomous classification method based on rotary area candidate network |
CN110298821A (en) * | 2019-05-28 | 2019-10-01 | 昆明理工大学 | A kind of reinforcing bar detection method based on Faster R-CNN |
CN110414330A (en) * | 2019-06-20 | 2019-11-05 | 平安科技(深圳)有限公司 | A kind of detection method and device of palm image |
CN110490892A (en) * | 2019-07-03 | 2019-11-22 | 中山大学 | A kind of Thyroid ultrasound image tubercle automatic positioning recognition methods based on USFaster R-CNN |
CN110633663A (en) * | 2019-09-05 | 2019-12-31 | 东南大学 | Method for automatically cutting multi-mode data in sign language video |
CN110751155A (en) * | 2019-10-14 | 2020-02-04 | 西北工业大学 | Novel target detection method based on Faster R-CNN |
CN110991374A (en) * | 2019-12-10 | 2020-04-10 | 电子科技大学 | Fingerprint singular point detection method based on RCNN |
CN110992349A (en) * | 2019-12-11 | 2020-04-10 | 南京航空航天大学 | Underground pipeline abnormity automatic positioning and identification method based on deep learning |
CN111160479A (en) * | 2019-12-31 | 2020-05-15 | 华南理工大学 | Welding spot identification method |
CN111325179A (en) * | 2020-03-09 | 2020-06-23 | 厦门美图之家科技有限公司 | Gesture tracking method and device, electronic equipment and storage medium |
CN111382766A (en) * | 2018-12-29 | 2020-07-07 | 中国科学院沈阳计算技术研究所有限公司 | Equipment fault detection method based on fast R-CNN |
CN111401293A (en) * | 2020-03-25 | 2020-07-10 | 东华大学 | Gesture recognition method based on Head lightweight Mask scanning R-CNN |
CN111611925A (en) * | 2020-05-21 | 2020-09-01 | 重庆现代建筑产业发展研究院 | Building detection and identification method and device |
CN111783772A (en) * | 2020-06-12 | 2020-10-16 | 青岛理工大学 | Grabbing detection method based on RP-ResNet network |
CN111881907A (en) * | 2020-06-22 | 2020-11-03 | 浙江大华技术股份有限公司 | Frame regression positioning method and device and electronic equipment |
CN112036261A (en) * | 2020-08-11 | 2020-12-04 | 海尔优家智能科技(北京)有限公司 | Gesture recognition method and device, storage medium and electronic device |
CN112052724A (en) * | 2020-07-23 | 2020-12-08 | 深圳市玩瞳科技有限公司 | Finger tip positioning method and device based on deep convolutional neural network |
CN112148128A (en) * | 2020-10-16 | 2020-12-29 | 哈尔滨工业大学 | Real-time gesture recognition method and device and man-machine interaction system |
CN112163447A (en) * | 2020-08-18 | 2021-01-01 | 桂林电子科技大学 | Multi-task real-time gesture detection and recognition method based on Attention and Squeezenet |
CN112257513A (en) * | 2020-09-27 | 2021-01-22 | 南京工业大学 | Training method, translation method and system for sign language video translation model |
CN112308822A (en) * | 2020-10-10 | 2021-02-02 | 杭州电子科技大学 | Intervertebral disc CT image detection method based on deep convolutional neural network |
CN112364687A (en) * | 2020-09-29 | 2021-02-12 | 上善智城(苏州)信息科技有限公司 | Improved Faster R-CNN gas station electrostatic sign identification method and system |
CN112861849A (en) * | 2020-12-29 | 2021-05-28 | 北京航空航天大学 | Tissue identification method in spinal deformity correction surgery |
CN113076972A (en) * | 2021-03-04 | 2021-07-06 | 山东师范大学 | Two-stage Logo image detection method and system based on deep learning |
CN113096126A (en) * | 2021-06-03 | 2021-07-09 | 四川九通智路科技有限公司 | Road disease detection system and method based on image recognition deep learning |
CN113792573A (en) * | 2021-07-13 | 2021-12-14 | 浙江理工大学 | Static gesture recognition method for wavelet transformation low-frequency information and Xception network |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140105453A1 (en) * | 2012-10-08 | 2014-04-17 | Pixart Imaging Inc. | Gesture identification with natural images |
CN106599939A (en) * | 2016-12-30 | 2017-04-26 | 深圳市唯特视科技有限公司 | Real-time target detection method based on region convolutional neural network |
CN107102727A (en) * | 2017-03-17 | 2017-08-29 | 武汉理工大学 | Dynamic gesture study and recognition methods based on ELM neutral nets |
CN107239731A (en) * | 2017-04-17 | 2017-10-10 | 浙江工业大学 | A kind of gestures detection and recognition methods based on Faster R CNN |
CN107392166A (en) * | 2017-07-31 | 2017-11-24 | 北京小米移动软件有限公司 | Skin color detection method, device and computer-readable recording medium |
CN107451602A (en) * | 2017-07-06 | 2017-12-08 | 浙江工业大学 | A kind of fruits and vegetables detection method based on deep learning |
-
2018
- 2018-02-02 CN CN201810105589.XA patent/CN108509839A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140105453A1 (en) * | 2012-10-08 | 2014-04-17 | Pixart Imaging Inc. | Gesture identification with natural images |
CN106599939A (en) * | 2016-12-30 | 2017-04-26 | 深圳市唯特视科技有限公司 | Real-time target detection method based on region convolutional neural network |
CN107102727A (en) * | 2017-03-17 | 2017-08-29 | 武汉理工大学 | Dynamic gesture study and recognition methods based on ELM neutral nets |
CN107239731A (en) * | 2017-04-17 | 2017-10-10 | 浙江工业大学 | A kind of gestures detection and recognition methods based on Faster R CNN |
CN107451602A (en) * | 2017-07-06 | 2017-12-08 | 浙江工业大学 | A kind of fruits and vegetables detection method based on deep learning |
CN107392166A (en) * | 2017-07-31 | 2017-11-24 | 北京小米移动软件有限公司 | Skin color detection method, device and computer-readable recording medium |
Cited By (52)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109325505A (en) * | 2018-09-11 | 2019-02-12 | 北京陌上花科技有限公司 | Example dividing method and device, mobile phone terminal for embedded device |
CN109657537A (en) * | 2018-11-05 | 2019-04-19 | 北京达佳互联信息技术有限公司 | Image-recognizing method, system and electronic equipment based on target detection |
CN109815802A (en) * | 2018-12-18 | 2019-05-28 | 中国海洋大学 | A kind of monitor video vehicle detection and recognition method based on convolutional neural networks |
CN111382766A (en) * | 2018-12-29 | 2020-07-07 | 中国科学院沈阳计算技术研究所有限公司 | Equipment fault detection method based on fast R-CNN |
CN109726690B (en) * | 2018-12-30 | 2023-04-18 | 陕西师范大学 | Multi-region description method for learner behavior image based on DenseCap network |
CN109726690A (en) * | 2018-12-30 | 2019-05-07 | 陕西师范大学 | Learner behavior image multizone based on DenseCap network describes method |
CN109977777A (en) * | 2019-02-26 | 2019-07-05 | 南京邮电大学 | Gesture identification method based on novel RF-Net model |
CN109977777B (en) * | 2019-02-26 | 2022-08-30 | 南京邮电大学 | Novel RF-Net model-based gesture recognition method |
CN109886245A (en) * | 2019-03-02 | 2019-06-14 | 山东大学 | A kind of pedestrian detection recognition methods based on deep learning cascade neural network |
CN109949297A (en) * | 2019-03-20 | 2019-06-28 | 天津工业大学 | Pulmonary nodule detection method based on Reception and Faster R-CNN |
CN110008910B (en) * | 2019-04-10 | 2022-01-28 | 江西理工大学 | Detection method for movie and television series cut-to-help lens |
CN110008910A (en) * | 2019-04-10 | 2019-07-12 | 江西理工大学 | A kind of movie and television play is exposed the false the detection method of camera lens |
CN110084852A (en) * | 2019-04-22 | 2019-08-02 | 西南交通大学 | A kind of accurate positioning method of high iron catenary support device sleeve puller bolt |
WO2020215565A1 (en) * | 2019-04-26 | 2020-10-29 | 平安科技(深圳)有限公司 | Hand image segmentation method and apparatus, and computer device |
CN110232311A (en) * | 2019-04-26 | 2019-09-13 | 平安科技(深圳)有限公司 | Dividing method, device and the computer equipment of hand images |
CN110232311B (en) * | 2019-04-26 | 2023-11-14 | 平安科技(深圳)有限公司 | Method and device for segmenting hand image and computer equipment |
CN110211041A (en) * | 2019-05-08 | 2019-09-06 | 云南大学 | A kind of optimization method of the neural network image classifier integrated based on receptive field |
CN110211041B (en) * | 2019-05-08 | 2023-06-23 | 云南大学 | Optimization method of neural network image classifier based on receptive field integration |
CN110298821A (en) * | 2019-05-28 | 2019-10-01 | 昆明理工大学 | A kind of reinforcing bar detection method based on Faster R-CNN |
CN110414330A (en) * | 2019-06-20 | 2019-11-05 | 平安科技(深圳)有限公司 | A kind of detection method and device of palm image |
CN110264457A (en) * | 2019-06-20 | 2019-09-20 | 浙江大学 | Weld seam autonomous classification method based on rotary area candidate network |
CN110232687A (en) * | 2019-06-21 | 2019-09-13 | 华北电力大学(保定) | The detection method of cotter bolt defect in a kind of electric inspection process image |
CN110490892A (en) * | 2019-07-03 | 2019-11-22 | 中山大学 | A kind of Thyroid ultrasound image tubercle automatic positioning recognition methods based on USFaster R-CNN |
CN110633663A (en) * | 2019-09-05 | 2019-12-31 | 东南大学 | Method for automatically cutting multi-mode data in sign language video |
CN110633663B (en) * | 2019-09-05 | 2022-04-08 | 东南大学 | Method for automatically cutting multi-mode data in sign language video |
CN110751155A (en) * | 2019-10-14 | 2020-02-04 | 西北工业大学 | Novel target detection method based on Faster R-CNN |
CN110991374A (en) * | 2019-12-10 | 2020-04-10 | 电子科技大学 | Fingerprint singular point detection method based on RCNN |
CN110992349A (en) * | 2019-12-11 | 2020-04-10 | 南京航空航天大学 | Underground pipeline abnormity automatic positioning and identification method based on deep learning |
CN111160479A (en) * | 2019-12-31 | 2020-05-15 | 华南理工大学 | Welding spot identification method |
CN111325179A (en) * | 2020-03-09 | 2020-06-23 | 厦门美图之家科技有限公司 | Gesture tracking method and device, electronic equipment and storage medium |
CN111325179B (en) * | 2020-03-09 | 2023-05-02 | 厦门美图之家科技有限公司 | Gesture tracking method, gesture tracking device, electronic equipment and storage medium |
CN111401293A (en) * | 2020-03-25 | 2020-07-10 | 东华大学 | Gesture recognition method based on Head lightweight Mask scanning R-CNN |
CN111401293B (en) * | 2020-03-25 | 2022-06-24 | 东华大学 | Gesture recognition method based on Head lightweight Mask scanning R-CNN |
CN111611925A (en) * | 2020-05-21 | 2020-09-01 | 重庆现代建筑产业发展研究院 | Building detection and identification method and device |
CN111783772A (en) * | 2020-06-12 | 2020-10-16 | 青岛理工大学 | Grabbing detection method based on RP-ResNet network |
CN111881907A (en) * | 2020-06-22 | 2020-11-03 | 浙江大华技术股份有限公司 | Frame regression positioning method and device and electronic equipment |
CN111881907B (en) * | 2020-06-22 | 2021-07-27 | 浙江大华技术股份有限公司 | Frame regression positioning method and device and electronic equipment |
CN112052724A (en) * | 2020-07-23 | 2020-12-08 | 深圳市玩瞳科技有限公司 | Finger tip positioning method and device based on deep convolutional neural network |
CN112036261A (en) * | 2020-08-11 | 2020-12-04 | 海尔优家智能科技(北京)有限公司 | Gesture recognition method and device, storage medium and electronic device |
CN112163447A (en) * | 2020-08-18 | 2021-01-01 | 桂林电子科技大学 | Multi-task real-time gesture detection and recognition method based on Attention and Squeezenet |
CN112163447B (en) * | 2020-08-18 | 2022-04-08 | 桂林电子科技大学 | Multi-task real-time gesture detection and recognition method based on Attention and Squeezenet |
CN112257513B (en) * | 2020-09-27 | 2023-07-25 | 南京工业大学 | Training method, translation method and system for sign language video translation model |
CN112257513A (en) * | 2020-09-27 | 2021-01-22 | 南京工业大学 | Training method, translation method and system for sign language video translation model |
CN112364687A (en) * | 2020-09-29 | 2021-02-12 | 上善智城(苏州)信息科技有限公司 | Improved Faster R-CNN gas station electrostatic sign identification method and system |
CN112308822A (en) * | 2020-10-10 | 2021-02-02 | 杭州电子科技大学 | Intervertebral disc CT image detection method based on deep convolutional neural network |
CN112148128A (en) * | 2020-10-16 | 2020-12-29 | 哈尔滨工业大学 | Real-time gesture recognition method and device and man-machine interaction system |
CN112148128B (en) * | 2020-10-16 | 2022-11-25 | 哈尔滨工业大学 | Real-time gesture recognition method and device and man-machine interaction system |
CN112861849A (en) * | 2020-12-29 | 2021-05-28 | 北京航空航天大学 | Tissue identification method in spinal deformity correction surgery |
CN113076972A (en) * | 2021-03-04 | 2021-07-06 | 山东师范大学 | Two-stage Logo image detection method and system based on deep learning |
CN113096126B (en) * | 2021-06-03 | 2021-09-24 | 四川九通智路科技有限公司 | Road disease detection system and method based on image recognition deep learning |
CN113096126A (en) * | 2021-06-03 | 2021-07-09 | 四川九通智路科技有限公司 | Road disease detection system and method based on image recognition deep learning |
CN113792573A (en) * | 2021-07-13 | 2021-12-14 | 浙江理工大学 | Static gesture recognition method for wavelet transformation low-frequency information and Xception network |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108509839A (en) | One kind being based on the efficient gestures detection recognition methods of region convolutional neural networks | |
Sincan et al. | Autsl: A large scale multi-modal turkish sign language dataset and baseline methods | |
CN109961034B (en) | Video target detection method based on convolution gating cyclic neural unit | |
CN107273800B (en) | Attention mechanism-based motion recognition method for convolutional recurrent neural network | |
Shan et al. | Automatic facial expression recognition based on a deep convolutional-neural-network structure | |
CN106599789B (en) | The recognition methods of video classification and device, data processing equipment and electronic equipment | |
Wang et al. | Actionness estimation using hybrid fully convolutional networks | |
CN106557774B (en) | The method for real time tracking of multichannel core correlation filtering | |
CN109816689A (en) | A kind of motion target tracking method that multilayer convolution feature adaptively merges | |
CN112507777A (en) | Optical remote sensing image ship detection and segmentation method based on deep learning | |
Xu et al. | Aligning correlation information for domain adaptation in action recognition | |
CN109902646A (en) | A kind of gait recognition method based on long memory network in short-term | |
CN112883896B (en) | Micro-expression detection method based on BERT network | |
CN113963032A (en) | Twin network structure target tracking method fusing target re-identification | |
CN109034012A (en) | First person gesture identification method based on dynamic image and video sequence | |
Mo et al. | Background noise filtering and distribution dividing for crowd counting | |
CN110334656A (en) | Multi-source Remote Sensing Images Clean water withdraw method and device based on information source probability weight | |
CN110276784A (en) | Correlation filtering motion target tracking method based on memory mechanism Yu convolution feature | |
Fan | Research and realization of video target detection system based on deep learning | |
Pang et al. | Dance video motion recognition based on computer vision and image processing | |
Yaseen et al. | A novel approach based on multi-level bottleneck attention modules using self-guided dropblock for person re-identification | |
CN116993760A (en) | Gesture segmentation method, system, device and medium based on graph convolution and attention mechanism | |
Le et al. | Smart Elevator Cotrol System Based on Human Hand Gesture Recognition | |
Hassan et al. | Enhanced dynamic sign language recognition using slowfast networks | |
Zhang et al. | Weakly-supervised butterfly detection based on saliency map |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180907 |