CN107527031A - A kind of indoor objects detection method based on SSD - Google Patents

A kind of indoor objects detection method based on SSD Download PDF

Info

Publication number
CN107527031A
CN107527031A CN201710724937.7A CN201710724937A CN107527031A CN 107527031 A CN107527031 A CN 107527031A CN 201710724937 A CN201710724937 A CN 201710724937A CN 107527031 A CN107527031 A CN 107527031A
Authority
CN
China
Prior art keywords
module
convolution
dimension
input
branch road
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710724937.7A
Other languages
Chinese (zh)
Other versions
CN107527031B (en
Inventor
李宏亮
姚晓宇
杨燕平
陈雅丽
方清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN201710724937.7A priority Critical patent/CN107527031B/en
Publication of CN107527031A publication Critical patent/CN107527031A/en
Application granted granted Critical
Publication of CN107527031B publication Critical patent/CN107527031B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/35Categorising the entire scene, e.g. birthday party or wedding scene
    • G06V20/36Indoor scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/049Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Image Analysis (AREA)

Abstract

A kind of indoor objects detection method based on SSD of the disclosure of the invention, belongs to field of target recognition, is a kind of new application method of the convolutional neural networks for indoor objects detection.What the present invention solved is indoor objects detection, but currently without the database for meeting this requirement of experiment marked, a database related to indoor objects is constructed for this, all images of the database are all with the openr angular samples in the visual field, meet the normal viewing angle of intelligent robot, there is otherness again in background, illumination and picture size etc..Artificial mark has been carried out to indoor frequent goal refrigerator, TV, bed, dining table, chair, sofa, tea table, closestool, Wash Units, bathing pool, cup etc.;Network and detector are extracted using the image training characteristics of acquisition, finally target to be identified is detected using the feature extraction network and detector that train.

Description

A kind of indoor objects detection method based on SSD
Technical field
Field of target recognition of the present invention, it is a kind of new application side of the convolutional neural networks for indoor objects detection Method.
Background technology
Indoor objects identification is the process for a kind of target being separated and being identified from other targets under environment indoors, The application field such as design, the retrieval of interior design plan of research and development, household monitoring system for household service robot has Important meaning.But traditional technique in measuring accuracy and speed can not all reach requirement, more to increase so there is an urgent need for one kind Imitate accurate detection method.
In computer vision field, there are all multi-methods to can be used for indoor objects detection, traditional object detection method base In features such as SIFT, HOG, using sorting techniques such as SVM, Adaboot.But the feature of these hand-designeds is often to image Sign it is limited in one's ability, cause target identification precision and positioning accuracy often relatively low, it is difficult to meet application request.And , there is good improvement in a Ge Xin branch of the deep learning as machine learning in real-time and accuracy.In recent years, depth Study is developed rapidly, and good performance is all presented in fields such as automatic speech recognition, natural language processing and image procossings. In image processing field, deep learning is widely used to image classification, target detection and semantic segmentation.Convolutional neural networks are The core of deep learning, the superperformance that convolutional neural networks obtain for deep learning are performed meritorious deeds never to be obliterated.In object detection field, The models such as Faster R-CNN, YOLO and SSD have good Detection results, and convolutional neural networks play not wherein Alternative effect.At the same time, Recognition with Recurrent Neural Network also has good development in recent years.Cyclic convolution network is for the time The processing of sequence has its unique advantage compared to convolutional neural networks.And the model such as LSTM has greatly improved cyclic convolution The gradient disappearance problem of network.
For the detection of indoor objects, there are some relatively good schemes, such as:Target inspection based on Faster R-CNN Survey, RPN (Region Proposal Network) is formed using shared convolution net, Suggestion box is directly predicted with RPN, and RPN overwhelming majority prediction is completed in GPU, convolution net and Fast R-CNN partial sharings, therefore greatly improves target inspection The speed of survey, but can not still meet to require in real time., can be with based on YOLO (You Only Look Once) target detection The position of target is directly returned out, uses preset frame mechanism due to no, YOLO accuracy of detection is not very high.In addition, also There is the detection method based on SSD (Single Shot Multibox Detector), this method is also direct regressive object Position and classification, accuracy of detection are improved with respect to YOLO, but still the mission requirements not reached.
Neutral net has millions of parameters, it is easy to the problem of over-fitting occurs.In order to solve asking for over-fitting Topic, it is general using first pre-training goes out a model on large-scale dataset, then again with specific small-scale data set to this Model is finely adjusted.Simultaneously the node of some hidden layers can be allowed not worked in training by dropout method, to reach Prevent the purpose of over-fitting.
The content of the invention
The present invention proposes a kind of improved indoor objects detection based on SSD, for family's monitoring, service robot Application scenarios.On the database of foundation, the experiment enriched, indoor common furniture is detected.
What the present invention solved is indoor objects detection, but currently without the data for meeting this requirement of experiment marked Storehouse, a database related to indoor objects is constructed for this, all images of the database are all openr with the visual field Angular samples, meet the normal viewing angle of intelligent robot, there is difference again in background, illumination and picture size etc. Property.Indoor frequent goal refrigerator, TV, bed, dining table, chair, sofa, tea table, closestool, Wash Units, bathing pool, cup etc. are carried out Artificial mark;Network and detector are extracted using the image training characteristics of acquisition, finally using the feature extraction net trained Network and detector detect to target to be identified.Thus technical solution of the present invention is a kind of indoor objects detection based on SSD Method, this method include:
Step 1:Obtain indoor target image to be detected;
Step 2:Feature extraction network is established, the global characteristics of network extraction target image are extracted using this feature;
Step 3:The global characteristics that step 2 is obtained input SSD detectors, testing result corresponding to acquisition;
It is characterized in that the feature extraction network of the step 2 includes:Three input modules, the first to the tenth a roll of product module Block, the first to the 5th pond module, two contextual information extraction modules, a normalization module;Three described input moulds Block is respectively an image to be detected and first and second zone bit information input module, and described image to be detected is as the first convolution The input of module;First convolution module, the first pond module, the second convolution module, the second pond module, the 3rd convolution module, 3rd pond module, Volume Four volume module, the 4th pond module, the 5th convolution module, the 5th pond module, the 6th convolution mould Block, the 7th convolution module, the 8th convolution module, the 9th convolution module, the tenth convolution module, the 11st convolution module level successively Connection;Extra, the output of Volume Four volume module will also be input to normalizing together with the output of the first zone bit information input module Change module, the output for then normalizing module is input to the first contextual information extraction module;Extra, the 7th convolution module Output will also be input to the second contextual information extraction module together with the output of the second zone bit information input module;Finally will The output of first and second contextual information extraction module, the 8th to the 11st convolution module is as the global characteristics extracted.
Further, described contextual information extraction module includes two convolution modules, a cascade module, a horizontal stroke To feature extraction branch road and a longitudinal feature extraction branch road;The output of first convolution module is separately input to transverse features and carried Branch road and longitudinal feature extraction branch road are taken, the output of transverse features the extraction branch road and longitudinal feature extraction branch road inputs jointly To cascade module, the then output of cascade module is input to second convolution module;Wherein cascade module is a cascading layers.Should Scheme extracts contextual information from horizontal and vertical two directions, is effectively extracted the Global Information of image.
Further, two convolution modules in the contextual information extraction module are all that convolution kernel size is 1*1, step A length of 1, it is filled with 0 convolutional layer.The program can effectively extract characteristics of image, and help to prevent network from dissipating.
Further, described transverse features branch road and longitudinal feature branch road be all by a dimension conversion module, one Dimension merging module, a cyclic convolution module, a dimension recovery module cascade composition successively;The dimension transformation module Act as each dimensional information of input picture is exchanged into position, the number of dimensions for acting as reducing image of dimension merging module The image dimension number of degrees after reduction are reverted to its original number of dimensions and order by amount, the acting as of the dimension recovery module;Follow Ring convolution module is LSTM layers (shot and long term memory network), and its direction handled is controlled by the dimension conversion module of corresponding branch road. The program takes full advantage of disposal ability of the cyclic convolution network to time series, and flexible Application to the feature of single image with carrying Take, and LSTM has good inhibition to gradient disperse.
Further, the dimension of the input picture is n*c*w*h, and wherein n represents image number, and c represents port number, w Represent that picture is wide, h represents that image is high;Dimension order is changed into h*n*w*c by the dimension conversion module in the transverse features branch road, Dimension order is changed into w*n*h*c by the dimension conversion module in longitudinal feature branch road;The dimension of two branch roads merges Module is all that second and third dimension are merged into a dimension.Program main function is in control cyclic convolution module LSTM extracts the direction of characteristics of image, and transverse features extraction branch road is to follow image as a time series input per a line Ring convolution module, longitudinal feature extraction branch road are that image is inputted into cyclic convolution module as a time series per a line.
Further, it by two convolution kernel sizes is 3*3 that the first and second convolution modules, which are all, step-length 1, is filled with 1 Convolutional layer composition;It by three convolution kernel sizes is 3*3 that 3rd to the 5th convolution module, which is, step-length 1, is filled with 1 convolution Layer composition;It by a convolution kernel size is 3*3 that 6th convolution module, which is, step-length 1, is filled with 6 convolutional layer composition;Volume seven It by a convolution kernel size is 1*1 that volume module, which is, step-length 1, is filled with 1 convolutional layer composition;8th and the 9th convolution module It by a convolution kernel size is 1*1 to be, step-length 1, is filled with 0 convolutional layer and a convolution kernel size is 3*3, step-length 2, It is filled with 1 convolutional layer composition;It by a convolution kernel size is 1*1 that 8th and the 9th convolution module, which is, step-length 1, filling Convolutional layer and a convolution kernel size for 0 are 3*3, step-length 1, are filled with 0 convolutional layer composition.The program is fully extracted The feature of image, is easy to the detector below effectively to detect target.
The present invention is improved on the basis of the method for existing detection to extraction characteristic, flexible Application circulation volume Contextual information is added in characteristic spectrum by product network.In existing method, accuracy and speed is considered, select SSD as inspection Survey scheme.SSD considers multiple features spectrum, selects the module of suitable characteristic spectrum addition, in the case where ensureing speed, will examine Survey precision to improve, the actual demand preferably met.
Brief description of the drawings
Fig. 1 is the overall network structure of the present invention;
Fig. 2 is the contextual information extraction module of the present invention;
Fig. 3 is partial test result figure.
Embodiment
Groundwork of the present invention is divided into two parts of training and test, and all working is divided into six steps:
Step 1, structure database:For it is to be studied the problem of, build house data storehouse.Image selects from indoor design website Take, visual angle is wider, and indoor frequent goal relativeness more can significantly embody.After some inappropriate images are excluded, 6000 multiple images are collected altogether, wherein 2/3rds are used as training sample, 1/3rd are used as test sample.Because sample is less, To avoid over-fitting, having carried out random stripping and slicing to image in training, mirror image etc. operates, to increase sample size.
Step 2, the detection to sample object:Image all in database is manually marked, by the mesh in image Its ground truth are marked out, that is, mark out position and the classification of target, classification is demarcated as 0,1,2....9, totally ten Classification.
Step 3, improve SSD models:For room objects, have between thing and thing, thing and environment and closely contact, they Be according to the mankind share custom place, so for room objects one of feature, improve SSD models, by global information It is added among the detection to specific objective.Using feature extraction network extraction characteristics of image, using LSTM to time series Disposal ability, model is built, the characteristic spectrum of image information is split in rows and columns respectively, be then input to cyclic convolution network and work as In handled, then carry out that processing is in parallel and convolution operation to information, the characteristic spectrum of row processing and column processing be processed into one Individual characteristic spectrum, it is then input in SSD detection network and is detected.Because high-level characteristic global information is lost seriously, and too The characteristic amount of low layer is excessive, and lengthy and jumbled information is a lot, so the present invention adds contextual information extraction module the 4th In the characteristic spectrum of the 7th two convolution module output.
Step 4, pre-training model:Because network model is larger, parameter is more, and sample is less, to prevent over-fitting, first will In ImageNet, this larger database is trained model, obtains pre-training model.
Step 5, the improved model of training:On the basis of step 4, continue to carry out model using the database of oneself Training.Data are pre-processed before training, expanding data sample.Training obtains final network model.
Step 6, test model:To training pattern carried out respectively on PASCAL VOC207 and the database of oneself Test, obtains position and the classification of detection image.The precision that the test result present invention is detected compared to SSD for indoor objects has Significantly improve, and remain to complete to detect with fast speed, the use demand that meets well.
6 steps more than, are improved the indoor objects detection method based on SSD, more fully make use of The contextual information of image, target detection capabilities of the networking for indoor scene are effectively increased, and ensure that the detection of network Speed.Partial test result is as shown in Figure 3.

Claims (6)

1. a kind of indoor objects detection method based on SSD, this method include:
Step 1:Obtain indoor target image to be detected;
Step 2:Feature extraction network is established, the global characteristics of network extraction target image are extracted using this feature;
Step 3:The global characteristics that step 2 is obtained input SSD detectors, testing result corresponding to acquisition;
It is characterized in that the feature extraction network of the step 2 includes:Three input modules, the first to the 11st convolution module, First to the 5th pond module, two contextual information extraction modules, a normalization module;Three described input modules point Not Wei an image to be detected and first and second zone bit information input module, described image to be detected is as the first convolution module Input;First convolution module, the first pond module, the second convolution module, the second pond module, the 3rd convolution module, the 3rd Pond module, Volume Four volume module, the 4th pond module, the 5th convolution module, the 5th pond module, the 6th convolution module, Seven convolution modules, the 8th convolution module, the 9th convolution module, the tenth convolution module, the 11st convolution module cascade successively;Additionally , the output of Volume Four volume module will also be input to normalization module together with the output of the first zone bit information input module, Then the output for normalizing module is input to the first contextual information extraction module;Extra, the output of the 7th convolution module is also The second contextual information extraction module is input to together with the output of the second zone bit information input module;Finally by first, The output of two contextual information extraction modules, the 8th to the 11st convolution module is as the global characteristics extracted.
A kind of 2. indoor objects detection method based on SSD as claimed in claim 1, it is characterised in that the contextual information Extraction module includes two convolution modules, a cascade module, a transverse features extraction branch road and a longitudinal feature extraction Branch road;The output of first convolution module is separately input to transverse features extraction branch road and longitudinal feature extraction branch road, the horizontal stroke Output to feature extraction branch road and longitudinal feature extraction branch road is input to cascade module jointly, and then the output of cascade module is defeated Enter to second convolution module;Wherein cascade module is a cascading layers.
A kind of 3. indoor objects detection method based on SSD as claimed in claim 1 or 2, it is characterised in that first He It by two convolution kernel sizes is 3*3 that second convolution module, which is all, step-length 1, is filled with 1 convolutional layer composition;3rd to the 5th It by three convolution kernel sizes is 3*3 that convolution module, which is, step-length 1, is filled with 1 convolutional layer composition;6th convolution module be by One convolution kernel size is 3*3, step-length 1, is filled with 6 convolutional layer composition;7th convolution module is big by a convolution kernel Small is 1*1, step-length 1, is filled with 1 convolutional layer composition;It by a convolution kernel size is 1* that 8th and the 9th convolution module, which is, 1, step-length 1, is filled with 0 convolutional layer and a convolution kernel size is 3*3, step-length 2, is filled with 1 convolutional layer composition; It by a convolution kernel size is 1*1 that 8th and the 9th convolution module, which is, step-length 1, is filled with 0 convolutional layer and a convolution kernel Size is 3*3, step-length 1, is filled with 0 convolutional layer composition.
A kind of 4. indoor objects detection method based on SSD as claimed in claim 2, it is characterised in that the contextual information Two convolution modules in extraction module are all that convolution kernel size is 1*1, step-length 1, are filled with 0 convolutional layer.
A kind of 5. indoor objects detection method based on SSD as claimed in claim 2, it is characterised in that described transverse features Branch road and longitudinal feature branch road are all by a dimension conversion module, a dimension merging module, cyclic convolution module, one Individual dimension recovery module cascades composition successively;The dimension transformation module is act as each dimensional information friendship of input picture Change place, the number of dimensions for acting as reducing image of dimension merging module, the dimension recovery module act as reducing The image dimension number of degrees afterwards revert to its original number of dimensions and order;Cyclic convolution module is LSTM layers, its handle direction by The dimension conversion module control of corresponding branch road.
6. a kind of indoor objects detection method based on SSD as claimed in claim 5, it is characterised in that the input picture Dimension is n*c*w*h, and wherein n represents image number, and c represents port number, and w represents that picture is wide, and h represents that image is high;The transverse direction Dimension order is changed into h*n*w*c by dimension conversion module in feature branch road, the dimension conversion mould in longitudinal feature branch road Dimension order is changed into w*n*h*c by block;The dimension merging module of two branch roads is all to be merged into second and third dimension In one dimension.
CN201710724937.7A 2017-08-22 2017-08-22 SSD-based indoor target detection method Active CN107527031B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710724937.7A CN107527031B (en) 2017-08-22 2017-08-22 SSD-based indoor target detection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710724937.7A CN107527031B (en) 2017-08-22 2017-08-22 SSD-based indoor target detection method

Publications (2)

Publication Number Publication Date
CN107527031A true CN107527031A (en) 2017-12-29
CN107527031B CN107527031B (en) 2021-02-05

Family

ID=60681878

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710724937.7A Active CN107527031B (en) 2017-08-22 2017-08-22 SSD-based indoor target detection method

Country Status (1)

Country Link
CN (1) CN107527031B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108256567A (en) * 2018-01-12 2018-07-06 环球大数据科技有限公司 A kind of target identification method and system based on deep learning
CN108537117A (en) * 2018-03-06 2018-09-14 哈尔滨思派科技有限公司 A kind of occupant detection method and system based on deep learning
CN108710897A (en) * 2018-04-24 2018-10-26 江苏科海智能系统有限公司 A kind of online general target detecting system in distal end based on SSD-T
CN109241872A (en) * 2018-08-20 2019-01-18 电子科技大学 Image, semantic fast partition method based on multistage network
CN109284760A (en) * 2018-08-10 2019-01-29 杭州群核信息技术有限公司 A kind of furniture detection method and device based on depth convolutional neural networks
CN109308458A (en) * 2018-08-31 2019-02-05 电子科技大学 A method of small target deteection precision is promoted based on characteristic spectrum change of scale
CN109472315A (en) * 2018-11-15 2019-03-15 江苏木盟智能科技有限公司 A kind of object detection method and system separating convolution based on depth
CN109741318A (en) * 2018-12-30 2019-05-10 北京工业大学 The real-time detection method of single phase multiple dimensioned specific objective based on effective receptive field
CN109840502A (en) * 2019-01-31 2019-06-04 深兰科技(上海)有限公司 A kind of method and apparatus carrying out target detection based on SSD model
CN110222593A (en) * 2019-05-18 2019-09-10 四川弘和通讯有限公司 A kind of vehicle real-time detection method based on small-scale neural network
CN111460919A (en) * 2020-03-13 2020-07-28 华南理工大学 Monocular vision road target detection and distance estimation method based on improved YO L Ov3
CN112446372A (en) * 2020-12-08 2021-03-05 电子科技大学 Text detection method based on channel grouping attention mechanism
TWI738009B (en) * 2019-06-20 2021-09-01 和碩聯合科技股份有限公司 Object detection system and object detection method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103198489A (en) * 2013-04-27 2013-07-10 哈尔滨工业大学 Automatic detection method of salient object based on salience density and edge response
CN106845549A (en) * 2017-01-22 2017-06-13 珠海习悦信息技术有限公司 A kind of method and device of the scene based on multi-task learning and target identification
US10216949B1 (en) * 2013-09-20 2019-02-26 Amazon Technologies, Inc. Dynamic quorum membership changes

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103198489A (en) * 2013-04-27 2013-07-10 哈尔滨工业大学 Automatic detection method of salient object based on salience density and edge response
US10216949B1 (en) * 2013-09-20 2019-02-26 Amazon Technologies, Inc. Dynamic quorum membership changes
CN106845549A (en) * 2017-01-22 2017-06-13 珠海习悦信息技术有限公司 A kind of method and device of the scene based on multi-task learning and target identification

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108256567B (en) * 2018-01-12 2020-08-25 环球大数据科技有限公司 Target identification method and system based on deep learning
CN108256567A (en) * 2018-01-12 2018-07-06 环球大数据科技有限公司 A kind of target identification method and system based on deep learning
CN108537117A (en) * 2018-03-06 2018-09-14 哈尔滨思派科技有限公司 A kind of occupant detection method and system based on deep learning
CN108710897A (en) * 2018-04-24 2018-10-26 江苏科海智能系统有限公司 A kind of online general target detecting system in distal end based on SSD-T
CN109284760A (en) * 2018-08-10 2019-01-29 杭州群核信息技术有限公司 A kind of furniture detection method and device based on depth convolutional neural networks
CN109241872A (en) * 2018-08-20 2019-01-18 电子科技大学 Image, semantic fast partition method based on multistage network
CN109241872B (en) * 2018-08-20 2022-03-18 电子科技大学 Image semantic fast segmentation method based on multistage network
CN109308458A (en) * 2018-08-31 2019-02-05 电子科技大学 A method of small target deteection precision is promoted based on characteristic spectrum change of scale
CN109308458B (en) * 2018-08-31 2022-03-15 电子科技大学 Method for improving small target detection precision based on characteristic spectrum scale transformation
CN109472315B (en) * 2018-11-15 2021-09-24 江苏木盟智能科技有限公司 Target detection method and system based on depth separable convolution
CN109472315A (en) * 2018-11-15 2019-03-15 江苏木盟智能科技有限公司 A kind of object detection method and system separating convolution based on depth
CN109741318A (en) * 2018-12-30 2019-05-10 北京工业大学 The real-time detection method of single phase multiple dimensioned specific objective based on effective receptive field
CN109741318B (en) * 2018-12-30 2022-03-29 北京工业大学 Real-time detection method of single-stage multi-scale specific target based on effective receptive field
CN109840502B (en) * 2019-01-31 2021-06-15 深兰科技(上海)有限公司 Method and device for target detection based on SSD model
CN109840502A (en) * 2019-01-31 2019-06-04 深兰科技(上海)有限公司 A kind of method and apparatus carrying out target detection based on SSD model
CN110222593A (en) * 2019-05-18 2019-09-10 四川弘和通讯有限公司 A kind of vehicle real-time detection method based on small-scale neural network
TWI738009B (en) * 2019-06-20 2021-09-01 和碩聯合科技股份有限公司 Object detection system and object detection method
CN111460919A (en) * 2020-03-13 2020-07-28 华南理工大学 Monocular vision road target detection and distance estimation method based on improved YO L Ov3
CN111460919B (en) * 2020-03-13 2023-05-26 华南理工大学 Monocular vision road target detection and distance estimation method based on improved YOLOv3
CN112446372A (en) * 2020-12-08 2021-03-05 电子科技大学 Text detection method based on channel grouping attention mechanism

Also Published As

Publication number Publication date
CN107527031B (en) 2021-02-05

Similar Documents

Publication Publication Date Title
CN107527031A (en) A kind of indoor objects detection method based on SSD
CN111126202B (en) Optical remote sensing image target detection method based on void feature pyramid network
Li et al. Building-a-nets: Robust building extraction from high-resolution remote sensing images with adversarial networks
CN107316058A (en) Improve the method for target detection performance by improving target classification and positional accuracy
CN107358257B (en) Under a kind of big data scene can incremental learning image classification training method
CN109410242A (en) Method for tracking target, system, equipment and medium based on double-current convolutional neural networks
CN103871100B (en) Tree modelling method for reconstructing based on a cloud Yu data-driven
CN109446970A (en) A kind of Intelligent Mobile Robot road scene recognition methods based on deep learning
CN105631413A (en) Cross-scene pedestrian searching method based on depth learning
CN107154043A (en) A kind of Lung neoplasm false positive sample suppressing method based on 3DCNN
Han et al. A new method in wheel hub surface defect detection: Object detection algorithm based on deep learning
CN105740773A (en) Deep learning and multi-scale information based behavior identification method
CN109614488A (en) Distribution network live line work condition distinguishing method based on text classification and image recognition
CN109410190B (en) Tower pole reverse-breaking detection model training method based on high-resolution remote sensing satellite image
CN105868706A (en) Method for identifying 3D model based on sparse coding
CN107958067A (en) It is a kind of based on without mark Automatic Feature Extraction extensive electric business picture retrieval system
CN112818777B (en) Remote sensing image target detection method based on dense connection and feature enhancement
CN107527065A (en) A kind of flower variety identification model method for building up based on convolutional neural networks
Ning et al. Point-voxel and bird-eye-view representation aggregation network for single stage 3D object detection
Ouyang et al. Aerial target detection based on the improved YOLOv3 algorithm
Li et al. A hierarchical deformable deep neural network and an aerial image benchmark dataset for surface multiview stereo reconstruction
Zhang et al. Recognition and detection of wolfberry in the natural background based on improved YOLOv5 network
CN107578445A (en) Image discriminant region extracting method based on convolution characteristic spectrum
Li et al. Detecting Plant Leaves Based on Vision Transformer Enhanced YOLOv5
Li et al. Ctmu-net: an improved u-net for semantic segmentation of remote-sensing images based on the combined attention mechanism

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant