CN107506792A - A kind of semi-supervised notable method for checking object - Google Patents
A kind of semi-supervised notable method for checking object Download PDFInfo
- Publication number
- CN107506792A CN107506792A CN201710702593.XA CN201710702593A CN107506792A CN 107506792 A CN107506792 A CN 107506792A CN 201710702593 A CN201710702593 A CN 201710702593A CN 107506792 A CN107506792 A CN 107506792A
- Authority
- CN
- China
- Prior art keywords
- notable
- prediction
- mrow
- spectrum
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 238000001228 spectrum Methods 0.000 claims abstract description 60
- 238000012549 training Methods 0.000 claims abstract description 33
- 238000001514 detection method Methods 0.000 claims abstract description 30
- 230000011218 segmentation Effects 0.000 claims abstract description 22
- 238000012360 testing method Methods 0.000 claims description 16
- 230000017105 transposition Effects 0.000 claims description 12
- 230000006870 function Effects 0.000 claims description 9
- 238000005070 sampling Methods 0.000 claims description 4
- 238000003475 lamination Methods 0.000 claims description 3
- 239000011159 matrix material Substances 0.000 claims description 3
- 239000000284 extract Substances 0.000 claims description 2
- 238000005457 optimization Methods 0.000 abstract 1
- 238000013528 artificial neural network Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2155—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (4)
- A kind of 1. semi-supervised notable method for checking object, based on improved faster rcnn RPN network models, its feature It is:Described RPN network models include obj ect detection module and notable prediction module;Described obj ect detection module and significantly One shared convolutional layer of inputs sharing of prediction module;Described notable prediction module includes first convolutional layer, a sigmoid layer and three transposition convolutional layers, shared volume The output end of lamination is composed by the first convolutional layer output characteristic, and described characteristic spectrum obtains one by sigmoid layers and initially shown Write prediction spectrum s, described initial significantly prediction spectrum s and a significantly prediction spectrum is exported after the up-sampling of three transposition convolutional layers sal;In the training stage, described notable prediction module also includes the Euclidean loss layers one after transposition convolutional layer;Described obj ect detection module includes second convolutional layer, a ReLu layer and a full articulamentum, shares convolutional layer Output end be sequentially connected with the second convolutional layer and ReLu layers, ReLu layers output characteristic spectrum F={ f1,f2,...,fnAnd it is initial Significantly prediction spectrum s obtains notable feature spectrum FS={ fs after carrying out func function operations1,fs2,...,fsn, described func letters Number represents that two matrix corresponding elements are multiplied;Full articulamentum extracts prediction block feature from notable feature spectrum, is predicted frame Position prediction and class prediction;Described method comprises the following steps:S1:Using object box information, initial segmentation is carried out to object box region in picture, and segmentation result is preserved, is made It is normative reference for the initial groundtruth of object;S2:The detection classification number of obj ect detection module and the initial learning rate of network are set;S3:The described network of training, optimize network losses L using stochastic gradient descent algorithm and update network, wherein learning rate It is updated per iteration n times;S4:The network trained to step S3, shared convolutional layer using notable prediction module and above are right as test network Picture is trained to carry out notable object prediction;S5:Super-pixel segmentation is carried out to training picture, sal is composed in the notable prediction obtained using super-pixel segmentation result to step S4 The smooth of super-pixel level is carried out, and binarization operation is carried out to the spectrum after smooth, finally gives the object prospect spectrum of binaryzation;S6:The object prospect spectrum of the binaryzation obtained by the use of step S5 is used as groundtruth, using splitting in alternative steps S1 The foreground object region that method obtains, and learning rate is arranged to the value of learning rate at the end of network training in step S3;S7:Repeat step S3~S6, until network training reaches promising result.
- A kind of 2. semi-supervised notable method for checking object according to claim 1, it is characterised in that:Described method is also Including:S8:The model completed to training is tested:Test network is obtained by the way of step S4, test image is input to In test network, test network exports notable object prediction spectrum, and super-pixel is carried out to significantly prediction spectrum using step S5 mode Level is smooth.
- A kind of 3. semi-supervised notable method for checking object according to claim 1, it is characterised in that:Two-value in step S5 Changing operation is:The pixel that will be above threshold value σ is set to 1, and the pixel less than threshold value σ is set to 0;The object prospect spectrum of obtained binaryzation In, subject area pixel is 1, background area pixels 0.
- A kind of 4. semi-supervised notable method for checking object according to claim 1, it is characterised in that:Described in step S3 Network losses L formula be:<mrow> <mi>L</mi> <mo>=</mo> <msub> <mi>L</mi> <mrow> <mi>c</mi> <mi>l</mi> <mi>s</mi> </mrow> </msub> <mrow> <mo>(</mo> <msub> <mi>p</mi> <mi>i</mi> </msub> <mo>,</mo> <msubsup> <mi>p</mi> <mi>i</mi> <mo>*</mo> </msubsup> <mo>)</mo> </mrow> <mo>+</mo> <msub> <mi>&alpha;L</mi> <mrow> <mi>l</mi> <mi>o</mi> <mi>c</mi> </mrow> </msub> <mrow> <mo>(</mo> <msub> <mi>t</mi> <mi>i</mi> </msub> <mo>,</mo> <msubsup> <mi>t</mi> <mi>i</mi> <mo>*</mo> </msubsup> <mo>)</mo> </mrow> <mo>+</mo> <mi>&beta;</mi> <mi>L</mi> <mrow> <mo>(</mo> <mi>s</mi> <mi>a</mi> <mi>l</mi> <mo>,</mo> <mi>g</mi> <mi>t</mi> <mo>)</mo> </mrow> </mrow>In formula, first two two losses for representing RPN networks in faster rcnn,Represent prediction block confidence level damage Lose, piRepresent that the confidence level that i-th of prediction block includes object is given a mark,True classification is represented,Represent prediction block with it is true Position loss between real frame, tiThe position coordinates of i-th of prediction block is represented,The position coordinates of true frame is represented, α is weight system Number;Section 3 is lost for significantly prediction, is provided by Euclidean loss layers, β is weight coefficient;Wherein:<mrow> <mi>L</mi> <mrow> <mo>(</mo> <mi>s</mi> <mi>a</mi> <mi>l</mi> <mo>,</mo> <mi>g</mi> <mi>t</mi> <mo>)</mo> </mrow> <mo>=</mo> <mfrac> <mn>1</mn> <mrow> <mn>2</mn> <mi>N</mi> </mrow> </mfrac> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>N</mi> </msubsup> <msup> <mrow> <mo>(</mo> <msub> <mi>sal</mi> <mi>i</mi> </msub> <mo>-</mo> <msub> <mi>gt</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mn>2</mn> </msup> </mrow>In formula, gt is a two-value spectrum, represents the groundtruth of image object;In the starting stage, gt is to image object frame The two-value spectrum that region segmentation obtains, N are the number of gt pixels.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710702593.XA CN107506792B (en) | 2017-08-16 | 2017-08-16 | Semi-supervised salient object detection method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710702593.XA CN107506792B (en) | 2017-08-16 | 2017-08-16 | Semi-supervised salient object detection method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107506792A true CN107506792A (en) | 2017-12-22 |
CN107506792B CN107506792B (en) | 2020-09-29 |
Family
ID=60691130
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710702593.XA Active CN107506792B (en) | 2017-08-16 | 2017-08-16 | Semi-supervised salient object detection method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107506792B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108428238A (en) * | 2018-03-02 | 2018-08-21 | 南开大学 | A kind of detection method general based on the polymorphic type task of depth network |
CN108629279A (en) * | 2018-03-27 | 2018-10-09 | 哈尔滨理工大学 | A method of the vehicle target detection based on convolutional neural networks |
CN109145902A (en) * | 2018-08-21 | 2019-01-04 | 武汉大学 | A method of geometry is identified and positioned using extensive feature |
CN110674807A (en) * | 2019-08-06 | 2020-01-10 | 中国科学院信息工程研究所 | Curved scene character detection method based on semi-supervised and weakly supervised learning |
CN111160180A (en) * | 2019-12-16 | 2020-05-15 | 浙江工业大学 | Night green apple identification method of apple picking robot |
CN111837229A (en) * | 2018-03-19 | 2020-10-27 | 科磊股份有限公司 | Semi-supervised anomaly detection in scanning electron microscope images |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104537676A (en) * | 2015-01-12 | 2015-04-22 | 南京大学 | Gradual image segmentation method based on online learning |
CN105608690A (en) * | 2015-12-05 | 2016-05-25 | 陕西师范大学 | Graph theory and semi supervised learning combination-based image segmentation method |
CN106157319A (en) * | 2016-07-28 | 2016-11-23 | 哈尔滨工业大学 | The significance detection method that region based on convolutional neural networks and Pixel-level merge |
CN106296638A (en) * | 2015-06-04 | 2017-01-04 | 欧姆龙株式会社 | Significance information acquisition device and significance information acquisition method |
CN106529419A (en) * | 2016-10-20 | 2017-03-22 | 北京航空航天大学 | Automatic detection method for significant stack type polymerization object in video |
-
2017
- 2017-08-16 CN CN201710702593.XA patent/CN107506792B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104537676A (en) * | 2015-01-12 | 2015-04-22 | 南京大学 | Gradual image segmentation method based on online learning |
CN106296638A (en) * | 2015-06-04 | 2017-01-04 | 欧姆龙株式会社 | Significance information acquisition device and significance information acquisition method |
CN105608690A (en) * | 2015-12-05 | 2016-05-25 | 陕西师范大学 | Graph theory and semi supervised learning combination-based image segmentation method |
CN106157319A (en) * | 2016-07-28 | 2016-11-23 | 哈尔滨工业大学 | The significance detection method that region based on convolutional neural networks and Pixel-level merge |
CN106529419A (en) * | 2016-10-20 | 2017-03-22 | 北京航空航天大学 | Automatic detection method for significant stack type polymerization object in video |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108428238A (en) * | 2018-03-02 | 2018-08-21 | 南开大学 | A kind of detection method general based on the polymorphic type task of depth network |
CN108428238B (en) * | 2018-03-02 | 2022-02-15 | 南开大学 | Multi-type task general detection method based on deep network |
CN111837229A (en) * | 2018-03-19 | 2020-10-27 | 科磊股份有限公司 | Semi-supervised anomaly detection in scanning electron microscope images |
TWI803595B (en) * | 2018-03-19 | 2023-06-01 | 美商克萊譚克公司 | Systems and methods of wafer inspection, and related non-transitory computer-readable storage medium |
CN108629279A (en) * | 2018-03-27 | 2018-10-09 | 哈尔滨理工大学 | A method of the vehicle target detection based on convolutional neural networks |
CN109145902A (en) * | 2018-08-21 | 2019-01-04 | 武汉大学 | A method of geometry is identified and positioned using extensive feature |
CN109145902B (en) * | 2018-08-21 | 2021-09-03 | 武汉大学 | Method for recognizing and positioning geometric identification by using generalized characteristics |
CN110674807A (en) * | 2019-08-06 | 2020-01-10 | 中国科学院信息工程研究所 | Curved scene character detection method based on semi-supervised and weakly supervised learning |
CN111160180A (en) * | 2019-12-16 | 2020-05-15 | 浙江工业大学 | Night green apple identification method of apple picking robot |
Also Published As
Publication number | Publication date |
---|---|
CN107506792B (en) | 2020-09-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112966684B (en) | Cooperative learning character recognition method under attention mechanism | |
CN107506792A (en) | A kind of semi-supervised notable method for checking object | |
Yue et al. | A deep learning framework for hyperspectral image classification using spatial pyramid pooling | |
CN105512289B (en) | Image search method based on deep learning and Hash | |
CN106980858B (en) | Language text detection and positioning system and language text detection and positioning method using same | |
Liu et al. | ARC-Net: An efficient network for building extraction from high-resolution aerial images | |
CN110210551A (en) | A kind of visual target tracking method based on adaptive main body sensitivity | |
CN104462494B (en) | A kind of remote sensing image retrieval method and system based on unsupervised feature learning | |
CN107909015A (en) | Hyperspectral image classification method based on convolutional neural networks and empty spectrum information fusion | |
CN109614985A (en) | A kind of object detection method based on intensive connection features pyramid network | |
CN108549893A (en) | A kind of end-to-end recognition methods of the scene text of arbitrary shape | |
CN108898145A (en) | A kind of image well-marked target detection method of combination deep learning | |
CN106372648A (en) | Multi-feature-fusion-convolutional-neural-network-based plankton image classification method | |
CN105825502B (en) | A kind of Weakly supervised method for analyzing image of the dictionary study based on conspicuousness guidance | |
CN102651128B (en) | Image set partitioning method based on sampling | |
CN108921198A (en) | commodity image classification method, server and system based on deep learning | |
CN106295613A (en) | A kind of unmanned plane target localization method and system | |
CN109785344A (en) | The remote sensing image segmentation method of binary channel residual error network based on feature recalibration | |
CN109948593A (en) | Based on the MCNN people counting method for combining global density feature | |
Zhu et al. | ED-Net: Automatic building extraction from high-resolution aerial images with boundary information | |
CN111353544B (en) | Improved Mixed Pooling-YOLOV 3-based target detection method | |
CN106408030A (en) | SAR image classification method based on middle lamella semantic attribute and convolution neural network | |
CN110334718A (en) | A kind of two-dimensional video conspicuousness detection method based on shot and long term memory | |
Zhang et al. | Research on mine vehicle tracking and detection technology based on YOLOv5 | |
CN104537684A (en) | Real-time moving object extraction method in static scene |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200901 Address after: Room d1-35, block C, 7 / F, building 1, electronic industrial park, 8 Gaoke Road, high tech Zone, Nanning City, Guangxi Zhuang Autonomous Region Applicant after: Guangxi Hefu Intelligent Technology Co.,Ltd. Address before: Room 101, Building 11, No. 1158 Central Road, Songjiang District, Songjiang District, Shanghai 201600 Applicant before: SHANGHAI HEFU ARTIFICIAL INTELLIGENCE TECHNOLOGY (Group) Co.,Ltd. Applicant before: CHENGDU JISHENG INTELLIGENTIZE ENGINEERING Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240326 Address after: 201600 Room 101, building 11, 1158 Zhongxin Road, Songjiang District, Shanghai Patentee after: SHANGHAI HEFU ARTIFICIAL INTELLIGENCE TECHNOLOGY (Group) Co.,Ltd. Country or region after: China Address before: Building 1, Electronics Industry Park, No. 8 Gaoke Road, High tech Zone, Nanning City, Guangxi Zhuang Autonomous Region, 530000, North of Building C, Building 7, D1-35 Patentee before: Guangxi Hefu Intelligent Technology Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right |