CN110533041A - Multiple dimensioned scene text detection method based on recurrence - Google Patents
Multiple dimensioned scene text detection method based on recurrence Download PDFInfo
- Publication number
- CN110533041A CN110533041A CN201910838235.0A CN201910838235A CN110533041A CN 110533041 A CN110533041 A CN 110533041A CN 201910838235 A CN201910838235 A CN 201910838235A CN 110533041 A CN110533041 A CN 110533041A
- Authority
- CN
- China
- Prior art keywords
- convolution
- module
- filled
- length
- convolution kernel
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 46
- 238000000034 method Methods 0.000 claims abstract description 23
- 238000012549 training Methods 0.000 claims abstract description 10
- 230000004927 fusion Effects 0.000 claims abstract description 7
- 230000008569 process Effects 0.000 claims abstract description 6
- 239000000284 extract Substances 0.000 claims abstract description 4
- 238000010276 construction Methods 0.000 claims abstract description 3
- 238000013528 artificial neural network Methods 0.000 claims description 15
- 230000000306 recurrent effect Effects 0.000 claims description 11
- 238000001228 spectrum Methods 0.000 claims description 8
- 238000000605 extraction Methods 0.000 claims description 4
- 230000006403 short-term memory Effects 0.000 claims description 4
- 238000010606 normalization Methods 0.000 claims description 3
- 238000012545 processing Methods 0.000 abstract description 2
- 238000013461 design Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000000155 melt Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/049—Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910838235.0A CN110533041B (en) | 2019-09-05 | 2019-09-05 | Regression-based multi-scale scene text detection method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910838235.0A CN110533041B (en) | 2019-09-05 | 2019-09-05 | Regression-based multi-scale scene text detection method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110533041A true CN110533041A (en) | 2019-12-03 |
CN110533041B CN110533041B (en) | 2022-07-01 |
Family
ID=68667081
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910838235.0A Active CN110533041B (en) | 2019-09-05 | 2019-09-05 | Regression-based multi-scale scene text detection method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110533041B (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200005141A1 (en) * | 2018-06-29 | 2020-01-02 | Utechzone Co., Ltd. | Automated optical inspection and classification apparatus based on a deep learning system and training apparatus thereof |
CN111259764A (en) * | 2020-01-10 | 2020-06-09 | 中国科学技术大学 | Text detection method and device, electronic equipment and storage device |
CN111881943A (en) * | 2020-07-08 | 2020-11-03 | 泰康保险集团股份有限公司 | Method, device, equipment and computer readable medium for image classification |
CN112287962A (en) * | 2020-08-10 | 2021-01-29 | 南京行者易智能交通科技有限公司 | Training method, detection method and device of multi-scale target detection model, and terminal equipment |
CN113159079A (en) * | 2020-01-07 | 2021-07-23 | 顺丰科技有限公司 | Target detection method, target detection device, computer equipment and storage medium |
CN113408525A (en) * | 2021-06-17 | 2021-09-17 | 成都崇瑚信息技术有限公司 | Multilayer ternary pivot and bidirectional long-short term memory fused text recognition method |
CN115393868A (en) * | 2022-08-18 | 2022-11-25 | 中化现代农业有限公司 | Text detection method and device, electronic equipment and storage medium |
CN116704248A (en) * | 2023-06-07 | 2023-09-05 | 南京大学 | Serum sample image classification method based on multi-semantic unbalanced learning |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105631426A (en) * | 2015-12-29 | 2016-06-01 | 中国科学院深圳先进技术研究院 | Image text detection method and device |
CN107578060A (en) * | 2017-08-14 | 2018-01-12 | 电子科技大学 | A kind of deep neural network based on discriminant region is used for the method for vegetable image classification |
CN107688808A (en) * | 2017-08-07 | 2018-02-13 | 电子科技大学 | A kind of quickly natural scene Method for text detection |
CN108549893A (en) * | 2018-04-04 | 2018-09-18 | 华中科技大学 | A kind of end-to-end recognition methods of the scene text of arbitrary shape |
CN108734169A (en) * | 2018-05-21 | 2018-11-02 | 南京邮电大学 | One kind being based on the improved scene text extracting method of full convolutional network |
CN109086663A (en) * | 2018-06-27 | 2018-12-25 | 大连理工大学 | The natural scene Method for text detection of dimension self-adaption based on convolutional neural networks |
CN109271967A (en) * | 2018-10-16 | 2019-01-25 | 腾讯科技(深圳)有限公司 | The recognition methods of text and device, electronic equipment, storage medium in image |
CN109299274A (en) * | 2018-11-07 | 2019-02-01 | 南京大学 | A kind of natural scene Method for text detection based on full convolutional neural networks |
US20190180154A1 (en) * | 2017-12-13 | 2019-06-13 | Abbyy Development Llc | Text recognition using artificial intelligence |
EP3534298A1 (en) * | 2018-02-26 | 2019-09-04 | Capital One Services, LLC | Dual stage neural network pipeline systems and methods |
-
2019
- 2019-09-05 CN CN201910838235.0A patent/CN110533041B/en active Active
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105631426A (en) * | 2015-12-29 | 2016-06-01 | 中国科学院深圳先进技术研究院 | Image text detection method and device |
CN107688808A (en) * | 2017-08-07 | 2018-02-13 | 电子科技大学 | A kind of quickly natural scene Method for text detection |
CN107578060A (en) * | 2017-08-14 | 2018-01-12 | 电子科技大学 | A kind of deep neural network based on discriminant region is used for the method for vegetable image classification |
US20190180154A1 (en) * | 2017-12-13 | 2019-06-13 | Abbyy Development Llc | Text recognition using artificial intelligence |
EP3534298A1 (en) * | 2018-02-26 | 2019-09-04 | Capital One Services, LLC | Dual stage neural network pipeline systems and methods |
CN108549893A (en) * | 2018-04-04 | 2018-09-18 | 华中科技大学 | A kind of end-to-end recognition methods of the scene text of arbitrary shape |
CN108734169A (en) * | 2018-05-21 | 2018-11-02 | 南京邮电大学 | One kind being based on the improved scene text extracting method of full convolutional network |
CN109086663A (en) * | 2018-06-27 | 2018-12-25 | 大连理工大学 | The natural scene Method for text detection of dimension self-adaption based on convolutional neural networks |
CN109271967A (en) * | 2018-10-16 | 2019-01-25 | 腾讯科技(深圳)有限公司 | The recognition methods of text and device, electronic equipment, storage medium in image |
CN109299274A (en) * | 2018-11-07 | 2019-02-01 | 南京大学 | A kind of natural scene Method for text detection based on full convolutional neural networks |
Non-Patent Citations (4)
Title |
---|
WENHAO HE等: "Deep Direct Regression for Multi-oriented Scene Text Detection", 《2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION》 * |
方清: "基于深度学习的自然场景文本检测与识别", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
杨小栋: "基于深度特征的多方向场景文字检测", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
雷绮仑: "多方向自然场景文本提取方法研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200005141A1 (en) * | 2018-06-29 | 2020-01-02 | Utechzone Co., Ltd. | Automated optical inspection and classification apparatus based on a deep learning system and training apparatus thereof |
US11455528B2 (en) * | 2018-06-29 | 2022-09-27 | Utechzone Co., Ltd. | Automated optical inspection and classification apparatus based on a deep learning system and training apparatus thereof |
CN113159079A (en) * | 2020-01-07 | 2021-07-23 | 顺丰科技有限公司 | Target detection method, target detection device, computer equipment and storage medium |
CN111259764A (en) * | 2020-01-10 | 2020-06-09 | 中国科学技术大学 | Text detection method and device, electronic equipment and storage device |
CN111881943A (en) * | 2020-07-08 | 2020-11-03 | 泰康保险集团股份有限公司 | Method, device, equipment and computer readable medium for image classification |
CN112287962A (en) * | 2020-08-10 | 2021-01-29 | 南京行者易智能交通科技有限公司 | Training method, detection method and device of multi-scale target detection model, and terminal equipment |
CN112287962B (en) * | 2020-08-10 | 2023-06-09 | 南京行者易智能交通科技有限公司 | Training method, detection method and device for multi-scale target detection model, and terminal equipment |
CN113408525A (en) * | 2021-06-17 | 2021-09-17 | 成都崇瑚信息技术有限公司 | Multilayer ternary pivot and bidirectional long-short term memory fused text recognition method |
CN115393868A (en) * | 2022-08-18 | 2022-11-25 | 中化现代农业有限公司 | Text detection method and device, electronic equipment and storage medium |
CN116704248A (en) * | 2023-06-07 | 2023-09-05 | 南京大学 | Serum sample image classification method based on multi-semantic unbalanced learning |
Also Published As
Publication number | Publication date |
---|---|
CN110533041B (en) | 2022-07-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110533041A (en) | Multiple dimensioned scene text detection method based on recurrence | |
CN110334705B (en) | Language identification method of scene text image combining global and local information | |
Yuan et al. | Gated CNN: Integrating multi-scale feature layers for object detection | |
CN111639544B (en) | Expression recognition method based on multi-branch cross-connection convolutional neural network | |
CN110083700A (en) | A kind of enterprise's public sentiment sensibility classification method and system based on convolutional neural networks | |
CN108537269B (en) | Weak interactive object detection deep learning method and system thereof | |
CN110287960A (en) | The detection recognition method of curve text in natural scene image | |
CN109858488A (en) | A kind of handwriting samples recognition methods and system based on sample enhancing | |
CN108830334A (en) | A kind of fine granularity target-recognition method based on confrontation type transfer learning | |
CN110866542B (en) | Depth representation learning method based on feature controllable fusion | |
CN109886141A (en) | A kind of pedestrian based on uncertainty optimization discrimination method again | |
CN106919920A (en) | Scene recognition method based on convolution feature and spatial vision bag of words | |
CN110414344A (en) | A kind of human classification method, intelligent terminal and storage medium based on video | |
CN108427740B (en) | Image emotion classification and retrieval algorithm based on depth metric learning | |
CN111598183A (en) | Multi-feature fusion image description method | |
CN106919710A (en) | A kind of dialect sorting technique based on convolutional neural networks | |
CN112507904B (en) | Real-time classroom human body posture detection method based on multi-scale features | |
CN109344898A (en) | Convolutional neural networks image classification method based on sparse coding pre-training | |
CN110070106A (en) | Smog detection method, device and electronic equipment | |
CN106874929A (en) | A kind of pearl sorting technique based on deep learning | |
CN111723667A (en) | Human body joint point coordinate-based intelligent lamp pole crowd behavior identification method and device | |
CN109726671A (en) | The action identification method and system of expression study from the overall situation to category feature | |
Agrawal et al. | Image caption generator using attention mechanism | |
CN116150747A (en) | Intrusion detection method and device based on CNN and SLTM | |
CN114398485B (en) | Expert portrait construction method and device based on multi-view fusion |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240623 Address after: 518000 1104, Building A, Zhiyun Industrial Park, No. 13, Huaxing Road, Henglang Community, Longhua District, Shenzhen, Guangdong Province Patentee after: Shenzhen Hongyue Enterprise Management Consulting Co.,Ltd. Country or region after: China Address before: 400065 Chongqing Nan'an District huangjuezhen pass Chongwen Road No. 2 Patentee before: CHONGQING University OF POSTS AND TELECOMMUNICATIONS Country or region before: China |
|
TR01 | Transfer of patent right |
Effective date of registration: 20240625 Address after: 200030, Room 901-1606, Building 4, No. 2377 Shenkun Road, Minhang District, Shanghai Patentee after: Shanghai Jinming Information Technology Co.,Ltd. Country or region after: China Address before: 518000 1104, Building A, Zhiyun Industrial Park, No. 13, Huaxing Road, Henglang Community, Longhua District, Shenzhen, Guangdong Province Patentee before: Shenzhen Hongyue Enterprise Management Consulting Co.,Ltd. Country or region before: China |