CN109670495A - A kind of method and system of the length text detection based on deep neural network - Google Patents

A kind of method and system of the length text detection based on deep neural network Download PDF

Info

Publication number
CN109670495A
CN109670495A CN201811528135.XA CN201811528135A CN109670495A CN 109670495 A CN109670495 A CN 109670495A CN 201811528135 A CN201811528135 A CN 201811528135A CN 109670495 A CN109670495 A CN 109670495A
Authority
CN
China
Prior art keywords
text
frame
class
region
length
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811528135.XA
Other languages
Chinese (zh)
Inventor
夏路遥
黄贤俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Yuan Heng Technology Co Ltd
Original Assignee
Shenzhen Yuan Heng Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Yuan Heng Technology Co Ltd filed Critical Shenzhen Yuan Heng Technology Co Ltd
Priority to CN201811528135.XA priority Critical patent/CN109670495A/en
Publication of CN109670495A publication Critical patent/CN109670495A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The method and system for the length text detection based on deep neural network that the invention discloses a kind of, packet are included: extracting characteristic pattern to original image;The first kind rectangular area of several default length-width ratios is enumerated from characteristic pattern based on Faster RCNN, the prospect or background, the real estate for predicting first kind rectangular area of first kind rectangular area is predicted, obtains all first kind text boxes;Based on CTPN from several predetermined widths of feature enumeration of graph, the second class rectangle region of indefinite length, if the second class rectangle region be it is text filed, access RNN recirculating network and obtain the second class text frame;Merge first kind text box and the second class text frame based on non-maxima suppression.The advantages of present invention incorporates Faster RCNN and CTPN, and merged the detection content of the two based on non-maxima suppression logic by certain rule, so that the recall rate and accuracy rate of text detection are all improved.

Description

A kind of method and system of the length text detection based on deep neural network
Technical field
The present invention relates to text detection techniques fields, and in particular to a kind of length text detection based on deep neural network Method and system.
Background technique
Currently, there is the demand for the Pictures Electronics with text for largely providing user in the market, such demand needs Text detection in picture is come out, and is identified as digitized text.The work of this part needs a large amount of manpower in the past, and present It is then this technology that electronic data is converted picture into using OCR technique.Current OCR technique is divided into detection and identification two Module, this technology are mainly used to improve the performance of text detection module, the accuracy rate that the boundary for promoting text is returned.
Text detection techniques, what is be based primarily upon is to develop preferable depth convolutional neural networks at present to be detected, packet Include Faster RCNN frame and CTPN detection framework;Wherein:
The basic version of Faster RCNN frame has biggish object relatively high accuracy rate.The process of frame Are as follows: I. extracts feature to picture, and II. enumerates a large amount of rectangle to attempt to return out corresponding object, the square that III. will be enumerated Shape is divided into 2 classes: comprising target and the biggish positive sample of intersection and other negative samples, IV. cuts out positive sample from characteristic pattern Come, the boundary of regressive object is then gone to according to characteristic pattern;
Occur for the CTPN testing stand of text class inspection optimization for text is always horizontal, and length is unfixed existing As use following process: I. extracts feature to picture, and II. enumerates a large amount of small rectangle, is different from Faster RCNN, American series Small rectangle can fixed width, and height will use different scales, such as: (11 pixels to 273 pixels, every time multiplied by fixation Ratio, totally 10 different scales), III is attached using the small scale text that RNN recirculating network will test, obtains text Row.IV. using CNN+RNN, training method, support are multiple dimensioned end to end.
Above-mentioned two detection model there are the shortcomings that are as follows:
Faster RCNN detection model is in context of detection, because text has that gap different in size is larger, it is difficult to It determines that the height for enumerating anchor is wide, it is poor to there is a problem of that boundary returns to the great text of length-width ratio gap.
CTPN detection model returns preferably boundary in context of detection, but for there are in the case where text overlapping, There can be the case where text box loss, it is difficult to accomplish to give all texts for change.
Summary of the invention
Aiming at the shortcomings existing in the above problems, the present invention provides a kind of long short text based on deep neural network The method and system of detection.
The method for the length text detection based on deep neural network that the invention discloses a kind of, comprising:
Original image is selected, characteristic pattern is extracted to original image;
The first kind rectangular area of several default length-width ratios, prediction first are enumerated from characteristic pattern based on Faster RCNN The prospect or background in class rectangle region, the real estate for predicting first kind rectangular area, obtain all first kind text boxes;
Based on CTPN from several predetermined widths of feature enumeration of graph, the second class rectangle region of indefinite length, the second class is judged Rectangular area is text or non-textual region;If the second class rectangle region be it is text filed, access RNN recirculating network and obtain the Two class text frames;
Merge the first kind text box and the second class text frame based on non-maxima suppression.
As a further improvement of the present invention, described that several default length and width are enumerated from characteristic pattern based on Faster RCNN The first kind rectangular area of ratio, comprising:
Original image abstract characteristics are extracted based on depth convolutional network;
The candidate region of using area candidate network recommendation original image;
The accurate region of text is returned from candidate region.
As a further improvement of the present invention, the CTPN that is based on is from several predetermined widths of feature enumeration of graph, indefinite length The second class rectangle region, comprising:
Characteristic image is generated using network;
Candidate the second class rectangle region enumerated is generated using fixed width and indefinite length;
The non-textual prediction of text is carried out to the second class rectangle region.
As a further improvement of the present invention, described that the first kind text box and second are merged based on non-maxima suppression Class text frame, comprising:
Framed score is sorted, best result and its corresponding frame is chosen, is set as valid frame;
Remaining frame is traversed, if the overlapping area with current best result frame is greater than certain threshold value, deletes valid frame;
The frame for continuing to select a highest scoring from untreated frame, repeats the above process.
The system for the length text detection based on deep neural network that the present invention also provides a kind of, comprising:
Extraction module extracts characteristic pattern to original image for selecting original image;
Faster RCNN processing module, for enumerating several default length-width ratios from characteristic pattern based on Faster RCNN First kind rectangular area is predicted the prospect or background, the real estate for predicting first kind rectangular area of first kind rectangular area, is obtained To all first kind text boxes;
CTPN processing module, for based on CTPN from several predetermined widths of feature enumeration of graph, the second class square of indefinite length Shape region judges the second class rectangle region for text or non-textual region;If the second class rectangle region be it is text filed, access RNN recirculating network obtains the second class text frame;
Merging module, for merging the first kind text box and the second class text frame based on non-maxima suppression.
As a further improvement of the present invention, described that several default length and width are enumerated from characteristic pattern based on Faster RCNN The first kind rectangular area of ratio, comprising:
Original image abstract characteristics are extracted based on depth convolutional network;
The candidate region of using area candidate network recommendation original image;
The accurate region of text is returned from candidate region.
As a further improvement of the present invention, the CTPN that is based on is from several predetermined widths of feature enumeration of graph, indefinite length The second class rectangle region, comprising:
Characteristic image is generated using network;
Candidate the second class rectangle region enumerated is generated using fixed width and indefinite length;
The non-textual prediction of text is carried out to the second class rectangle region.
As a further improvement of the present invention, described that the first kind text box and second are merged based on non-maxima suppression Class text frame, comprising:
Framed score is sorted, best result and its corresponding frame is chosen, is set as valid frame;
Remaining frame is traversed, if the overlapping area with current best result frame is greater than certain threshold value, deletes valid frame;
The frame for continuing to select a highest scoring from untreated frame, repeats the above process.
Compared with prior art, the invention has the benefit that
The Faster RCNN and CTPN that the present invention uses are all based on deep learning algorithm, are the spies in original image The text detection done on sign figure can use under various complex environments;
The advantages of present invention incorporates Faster RCNN: still there is higher recall rate for overlay target.And CTPN Advantage: still having preferable boundary accurate rate for longer text, has better compatibility simultaneously for the text of different scale Property.The detection content of the two is merged based on non-maxima suppression logic by certain rule, so that text detection is recalled Rate and accuracy rate are all improved.
Detailed description of the invention
Fig. 1 is the process of the method for the length text detection based on deep neural network disclosed in an embodiment of the present invention Figure;
Fig. 2 is the frame of the system of the length text detection based on deep neural network disclosed in an embodiment of the present invention Figure;
Fig. 3 is the schematic diagram of first kind text box disclosed in an embodiment of the present invention;
Fig. 4 is that RNN recirculating network disclosed in an embodiment of the present invention obtains the schematic diagram of the second class text frame.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiments of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, ordinary skill people Member's every other embodiment obtained without making creative work, shall fall within the protection scope of the present invention.
The present invention is described in further detail with reference to the accompanying drawing:
As shown in Figure 1, the method for the invention discloses a kind of length text detection based on deep neural network, comprising:
Step 1, selection original image, carry out feature extraction to original image, obtain characteristic pattern;Wherein original image can be Bill picture;
Step 2, the first kind rectangular area for enumerating several default length-width ratios from characteristic pattern based on Faster RCNN, in advance The prospect or background, the real estate for predicting first kind rectangular area for surveying first kind rectangular area, obtain all first kind texts This frame, first kind text box are as shown in Figure 3;Wherein:
The principle of Faster RCNN are as follows:
Original image abstract characteristics are extracted based on depth convolutional network;
The candidate region of using area candidate network recommendation original image;
The accurate region of text is returned from candidate region;
Step 3, based on CTPN from several predetermined widths of feature enumeration of graph, the second class rectangle region of indefinite length, judgement Second class rectangle region is text or non-textual region;If the second class rectangle region be it is text filed, access RNN recirculating network The second class text frame is obtained, the second class text frame is as shown in Figure 4;Wherein:
The principle of CTPN are as follows:
Characteristic image is generated using network;
Candidate the second class rectangle region enumerated is generated using fixed width and indefinite length;
The non-textual prediction of text is carried out to the second class rectangle region.
Step 4 merges first kind text box and the second class text frame based on non-maxima suppression (NMS);Wherein:
Merge first kind text box and the second class text frame based on non-maxima suppression, comprising:
Framed score is sorted, best result and its corresponding frame is chosen, is set as valid frame;
Remaining frame is traversed, if the overlapping area with current best result frame is greater than certain threshold value, deletes valid frame;
The frame for continuing to select a highest scoring from untreated frame, repeats the above process.
Such as:
Assuming that object detects that 4 Boxes, each Box respectively correspond a classification Score, according to Score from it is small to Longer spread is followed successively by, (B1, S1), (B2, S2), (B3, S3), (B4, S4) .S4 > S3 > S2 > S1;
Step 1. is according to Score size, since Box B4 frame;
Step 2. calculates separately the overlapping degree IoU of B1, B2, B3 and B4, judges whether to be greater than presetting threshold value;Such as Fruit is greater than given threshold, then gives up the Box;The Box. of reservation is marked to assume that the threshold value of B3 and B4 is more than given threshold simultaneously, then Give up B3, label B4 is the Box to be retained;
The maximum B2 of Score is chosen in Step 3. B1 from remaining Boxes, B2, then calculates B2's and remaining B1 Overlapping degree IoU;If it is greater than given threshold, the Box is equally abandoned;The Box of reservation is marked simultaneously;
Above procedure is repeated, until finding whole reservation Boxes.
As shown in Fig. 2, the present invention provides a kind of system of length text detection based on deep neural network, comprising:
Extraction module carries out feature extraction to original image, obtains characteristic pattern for selecting original image;It is wherein original Picture can be bill picture;
Faster RCNN processing module, for enumerating several default length-width ratios from characteristic pattern based on Faster RCNN First kind rectangular area is predicted the prospect or background, the real estate for predicting first kind rectangular area of first kind rectangular area, is obtained To all first kind text boxes, first kind text box is as shown in Figure 3;Wherein:
The principle of Faster RCNN are as follows:
Original image abstract characteristics are extracted based on depth convolutional network;
The candidate region of using area candidate network recommendation original image;
The accurate region of text is returned from candidate region;
CTPN processing module, for based on CTPN from several predetermined widths of feature enumeration of graph, the second class square of indefinite length Shape region judges the second class rectangle region for text or non-textual region;If the second class rectangle region be it is text filed, access RNN recirculating network obtains the second class text frame, and the second class text frame is as shown in Figure 4;Wherein:
The principle of CTPN are as follows:
Characteristic image is generated using network;
Candidate the second class rectangle region enumerated is generated using fixed width and indefinite length;
The non-textual prediction of text is carried out to the second class rectangle region;
Merging module, for merging first kind text box and the second class text frame based on non-maxima suppression (NMS);Its In:
Merge first kind text box and the second class text frame based on non-maxima suppression, comprising:
Framed score is sorted, best result and its corresponding frame is chosen, is set as valid frame;
Remaining frame is traversed, if the overlapping area with current best result frame is greater than certain threshold value, deletes valid frame;
The frame for continuing to select a highest scoring from untreated frame, repeats the above process.
Such as:
Assuming that object detects that 4 Boxes, each Box respectively correspond a classification Score, according to Score from it is small to Longer spread is followed successively by, (B1, S1), (B2, S2), (B3, S3), (B4, S4) .S4 > S3 > S2 > S1;
Step 1. is according to Score size, since Box B4 frame;
Step 2. calculates separately the overlapping degree IoU of B1, B2, B3 and B4, judges whether to be greater than presetting threshold value;Such as Fruit is greater than given threshold, then gives up the Box;The Box. of reservation is marked to assume that the threshold value of B3 and B4 is more than given threshold simultaneously, then Give up B3, label B4 is the Box to be retained;
The maximum B2 of Score is chosen in Step 3. B1 from remaining Boxes, B2, then calculates B2's and remaining B1 Overlapping degree IoU;If it is greater than given threshold, the Box is equally abandoned;The Box of reservation is marked simultaneously;
Above procedure is repeated, until finding whole reservation Boxes.
Further, the merging of text filed frame can be NMS (non-maxima suppression) method there are many mode, It can be region and merging etc. be completely covered.
The invention has the benefit that
The Faster RCNN and CTPN that the present invention uses are all based on deep learning algorithm, are the spies in original image The text detection done on sign figure can use under various complex environments;The advantages of present invention incorporates Faster RCNN: right Still there is higher recall rate in overlay target.And the advantages of CTPN: still there is preferable boundary accurate for longer text Rate has better compatibility simultaneously for the text of different scale.By certain rule by the detection content of the two based on non- Maximum inhibits logic to merge, so that the recall rate and accuracy rate of text detection are all improved.
These are only the preferred embodiment of the present invention, is not intended to restrict the invention, for those skilled in the art For member, the invention may be variously modified and varied.All within the spirits and principles of the present invention, it is made it is any modification, Equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.

Claims (8)

1. a kind of method of the length text detection based on deep neural network characterized by comprising
Original image is selected, characteristic pattern is extracted to original image;
The first kind rectangular area of several default length-width ratios is enumerated from characteristic pattern based on Faster RCNN, predicts first kind square The prospect or background in shape region, the real estate for predicting first kind rectangular area, obtain all first kind text boxes;
Based on CTPN from several predetermined widths of feature enumeration of graph, the second class rectangle region of indefinite length, the second class rectangle is judged Region is text or non-textual region;If the second class rectangle region be it is text filed, access RNN recirculating network and obtain the second class Text box;
Merge the first kind text box and the second class text frame based on non-maxima suppression.
2. the method for the length text detection based on deep neural network as described in claim 1, which is characterized in that the base The first kind rectangular area of several default length-width ratios is enumerated from characteristic pattern in Faster RCNN, comprising:
Original image abstract characteristics are extracted based on depth convolutional network;
The candidate region of using area candidate network recommendation original image;
The accurate region of text is returned from candidate region.
3. the method for the length text detection based on deep neural network as described in claim 1, which is characterized in that the base In CTPN from several predetermined widths of feature enumeration of graph, the second class rectangle region of indefinite length, comprising:
Characteristic image is generated using network;
Candidate the second class rectangle region enumerated is generated using fixed width and indefinite length;
The non-textual prediction of text is carried out to the second class rectangle region.
4. the method for the length text detection based on deep neural network as described in claim 1, which is characterized in that the base Merge the first kind text box and the second class text frame in non-maxima suppression, comprising:
Framed score is sorted, best result and its corresponding frame is chosen, is set as valid frame;
Remaining frame is traversed, if the overlapping area with current best result frame is greater than certain threshold value, deletes valid frame;
The frame for continuing to select a highest scoring from untreated frame, repeats the above process.
5. a kind of system of the length text detection based on deep neural network characterized by comprising
Extraction module extracts characteristic pattern to original image for selecting original image;
Faster RCNN processing module, for enumerating the first of several default length-width ratios from characteristic pattern based on Faster RCNN Class rectangle region predicts the prospect or background, the real estate for predicting first kind rectangular area of first kind rectangular area, obtains institute Some first kind text boxes;
CTPN processing module, for based on CTPN from several predetermined widths of feature enumeration of graph, the second class rectangle area of indefinite length Domain judges the second class rectangle region for text or non-textual region;If the second class rectangle region be it is text filed, access RNN Recirculating network obtains the second class text frame;
Merging module, for merging the first kind text box and the second class text frame based on non-maxima suppression.
6. the system of the length text detection based on deep neural network as claimed in claim 5, which is characterized in that the base The first kind rectangular area of several default length-width ratios is enumerated from characteristic pattern in Faster RCNN, comprising:
Original image abstract characteristics are extracted based on depth convolutional network;
The candidate region of using area candidate network recommendation original image;
The accurate region of text is returned from candidate region.
7. the system of the length text detection based on deep neural network as claimed in claim 5, which is characterized in that the base In CTPN from several predetermined widths of feature enumeration of graph, the second class rectangle region of indefinite length, comprising:
Characteristic image is generated using network;
Candidate the second class rectangle region enumerated is generated using fixed width and indefinite length;
The non-textual prediction of text is carried out to the second class rectangle region.
8. the system of the length text detection based on deep neural network as claimed in claim 5, which is characterized in that the base Merge the first kind text box and the second class text frame in non-maxima suppression, comprising:
Framed score is sorted, best result and its corresponding frame is chosen, is set as valid frame;
Remaining frame is traversed, if the overlapping area with current best result frame is greater than certain threshold value, deletes valid frame;
The frame for continuing to select a highest scoring from untreated frame, repeats the above process.
CN201811528135.XA 2018-12-13 2018-12-13 A kind of method and system of the length text detection based on deep neural network Pending CN109670495A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811528135.XA CN109670495A (en) 2018-12-13 2018-12-13 A kind of method and system of the length text detection based on deep neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811528135.XA CN109670495A (en) 2018-12-13 2018-12-13 A kind of method and system of the length text detection based on deep neural network

Publications (1)

Publication Number Publication Date
CN109670495A true CN109670495A (en) 2019-04-23

Family

ID=66143857

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811528135.XA Pending CN109670495A (en) 2018-12-13 2018-12-13 A kind of method and system of the length text detection based on deep neural network

Country Status (1)

Country Link
CN (1) CN109670495A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110738238A (en) * 2019-09-18 2020-01-31 平安科技(深圳)有限公司 certificate information classification positioning method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107346420A (en) * 2017-06-19 2017-11-14 中国科学院信息工程研究所 Text detection localization method under a kind of natural scene based on deep learning
CN108108731A (en) * 2016-11-25 2018-06-01 中移(杭州)信息技术有限公司 Method for text detection and device based on generated data
CN108520229A (en) * 2018-04-04 2018-09-11 北京旷视科技有限公司 Image detecting method, device, electronic equipment and computer-readable medium
CN108921166A (en) * 2018-06-22 2018-11-30 深源恒际科技有限公司 Medical bill class text detection recognition method and system based on deep neural network
CN108960229A (en) * 2018-04-23 2018-12-07 中国科学院信息工程研究所 One kind is towards multidirectional character detecting method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108108731A (en) * 2016-11-25 2018-06-01 中移(杭州)信息技术有限公司 Method for text detection and device based on generated data
CN107346420A (en) * 2017-06-19 2017-11-14 中国科学院信息工程研究所 Text detection localization method under a kind of natural scene based on deep learning
CN108520229A (en) * 2018-04-04 2018-09-11 北京旷视科技有限公司 Image detecting method, device, electronic equipment and computer-readable medium
CN108960229A (en) * 2018-04-23 2018-12-07 中国科学院信息工程研究所 One kind is towards multidirectional character detecting method and device
CN108921166A (en) * 2018-06-22 2018-11-30 深源恒际科技有限公司 Medical bill class text detection recognition method and system based on deep neural network

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
AIHGF: "目标检测-非极大值抑制(Non Maximum Suppression)", 《HTTPS://BLOG.CSDN.NET/ZZIAHGF/ARTICLE/DETAILS/78525279》 *
AIHGF: "目标检测-非极大值抑制(Non Maximum Suppression)", 《HTTPS://BLOG.CSDN.NET/ZZIAHGF/ARTICLE/DETAILS/78525279》, 13 November 2017 (2017-11-13), pages 1 *
JIANG YINGYING 等: "R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection", 《ARXIV》 *
JIANG YINGYING 等: "R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection", 《ARXIV》, 30 June 2017 (2017-06-30), pages 1 - 8 *
TIAN ZHI 等: "Detecting Text in Natural Image with Connectionist Text Proposal Network", 《ECCV 2016》, 12 November 2016 (2016-11-12), pages 1 - 16 *
TIAN ZHI 等: "Detecting Text in Natural Image with Connectionist Text Proposal Network", 《ECCV 2016》, pages 1 - 16 *
郑阳: "自然场景图像中文本检测技术研究", 《中国博士学位论文全文数据库 信息科技辑》 *
郑阳: "自然场景图像中文本检测技术研究", 《中国博士学位论文全文数据库 信息科技辑》, no. 02, 15 February 2018 (2018-02-15) *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110738238A (en) * 2019-09-18 2020-01-31 平安科技(深圳)有限公司 certificate information classification positioning method and device
WO2021051553A1 (en) * 2019-09-18 2021-03-25 平安科技(深圳)有限公司 Certificate information classification and positioning method and apparatus
CN110738238B (en) * 2019-09-18 2023-05-26 平安科技(深圳)有限公司 Classification positioning method and device for certificate information

Similar Documents

Publication Publication Date Title
CN107093172B (en) Character detection method and system
CN109117836B (en) Method and device for detecting and positioning characters in natural scene based on focus loss function
CN105608456B (en) A kind of multi-direction Method for text detection based on full convolutional network
CN108805116B (en) Image text detection method and system
US8315465B1 (en) Effective feature classification in images
JPH0737103A (en) Inclination angle detection device
CN101833664A (en) Video image character detecting method based on sparse expression
CN110414538A (en) Defect classification method, defect classification based training method and device thereof
KR20070045445A (en) Method for discriminating a obscene video using visual features and apparatus thereof
CN110503054A (en) The processing method and processing device of text image
KR101177626B1 (en) Object checking apparatus and method
CN103093185B (en) Character recognition device, image processing apparatus and method thereof
CN109741551A (en) A kind of commodity identification settlement method, apparatus and system
CN109035256A (en) User interface image cutting method, device, server and storage medium
CN106778717A (en) A kind of test and appraisal table recognition methods based on image recognition and k nearest neighbor
CN111382766A (en) Equipment fault detection method based on fast R-CNN
CN114821620A (en) Text content extraction and identification method based on longitudinal combination of line text boxes
CN108647703A (en) A kind of type judgement method of the classification image library based on conspicuousness
Hu et al. Generalized image recognition algorithm for sign inventory
JP3471578B2 (en) Line direction determining device, image tilt detecting device, and image tilt correcting device
Giri Text information extraction and analysis from images using digital image processing techniques
Fleuret et al. Fast face detection with precise pose estimation
CN114359917A (en) Handwritten Chinese character detection and recognition and font evaluation method
CN109670495A (en) A kind of method and system of the length text detection based on deep neural network
Ovodov Optical braille recognition using object detection neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination