CN107748888B - 一种图像文本行检测方法及装置 - Google Patents
一种图像文本行检测方法及装置 Download PDFInfo
- Publication number
- CN107748888B CN107748888B CN201710953107.1A CN201710953107A CN107748888B CN 107748888 B CN107748888 B CN 107748888B CN 201710953107 A CN201710953107 A CN 201710953107A CN 107748888 B CN107748888 B CN 107748888B
- Authority
- CN
- China
- Prior art keywords
- connected domain
- rectangle frame
- image
- text
- standard
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 47
- 238000001914 filtration Methods 0.000 claims abstract description 72
- 238000006116 polymerization reaction Methods 0.000 claims abstract description 60
- 238000012545 processing Methods 0.000 claims abstract description 51
- 238000000034 method Methods 0.000 claims abstract description 37
- 230000002159 abnormal effect Effects 0.000 claims description 17
- 230000008569 process Effects 0.000 claims description 9
- 238000004458 analytical method Methods 0.000 claims description 7
- 238000004891 communication Methods 0.000 claims description 6
- 239000006185 dispersion Substances 0.000 claims description 6
- 238000007781 pre-processing Methods 0.000 claims description 4
- 238000004364 calculation method Methods 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000012360 testing method Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000005286 illumination Methods 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 238000010191 image analysis Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/158—Segmentation of character regions using character size, text spacings or pitch estimation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/443—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/457—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by analysing connectivity, e.g. edge linking, connected component analysis or slices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/762—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/153—Segmentation of character regions using recognition of characters or words
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- General Engineering & Computer Science (AREA)
- Image Analysis (AREA)
- Character Input (AREA)
Abstract
Description
Claims (12)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710953107.1A CN107748888B (zh) | 2017-10-13 | 2017-10-13 | 一种图像文本行检测方法及装置 |
CN201880002337.2A CN109874313A (zh) | 2017-10-13 | 2018-10-12 | 文本行检测方法及文本行检测装置 |
PCT/CN2018/110004 WO2019072233A1 (zh) | 2017-10-13 | 2018-10-12 | 文本行检测方法及文本行检测装置 |
US16/513,883 US20190340460A1 (en) | 2017-10-13 | 2019-07-17 | Text line detecting method and text line detecting device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710953107.1A CN107748888B (zh) | 2017-10-13 | 2017-10-13 | 一种图像文本行检测方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107748888A CN107748888A (zh) | 2018-03-02 |
CN107748888B true CN107748888B (zh) | 2019-11-08 |
Family
ID=61253742
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710953107.1A Active CN107748888B (zh) | 2017-10-13 | 2017-10-13 | 一种图像文本行检测方法及装置 |
CN201880002337.2A Pending CN109874313A (zh) | 2017-10-13 | 2018-10-12 | 文本行检测方法及文本行检测装置 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880002337.2A Pending CN109874313A (zh) | 2017-10-13 | 2018-10-12 | 文本行检测方法及文本行检测装置 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20190340460A1 (zh) |
CN (2) | CN107748888B (zh) |
WO (1) | WO2019072233A1 (zh) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107748888B (zh) * | 2017-10-13 | 2019-11-08 | 众安信息技术服务有限公司 | 一种图像文本行检测方法及装置 |
JP2019159633A (ja) * | 2018-03-12 | 2019-09-19 | セイコーエプソン株式会社 | 画像処理装置、画像処理方法および画像処理プログラム |
CN110660067A (zh) * | 2018-06-28 | 2020-01-07 | 杭州海康威视数字技术股份有限公司 | 一种目标检测方法及其装置 |
CN109325169A (zh) * | 2018-07-25 | 2019-02-12 | 北京奔流网络信息技术有限公司 | 一种版权图片过滤方法和装置 |
CN109697414B (zh) * | 2018-12-13 | 2021-06-18 | 北京金山数字娱乐科技有限公司 | 一种文本定位方法及装置 |
CN109657629B (zh) * | 2018-12-24 | 2021-12-07 | 科大讯飞股份有限公司 | 一种文本行提取方法及装置 |
CN109871743B (zh) * | 2018-12-29 | 2021-01-12 | 口碑(上海)信息技术有限公司 | 文本数据的定位方法及装置、存储介质、终端 |
CN109993161B (zh) * | 2019-02-25 | 2021-08-03 | 众安信息技术服务有限公司 | 一种文本图像旋转矫正方法及系统 |
CN110414529A (zh) * | 2019-06-26 | 2019-11-05 | 深圳中兴网信科技有限公司 | 试卷信息提取方法、系统及计算机可读存储介质 |
CN110414505A (zh) * | 2019-06-27 | 2019-11-05 | 深圳中兴网信科技有限公司 | 图像的处理方法、处理系统及计算机可读存储介质 |
CN110598566A (zh) * | 2019-08-16 | 2019-12-20 | 深圳中兴网信科技有限公司 | 图像处理方法、装置、终端和计算机可读存储介质 |
CN110826561A (zh) * | 2019-11-11 | 2020-02-21 | 上海眼控科技股份有限公司 | 车辆文本识别方法、装置和计算机设备 |
CN111126266B (zh) * | 2019-12-24 | 2023-05-05 | 上海智臻智能网络科技股份有限公司 | 文本处理方法、文本处理系统、设备及介质 |
CN111144342B (zh) * | 2019-12-30 | 2023-04-18 | 福建天晴数码有限公司 | 页面内容识别系统 |
CN111259764A (zh) * | 2020-01-10 | 2020-06-09 | 中国科学技术大学 | 文本检测方法、装置、电子设备及存储装置 |
JP2021149439A (ja) * | 2020-03-18 | 2021-09-27 | 富士フイルムビジネスイノベーション株式会社 | 情報処理装置及び情報処理プログラム |
CN111444904A (zh) * | 2020-03-23 | 2020-07-24 | Oppo广东移动通信有限公司 | 内容识别方法、装置以及电子设备 |
CN113538450B (zh) * | 2020-04-21 | 2023-07-21 | 百度在线网络技术(北京)有限公司 | 用于生成图像的方法及装置 |
CN111738326B (zh) * | 2020-06-16 | 2023-07-11 | 中国工商银行股份有限公司 | 句粒度标注训练样本生成方法及装置 |
CN112183307A (zh) * | 2020-09-25 | 2021-01-05 | 上海眼控科技股份有限公司 | 文本识别方法、计算机设备和存储介质 |
CN117409428B (zh) * | 2023-12-13 | 2024-03-01 | 南昌理工学院 | 一种试卷信息处理方法、系统、计算机设备及存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8036461B2 (en) * | 2003-06-24 | 2011-10-11 | Abbyy Software Limited | Method of graphical objects recognition using the integrity principle |
CN102930262A (zh) * | 2012-09-19 | 2013-02-13 | 北京百度网讯科技有限公司 | 一种从图像中提取文字行的方法及装置 |
CN104182750A (zh) * | 2014-07-14 | 2014-12-03 | 上海交通大学 | 一种在自然场景图像中基于极值连通域的中文检测方法 |
CN105095890A (zh) * | 2014-04-25 | 2015-11-25 | 广州市动景计算机科技有限公司 | 图像中字符分割方法及装置 |
CN107180239A (zh) * | 2017-06-09 | 2017-09-19 | 科大讯飞股份有限公司 | 文本行识别方法及系统 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8224114B2 (en) * | 2008-09-05 | 2012-07-17 | The Neat Company, Inc. | Method and apparatus for despeckling an image |
US8144986B2 (en) * | 2008-09-05 | 2012-03-27 | The Neat Company, Inc. | Method and apparatus for binarization threshold calculation |
CN104751142B (zh) * | 2015-04-01 | 2018-04-27 | 电子科技大学 | 一种基于笔划特征的自然场景文本检测方法 |
CN107145883A (zh) * | 2016-03-01 | 2017-09-08 | 夏普株式会社 | 文本检测方法和设备 |
CN107229932B (zh) * | 2016-03-25 | 2021-05-28 | 阿里巴巴集团控股有限公司 | 一种图像文本的识别方法和装置 |
CN107748888B (zh) * | 2017-10-13 | 2019-11-08 | 众安信息技术服务有限公司 | 一种图像文本行检测方法及装置 |
-
2017
- 2017-10-13 CN CN201710953107.1A patent/CN107748888B/zh active Active
-
2018
- 2018-10-12 WO PCT/CN2018/110004 patent/WO2019072233A1/zh active Application Filing
- 2018-10-12 CN CN201880002337.2A patent/CN109874313A/zh active Pending
-
2019
- 2019-07-17 US US16/513,883 patent/US20190340460A1/en not_active Abandoned
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8036461B2 (en) * | 2003-06-24 | 2011-10-11 | Abbyy Software Limited | Method of graphical objects recognition using the integrity principle |
CN102930262A (zh) * | 2012-09-19 | 2013-02-13 | 北京百度网讯科技有限公司 | 一种从图像中提取文字行的方法及装置 |
CN105095890A (zh) * | 2014-04-25 | 2015-11-25 | 广州市动景计算机科技有限公司 | 图像中字符分割方法及装置 |
CN104182750A (zh) * | 2014-07-14 | 2014-12-03 | 上海交通大学 | 一种在自然场景图像中基于极值连通域的中文检测方法 |
CN107180239A (zh) * | 2017-06-09 | 2017-09-19 | 科大讯飞股份有限公司 | 文本行识别方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
US20190340460A1 (en) | 2019-11-07 |
WO2019072233A1 (zh) | 2019-04-18 |
CN109874313A (zh) | 2019-06-11 |
CN107748888A (zh) | 2018-03-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107748888B (zh) | 一种图像文本行检测方法及装置 | |
CN104182750B (zh) | 一种在自然场景图像中基于极值连通域的中文检测方法 | |
CN101777124A (zh) | 一种提取视频文本信息的方法及装置 | |
CN101510258B (zh) | 一种证件验证方法、系统及一种证件验证终端 | |
CN104778470B (zh) | 基于组件树和霍夫森林的文字检测和识别方法 | |
CN104820986B (zh) | 一种基于机器视觉的线缆在线检测方法 | |
Sulaiman et al. | Development of automatic vehicle plate detection system | |
CN113083804A (zh) | 激光智能除锈方法、系统及可存读介质 | |
CN103310211A (zh) | 一种基于图像处理的填注标记识别方法 | |
CN109086772A (zh) | 一种扭曲粘连字符图片验证码的识别方法及系统 | |
CN103295009A (zh) | 基于笔画分解的车牌字符识别方法 | |
CN110942063B (zh) | 证件文字信息获取方法、装置以及电子设备 | |
Yingthawornsuk et al. | Automatic Thai Coin Calculation System by Using SIFT | |
CN104834891A (zh) | 一种中文图像型垃圾邮件过滤方法及系统 | |
CN106650696A (zh) | 一种基于奇异值分解的手写电气元件符号识别方法 | |
Wu et al. | Contour restoration of text components for recognition in video/scene images | |
CN111767909B (zh) | 一种字符识别方法、设备及计算机可读存储介质 | |
Xue | Optical character recognition | |
Karanje et al. | Survey on text detection, segmentation and recognition from a natural scene images | |
CN105069455A (zh) | 一种发票公章过滤的方法及装置 | |
Romic et al. | Character recognition based on region pixel concentration for license plate identification | |
Deb et al. | Statistical characteristics in HSI color model and position histogram based vehicle license plate detection | |
CN114926635A (zh) | 与深度学习方法相结合的多焦图像中目标分割方法 | |
CN114332983A (zh) | 人脸图像清晰度检测方法、装置、电子设备、及介质 | |
Gopalan et al. | Statistical modeling for the detection, localization and extraction of text from heterogeneous textual images using combined feature scheme |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240306 Address after: Room 1179, W Zone, 11th Floor, Building 1, No. 158 Shuanglian Road, Qingpu District, Shanghai, 201702 Patentee after: Shanghai Zhongan Information Technology Service Co.,Ltd. Country or region after: China Address before: 518000 Room 201, building A, No. 1, Qian Wan Road, Qianhai Shenzhen Hong Kong cooperation zone, Shenzhen, Guangdong (Shenzhen Qianhai business secretary Co., Ltd.) Patentee before: ZHONGAN INFORMATION TECHNOLOGY SERVICE Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240415 Address after: Room 1179, W Zone, 11th Floor, Building 1, No. 158 Shuanglian Road, Qingpu District, Shanghai, 201702 Patentee after: Shanghai Zhongan Information Technology Service Co.,Ltd. Country or region after: China Address before: 518000 Room 201, building A, No. 1, Qian Wan Road, Qianhai Shenzhen Hong Kong cooperation zone, Shenzhen, Guangdong (Shenzhen Qianhai business secretary Co., Ltd.) Patentee before: ZHONGAN INFORMATION TECHNOLOGY SERVICE Co.,Ltd. Country or region before: China |