JP6000899B2 - テキストを自動的に検出する方法 - Google Patents
テキストを自動的に検出する方法 Download PDFInfo
- Publication number
- JP6000899B2 JP6000899B2 JP2013111587A JP2013111587A JP6000899B2 JP 6000899 B2 JP6000899 B2 JP 6000899B2 JP 2013111587 A JP2013111587 A JP 2013111587A JP 2013111587 A JP2013111587 A JP 2013111587A JP 6000899 B2 JP6000899 B2 JP 6000899B2
- Authority
- JP
- Japan
- Prior art keywords
- text
- closed
- closed curve
- edge
- curves
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/13—Edge detection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/181—Segmentation; Edge detection involving edge growing; involving edge linking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
- G06V20/63—Scene text, e.g. street names
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/1801—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
- G06V30/18076—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10004—Still image; Photographic image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
- Character Input (AREA)
Description
但し、C1及びC2は任意の2つ閉曲線/文字を表し、G1,1,G1,2及びG2,1,G2,2はこの2つ文字に関して、それぞれ推定される背景モード及びテキストモードである。試行錯誤に基づいて距離が、2と選択された閾値Tcより短い場合、2つ文字間のリンクを維持する。
Claims (5)
- 自然風景の電子画像内のテキストを自動的に検出する、コンピュータで実行する方法であって、
分析のための電子画像を受信するステップと、
前記電子画像でエッジ検出アルゴリズムを実行するステップと、
検出されたエッジに応じて、前記電子画像内の閉曲線を特定するステップと、
閉要素間のリンクを確立するステップと、
前記特定された閉曲線に応じて、候補テキスト線を特定するステップと、
前記候補テキスト線を、テキスト領域又は非テキスト領域として分類するステップと、
前記電子画像内の前記テキスト領域を、グラフィカル・ユーザ・インターフェースを介してユーザに出力するステップと、
を含み、
前記候補テキスト線を特定するステップは、
検討するためのリンクを選択する第1のステップと、
前記リンクにより接続された第1の閉曲線及び第2の閉曲線のそれぞれの中央を接続する直線を合わせる第2のステップと、
前記第1の閉曲線及び前記第2の閉曲線の各々に関して、前記選択されたリンク以外の全ての関連するリンクを特定し、前記関連するリンクのうちの1つに付く第3の閉曲線を選択する第3のステップと、
前記第3の閉曲線を含めることにより、前記第1の閉曲線、前記第2の閉曲線、及び前記第3の閉曲線を接続する前記合わせた線を再度合わせる第4のステップと、
所定の閾値Tfより短い距離を有する中央を含む閉曲線が、全て前記候補テキスト線に加えられるまで前記第1のステップ〜前記第4のステップを繰り返す第5のステップと、
を更に含む、
テキストを自動的に検出する方法。 - 前記閉曲線を特定するステップは、
潜在的な閉曲線の開口先端を特定するステップと、
閾値Topenより短い距離で隔てられた全ての2つの開口先端を接続させてエッジを形成するステップと、
前記潜在的な閉曲線内に開口先端がなくなるまで、残っている全ての開口先端を浸食させるステップと、
1つ以上の閉曲線を出力するステップと、
を更に含む、請求項1に記載のテキストを自動的に検出する方法。 - 1つ以上の誤って接続された閉曲線を検出するステップと、
前記1つ以上の誤って接続された閉曲線を切断するステップと、
を更に含む、請求項2に記載のテキストを自動的に検出する方法。 - 前記閉曲線を切断するステップは、
連結成分アルゴリズムを実行して、閉要素に隣接するエッジ画素と、2つの背景画素領域を隔てるエッジ画素とを区別するステップと、
前記2つの背景画素領域を隔て且つ前記閉要素に隣接しないエッジ画素を取り除くステップと、
を更に含む、請求項3に記載のテキストを自動的に検出する方法。 - 前記閉要素間のリンクを確立するステップは、
2つの閉曲線の中央間の距離が第1の閾値Tdより短いかどうかを判定するステップと、
前記2つの閉曲線間の高さの割合に第2の閾値を適用するステップと、
画素の色に制約を適用し、これにより、隣接する閉曲線の背景画素どうしが互いに類似し、且つ、テキスト画素どうしが互いに類似するステップと、
を更に含む、請求項1に記載のテキストを自動的に検出する方法。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/494,173 US8837830B2 (en) | 2012-06-12 | 2012-06-12 | Finding text in natural scenes |
US13/494,173 | 2012-06-12 |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2013257866A JP2013257866A (ja) | 2013-12-26 |
JP2013257866A5 JP2013257866A5 (ja) | 2016-07-07 |
JP6000899B2 true JP6000899B2 (ja) | 2016-10-05 |
Family
ID=49626053
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2013111587A Expired - Fee Related JP6000899B2 (ja) | 2012-06-12 | 2013-05-28 | テキストを自動的に検出する方法 |
Country Status (3)
Country | Link |
---|---|
US (1) | US8837830B2 (ja) |
JP (1) | JP6000899B2 (ja) |
DE (1) | DE102013210375A1 (ja) |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104794479B (zh) * | 2014-01-20 | 2018-06-29 | 北京大学 | 基于局部笔画宽度变换的自然场景图片中文本检测方法 |
US9754171B1 (en) | 2014-06-27 | 2017-09-05 | Blinker, Inc. | Method and apparatus for receiving vehicle information from an image and posting the vehicle information to a website |
US9558419B1 (en) | 2014-06-27 | 2017-01-31 | Blinker, Inc. | Method and apparatus for receiving a location of a vehicle service center from an image |
US10733471B1 (en) | 2014-06-27 | 2020-08-04 | Blinker, Inc. | Method and apparatus for receiving recall information from an image |
US9779318B1 (en) | 2014-06-27 | 2017-10-03 | Blinker, Inc. | Method and apparatus for verifying vehicle ownership from an image |
US10540564B2 (en) | 2014-06-27 | 2020-01-21 | Blinker, Inc. | Method and apparatus for identifying vehicle information from an image |
US9563814B1 (en) | 2014-06-27 | 2017-02-07 | Blinker, Inc. | Method and apparatus for recovering a vehicle identification number from an image |
US9600733B1 (en) | 2014-06-27 | 2017-03-21 | Blinker, Inc. | Method and apparatus for receiving car parts data from an image |
US9607236B1 (en) | 2014-06-27 | 2017-03-28 | Blinker, Inc. | Method and apparatus for providing loan verification from an image |
US9594971B1 (en) | 2014-06-27 | 2017-03-14 | Blinker, Inc. | Method and apparatus for receiving listings of similar vehicles from an image |
US10579892B1 (en) | 2014-06-27 | 2020-03-03 | Blinker, Inc. | Method and apparatus for recovering license plate information from an image |
US9892337B1 (en) | 2014-06-27 | 2018-02-13 | Blinker, Inc. | Method and apparatus for receiving a refinancing offer from an image |
US9589202B1 (en) | 2014-06-27 | 2017-03-07 | Blinker, Inc. | Method and apparatus for receiving an insurance quote from an image |
US10572758B1 (en) | 2014-06-27 | 2020-02-25 | Blinker, Inc. | Method and apparatus for receiving a financing offer from an image |
US10867327B1 (en) | 2014-06-27 | 2020-12-15 | Blinker, Inc. | System and method for electronic processing of vehicle transactions based on image detection of vehicle license plate |
US9773184B1 (en) | 2014-06-27 | 2017-09-26 | Blinker, Inc. | Method and apparatus for receiving a broadcast radio service offer from an image |
US9818154B1 (en) | 2014-06-27 | 2017-11-14 | Blinker, Inc. | System and method for electronic processing of vehicle transactions based on image detection of vehicle license plate |
US9760776B1 (en) | 2014-06-27 | 2017-09-12 | Blinker, Inc. | Method and apparatus for obtaining a vehicle history report from an image |
US9589201B1 (en) | 2014-06-27 | 2017-03-07 | Blinker, Inc. | Method and apparatus for recovering a vehicle value from an image |
US10515285B2 (en) | 2014-06-27 | 2019-12-24 | Blinker, Inc. | Method and apparatus for blocking information from an image |
US9811754B2 (en) * | 2014-12-10 | 2017-11-07 | Ricoh Co., Ltd. | Realogram scene analysis of images: shelf and label finding |
CN106156766B (zh) * | 2015-03-25 | 2020-02-18 | 阿里巴巴集团控股有限公司 | 文本行分类器的生成方法及装置 |
US9464914B1 (en) | 2015-09-01 | 2016-10-11 | International Business Machines Corporation | Landmark navigation |
CN108242059B (zh) * | 2016-12-26 | 2021-03-12 | 深圳怡化电脑股份有限公司 | 图像边界查找方法和装置 |
US10685225B2 (en) | 2017-12-29 | 2020-06-16 | Wipro Limited | Method and system for detecting text in digital engineering drawings |
EP3738098A4 (en) * | 2018-08-21 | 2021-05-12 | Huawei Technologies Co., Ltd. | INPAINTING BASED ON BINARIZATION AND STANDARDIZATION TO REMOVE TEXT |
CN109344824B (zh) * | 2018-09-21 | 2022-06-10 | 泰康保险集团股份有限公司 | 一种文本行区域检测方法、装置、介质和电子设备 |
CN111914830A (zh) * | 2019-05-07 | 2020-11-10 | 阿里巴巴集团控股有限公司 | 一种图像中的文本行定位方法、装置、设备及系统 |
CN111027560B (zh) * | 2019-11-07 | 2023-09-29 | 浙江大华技术股份有限公司 | 文本检测方法以及相关装置 |
US11721119B2 (en) * | 2020-12-18 | 2023-08-08 | Konica Minolta Business Solutions U.S.A., Inc. | Finding natural images in document pages |
CN113516114B (zh) * | 2021-05-19 | 2023-09-29 | 西安建筑科技大学 | 一种自然场景文本检测方法、设备和介质 |
US20230409469A1 (en) * | 2022-06-17 | 2023-12-21 | Verizon Patent And Licensing Inc. | System and method for user interface testing |
Family Cites Families (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4791675A (en) * | 1985-12-31 | 1988-12-13 | Schlumberger Systems And Services, Inc. | VSP Connectivity pattern recognition system |
EP0385009A1 (en) * | 1989-03-03 | 1990-09-05 | Hewlett-Packard Limited | Apparatus and method for use in image processing |
US5350303A (en) * | 1991-10-24 | 1994-09-27 | At&T Bell Laboratories | Method for accessing information in a computer |
JP3278471B2 (ja) * | 1991-11-29 | 2002-04-30 | 株式会社リコー | 領域分割方法 |
JPH0728951A (ja) * | 1993-07-13 | 1995-01-31 | Ricoh Co Ltd | オンライン文字図形認識装置 |
JPH10261047A (ja) * | 1997-03-19 | 1998-09-29 | Fujitsu Ltd | 文字認識装置 |
US6233364B1 (en) * | 1998-09-18 | 2001-05-15 | Dainippon Screen Engineering Of America Incorporated | Method and system for detecting and tagging dust and scratches in a digital image |
JP3913985B2 (ja) * | 1999-04-14 | 2007-05-09 | 富士通株式会社 | 文書画像中の基本成分に基づく文字列抽出装置および方法 |
JP2000298725A (ja) * | 1999-04-15 | 2000-10-24 | Nec Corp | テキストデータ検出装置およびその方法 |
US6909805B2 (en) * | 2001-01-31 | 2005-06-21 | Matsushita Electric Industrial Co., Ltd. | Detecting and utilizing add-on information from a scanned document image |
US20030095113A1 (en) * | 2001-11-21 | 2003-05-22 | Yue Ma | Index and retrieval system and method for scanned notes from whiteboard |
US20030198386A1 (en) * | 2002-04-19 | 2003-10-23 | Huitao Luo | System and method for identifying and extracting character strings from captured image data |
JP4583218B2 (ja) * | 2004-07-05 | 2010-11-17 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 対象コンテンツを評価する方法、コンピュータ・プログラム、システム |
WO2007028166A2 (en) * | 2005-09-02 | 2007-03-08 | Blindsight, Inc. | A system and method for detecting text in real-world color images |
US8031940B2 (en) * | 2006-06-29 | 2011-10-04 | Google Inc. | Recognizing text in images using ranging data |
US8155437B2 (en) * | 2007-09-07 | 2012-04-10 | CVISION Technologies, Inc. | Perceptually lossless color compression |
CN101436248B (zh) * | 2007-11-14 | 2012-10-24 | 佳能株式会社 | 用于根据图像生成文本字符串的方法和设备 |
US8098891B2 (en) * | 2007-11-29 | 2012-01-17 | Nec Laboratories America, Inc. | Efficient multi-hypothesis multi-human 3D tracking in crowded scenes |
CN101470806B (zh) * | 2007-12-27 | 2012-06-27 | 东软集团股份有限公司 | 车灯检测方法和装置、感兴趣区域分割方法和装置 |
US8917935B2 (en) * | 2008-05-19 | 2014-12-23 | Microsoft Corporation | Detecting text using stroke width based text detection |
US8351691B2 (en) * | 2008-12-18 | 2013-01-08 | Canon Kabushiki Kaisha | Object extraction in colour compound documents |
AU2009201252B2 (en) * | 2009-03-31 | 2011-06-02 | Canon Kabushiki Kaisha | Colour correcting foreground colours for visual quality improvement |
US8244070B2 (en) * | 2009-06-01 | 2012-08-14 | Xerox Corporation | Real-time image personalization |
US8175617B2 (en) * | 2009-10-28 | 2012-05-08 | Digimarc Corporation | Sensor-based mobile search, related methods and systems |
US8233668B2 (en) * | 2010-08-09 | 2012-07-31 | John Bean Technologies Corporation | Distinguishing abutting food product |
US20120045132A1 (en) * | 2010-08-23 | 2012-02-23 | Sony Corporation | Method and apparatus for localizing an object within an image |
US8610712B2 (en) * | 2011-03-01 | 2013-12-17 | Adobe Systems Incorporated | Object selection in stereo image pairs |
-
2012
- 2012-06-12 US US13/494,173 patent/US8837830B2/en active Active
-
2013
- 2013-05-28 JP JP2013111587A patent/JP6000899B2/ja not_active Expired - Fee Related
- 2013-06-04 DE DE102013210375A patent/DE102013210375A1/de active Pending
Also Published As
Publication number | Publication date |
---|---|
JP2013257866A (ja) | 2013-12-26 |
US20130330004A1 (en) | 2013-12-12 |
US8837830B2 (en) | 2014-09-16 |
DE102013210375A1 (de) | 2013-12-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6000899B2 (ja) | テキストを自動的に検出する方法 | |
US10872239B2 (en) | Entrance detection from street-level imagery | |
US11302109B2 (en) | Range and/or polarity-based thresholding for improved data extraction | |
AU2020319589B2 (en) | Region proposal networks for automated bounding box detection and text segmentation | |
CN109196514B (zh) | 图像分类和标记 | |
US7236632B2 (en) | Automated techniques for comparing contents of images | |
CN104298982B (zh) | 一种文字识别方法及装置 | |
US20150161465A1 (en) | Text recognition for textually sparse images | |
CA3129608C (en) | Region proposal networks for automated bounding box detection and text segmentation | |
CN104182722B (zh) | 文本检测方法和装置以及文本信息提取方法和系统 | |
JP6882362B2 (ja) | 身元確認書類を含む画像を識別するシステムおよび方法 | |
JP2009169518A (ja) | 領域識別装置およびコンテンツ識別装置 | |
JP5679229B2 (ja) | 画像処理装置、画像処理方法、及びプログラム | |
US20200089817A1 (en) | Composition Engine for Analytical Models | |
KR20230030907A (ko) | 가짜 영상 탐지 방법 및 이를 수행하기 위한 장치 | |
Viitaniemi et al. | Detecting hand-head occlusions in sign language video | |
JP2016151978A (ja) | 画像処理装置及び画像処理プログラム | |
Lakshminarasimha et al. | Data augmentation based face anti-spoofing (FAS) scheme using deep learning techniques | |
Antunes | OMECO: Generating personalized business card designs from images | |
Nasim et al. | Dark Channel Prior (DCP) based Bangla car plate detection and recognition in foggy weather | |
Kang et al. | Head pose estimation using random forest and texture analysis | |
Patel et al. | Emotions reflecting chat application | |
Malhotra et al. | Automated Puzzle Solver Using Image Processing | |
CN112270377A (zh) | 目标图像提取方法、神经网络训练方法和装置 | |
Rajalingam | Text Segmentation and Recognition for Enhanced Image Spam Detection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20160524 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20160524 |
|
A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20160524 |
|
A975 | Report on accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A971005 Effective date: 20160817 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20160818 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20160823 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20160831 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6000899 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
LAPS | Cancellation because of no payment of annual fees |