WO2012075817A1 - 一种文本图像的切边方法 - Google Patents
一种文本图像的切边方法 Download PDFInfo
- Publication number
- WO2012075817A1 WO2012075817A1 PCT/CN2011/077914 CN2011077914W WO2012075817A1 WO 2012075817 A1 WO2012075817 A1 WO 2012075817A1 CN 2011077914 W CN2011077914 W CN 2011077914W WO 2012075817 A1 WO2012075817 A1 WO 2012075817A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- text
- quadrilateral
- trimming
- image data
- image
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 48
- 238000009966 trimming Methods 0.000 title claims abstract description 43
- 238000001514 detection method Methods 0.000 claims abstract description 15
- 230000000694 effects Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000003909 pattern recognition Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 231100000241 scar Toxicity 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/16—Image preprocessing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/24—Aligning, centring, orientation detection or correction of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
Definitions
- the invention belongs to the technical field of image processing, and relates to an image processing method, and in particular to a method for trimming a text image. Background technique
- digital imaging devices such as digital cameras have become more and more widely used.
- digital cameras have become standard devices for mobile devices such as mobile phones.
- These digital imaging devices are often used to capture various text images, which are images containing one or more lines of printed text, such as business cards, menus, tickets, and the like.
- Camera shooting is different from the image scanned by the scanner. The difference is that the image captured by the camera is not as clear as the image scanned by the scanner.
- the image captured by the camera contains the projection deformation of the captured object such as the business card, so that the business card in the image is viewed. It is not as good as the one that was scanned, and the image taken by the camera contains the background of the outside of the object being photographed. Due to differences in shooting habits, framing needs, etc., the text image captured by the user generally contains the background, and the image desired by the user usually removes the image of the background area, just like the image scanned by the scanner.
- the business card processing software mobiReader of Apple's app s tore when shooting a business card, requires the user to shoot the business card just in the preview frame by specifying a preview box. Therefore, the image obtained by the shooting has only the area of the business card, and the effect of cutting the business card is achieved. This is a method of manual trimming. Because the shape of the image of the text, such as business cards, menus, and tickets, is generally a quadrilateral with obvious edges, many software will detect the text area by the line detection method in image processing, such as Apple's app s tore software genius Scar.
- this detection method often treats the line inside the text as the edge of the text. For such a picture containing text, it is still possible to assist in finding the area of the text image according to some specific clues, improve the accuracy of detecting the text area, and ensure the effect of trimming.
- the technical problem to be solved by the present invention is: providing a trimming method for a text image, which can automatically detect the edge of the text region, and use the detected text region to verify and correct the text edge by using the detected text region, The accuracy of detecting the text area is improved, thereby removing irrelevant redundant information, so that the image that the user sees when viewing the image data is a part that has a text area and is valuable to the user.
- the present invention uses the following technical solutions:
- Step 110 Acquire text image data
- Step 120 detecting a straight line of the text image by using a line detection method to obtain an edge of the trimmed quadrilateral;
- Step 130 Perform text detection on the image data to obtain coordinates of a boundary point of the text area.
- Step 140 Obtain a final trimming result according to the results of step 120 and step 130; Step 150, end.
- the step 140 specifically includes the following steps:
- Step 210 Form all the trimmed quads obtained in step 120 into a candidate quadrilateral set; Step 220, if the candidate quadrilateral set is empty, proceed to step 260, otherwise proceed to step 230; Step 230, remove the next candidate from the quadrilateral candidate set Quadrilateral
- Step 240 determining whether all boundary points of the text area are in the candidate quadrilateral, if yes, the text area is a valid trimming quadrilateral to step 250, if no, go to step 220;
- Step 250 returning a valid trimming quadrilateral
- Step 260 returning the entire image to the user for manually determining the trimming quadrilateral.
- the method for acquiring image data in the step 110 includes one of the following three types:
- One method is to take a picture with a digital camera
- the second method is image data obtained by a scanner;
- the third method is to open a pre-existing file containing image data, read the data in the file and decompress the image data according to a standard algorithm.
- the device requires general arithmetic and storage devices, including a CPU of a certain frequency (central processing unit), a memory for computing, and a system software for storage, application software. And storage space for various data, etc. If you need to take image data through digital camera shooting or scanner scanning, you also need a digital camera or scanner.
- the invention automatically detects the edge of the text area for the text image data that the user needs to save or browse, and uses the detected method to verify and correct the edge of the text by using the detected text area, thereby greatly improving the detected text area.
- the accuracy The beneficial effects of the present invention are as follows:
- the method for trimming a text image proposed by the present invention uses a technique in the field of image processing, pattern recognition, etc., to determine whether a trimmed quadrilateral contains a text region when trimming a text image. Thereby obtaining an accurate text area.
- the method can obtain an accurate trimming area from the candidate quadrilateral of the trimming edge, and reduce the trouble of the user manually cutting the area and losing the data due to the trimming.
- FIG. 1 is a flow chart of a text image trimming method according to the present invention.
- FIG. 2 is a flow chart showing the steps of determining whether a trimmed quadrilateral is effective
- Figure 3 is a schematic diagram of the effect of the original text image
- FIG. 4 is a schematic diagram showing the effect of a text image processed by the text image trimming method of the present invention. detailed description
- the present invention discloses a method for trimming a text image, and the method includes the following steps: [Step 110] Acquire text image data.
- the method of obtaining image data includes one of the following three types:
- One method is to take a picture with a digital camera
- the second method is image data obtained by a scanner
- the third method is to open a pre-existing file containing image data, read the data in the file and decompress the image data according to a standard algorithm.
- Step 120 A straight line of the text image is detected by a line detection method to obtain an edge of the trimmed quadrilateral.
- Step 130 Perform text detection on the image data to obtain the coordinates of the boundary point of the text area;
- Step 140 obtain the final trimming result according to the results of Step 120 and Step 130; Referring to FIG. 2, Step 140 specifically includes The following steps:
- Step 210 Form all the trimmed quads obtained in step 120 into a candidate quadrilateral set; Step 220, if the candidate quadrilateral set is empty, proceed to step 260, otherwise proceed to step 230; Step 230, remove the next candidate from the quadrilateral candidate set Quadrilateral
- Step 240 determining whether all boundary points of the text area are in the candidate quadrilateral, if yes, the text area is a valid trimming quadrilateral to step 250, if no, go to step 220;
- Step 250 returning a valid trimming quadrilateral
- Step 260 returning the entire image to the user for manually determining the trimming quadrilateral.
- the method for trimming a text image proposed by the present invention uses a technique in the field of image processing, pattern recognition, etc., when trimming a text image, by determining whether the trimmed quadrilateral contains a text region, thereby obtaining The exact text area.
- the method can obtain an accurate trimming area from the candidate quadrilateral of the trimming edge, and reduce the trouble of the user manually cutting the area and losing the data due to the trimming.
- the recorded ticket list is as shown in FIG. 3, and the image data therein is read by the method of the present invention, and the correction algorithm of automatic trimming and text detection is obtained.
- the image shown is shown in Figure 4.
- the mobile phone comes with a 3 megapixel camera, and uses the method described in the present invention to obtain a rectangular image of the captured business card image through automatic trimming and text detection correction algorithms.
- the description and application of the present invention are intended to be illustrative, and not intended to limit the scope of the invention. Variations and modifications of the embodiments disclosed herein are possible, and various alternative and equivalent components of the embodiments are well known to those of ordinary skill in the art. It will be apparent to those skilled in the art that the present invention may be embodied in other forms, structures, arrangements, ratios, and other components, materials and components without departing from the spirit or essential characteristics of the invention. Other variations and modifications of the embodiments disclosed herein may be made without departing from the scope and spirit of the invention.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Character Input (AREA)
- Editing Of Facsimile Originals (AREA)
- Studio Devices (AREA)
- Image Analysis (AREA)
- Processing Or Creating Images (AREA)
- Facsimile Image Signal Circuits (AREA)
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020137014846A KR101784919B1 (ko) | 2010-12-10 | 2011-08-02 | 텍스트 이미지의 트리밍 방법 |
JP2013542346A JP5896245B2 (ja) | 2010-12-10 | 2011-08-02 | テキスト画像のトリミング方法 |
EP11846097.1A EP2650821B1 (en) | 2010-12-10 | 2011-08-02 | Text image trimming method |
ES11846097.1T ES2609953T3 (es) | 2010-12-10 | 2011-08-02 | Procedimiento de recorte de imágenes de texto |
US13/992,383 US9082192B2 (en) | 2010-12-10 | 2011-08-02 | Text image trimming method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010105839734A CN102201053B (zh) | 2010-12-10 | 2010-12-10 | 一种文本图像的切边方法 |
CN201010583973.4 | 2010-12-10 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2012075817A1 true WO2012075817A1 (zh) | 2012-06-14 |
Family
ID=44661718
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2011/077914 WO2012075817A1 (zh) | 2010-12-10 | 2011-08-02 | 一种文本图像的切边方法 |
Country Status (8)
Country | Link |
---|---|
US (1) | US9082192B2 (zh) |
EP (1) | EP2650821B1 (zh) |
JP (1) | JP5896245B2 (zh) |
KR (1) | KR101784919B1 (zh) |
CN (1) | CN102201053B (zh) |
ES (1) | ES2609953T3 (zh) |
PT (1) | PT2650821T (zh) |
WO (1) | WO2012075817A1 (zh) |
Families Citing this family (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102201053B (zh) * | 2010-12-10 | 2013-07-24 | 上海合合信息科技发展有限公司 | 一种文本图像的切边方法 |
CN103678300B (zh) * | 2012-08-30 | 2020-02-07 | 深圳市世纪光速信息技术有限公司 | 一种图片转换方法及装置 |
CN103020619B (zh) * | 2012-12-05 | 2016-04-20 | 上海合合信息科技发展有限公司 | 一种自动切分电子化笔记本中手写条目的方法 |
CN102938063B (zh) * | 2012-12-05 | 2016-02-10 | 上海合合信息科技发展有限公司 | 一种方便电子化的专业笔记本及其电子化方法 |
CN103500337B (zh) * | 2013-09-30 | 2018-10-09 | 上海合合信息科技发展有限公司 | 用于智能无线通信终端的识别四边形边框的方法及装置 |
CN106796647B (zh) * | 2014-09-05 | 2018-09-14 | 北京市商汤科技开发有限公司 | 场景文本检测系统和方法 |
CN104408427A (zh) * | 2014-12-01 | 2015-03-11 | 上海合合信息科技发展有限公司 | 图像四边形识别的方法和装置 |
CN104636742B (zh) * | 2015-02-16 | 2017-12-15 | 珠海市追梦网络科技有限公司 | 一种通过摄像自动锁定目标题目并传输的方法 |
CN105046200B (zh) * | 2015-06-19 | 2020-06-09 | 成都理想境界科技有限公司 | 基于直线检测的电子阅卷方法 |
CN105092608B (zh) * | 2015-09-24 | 2017-11-03 | 哈尔滨工业大学 | 终端光学元件损伤在线检测中孪生像的剔除方法 |
CN105631396A (zh) * | 2015-11-13 | 2016-06-01 | 东莞酷派软件技术有限公司 | 一种获取文件电子档的方法及装置 |
CN106491039B (zh) * | 2016-11-14 | 2019-01-08 | 浙江工业大学 | 一种摆动式擦窗机器人玻璃边缘检测方法 |
CN106491042B (zh) * | 2016-11-14 | 2019-01-08 | 浙江工业大学 | 一种摆动式擦窗机器人玻璃边缘处向下移动处理方法 |
CN106491040B (zh) * | 2016-11-14 | 2019-04-09 | 浙江工业大学 | 一种摆动式擦窗机器人玻璃边缘触碰检测处理方法 |
CN106981070B (zh) * | 2017-02-13 | 2018-05-11 | 北京仁光科技有限公司 | 一种大屏系统交互行为状态的判断方法 |
CN110363196B (zh) * | 2019-06-20 | 2022-02-08 | 吴晓东 | 一种倾斜文本的文字精准识别的方法 |
CN111027560B (zh) * | 2019-11-07 | 2023-09-29 | 浙江大华技术股份有限公司 | 文本检测方法以及相关装置 |
CN112016551B (zh) * | 2020-10-23 | 2021-04-09 | 北京易真学思教育科技有限公司 | 文本检测方法、装置、电子设备及计算机存储介质 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1937698A (zh) * | 2006-10-19 | 2007-03-28 | 上海交通大学 | 图像畸变自动校正的图像处理方法 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS56140473A (en) * | 1980-04-03 | 1981-11-02 | Nec Corp | Character element cutting device |
US4764789A (en) * | 1986-07-28 | 1988-08-16 | Sanyo Electric Co., Ltd. | Apparatus for inputting image forming condition |
JPH09297814A (ja) * | 1996-04-30 | 1997-11-18 | Nippon Telegr & Teleph Corp <Ntt> | フィールド抽出方法 |
US7171056B2 (en) | 2003-02-22 | 2007-01-30 | Microsoft Corp. | System and method for converting whiteboard content into an electronic document |
JP4363151B2 (ja) | 2003-10-14 | 2009-11-11 | カシオ計算機株式会社 | 撮影装置、その画像処理方法及びプログラム |
JP4143566B2 (ja) * | 2004-04-16 | 2008-09-03 | キヤノン株式会社 | 文書処理装置及びその制御方法、並びにコンピュータプログラム |
EP1947605B1 (en) * | 2005-02-25 | 2019-03-06 | Psion Systems Inc. | Automatic perspective distortion detection and correction for document imaging |
JP4344888B2 (ja) | 2005-12-09 | 2009-10-14 | 株式会社カシオ日立モバイルコミュニケーションズ | 撮像装置、撮像画像処理方法及びプログラム |
JP4424314B2 (ja) * | 2006-02-03 | 2010-03-03 | カシオ計算機株式会社 | 書画撮影装置、書画静止検出方法及びプログラム |
JP4902568B2 (ja) * | 2008-02-19 | 2012-03-21 | キヤノン株式会社 | 電子文書生成装置、電子文書生成方法、コンピュータプログラム、および記憶媒体 |
CN101593277A (zh) * | 2008-05-30 | 2009-12-02 | 电子科技大学 | 一种复杂彩色图像中文本区域自动定位方法及装置 |
CN101425142B (zh) * | 2008-09-17 | 2011-05-11 | 北大方正集团有限公司 | 页面倾斜角度的确定方法和装置 |
US9165388B2 (en) * | 2008-09-22 | 2015-10-20 | International Business Machines Corporation | Method of automatic cropping |
CN102201053B (zh) * | 2010-12-10 | 2013-07-24 | 上海合合信息科技发展有限公司 | 一种文本图像的切边方法 |
-
2010
- 2010-12-10 CN CN2010105839734A patent/CN102201053B/zh active Active
-
2011
- 2011-08-02 US US13/992,383 patent/US9082192B2/en active Active
- 2011-08-02 WO PCT/CN2011/077914 patent/WO2012075817A1/zh active Application Filing
- 2011-08-02 KR KR1020137014846A patent/KR101784919B1/ko active IP Right Grant
- 2011-08-02 PT PT118460971T patent/PT2650821T/pt unknown
- 2011-08-02 EP EP11846097.1A patent/EP2650821B1/en active Active
- 2011-08-02 ES ES11846097.1T patent/ES2609953T3/es active Active
- 2011-08-02 JP JP2013542346A patent/JP5896245B2/ja active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1937698A (zh) * | 2006-10-19 | 2007-03-28 | 上海交通大学 | 图像畸变自动校正的图像处理方法 |
Non-Patent Citations (3)
Title |
---|
BU, FEIYU ET AL.: "Edge detection based image segmentation method for multi sheet of papers in one image", COMPUTER ENGINEERING AND DESIGN, vol. 28, no. 8, 30 April 2007 (2007-04-30), pages 1861 - 1863 * |
HE, ZHIMING: "Rectification of Perspective Document Image Captured by Digital Camera", JOURNAL OF SHANGHAI UNIVERSITY OF ENGINEERING SCIENCE, vol. 21, no. 3, 30 September 2007 (2007-09-30), pages 260 - 264 * |
See also references of EP2650821A4 * |
Also Published As
Publication number | Publication date |
---|---|
JP2014504400A (ja) | 2014-02-20 |
JP5896245B2 (ja) | 2016-03-30 |
US9082192B2 (en) | 2015-07-14 |
CN102201053A (zh) | 2011-09-28 |
PT2650821T (pt) | 2017-01-13 |
CN102201053B (zh) | 2013-07-24 |
ES2609953T3 (es) | 2017-04-25 |
EP2650821A4 (en) | 2014-03-12 |
US20130251262A1 (en) | 2013-09-26 |
EP2650821A1 (en) | 2013-10-16 |
KR101784919B1 (ko) | 2017-10-12 |
EP2650821B1 (en) | 2016-10-05 |
KR20130126929A (ko) | 2013-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2012075817A1 (zh) | 一种文本图像的切边方法 | |
JP4772839B2 (ja) | 画像識別方法および撮像装置 | |
RU2631765C1 (ru) | Способ и система исправления перспективных искажений в изображениях, занимающих двухстраничный разворот | |
WO2012068902A1 (zh) | 提高文本图像清晰度的方法及系统 | |
TW201230789A (en) | Image processing system and object of image capturing used therewith | |
US20170070664A1 (en) | Method for processing image and electronic apparatus therefor | |
CN109241345B (zh) | 基于人脸识别的视频定位方法和装置 | |
WO2021008205A1 (zh) | 图像处理 | |
JP2019087945A (ja) | 情報処理装置、プログラム及び情報処理方法 | |
US8520950B2 (en) | Image processing device, image processing method, program, and integrated circuit | |
CN113158773B (zh) | 一种活体检测模型的训练方法及训练装置 | |
WO2022111461A1 (zh) | 识别方法、装置及电子设备 | |
JP6564136B2 (ja) | 画像処理装置、画像処理方法、および、プログラム | |
WO2018107574A1 (zh) | 一种对印防伪特征的检测方法及装置 | |
WO2022042669A1 (zh) | 一种图片处理方法、装置、设备及存储介质 | |
CN113379713B (zh) | 证件图像的检测方法及其装置 | |
US20120133610A1 (en) | Method for adjusting region of interest and related optical touch module | |
KR20230017774A (ko) | 정보 처리 장치, 정보 처리 방법, 및 프로그램 | |
JP6669390B2 (ja) | 情報処理装置、情報処理方法、及びプログラム | |
CN110971820B (zh) | 拍照方法、拍照装置、移动终端及计算机可读存储介质 | |
CN111885371A (zh) | 图像遮挡检测方法、装置、电子设备和计算机可读介质 | |
CN105100616B (zh) | 一种图像处理方法及电子设备 | |
JP2009159525A (ja) | 撮像装置及び画像合成プログラム | |
JP2002207963A (ja) | 画像処理装置 | |
JP6217225B2 (ja) | 画像照合装置、画像照合方法及びプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11846097 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13992383 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 2013542346 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20137014846 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REEP | Request for entry into the european phase |
Ref document number: 2011846097 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011846097 Country of ref document: EP |