WO2014086277A1 - Ordinateur portable professionnel commode pour une électronisation et procédé pour identifier automatiquement un numéro de page de celui-ci - Google Patents

Ordinateur portable professionnel commode pour une électronisation et procédé pour identifier automatiquement un numéro de page de celui-ci Download PDF

Info

Publication number
WO2014086277A1
WO2014086277A1 PCT/CN2013/088425 CN2013088425W WO2014086277A1 WO 2014086277 A1 WO2014086277 A1 WO 2014086277A1 CN 2013088425 W CN2013088425 W CN 2013088425W WO 2014086277 A1 WO2014086277 A1 WO 2014086277A1
Authority
WO
WIPO (PCT)
Prior art keywords
page
paper
page number
area
type
Prior art date
Application number
PCT/CN2013/088425
Other languages
English (en)
Chinese (zh)
Inventor
曹璐
镇立新
罗希平
Original Assignee
上海合合信息科技发展有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海合合信息科技发展有限公司 filed Critical 上海合合信息科技发展有限公司
Publication of WO2014086277A1 publication Critical patent/WO2014086277A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems

Definitions

  • the invention belongs to the technical field of electronic computers, and relates to a page number identification method for an electronic document, in particular to a professional notebook which is convenient for electronic use and a method for automatically identifying the page number thereof. Background technique
  • Smartphones are one of the commonly used tools for electronically documenting paper documents. Since the camera is usually equipped with a camera on the smartphone, the camera on the mobile phone can take a paper document, and the captured electronic document can be processed into a certain image, and then converted into a JPEG format photo, or a PDF document can be generated. Applications with the above features have also become more popular, such as apps in the Apple App Store and the Google Store.
  • CamS Canne r These applications can automatically monitor the four sides of the captured document from the captured image, use this as a reference to cut off the background outside the document area in the image, and perform correction and image enhancement on the document area to obtain a scanner similar to the one used.
  • the effect of scanning a clean and clean electronic document is saved and managed in a user-specified format.
  • the common paper documents that need to be electronically are paper notebook pages.
  • Some notebook pages have a page number printed on them, but the general notebook page does not print the page number, the bottom of the bottom line printed on the notebook page, until the bottom edge of the page is blank, just for the sake of beauty, no actual effect.
  • Electronically copy notebook pages with application software such as CamScanner and save them to electronic devices such as smartphones in JPEG image format or PDF document format. After the number of electronic pages increases, management and retrieval will be performed.
  • an object of the present invention is to provide a professional notebook that is convenient for electronic use and a method for automatically identifying the page number thereof, which is used to solve the disorder of the electronic document of the paper page in the prior art, which is difficult to be Implement the management and query of content.
  • the present invention provides a professional notebook that is convenient for electronic use and an automatic identification method for the page number thereof.
  • a professional notebook that is convenient for electronic printing, and a page marked area is printed on a paper page of the electronic notebook that is convenient for electronic use.
  • the page number labeling area is fixedly disposed at a designated position on the paper page.
  • a method for automatically recognizing a page number of a professional notebook that is convenient for electronic use, and the method for automatically recognizing a page number of the electronic notebook for convenient electronicization includes:
  • Identifying the page number content in the page number labeling area is added to the electronic document of the paper page.
  • the type of the paper page is determined by the size and format of the paper page; the format of the paper page includes the number of branch lines printed on the paper page, or / and whether there is a page number labeling area, or / and page number mark the size and location of the area.
  • the page number labeling area is fixedly disposed at a designated position on the paper page.
  • a specific implementation manner of determining the type of the paper page according to the paper page image is: manually specifying the type of the paper page.
  • a specific implementation manner of determining the type of the paper page according to the paper page image is: a fixed position on the paper page Printing a type of mark; detecting a type mark on the paper page image, comparing the detected type mark to a previously known type mark, and finding the type to which the paper page belongs.
  • the specific implementation manner of determining the type of the paper page according to the paper page image is: creating a new type of paper page, inputting the type The size and format of an unknown paper page.
  • the page number labeling area is a printed font area or a handwritten font area.
  • the page number labeling area is a handwritten font area
  • the image block is binarized to detect the number of pixels of the front sights representing the handwriting of the user. If the proportion of the number in the entire page number labeling area exceeds a preset threshold, the page labeling area is not blank. Otherwise it is blank.
  • the electronic notebook and the automatic identification method for the page number thereof have the following beneficial effects:
  • the invention solves the electronic document of the paper page by printing the page number labeling area on the notebook paper page and automatically identifying the page number content of the page labeling area when the notebook paper page is electronicized by the application software such as CamScanner.
  • the order of confusion is conducive to electronic document management and query.
  • FIG. 1 is a schematic view showing the structure of a professional notebook which is convenient for electronic use according to the present invention.
  • FIG. 2 is a schematic view showing another structure of a professional notebook which is convenient for electronic use according to the present invention.
  • FIG. 3 is a flow chart showing the automatic page number identification method of the electronic notebook which is convenient for electronic use according to the present invention.
  • the embodiment provides a professional notebook that is convenient for electronic use.
  • the page number labeling area 101 is printed on the paper page 100 of the electronic notebook that is convenient for electronic printing.
  • the page number labeling area 101 is fixedly disposed at a specified position on the paper page 100.
  • the page number labeling area 101 can be accurately scanned, and the page number content in the extracted page number labeling area 101 is added to the electronic document of the page number.
  • the above specified position can be anywhere on the paper page, such as the header position of the paper page, or the position of the footer, see Figures 1 and 2.
  • the embodiment further provides a method for automatically recognizing a page of a professional notebook which is convenient for electronic use, wherein the professional notebook which is convenient for electronic use is a convenient professional electronic notebook provided by the embodiment, as shown in the figure.
  • the automatic page number identification method of the electronic notebook is convenient for:
  • a page number labeling area is printed on the paper page of the electronic notebook that is convenient for electronic printing.
  • the page number labeling area is fixedly disposed at a specified position on the paper page. In this way, when the paper page is electronically printed, the page number labeling area can be accurately scanned, and the page number content in the page number labeling area is extracted.
  • the page number labeling area is used for the user to write or print the page number. For example, under the last branch line of the notebook page, a rectangular area is printed with a dotted line to prompt the user to write the page number therein.
  • the page number can be in the form of a number, or a letter, or any form that distinguishes the order.
  • the type of the paper page is determined based on the paper page image, thereby obtaining a position of a page number labeling area printed on a paper page of the professional notebook in the paper page.
  • the type of the paper page is determined by the size and format of the paper page; the format of the paper page includes the number of branch lines printed on the paper page, or/and whether there is a page number labeling area. , or / and page number to mark the size and location of the area. That is, the format of the paper page may be any kind of situation, for example, only the branch line is printed on the paper page, or only the page mark area and the size and position of the page mark area are printed, or both The printed line has a page mark area printed on it.
  • Determining four edge lines of the paper page image by a line detection method in the image and four
  • the page area defined by the edge line is corrected to a square area, and the exact position of the page number labeling area in the square area is determined.
  • four straight lines representing the outer edge of the page in the page image are acquired by the line detection in the image, and the four background areas outside the range defined by the outer edge of the page are cut off, and the four outer pages are represented by the four lines.
  • the captured image is corrected based on the straight line, and the four page areas defined by the straight lines representing the outer edge of the page are corrected into a square area, which may be a rectangular area or a square area.
  • the exact position of the page number labeling area in the paper page can be determined according to the type of the paper page and the corrected page area, thereby accurately obtaining the page number of the paper page.
  • Identifying the page number content in the page number labeling area is added to the electronic document of the paper page.
  • the page number labeling area is extracted from the corrected paper page image, and the page number content printed or handwritten in the page labeling area is identified, and the page number content is added to the electronic document of the paper page. , for later management and query.
  • the page number labeling area is a printed font area or a handwritten font area.
  • the specific determining method is: performing image block of the page number labeling area Binary processing, detecting the number of pixels of the front sights in which the user's handwriting is represented. If the proportion of the number in the entire area exceeds a preset threshold, the page number labeling area is not blank, otherwise it is blank. .
  • the accuracy of the recognition is greatly improved. There are two situations at this time. One is that the page number is printed. This type of page number is generally Arabic numerals, which is easy to identify and has high accuracy.
  • the other is that the page number is filled by the user.
  • First it is judged whether the user has handwritten the page number in this area, and there are many methods for judging.
  • a judgment example is given to illustrate the technical feasibility, such as performing binary value on the image block representing the extracted page code labeling area. Processing, checking the number of pixels of the front sights representing the handwriting of the user, if the proportion of the number in the entire page number labeling area exceeds a predetermined threshold, the page labeling area is considered to be blank.
  • the page number writing area is blank; if it is detected that the page number labeling area is blank, no identification of the page number is performed; if it is detected that the page number labeling area is not blank, handwriting is performed in the area Identification of page numbers.
  • the invention automatically prints the page number content of the page labeling area by printing the page number labeling area on the notebook paper page, and automatically digitizes the notebook paper page when using the application software such as CamScanner, thereby facilitating electronic document management and query.
  • the method of increasing the page number labeling area determines the order of the electronic document, which not only facilitates the management and query of the electronic document, but also makes the reading of the electronic document faster and clearer.
  • the embodiment provides a method for automatically recognizing the page number of the professional notebook which is convenient for electronicization, and the difference from the automatic page number identification method of the electronic notebook for facilitating electronicization according to the first embodiment is: the paper page is known in advance.
  • the specific implementation manner of determining the type of the paper page according to the paper page image is: manually specifying the type of the paper page; that is, manually specifying the image before the image is taken, or after the image is processed after the image is taken.
  • the type of notebook paper page to which it belongs such as one of a series of notebook page types that are pre-stored in applications such as camScanner.
  • the embodiment provides a method for automatically recognizing the page number of the professional notebook which is convenient for electronicization, and the difference from the automatic page number identification method of the electronic notebook for convenient electronicization according to the first and second embodiments is: the paper is known in advance
  • the type of the page, the specific implementation manner of determining the type of the paper page according to the paper page image is:
  • a type of indicia is printed at a fixed position on the paper page; the type of indicia can be a text, a symbol, a graphic, or a combination of any two or three.
  • a type mark on the paper page image is detected, and the detected type mark is compared with a previously known type mark to find out the type to which the paper page belongs.
  • the four outer edges of the paper page of the notebook are detected in the image, and the approximate position of the mark is determined in the image of the paper page with reference to the four outer edges, thereby realizing the mark
  • the detection in the image then compares the detected mark with the pre-stored mark of the paper page representing a plurality of different types of notebooks to find out the type of the paper page of the photographed notebook.
  • the detected mark is compared with the pre-stored mark representing a plurality of different types of notebook paper pages to find out the type of the paper page of the photographed notebook, which involves handwriting recognition, text recognition, Mature techniques in the art such as image matching are not described herein.
  • Embodiment 4
  • the embodiment provides a method for automatically recognizing a page of a professional notebook which is convenient for electronic use, and the implementation thereof
  • the difference in the automatic page number identification method of the electronic notebook for convenience in the first embodiment is: the type of the paper page is not known in advance, and in this case, the paper page is determined according to the paper page image.
  • the specific implementation of the type is:
  • the paper page of the notebook being photographed does not belong to a type of paper page printed with bold or/and lengthened branch lines, or/and a line of division, or/and a title area, which is known in advance by an application such as CamScanner. Then, in the subsequent steps, the type of the unknown paper page is first added to the type of the newly created paper page, and then the subsequent processing is performed.
  • the invention needs to know in advance the printing type of the page number or the page number labeling area in the application software such as CamScanner, and the position of the corresponding page number or page number labeling area in the notebook page, so as to realize the notebook page (ie the above paper quality) Pages are automatically added to the page number in the electronic document. If the notebook page has a printed or user-written page number before the electronic version, the method of the present invention can electronically identify the page number and save it, which is very advantageous for the subsequent retrieval of the electronic document.
  • the invention facilitates the electronic document management method by printing a page number labeling area on a notebook page and automatically identifying the page number when the notebook page is electronically used by an application software such as CamScanner.
  • the present invention effectively overcomes various shortcomings in the prior art and has high industrial utilization value.
  • the above-described embodiments are merely illustrative of the principles of the invention and its advantages, and are not intended to limit the invention. Modifications or variations of the above-described embodiments may be made by those skilled in the art without departing from the spirit and scope of the invention. Therefore, all equivalent modifications or changes made by those skilled in the art without departing from the spirit and scope of the invention are still covered by the appended claims.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Processing Or Creating Images (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

La présente invention propose un ordinateur portable professionnel commode pour une électronisation et un procédé pour identifier automatiquement un numéro de page de celui-ci. Le procédé comprend : la photographie d'une image de page papier de l'ordinateur portable professionnel commode pour une électronisation ; la détermination d'un type d'une page papier conformément à l'image de page papier, de manière à obtenir une position, dans la page papier, d'une zone de marquage de numéro de page imprimée sur la page papier de l'ordinateur portable professionnel ; la détermination, au moyen d'un procédé de détection de droite dans l'image, de quatre lignes de bord de l'image de page papier, la correction d'une zone de page limitée par les quatre lignes de bord en tant que zone carrée, et la détermination d'une position précise de la zone de marquage de numéro de page dans la zone carrée ; et l'ajout du contenu de numéro de page dans la zone de marquage de numéro de page dans un fichier électronique de la page papier. Dans la présente invention, une zone de marquage de numéro de page est imprimée sur une page papier d'un ordinateur portable, et lorsqu'un logiciel d'application tel que CamScanner est utilisé pour effectuer une électronisation, le contenu de numéro de page de la zone de marquage de numéro de page est identifié automatiquement, ce qui est commode pour la gestion et l'interrogation du fichier électronique.
PCT/CN2013/088425 2012-12-05 2013-12-03 Ordinateur portable professionnel commode pour une électronisation et procédé pour identifier automatiquement un numéro de page de celui-ci WO2014086277A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2012105171661A CN102938061A (zh) 2012-12-05 2012-12-05 方便电子化的专业笔记本及其页码自动识别方法
CN201210517166.1 2012-12-05

Publications (1)

Publication Number Publication Date
WO2014086277A1 true WO2014086277A1 (fr) 2014-06-12

Family

ID=47696956

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/088425 WO2014086277A1 (fr) 2012-12-05 2013-12-03 Ordinateur portable professionnel commode pour une électronisation et procédé pour identifier automatiquement un numéro de page de celui-ci

Country Status (2)

Country Link
CN (1) CN102938061A (fr)
WO (1) WO2014086277A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105812667A (zh) * 2016-04-15 2016-07-27 张磊 一种快速拍摄ppt的系统和方法

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102938061A (zh) * 2012-12-05 2013-02-20 上海合合信息科技发展有限公司 方便电子化的专业笔记本及其页码自动识别方法
CN103034842A (zh) * 2012-12-05 2013-04-10 上海合合信息科技发展有限公司 一种方便电子化的专业笔记本及其电子缩略图显示方法
CN102938063B (zh) * 2012-12-05 2016-02-10 上海合合信息科技发展有限公司 一种方便电子化的专业笔记本及其电子化方法
CN104714711B (zh) * 2013-12-11 2017-12-15 汉王科技股份有限公司 电磁式记事本以及自动换页方法
CN104484397A (zh) * 2014-12-16 2015-04-01 上海合合信息科技发展有限公司 图像文档自动排序方法及装置
CN106515258B (zh) * 2016-11-10 2017-12-19 深圳市科迈爱康科技有限公司 笔记本、智能终端及笔记本内容索引创建方法
CN107393356A (zh) * 2017-04-07 2017-11-24 深圳市友悦机器人科技有限公司 控制方法、控制装置和早教机
CN107766854B (zh) * 2017-09-28 2021-07-06 电子科技大学 一种基于模板匹配实现快速页码识别的方法
CN109977934A (zh) * 2017-12-27 2019-07-05 田雪松 一种信息记录方法及装置
CN108810307B (zh) * 2018-06-15 2020-09-04 深圳市成者云科技有限公司 一种边框页码扫描系统
CN110175550B (zh) * 2019-05-20 2021-04-09 青岛罗博智慧教育技术有限公司 页码识别容错编码的方法
CN110232364A (zh) * 2019-06-18 2019-09-13 华中师范大学 一种答题卡页码识别方法及装置
CN110532938B (zh) * 2019-08-27 2022-05-24 海南阿凡题科技有限公司 基于Faster-RCNN的纸质作业页码识别方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101807192A (zh) * 2009-12-31 2010-08-18 优视科技有限公司 一种用于移动通讯设备终端的网页页面光学字符识别处理方法
CN102081732A (zh) * 2010-12-29 2011-06-01 方正国际软件有限公司 一种版式识别模板方法及系统
CN102084378A (zh) * 2008-05-06 2011-06-01 计算机连接管理中心公司 基于照相机的文档成像
CN102938061A (zh) * 2012-12-05 2013-02-20 上海合合信息科技发展有限公司 方便电子化的专业笔记本及其页码自动识别方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1603072A1 (fr) * 2004-06-02 2005-12-07 CCS Content Conversion Specialists GmbH Procédé et dispositif pour analyser la structure d'un document
CN101976114B (zh) * 2010-09-29 2012-07-04 长安大学 一种基于摄像头的计算机与纸笔信息交互系统及方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102084378A (zh) * 2008-05-06 2011-06-01 计算机连接管理中心公司 基于照相机的文档成像
CN101807192A (zh) * 2009-12-31 2010-08-18 优视科技有限公司 一种用于移动通讯设备终端的网页页面光学字符识别处理方法
CN102081732A (zh) * 2010-12-29 2011-06-01 方正国际软件有限公司 一种版式识别模板方法及系统
CN102938061A (zh) * 2012-12-05 2013-02-20 上海合合信息科技发展有限公司 方便电子化的专业笔记本及其页码自动识别方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105812667A (zh) * 2016-04-15 2016-07-27 张磊 一种快速拍摄ppt的系统和方法

Also Published As

Publication number Publication date
CN102938061A (zh) 2013-02-20

Similar Documents

Publication Publication Date Title
WO2014086277A1 (fr) Ordinateur portable professionnel commode pour une électronisation et procédé pour identifier automatiquement un numéro de page de celui-ci
US10339378B2 (en) Method and apparatus for finding differences in documents
CN103020619B (zh) 一种自动切分电子化笔记本中手写条目的方法
WO2014086279A1 (fr) Ordinateur portable professionnel adapté à l'électronisation et procédé de classement automatique de fichier électronique associé
US9311531B2 (en) Systems and methods for classifying objects in digital images captured using mobile devices
WO2022057707A1 (fr) Procédé de reconnaissance de texte, procédé de classification de reconnaissance d'image et procédé de traitement de reconnaissance de document
WO2014086272A1 (fr) Bloc-notes professionnel adapté à l'électronisation et procédé permettant d'inclure ce bloc-notes dans un agenda électronique
US20220222284A1 (en) System and method for automated information extraction from scanned documents
RU2656573C2 (ru) Методы обнаружения введенных пользователем контрольных меток
CN110909740A (zh) 信息处理装置以及存储介质
US11881043B2 (en) Image processing system, image processing method, and program
WO2014082551A1 (fr) Procédé et dispositif pour obtenir des contenus dans un cahier en papier
CN112784220B (zh) 一种纸质合同防篡改校验方法及系统
US20210209393A1 (en) Image processing system, image processing method, and program
US9818028B2 (en) Information processing apparatus for obtaining a degree of similarity between elements
WO2014086266A1 (fr) Bloc-notes professionnel adapté à l'électronisation et procédé associé destiné à afficher une vignette électronique
JP2018042067A (ja) 画像処理システム、画像処理方法、情報処理装置
US10579653B2 (en) Apparatus, method, and computer-readable medium for recognition of a digital document
WO2014086265A1 (fr) Bloc-notes spécial pouvant être électronisé et son procédé d'électronisation
JP2008282094A (ja) 文字認識処理装置
US7920742B2 (en) Image processing apparatus, program and recording medium for document registration
JP6540597B2 (ja) 情報処理装置、情報処理方法及びプログラム
CN113887484B (zh) 一种卡片式文件图像识别方法和装置
JP2007173938A5 (fr)
JP2014010674A (ja) 情報管理システム、画像処理装置、制御方法、及び、制御プログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13861171

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13861171

Country of ref document: EP

Kind code of ref document: A1