WO2021260852A1 - Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données - Google Patents

Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données Download PDF

Info

Publication number
WO2021260852A1
WO2021260852A1 PCT/JP2020/024831 JP2020024831W WO2021260852A1 WO 2021260852 A1 WO2021260852 A1 WO 2021260852A1 JP 2020024831 W JP2020024831 W JP 2020024831W WO 2021260852 A1 WO2021260852 A1 WO 2021260852A1
Authority
WO
WIPO (PCT)
Prior art keywords
image data
data
entry field
machine
data processing
Prior art date
Application number
PCT/JP2020/024831
Other languages
English (en)
Japanese (ja)
Inventor
長年 名和
志朗 小笠原
史拓 横瀬
郁子 高木
Original Assignee
日本電信電話株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電信電話株式会社 filed Critical 日本電信電話株式会社
Priority to PCT/JP2020/024831 priority Critical patent/WO2021260852A1/fr
Publication of WO2021260852A1 publication Critical patent/WO2021260852A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition

Definitions

  • the present invention relates to a data processing apparatus, a data processing method, and a data processing program.
  • Non-Patent Document 1 There is a technology to convert paper forms into machine-readable data by OCR (Optical Character Recognition) processing in order to efficiently input information to the system (see Non-Patent Document 1). According to the above technology, it is possible to efficiently input information to the system, but since there may be an error in the conversion result by OCR processing, it is necessary for a human to visually confirm the conversion result in the end. there were.
  • OCR Optical Character Recognition
  • the conversion result by OCR processing can be pasted as transparent text on a bitmap image. Therefore, according to such a technique, the result of conversion by the OCR process can be visually confirmed.
  • the above PDF document only sets the text that is the result of OCR processing of the image while maintaining the position of the image that can be recognized as characters. Therefore, even if you convert forms that have mutual relationships between data such as data repetition structure and nesting structure into PDF documents and set the text that is the result of OCR processing, these relationships should be retained. Can't. It is also not possible to retrieve the results of OCR processing (machine-readable data) across multiple forms of the same format and perform aggregation processing. Also, in the case of the above PDF document, there is a problem that it is necessary for a human to visually confirm the presence or absence of a conversion error in the OCR process one by one.
  • the present invention has an input unit that accepts input of image data obtained by scanning a paper form, and an extraction unit that extracts image data of an entry field in the paper form from the image data.
  • the OCR processing unit that converts the image data in the entry field into machine-readable data by performing OCR (Optical Character Recognition) processing on the image data in the entry field, and the extracted location in the entry field. It is characterized by having a table calculation software control unit that holds the image data of the above and the machine-readable data obtained by converting the image data of the entry field in the same cell in the spreadsheet used by the table calculation software. do.
  • FIG. 1 is a diagram showing an example of a paper form.
  • FIG. 2 is a diagram showing an example of data created by a data processing device.
  • FIG. 3 is a diagram showing a configuration example of a data processing device.
  • FIG. 4 is a flowchart showing an example of a processing procedure of the data processing apparatus.
  • FIG. 5 is a diagram showing an example of an Excel (registered trademark) form.
  • FIG. 6 is a diagram showing an example of a state in which the outer frame of the image data of the form is enlarged or reduced so as to match the outer frame of the Excel form.
  • FIG. 7 is a diagram showing an example of a state in which an area corresponding to a cell in an entry field of an Excel form is cut out from the image data of the paper form shown in FIG.
  • FIG. 1 is a diagram showing an example of a paper form.
  • FIG. 2 is a diagram showing an example of data created by a data processing device.
  • FIG. 3 is a diagram showing a configuration
  • FIG. 8 is a diagram showing an example of a state in which the image data of the cut-out entry field is pasted in the cell corresponding to the image data on the Excel form.
  • FIG. 9 is a diagram showing an example of a schema representation of data created by a data processing device.
  • FIG. 10 is a diagram for explaining the effect of the data processing device.
  • FIG. 11 is a diagram showing an example of a computer that executes a data processing program.
  • the data processing device of the present embodiment extracts the image data of the entry column from the image data of the paper form, and converts the image data of the extracted entry column into machine-readable data by OCR processing. Then, the data processing device holds the extracted image data and the machine-readable data converted from the image data in the same cell in the spreadsheet used by the spreadsheet software.
  • the data processing device uses the image data of the entry field extracted from the paper form (see FIG. 1) and the machine-readable data converted from the image data in the spreadsheet. Hold it on top of the cell corresponding to the entry field.
  • the data processing device can create a spreadsheet in which the machine-readable data obtained from the paper form and the image data based on the machine-readable data can be easily visually confirmed by the user.
  • the data processing device 10 includes, for example, electronic circuits such as a CPU (Central Processing Unit) and MPU (Micro Processing Unit), and integrated circuits such as an ASIC (Application Specific Integrated Circuit) and an FPGA (Field Programmable Gate Array). .. Further, the data processing device 10 has an internal memory for storing programs and control data that specify various processing procedures, and executes each process using the internal memory. Further, the data processing device 10 realizes the functions of the following parts by executing various programs.
  • a CPU Central Processing Unit
  • MPU Micro Processing Unit
  • ASIC Application Specific Integrated Circuit
  • FPGA Field Programmable Gate Array
  • the data processing device 10 includes a spreadsheet software 11, a spreadsheet software control unit 12, an image data acquisition unit (extraction unit) 13, a coordinate conversion unit 14, and an image cutting unit 15. And an OCR processing unit 16.
  • Spreadsheet software 11 is software that uses spreadsheets to perform spreadsheets.
  • Excel registered trademark
  • the above spreadsheet has multiple cells and can hold various characters, symbols and values in the cells.
  • the colors of characters, symbols, and values in each cell, the color tone of the cell itself, and the like can be changed by setting.
  • image data can be set as the background of each cell in the spreadsheet.
  • the spreadsheet software control unit 12 controls the spreadsheet software 11.
  • the spreadsheet software control unit 12 uses the spreadsheet software 11 to convert the image data of the form entry field cut out by the image cutting unit 15 and the machine-readable data converted from the image data by the OCR processing unit 16. Keep it in the same cell in the spreadsheet used.
  • the spreadsheet software control unit 12 includes, for example, a cell information acquisition unit 121, an image pasting unit 122, and a cell information setting unit 123.
  • the cell information acquisition unit 121 acquires the cell information of the spreadsheet. For example, the cell information acquisition unit 121 acquires the coordinates of the cell corresponding to the entry field of the spreadsheet (for example, an Excel form).
  • the image pasting unit 122 pastes the image data cut out by the image cutting unit 15 (for example, the image data in the entry field of the form) into the cell corresponding to the image data on the spreadsheet.
  • the cell information setting unit 123 sets the cell information of the spreadsheet. For example, the cell information setting unit 123 holds the result of conversion (machine readable data) of the image data in the entry field output from the OCR processing unit 16 by the OCR processing as the value of the cell corresponding to the entry field.
  • the image data in the entry field is pasted in the cell by the image pasting unit 122. As a result, the image data in the entry field and the machine-readable data converted from the image data are superposed and held in the same cell in the spreadsheet.
  • the cell information setting unit 123 sets the character color, background color, etc. of each cell in the spreadsheet. For example, the cell information setting unit 123 determines the character color or background of the cell holding the machine-readable data based on the matching rate between the image data and the machine-readable data when the image data is converted by OCR processing. Select and set the color.
  • the cell information setting unit 123 sets the character color or background color of a cell whose matching rate between the image data and the machine-readable data is equal to or more than a predetermined value when the image data is converted by OCR processing. Change to. As a result, the user can intuitively grasp the cell with high reliability of conversion. Further, the cell information setting unit 123 may change the character color or the background color to a conspicuous color for the cells whose matching degree is less than a predetermined value. As a result, the user can intuitively grasp the cell that requires attention at the time of visual check.
  • the cell information setting unit 123 may increase the transparency of the image data (that is, the image data of the conversion source of the machine-readable data) attached to the cell having the above-mentioned matching degree of a predetermined value or more. As a result, the user can intuitively grasp the cells that do not have to worry about the image data at the time of visual check.
  • the image data acquisition unit 13 acquires the image data of the form. This image data can be obtained, for example, by scanning a paper form.
  • the coordinate conversion unit 14 converts the cell coordinates in the form entry field acquired by the cell information acquisition unit 121 into the coordinates on the image data of the form acquired by the image data acquisition unit 13. That is, the coordinate conversion unit 14 determines which area on the image data of the form acquired by the image data acquisition unit 13 corresponds to the cell area of the form entry field acquired by the cell information acquisition unit 121. do.
  • the image cutting unit 15 cuts out the image data.
  • the image cutting unit 15 cuts out an area of image data surrounded by the coordinates converted by the coordinate conversion unit 14 from the image data acquired by the image data acquisition unit 13.
  • the OCR processing unit 16 outputs the result of converting the image data into machine-readable data by OCR processing on the image data.
  • FIG. 5 An example of processing procedure of the data processing apparatus 10 will be described with reference to FIG.
  • this Excel form includes cells having a position and a size corresponding to each entry field of the paper form.
  • This Excel form may be stored inside the data processing device 10, or may be acquired by the data processing device 10 stored in an external storage device.
  • the image data acquisition unit 13 of the data processing device 10 scans a paper form (see FIG. 1) to generate image data (S1), and performs tilt correction and noise removal on the generated image data. (S2).
  • the data processing device 10 selects an Excel form corresponding to the paper form scanned in S1 (S3). Then, the data processing device 10 enlarges or reduces the image data so that the outer frame of the image data after tilt correction and noise removal in S2 matches the outer frame of the Excel form selected in S3 (S4). ). For example, as shown in FIG. 6, the data processing device 10 enlarges or reduces the image data so that the outer frame of the image data of the paper form matches the outer frame of the Excel form, and the image data is converted into the Excel form. Set as the background of.
  • the data processing device 10 selects a cell with a colored background (cell corresponding to the entry field of the paper form) among the cells of the Excel form selected in S3 (S5). For example, the data processing device 10 selects a cell having a colored background from the cells of the Excel form shown in FIG.
  • the cell information acquisition unit 121 acquires the coordinates of the four vertices of the cell selected in S5 (S6). Then, the coordinate conversion unit 14 calculates the coordinates on the image data of the paper form corresponding to the coordinates of the four vertices of the cell acquired by the cell information acquisition unit 121 in S6 (S7: Corresponding on the image data). Calculate the coordinates).
  • the image cutting unit 15 cuts out an area of image data surrounded by the coordinates calculated in S7 (S8). For example, as shown in FIG. 7, the image cutting unit 15 cuts out an area corresponding to a cell in an entry field of an Excel form from the image data of the form. Further, the image pasting unit 122 pastes the image data cut out in S8 into the cell corresponding to the image data on the Excel form (S9). For example, as shown in FIG. 8, the image pasting unit 122 pastes the image data in the entry field into the cell corresponding to the image data on the Excel form.
  • the OCR processing unit 16 performs OCR processing on the image data cut out in S8 (S10). Then, the cell information setting unit 123 sets the result of the above OCR processing as the value of the cell selected in S5 (S11). Further, the cell information setting unit 123 sets the background color tone corresponding to the matching rate of the result of the OCR processing in S10 as the background color tone of the cell selected in S5 (S12).
  • the cell information setting unit 123 has the background color tone of the cell as shown in FIG. To a predetermined background color tone (match rate: background color tone corresponding to high).
  • the Excel form shown in FIG. 2 includes cells that are not subject to OCR processing, and the cell information setting unit 123 sets a background color tone corresponding to cells that are not subject to OCR processing for the cells. ing.
  • the cell information setting unit 123 determines whether or not all the cells whose background is colored have been processed (S13), and when it is determined that there is a cell that has not been processed yet (No in S13). ), Return to S6. On the other hand, when the cell information setting unit 123 determines that all the cells whose background is colored have been processed (Yes in S13), the processing is terminated.
  • the image data of the entry field of the paper form and the machine-readable data converted from the image data are held in the cell corresponding to the entry field in the Excel form. It is possible to create the data (see FIG. 2).
  • the data in which the image data in the entry field of the above paper form and the machine-readable data converted from the image data are held in the same cell in the spreadsheet may be stored in a database or the like. ..
  • the schema representation of the data will be, for example, as shown in FIG.
  • the data includes information indicating the cell position of the spreadsheet, the conversion result by the OCR process held in the cell at the cell position, the match rate in the OCR process, and the OCR process. It is represented by the image data (original image data) that is the target of.
  • the data processing device 10 holds the result of OCR processing (machine readable data) for the image data of an item such as a paper form and the image data of the item in the same cell in the spreadsheet.
  • OCR processing machine readable data
  • the user when the paper form is converted into machine-readable data and distributed, the user only has to visually check only the necessary parts of the form (reference numeral 101), so that the machine-readable data of the paper form can be distributed by humans. The cost can be reduced.
  • the data processing device 10 holds the result of OCR processing (machine readable data) for the image data of an item such as a paper form and the image data of the item in the same cell in the spreadsheet. This makes it possible to keep the spreadsheet including the relationships between each item of the form. As a result, for example, it becomes easy to take out the result of OCR processing (machine-readable data) across a plurality of paper forms of the same format and perform aggregation processing.
  • the user can select automatic processing or manual processing according to the high accuracy required for machine-readable data such as paper forms.
  • machine-readable data such as paper forms.
  • the user can efficiently perform the process by selecting the automatic process (that is, the user does not visually check the machine-readable data created by the data processing device 10).
  • the machine-readable data requires strict accuracy
  • the user performs manual processing (that is, the user visually checks the machine-readable data created by the data processing device 10 using image data. By selecting (Make necessary corrections above), the accuracy of machine-readable data can be ensured.
  • the data processing device 10 holds the machine-readable data of the image data of the item such as a paper form and the image data of the item in the spreadsheet, thereby converting the image data of the item into the machine-readable data.
  • the above conversion work is performed as a part of the data input work. Therefore, the work place of the conversion work is limited to the place where the terminal for data input is located.
  • the spreadsheet processed by the data processing device 10 as a container, there are no restrictions on the work place. As a result, for example, it is possible to send the spreadsheet to the contractor and receive the result of the conversion work.
  • the data processing device 10 performs OCR processing on image data of items such as paper forms
  • the character color of the cell in which the result of the OCR processing (machine readable data) is held based on the match rate of the OCR processing. , Background color, transparency of image data, etc. This makes it easier for the user to intuitively distinguish between cells that require attention and cells that do not require attention when visually checking the result of OCR processing (conversion result) on the spreadsheet.
  • program Further, it can be implemented by installing a program that realizes the function of the data processing device 10 described in the above embodiment on a desired information processing device (computer).
  • the computer can function as the data processing device 10 by causing the computer to execute the above program provided as package software or online software.
  • the computer referred to here includes a desktop type or notebook type personal computer, a rack-mounted server computer, and the like.
  • computers include smartphones, mobile phones, mobile communication terminals such as PHS (Personal Handyphone System), and PDA (Personal Digital Assistants).
  • the function of the data processing device 10 may be implemented in the cloud server.
  • the computer 1000 has, for example, a memory 1010, a CPU 1020, a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070. Each of these parts is connected by a bus 1080.
  • the memory 1010 includes a ROM (Read Only Memory) 1011 and a RAM 1012.
  • the ROM 1011 stores, for example, a boot program such as a BIOS (Basic Input Output System).
  • BIOS Basic Input Output System
  • the hard disk drive interface 1030 is connected to the hard disk drive 1090.
  • the disk drive interface 1040 is connected to the disk drive 1100.
  • a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1100.
  • a mouse 1110 and a keyboard 1120 are connected to the serial port interface 1050.
  • a display 1130 is connected to the video adapter 1060.
  • the hard disk drive 1090 stores, for example, the OS 1091, the application program 1092, the program module 1093, and the program data 1094.
  • the various data described in the above-described embodiment are stored in, for example, the hard disk drive 1090 or the memory 1010.
  • the CPU 1020 reads the program module 1093 and the program data 1094 stored in the hard disk drive 1090 into the RAM 1012 as needed, and executes each of the above-mentioned procedures.
  • the program module 1093 and program data 1094 related to the above program are not limited to the case where they are stored in the hard disk drive 1090, for example, they are stored in a removable storage medium and read by the CPU 1020 via the disk drive 1100 or the like. It may be issued. Alternatively, the program module 1093 and the program data 1094 related to the above program are stored in another computer connected via a network such as a LAN or WAN (Wide Area Network), and read by the CPU 1020 via the network interface 1070. May be done.
  • a network such as a LAN or WAN (Wide Area Network)

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Character Input (AREA)

Abstract

Ce dispositif de traitement de données (10) reçoit l'entrée de données d'image obtenues par numérisation d'un formulaire papier, et extrait, à partir des données d'image, des données d'image à l'emplacement d'un champ d'entrée dans le formulaire papier. Le dispositif de traitement de données (10) effectue un processus de reconnaissance optique de caractères (OCR) sur les données d'image extraites à l'emplacement du champ d'entrée pour convertir les données d'image à l'emplacement du champ d'entrée en données lisibles par machine. Ensuite, le dispositif de traitement de données (10) stocke les données d'image extraites à l'emplacement du champ d'entrée et les données lisibles par machine obtenues par conversion des données d'image à l'emplacement du champ d'entrée dans la même cellule dans la feuille de calcul utilisée dans un tableur.
PCT/JP2020/024831 2020-06-24 2020-06-24 Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données WO2021260852A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/024831 WO2021260852A1 (fr) 2020-06-24 2020-06-24 Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/024831 WO2021260852A1 (fr) 2020-06-24 2020-06-24 Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données

Publications (1)

Publication Number Publication Date
WO2021260852A1 true WO2021260852A1 (fr) 2021-12-30

Family

ID=79282102

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/024831 WO2021260852A1 (fr) 2020-06-24 2020-06-24 Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données

Country Status (1)

Country Link
WO (1) WO2021260852A1 (fr)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001325562A (ja) * 2000-05-17 2001-11-22 Minolta Co Ltd 画像認識装置、画像形成装置、画像認識方法および画像認識プログラムを記憶したコンピュータ読取り可能な記録媒体
JP2003208569A (ja) * 2002-01-10 2003-07-25 Ricoh Co Ltd 手書き情報処理装置、手書き情報処理方法、手書き情報処理プログラム、そのプログラムが記録された記録媒体、及び電子黒板
JP2012198597A (ja) * 2011-03-18 2012-10-18 Brother Ind Ltd 制御装置及びコンピュータプログラム

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001325562A (ja) * 2000-05-17 2001-11-22 Minolta Co Ltd 画像認識装置、画像形成装置、画像認識方法および画像認識プログラムを記憶したコンピュータ読取り可能な記録媒体
JP2003208569A (ja) * 2002-01-10 2003-07-25 Ricoh Co Ltd 手書き情報処理装置、手書き情報処理方法、手書き情報処理プログラム、そのプログラムが記録された記録媒体、及び電子黒板
JP2012198597A (ja) * 2011-03-18 2012-10-18 Brother Ind Ltd 制御装置及びコンピュータプログラム

Similar Documents

Publication Publication Date Title
CN107133199B (zh) 使用压缩机制来获取字体部分
CN108256523B (zh) 基于移动终端的识别方法、装置及计算机可读存储介质
JP2021166070A (ja) 文書比較方法、装置、電子機器、コンピュータ読取可能な記憶媒体及びコンピュータプログラム
US20140344669A1 (en) Document conversion apparatus
US10643022B2 (en) PDF extraction with text-based key
CN113408251B (zh) 版式文档的处理方法、装置、电子设备及可读存储介质
US10366142B2 (en) Identifier based glyph search
JP2023541527A (ja) テキスト検出に用いる深層学習モデルトレーニング方法及びテキスト検出方法
US9886426B1 (en) Methods and apparatus for generating an efficient SVG file
CN113407745B (zh) 数据标注方法、装置、电子设备及计算机可读存储介质
CN109992759B (zh) 表格对象编辑方法、装置、电子设备及存储介质
WO2021260852A1 (fr) Dispositif de traitement de données, procédé de traitement de données et programme de traitement de données
WO2006101287A1 (fr) Systeme et procede de fourniture de donnees d'images a polices de caracteres traduites utilisant des serveurs de polices de caracteres multi-langues
US20230046344A1 (en) Systems, Methods, and Devices for a Form Converter
CN115909449A (zh) 文件处理方法、装置、电子设备、存储介质及程序产品
JP6914491B1 (ja) 情報処理装置、プログラム及び情報処理方法
JP7302175B2 (ja) 情報処理装置、及び情報処理プログラム
US11074392B2 (en) Information processing apparatus and non-transitory computer readable medium for switching between an attribute information mode and an image information mode
JP4999014B2 (ja) 拡大縮小可能フォントをサポートするテキストエディタを用いたオブジェクトの挿入
JP2020119152A (ja) 情報処理装置、及び情報処理プログラム
JP7279986B1 (ja) 情報処理方法、情報処理システム及び情報処理プログラム
JP7496715B2 (ja) 仕様情報生成装置、仕様情報生成方法およびプログラム
JP2015011420A (ja) 記事画像再構成装置
CN113268959B (zh) 文档处理方法、装置和电子设备
JP5569367B2 (ja) 画像処理装置、画像処理方法及びプログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20942333

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20942333

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP