JP7393960B2

JP7393960B2 - Information processing device, information processing method, and program

Info

Publication number: JP7393960B2
Application number: JP2020009944A
Authority: JP
Inventors: 道和美濃部
Original assignee: Japan Research Institute Ltd
Current assignee: Japan Research Institute Ltd
Priority date: 2020-01-24
Filing date: 2020-01-24
Publication date: 2023-12-07
Anticipated expiration: 2040-01-24
Also published as: JP2021117661A

Description

本発明は、手形等の用紙の画像を読み込み、読み込んだ画像に対してＯＣＲを行い、用紙に記載されている金額等の情報を取得する情報処理装置等に関するものである。 The present invention relates to an information processing apparatus and the like that reads an image of a paper such as a bill, performs OCR on the read image, and obtains information such as the amount written on the paper.

銀行等の金融機関では、例えば、各店において、顧客から受け取った手形や小切手等の用紙の束に関し、用紙の枚数と、各用紙に表記されている金額を合計した合計金額とを記入した帳票が作成され、当該用紙の束が帳票と共に事務センターに持ち込まれる。事務センターにおいては、持ち込まれた用紙の束に関し、用紙の枚数と合計金額とが再計算され、帳票に記入されている枚数および合計金額と照合される。 For example, in a financial institution such as a bank, each store has a form containing the number of sheets and the total amount written on each sheet regarding bundles of paper such as bills and checks received from customers. The bundle of paper is then brought to the office center along with the form. At the office center, the number of sheets and the total amount of the bundle of sheets brought in are recalculated and compared with the number of sheets and the total amount written on the form.

事務センターでは、上記のような事務作業の省力化のために、２以上の各用紙の画像をスキャナ等で読み込み、読み込んだ画像に対してＯＣＲを行い、用紙に表記されている情報を取得する情報処理装置が用いられることがある。 At the administrative center, in order to save labor in the administrative work described above, images of two or more sheets are read in using a scanner, etc., and OCR is performed on the read images to obtain the information written on the sheets. An information processing device may be used.

この種の情報処理装置では、一般に、用紙の画像からＯＣＲにより取得される情報は、ＯＣＲの精度の不足のため、当該用紙に表記されている情報と必ずしも一致せず、人によるチェックの手間を要する。 In this type of information processing device, the information obtained from the paper image by OCR generally does not necessarily match the information written on the paper due to the lack of accuracy of OCR, so it does not require the effort of human checking. It takes.

かかる手間を軽減するものとして、従来、一の画像に対して、２種類以上のＯＣＲエンジンでＯＣＲ認識を行い、２以上の認識結果を取得し、当該２以上の認識結果の一致・不一致に基づいて、人によるチェックの対象となる用紙を絞り込むＯＣＲシステムが存在した（例えば、特許文献１参照）。 In order to reduce this effort, conventional methods have been used to perform OCR recognition on one image using two or more types of OCR engines, obtain two or more recognition results, and based on the coincidence or mismatch of the two or more recognition results. Therefore, there has been an OCR system that narrows down the sheets to be checked by a person (for example, see Patent Document 1).

ただし、２種類以上のＯＣＲエンジンでＯＣＲ認識を行う場合でも、各ＯＣＲエンジンでのＯＣＲの精度が不足していれば、認識結果の不一致が多くなり、チェックの手間を効果的に低減することは困難である。このため、上記ＯＣＲシステムは、読み込んだ画像を第一のＯＣＲエンジンにおいて補正し、補正後の画像を第一および第二の各ＯＣＲエンジンで認識することにより、各ＯＣＲエンジンでのＯＣＲの精度を高めるようにしていた。 However, even when performing OCR recognition using two or more types of OCR engines, if the OCR accuracy of each OCR engine is insufficient, there will be many discrepancies in the recognition results, making it impossible to effectively reduce the effort of checking. Have difficulty. For this reason, the above OCR system corrects the read image in the first OCR engine, and recognizes the corrected image in each of the first and second OCR engines, thereby improving the accuracy of OCR in each OCR engine. I was trying to increase it.

特許第５４６４４７４号公報Patent No. 5464474

しかし、上記のような補正によっても、ＯＣＲの精度を高めることには限界がある。 However, even with the above-described correction, there is a limit to increasing the accuracy of OCR.

そこで、ＯＣＲの精度の不足を補うことで、用紙に表記されている情報を的確に取得できる情報処理装置が所望される。 Therefore, there is a need for an information processing apparatus that can accurately acquire information written on paper by compensating for the lack of accuracy in OCR.

本第一の発明の情報処理装置は、２以上の各用紙に記載されている情報に対する統計処理結果である統計情報が格納される統計情報格納部と、２以上の各用紙を読み込んだ画像である用紙画像を受け付ける用紙画像受付部と、２以上の各用紙画像に対して、第一のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第一ＯＣＲ情報を取得する第一ＯＣＲ部と、２以上の各用紙画像に対して第二のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第二ＯＣＲ情報を取得する第二ＯＣＲ部と、統計情報と２以上の第一ＯＣＲ情報と２以上の第二ＯＣＲ情報とを用いて、２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定する決定部と、決定部が決定した２以上の各用紙に対応する正しいＯＣＲ情報に関する情報である検査情報を出力する出力部とを具備する情報処理装置である。 The information processing device of the first invention includes a statistical information storage unit that stores statistical information that is the result of statistical processing of information written on each of two or more sheets, and an image that is read from each of the two or more sheets. a paper image reception unit that receives a certain paper image; and a first OCR processing unit that performs first OCR processing on each of the two or more paper images and obtains two or more pieces of first OCR information that is information on a predetermined area. an OCR unit, a second OCR unit that performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information that is information on a predetermined area; Using the above first OCR information and two or more second OCR information, a determining section determines correct OCR information that is OCR information corresponding to two or more sheets, and a determining section determines the correct OCR information, and The information processing apparatus includes an output section that outputs inspection information that is information regarding correct OCR information corresponding to each sheet.

かかる構成により、ＯＣＲの精度の不足を補うことで、用紙に表記されている情報を的確に取得できる。 With this configuration, the information written on the paper can be accurately acquired by compensating for the lack of accuracy in OCR.

また、本第二の発明の情報処理装置は、第一の発明に対して、統計情報は、２以上の各用紙に記載されている数値の和である情報処理装置である。 Further, the information processing apparatus of the second invention is different from the first invention to an information processing apparatus in which the statistical information is the sum of numerical values written on two or more sheets.

かかる構成により、ＯＣＲの精度の不足を簡単かつ効果的に補うことができる。 With this configuration, the lack of precision in OCR can be easily and effectively compensated for.

また、本第三の発明の情報処理装置は、第一または第二の発明に対して、用紙画像受付部は、２以上の各用紙に対して、２以上の異なる用紙画像を受け付け、第一ＯＣＲ部は、２以上の各用紙ごとに、２以上の異なる各用紙画像に対して、第一のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第一ＯＣＲ情報を取得し、第二ＯＣＲ部は、２以上の各用紙ごとに、２以上の異なる各用紙画像に対して、第二のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第二ＯＣＲ情報を取得し、決定部は、統計情報と４以上の第一ＯＣＲ情報と４以上の第二ＯＣＲ情報とを用いて、２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定する情報処理装置である。 Further, in the information processing device of the third invention, in contrast to the first or second invention, the paper image receiving unit receives two or more different paper images for each of the two or more papers, and The OCR unit performs first OCR processing on two or more different paper images for each of the two or more papers, and obtains two or more pieces of first OCR information that is information on a predetermined area. , the second OCR unit performs second OCR processing on two or more different paper images for each of the two or more papers, and generates two or more second OCR information that is information on a predetermined area. The determination unit determines correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, four or more first OCR information, and four or more second OCR information. It is an information processing device.

かかる構成により、ＯＣＲの精度の不足を、より効果的に補うことができる。 With this configuration, the lack of accuracy in OCR can be compensated for more effectively.

また、本第四の発明の情報処理装置は、第一から第三いずれか１つの発明に対して、２以上の各用紙には、識別子が表出しており、第一ＯＣＲ部または第二ＯＣＲ部は、２以上の各用紙画像に対するＯＣＲ処理により、予め決められた領域から識別子を取得し、予め決められた特別処理を行う識別子である１以上の特別識別子が格納される特別識別子格納部と、第一ＯＣＲ部または第二ＯＣＲ部が取得した識別子が特別識別子であるか否かを判断する判断部と、判断部が特別識別子であると判断した識別子が表出した用紙に対して、予め決められた処理であり、決定部が正しいＯＣＲ情報を決定する決定処理の精度を高めるための処理である特別処理を行う特別処理部とをさらに具備する情報処理装置である。 Further, in the information processing device of the fourth invention, in contrast to any one of the first to third inventions, an identifier is exposed on each of two or more sheets, and The section includes a special identifier storage section that stores one or more special identifiers that are identifiers that acquire identifiers from predetermined areas through OCR processing on each of the two or more paper images and perform predetermined special processing. , a determination unit that determines whether the identifier acquired by the first OCR unit or the second OCR unit is a special identifier; and a determination unit that determines whether the identifier acquired by the first OCR unit or the second OCR unit is a special identifier; The information processing apparatus further includes a special processing unit that performs special processing that is a predetermined process and is a process for increasing the accuracy of the determination process in which the determination unit determines correct OCR information.

かかる構成により、特別識別子が表出している用紙に対してのみ特別処理を行うことで、処理負荷の増大を抑制しつつ、ＯＣＲの精度の不足を補うことができる。 With this configuration, by performing special processing only on sheets on which special identifiers are exposed, it is possible to compensate for the lack of accuracy in OCR while suppressing an increase in processing load.

本発明によれば、ＯＣＲの精度の不足を補うことで、に用紙に表記されている情報を的確に取得できる情報処理装置が実現される。 According to the present invention, by compensating for the lack of precision in OCR, an information processing device is realized that can accurately acquire information written on paper.

実施の形態における情報処理システムのブロック図Block diagram of information processing system in embodiment 同情報処理装置の動作を説明するフローチャートFlowchart explaining the operation of the information processing device 同決定処理を説明するフローチャートFlowchart explaining the same decision process 同端末装置の動作を説明するフローチャートFlowchart explaining the operation of the terminal device 同用紙群の一例を示す図Diagram showing an example of the same paper group 同帳票の一例を示す図Diagram showing an example of the same form 同ＯＣＲ関連情報のデータ構造図であり、ＯＣＲ関連情報の一例も示す図This is a data structure diagram of the OCR-related information, and also shows an example of the OCR-related information. 同判定情報のデータ構造図Data structure diagram of same judgment information 同検査結果の構造図であり、検査結果の一例も示す図This is a diagram showing the structure of the test results and also shows an example of the test results. 同選択画面の一例を示す図Diagram showing an example of the same selection screen 同ＯＣＲ関連情報群の他の一例を示す図Diagram showing another example of the same OCR related information group 同判定情報群の他の一例を示す図Diagram showing another example of the same determination information group 同検査結果の他の一例を示す図Diagram showing another example of the same test results 同選択画面の他の一例を示す図を示す図A diagram showing another example of the same selection screen 同ＯＣＲ関連情報群のその他の一例を示す図Diagram showing another example of the same OCR related information group 同判定情報群のその他の一例を示す図Diagram showing another example of the same determination information group 同特別識別子関連情報のデータ構造図Data structure diagram of special identifier related information 同ＯＣＲ関連情報群のさらにその他の一例を示す図A diagram showing yet another example of the same OCR-related information group 同確認画面の一例を示す図Diagram showing an example of the same confirmation screen 同コンピュータシステムの外観図External view of the computer system 同コンピュータシステムの内部構成の一例を示す図Diagram showing an example of the internal configuration of the computer system

以下、情報処理装置等を含む情報システムの実施形態について図面を参照して説明する。なお、実施の形態において同じ符号を付した構成要素は同様の動作を行うので、再度の説明を省略する場合がある。 Hereinafter, embodiments of an information system including an information processing device and the like will be described with reference to the drawings. Note that in the embodiments, constituent elements with the same reference numerals perform similar operations, and therefore, repeated explanation may be omitted.

図１は、本実施の形態における情報処理システムのブロック図である。この情報処理システムは、情報処理装置１、および１または２以上の端末装置２を備える。情報処理装置１は、例えば、ＬＡＮやインターネット等のネットワーク、無線または有線の通信回線などを介して、１以上の端末装置２の各々と通信可能に接続される。 FIG. 1 is a block diagram of an information processing system in this embodiment. This information processing system includes an information processing device 1 and one or more terminal devices 2. The information processing device 1 is communicably connected to each of the one or more terminal devices 2 via, for example, a network such as a LAN or the Internet, or a wireless or wired communication line.

情報処理装置１は、例えば、銀行等の金融機関のサーバである。サーバは、例えば、金融機関内のサーバであるが、クラウドサーバやＡＳＰサーバ等でもよく、その所在やタイプは問わない。端末装置２は、例えば、金融機関内のＰＣであるが、携帯端末でもよく、その所在やタイプは問わない。携帯端末とは、ユーザによる携帯が可能な端末であり、例えば、タブレット端末、スマートフォン、ノートＰＣ、携帯電話機等であるが、その種類は問わない。 The information processing device 1 is, for example, a server of a financial institution such as a bank. The server is, for example, a server within a financial institution, but may also be a cloud server, an ASP server, etc., and its location and type do not matter. The terminal device 2 is, for example, a PC in a financial institution, but it may also be a mobile terminal, and its location and type do not matter. A mobile terminal is a terminal that can be carried by a user, and includes, for example, a tablet terminal, a smartphone, a notebook PC, a mobile phone, etc., but the type thereof does not matter.

ただし、情報処理装置１は、スタンドアロンでもよい。すなわち、本発明の情報処理方法は、例えば、一のＰＣでも実現可能であり、ハードウェア構成には依存しない。 However, the information processing device 1 may be standalone. That is, the information processing method of the present invention can be implemented on, for example, a single PC, and does not depend on the hardware configuration.

なお、本実施の形態の金融機関でも、例えば、各店において、顧客から受け取った手形等の用紙の束に関し、用紙の枚数と、各用紙に表記されている金額を合計した合計金額とを記入した帳票が作成され、当該用紙の束が帳票と共に事務センターに持ち込まれる。事務センターにおいては、持ち込まれた用紙の束に関し、用紙の枚数と合計金額とが再計算され、帳票に記入されている枚数および合計金額と照合される。 In addition, in the financial institution of this embodiment, for example, in each store, regarding a bundle of papers such as bills received from a customer, the number of papers and the total amount written on each paper are written down. A form is created, and the bundle of papers is brought to the office center together with the form. At the office center, the number of sheets and the total amount of the bundle of sheets brought in are recalculated and compared with the number of sheets and the total amount written on the form.

情報システムは、事務センターでの上記のような事務作業の省力化のために用いられる。まず、基本的な動作として、情報処理装置１は、端末装置２を介して、２以上の各用紙の画像を読み込み、読み込んだ画像に対してＯＣＲを行い、用紙に表記されている金額等の情報を含むＯＣＲ情報を取得する。 Information systems are used to save labor in the above-mentioned clerical work at administrative centers. First, as a basic operation, the information processing device 1 reads two or more images of each paper via the terminal device 2, performs OCR on the read images, and calculates the amount written on the paper, etc. Obtain OCR information including information.

そして、特徴的な動作として、２以上の各用紙に記載されている情報に対する統計処理結果（例えば、合計金額）が記録媒体に格納されており、情報処理装置１は、２以上の各用紙ごとに、第一のＯＣＲ処理による第一ＯＣＲ情報と、第二のＯＣＲ処理による第二ＯＣＲ情報とを取得して、格納されている統計情報と、当該取得した、２以上の第一ＯＣＲ情報および２以上の第二ＯＣＲ情報とを用いて、正しいＯＣＲ情報を決定する。これにより、ＯＣＲの精度を高めなくても、用紙の記載情報と、用紙画像から取得したＯＣＲ情報との、人による照合の手間が、効果的に軽減される。 As a characteristic operation, the statistical processing results (for example, the total amount) for the information written on each of two or more sheets are stored in the recording medium, and the information processing device 1 stores The first OCR information obtained by the first OCR processing and the second OCR information obtained by the second OCR processing are acquired, and the stored statistical information, the two or more obtained first OCR information, and Correct OCR information is determined using the two or more pieces of second OCR information. As a result, even without increasing the accuracy of OCR, it is possible to effectively reduce the effort required by a person to compare the written information on the paper with the OCR information acquired from the paper image.

情報処理装置１は、格納部１１、受付部１２、処理部１３、および出力部１４を備える。格納部１１は、統計情報格納部１１１、および特別識別子格納部１１２を備える。受付部１２は、用紙画像受付部１２１を備える。処理部１３は、第一ＯＣＲ部１３１、第二ＯＣＲ部１３２、決定部１３３、判断部１３４、および特別処理部１３５を備える。 The information processing device 1 includes a storage section 11, a reception section 12, a processing section 13, and an output section 14. The storage unit 11 includes a statistical information storage unit 111 and a special identifier storage unit 112. The reception unit 12 includes a paper image reception unit 121. The processing section 13 includes a first OCR section 131 , a second OCR section 132 , a determining section 133 , a determining section 134 , and a special processing section 135 .

端末装置２は、端末格納部２１、端末受付部２２、端末処理部２３、端末送信部２４、端末受信部２５、および端末出力部２６を備える。 The terminal device 2 includes a terminal storage section 21 , a terminal reception section 22 , a terminal processing section 23 , a terminal transmission section 24 , a terminal reception section 25 , and a terminal output section 26 .

情報処理装置１を構成する格納部１１は、各種の情報を格納し得る。各種の情報とは、例えば、後述する統計情報、後述する特別識別子などである。なお、その他の情報については、適時説明する。 The storage unit 11 that constitutes the information processing device 1 can store various types of information. The various types of information include, for example, statistical information, which will be described later, and special identifiers, which will be described later. Other information will be explained in due course.

統計情報格納部１１１には、１または２以上の統計情報が格納される。統計情報とは、２以上の各用紙に記載されている情報に対する統計処理結果である。 The statistical information storage unit 111 stores one or more pieces of statistical information. Statistical information is the result of statistical processing of information written on two or more sheets.

用紙とは、例えば、手形、または小切手であるが、その種類は問わない。手形は、例えば、約束手形、または為替手形であるが、その種類は問わない。また、用紙は、例えば、伝票でもよい。伝票は、例えば、入金伝票、出金伝票、払出伝票などであるが、その種類は問わない。 The paper may be, for example, a bill or a check, but its type does not matter. The bill is, for example, a promissory note or a bill of exchange, but the type does not matter. Further, the paper may be, for example, a slip. The slip may be, for example, a deposit slip, a withdrawal slip, a payout slip, etc., but the type thereof does not matter.

用紙に記載されている情報とは、例えば、数値である。数値は、通常、金額である。また、用紙には、通常、数値以外の情報も記載されている。数値以外の情報とは、例えば、振出人名、受取人名、種類情報などである。振出人名とは、手形等の用紙を振り出した振出人の名称である。振出人は、通常、会社であるが、会社の役員等でもよい。受取人名は、手形等の用紙に記載されている金額を受け取る受取人の名称である。受取人もまた、通常、会社であるが、個人でもよい。種類情報とは、用紙の種類を示す情報である。種類情報は、例えば、“約束手形”、“小切手”、“伝票”などであるが、その形式は問わない。 The information written on the paper is, for example, a numerical value. The numerical value is usually a monetary amount. In addition, information other than numerical values is usually written on the paper. Information other than numerical values includes, for example, the name of the sender, the name of the recipient, and type information. The name of the drawer is the name of the drawer who issued the paper such as a bill. The drawer is usually a company, but may also be an officer of the company. The beneficiary name is the name of the beneficiary who receives the amount written on paper such as a bill. The recipient is also typically a company, but may also be an individual. Type information is information indicating the type of paper. The type information includes, for example, "promissory note," "cheque," "slip," etc., but the format does not matter.

統計処理結果とは、統計処理の結果である。統計処理とは、統計に関する処理である。統計処理は、通常、２以上の各用紙に記載されている数値の和を求める処理である。数値の和とは、２以上の数値を合計した合計値である。数値の和は、例えば、２以上の金額を合計した合計金額であるが、金額以外の数値の合計でもよい。 The statistical processing result is the result of statistical processing. Statistical processing is processing related to statistics. Statistical processing is usually a process of calculating the sum of numerical values written on two or more sheets. The sum of numerical values is the sum of two or more numerical values. The sum of numerical values is, for example, the total amount of two or more amounts, but may be the sum of numerical values other than amounts.

ただし、統計処理は、例えば、２以上の各用紙に記載されている数値の代表値を求める処理などでもよく、統計に関する処理であれば、その種類は問わない。代表値とは、２以上の数値を代表する値である。代表値は、例えば、平均値であるが、最大値や中央値などでもよく、その種類は問わない。 However, the statistical processing may be, for example, processing to obtain representative values of numerical values written on each of two or more sheets, and any type of processing may be used as long as it is related to statistics. A representative value is a value that represents a numerical value of 2 or more. The representative value is, for example, an average value, but may also be a maximum value, a median value, etc., and its type does not matter.

統計情報には、通常、店識別子が対応付いている。店識別子とは、店を識別する情報である。店とは、例えば、金融機関の本店または１以上の各支店であるが、その種類は問わない。店識別子は、例えば、店番号、支店名等であるが、店を識別し得る情報であれば何でもよい。また、統計情報には、通常、枚数情報も対応付いている。枚数情報とは、用紙の枚数を示す情報である。なお、店識別子および枚数情報は、通常、統計情報格納部１１１に格納されているが、その所在は問わない。 Statistical information is usually associated with a store identifier. A store identifier is information that identifies a store. A store is, for example, a head office or one or more branches of a financial institution, but the type thereof does not matter. The store identifier is, for example, a store number, a branch name, etc., but any information that can identify the store may be used. Further, the statistical information is usually associated with sheet count information. The number of sheets information is information indicating the number of sheets of paper. Note that the store identifier and the number of sheets information are normally stored in the statistical information storage section 111, but their location does not matter.

統計情報は、例えば、合計金額等の統計処理結果が記入された帳票から、後述するＯＣＲ処理によって自動的に取得された情報であるが、帳票を基にユーザが入力した情報でもよい。 The statistical information is, for example, information automatically acquired by OCR processing, which will be described later, from a form in which statistical processing results such as the total amount are entered, but it may also be information input by the user based on the form.

また、２以上の各用紙の表面には、識別子が表出している。ここでいう識別子とは、後述する特別処理の対象か否かの判断に用いられる情報である。識別子は、通常、振出人を識別する情報である。振出人を識別する情報は、例えば、前述した振出人名である。ただし、振出人を識別する情報は、例えば、口座番号や電話番号等でもよく、振出人を識別し得る情報であれば何でもよい。 Furthermore, an identifier is exposed on the surface of each of the two or more sheets. The identifier here is information used to determine whether or not the object is subject to special processing, which will be described later. The identifier is usually information that identifies the issuer. The information for identifying the issuer is, for example, the issuer's name described above. However, the information for identifying the payer may be, for example, an account number, a telephone number, etc., or any information that can identify the payer.

表出することは、例えば、用紙に対して、識別子が予め印刷されていることでもよいし、識別子がプリンタ等で追記されることでもよいし、識別子が手書きされることでもよい。 For example, the expression may be such that the identifier is printed on the paper in advance, the identifier may be added using a printer, or the identifier may be handwritten.

各用紙からは、ＯＣＲ処理によって、例えば、振出人名等の識別子も自動的に取得されることは好適である。 It is preferable that an identifier such as the payer's name be automatically acquired from each paper by OCR processing.

特別識別子格納部１１２には、１または２以上の特別識別子が格納される。特別識別子とは、特別処理を行うことを示す情報である。特別識別子は、通常、振出人名である。特別処理とは、特別識別子が表出している用紙に対してのみ行われる、予め決められた特別な処理である。特別処理は、例えば、決定部１３３による決定処理の精度を高めるための処理である。なお、特別処理の詳細、および決定処理については後述する。 The special identifier storage unit 112 stores one or more special identifiers. The special identifier is information indicating that special processing is to be performed. The special identifier is usually the drawer's name. The special process is a predetermined special process that is performed only on the paper on which the special identifier is exposed. The special process is, for example, a process for increasing the accuracy of the decision process by the decision unit 133. Note that details of the special processing and determination processing will be described later.

受付部１２は、各種の情報を受け付ける。各種の情報とは、例えば、後述する用紙画像である。また、受付部１２は、各種の指示も受け付け得る。なお、各種の指示については、具体例で説明する。 The reception unit 12 receives various types of information. The various types of information include, for example, paper images to be described later. The reception unit 12 can also receive various instructions. Note that various instructions will be explained using specific examples.

受付部１２による受け付けは、通常、端末装置２から送信された情報の受信であるが、例えば、キーボードやタッチパネル等の入力デバイスから入力された情報の受け付け、スキャナやカメラ等の光学読取デバイスを介して読み取られた情報の受け付け、ディスクや半導体メモリなどの記録媒体から読み出された情報の受け付けなどでもよい。 The reception unit 12 normally receives information transmitted from the terminal device 2, but for example, it may receive information input from an input device such as a keyboard or touch panel, or receive information via an optical reading device such as a scanner or camera. It may also be possible to receive information read from a recording medium such as a disk or a semiconductor memory, or the like.

用紙画像受付部１２１は、用紙画像を受け付ける。用紙画像受付部１２１による受け付けは、通常、端末装置２から送信された用紙画像の受信であるが、例えば、光学読取デバイスを介して読み取られた用紙画像の受け付けでもよいし、記録媒体から読み出された用紙画像の受け付けでもよい。 The paper image reception unit 121 receives paper images. The paper image receiving unit 121 normally receives a paper image transmitted from the terminal device 2, but for example, it may also accept a paper image read through an optical reading device, or it may receive a paper image read from a recording medium. It is also possible to accept a paper image that has been printed.

用紙画像とは、２以上の各用紙を読み込んだ画像である。具体的には、例えば、２以上の各用紙が手形であり、そのうち１枚目（以下、用紙１）に、金額“￥１，０００※”および振出人名“ＢＢホールディングス”などの情報が記載され、２枚目の用紙（用紙２）には、金額“￥２，０００※”および振出人名“ＡＡ株式会社”などの情報が記載されている。この場合、用紙画像受付部１２１は、まず、用紙１をスキャナ等で読み取った画像（以下、用紙画像１）を受け付け、次に、用紙２を読み取った画像（以下、用紙画像２）を受け付ける。 A paper image is an image obtained by reading two or more sheets. Specifically, for example, two or more sheets are each a bill, and the first sheet (hereinafter referred to as sheet 1) contains information such as the amount "¥1,000*" and the name of the drawer "BB Holdings." On the second sheet (Paper 2), information such as the amount "¥2,000*" and the name of the payer "AA Corporation" are written. In this case, the paper image receiving unit 121 first receives an image obtained by reading paper 1 with a scanner or the like (hereinafter referred to as "paper image 1"), and then receives an image obtained by reading paper 2 (hereinafter referred to as "paper image 2").

用紙画像受付部１２１は、例えば、２以上の各用紙に対して、２以上の異なる用紙画像を受け付けてもよい。 For example, the paper image receiving unit 121 may receive two or more different paper images for each of two or more papers.

異なる用紙画像とは、通常、一の用紙を２以上の異なる形式で読み取った画像である。異なる形式とは、例えば、ＪＰＥＧ、ＴＩＦ、ＧＩＦ、ＢＭＰ等の形式であるが、その種類は問わない。 Different paper images are usually images obtained by reading one paper in two or more different formats. The different formats include, for example, JPEG, TIF, GIF, BMP, etc., but the types are not limited.

具体的には、例えば、用紙１に対して、ＪＰＥＧおよびＴＩＦの２つの形式で読み取りが行われ、用紙画像受付部１２１は、ＪＰＥＧ形式で読み取った画像（以下、用紙画像１Ｊ）と、ＴＩＦ形式で読み取った画像（以下、用紙画像１Ｔ）との、２種類の画像を受け付けてもよい。同様に、用紙２に対して、ＪＰＥＧおよびＴＩＦの２つの形式で読み取りが行われ、用紙画像受付部１２１は、ＪＰＥＧ形式で読み取った画像（以下、用紙画像２Ｊ）と、ＴＩＦ形式で読み取った画像（以下、用紙画像２Ｔ）との、２種類の画像を受け付けてもよい。 Specifically, for example, paper 1 is read in two formats, JPEG and TIF, and the paper image reception unit 121 receives an image read in JPEG format (hereinafter referred to as paper image 1J) and an image in TIF format. Two types of images may be accepted: the image read in (hereinafter referred to as paper image 1T). Similarly, paper 2 is read in two formats, JPEG and TIF, and the paper image receiving unit 121 receives an image read in JPEG format (hereinafter referred to as paper image 2J) and an image read in TIF format. (hereinafter referred to as paper image 2T), two types of images may be accepted.

なお、異なる用紙画像は、例えば、一の用紙を二度読みした画像でもよい。二度読みとは、一の用紙を、同じ形式で２回以上、読み取ることである。同じ用紙を同じ形式で読み取っても、各用紙画像から取得される情報は、常に同じとは限らない（例えば、ＯＣＲの結果に揺らぎが生じる）からである。 Note that the different paper images may be, for example, images obtained by reading one paper twice. Double reading refers to reading one sheet of paper two or more times in the same format. This is because even if the same paper is read in the same format, the information obtained from each paper image is not always the same (for example, fluctuations occur in the OCR results).

具体的には、例えば、用紙１に対して、同じ形式（例えば、ＪＰＥＧ形式）で２回読み取りが行われ、用紙画像受付部１２１は、１回目に読み取られた画像（以下、用紙画像１１）と、２回目に読み取られた画像（以下、用紙画像１２）との、２種類の画像を受け付けてもよい。同様に、用紙２に対して、同じ形式で２回読み取りが行われ、用紙画像受付部１２１は、１回目に読み取られた画像（以下、用紙画像２１）と、２回目に読み取られた画像（以下、用紙画像２２）との、２種類の画像を受け付けてもよい。 Specifically, for example, paper 1 is read twice in the same format (for example, JPEG format), and the paper image reception unit 121 receives the image read the first time (hereinafter, paper image 11). Two types of images may be accepted: and an image read a second time (hereinafter referred to as paper image 12). Similarly, paper 2 is read twice in the same format, and the paper image receiving unit 121 reads the image read the first time (hereinafter referred to as paper image 21) and the image read the second time (hereinafter referred to as paper image 21). Hereinafter, two types of images, the paper image 22), may be accepted.

なお、ｍ枚の各用紙ごとに一の用紙画像を受け付ける場合、後述する一致・不一致の組み合わせは、（２のｍ乗）通りである。また、ｍ枚の各用紙ごとに、ｎ個の用紙画像を受け付ける場合、一致・不一致の組み合わせは、｛（２のｍ乗）×ｎ｝通りとなる。 Note that when one sheet image is received for each of m sheets, there are (2 to the power of m) combinations of matches and mismatches, which will be described later. Further, when n paper images are received for each of m sheets, there are {(2 to the power of m)×n} combinations of matches and mismatches.

処理部１３は、各種の処理を行う。各種の処理とは、例えば、第一ＯＣＲ部１３１、第二ＯＣＲ部１３２、決定部１３３、判断部１３４、および特別処理部１３５などの処理である。また、処理部１３は、例えば、フローチャートで説明する各種の判別なども行う。なお、その他の処理については適時説明する。 The processing unit 13 performs various processes. The various types of processing include, for example, processing performed by the first OCR section 131, the second OCR section 132, the determining section 133, the determining section 134, and the special processing section 135. Furthermore, the processing unit 13 also performs various types of determination described in the flowchart, for example. Note that other processing will be explained in due course.

第一ＯＣＲ部１３１は、２以上の各用紙画像に対して、第一のＯＣＲ処理を行い、２以上の第一ＯＣＲ情報を取得する。 The first OCR unit 131 performs first OCR processing on each of the two or more paper images and obtains two or more pieces of first OCR information.

ＯＣＲ処理とは、用紙画像に対して、予め決められた領域からの画像の切り出し、当該切り出した画像の予め準備されたパターンとの照合などの処理を行うことにより、１または２以上の文字の配列（以下、文字列）を特定し、当該特定した文字列に対応する１または２以上の文字コードの配列（以下、文字コード列）を取得する処理である、といってもよい。 OCR processing involves processing a paper image by cutting out an image from a predetermined area and comparing the cut out image with a pre-prepared pattern. It can be said that this is a process of identifying an array (hereinafter referred to as a character string) and acquiring an array of one or more character codes (hereinafter referred to as a character code string) corresponding to the identified character string.

予め決められた領域は、例えば、数値（例えば、金額）が記載される領域であるが、数値以外の情報（例えば、振出人名）が記載される領域でもよく、その種類は問わない。各種の領域には、通常、項目名が対応付いている。項目名とは、当該領域に記載される情報の種類を示す情報である。項目名は、例えば、“金額”や“振出人”等であるが、項目名が示す情報の種類は問わない。 The predetermined area is, for example, an area where a numerical value (for example, an amount of money) is written, but it may also be an area where information other than numerical values (for example, the payer's name) is written, and the type thereof does not matter. Item names are usually associated with each type of area. The item name is information indicating the type of information written in the area. The item name is, for example, "amount" or "payer", but the type of information indicated by the item name does not matter.

なお、予め決められた領域は、例えば、用紙画像から項目名を検出することによって特定されるが、位置情報によって特定されてもよい。位置情報とは、用紙画像内における領域の位置に関する情報である。位置情報は、例えば、（ｘ１，ｙ１），（ｘ２，ｙ２）といった、始点および終点を示す２つの座標の組であるが、その形式は問わない。 Note that the predetermined area is specified, for example, by detecting the item name from the paper image, but may also be specified using position information. Position information is information regarding the position of an area within a paper image. The position information is, for example, a set of two coordinates indicating a starting point and an ending point, such as (x1, y1) and (x2, y2), but its format does not matter.

第一のＯＣＲ処理とは、第一の方式によるＯＣＲ処理である。なお、これに対して、後述する第二のＯＣＲ処理は、第二の方式によるＯＣＲ処理である。方式とは、ＯＣＲを実現するプログラムである、といってもよい。従って、第一の方式と第二の方式の違いは、通常、ＯＣＲを実現するプログラムの違いである。ただし、第一の方式と第二の方式の違いは、例えば、アルゴリズムの差異でもよい。アルゴリズムの差異とは、例えば、画像切り出しやパターン照合等の処理を実現するアルゴリズムの差異、切り出しの位置の差異、予め準備されたパターンの差異等であるが、その種類は問わない。 The first OCR processing is OCR processing using the first method. In contrast, the second OCR process to be described later is an OCR process based on the second method. The method can be said to be a program that implements OCR. Therefore, the difference between the first method and the second method is usually a difference in the program that implements OCR. However, the difference between the first method and the second method may be, for example, a difference in algorithm. Differences in algorithms include, for example, differences in algorithms for implementing processes such as image cropping and pattern matching, differences in cropping positions, and differences in patterns prepared in advance, but the type does not matter.

第一ＯＣＲ情報とは、２以上の各用紙画像に対する第一のＯＣＲ処理により、当該用紙画像の予め決められた領域から取得される情報である。第一ＯＣＲ情報は、例えば、用紙の予め決められた領域に記載されている文字列に対応するコード列である。 The first OCR information is information obtained from a predetermined area of the paper image by first OCR processing on each of two or more paper images. The first OCR information is, for example, a code string corresponding to a character string written in a predetermined area of a sheet of paper.

なお、第一ＯＣＲ部１３１に関する上記各事項は、ＯＣＲ処理の方式の違いを除いて、後述する第二ＯＣＲ部１３２にも当てはまる。 Note that the above-mentioned matters regarding the first OCR unit 131 also apply to the second OCR unit 132, which will be described later, except for the difference in the OCR processing method.

詳しくは、第一ＯＣＲ部１３１は、例えば、用紙画像１に対して、第一のＯＣＲ処理を行い、金額および振出人名等の情報を含む第一ＯＣＲ情報“１０００円，Ａ株式会社，・・・”を取得する。同様に、第一ＯＣＲ部１３１は、用紙画像２に対して、第一のＯＣＲ処理を行い、金額および振出人名等の情報を含む第一ＯＣＲ情報“５００円，Ｂ株式会社，・・・”を取得する。 Specifically, the first OCR unit 131 performs the first OCR process on the paper image 1, for example, and generates the first OCR information "1000 yen, Company A, etc." including information such as the amount and the name of the payer.・Get “”. Similarly, the first OCR unit 131 performs the first OCR process on the paper image 2, and the first OCR information including information such as the amount and the name of the payer is "500 yen, B Corporation, . . ." get.

第一ＯＣＲ部１３１は、２以上の各用紙ごとに、用紙画像受付部１２１が受け付けた２以上の異なる各用紙画像に対して、第一のＯＣＲ処理を行い、２以上の第一ＯＣＲ情報を取得してもよい。 The first OCR unit 131 performs first OCR processing on each of the two or more different paper images received by the paper image receiving unit 121 for each of the two or more papers, and extracts the two or more pieces of first OCR information. You may obtain it.

具体的には、例えば、用紙１について、第一ＯＣＲ部１３１は、ＪＰＥＧ形式で読み取られた用紙画像１Ｊに対し、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“１０００円，Ａ株式会社，・・・”を取得すると共に、ＴＩＦ形式で読み取られた用紙画像１Ｔに対しても、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“１０００円，Ａ株式会社，・・・”を取得する。 Specifically, for example, for paper 1, the first OCR unit 131 performs first OCR processing on the paper image 1J read in JPEG format, and the first OCR section 131 performs first OCR processing on the paper image 1J read in JPEG format, and the first OCR section 131 performs first OCR processing on the paper image 1J read in JPEG format. ...", and also performs the first OCR processing on the paper image 1T read in TIF format, and obtains the first OCR information "1000 yen, Company A, ..." .

同様に、用紙２について、第一ＯＣＲ部１３１は、ＪＰＥＧ形式で読み取られた用紙画像２Ｊに対して、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“５００円，Ｂ株式会社，・・・”を取得すると共に、ＴＩＦ形式で読み取られた用紙画像２Ｊに対しても、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“５００円，Ｂ株式会社，・・・”を取得してもよい。 Similarly, regarding the paper 2, the first OCR unit 131 performs the first OCR processing on the paper image 2J read in JPEG format, and the first OCR information "500 yen, B Corporation, . . . ”, the first OCR process may also be performed on the paper image 2J read in TIF format, and the first OCR information “500 yen, B Corporation, . . .” may be obtained. .

なお、こうして、一の用紙を２以上の異なる形式で読み取った各用紙画像から取得される２以上の第一ＯＣＲ情報は、必ずしも一致するとは限らない。例えば、用紙１について、用紙画像１Ｊから第一ＯＣＲ情報“１０００円”が取得される一方、用紙画像１Ｔから第一ＯＣＲ情報“１００１円”が取得され、用紙２については、用紙画像２Ｊから第一ＯＣＲ情報“５００円”が取得され、用紙画像２Ｔからも第一ＯＣＲ情報“５００円”が取得される場合もあり得る。 Note that the two or more pieces of first OCR information obtained from each paper image obtained by reading one paper in two or more different formats do not necessarily match. For example, for paper 1, the first OCR information "1000 yen" is acquired from paper image 1J, while the first OCR information "1001 yen" is acquired from paper image 1T, and for paper 2, the first OCR information "1001 yen" is acquired from paper image 1T. There is also a case where the first OCR information "500 yen" is acquired and the first OCR information "500 yen" is also acquired from the paper image 2T.

または、用紙１について、第一ＯＣＲ部１３１は、１回目に読み取られた用紙画像１１に対し、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“１０００円，Ａ株式会社，・・・”を取得すると共に、２回目に読み取られた用紙画像１２に対しても、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“１０００円，Ａ株式会社，・・・”を取得してもよい。 Alternatively, regarding the paper 1, the first OCR unit 131 performs the first OCR processing on the paper image 11 read the first time, and outputs the first OCR information "1000 yen, A Corporation,..." At the same time, the first OCR processing may also be performed on the paper image 12 read a second time to obtain the first OCR information "1000 yen, Company A, . . .".

同様に、用紙２について、第一ＯＣＲ部１３１は、１回目に読み取られた用紙画像２１に対して、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“５００円，Ｂ株式会社，・・・”を取得すると共に、２回目に読み取られた用紙画像２２に対しても、第一のＯＣＲ処理を行い、第一ＯＣＲ情報“５００円，Ｂ株式会社，・・・”を取得してもよい。 Similarly, regarding the paper 2, the first OCR unit 131 performs the first OCR processing on the paper image 21 read the first time, and the first OCR information "500 yen, B Corporation, . . . ”, the first OCR process may also be performed on the paper image 22 read the second time, and the first OCR information “500 yen, B Corporation, . . .” may be obtained. .

なお、こうして、一の用紙を２回以上読み取った各用紙画像から取得される２以上の第一ＯＣＲ情報もまた、必ずしも一致するとは限らない。例えば、用紙１について、用紙画像１１から第一ＯＣＲ情報“１０００円”が取得され、用紙画像１２からも第一ＯＣＲ情報“１０００円”が取得され、用紙２については、用紙画像２１から第一ＯＣＲ情報“５０５円”が取得される一方、用紙画像２２から第一ＯＣＲ情報“５００円”が取得される場合もあり得る。 Note that the two or more pieces of first OCR information obtained from each paper image obtained by reading one paper twice or more do not necessarily match. For example, for paper 1, the first OCR information "1000 yen" is obtained from paper image 11, the first OCR information "1000 yen" is also obtained from paper image 12, and for paper 2, the first OCR information "1000 yen" is obtained from paper image 21. While the OCR information "505 yen" is acquired, there is also a case where the first OCR information "500 yen" is acquired from the paper image 22.

第二ＯＣＲ部１３２は、２以上の各用紙画像に対して第二のＯＣＲ処理を行い、２以上の第二ＯＣＲ情報を取得する。 The second OCR unit 132 performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information.

第二のＯＣＲ処理とは、第二の方式によるＯＣＲ処理である。第二の方式は、通常、第一の方式とは異なる方式である。なお、第一の方式と第二の方式との違いについては、前述したので繰り返さない。 The second OCR processing is OCR processing using the second method. The second method is typically a different method than the first method. Note that the difference between the first method and the second method has been described above, so it will not be repeated.

第二ＯＣＲ部１３２による第二のＯＣＲ処理の対象は、通常、第一ＯＣＲ部１３１による第一のＯＣＲ処理の対象と共通である。つまり、用紙画像受付部１２１が受け付けた２以上の用紙画像の各々に対して、第一ＯＣＲ部１３１による第一のＯＣＲ処理と、第二ＯＣＲ部１３２による第二のＯＣＲ処理とが行われる。ただし、受け付けられた２以上の用紙画像の中には、第一のＯＣＲ処理または第二のＯＣＲ処理のどちらか一方しか行われない用紙画像が含まれていてもよい。 The target of the second OCR process by the second OCR unit 132 is usually the same as the target of the first OCR process by the first OCR unit 131. That is, a first OCR process by the first OCR unit 131 and a second OCR process by the second OCR unit 132 are performed on each of the two or more paper images received by the paper image receiving unit 121. However, the two or more accepted paper images may include a paper image on which only either the first OCR process or the second OCR process is performed.

第二ＯＣＲ情報とは、２以上の各用紙画像に対する第二のＯＣＲ処理により、当該用紙画像の予め決められた領域から取得される情報である。 The second OCR information is information obtained from a predetermined area of the paper image by performing second OCR processing on each of two or more paper images.

なお、第二ＯＣＲ情報に関する予め決められた領域は、通常、第一ＯＣＲ情報に関する予め決められた領域と同じ領域であり、従って、一の用紙に対する第一および第二の２つのＯＣＲ処理によって、基本的には、同じ２つのコード列が取得される。つまり、一の用紙内の同じ領域（例えば、金額等の数値が記載される領域：以下、第一領域）から取得される第一ＯＣＲ情報と第二ＯＣＲ情報は、通常、一致する。 Note that the predetermined area regarding the second OCR information is usually the same area as the predetermined area regarding the first OCR information. Basically, the same two code strings are obtained. That is, the first OCR information and the second OCR information obtained from the same area (for example, an area where a numerical value such as an amount of money is written: hereinafter referred to as the first area) within one sheet usually match.

ただし、一の用紙内の第一領域から取得される第一ＯＣＲ情報と第二ＯＣＲ情報であっても、ＯＣＲの精度によっては、必ずしも一致するとは限らない。例えば、用紙画像１の第一領域から、第一ＯＣＲ情報“１０００円”と第二ＯＣＲ情報“１０００円”が取得され、用紙画像２の第一領域からは、第一ＯＣＲ情報“５００円”と第二ＯＣＲ情報“５０１円”が取得される場合もあり得る。 However, even if the first OCR information and the second OCR information obtained from the first area within one sheet of paper do not necessarily match, depending on the accuracy of OCR. For example, from the first area of paper image 1, the first OCR information "1000 yen" and the second OCR information "1000 yen" are acquired, and from the first area of paper image 2, the first OCR information "500 yen" is obtained. There is also a possibility that the second OCR information "501 yen" is acquired.

第二ＯＣＲ部１３２もまた、例えば、２以上の各用紙ごとに、２以上の異なる各用紙画像に対して、第二のＯＣＲ処理を行い、２以上の第二ＯＣＲ情報を取得してもよい。なお、この処理は、第一ＯＣＲ部１３１が、２以上の各用紙ごとに、２以上の異なる各用紙画像に対して、第一のＯＣＲ処理を行い、２以上の第一ＯＣＲ情報を取得した処理と同様であり、説明を省略する。 The second OCR unit 132 may also perform second OCR processing on two or more different paper images for each of two or more papers, and obtain two or more pieces of second OCR information. . Note that in this process, the first OCR unit 131 performs first OCR processing on two or more different paper images for each of two or more papers, and obtains two or more first OCR information. This is the same as the processing, and the explanation will be omitted.

決定部１３３は、決定処理を行う。決定処理とは、統計情報格納部１１１に格納されている統計情報と、第一ＯＣＲ部１３１が取得した２以上の第一ＯＣＲ情報と、第二ＯＣＲ部１３２が取得した２以上の第二ＯＣＲ情報とを用いて、２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定する処理である。 The determining unit 133 performs determination processing. The determination process refers to the statistical information stored in the statistical information storage unit 111, two or more pieces of first OCR information acquired by the first OCR unit 131, and two or more pieces of second OCR information acquired by the second OCR unit 132. This is OCR information corresponding to two or more sheets of paper, and is a process of determining correct OCR information using information.

ここで、２以上の第一ＯＣＲ情報とは、２以上の各用紙ごとに、用紙画像受付部１２１が受け付けた２以上の各用紙画像に対して、第一ＯＣＲ部１３１が第一のＯＣＲ処理を行うことにより取得した情報である。同様に、２以上の第二ＯＣＲ情報とは、当該２以上の各用紙ごとに、用紙画像受付部１２１が受け付けた２以上の各用紙画像に対して、第二ＯＣＲ部１３２が第二のＯＣＲ処理を行うことにより取得した情報である。 Here, two or more first OCR information means that the first OCR unit 131 performs first OCR processing on each of the two or more paper images received by the paper image receiving unit 121 for each of the two or more papers. This is information obtained by performing. Similarly, two or more pieces of second OCR information means that the second OCR unit 132 performs a second This is information obtained by processing.

正しいＯＣＲ情報とは、第一ＯＣＲ情報または第二ＯＣＲ情報のうち、用紙に記載されている情報と一致する情報である。正しいＯＣＲ情報は、例えば、“第一ＯＣＲ情報”、“第二ＯＣＲ情報”、または“第一ＯＣＲ情報および第二ＯＣＲ情報”のいずれかであるが、ＩＤ等でもよく、その形式は問わない。 Correct OCR information is information that matches the information written on the paper, out of the first OCR information or the second OCR information. The correct OCR information is, for example, "first OCR information", "second OCR information", or "first OCR information and second OCR information", but it may also be an ID, etc., and its format does not matter. .

決定部１３３は、統計情報と２以上の第一ＯＣＲ情報と２以上の第二ＯＣＲ情報とを用いて、２以上の各用紙ごとに、例えば、以下の手順で、正しいＯＣＲ情報を決定する。 The determining unit 133 determines correct OCR information for each of the two or more sheets using the statistical information, two or more pieces of first OCR information, and two or more pieces of second OCR information, for example, in accordance with the following procedure.

すなわち、決定部１３３は、例えば、Ｍ枚（ただし、Ｍは２以上の整数）の各用紙ごとに、当該用紙を読み取った用紙画像から取得された第一のＯＣＲ情報と第二のＯＣＲ情報が、一致するか否かを判別する。また、決定部１３３は、例えば、Ｍ枚の用紙のうち、一致しないと判別した用紙の数（以下、不一致用紙数ｍ：ｍは０以上の整数）も取得して、不一致用紙数ｍが１以上か否かをさらに判別する。 That is, the determining unit 133 determines, for example, for each of M sheets (where M is an integer of 2 or more), the first OCR information and the second OCR information obtained from the sheet image read from the sheet are , determine whether they match or not. Further, the determination unit 133 also obtains the number of sheets determined to be non-matching among the M sheets (hereinafter, number of non-matching sheets m: m is an integer greater than or equal to 0), and determines that the number of non-matching sheets m is 1. It is further determined whether or not the above is true.

そして、不一致用紙数ｍが１以上であると判別された場合、決定部１３３は、Ｍ枚の各用紙について、正しいＯＣＲ情報が、“第一ＯＣＲ情報”、“第二ＯＣＲ情報”、または“第一ＯＣＲ情報および第二ＯＣＲ情報”のうちどれであるかを、次のように決定する。 Then, if it is determined that the number m of mismatched sheets is 1 or more, the determination unit 133 determines whether the correct OCR information for each of the M sheets is "first OCR information", "second OCR information", or " Which of the first OCR information and the second OCR information is selected is determined as follows.

つまり、第一ＯＣＲ情報と第二ＯＣＲ情報とが、（１）一致する用紙については、“第一ＯＣＲ情報および第二ＯＣＲ情報”に決定され、（２）一致しないｍ枚の用紙については、一致・不一致の可能な組み合わせ（すなわち、２のｍ乗通り）のうち、当該組み合わせを基に取得される統計情報（例えば、合計金額）が、格納されている統計情報（例えば、合計金額）と一致する組み合わせに対応するもの、に決定される。 In other words, the first OCR information and the second OCR information are (1) determined to be "first OCR information and second OCR information" for sheets that match, and (2) for m sheets of paper that do not match, Among the possible combinations of matches and mismatches (i.e., 2 to the m power), the statistical information obtained based on the combination (e.g., total amount) is the same as the stored statistical information (e.g., total amount). The one corresponding to the matching combination is determined.

なお、不一致用紙数ｍが０であると判別された場合は、Ｍ枚のどの用紙についても、“第一ＯＣＲ情報および第二ＯＣＲ情報”に決定される。 Note that if it is determined that the number m of mismatched sheets is 0, the "first OCR information and second OCR information" are determined for any of the M sheets.

また、統計情報と２以上の第一ＯＣＲ情報と２以上の第二ＯＣＲ情報とを用いた決定処理の詳細は、フローチャートおよび具体例で説明する。 Further, details of the determination process using statistical information, two or more first OCR information, and two or more second OCR information will be explained with a flowchart and a specific example.

または、決定部１３３は、例えば、統計情報格納部１１１に格納されている統計情報と、第一ＯＣＲ部１３１が取得した４以上の第一ＯＣＲ情報と、第二ＯＣＲ部１３２が取得した４以上の第二ＯＣＲ情報とを用いて、２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定してもよい。 Alternatively, the determining unit 133 may, for example, use the statistical information stored in the statistical information storage unit 111, four or more pieces of first OCR information acquired by the first OCR unit 131, and four or more pieces of first OCR information acquired by the second OCR unit 132. The second OCR information may be used to determine correct OCR information that corresponds to two or more sheets.

ここで、４以上の第一ＯＣＲ情報とは、２以上の各用紙ごとに、用紙画像受付部１２１が受け付けた２以上の異なる各用紙画像に対して、第一ＯＣＲ部１３１が第一のＯＣＲ処理を行うことにより取得した情報である。同様に、４以上の第二ＯＣＲ情報とは、当該２以上の各用紙ごとに、用紙画像受付部１２１が受け付けた２以上の異なる各用紙画像に対して、第二ＯＣＲ部１３２が第二のＯＣＲ処理を行うことにより取得した情報である。 Here, the first OCR information of four or more means that the first OCR section 131 performs first OCR information on each of two or more different paper images received by the paper image reception section 121 for each of two or more sheets. This is information obtained by processing. Similarly, four or more pieces of second OCR information means that the second OCR unit 132 performs a second This is information obtained by performing OCR processing.

なお、統計情報と４以上の第一ＯＣＲ情報と４以上の第二ＯＣＲ情報とを用いた決定処理は、統計情報と２以上の第一ＯＣＲ情報と２以上の第二ＯＣＲ情報とを用いた決定処理（前述）に対し、上記（１）および（２）が、例えば、下記（１ａ）および（１ｂ）に変更される点を除いて、基本的に同様である。 Note that the determination process using statistical information, 4 or more first OCR information, and 4 or more second OCR information uses statistical information, 2 or more first OCR information, and 2 or more second OCR information. The process is basically the same as the determination process (described above) except that (1) and (2) above are changed to, for example, (1a) and (1b) below.

つまり、Ｍ枚の各用紙に対し、ｎ個（ただし、ｎは２以上の整数）の異なる用紙画像が受け付けられるとして、第一ＯＣＲ部１３１は、当該ｎ個の異なる各用紙画像に対して、第一のＯＣＲ処理を行い、ｎ個の第一ＯＣＲ情報を取得し、当該ｎ個の第一ＯＣＲ情報を格納部１１に蓄積する一方、第二ＯＣＲ部１３２は、当該ｎ個の異なる各用紙画像に対して、第二のＯＣＲ処理を行い、ｎ個の第二ＯＣＲ情報を取得し、当該ｎ個の第二ＯＣＲ情報を上記ｎ個の第一ＯＣＲ情報に対応付けて格納部１１に蓄積する。これにより、格納部１１には、Ｍ枚の各用紙ごとに、第一ＯＣＲ情報と第二ＯＣＲ情報との対（以下、「対情報」と記す場合がある）がｎ個、格納される。 In other words, assuming that n different paper images (where n is an integer of 2 or more) are accepted for each of M sheets, the first OCR unit 131 receives the n different paper images for each of the n different paper images. The first OCR process is performed to obtain n pieces of first OCR information, and the n pieces of first OCR information are stored in the storage unit 11, while the second OCR unit 132 processes each of the n pieces of different paper. Second OCR processing is performed on the image, n pieces of second OCR information are acquired, and the n pieces of second OCR information are stored in the storage unit 11 in association with the above n pieces of first OCR information. do. As a result, n pairs of first OCR information and second OCR information (hereinafter sometimes referred to as "pair information") are stored in the storage unit 11 for each of the M sheets.

そして、第一ＯＣＲ部１３１は、Ｍ枚の各用紙ごとに、ｎ個の対情報の中に、第一ＯＣＲ情報と第二ＯＣＲ情報とが一致する対情報が、（１ａ）１つでも存在する用紙については、“第一ＯＣＲ情報および第二ＯＣＲ情報”に決定され、（２ａ）１つも存在しない用紙については、一致・不一致の可能な組み合わせ（すなわち、｛（２のｍ乗）×ｎ｝通り）のうち、当該組み合わせを基に取得される統計情報（例えば、合計金額）が、格納されている統計情報（例えば、合計金額）と一致する組み合わせに対応するもの、に決定される。 Then, for each of the M sheets, the first OCR unit 131 determines that (1a) at least one pair of information in which the first OCR information and the second OCR information match exists among the n pieces of pair information. (2a) For sheets that do not exist, the possible combinations of matches and mismatches (i.e., {(2 to the power of m) x n }), the statistical information (e.g., total amount) acquired based on the combination is determined to correspond to the combination that matches the stored statistical information (e.g., total amount).

判断部１３４は、第一ＯＣＲ部１３１または第二ＯＣＲ部１３２が取得した識別子が特別識別子であるか否かを判断する。判断部１３４は、取得された識別子が、例えば、特別識別子格納部１１２に格納されている１以上のいずれかの特別識別子と一致する場合に、特別識別子であると判断する。 The determining unit 134 determines whether the identifier acquired by the first OCR unit 131 or the second OCR unit 132 is a special identifier. The determination unit 134 determines that the acquired identifier is a special identifier, for example, when it matches one or more special identifiers stored in the special identifier storage unit 112.

なお、金額等の情報に関する一致は、通常、完全一致であるが、識別子に関する一致は、必ずしも完全一致でなくてもよい。識別子に関する一致は、例えば、識別子を構成する１または２以上の文字のうち、一致する文字の数または割合が、閾値を超える又は閾値以上であることでもよい。 Note that although a match regarding information such as an amount is usually a perfect match, a match regarding an identifier does not necessarily have to be a perfect match. A match regarding an identifier may be, for example, that the number or proportion of matching characters among one or more characters constituting the identifier exceeds a threshold value or is equal to or greater than a threshold value.

特別処理部１３５は、判断部１３４が特別識別子であると判断した識別子が表出した１または２以上の各用紙に対して、予め決められた特別処理を行う。特別処理とは、前述したように、特別識別子が表出している用紙に対してのみ行われる特別な処理であり、例えば、決定部１３３による決定処理の精度を高めるための処理である。 The special processing unit 135 performs predetermined special processing on each of the one or more sheets on which the identifier determined by the determination unit 134 to be a special identifier appears. As mentioned above, the special process is a special process that is performed only on the paper on which the special identifier is exposed, and is, for example, a process to improve the accuracy of the decision process by the decision unit 133.

特別処理は、具体的には、例えば、前述したような、一の用紙に対して２以上の異なる用紙画像を受け付ける処理であってもよい。異なる用紙画像は、前述したように、例えば、一の用紙を異なる形式で読み取った画像であるが、一の用紙を同じ形式で二度読みした画像でもよい。すなわち、特別処理部１３５は、判断部１３４が特別識別子であると判断した識別子が表出した１以上の各用紙に対してのみ、例えば、ＪＰＥＧ形式およびＴＩＦ形式等の２以上の異なる用紙画像を受け付ける特別処理を行い、それ以外の用紙に対しては、かかる特別処理を行わず、例えば、ＪＰＥＧ等の一の形式の画像のみを受け付ける通常の受け付け処理を行うことは好適である。 Specifically, the special process may be, for example, a process of accepting two or more different paper images for one paper, as described above. As described above, the different paper images are, for example, images obtained by reading one paper in different formats, but may also be images obtained by reading one paper twice in the same format. That is, the special processing unit 135 creates two or more different paper images, such as JPEG format and TIF format, only for each of the one or more papers on which the identifier determined by the determination unit 134 is a special identifier. It is preferable to carry out special processing for accepting other sheets of paper, and to perform normal acceptance processing for accepting only images in one format, such as JPEG, for example, without performing such special processing on other sheets.

なお、特別処理は、例えば、受け付けられた一の用紙画像の形式を別の形式に変換すること（例えば、ＪＰＥＧ形式からＴＩＦ形式やＧＩＦ形式への変換等）により、２以上の異なる用紙画像を取得する処理でもよい。 Note that special processing can be used to convert two or more different paper images by, for example, converting the format of one received paper image into another format (for example, converting from JPEG format to TIF format or GIF format). It may also be a process of acquiring it.

または、特別処理は、例えば、第一または第二の少なくとも一方のＯＣＲ処理の精度を高める処理でもよく、結果として、決定部１３３による決定処理の精度が高まる処理であれば、その種類は問わない。 Alternatively, the special process may be, for example, a process that increases the accuracy of at least one of the first or second OCR process, and any type of special process may be used as long as it increases the accuracy of the determination process by the determination unit 133. .

なお、ＯＣＲ処理の精度を高める処理とは、例えば、より高い解像度の用紙画像を受け付ける処理、受け付けた用紙画像を補正する処理などであるが、結果として、ＯＣＲ処理の精度が高まる処理であれば、その種類は問わない。 Processing that increases the accuracy of OCR processing includes, for example, processing that accepts paper images with higher resolution, processing that corrects accepted paper images, etc., but any processing that increases the accuracy of OCR processing as a result , the type does not matter.

判断部１３４が特別識別子であると判断した識別子が表出した１または２以上の各用紙に対してのみ、特別処理部１３５が上記のような特別処理を行うことによって、決定部１３３による決定処理の精度を、処理負荷の増大を抑制しつつ、高めることができる。 The special processing unit 135 performs the above-mentioned special processing only on one or more sheets on which an identifier determined to be a special identifier by the determination unit 134 appears, so that the determination processing by the determination unit 133 is performed. The accuracy of can be increased while suppressing an increase in processing load.

具体的には、特別識別子が、例えば、誤認識が生じ易い手形等の用紙を多く振り出す振出人の振出人名（例えば、「ＡＡ株式会社」等）である場合、特別識別子である振出人名が表出した用紙に対して特別処理が行われることで、誤認識の発生が抑制される。他方、特別識別子でない振出人名（例えば、「ＢＢホールディングス」等）が表出した用紙に対しては、特別処理が行われることはないので、処理負荷の増大が抑制される。 Specifically, if the special identifier is, for example, the drawer name of a drawer who issues a large number of papers such as bills that are likely to be misrecognized (e.g., "AA Co., Ltd."), the drawer name that is the special identifier is By performing special processing on the exposed paper, the occurrence of erroneous recognition is suppressed. On the other hand, no special processing is performed on sheets with a payer's name that is not a special identifier (for example, "BB Holdings", etc.), so an increase in processing load is suppressed.

出力部１４は、各種の情報を出力する。各種の情報とは、例えば、後述する検査情報である。出力部１４による出力は、通常、ディスプレイへの表示であるが、例えば、プリンタでのプリントアウト、スピーカーからの音出力、他の処理装置や他のプログラムなどへの処理結果の引渡し、記録媒体への蓄積、外部の装置への送信などでもよく、その態様は問わない。 The output unit 14 outputs various information. The various types of information include, for example, test information that will be described later. The output by the output unit 14 is usually displayed on a display, but for example, it can be printed out on a printer, output sound from a speaker, handed over the processing result to another processing device or other program, or transferred to a recording medium. The information may be stored, transmitted to an external device, etc., and the format does not matter.

検査情報とは、決定部１３３が決定した、２以上の各用紙に対応する正しいＯＣＲ情報に関する情報である。検査情報は、例えば、用紙識別子と、正しいＯＣＲ情報との組（以下、組情報）の集合であってもよい。用紙識別子とは、用紙を識別する情報である。用紙識別子は、例えば、“０００１”や“０００２”等の番号であるが、“１枚目”や“２枚目”等でもよく、用紙を識別し得る情報であれば何でもよい。 The inspection information is information regarding correct OCR information corresponding to two or more sheets determined by the determination unit 133. The inspection information may be, for example, a set of pairs of paper identifiers and correct OCR information (hereinafter referred to as pair information). The paper identifier is information that identifies paper. The paper identifier is, for example, a number such as "0001" or "0002", but may also be "first sheet" or "second sheet", or any information that can identify the sheet.

なお、組情報は、例えば、正しいＯＣＲ情報に対応する情報も有していてもよく、その形式は問わない。対応する情報とは、例えば、正しいＯＣＲ情報が“第ｋＯＣＲ情報”（ただし、ｋは自然数）である場合は、“第ｋＯＣＲ情報”に対応する第ｋのＯＣＲ処理によって取得された、金額等の数値である。正しいＯＣＲ情報に対応する情報は、通常、用紙に記載されている情報と一致する。 Note that the set information may also include, for example, information corresponding to correct OCR information, and its format does not matter. For example, when the correct OCR information is "k-th OCR information" (k is a natural number), the corresponding information is the amount of money, etc. obtained by the k-th OCR process corresponding to "k-th OCR information". It is a numerical value. Information corresponding to correct OCR information usually matches the information written on the paper.

また、出力部１４は、例えば、前述したｍ枚の不一致用紙について、正しいＯＣＲ情報が得られなかった場合に、当該ｍ枚の不一致用紙の用紙識別子群を出力することは好適である。これによって、ユーザは、出力された用紙識別子群に対応する１以上の各用紙についてのみ、効率よくチェックが行える。 Further, it is preferable that the output unit 14 outputs the paper identifier group of the m mismatched sheets, for example, when correct OCR information cannot be obtained for the m mismatched sheets mentioned above. This allows the user to efficiently check only one or more sheets corresponding to the output sheet identifier group.

端末装置２を構成する端末格納部２１は、各種の情報を格納し得る。各種の情報とは、例えば、端末識別子である。端末識別子とは、端末装置２を識別する情報である。端末識別子は、例えば、ＭＡＣアドレスやＩＰアドレス等のアドレス、電話番号、ＩＤなどであるが、端末装置２のユーザを識別するユーザ識別子でもよく、端末装置２を識別し得る情報であれば何でもよい。ユーザ識別子は、例えば、メールアドレス、電話番号、住所・氏名、ＩＤなどであるが、端末装置２のユーザを識別し得る情報であれば何でもよい。 The terminal storage unit 21 that constitutes the terminal device 2 can store various types of information. The various types of information are, for example, terminal identifiers. The terminal identifier is information that identifies the terminal device 2. The terminal identifier is, for example, an address such as a MAC address or an IP address, a telephone number, an ID, etc., but it may also be a user identifier that identifies the user of the terminal device 2, or any information that can identify the terminal device 2. . The user identifier may be, for example, an e-mail address, telephone number, address/name, ID, etc., but any information that can identify the user of the terminal device 2 may be used.

端末受付部２２は、各種の情報を受け付ける。各種の情報とは、例えば、前述した用紙画像である。例えば、スキャナやカメラ等の光学読み込みデバイスを介して、２以上の各用紙の画像が読み込まれ、端末受付部２２は、当該読み込まれた２以上の各用紙画像を受け付ける。 The terminal reception unit 22 receives various types of information. The various types of information include, for example, the above-mentioned paper image. For example, two or more images of each sheet are read through an optical reading device such as a scanner or a camera, and the terminal reception unit 22 receives the two or more images of each of the read sheets.

なお、読み込まれた２以上の各用紙画像は、例えば、端末格納部２１に蓄積され、端末受付部２２は、端末格納部２１に格納されている２以上の用紙画像（以下、用紙画像群）を一括的に受け付けてもよい。 Note that the two or more read paper images are stored in, for example, the terminal storage unit 21, and the terminal reception unit 22 receives the two or more paper images (hereinafter referred to as a paper image group) stored in the terminal storage unit 21. may be accepted all at once.

端末処理部２３は、各種の処理を行う。各種の処理とは、例えば、フローチャートで説明する各種の判別などである。また、端末処理部２３は、例えば、受け付けられた情報の、送信される情報への変換なども行う。 The terminal processing unit 23 performs various processes. The various types of processing include, for example, various types of determination described in flowcharts. The terminal processing unit 23 also converts received information into transmitted information, for example.

端末送信部２４は、各種の情報を送信する。各種の情報とは、例えば、用紙画像などである。端末送信部２４は、例えば、端末受付部２２が受け付けた２以上の各用紙画像を、端末格納部２１に格納されている端末識別子と対に、情報処理装置１に送信する。 The terminal transmitter 24 transmits various types of information. The various types of information include, for example, paper images. The terminal transmitting unit 24 transmits, for example, each of the two or more paper images received by the terminal receiving unit 22 to the information processing device 1 in pairs with the terminal identifier stored in the terminal storage unit 21.

端末受信部２５は、各種の情報を受信する。各種の情報とは、例えば、前述した検査結果、前述した用紙識別子群などである。端末受信部２５は、例えば、端末送信部２４が２以上の用紙画像等を情報処理装置１に送信したことに応じて、情報処理装置１から検査結果または用紙識別子群などを受信する。 The terminal receiving unit 25 receives various information. The various types of information include, for example, the above-mentioned test results, the above-mentioned sheet identifier group, and the like. The terminal receiving unit 25 receives test results or a group of paper identifiers from the information processing device 1, for example, in response to the terminal transmitting unit 24 transmitting two or more paper images or the like to the information processing device 1.

端末出力部２６は、各種の情報を出力する。各種の情報とは、例えば、検査結果、用紙識別子群などである。端末出力部２６による出力は、通常、ディスプレイへの表示であるが、例えば、記録媒体への蓄積などでもよく、その態様は問わない。端末出力部２６は、例えば、端末受信部２５が受信した検査結果または用紙識別子群等の情報を、ディスプレイに表示する。 The terminal output unit 26 outputs various information. The various types of information include, for example, test results, paper identifier groups, and the like. The output by the terminal output unit 26 is usually displayed on a display, but it may also be stored in a recording medium, for example, and the format is not limited. The terminal output section 26 displays, for example, information such as the test results or paper identifier group received by the terminal reception section 25 on a display.

ユーザは、検査結果を基に、２以上の各用紙画像からの情報の取得が的確に行われたことを確認できる。なお、的確に行われたことは、例えば、２以上の全ての用紙について、当該用紙に記載されている金額等の情報と、当該用紙を読み取った用紙画像からＯＣＲ処理により取得された金額等の情報とが一致することである。 Based on the inspection results, the user can confirm that information has been accurately acquired from each of the two or more paper images. What has been done accurately is, for example, for all sheets of 2 or more, information such as the amount written on the sheet and the amount etc. obtained by OCR processing from the sheet image read from the sheet. The information must match.

また、用紙識別子群が出力された場合、ユーザは、情報の取得が的確に行われなかった可能性が高い１以上の各用紙のみを、効率よくチェックできる。なお、的確に行われなかったことは、例えば、２枚以上の用紙のうち少なくとも１枚の用紙について、当該用紙に記載されている金額等の情報と、当該用紙を読み取った用紙画像からＯＣＲ処理により取得された金額等の情報とが一致しないことである。 Further, when a group of paper identifiers is output, the user can efficiently check only one or more papers for which there is a high possibility that information was not acquired accurately. What was not done accurately is, for example, OCR processing of at least one of two or more sheets of paper from the information such as the amount written on the sheet and the paper image read from the sheet. This is because the information such as the amount obtained by the above information does not match.

格納部１１、統計情報格納部１１１、特別識別子格納部１１２、および端末格納部２１は、例えば、ハードディスクやフラッシュメモリといった不揮発性の記録媒体が好適であるが、ＲＡＭなど揮発性の記録媒体でも実現可能である。 The storage unit 11, the statistical information storage unit 111, the special identifier storage unit 112, and the terminal storage unit 21 are preferably nonvolatile recording media such as a hard disk or flash memory, but they can also be realized by volatile recording media such as RAM. It is possible.

格納部１１等に情報が記憶される過程は問わない。例えば、記録媒体を介して情報が格納部１１等で記憶されるようになってもよく、ネットワークや通信回線等を介して送信された情報が格納部１１等で記憶されるようになってもよく、あるいは、入力デバイスを介して入力された情報が格納部１１等で記憶されるようになってもよい。入力デバイスは、例えば、キーボード、マウス、タッチパネル、カメラ等、何でもよい。 The process by which information is stored in the storage unit 11 or the like does not matter. For example, information may be stored in the storage unit 11 or the like via a recording medium, or information transmitted via a network or communication line may be stored in the storage unit 11 or the like. Alternatively, information input via an input device may be stored in the storage unit 11 or the like. The input device may be anything, such as a keyboard, mouse, touch panel, camera, etc.

受付部１２、用紙画像受付部１２１、および端末受付部２２は、入力デバイスを含むと考えても、含まないと考えてもよい。受付部１２等は、入力デバイスのドライバーソフトによって、または入力デバイスとそのドライバーソフトとで実現され得る。 The reception unit 12, paper image reception unit 121, and terminal reception unit 22 may or may not include input devices. The reception unit 12 and the like can be realized by the driver software of the input device, or by the input device and its driver software.

処理部１３、第一ＯＣＲ部１３１、第二ＯＣＲ部１３２、決定部１３３、判断部１３４、特別処理部１３５、および端末処理部２３は、通常、ＭＰＵやメモリ等から実現され得る。処理部１３等の処理手順は、通常、ソフトウェアで実現され、当該ソフトウェアはＲＯＭ等の記録媒体に記録されている。ただし、処理手順は、ハードウェア（専用回路）で実現してもよい。 The processing section 13, the first OCR section 131, the second OCR section 132, the determination section 133, the judgment section 134, the special processing section 135, and the terminal processing section 23 can usually be realized by an MPU, a memory, or the like. The processing procedures of the processing unit 13 and the like are usually realized by software, and the software is recorded on a recording medium such as a ROM. However, the processing procedure may be realized by hardware (dedicated circuit).

出力部１４、および端末出力部２６は、ディスプレイやスピーカー等の出力デバイスを含むと考えても含まないと考えてもよい。出力部１４等は、出力デバイスのドライバーソフトによって、または出力デバイスとそのドライバーソフトとで実現され得る。 The output unit 14 and the terminal output unit 26 may or may not include output devices such as a display and a speaker. The output unit 14 and the like can be realized by the driver software of the output device, or by the output device and its driver software.

端末送信部２４、および出力部１４の送信機能は、通常、無線または有線の通信手段（例えば、ＮＩＣ(Network interface controller)やモデム等の通信モジュール）で実現されるが、放送手段（例えば、放送モジュール）で実現されてもよい。 The transmission functions of the terminal transmitting unit 24 and the output unit 14 are usually realized by wireless or wired communication means (for example, a communication module such as a NIC (Network Interface Controller) or a modem), but are not realized by broadcasting means (for example, broadcasting). module).

端末受信部２５、および受付部１２等の受信機能は、通常、無線または有線の通信手段現されるが、放送を受信する手段（例えば、放送受信モジュール）で実現されてもよい。 The receiving functions of the terminal receiving section 25, receiving section 12, etc. are usually realized by wireless or wired communication means, but may also be realized by means for receiving broadcasts (for example, a broadcast receiving module).

次に、情報処理システムの動作について図２～図４のフローチャートを用いて説明する。 Next, the operation of the information processing system will be explained using flowcharts shown in FIGS. 2 to 4.

図２は、情報処理装置１の動作を説明するフローチャートである。なお、このフローチャートにおいて、用紙画像受付部１２１による受け付けは、通常、端末装置２からの受信であり、出力部１４による出力は、通常、端末装置２への送信である。用紙画像受付部１２１は、端末装置２から、端末識別子と対に用紙画像群を受信するものとする。 FIG. 2 is a flowchart illustrating the operation of the information processing device 1. Note that in this flowchart, the reception by the paper image reception unit 121 is usually reception from the terminal device 2, and the output by the output unit 14 is usually transmission to the terminal device 2. It is assumed that the paper image receiving unit 121 receives a group of paper images paired with a terminal identifier from the terminal device 2 .

（ステップＳ２０１）処理部１３は、用紙画像受付部１２１が端末装置２から用紙画像群等を受信したか否かを判別する。用紙画像受付部１２１が端末装置２から用紙画像群等を受信したと判別された場合はステップＳ２０２に進み、受信していないと判別された場合はステップＳ２０１に戻る。 (Step S201) The processing unit 13 determines whether the paper image receiving unit 121 has received a group of paper images or the like from the terminal device 2. If it is determined that the paper image receiving unit 121 has received the paper image group etc. from the terminal device 2, the process advances to step S202, and if it is determined that the paper image group has not been received, the process returns to step S201.

（ステップＳ２０２）処理部１３は、ステップＳ２０１で受信された用紙画像群を構成する用紙画像の数を計数し、計数結果を変数Ｍにセットする。変数Ｍとは、読み込まれた用紙の枚数を示す変数である。 (Step S202) The processing unit 13 counts the number of paper images forming the paper image group received in step S201, and sets the counting result in variable M. The variable M is a variable indicating the number of sheets of paper that has been read.

（ステップＳ２０３）処理部１３は、変数Ｍが、統計情報格納部１１１に格納されている統計情報に対応付いた枚数情報と一致するか否かを判別する。変数Ｍが枚数情報と一致すると判別された場合はステップＳ２０５に進み、一致しないと判別された場合はステップＳ２０４に進む。 (Step S203) The processing unit 13 determines whether the variable M matches the number of sheets information associated with the statistical information stored in the statistical information storage unit 111. If it is determined that the variable M matches the sheet number information, the process advances to step S205, and if it is determined that they do not match, the process advances to step S204.

（ステップＳ２０４）出力部１４は、“枚数不一致”を、ステップＳ２０１で用紙画像群と対に受信された端末識別子で識別される端末装置２（以下、単に「端末装置２」と記す場合がある）に送信する。その後、ステップＳ２０１に戻る。 (Step S204) The output unit 14 outputs the "number of sheets mismatch" to the terminal device 2 (hereinafter sometimes simply referred to as "terminal device 2") identified by the terminal identifier received in pair with the paper image group in step S201. ). After that, the process returns to step S201.

（ステップＳ２０５）第一ＯＣＲ部１３１は、ステップＳ２０１で受信された用紙画像群に対する第一ＯＣＲ処理により、Ｍ個の第一ＯＣＲ情報を有する第一ＯＣＲ情報群を取得し、格納部１１に蓄積する。 (Step S205) The first OCR unit 131 acquires a first OCR information group having M pieces of first OCR information by performing the first OCR processing on the paper image group received in step S201, and stores it in the storage unit 11. do.

（ステップＳ２０６）第二ＯＣＲ部１３２は、ステップＳ２０１で受信された用紙画像群に対する第二ＯＣＲ処理により、Ｍ個の第二ＯＣＲ情報を有する第二ＯＣＲ情報群を取得し、ステップＳ２０５で取得された第一ＯＣＲ情報群に対応付けて格納部１１に蓄積する。これにより、格納部１１には、第一ＯＣＲ情報と第二ＯＣＲ情報との対が、Ｍ個、格納される。 (Step S206) The second OCR unit 132 acquires a second OCR information group having M pieces of second OCR information by performing a second OCR process on the paper image group received in step S201, and obtains a second OCR information group having M pieces of second OCR information. The first OCR information group is stored in the storage unit 11 in association with the first OCR information group. As a result, the storage unit 11 stores M pairs of first OCR information and second OCR information.

（ステップＳ２０７）処理部１３は、変数ｉに初期値“１”をセットする。変数ｉとは、格納部１１に格納されているＭ個の対のうち、未選択の対を順番に選択していくための変数である。なお、変数ｉは、Ｍ枚の用紙のうち、未選択の用紙を順番に選択していくための変数である、といってもよい。 (Step S207) The processing unit 13 sets the initial value "1" to the variable i. The variable i is a variable for sequentially selecting unselected pairs among the M pairs stored in the storage unit 11. Note that the variable i can be said to be a variable for sequentially selecting unselected sheets among the M sheets.

（ステップＳ２０８）処理部１３は、ｉ番目の対があるか否かを判別する。ｉ番目の対があると判別された場合はステップＳ２０９に進み、ｉ番目の対がないと判別された場合はステップＳ２１３に進む。 (Step S208) The processing unit 13 determines whether there is an i-th pair. If it is determined that there is an i-th pair, the process advances to step S209, and if it is determined that there is no i-th pair, the process advances to step S213.

（ステップＳ２０９）処理部１３は、ｉ番目の対を構成する第一ＯＣＲ情報と第二ＯＣＲ情報が一致するか否かを判別する。ｉ番目の対を構成する第一ＯＣＲ情報と第二ＯＣＲ情報が一致すると判別された場合はステップＳ２１０に進み、一致しないと判別された場合はステップＳ２１１に進む。 (Step S209) The processing unit 13 determines whether the first OCR information and the second OCR information forming the i-th pair match. If it is determined that the first OCR information and the second OCR information constituting the i-th pair match, the process proceeds to step S210, and if it is determined that they do not match, the process proceeds to step S211.

（ステップＳ２１０）処理部１３は、ｉ番目の判別結果に“一致”をセットし、ステップＳ２１２に進む。 (Step S210) The processing unit 13 sets "match" to the i-th determination result, and proceeds to step S212.

（ステップＳ２１１）処理部１３は、ｉ番目の判別結果に“不一致”をセットし、ステップＳ２１２に進む。 (Step S211) The processing unit 13 sets "mismatch" to the i-th determination result, and proceeds to step S212.

（ステップＳ２１２）処理部１３は、変数ｉをインクリメントし、ステップＳ２０８に戻る。 (Step S212) The processing unit 13 increments the variable i, and returns to step S208.

（ステップＳ２１３）決定部１３３は、統計情報格納部１１１に格納されている統計情報（例えば、合計金額）と、ステップＳ２０５で取得された第一ＯＣＲ情報群と、ステップＳ２０６で取得された第二ＯＣＲ情報群とを用いて、正しいＯＣＲ情報群を決定する決定処理を実行する。なお、決定処理については、図３のフローチャートを用いて説明する。 (Step S213) The determining unit 133 selects the statistical information (for example, total amount) stored in the statistical information storage unit 111, the first OCR information group acquired in step S205, and the second OCR information group acquired in step S206. A determination process for determining the correct OCR information group is executed using the OCR information group. Note that the determination process will be explained using the flowchart of FIG. 3.

（ステップＳ２１４）処理部１３は、正しいＯＣＲ情報群が“Ｎｕｌｌ”であるか否かを判別する。なお、“Ｎｕｌｌ”とは、当該情報の不存在を示すフラグである。正しいＯＣＲ情報群が、“Ｎｕｌｌ”であると判別された場合はステップＳ２１５に進み、“Ｎｕｌｌ”でないと判別された場合はステップＳ２１６に進む。 (Step S214) The processing unit 13 determines whether the correct OCR information group is "Null". Note that "Null" is a flag indicating the absence of the information. If it is determined that the correct OCR information group is "Null", the process advances to step S215; if it is determined that the correct OCR information group is not "Null", the process advances to step S216.

（ステップＳ２１５）出力部１４は、ステップＳ２０９での判別結果が“不一致”である１または２以上の対（以下、対群）に対応する用紙識別子群を端末装置２に送信する。その後。ステップＳ２０１に戻る。 (Step S215) The output unit 14 transmits to the terminal device 2 a group of paper identifiers corresponding to one or more pairs (hereinafter referred to as pair groups) for which the determination result in step S209 is "inconsistent". after that. Return to step S201.

（ステップＳ２１６）出力部１４は、正しいＯＣＲ情報群に関する検査結果を端末装置２に送信する。その後。ステップＳ２０１に戻る。 (Step S216) The output unit 14 transmits the inspection results regarding the correct OCR information group to the terminal device 2. after that. Return to step S201.

なお、図２のフローチャートにおいて、情報処理装置１の電源オンやプログラムの起動に応じて処理が開始し、電源オフや処理終了の割り込みにより処理は終了する。ただし、処理の開始または終了のトリガは問わない。 Note that in the flowchart of FIG. 2, the process starts when the information processing device 1 is powered on or a program is started, and ends when the power is turned off or an interrupt to end the process occurs. However, the trigger for starting or ending the process does not matter.

図３は、ステップＳ２１３の決定処理を説明するフローチャートである。 FIG. 3 is a flowchart illustrating the determination process in step S213.

（ステップＳ３０１）決定部１３３は、ステップＳ２０９での判別結果が“不一致”である対を計数し、計数結果を変数ｍにセットする。変数ｍとは、Ｍ個の対のうち、第一ＯＣＲ情報と第二ＯＣＲ情報が不一致である対の個数を示す変数である。なお、変数ｍは、Ｍ枚の用紙のうち、第一ＯＣＲ情報と第二ＯＣＲ情報が不一致である用紙の枚数を示す変数である、といってもよい。 (Step S301) The determining unit 133 counts the pairs for which the determination result in step S209 is "unmatched", and sets the count result to the variable m. The variable m is a variable that indicates the number of pairs in which the first OCR information and the second OCR information do not match among the M pairs. Note that the variable m can be said to be a variable that indicates the number of sheets of paper for which the first OCR information and the second OCR information do not match among the M sheets.

（ステップＳ３０２）決定部１３３は、変数ｍが１以上であるか否かを判別する。変数ｍが、１以上であると判別された場合はステップＳ３０４に進み、１以上でない（つまり、０である）と判別された場合はステップＳ３０３に進む。 (Step S302) The determining unit 133 determines whether the variable m is 1 or more. If it is determined that the variable m is greater than or equal to 1, the process proceeds to step S304; if it is determined that the variable m is not greater than or equal to 1 (that is, it is 0), the process proceeds to step S303.

（ステップＳ３０３）決定部１３３は、第一ＯＣＲ情報群および第二ＯＣＲ情報群を、正しいＯＣＲ情報群に決定する。その後、上位の処理にリターンする。 (Step S303) The determining unit 133 determines the first OCR information group and the second OCR information group as correct OCR information groups. After that, the process returns to the upper level processing.

（ステップＳ３０４）決定部１３３は、ステップＳ２０９での判別結果が“不一致”であるｍ個の対に関し、２のｍ乗通りの組み合わせを作成する。 (Step S304) The determining unit 133 creates 2 to the m power of the m pairs for which the determination result in step S209 is "inconsistent."

（ステップＳ３０５）決定部１３３は、変数ｊに初期値“１”をセットする。変数ｊとは、ステップＳ３０４で作成された「２のｍ乗通りの組み合わせ」のうち、未選択の組み合わせを順番に選択していくための変数である。 (Step S305) The determining unit 133 sets the initial value "1" to the variable j. The variable j is a variable for sequentially selecting unselected combinations from among the "2 m combinations" created in step S304.

（ステップＳ３０６）決定部１３３は、ｊ番目の組み合わせがあるか否かを判別する。ｊ番目の組み合わせがあると判別された場合はステップＳ３０７に進み、ないと判別された場合はステップＳ３１２に進む。 (Step S306) The determining unit 133 determines whether there is a j-th combination. If it is determined that there is a j-th combination, the process advances to step S307; if it is determined that there is no combination, the process advances to step S312.

（ステップＳ３０７）決定部１３３は、ｊ番目の組み合わせに対応する統計情報（例えば、ｊ番目の合計金額）を取得する。 (Step S307) The determining unit 133 acquires statistical information (for example, the j-th total amount) corresponding to the j-th combination.

（ステップＳ３０８）決定部１３３は、ステップＳ３０７で取得された統計情報が、統計情報格納部１１１に格納されている統計情報（例えば、合計金額）と一致するか否かを判定する。取得された統計情報が格納されている統計情報と一致すると判定された場合はステップＳ３０９に進み、一致しないと判定された場合はステップＳ３１０に進む。 (Step S308) The determining unit 133 determines whether the statistical information acquired in step S307 matches the statistical information (for example, the total amount) stored in the statistical information storage unit 111. If it is determined that the acquired statistical information matches the stored statistical information, the process advances to step S309, and if it is determined that they do not match, the process advances to step S310.

（ステップＳ３０９）決定部１３３は、ｊ番目の判定結果に“一致”をセットし、ステップＳ３１１に進む。 (Step S309) The determining unit 133 sets "match" to the j-th determination result, and proceeds to step S311.

（ステップＳ３１０）決定部１３３は、ｊ番目の判定結果に“不一致”をセットし、ステップＳ３１１に進む。 (Step S310) The determining unit 133 sets "mismatch" to the j-th determination result, and proceeds to step S311.

（ステップＳ３１１）決定部１３３は、変数ｊをインクリメントし、ステップＳ３０６に戻る。 (Step S311) The determining unit 133 increments the variable j, and returns to step S306.

（ステップＳ３１２）決定部１３３は、ステップＳ３０８での判定結果が“一致”の組み合わせがあるか否かを判別する。ステップＳ３０８での判定結果が“一致”の組み合わせが、あると判別された場合はステップＳ３１３に進み、ないと判別された場合はステップＳ３１５に進む。 (Step S312) The determining unit 133 determines whether there is a combination for which the determination result in step S308 is a "match". If it is determined that there is a combination whose determination result in step S308 is "match", the process advances to step S313, and if it is determined that there is no combination, the process advances to step S315.

（ステップＳ３１３）決定部１３３は、ステップＳ３０８での判定結果が“一致”の組み合わせが、１つだけであるか否かを判別する。ステップＳ３０８での判定結果が“一致”の組み合わせが、１つだけであると判別された場合はステップＳ３１４に進み、１つだけでない（つまり、２つ以上である）と判別された場合はステップＳ３１５に進む。 (Step S313) The determining unit 133 determines whether there is only one combination for which the determination result in step S308 is "match". If it is determined that there is only one combination in which the determination result in step S308 is "matching", the process advances to step S314; if it is determined that there is not only one combination (that is, there are two or more), then step Proceed to S315.

（ステップＳ３１４）決定部１３３は、正しいＯＣＲ情報群に、ステップＳ３０８での判定結果が“一致”である唯一の組み合わせをセットする。その後、上位の処理にリターンする。 (Step S314) The determining unit 133 sets the only combination for which the determination result in step S308 is "match" in the correct OCR information group. After that, the process returns to the upper level processing.

（ステップＳ３１５）決定部１３３は、正しいＯＣＲ情報群に“Ｎｕｌｌ”をセットする。その後、上位の処理にリターンする。 (Step S315) The determining unit 133 sets "Null" to the correct OCR information group. After that, the process returns to the upper level processing.

図４は、端末装置２の動作を説明するフローチャートである。なお、このフローチャートにおいて、端末受付部２２は、端末格納部２１に格納されている用紙画像群を一括的に受け付けるものとする。 FIG. 4 is a flowchart illustrating the operation of the terminal device 2. In this flowchart, it is assumed that the terminal reception section 22 receives the paper image group stored in the terminal storage section 21 all at once.

（ステップＳ４０１）端末処理部２３は、端末受付部２２が用紙画像群を受け付けたか否かを判別する。端末受付部２２が用紙画像群を受け付けたと判別された場合はステップＳ４０２に進み、受け付けていないと判別された場合はステップＳ４０３に進む。 (Step S401) The terminal processing unit 23 determines whether the terminal reception unit 22 has accepted the paper image group. If it is determined that the terminal receiving unit 22 has accepted the paper image group, the process advances to step S402, and if it is determined that the group of paper images has not been received, the process advances to step S403.

（ステップＳ４０２）端末送信部２４は、ステップＳ４０１で受け付けられた用紙画像群を、端末格納部２１に格納さている端末識別子と対に、情報処理装置１送信する。その後、ステップＳ４０１に戻る。 (Step S402) The terminal transmitting unit 24 transmits the paper image group received in step S401 to the information processing device 1 in pairs with the terminal identifier stored in the terminal storage unit 21. After that, the process returns to step S401.

（ステップＳ４０３）端末処理部２３は、端末受信部２５が“枚数不一致”を受信したか否かを判別する。端末受信部２５が“枚数不一致”を受信したと判別された場合はステップＳ４０４に進み、受信していないと判別された場合はステップＳ４０５に進む。 (Step S403) The terminal processing unit 23 determines whether the terminal receiving unit 25 has received “number of sheets mismatch”. If it is determined that the terminal receiving unit 25 has received "number of sheets mismatch", the process advances to step S404, and if it is determined that the message has not been received, the process advances to step S405.

（ステップＳ４０４）端末出力部２６は、ステップＳ４０３で受信された“枚数不一致”を、ディスプレイ等の出力デバイスを介して出力する。その後、ステップＳ４０１に戻る。 (Step S404) The terminal output unit 26 outputs the "number of sheets mismatch" received in step S403 via an output device such as a display. After that, the process returns to step S401.

（ステップＳ４０５）端末処理部２３は、端末受信部２５が検査結果を受信したか否かを判別する。端末受信部２５が検査結果を受信したと判別された場合はステップＳ４０６に進み、受信していないと判別された場合はステップＳ４０７に進む。 (Step S405) The terminal processing unit 23 determines whether the terminal receiving unit 25 has received the test result. If it is determined that the terminal receiving unit 25 has received the test results, the process advances to step S406, and if it is determined that the test results have not been received, the process advances to step S407.

（ステップＳ４０６）端末出力部２６は、ステップＳ４０５で受信された検査結果を、ディスプレイ等の出力デバイスを介して出力する。その後、ステップＳ４０１に戻る。 (Step S406) The terminal output unit 26 outputs the test result received in step S405 via an output device such as a display. After that, the process returns to step S401.

（ステップＳ４０７）端末処理部２３は、端末受信部２５が用紙識別子群を受信したか否かを判別する。端末受信部２５が用紙識別子群を受信したと判別された場合はステップＳ４０８に進み、受信していないと判別された場合はステップＳ４０１に戻る。 (Step S407) The terminal processing unit 23 determines whether the terminal receiving unit 25 has received the paper identifier group. If it is determined that the terminal receiving unit 25 has received the sheet identifier group, the process advances to step S408, and if it is determined that the paper identifier group has not been received, the process returns to step S401.

（ステップＳ４０８）端末出力部２６は、ステップＳ４０７で受信された用紙識別子群を、ディスプレイ等の出力デバイスを介して出力する。その後、ステップＳ４０１に戻る。 (Step S408) The terminal output unit 26 outputs the paper identifier group received in step S407 via an output device such as a display. After that, the process returns to step S401.

なお、図４のフローチャートにおいて、端末装置２の電源オンやプログラムの起動に応じて処理が開始し、電源オフや処理終了の割り込みにより処理は終了する。ただし、処理の開始または終了のトリガは問わない。 Note that in the flowchart of FIG. 4, the process starts when the terminal device 2 is powered on or a program is started, and ends when the power is turned off or an interrupt to end the process occurs. However, the trigger for starting or ending the process does not matter.

なお、Ｍ枚の各用紙に対し、ｎ個の用紙画像を準備し、当該ｎ個の各用紙画像に対してＯＣＲ処理を行う場合、図２のフローチャートにおいて、一部のステップ（Ｓ２０２，Ｓ２０５～Ｓ２０９，Ｓ２１３）が、下記のように変更される。 Note that when n paper images are prepared for each of M sheets and OCR processing is performed on each of the n paper images, some steps (S202, S205 to S205) in the flowchart of FIG. S209, S213) are changed as follows.

用紙画像受付部１２１は、端末装置２から、端末識別子と対に、用紙画像群と変数ｎを受信する。変数ｎは、一の用紙に対して準備される用紙画像の数を示す変数である。 The paper image receiving unit 121 receives the paper image group and the variable n in pairs with the terminal identifier from the terminal device 2 . The variable n is a variable indicating the number of paper images prepared for one paper.

（ステップＳ２０２ａ）処理部１３は、ステップＳ２０１で受信された用紙画像群を構成する用紙画像の数を計数し、当該用紙画像数を変数ｎで除算した結果“用紙画像数／ｎ”を変数Ｍにセットする。 (Step S202a) The processing unit 13 counts the number of paper images constituting the paper image group received in step S201, divides the number of paper images by the variable n, and divides the resultant “number of paper images/n” into the variable M Set to .

（ステップＳ２０５ａ）第一ＯＣＲ部１３１は、ステップＳ２０１で受信された用紙画像群に対する第一ＯＣＲ処理により、（Ｍ×ｎ）個の第一ＯＣＲ情報を有する第一ＯＣＲ情報群を取得し、格納部１１に蓄積する。 (Step S205a) The first OCR unit 131 acquires a first OCR information group having (M×n) pieces of first OCR information by performing the first OCR processing on the paper image group received in step S201, and stores it. It is stored in section 11.

（ステップＳ２０６ａ）第二ＯＣＲ部１３２は、ステップＳ２０１で受信された用紙画像群に対する第二ＯＣＲ処理により、（Ｍ×ｎ）個の第二ＯＣＲ情報を有する第二ＯＣＲ情報群を取得し、ステップＳ２０５ａで取得された第一ＯＣＲ情報群に対応付けて格納部１１に蓄積する。これにより、格納部１１には、Ｍ枚の各用紙ごとに、第一ＯＣＲ情報と第二ＯＣＲ情報との対（対情報）がｎ個、格納される。 (Step S206a) The second OCR unit 132 acquires a second OCR information group having (M×n) pieces of second OCR information by performing a second OCR process on the sheet image group received in step S201, and It is stored in the storage unit 11 in association with the first OCR information group acquired in S205a. As a result, n pairs (pair information) of the first OCR information and the second OCR information are stored in the storage unit 11 for each of the M sheets.

（ステップＳ２０７ａ）処理部１３は、変数ｉに初期値“１”をセットする。ここでの変数ｉは、Ｍ枚の用紙のうち、未選択の用紙を順番に選択していくための変数である。 (Step S207a) The processing unit 13 sets the initial value "1" to the variable i. The variable i here is a variable for sequentially selecting unselected sheets among the M sheets.

（ステップＳ２０８ａ）処理部１３は、ｉ番目の用紙があるか否かを判別する。ｉ番目の用紙があると判別された場合はステップＳ２０９ａに進み、ｉ番目の用紙がないと判別された場合はステップＳ２１３ａに進む。 (Step S208a) The processing unit 13 determines whether or not there is an i-th sheet. If it is determined that there is an i-th sheet, the process advances to step S209a, and if it is determined that there is no i-th sheet, the process advances to step S213a.

（ステップＳ２０９ａ）処理部１３は、ｉ番目の用紙に対応するｎ個の対情報の中に、第一ＯＣＲ情報と第二ＯＣＲ情報が一致する対情報が少なくとも１つあるか否かを判別する。ｉ番目の用紙に対応するｎ個の対情報の中に、第一ＯＣＲ情報と第二ＯＣＲ情報が一致する対情報が、少なくとも１つあると判別された場合はステップＳ２１０に進み、１つもないと判別された場合はステップＳ２１１に進む。 (Step S209a) The processing unit 13 determines whether there is at least one pair of information in which the first OCR information and the second OCR information match among the n pieces of pair information corresponding to the i-th sheet. . If it is determined that there is at least one pair of information in which the first OCR information and the second OCR information match among the n pieces of pair information corresponding to the i-th sheet, the process advances to step S210, and there is no pair of information. If it is determined that this is the case, the process advances to step S211.

（ステップＳ２１３ａ）決定部１３３は、統計情報格納部１１１に格納されている統計情報（例えば、合計金額）と、ステップＳ２０５ａで取得された第一ＯＣＲ情報群と、ステップＳ２０６ａで取得された第二ＯＣＲ情報群とを用いて、正しいＯＣＲ情報群を決定する決定処理を実行する。 (Step S213a) The determining unit 133 selects the statistical information (for example, the total amount) stored in the statistical information storage unit 111, the first OCR information group acquired in step S205a, and the second OCR information group acquired in step S206a. A determination process for determining the correct OCR information group is executed using the OCR information group.

また、図３のフローチャートでも、一部のステップ（Ｓ３０１，Ｓ３０４）が下記のように変更される。 Also, in the flowchart of FIG. 3, some steps (S301, S304) are changed as follows.

（ステップＳ３０１ａ）決定部１３３は、ステップＳ２０９ａでの判別結果が“不一致”である用紙の数を計数し、計数結果を変数ｍにセットする。ここでの変数ｍは、Ｍ枚の用紙のうち、第一ＯＣＲ情報と第二ＯＣＲ情報が不一致である用紙の枚数を示す変数である。 (Step S301a) The determining unit 133 counts the number of sheets for which the determination result in step S209a is "unmatched", and sets the count result to a variable m. The variable m here is a variable indicating the number of sheets for which the first OCR information and the second OCR information do not match among the M sheets.

（ステップＳ３０４ａ）決定部１３３は、ステップＳ２０９ａでの判別結果が“不一致”であるｍ枚の各用紙に関し、｛（２のｍ乗）×ｎ｝通りの組み合わせを作成する。 (Step S304a) The determining unit 133 creates {(2 to the power of m)×n} combinations for each of the m sheets for which the determination result in step S209a is “unmatched”.

さらに、図４のフローチャートでも、一部のステップ（Ｓ４０１，Ｓ４０２）が下記のように変更される。 Furthermore, in the flowchart of FIG. 4 as well, some steps (S401, S402) are changed as follows.

（ステップＳ４０１ａ）端末処理部２３は、端末受付部２２が用紙画像群および変数ｎを受け付けたか否かを判別する。端末受付部２２が用紙画像群および変数ｎを受け付けたと判別された場合はステップＳ４０２ａに進み、受け付けていないと判別された場合はステップＳ４０３に進む。 (Step S401a) The terminal processing unit 23 determines whether the terminal reception unit 22 has received the paper image group and the variable n. If it is determined that the terminal reception unit 22 has received the paper image group and the variable n, the process advances to step S402a, and if it is determined that the terminal reception unit 22 has not received the paper image group and the variable n, the process advances to step S403.

（ステップＳ４０２ａ）端末送信部２４は、ステップＳ４０１で受け付けられた用紙画像群および変数ｎを、端末格納部２１に格納さている端末識別子と対に、情報処理装置１送信する。その後、ステップＳ４０１ａに戻る。 (Step S402a) The terminal transmitting unit 24 transmits the paper image group and variable n received in step S401 to the information processing device 1 in pairs with the terminal identifier stored in the terminal storage unit 21. After that, the process returns to step S401a.

以下、本実施の形態における情報処理システムの具体的な動作例について説明する。なお、以下の説明は、種々の変更が可能であり、本発明の範囲を何ら制限するものではない。 A specific example of the operation of the information processing system in this embodiment will be described below. Note that the following description can be modified in various ways and is not intended to limit the scope of the present invention in any way.

（具体例１）
本例における用紙は、手形である。いま、各店から事務センターへ、例えば、図５に示すような３枚の用紙が、図６に示すような帳票と共に持ち込まれたとする。 (Specific example 1)
The paper in this example is a bill. Suppose now that three sheets of paper as shown in FIG. 5 are brought into the office center from each store along with a form as shown in FIG. 6.

図５は、用紙群の一例を示す図である。用紙群とは、２枚以上の用紙の集合である。この用紙群は、３枚の用紙を有する。そのうち、１枚目の用紙には、金額「￥１，０００※」、および振出人「ＢＢホールディングス」などが記載されている。また、２枚目の用紙には、金額「￥２，０００※」、および振出人「ＡＡ株式会社」などが記載されている。さらに、３枚目の用紙には、金額「￥３，０００※」、および振出人「ＡＡ株式会社」などが記載されている。 FIG. 5 is a diagram showing an example of a paper group. A sheet group is a collection of two or more sheets. This paper group has three sheets. On the first sheet of paper, the amount "¥1,000*" and the drawer "BB Holdings" are written. Furthermore, on the second sheet of paper, the amount "¥2,000*" and the drawer "AA Co., Ltd." are written. Furthermore, on the third sheet, the amount "¥3,000*" and the drawer "AA Co., Ltd." are written.

図６は、帳票の一例を示す図である。この帳票には、統計情報である合計金額「６０００円」、および枚数「３枚」などの情報が記入されている。 FIG. 6 is a diagram showing an example of a form. In this form, statistical information such as the total amount "6,000 yen" and the number of sheets "3" are written.

上記３枚の用紙の画像がスキャナで読み込まれ、また、上記帳票に記入された合計金額“６０００円”等の情報がキーボード等の入力デバイスを介して入力されると、端末装置２において、図４のステップＳ４０１およびＳ４０２の処理が実行される。すなわち、端末受付部２２が、読み込まれた３つの用紙画像を有する用紙画像群、および読み込まれたた合計金額“６０００円”等の情報を受け付け、端末送信部２４は、当該受け付けられた用紙画像群および合計金額等の情報を情報処理装置１に送信する。 When the images of the three sheets mentioned above are read by a scanner, and information such as the total amount "6,000 yen" written on the form is inputted via an input device such as a keyboard, the terminal device 2 displays the 4, steps S401 and S402 are executed. That is, the terminal reception unit 22 receives information such as a paper image group including the three read paper images and the read total amount “6000 yen”, and the terminal transmission unit 24 receives the received paper images. Information such as the group and total amount is transmitted to the information processing device 1.

情報処理装置１において、用紙画像受付部１２１が上記用紙画像群および合計金額等の情報を受信し、処理部１３等は、当該受信された合計金額等の情報を統計情報格納部１１１に蓄積すると共に、当該受信された用紙画像群、および統計情報格納部１１１に格納されている合計金額等の情報を用いて、図３のステップＳ２０１～Ｓ２１６の処理を実行する。これによって、例えば、図７に示すような、２以上（本例では、３つ）のＯＣＲ関連情報が、格納部１１に蓄積される。 In the information processing device 1, the paper image reception unit 121 receives the paper image group and information such as the total amount, and the processing unit 13 etc. accumulates the received information such as the total amount in the statistical information storage unit 111. At the same time, the processes of steps S201 to S216 in FIG. 3 are executed using the received sheet image group and information such as the total amount stored in the statistical information storage unit 111. As a result, two or more (in this example, three) pieces of OCR-related information, as shown in FIG. 7, are accumulated in the storage unit 11, for example.

図７は、ＯＣＲ関連情報のデータ構造図である。ＯＣＲ関連情報とは、用紙画像群に対して行われたＯＣＲの結果に関する情報である。ＯＣＲ関連情報は、第一ＯＣＲ情報、第二ＯＣＲ情報、判別結果、および識別子（本例では、振出人名）などを有する。 FIG. 7 is a data structure diagram of OCR related information. The OCR-related information is information regarding the results of OCR performed on a group of paper images. The OCR-related information includes first OCR information, second OCR information, a determination result, an identifier (in this example, the name of the issuer), and the like.

また、格納部１１に格納される２以上のＯＣＲ関連情報には、不一致情報が対応付いている。不一致情報とは、第一ＯＣＲ情報と第二ＯＣＲ情報が一致しない１または２以上の用紙に関する情報である。不一致情報は、１または２以上の用紙識別子を含む。不一致情報は、具体的には、例えば、“１枚目”や“２枚目，３枚目”等であるが、その形式は問わない。 Further, two or more pieces of OCR related information stored in the storage unit 11 are associated with mismatch information. The mismatch information is information regarding one or more sheets for which the first OCR information and the second OCR information do not match. The mismatch information includes one or more paper identifiers. Specifically, the mismatch information is, for example, "first page", "second page, third page", etc., but its format does not matter.

さらに、格納される２以上の各ＯＣＲ関連情報に、ＩＤ（例えば、“１”，“２”等）が対応付いている。なお、以下では、格納される２以上のＯＣＲ関連情報を「ＯＣＲ関連情報群」と記す場合がある。 Further, each of the two or more pieces of OCR related information stored is associated with an ID (for example, "1", "2", etc.). Note that, hereinafter, two or more pieces of OCR-related information stored may be referred to as an "OCR-related information group."

なお、図７は、ＯＣＲ関連情報群の一例も示している。この例は、判別結果が一致しない用紙が１枚のみの場合である。例えば、ＩＤ“１”が対応付いたＯＣＲ関連情報（以下、ＯＣＲ関連情報１）は、第一ＯＣＲ情報“１０００円”、第二ＯＣＲ情報“１０００円”、判別結果“一致”、および識別子“ＢＢホールディングス”などを有する。また、ＩＤ“２”が対応付いたＯＣＲ関連情報（ＯＣＲ関連情報２）は、第一ＯＣＲ情報“２０００円”、第二ＯＣＲ情報“２１００円”、判別結果“不一致”、および識別子“ＡＡ株式会社”などを有する。同様に、ＯＣＲ関連情報３は、第一ＯＣＲ情報“３０００円”、第二ＯＣＲ情報“３０００円”、判別結果“一致”、および識別子“ＡＡ株式会社”などを有する。そして、ＯＣＲ関連情報１～３に、不一致情報“２枚目”が対応付いている。 Note that FIG. 7 also shows an example of an OCR-related information group. In this example, there is only one sheet of paper for which the determination results do not match. For example, OCR related information associated with ID "1" (hereinafter referred to as OCR related information 1) includes the first OCR information "1000 yen", the second OCR information "1000 yen", the determination result "match", and the identifier " BB Holdings, etc. In addition, the OCR related information (OCR related information 2) associated with ID "2" includes the first OCR information "2000 yen", the second OCR information "2100 yen", the determination result "mismatch", and the identifier "AA stock". Company” etc. Similarly, the OCR related information 3 includes first OCR information "3000 yen", second OCR information "3000 yen", determination result "match", and identifier "AA Corporation". Then, the OCR related information 1 to 3 are associated with the mismatch information "second sheet".

情報処理装置１において、決定部１３３が、図７のＯＣＲ関連情報群を用いて、図３のステップＳ２１３（詳しくは、図３のステップＳ３０１～Ｓ３１５）の決定処理を実行することによって、例えば、図８に示すような、１以上の決定情報が、格納部１１に蓄積される。 In the information processing device 1, the determining unit 133 executes the determining process in step S213 in FIG. 3 (specifically, steps S301 to S315 in FIG. 3) using the OCR-related information group in FIG. 7, so that, for example, One or more pieces of decision information as shown in FIG. 8 are stored in the storage unit 11.

図８は、判定情報のデータ構造図である。判定情報とは、決定処理の経過および結果に関する情報である。判定情報は、不一致情報、組み合わせ、合計金額判定結果、および正しいＯＣＲ情報群を有する。組み合わせとは、不一致情報が有する１または２以上の各用紙識別子で識別される用紙群に関する、可能な全ての一致・不一致の組み合わせである。組み合わせの数は、前述したように、不一致情報が有する用紙識別子の数ｍ（つまり、図３のフローチャートにおける変数ｍ：以下、単に「ｍ」と記す場合がある）に対して、｛２のｍ乗｝通りとなる。判定結果とは、ステップＳ３０８の判定の結果である。 FIG. 8 is a data structure diagram of determination information. The determination information is information regarding the progress and results of the determination process. The determination information includes mismatch information, combination, total amount determination result, and correct OCR information group. The combinations are all possible combinations of matches and mismatches regarding the paper group identified by one or more paper identifiers included in the mismatch information. As mentioned above, the number of combinations is {2 m The result is as follows. The determination result is the result of the determination in step S308.

なお、格納される１以上の各判定情報には、ＩＤ（例えば、“１”，“２”等）が対応付いている。以下では、格納される１以上の判定情報を「判定情報群」と記し、個々の判定情報を「判定情報１」、「判定情報２」等のように記す場合がある。 Note that each of the one or more pieces of stored determination information is associated with an ID (for example, "1", "2", etc.). Hereinafter, one or more pieces of stored determination information may be referred to as a "determination information group", and individual determination information may be referred to as "determination information 1", "determination information 2", etc.

なお、図８は、判定情報群の一例も示している。この例は、｛不一致情報＝２枚目、ｍ＝１｝の場合である。判定情報群は、ただ１つの判定情報１を有する。 Note that FIG. 8 also shows an example of the determination information group. This example is a case of {mismatch information=2nd sheet, m=1}. The determination information group has only one determination information 1.

判定情報１は、不一致情報“２枚目”、および２通りの組み合わせを含む。第一の組み合わせは、第一ＯＣＲ情報が正、かつ第一ＯＣＲ情報が誤（以下、｛○，×｝のように記す場合がある）であり、第二の組み合わせは、｛×，○｝である。 Determination information 1 includes mismatch information "second sheet" and two combinations. The first combination is that the first OCR information is correct and the first OCR information is incorrect (hereinafter sometimes written as {○, ×}), and the second combination is {×,○} It is.

また、判定情報１は、第一の組み合わせ｛○，×｝に対応する合計金額“６０００円”と判定結果“一致”、および第二の組み合わせ｛×，○｝に対応する合計金額“６１００円”と判定結果“不一致”、も含む。さらに、判定情報１は、正しいＯＣＲ情報群“２枚目：第一ＯＣＲ情報”をも含む。 In addition, determination information 1 includes the total amount “6000 yen” corresponding to the first combination {○, ×} and the determination result “match”, and the total amount “6100 yen” corresponding to the second combination {×, ○} ” and the judgment result “inconsistency” are also included. Furthermore, the determination information 1 also includes a correct OCR information group "second sheet: first OCR information".

なお、第一の組み合わせ｛○，×｝に対応する合計金額“６０００円”は、図７のＯＣＲ情報群を用いて、例えば、決定部１３３によって、次にように取得される。すなわち、決定部１３３は、ＯＣＲ情報１が有する第一または第二のＯＣＲ情報“１０００”円と、ＯＣＲ情報１が有する第一ＯＣＲ情報“２０００円”と、ＯＣＲ情報３が有する第一または第二のＯＣＲ情報“３０００”円とを加算し、加算結果“６０００円”を取得する。 Note that the total amount “6,000 yen” corresponding to the first combination {◯, ×} is obtained by, for example, the determining unit 133 as follows using the OCR information group shown in FIG. That is, the determining unit 133 selects the first or second OCR information “1000 yen” that OCR information 1 has, the first OCR information “2000 yen” that OCR information 1 has, and the first or second OCR information that OCR information 3 has. The second OCR information "3000 yen" is added to obtain the addition result "6000 yen".

同様に、第二の組み合わせ｛×，○｝に対応する合計金額“６１００円”は、ＯＣＲ情報１が有する第一または第二のＯＣＲ情報“１０００”円と、ＯＣＲ情報２が有する第一ＯＣＲ情報“２１００円”と、ＯＣＲ情報３が有する第一または第二のＯＣＲ情報“３０００”円とを加算することにより取得される。 Similarly, the total amount “6100 yen” corresponding to the second combination {x, ○} is calculated by adding “1000” yen of the first or second OCR information that OCR information 1 has and the first OCR information that OCR information 2 has. It is obtained by adding the information "2100 yen" and the first or second OCR information "3000" yen included in the OCR information 3.

また、第一の組み合わせ｛○，×｝に対応する判定結果“一致”は、第一の組み合わせ｛○，×｝に対応する合計金額“６０００円”と、統計情報格納部１１１の合計金額“６０００円”とが一致するか否かを判別した結果として取得される。同様に、第二の組み合わせ｛○，×｝に対応する判定結果“一致”は、第二の組み合わせ｛×，○｝に対応する合計金額“６１００円”と、統計情報格納部１１１の合計金額“６０００円”とが一致するか否かを判別した結果として取得される。 Further, the determination result “match” corresponding to the first combination {○, ×} is the total amount “6000 yen” corresponding to the first combination {○, ×} and the total amount “ 6,000 yen” match or not. Similarly, the determination result “match” corresponding to the second combination {○, ×} is the total amount “6100 yen” corresponding to the second combination {×, ×} and the total amount in the statistical information storage unit 111. It is obtained as a result of determining whether or not "6000 yen" matches.

さらに、正しいＯＣＲ情報群“２枚目，一致”は、不一致情報“２枚目”と、判定結果が“一致”である第一の組み合わせ｛○，×｝の“○”に対応する“第一ＯＣＲ情報”とを用いて取得される。 Furthermore, the correct OCR information group “2nd sheet, match” is the “2nd sheet” of mismatch information and the “2nd sheet” corresponding to “○” of the first combination {○, ×} whose judgment result is “match”. "1 OCR information".

さらに、情報処理装置１において、処理部１３および出力部１４が、図８の判定結果を用いて、図３のステップＳ２１４～Ｓ２１５の処理を実行し、そして、端末装置２において、図４のステップＳ４０５～Ｓ４０８の処理が実行されることによって、例えば、図９に示すような、１以上の検査結果が、端末装置２のディスプレイに表示される。 Further, in the information processing device 1, the processing unit 13 and the output unit 14 execute the processing in steps S214 to S215 in FIG. 3 using the determination result in FIG. By executing the processes of S405 to S408, one or more test results as shown in FIG. 9 are displayed on the display of the terminal device 2, for example.

図９は、出力される検査結果の構造図である。検査結果は、３つの項目「不一致」，「第一ＯＣＲ」，および「第二ＯＣＲ」を含む。「不一致」は、判定情報が有する不一致情報に対応する項目であり、「第一ＯＣＲ」および「第二ＯＣＲ」は、判定情報が有する組み合わせを構成する第一ＯＣＲ情報および第二ＯＣＲ情報に対応する項目である。 FIG. 9 is a structural diagram of the test results that are output. The inspection result includes three items: "mismatch", "first OCR", and "second OCR". "Mismatch" is an item corresponding to mismatch information included in the judgment information, and "First OCR" and "Second OCR" correspond to the first OCR information and second OCR information that constitute the combination included in the judgment information. This is an item to do.

「不一致」には、不一致情報（図８等参照）が有する１以上のページ識別子が表記される。「第一ＯＣＲ」および「第二ＯＣＲ」には、当該１以上の各ページ識別子に対応付けて、第一および第二の２つのＯＣＲ処理による２つの認識結果（例えば、｛１０００円，１０００円｝や｛２０００円，２１００円｝等）と、当該２つの認識結果に対応する２つの判定結果（｛○，×｝または｛×，○｝のいずれか）とが表記される。 “Unmatched” indicates one or more page identifiers included in the unmatched information (see FIG. 8, etc.). "First OCR" and "Second OCR" include two recognition results obtained by the first and second OCR processes (for example, {1000 yen, 1000 yen }, {2000 yen, 2100 yen}, etc.), and two determination results (either {◯, x} or {x, ○}) corresponding to the two recognition results are written.

なお、図９は、ディスプレイに表示される検査結果の一例も示している。この検査結果では、「不一致」の項目に「２枚目」が表示され、「第一ＯＣＲ」および「第二ＯＣＲ」の項目に、｛２０００円，２１００円｝と｛○，×｝とが表示されている。 Note that FIG. 9 also shows an example of the test results displayed on the display. In this inspection result, "2nd sheet" is displayed in the "mismatch" item, and {2000 yen, 2100 yen} and {○, ×} are displayed in the "1st OCR" and "2nd OCR" items. Displayed.

これによって、ユーザは、３枚の用紙のうち、２枚目の用紙において、第一ＯＣＲおよび第二ＯＣＲによる２つの認識結果に不一致が生じており、第一ＯＣＲ、第二ＯＣＲによる認識結果は、それぞれ２０００円、２１００円であり、前者が正しく、後者が誤りと判定されたことが分かる。 As a result, the user notices that there is a discrepancy between the two recognition results obtained by the first OCR and the second OCR on the second sheet of the three sheets, and the recognition results obtained by the first OCR and the second OCR are , 2000 yen and 2100 yen, respectively, and it can be seen that the former was determined to be correct and the latter was determined to be incorrect.

なお、ディスプレイには、ＯＫボタン等も表示されており、上記の判定結果は、ＯＫボタンの押下に応じて確定される。 Note that an OK button and the like are also displayed on the display, and the above determination result is confirmed in response to pressing the OK button.

なお、図８の判定情報において、２つの判定結果がいずれも“不一致である場合は、例えば、図１０に示す選択画面が表示されてもよい。図１０は、選択画面の一例を示す図である。選択画面とは、判定結果がいずれも“不一致である場合に、第一ＯＣＲおよび第二ＯＣＲによる２つの認識結果を提示して、ユーザによる選択を受け付けるための画面である。選択画面には、例えば、「どちらが正しいか判定できません。正しい方を選択して下さい。」等の文字列と共に、図９の検査結果において、判定結果｛○，×｝を、判定不可能を示す情報（例えば、｛？，？｝等）に変更したものが表示される。ただし、選択画面の構造や表現は問わない。 In addition, in the determination information of FIG. 8, if both of the two determination results are "inconsistent," for example, the selection screen shown in FIG. 10 may be displayed. FIG. 10 is a diagram showing an example of the selection screen. The selection screen is a screen for presenting the two recognition results by the first OCR and the second OCR and accepting a selection by the user when the determination results are "inconsistent". On the selection screen, for example, along with a character string such as "It cannot be determined which is correct. Please select the correct one," the judgment result {○, The changed information (for example, {?, ?}, etc.) is displayed. However, the structure and expression of the selection screen does not matter.

これによって、ユーザは、２枚目の用紙で第一ＯＣＲおよび第二ＯＣＲによる２つの認識結果に不一致が生じており、正誤の判定ができなかったことが分かる。そこで、ユーザは、２枚目の用紙を確認し、“￥２，０００※”が記載されていることから、認識結果“２０００円”に対応する第一ＯＣＲを選択する。これにより、判定不可能を示す｛？，？｝等の情報は、ユーザの選択による判定結果｛○，×｝に置き換えられ、２枚目の用紙の金額は“２０００円”に確定される。 This allows the user to understand that there is a discrepancy between the two recognition results obtained by the first OCR and the second OCR for the second sheet of paper, and that it was not possible to determine whether it is correct or incorrect. Therefore, the user checks the second sheet of paper, and since "¥2,000*" is written thereon, the user selects the first OCR corresponding to the recognition result "¥2,000". This indicates that it is impossible to determine {? ，? } is replaced with the determination result {◯, ×} based on the user's selection, and the amount of money for the second sheet is determined to be “2000 yen”.

なお、不一致が生じた用紙の数が２枚（すなわち、ｍ＝２）の場合、ＯＣＲ関連情報群、判定情報群、検査結果、および選択画面は、例えば、図１１～図１４のようになる。この例では、２枚目および３枚目で不一致が生じている。 Note that when the number of sheets in which mismatches occur is two (that is, m = 2), the OCR-related information group, determination information group, inspection results, and selection screen are as shown in FIGS. 11 to 14, for example. . In this example, there is a mismatch between the second and third images.

図１１は、ＯＣＲ関連情報群の他の一例（ｍ＝２の場合）を示す図である。このＯＣＲ関連情報群を構成するＯＣＲ関連情報１～３のうち、ＯＣＲ関連情報１，２は、図７の例と同様であるが、ＯＣＲ関連情報３において、第一ＯＣＲ情報が“３００１円”であり、判定結果は“不一致”となっている。従って、不一致情報は、“２，３枚目”である。 FIG. 11 is a diagram showing another example of the OCR-related information group (in the case of m=2). Of the OCR related information 1 to 3 that make up this OCR related information group, OCR related information 1 and 2 are the same as the example in FIG. 7, but in OCR related information 3, the first OCR information is "3001 yen". , and the determination result is “no match”. Therefore, the mismatch information is "2nd and 3rd sheets".

図１２は、判定情報群の他の一例（ｍ＝２、不一致情報＝２，３枚目の場合）を示す図である。この判定情報群は、２枚目・３枚目について可能な４通り（つまり、２の２乗）の組み合わせに関する４つの判定情報１～４を有する。このうち、判定情報２において、合計金額が“６０００円”、判定結果が“一致”であることから、正しいＯＣＲ情報群は、“２枚目：第一ＯＣＲ情報”，および“３枚目：第二ＯＣＲ情報”となっている。 FIG. 12 is a diagram showing another example of the determination information group (m=2, mismatch information=2nd, 3rd sheet). This judgment information group has four pieces of judgment information 1 to 4 regarding four possible combinations (that is, the square of 2) of the second and third sheets. Among these, in judgment information 2, the total amount is "6000 yen" and the judgment result is "match", so the correct OCR information group is "2nd sheet: first OCR information" and "3rd sheet: "Second OCR Information".

図１３は、検査結果の他の一例（ｍ＝２、不一致情報＝２，３枚目の場合）を示す図である。この検査結果は、図９の検査結果に対し、３枚目に関する情報を追加したものである。なお、２枚目に関する情報は、図９と同様である。すなわち、図１３の検査結果では、「不一致」の項目に「３枚目」が追加され、「第一ＯＣＲ」および「第二ＯＣＲ」の項目に、「３枚目」に対応する｛３００１円，３０００円｝と｛×，○｝とが追加されている。 FIG. 13 is a diagram showing another example of the inspection results (m=2, mismatch information=2nd, 3rd sheet). This inspection result is obtained by adding information regarding the third sheet to the inspection result of FIG. 9. Note that the information regarding the second sheet is the same as that in FIG. In other words, in the inspection results of FIG. 13, "3rd sheet" is added to the "mismatch" item, and the "3rd sheet" corresponding to {3001 yen] is added to the "1st OCR" and "2nd OCR" items. , 3,000 yen} and {x, ○} are added.

これによって、ユーザは、３枚目の用紙においても、第一ＯＣＲおよび第二ＯＣＲによる２つの認識結果に不一致が生じており、３枚目に関しては、第一ＯＣＲ、第二ＯＣＲによる認識結果は、それぞれ３００１円、３０００円であり、前者が誤りであり、後者が正しいと判定されたことが分かる。なお、３枚目に関する判定結果もまた、ＯＫボタンの押下に応じて、３枚目に関する判定結果と共に確定される。 As a result, the user notices that even for the third sheet, there is a discrepancy between the two recognition results obtained by the first OCR and the second OCR, and for the third sheet, the recognition results obtained by the first OCR and the second OCR are , 3001 yen and 3000 yen, respectively, indicating that the former was determined to be incorrect and the latter was determined to be correct. Note that the determination result regarding the third sheet is also confirmed together with the determination result regarding the third sheet in response to pressing the OK button.

図１４は、選択画面の他の一例（ｍ＝２、不一致情報＝２，３枚目の場合）を示す図である。この選択画面は、図１０の選択画面に対し、３枚目に関する情報を追加したものである。なお、２枚目に関する情報は、図１０と同様である。すなわち、図１４の選択画面では、「不一致」の項目に「３枚目」が追加され、「第一ＯＣＲ」および「第二ＯＣＲ」の項目に、「３枚目」に対応する｛３００１円，３００円｝と｛？，？｝とが追加されている。 FIG. 14 is a diagram showing another example of the selection screen (m=2, mismatch information=2nd, 3rd image). This selection screen is obtained by adding information regarding the third sheet to the selection screen of FIG. 10. Note that the information regarding the second sheet is the same as that in FIG. 10. That is, on the selection screen in FIG. 14, "3rd sheet" is added to the "unmatched" item, and the "3rd sheet" corresponding to {3001 yen , 300 yen} and {? ，? } has been added.

これによって、ユーザは、３枚目の用紙でも、第一ＯＣＲおよび第二ＯＣＲによる２つの認識結果に不一致が生じており、正誤の判定ができなかったことが分かる。そこで、ユーザは、３枚目の用紙を確認し、“￥３，０００※”が記載されていることから、認識結果“３０００円”に対応する第二ＯＣＲを選択する。これにより、３枚目に対応付いた｛？，？｝等の情報は、ユーザの選択による判定結果｛×，○｝に置き換えられ、３枚目の用紙の金額は“３０００円”に確定される。 This allows the user to understand that even for the third sheet, there is a discrepancy between the two recognition results obtained by the first OCR and the second OCR, and it was not possible to determine whether the sheet is correct or incorrect. Therefore, the user checks the third sheet of paper, and since "¥3,000*" is written thereon, the user selects the second OCR corresponding to the recognition result "¥3,000". As a result, the third image has been added {? ，? } and the like are replaced with the determination result {×, ○} based on the user's selection, and the amount of money for the third sheet is determined to be “3000 yen”.

なお、ｍ＝３の場合、ＯＣＲ関連情報群、および判定情報群は、例えば、図１５、および図１６のようになる。この例では、１枚目から３枚目の全てで不一致が生じている。なお、検査結果、および選択画面については、図示を省略している。 Note that when m=3, the OCR-related information group and the determination information group are as shown in FIGS. 15 and 16, for example. In this example, mismatches occur in all of the first to third sheets. Note that illustration of the test results and selection screen is omitted.

図１５は、ＯＣＲ関連情報群のその他の一例（ｍ＝３の場合）を示す図である。このＯＣＲ関連情報群を構成するＯＣＲ関連情報１～３のうち、ＯＣＲ関連情報２，３は、図１１の例と同様であるが、ＯＣＲ関連情報１において、第一ＯＣＲ情報が“１０５０円”であり、判定結果は“不一致”となっている。従って、不一致情報は、“１～３枚目”である。 FIG. 15 is a diagram showing another example of the OCR-related information group (in the case where m=3). Of the OCR related information 1 to 3 that make up this OCR related information group, OCR related information 2 and 3 are the same as the example in FIG. 11, but in OCR related information 1, the first OCR information is "1050 yen". , and the determination result is “no match”. Therefore, the mismatch information is "1st to 3rd sheets".

図１６は、判定情報群のその他の一例（ｍ＝３、不一致情報＝１～３枚目の場合）を示す図である。この判定情報群は、１～３枚目について可能な８通り（つまり、２の３乗）の組み合わせに関する８つの判定情報１～８を有する。このうち、判定情報６において、合計金額が“６０００円”、判定結果が“一致”であることから、正しいＯＣＲ情報群は、“１枚目：第二ＯＣＲ情報”，２枚目：第一ＯＣＲ情報”および“３枚目：第二ＯＣＲ情報”となっている。 FIG. 16 is a diagram showing another example of the determination information group (m=3, mismatch information=first to third sheets). This judgment information group has eight pieces of judgment information 1 to 8 regarding eight possible combinations (that is, 2 to the third power) of the first to third sheets. Among these, in judgment information 6, the total amount is "6000 yen" and the judgment result is "match", so the correct OCR information group is "1st sheet: 2nd OCR information", 2nd sheet: 1st sheet "OCR information" and "Third sheet: second OCR information."

従って、この場合の検査結果は、図１３の検査結果に対して、さらに、１枚目に関する情報を追加したものとなる。すなわち、この場合の検査結果では、「不一致」の項目に「１枚目」が追加され、「第一ＯＣＲ」および「第二ＯＣＲ」の項目に、「１枚目」に対応する｛１０５０円，１０００円｝と｛×，○｝とが追加される。なお、この検査結果に対するユーザの指示、および当該指示に応じた処理部１３等の処理は、前述と同様である。 Therefore, the inspection result in this case is the one in which information regarding the first sheet is further added to the inspection result in FIG. That is, in the inspection results in this case, "1st sheet" is added to the "mismatch" item, and the "1st sheet" corresponding to "1st sheet" is added to the "1st OCR" and "2nd OCR" items. , 1000 yen} and {x, ○} are added. Note that the user's instructions regarding this test result and the processing performed by the processing unit 13 and the like in response to the instructions are the same as described above.

また、この場合の選択画面は、図１４の選択画面に対して、さらに、１枚目に関する情報を追加したものとなる。すなわち、この場合の選択画面では、「不一致」の項目に「１枚目」が追加され、「第一ＯＣＲ」および「第二ＯＣＲ」の項目に、「１枚目」に対応する｛１０５０円，１０００円｝と｛？，？｝とが追加される。なお、この選択画面に対するユーザの指示、および当該指示に応じた処理部１３等の処理は、前述と同様である。 Further, the selection screen in this case is the selection screen shown in FIG. 14 with information regarding the first sheet added. In other words, on the selection screen in this case, "1st sheet" is added to the "unmatched" item, and the "1st sheet" corresponding to "1st sheet" is added to the "1st OCR" and "2nd OCR" items. , 1000 yen} and {? ，? } is added. Note that the user's instructions on this selection screen and the processing performed by the processing unit 13 and the like in response to the instructions are the same as described above.

（具体例２）
なお、上記具体例１において、３枚の各用紙ごとに、２つの画像（例えば、ＪＰＥＧおよびＴＩＦの２種類の画像）を準備し、計６枚の用紙画像を有する用紙画像群等を用いて、決定処理を行ってもよい。具体例２の詳しい説明は省略する。 (Specific example 2)
In addition, in the above specific example 1, two images (for example, two types of JPEG and TIF images) are prepared for each of the three sheets, and a sheet image group having a total of six sheet images is used. , a determination process may be performed. A detailed explanation of the second specific example will be omitted.

（具体例３）
または、上記具体例１において、特別識別子格納部１１２に、１以上の特別識別子（例えば、誤認識が生じ易い手形等の用紙を多く振り出す振出人の振出人名）が格納されていてもよい。第一ＯＣＲ部１３１または第二ＯＣＲ部１３２が、各用紙画像に対するＯＣＲ処理により振出人名をも取得し、判断部１３４は、３枚の各用紙画像から取得された振出人名が、特別識別子格納部１１２に格納されている振出人名と一致するか否かを判定してもよい。そして、取得された振出人名が、格納されている振出人名と一致すると判定されるほど類似している場合に、認識の精度を高めるための特別処理（例えば、用紙画像に対する補正、一の用紙につき２以上の用紙画像を準備する等）を行ってもよい。 (Specific example 3)
Alternatively, in the first specific example, the special identifier storage unit 112 may store one or more special identifiers (for example, the name of a drawer who issues a large number of papers such as bills that are likely to be misrecognized). The first OCR unit 131 or the second OCR unit 132 also acquires the issuer's name through OCR processing on each paper image, and the determining unit 134 determines that the issuer's name obtained from each of the three paper images is stored in a special identifier storage. It may be determined whether the name of the drawer matches the name of the drawer stored in 112. If the obtained issuer's name is similar enough to be determined to match the stored issuer's name, special processing is performed to increase recognition accuracy (for example, correction to the paper image, (e.g., preparing two or more paper images) may also be performed.

（具体例４）
上記具体例３において、特別処理は、一の用紙につき２以上の用紙画像を準備する処理であり、特別識別子格納部１１２には、誤読数が閾値より多いと処理部１３によって判別された振出人の識別子が格納されてもよい。誤読数とは、一の振出人名に対応する２以上の用紙に関し、認識結果が誤りであると決定部１３３によって判定された回数、または割合である。 (Specific example 4)
In the above specific example 3, the special process is a process of preparing two or more paper images for one paper, and the special identifier storage unit 112 stores information about the drawer whose number of misreadings is determined by the processing unit 13 to be greater than the threshold value. may be stored. The number of misreadings refers to the number of times or the ratio of the number of times the determination unit 133 determines that the recognition result is incorrect regarding two or more sheets corresponding to one payer's name.

割合とは、第一ＯＣＲ部１３１または第二ＯＣＲ部１３２によって認識された回数に対する、決定部１３３によって誤りであると判定された回数の割合であり、例えば、０～１の数値、または百分率（％）等で表現されるが、その形式は問わない。 The ratio is the ratio of the number of times that the determination unit 133 determines that there is an error to the number of times that the first OCR unit 131 or the second OCR unit 132 recognizes the error. %) etc., but the format does not matter.

特別識別子格納部１１２には、例えば、図１７に示すような、１または２以上の特別識別子関連情報が格納されてもよい。図１７は、特別識別子関連情報のデータ構造図である。特別識別子関連情報とは、特別識別子に関連する情報である。特別識別子関連情報は、識別子（本例では、振出人名）、誤読数、および特別識別子フラグを有する。ここでの誤読数は、誤りであると判定された回数である。 The special identifier storage unit 112 may store one or more pieces of special identifier related information as shown in FIG. 17, for example. FIG. 17 is a data structure diagram of special identifier related information. Special identifier related information is information related to a special identifier. The special identifier related information includes an identifier (in this example, the drawer's name), the number of misreads, and a special identifier flag. The number of misreadings here is the number of times it was determined to be an error.

特別識別子フラグとは、当該識別子が特別識別子か否かを示す情報である。特別識別子フラグは、例えば、当該識別子が、特別識別子であることを示す“ＯＮ”、別識別子でないことを示す“ＯＦＦ”の間で切り替わる。特別識別子フラグ“ＯＮ”が対応付いた識別子は、特別識別子であると見なされ、特別識別子フラグ“ＯＦＦ”が対応付いた識別子は、特別識別子でないと見なされる。 The special identifier flag is information indicating whether the identifier is a special identifier. The special identifier flag switches between, for example, "ON" indicating that the identifier is a special identifier, and "OFF" indicating that the identifier is not a separate identifier. Identifiers associated with the special identifier flag "ON" are considered to be special identifiers, and identifiers associated with the special identifier flag "OFF" are considered not to be special identifiers.

ＩＤ“１”が対応付いた特別識別子関連情報１は、識別子“ＡＡ株式会社”、誤読数“３”、および特別識別子フラグ“ＯＮ”を有する。同様に、特別識別子関連情報２は、識別子“ＢＢホールディングス”、誤読数“１”、および特別識別子フラグ“ＯＦＦ”を有する。 The special identifier related information 1 associated with the ID "1" has the identifier "AA Corporation", the number of misreadings "3", and the special identifier flag "ON". Similarly, the special identifier related information 2 has the identifier "BB Holdings", the number of misreadings "1", and the special identifier flag "OFF".

詳しくは、特別識別子格納部１１２に格納されている誤読数と特別識別子フラグの初期値は、それぞれ“０”、“ＯＦＦ”である。特別識別子格納部１１２に格納されているいずれかの識別子が表出した用紙画像から、第一ＯＣＲ部１３１または第二ＯＣＲ部１３２によって金額等の情報が取得され、当該取得された情報について、決定部１３３が「誤り」と判定する度に、処理部１３は、当該識別子に対応する誤読数をインクリメントする。そして、処理部１３は、当該インクリメント後の誤読数が閾値（ここでは、“２”）を超えたか否かを判別し、閾値を超えたと判別した場合に、当該識別子に対応する特別識別子フラグを“ＯＦＦ”から“ＯＮ”に切り替える。 Specifically, the initial values of the number of misreads and the special identifier flag stored in the special identifier storage unit 112 are "0" and "OFF", respectively. The first OCR unit 131 or the second OCR unit 132 acquires information such as the amount from the paper image in which any of the identifiers stored in the special identifier storage unit 112 is displayed, and determines the acquired information. Each time the unit 133 determines an "error", the processing unit 13 increments the number of misreadings corresponding to the identifier. Then, the processing unit 13 determines whether or not the number of misreadings after the increment has exceeded a threshold (here, "2"), and when determining that it has exceeded the threshold, sets the special identifier flag corresponding to the identifier. Switch from “OFF” to “ON”.

いま、特別識別子格納部１１２に、図１７に示す２つの特別識別子関連情報１，２が格納されているとすると、特別識別子フラグが“ＯＮ”である“ＡＡ株式会社”が表出した用紙群についてのみ、一の用紙につき２つの用紙画像が準備され、特別識別子フラグが“ＯＦＦ”である“ＢＢホールディングス”が表出した用紙については、一の用紙画像しか準備されない。 Assuming that the special identifier storage unit 112 stores two pieces of special identifier related information 1 and 2 shown in FIG. Only for this case, two paper images are prepared for each sheet, and only one paper image is prepared for the sheet on which "BB Holdings" whose special identifier flag is "OFF" is displayed.

この場合、前述した図７のＯＣＲ情報群が有するＯＣＲ関連情報２は、例えば、図１８に示すＯＣＲ関連情報２のようになる。図１８は、ＯＣＲ関連情報群のさらにその他の一例を示す図である。図１８に示すＯＣＲ関連情報群が有するＯＣＲ関連情報２は、２番目の用紙を基に準備された２つの用紙画像（例えば、１番目がＴＩＦ画像、２番目がＪＰＥＧ画像など）に対応する２つのＯＣＲ関連情報を含んでいる。 In this case, the OCR-related information 2 included in the OCR information group of FIG. 7 described above becomes, for example, the OCR-related information 2 shown in FIG. 18. FIG. 18 is a diagram showing still another example of the OCR-related information group. The OCR-related information 2 included in the OCR-related information group shown in FIG. Contains two pieces of OCR related information.

上記２つのＯＣＲ関連情報のうち、１番目のＯＣＲ関連情報は、第一ＯＣＲ情報“２０００円”，第二ＯＣＲ情報“２１００円”，および判別結果“不一致”を含み、２番目のＯＣＲ関連情報は、第一ＯＣＲ情報“２０００円”，第二ＯＣＲ情報“２０００円”，および判別結果“一致”を含む。 Among the above two OCR related information, the first OCR related information includes the first OCR information "2000 yen", the second OCR information "2100 yen", and the determination result "mismatch", and the second OCR related information includes the first OCR information "2000 yen", the second OCR information "2000 yen", and the determination result "match".

従って、ＯＣＲ関連情報２において、少なくとも１つの判別結果が“一致”となったため、図１８のＯＣＲ情報群に対応付いた不一致情報は“Ｎｕｌｌ”となる。これにより、処理負荷の増加を抑制しつつ、ＯＣＲの精度を高めることができる。 Therefore, in OCR related information 2, at least one determination result is "match", so the mismatch information associated with the OCR information group in FIG. 18 is "Null". Thereby, the accuracy of OCR can be improved while suppressing an increase in processing load.

なお、不一致情報は“Ｎｕｌｌ”である場合、端末装置２のディスプレイには、例えば、図１９に示すような確認画面が表示されてもよい。図１９は、確認画面の一例を示す図である。確認画面とは、用紙画像群を構成する全ての用紙画像について、第一および第二の２つのＯＣＲ処理による２つの認識結果が一致したことを確認する画面である。確認画面は、「全用紙で２つのＯＣＲが一致しました。・・・」等の文字列と、ＯＫボタンとを有する。ＯＫボタンが押下されると、用紙画像群に対する全ての認識結果が確定される。 Note that if the mismatch information is "Null", a confirmation screen as shown in FIG. 19 may be displayed on the display of the terminal device 2, for example. FIG. 19 is a diagram showing an example of a confirmation screen. The confirmation screen is a screen for confirming that the two recognition results obtained by the first and second OCR processes match for all paper images constituting the paper image group. The confirmation screen includes a character string such as "Two OCRs matched for all sheets..." and an OK button. When the OK button is pressed, all recognition results for the paper image group are confirmed.

以上、本実施の形態によれば、統計情報格納部１１１に、２以上の各用紙に記載されている情報に対する統計処理結果である統計情報が格納され、情報処理装置１は、２以上の各用紙を読み込んだ画像である用紙画像を受け付け、２以上の各用紙画像に対して、第一のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第一ＯＣＲ情報を取得し、２以上の各用紙画像に対して第二のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第二ＯＣＲ情報を取得し、統計情報と２以上の第一ＯＣＲ情報と２以上の第二ＯＣＲ情報とを用いて、２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定し、決定した２以上の各用紙に対応する正しいＯＣＲ情報に関する情報である検査情報を出力する。 As described above, according to the present embodiment, the statistical information storage unit 111 stores statistical information that is the result of statistical processing on information written on each of two or more sheets, and the information processing device 1 stores each of the two or more sheets. Accepting a paper image that is an image read from the paper, performing first OCR processing on each of the two or more paper images, and acquiring two or more first OCR information that is information on a predetermined area; A second OCR process is performed on each of the two or more paper images, and two or more pieces of second OCR information, which is information on a predetermined area, are obtained, and statistical information, two or more pieces of first OCR information, and two or more pieces of second OCR information are obtained. The second OCR information is used to determine correct OCR information that is OCR information corresponding to each of two or more sheets, and inspection information that is information regarding correct OCR information corresponding to each of the determined two or more sheets is determined. Output.

こうして、２以上の各用紙ごとに、第一のＯＣＲ処理による第一ＯＣＲ情報と、第二のＯＣＲ処理による第二ＯＣＲ情報とを取得して、格納されている統計情報と、当該取得した２以上の第一ＯＣＲ情報および２以上の第二ＯＣＲ情報とを用いて、正しいＯＣＲ情報を決定することで、ＯＣＲの精度の不足が補われる結果、情報処理装置１は、用紙に表記されている情報を的確に取得できる。なお、的確に取得できることは、用紙に表記されている情報と一致する情報を、当該用紙の用紙画像からＯＣＲによって取得できることである。 In this way, for each of two or more sheets, the first OCR information by the first OCR process and the second OCR information by the second OCR process are acquired, and the stored statistical information and the acquired two By determining correct OCR information using the above first OCR information and two or more second OCR information, the lack of accuracy of OCR is compensated for, and as a result, the information processing device 1 can correct the information written on the paper. Accurate information can be obtained. Note that what can be accurately acquired is that information that matches the information written on the paper can be acquired from the paper image of the paper by OCR.

従って、用紙に記載されている情報と、用紙画像からＯＣＲにより取得された情報との、人によるチェックの手間を効果的に軽減できる。 Therefore, it is possible to effectively reduce the effort required for a person to check the information written on the paper and the information obtained from the paper image by OCR.

また、上記構成において、統計情報は、２以上の各用紙に記載されている数値の和であることにより、情報処理装置１は、ＯＣＲの精度不足を簡単かつ効果的に補うことができる。 Furthermore, in the above configuration, since the statistical information is the sum of the numerical values written on two or more sheets, the information processing device 1 can easily and effectively compensate for the lack of precision in OCR.

また、情報処理装置１は、２以上の各用紙に対して、２以上の異なる用紙画像を受け付け、２以上の各用紙ごとに、２以上の異なる各用紙画像に対して、第一のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第一ＯＣＲ情報を取得し、２以上の各用紙ごとに、２以上の異なる各用紙画像に対して、第二のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第二ＯＣＲ情報を取得し、統計情報と４以上の第一ＯＣＲ情報と４以上の第二ＯＣＲ情報とを用いて、２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定する。 The information processing device 1 also receives two or more different paper images for each of the two or more papers, and performs first OCR processing on each of the two or more different paper images for each of the two or more papers. , obtain two or more pieces of first OCR information that is information of a predetermined area, and perform a second OCR process on two or more different paper images for each of the two or more papers, Obtain two or more pieces of second OCR information that are information on a predetermined area, and use the statistical information, four or more pieces of first OCR information, and four or more pieces of second OCR information to correspond to each piece of paper. The correct OCR information is determined.

こうして、２以上の各用紙ごとに、２以上の異なる用紙画像を取得し、２以上の各用紙画像ごとに、第一のＯＣＲ処理による第一ＯＣＲ情報と、第二のＯＣＲ処理による第二ＯＣＲ情報とを取得し、格納されている統計情報と、当該取得した４以上の第一ＯＣＲ情報および４以上の第二ＯＣＲ情報とを用いて、正しいＯＣＲ情報を決定することで、ＯＣＲの精度の不足を、より効果的に補うことができる。 In this way, two or more different paper images are acquired for each of the two or more paper images, and for each of the two or more paper images, the first OCR information is obtained by the first OCR processing, and the second OCR information is obtained by the second OCR processing. The accuracy of OCR can be improved by determining the correct OCR information using the stored statistical information and the acquired 4 or more first OCR information and 4 or more second OCR information. Deficiencies can be compensated for more effectively.

また、上記構成において、２以上の各用紙には、識別子が表出しており、情報処理装置１は、２以上の各用紙画像に対するＯＣＲ処理により、予め決められた領域から識別子を取得し、特別識別子格納部１１２には、予め決められた特別処理を行う識別子である１以上の特別識別子が格納されており、第一のＯＣＲ処理または第二のＯＣＲ処理によって取得した識別子が特別識別子であるか否かを判断し、特別識別子であると判断した識別子が表出した用紙に対して、予め決められた処理であり、第一ＯＣＲ処理または第二ＯＣＲ処理の精度を高めるための処理である特別処理を行う。 Further, in the above configuration, an identifier is exposed on each of the two or more sheets of paper, and the information processing device 1 acquires the identifier from a predetermined area by OCR processing on each of the two or more paper images and specially The identifier storage unit 112 stores one or more special identifiers that are identifiers that perform predetermined special processing, and determines whether the identifier acquired by the first OCR processing or the second OCR processing is a special identifier. This is a predetermined process for the paper on which the identifier judged to be a special identifier appears, and is a process to improve the accuracy of the first OCR process or the second OCR process. Perform processing.

こうして、特別識別子が表出している用紙に対してのみ特別処理を行うことで、情報処理装置１は、処理負荷の増大を抑制しつつ、ＯＣＲの精度の不足を補うことができる。 In this way, by performing special processing only on the paper on which the special identifier is exposed, the information processing apparatus 1 can compensate for the lack of OCR accuracy while suppressing an increase in processing load.

また、上記構成において、２以上の各用紙は、手形であることにより、情報処理装置１は、手形に表記されている金額等の情報を的確に取得できる。 Furthermore, in the above configuration, since each of the two or more sheets is a bill, the information processing device 1 can accurately acquire information such as the amount written on the bill.

さらに、本実施の形態における処理は、ソフトウェアで実現しても良い。そして、このソフトウェアをソフトウェアダウンロード等により配布しても良い。また、このソフトウェアをＣＤ－ＲＯＭなどの記録媒体に記録して流布しても良い。なお、このことは、本明細書における他の実施の形態においても該当する。 Furthermore, the processing in this embodiment may be realized by software. Then, this software may be distributed by software download or the like. Furthermore, this software may be recorded on a recording medium such as a CD-ROM and distributed. Note that this also applies to other embodiments in this specification.

なお、本実施の形態における情報処理装置１を実現するソフトウェアは、例えば、以下のようなプログラムである。つまり、このプログラムは、２以上の各用紙に記載されている情報に対する統計処理結果である統計情報が格納される統計情報格納部１１１にアクセス可能なコンピュータを、前記２以上の各用紙を読み込んだ画像である用紙画像を受け付ける用紙画像受付部１２１と、前記２以上の各用紙画像に対して、第一のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第一ＯＣＲ情報を取得する第一ＯＣＲ部１３１と、前記２以上の各用紙画像に対して第二のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第二ＯＣＲ情報を取得する第二ＯＣＲ部１３２と、前記統計情報と前記２以上の第一ＯＣＲ情報と前記２以上の第二ＯＣＲ情報とを用いて、前記２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定する決定部１３３と、前記決定部１３３が決定した２以上の各用紙に対応する正しいＯＣＲ情報に関する情報である検査情報を出力する出力部１４として機能させるためのプログラムである。 Note that the software that implements the information processing device 1 in this embodiment is, for example, the following program. In other words, this program causes a computer that can access the statistical information storage unit 111, which stores statistical information that is the result of statistical processing of information written on each of two or more sheets, to read each of the two or more sheets. A paper image receiving unit 121 that receives a paper image as an image performs first OCR processing on each of the two or more paper images, and obtains two or more pieces of first OCR information that is information on a predetermined area. a first OCR unit 131 that performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information that is information on a predetermined area; 132, and determining correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, the two or more first OCR information, and the two or more second OCR information. This is a program for making the output unit 14 function as the unit 133 and inspection information that is information regarding correct OCR information corresponding to two or more sheets determined by the determining unit 133.

なお、本実施の形態における情報処理装置１は、第一および第二の２つのＯＣＲ部（１３１および１３２）を備えているが、例えば、第三ＯＣＲ部（図示しない）さらに備えていてもよい。 Note that although the information processing device 1 in this embodiment includes two OCR units (131 and 132), a first and a second OCR unit, it may further include a third OCR unit (not shown), for example. .

第三ＯＣＲ部は、２以上の各用紙画像に対して、第三のＯＣＲ処理を行い、予め決められた領域の情報である２以上の第三ＯＣＲ情報を取得する。決定部１３３は、統計情報格納部１１１に格納されている統計情報と、第一ＯＣＲ部１３１が取得した２以上の第一ＯＣＲ情報と、第二ＯＣＲ部１３２が取得した２以上の第二ＯＣＲ情報と、第三ＯＣＲ部が取得した２以上の第三ＯＣＲ情報とを用いて、２以上の各用紙に対応するＯＣＲ情報であり、正しいＯＣＲ情報を決定する。 The third OCR unit performs third OCR processing on each of the two or more paper images, and obtains two or more pieces of third OCR information that is information on a predetermined area. The determining unit 133 selects the statistical information stored in the statistical information storage unit 111, two or more pieces of first OCR information acquired by the first OCR unit 131, and two or more pieces of second OCR information acquired by the second OCR unit 132. information and two or more pieces of third OCR information acquired by the third OCR unit to determine correct OCR information that corresponds to each of the two or more sheets.

これによって、ＯＣＲの精度の不足を、より効果的に補うことができ、その結果、各用紙に記載されている情報を読み取る精度が向上する。 This makes it possible to more effectively compensate for the lack of accuracy in OCR, and as a result, the accuracy of reading the information written on each sheet is improved.

また、ＯＣＲ部の数をさらに増やすことで、読み取り精度の一層の向上が期待できる。 Furthermore, by further increasing the number of OCR units, it is expected that reading accuracy will further improve.

図２０は、本実施の形態におけるプログラムを実行して、情報処理装置１等を実現するコンピュータシステム９００の外観図である。本実施の形態は、コンピュータハードウェアおよびその上で実行されるコンピュータプログラムによって実現され得る。図２０において、コンピュータシステム９００は、ディスクドライブ９０５を含むコンピュータ９０１と、キーボード９０２と、マウス９０３と、ディスプレイ９０４と、光学読取デバイス９１７とを備える。なお、キーボード９０２やマウス９０３やディスプレイ９０４や光学読取デバイス９１７をも含むシステム全体をコンピュータと呼んでもよい。 FIG. 20 is an external view of a computer system 900 that implements the information processing device 1 and the like by executing the program in this embodiment. This embodiment can be realized by computer hardware and a computer program executed on the computer hardware. In FIG. 20, computer system 900 includes a computer 901 including a disk drive 905, a keyboard 902, a mouse 903, a display 904, and an optical reading device 917. Note that the entire system including the keyboard 902, mouse 903, display 904, and optical reading device 917 may also be called a computer.

図２１は、コンピュータシステム９００の内部構成の一例を示す図である。図２１において、コンピュータ９０１は、ディスクドライブ９０５に加えて、ＭＰＵ９１１と、ブートアッププログラム等のプログラムを記憶するためのＲＯＭ９１２と、ＭＰＵ９１１に接続され、アプリケーションプログラムの命令を一時的に記憶すると共に、一時記憶空間を提供するＲＡＭ９１３と、アプリケーションプログラム、システムプログラム、およびデータを記憶するストレージ９１４と、ＭＰＵ９１１、ＲＯＭ９１２等を相互に接続するバス９１５と、外部ネットワークや内部ネットワーク等のネットワークへの接続を提供するネットワークカード９１６と、光学読取デバイス９１７と、を備える。ストレージ９１４は、例えば、ハードディスク、ＳＳＤ、フラッシュメモリなどである。光学読取デバイス９１７は、例えば、カメラ、スキャナなどである。 FIG. 21 is a diagram showing an example of the internal configuration of the computer system 900. In FIG. 21, in addition to a disk drive 905, a computer 901 is connected to an MPU 911, a ROM 912 for storing programs such as a boot-up program, and a ROM 912 for temporarily storing instructions of an application program. A RAM 913 that provides storage space, a storage 914 that stores application programs, system programs, and data, a bus 915 that interconnects the MPU 911, ROM 912, etc., and provides connections to networks such as external networks and internal networks. A network card 916 and an optical reading device 917 are provided. The storage 914 is, for example, a hard disk, SSD, flash memory, or the like. The optical reading device 917 is, for example, a camera, a scanner, or the like.

コンピュータシステム９００に、情報処理装置１等の機能を実行させるプログラムは、例えば、ＤＶＤ、ＣＤ－ＲＯＭ等のディスク９２１に記憶されて、ディスクドライブ９０５に挿入され、ストレージ９１４に転送されてもよい。これに代えて、そのプログラムは、ネットワークを介してコンピュータ９０１に送信され、ストレージ９１４に記憶されてもよい。プログラムは、実行の際にＲＡＭ９１３にロードされる。なお、プログラムは、ディスク９２１、またはネットワークから直接、ロードされてもよい。また、ディスク９２１に代えて他の着脱可能な記録媒体（例えば、ＤＶＤやメモリカード等）を介して、プログラムがコンピュータシステム９００に読み込まれてもよい。 A program that causes the computer system 900 to execute the functions of the information processing device 1 and the like may be stored on a disk 921 such as a DVD or CD-ROM, inserted into the disk drive 905, and transferred to the storage 914. Alternatively, the program may be sent to computer 901 via the network and stored in storage 914. The program is loaded into RAM 913 during execution. Note that the program may be loaded directly from the disk 921 or the network. Further, instead of the disk 921, the program may be read into the computer system 900 via another removable recording medium (for example, a DVD or a memory card).

プログラムは、コンピュータの詳細を示す９０１に、情報処理装置１等の機能を実行させるオペレーティングシステム（ＯＳ）、またはサードパーティプログラム等を必ずしも含んでいなくてもよい。プログラムは、制御された態様で適切な機能やモジュールを呼び出し、所望の結果が得られるようにする命令の部分のみを含んでいてもよい。コンピュータシステム９００がどのように動作するのかについては周知であり、詳細な説明は省略する。 The program does not necessarily include an operating system (OS) that causes the information processing device 1 or the like to execute functions, a third party program, etc. in 901 indicating details of the computer. A program may include only those portions of instructions that call appropriate functions or modules in a controlled manner to achieve desired results. How computer system 900 operates is well known and will not be described in detail.

なお、上述したコンピュータシステム９００は、サーバまたは据え置き型のＰＣであるが、端末装置２は、例えば、タブレット端末やスマートフォンやノートＰＣといった、携帯端末で実現されてもよい。この場合、例えば、キーボード９０２およびマウス９０３はタッチパネルに、ディスクドライブ９０５はメモリカードスロットに、ディスク９２１はメモリカードに、それぞれ置き換えられることが望ましい。 Note that although the computer system 900 described above is a server or a stationary PC, the terminal device 2 may be implemented as a mobile terminal such as a tablet terminal, a smartphone, or a notebook PC. In this case, for example, it is desirable that the keyboard 902 and mouse 903 be replaced with a touch panel, the disk drive 905 with a memory card slot, and the disk 921 with a memory card.

ただし、以上は例示であり、情報処理装置１等を実現するコンピュータのハードウェア構成は問わない。 However, the above is just an example, and the hardware configuration of the computer that implements the information processing device 1 etc. does not matter.

なお、上記プログラムにおいて、情報を送信する送信ステップや、情報を受信する受信ステップなどでは、ハードウェアによって行われる処理、例えば、送信ステップにおけるモデムやインターフェースカードなどで行われる処理（ハードウェアでしか行われない処理）は含まれない。 Note that in the above program, in the transmission step for transmitting information and the reception step for receiving information, processing is performed by hardware, such as processing performed by the modem or interface card in the transmission step (processing that can only be performed by hardware). Processing that is not carried out) is not included.

また、上記プログラムを実行するコンピュータは、単数であってもよく、複数であってもよい。すなわち、一のコンピュータが集中処理を行ってもよく、あるいは複数のコンピュータが分散処理を行ってもよい。 Further, the number of computers that execute the above program may be one or more. That is, one computer may perform centralized processing, or multiple computers may perform distributed processing.

また、上記実施の形態において、一の装置に存在する２以上の通信手段（端末送信部２４、および端末受信部２５など）は、物理的に一の媒体で実現されてもよいことは言うまでもない。 Furthermore, in the embodiment described above, it goes without saying that two or more communication means (terminal transmitter 24, terminal receiver 25, etc.) existing in one device may be physically realized by one medium. .

また、上記実施の形態において、各処理（各機能）は、単一の装置（システム）によって集中処理されることによって実現されてもよく、あるいは、複数の装置によって分散処理されることによって実現されてもよい。 Furthermore, in the above embodiments, each process (each function) may be realized by being centrally processed by a single device (system), or by being distributed by a plurality of devices. It's okay.

本発明は、以上の実施の形態に限定されることなく、種々の変更が可能であり、それらも本発明の範囲内に包含されるものであることは言うまでもない。 It goes without saying that the present invention is not limited to the above-described embodiments, and that various modifications can be made, and these are also included within the scope of the present invention.

以上のように、本発明にかかる情報処理装置は、ＯＣＲの精度の不足を補うことで、用紙に表記されている情報を的確に取得できるという効果を有し、情報処理装置等として有用である。 As described above, the information processing device according to the present invention has the effect of accurately acquiring information written on paper by compensating for the lack of accuracy in OCR, and is useful as an information processing device, etc. .

１情報処理装置
２端末装置
１１格納部
１２受付部
１３処理部
１４出力部
２１端末格納部
２２端末受付部
２３端末処理部
２４端末送信部
２５端末受信部
２６端末出力部
１１１統計情報格納部
１１２特別識別子格納部
１２１用紙画像受付部
１３１第一ＯＣＲ部
１３２第二ＯＣＲ部
１３３決定部
１３４判断部
１３５特別処理部 1 Information processing device 2 Terminal device 11 Storage section 12 Reception section 13 Processing section 14 Output section 21 Terminal storage section 22 Terminal reception section 23 Terminal processing section 24 Terminal transmission section 25 Terminal reception section 26 Terminal output section 111 Statistical information storage section 112 Special Identifier storage section 121 Paper image reception section 131 First OCR section 132 Second OCR section 133 Determination section 134 Judgment section 135 Special processing section

Claims

a statistical information storage unit that stores statistical information that is the result of statistical processing of the information written on each of the two or more sheets;
a paper image reception unit that receives a paper image that is an image obtained by reading each of the two or more papers;
a first OCR unit that performs first OCR processing on each of the two or more paper images and obtains two or more pieces of first OCR information that is information on a predetermined area;
a second OCR unit that performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information that is information on a predetermined area;
a determining unit that determines correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, the two or more first OCR information, and the two or more second OCR information;
an output unit that outputs inspection information that is information regarding correct OCR information corresponding to each of the two or more sheets determined by the determination unit,
The information processing device, wherein the statistical information is a sum of numerical values written on each of the two or more sheets.

a statistical information storage unit that stores statistical information that is the result of statistical processing of the information written on each of the two or more sheets;
a paper image reception unit that receives a paper image that is an image obtained by reading each of the two or more papers;
a first OCR unit that performs first OCR processing on each of the two or more paper images and obtains two or more pieces of first OCR information that is information on a predetermined area;
a second OCR unit that performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information that is information on a predetermined area;
a determining unit that determines correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, the two or more first OCR information, and the two or more second OCR information;
an output unit that outputs inspection information that is information regarding correct OCR information corresponding to each of the two or more sheets determined by the determination unit,
The paper image reception unit includes:
accepting two or more different paper images for each of the two or more papers;
The first OCR section includes:
Performing a first OCR process on each of the two or more different paper images for each of the two or more sheets to obtain two or more pieces of first OCR information that is information on a predetermined area;
The second OCR unit includes:
performing a second OCR process on each of the two or more different paper images for each of the two or more sheets to obtain two or more second OCR information that is information on a predetermined area;
The determining unit is
An information processing device that uses the statistical information, the four or more first OCR information, and the four or more second OCR information to determine correct OCR information that is OCR information that corresponds to each of the two or more sheets . .

a statistical information storage unit that stores statistical information that is the result of statistical processing of the information written on each of the two or more sheets;
a paper image reception unit that receives a paper image that is an image obtained by reading each of the two or more papers;
a first OCR unit that performs first OCR processing on each of the two or more paper images and obtains two or more pieces of first OCR information that is information on a predetermined area;
a second OCR unit that performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information that is information on a predetermined area;
a determining unit that determines correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, the two or more first OCR information, and the two or more second OCR information;
an output unit that outputs inspection information that is information regarding correct OCR information corresponding to each of the two or more sheets determined by the determination unit,
An identifier is exposed on each of the two or more sheets,
The first OCR unit or the second OCR unit,
Obtaining an identifier from a predetermined area by OCR processing on each of the two or more paper images,
a special identifier storage unit storing one or more special identifiers that are identifiers for performing predetermined special processing;
a determining unit that determines whether the identifier acquired by the first OCR unit or the second OCR unit is a special identifier;
This is a predetermined process for a sheet on which an identifier that has been determined by the determining unit to be a special identifier is displayed, and is a process for increasing the accuracy of the determining process in which the determining unit determines correct OCR information. An information processing device further comprising a special processing unit that performs special processing.

A statistical information storage section in which statistical information that is the result of statistical processing on information written on two or more sheets is stored, a paper image reception section, a first OCR section, a second OCR section, a determination section, and an output section. An information processing method realized,
a paper image receiving step in which the paper image receiving unit receives a paper image that is an image obtained by reading each of the two or more papers;
a first OCR step in which the first OCR unit performs a first OCR process on each of the two or more paper images to obtain two or more pieces of first OCR information that is information on a predetermined area; ,
a second OCR step in which the second OCR unit performs a second OCR process on each of the two or more paper images to obtain two or more pieces of second OCR information that is information on a predetermined area;
The determining unit determines correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, the two or more first OCR information, and the two or more second OCR information. a decision step to
an output step in which the output unit outputs inspection information that is information regarding correct OCR information corresponding to each of the two or more sheets determined by the determination unit,
The information processing method , wherein the statistical information is the sum of numerical values written on each of the two or more sheets .

A computer that can access the statistical information storage unit where statistical information that is the result of statistical processing of the information written on each of the two or more sheets is stored,
a paper image reception unit that receives a paper image that is an image obtained by reading each of the two or more papers;
a first OCR unit that performs first OCR processing on each of the two or more paper images and obtains two or more pieces of first OCR information that is information on a predetermined area;
a second OCR unit that performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information that is information on a predetermined area;
a determining unit that determines correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, the two or more first OCR information, and the two or more second OCR information;
A program for functioning as an output unit that outputs inspection information that is information regarding correct OCR information corresponding to two or more sheets determined by the determination unit,
The program, wherein the statistical information is the sum of numerical values written on each of the two or more sheets .

A computer that can access the statistical information storage unit where statistical information that is the result of statistical processing of the information written on each of the two or more sheets is stored,
a paper image reception unit that receives a paper image that is an image obtained by reading each of the two or more papers;
a first OCR unit that performs first OCR processing on each of the two or more paper images and obtains two or more pieces of first OCR information that is information on a predetermined area;
a second OCR unit that performs second OCR processing on each of the two or more paper images and obtains two or more pieces of second OCR information that is information on a predetermined area;
a determining unit that determines correct OCR information that is OCR information corresponding to each of the two or more sheets using the statistical information, the two or more first OCR information, and the two or more second OCR information;
A program for functioning as an output unit that outputs inspection information that is information regarding correct OCR information corresponding to two or more sheets determined by the determination unit,
The paper image reception unit includes:
accepting two or more different paper images for each of the two or more papers;
The first OCR section includes:
Performing a first OCR process on each of the two or more different paper images for each of the two or more sheets to obtain two or more pieces of first OCR information that is information on a predetermined area;
The second OCR unit includes:
performing a second OCR process on each of the two or more different paper images for each of the two or more sheets to obtain two or more second OCR information that is information on a predetermined area;
The determining unit is
The statistical information, the four or more first OCR information, and the four or more second OCR information are used to determine correct OCR information that corresponds to each of the two or more sheets. A program that allows a computer to function.