JP2016200899A

JP2016200899A - Information processing apparatus, information processing method, and program

Info

Publication number: JP2016200899A
Application number: JP2015079233A
Authority: JP
Inventors: 健一阿部; Kenichi Abe
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2015-04-08
Filing date: 2015-04-08
Publication date: 2016-12-01

Abstract

PROBLEM TO BE SOLVED: To estimate an appropriate attribute value of a document image.SOLUTION: An information processing apparatus includes: recognition means of extracting attribute values of document images by applying recognition processing to each of a plurality of document images belonging to a document image group; result determination means for determining whether the recognition processing has been successful or not, in accordance with a recognition condition; document type specifying means for specifying a document type of the document image; reference image specifying means for specifying a document image determined to have succeeded in recognition processing and having the identified document type, as a reference image, from among the document images belonging to the document image group; and estimating means for estimating an attribute value of the document image determined to have failed in recognition processing, on the basis of an attribute value of the document image specified by the reference image specifying means.SELECTED DRAWING: Figure 1

Description

本発明は、情報処理装置、情報処理方法及びプログラムに関する。 The present invention relates to an information processing apparatus, an information processing method, and a program.

近年、病院等の医療機関において情報化が進み、患者の診断情報を管理する電子カルテを導入する機関が増えている。電子カルテ上には、患者の診断履歴、投薬情報、手術情報、患部画像、単純Ｘ線画像等が一元管理され、データの共有や再利用が簡単にできるようになってきている。また、近年のストレージの大容量化によって、院内で利用する文書や、ＣＴ、ＭＲＩ、単純Ｘ線等のデジタル画像を大量にストレージに格納できるようになった。 In recent years, computerization has progressed in medical institutions such as hospitals, and an increasing number of institutions have introduced electronic medical records that manage patient diagnosis information. On the electronic medical record, patient diagnosis history, medication information, surgery information, affected area images, simple X-ray images, and the like are centrally managed, and data can be easily shared and reused. In addition, with the recent increase in storage capacity, documents used in the hospital and digital images such as CT, MRI, and simple X-rays can be stored in large amounts in the storage.

しかしながら、電子カルテ等基幹システムが電子化されている病院においてもなお院内には紙文書が存在している。このような病院においては、生成される紙文書のうち保存が必要なものは電子化し、文書種別や患者情報を属性値として付与して文書管理システムや電子カルテで保存するという運用をしている。中規模以上の病院では、一日に数千枚の紙文書スキャンが発生することもあり、紙文書を電子化するための人員を割いていることも珍しくない。 However, even in hospitals where the core system such as electronic medical records is digitized, paper documents still exist in the hospital. In such hospitals, generated paper documents that need to be stored are digitized, and document types and patient information are assigned as attribute values and stored in a document management system or electronic medical record. . In hospitals of medium or larger scale, thousands of paper document scans may occur every day, and it is not uncommon to have personnel to digitize paper documents.

このような紙文書のスキャン読み取り作業を支援する試みとして、特許文献１には、スキャンにより得られた文書画像から文字認識により文書タイトル等を抽出してユーザに提示し、目視確認及び修正の上、登録する技術が開示されている。また、特許文献２には、１次元バーコードや２次元コードを用いて紙文書に文書種別や患者情報を印字し、スキャンにより得られた文書画像からバーコードを読み取って文書種別や患者情報を復元する技術が開示されている。 As an attempt to support such a scan reading operation of a paper document, Patent Document 1 discloses that a document title or the like is extracted from a document image obtained by scanning by character recognition and presented to a user for visual confirmation and correction. A technique for registering is disclosed. Further, Patent Document 2 prints a document type and patient information on a paper document using a one-dimensional barcode or a two-dimensional code, reads the barcode from a document image obtained by scanning, and displays the document type and patient information. A technique for restoring is disclosed.

特開２０００−１４８７８８号公報JP 2000-148788 A 特開２０１３−１３１０４４号公報JP 2013-131044 A

しかしながら、文字認識をベースとする方法では、文書によっては十分な認識精度が得られないという問題がある。また、紙文書にＱＲコード（登録商標）を印字する方法は文字認識に比べ認識精度で有利であるが、スキャン読み取り対象の情報を紙文書に印字する装置を備えている必要がある。このため、印字のための装置を備えていない病院や診療所から発行された紙文書に対応できない。また、ＱＲコードの読み取りに失敗する可能性もある。 However, a method based on character recognition has a problem that sufficient recognition accuracy cannot be obtained depending on a document. In addition, a method of printing a QR code (registered trademark) on a paper document is more advantageous in recognition accuracy than character recognition, but it is necessary to have a device for printing information to be scanned and read on a paper document. For this reason, it is not possible to deal with paper documents issued from hospitals or clinics that do not have a printing device. In addition, there is a possibility that reading the QR code may fail.

文書画像の認識処理を利用しつつ、文書画像の正しい属性値を推定することを目的とする。 An object is to estimate a correct attribute value of a document image while using a document image recognition process.

そこで、本発明は、情報処理装置であって、文書画像群に属する複数の文書画像それぞれに対する認識処理により、前記文書画像の属性値を抽出する認識手段と、認識条件にしたがい認識処理に成功したか否かを判定する結果判定手段と、前記文書画像の文書タイプを特定する文書タイプ特定手段と、前記文書画像群に属する前記文書画像の中から、前記認識処理に成功したと判定され、かつ特定された文書タイプが参照タイプの前記文書画像を参照画像として特定する参照画像特定手段と、前記参照画像特定手段により特定された前記文書画像の前記属性値に基づいて、前記認識処理に失敗したと判定された前記文書画像の前記属性値を推定する推定手段とを有することを特徴とする。 Therefore, the present invention is an information processing apparatus, and by recognition processing for each of a plurality of document images belonging to a document image group, the recognition means for extracting the attribute value of the document image and the recognition processing succeeded according to the recognition conditions. A result determination unit that determines whether or not, a document type identification unit that identifies a document type of the document image, a determination that the recognition process has been successful from the document images belonging to the document image group, and Based on the attribute value of the document image specified by the reference image specifying means for specifying the document image whose reference type is the reference type as the reference image, the recognition processing has failed. And estimating means for estimating the attribute value of the document image determined as.

本発明によれば、文書画像の正しい属性値を推定することができる。 According to the present invention, a correct attribute value of a document image can be estimated.

文書管理システムを示す図である。It is a figure which shows a document management system. サーバ装置のハードウェア構成を示す図である。It is a figure which shows the hardware constitutions of a server apparatus. 文書画像登録処理を示すフローチャートである。It is a flowchart which shows a document image registration process. 属性値抽出処理を示すフローチャートである。It is a flowchart which shows an attribute value extraction process. 属性パターンの説明図である。It is explanatory drawing of an attribute pattern. 属性パターン判定処理を示すフローチャートである。It is a flowchart which shows an attribute pattern determination process. 連続確率算出処理の説明図である。It is explanatory drawing of a continuous probability calculation process. 属性値推定処理を示すフローチャートである。It is a flowchart which shows an attribute value estimation process. 参照タイプテーブルの一例を示す図である。It is a figure which shows an example of a reference type table. 第１の参照画像検出処理を示すフローチャートである。It is a flowchart which shows a 1st reference image detection process. 文書タイプテーブルの一例を示す図である。It is a figure which shows an example of a document type table. 属性値推定処理の説明図である。It is explanatory drawing of an attribute value estimation process. 確認画面の一例を示す図である。It is a figure which shows an example of a confirmation screen. 修正履歴テーブルのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of a correction history table. 参照テーブル生成処理を示すフローチャートである。It is a flowchart which shows a reference table production | generation process. 変更例に係る確認画面の一例を示す図である。It is a figure which shows an example of the confirmation screen which concerns on the example of a change. 変更例に係る確認画面の一例を示す図である。It is a figure which shows an example of the confirmation screen which concerns on the example of a change. 変更例に係る確認画面の一例を示す図である。It is a figure which shows an example of the confirmation screen which concerns on the example of a change.

以下、本発明の実施形態について図面に基づいて説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

（第１の実施形態）
図１は、第１の実施形態に係る文書管理システム１００を示す図である。文書管理システム１００は、紙媒体から文書画像を生成し、文書画像内に示された属性値を抽出し、文書画像と属性値とを紐付けて登録、管理する。文書管理システム１００は、読取装置１１０と、情報処理装置の一例としてのサーバ装置１２０と、クライアント装置１３０とを有しており、これらの装置は、有線又は無線のネットワーク１４０を介して通信接続されている。なお、本実施形態においては、文書管理システム１００が、医療文書を示す文書画像を管理する場合を例に説明する。 (First embodiment)
FIG. 1 is a diagram illustrating a document management system 100 according to the first embodiment. The document management system 100 generates a document image from a paper medium, extracts an attribute value indicated in the document image, registers and manages the document image and the attribute value in association with each other. The document management system 100 includes a reading device 110, a server device 120 as an example of an information processing device, and a client device 130, and these devices are communicatively connected via a wired or wireless network 140. ing. In the present embodiment, a case where the document management system 100 manages a document image indicating a medical document will be described as an example.

読取装置１１０は、紙文書をスキャンして文書画像を生成する。読取装置１１０は、例えば、オートドキュメントフィーダ（以下ＡＤＦ）という自動給紙機構を備えたスキャナ装置である。また、他の例としては、読取装置１１０は、紙文書を撮影するデジタルカメラであってもよい。読取装置１１０は、文書画像生成部１１１と送受信部１１２とを有している。文書画像生成部１１１は、入力された紙文書を電子的に読み取り（スキャンし）、１枚の紙文書から１つの文書画像を生成する。送受信部１１２は、ネットワーク１４０を介して、外部装置とデータの送受信を行う。ユーザによって読取装置１１０に複数の紙原稿がセットされると、紙原稿は１ページずつ不図示のスキャナ部に送られる。そして、文書画像生成部１１１は、スキャナ部の画像読取処理の結果に基づいて、複数の紙原稿それぞれに対する、複数の文書画像を生成する。なお、本実施形態においては、複数の紙文書からなる紙文書群がまとめてスキャンされ、これに対応して、複数の文書画像を含む文書画像群が生成されるものとする。文書画像群は、送受信部１１２によりサーバ装置１２０に送信される。 The reading device 110 scans a paper document and generates a document image. The reading device 110 is, for example, a scanner device provided with an automatic paper feeding mechanism called an auto document feeder (hereinafter referred to as ADF). As another example, the reading device 110 may be a digital camera that captures a paper document. The reading device 110 includes a document image generation unit 111 and a transmission / reception unit 112. The document image generation unit 111 electronically reads (scans) an input paper document and generates one document image from one paper document. The transmission / reception unit 112 transmits / receives data to / from an external device via the network 140. When a plurality of paper originals are set on the reading device 110 by the user, the paper originals are sent page by page to a scanner unit (not shown). Then, the document image generation unit 111 generates a plurality of document images for each of the plurality of paper originals based on the result of the image reading process of the scanner unit. In the present embodiment, it is assumed that a paper document group composed of a plurality of paper documents is scanned together and a document image group including a plurality of document images is generated correspondingly. The document image group is transmitted to the server apparatus 120 by the transmission / reception unit 112.

サーバ装置１２０は、読取装置１１０により生成された文書画像を登録、管理する。サーバ装置１２０は、送受信部１２１と、属性値抽出部１２２と、属性パターン特定部１２３と、属性値推定部１２４と、画像管理部１２５と、修正管理部１２６と、参照タイプテーブル生成部１２７と、を有している。送受信部１２１は、ネットワーク１４０を介して、外部装置とデータの送受信を行う。送受信部１２１は、例えば読取装置１１０から文書画像群を受信する。属性値抽出部１２２は、文書画像群に含まれる各文書画像から認識技術を用いて文字やバーコードで表された属性値を抽出する。属性パターン特定部１２３は、文書画像群の属性パターンを特定する。属性パターンは、文書画像群に含まれる複数の文書画像が示す属性値に関する特徴を示す情報である。属性値推定部１２４は、属性パターンに基づいて、文書画像群の属性値を推定する。画像管理部１２５は、文書画像の登録、管理を行う。修正管理部１２６は、属性値抽出部１２２により得られた属性値の修正等を行い、修正処理に係る履歴情報を記録する。参照タイプテーブル生成部１２７は、修正処理に係る履歴情報に基づいて、参照タイプテーブルを生成する。なお、参照タイプテーブル、及び参照タイプテーブルを生成する処理については、後述する。 The server device 120 registers and manages the document image generated by the reading device 110. The server apparatus 120 includes a transmission / reception unit 121, an attribute value extraction unit 122, an attribute pattern specification unit 123, an attribute value estimation unit 124, an image management unit 125, a modification management unit 126, and a reference type table generation unit 127. ,have. The transmission / reception unit 121 transmits / receives data to / from an external device via the network 140. The transmission / reception unit 121 receives a document image group from the reading device 110, for example. The attribute value extraction unit 122 extracts an attribute value represented by a character or a barcode from each document image included in the document image group using a recognition technique. The attribute pattern specifying unit 123 specifies the attribute pattern of the document image group. The attribute pattern is information indicating characteristics relating to attribute values indicated by a plurality of document images included in the document image group. The attribute value estimation unit 124 estimates the attribute value of the document image group based on the attribute pattern. The image management unit 125 registers and manages document images. The correction management unit 126 corrects the attribute value obtained by the attribute value extraction unit 122 and records history information related to the correction process. The reference type table generation unit 127 generates a reference type table based on the history information related to the correction process. The reference type table and the process for generating the reference type table will be described later.

クライアント装置１３０は、読取装置１１０により生成された文書画像や、文書画像の属性値をユーザがチェックするための手段を提供する。クライアント装置１３０は、表示処理部１３１と、指示受付部１３２と、送受信部１３３と、を有している。表示処理部１３１は、文書画像等を表示画面に表示する。指示受付部１３２は、ユーザから入力された指示を受け付ける。送受信部１３３は、ネットワーク１４０を介して、外部装置とデータの送受信を行う。なお、文書管理システム１００の構成は、本実施形態に限定されるものではない。他の例としては、サーバ装置１２０とクライアント装置１３０は、一体に設けられていてもよい。 The client device 130 provides a means for the user to check the document image generated by the reading device 110 and the attribute value of the document image. The client device 130 includes a display processing unit 131, an instruction receiving unit 132, and a transmission / reception unit 133. The display processing unit 131 displays a document image or the like on the display screen. The instruction receiving unit 132 receives an instruction input from the user. The transmission / reception unit 133 transmits / receives data to / from an external device via the network 140. The configuration of the document management system 100 is not limited to this embodiment. As another example, the server apparatus 120 and the client apparatus 130 may be provided integrally.

図２は、サーバ装置１２０のハードウェア構成を示す図である。サーバ装置１２０は、ＣＰＵ２０１と、ＲＯＭ２０２と、ＲＡＭ２０３と、ＨＤＤ２０４と、表示部２０５と、入力部２０６と、ネットワークＩ／Ｆ部２０７とを有している。ＣＰＵ２０１は、ＲＯＭ２０２に記憶された制御プログラムを読み出して各種処理を実行する。ＲＡＭ２０３は、ＣＰＵ２０１の主メモリ、ワークエリア等の一時記憶領域として用いられる。ＨＤＤ２０４は、画像データや各種プログラム等各種情報を記憶する。なお、後述するサーバ装置１２０の機能や処理は、ＣＰＵ２０１がＲＯＭ２０２又はＨＤＤ２０４に格納されているプログラムを読み出し、このプログラムを実行することにより実現されるものである。 FIG. 2 is a diagram illustrating a hardware configuration of the server device 120. The server device 120 includes a CPU 201, a ROM 202, a RAM 203, an HDD 204, a display unit 205, an input unit 206, and a network I / F unit 207. The CPU 201 reads a control program stored in the ROM 202 and executes various processes. The RAM 203 is used as a temporary storage area such as a main memory and a work area for the CPU 201. The HDD 204 stores various information such as image data and various programs. Note that the functions and processing of the server device 120 described later are realized by the CPU 201 reading a program stored in the ROM 202 or the HDD 204 and executing this program.

表示部２０５は、各種情報を表示する。入力部２０６は、キーボードやマウスを有し、ユーザによる各種操作を受け付ける。ネットワークＩ／Ｆ部２０７は、ネットワークを介して画像形成装置等の外部装置との通信処理を行う。なお、読取装置１１０、クライアント装置１３０のハードウェア構成は、図２を参照しつつ説明したサーバ装置１２０のハードウェア構成とほぼ同様である。但し、読取装置１１０は、文書画像を電子的に読み取るスキャナ部をさらに備えているものとする。 The display unit 205 displays various information. The input unit 206 has a keyboard and a mouse and accepts various operations by the user. A network I / F unit 207 performs communication processing with an external apparatus such as an image forming apparatus via a network. Note that the hardware configurations of the reading device 110 and the client device 130 are substantially the same as the hardware configuration of the server device 120 described with reference to FIG. However, the reading device 110 further includes a scanner unit that electronically reads a document image.

図３は、サーバ装置１２０による文書画像登録処理を示すフローチャートである。文書画像登録処理は、読取装置１１０から受信した文書画像群に含まれる文書画像を登録する処理である。なお、本実施形態においては、読取装置１１０が医療文書を入力原稿として、医療文書に対応する文書画像を生成し、一連のスキャン動作により得られた複数の文書画像を文書画像群としてサーバ装置１２０に送信するものとする。文書画像登録処理において、サーバ装置１２０は、文字認識技術により文書画像から文書種別、患者識別情報（患者ＩＤ）としての患者名、診療科識別情報（診療科ＩＤ）としての診療科名の属性値を抽出する。そして、サーバ装置１２０は、抽出した属性値をユーザが目視確認した上で文書画像と属性値とを対応付けて登録する。なお、文書種別、患者名（患者ＩＤ）、診療科名（診療科ＩＤ）を属性種別、各属性種別の値を属性値と称することとする。なお、各属性値は、英数字等を組み合わせてユニークに採番した文字列であってもよい。 FIG. 3 is a flowchart showing document image registration processing by the server apparatus 120. The document image registration process is a process for registering a document image included in a document image group received from the reading device 110. In the present embodiment, the reading device 110 generates a document image corresponding to a medical document using a medical document as an input original, and the server device 120 uses a plurality of document images obtained by a series of scanning operations as a document image group. Shall be sent to In the document image registration processing, the server device 120 uses the character recognition technology to determine the document type from the document image, the patient name as patient identification information (patient ID), and the attribute value of the clinical department name as clinical department identification information (clinical department ID). To extract. Then, the server apparatus 120 registers the document image and the attribute value in association with each other after the user visually confirms the extracted attribute value. The document type, patient name (patient ID), and department name (clinical department ID) are referred to as attribute types, and the value of each attribute type is referred to as an attribute value. Each attribute value may be a character string uniquely numbered by combining alphanumeric characters and the like.

以下、図３を参照しつつ、文書画像登録処理の概略について説明し、各処理の詳細については後述する。Ｓ３０１において、送受信部１２１は、ネットワーク１４０を介して読取装置１１０から文書画像群を受信する。Ｓ３０１において受信した文書画像群を処理対象として、以下の処理が行われる。すなわち、Ｓ３０１に続いて、Ｓ３０２において、属性値抽出部１２２は、Ｓ３０１において受信した文書画像群に含まれる複数の文書画像それぞれに対し、文字認識処理を行うことにより、文書画像の属性値を抽出する。本実施形態においては、属性値抽出部１２２は、文書種別、患者ＩＤ及び診療科名の属性値を抽出する。 Hereinafter, the outline of the document image registration process will be described with reference to FIG. 3, and details of each process will be described later. In step S 301, the transmission / reception unit 121 receives a document image group from the reading device 110 via the network 140. The following processing is performed on the document image group received in S301 as a processing target. That is, following S301, in S302, the attribute value extraction unit 122 performs character recognition processing on each of the plurality of document images included in the document image group received in S301, thereby extracting the attribute value of the document image. To do. In the present embodiment, the attribute value extracting unit 122 extracts attribute values of document type, patient ID, and department name.

次に、Ｓ３０３において、属性パターン特定部１２３は、ステップＳ３０１において受信した文書画像群の属性パターンを特定する。次に、Ｓ３０４において、属性値推定部１２４は、属性値抽出処理（Ｓ３０２）において得られた属性値に基づいて、認識処理に失敗したと判定された文書画像を含む、文書画像群の一部の文書画像に対し、属性値抽出結果よりも正確な属性値を推定する。 In step S303, the attribute pattern specifying unit 123 specifies the attribute pattern of the document image group received in step S301. Next, in S304, the attribute value estimation unit 124 includes a part of the document image group including the document image determined to have failed in the recognition process based on the attribute value obtained in the attribute value extraction process (S302). More accurate attribute values are estimated for the document image than the attribute value extraction result.

Ｓ３０４の処理が完了すると、送受信部１２１は、処理対象の文書画像と、抽出された属性値及び推定された属性値をクライアント装置１３０に送信する。この処理は、認識結果及び推定結果を出力する出力処理の一例である。クライアント装置１３０では、表示処理部１３１が、文書画像と属性値とを表示部２０５に表示する。ユーザは、表示内容を確認し、適宜、推定された属性値に対する修正指示を入力し、また文書画像の登録指示を入力する。指示受付部１３２が修正指示を受け付けると、送受信部１３３は、サーバ装置１２０に修正指示を送信する。また、指示受付部１３２が登録指示を受け付けると、送受信部１３３は、サーバ装置１２０に登録指示を送信する。 When the process of S304 is completed, the transmission / reception unit 121 transmits the processing target document image, the extracted attribute value, and the estimated attribute value to the client device 130. This process is an example of an output process for outputting a recognition result and an estimation result. In the client device 130, the display processing unit 131 displays the document image and the attribute value on the display unit 205. The user confirms the display contents, inputs a correction instruction for the estimated attribute value as appropriate, and inputs a document image registration instruction. When the instruction receiving unit 132 receives the correction instruction, the transmission / reception unit 133 transmits the correction instruction to the server device 120. When the instruction receiving unit 132 receives a registration instruction, the transmission / reception unit 133 transmits a registration instruction to the server device 120.

Ｓ３０５において、修正管理部１２６は、送受信部１２１がクライアント装置１３０から修正指示を受信したか否かを判定する。修正管理部１２６は、修正指示を受け付けた場合には（Ｓ３０５でＹｅｓ）、処理をＳ３０６へ進める。修正管理部１２６は、修正指示を受け付けなかった場合には（Ｓ３０５でＮｏ）、処理をＳ３０７へ進める。Ｓ３０６において、修正管理部１２６は、修正指示に従い、属性値を修正する。次に、Ｓ３０７において、画像管理部１２５は、送受信部１２１が登録指示を受信したか否かを判定する。画像管理部１２５は、登録指示を受信した場合には（Ｓ３０７でＹｅｓ）、処理をＳ３０８へ進める。画像管理部１２５は、登録指示を受信しなかった場合には（Ｓ３０７でＮｏ）、処理をＳ３０５へ進める。 In step S 305, the correction management unit 126 determines whether the transmission / reception unit 121 has received a correction instruction from the client device 130. When the correction management unit 126 receives a correction instruction (Yes in S305), the correction management unit 126 advances the process to S306. If the correction management unit 126 has not received the correction instruction (No in S305), the process proceeds to S307. In S306, the correction management unit 126 corrects the attribute value according to the correction instruction. In step S 307, the image management unit 125 determines whether the transmission / reception unit 121 has received a registration instruction. If the registration instruction is received (Yes in S307), the image management unit 125 advances the process to S308. If the registration instruction is not received (No in S307), the image management unit 125 advances the process to S305.

Ｓ３０８において、修正管理部１２６は、修正履歴を記録する。なお、修正が行われなかった場合には、修正がなかったことを示す情報を修正履歴として記録する。次に、Ｓ３０９において、画像管理部１２５は、文書画像を属性値に対応付けて、ＨＤＤ２０４等の記憶部に登録する。以上で、文書登録処理が完了する。 In S308, the correction management unit 126 records the correction history. If no correction has been made, information indicating no correction is recorded as a correction history. In step S309, the image management unit 125 registers the document image in the storage unit such as the HDD 204 in association with the attribute value. This completes the document registration process.

以上のように、本実施形態に係る文書管理システム１００は、スキャンした文書から認識処理により抽出した属性値だけでなく、抽出した属性値から推定した属性値をユーザに提示することができる。そして、文書管理システム１００は、ユーザが目視確認し、正しい属性値に修正した上で、文書画像に正しい属性値を付与して登録することができる。 As described above, the document management system 100 according to the present embodiment can present not only the attribute value extracted from the scanned document by the recognition process but also the attribute value estimated from the extracted attribute value to the user. Then, the document management system 100 can register the document image with the correct attribute value after the user visually confirms and corrects the attribute value.

図４は、図３を参照しつつ説明した属性値抽出処理（Ｓ３０２）における詳細な処理を示すフローチャートである。属性値抽出処理において、属性値抽出部１２２は、文書種別、患者ＩＤ及び診療科ＩＤを文字認識により抽出し、抽出元の文書画像と、属性値とを対応付ける。図４に示す処理は、１つの属性値に対する属性値抽出処理であり、属性値抽出部１２２は、抽出すべき属性値の数だけ、属性値抽出処理を実行することにより、上記３つの属性値を抽出し、３つの属性値を抽出元の１つの文書画像に対応付ける。 FIG. 4 is a flowchart showing detailed processing in the attribute value extraction processing (S302) described with reference to FIG. In the attribute value extraction process, the attribute value extraction unit 122 extracts the document type, patient ID, and medical department ID by character recognition, and associates the extraction source document image with the attribute value. The process shown in FIG. 4 is an attribute value extraction process for one attribute value, and the attribute value extraction unit 122 executes the attribute value extraction process for the number of attribute values to be extracted, thereby performing the above three attribute values. And the three attribute values are associated with one document image of the extraction source.

Ｓ４０１において、属性値抽出部１２２は、処理対象の文書画像群から１つの文書画像を処理対象として選択する。次に、Ｓ４０２において、属性値抽出部１２２は、抽出対象の属性値が含まれる可能性が高い領域を属性領域として抽出する。属性領域を抽出する処理には、既知の文書レイアウト解析技術や文字認識技術を利用することができる。例えば、文書種別についは、一般に文書タイトル（文書種別）は文書の上部にあり本文よりもフォントサイズが大きいという知識を利用することができる。すなわち、属性値抽出部１２２は、文書レイアウト解析により文書画像内の文字領域の塊を文字矩形として抽出し、抽出された文字矩形の座標値やサイズから文書タイトルらしさを計算して属性領域を抽出する。また、患者名や診療科名は、それらが記載されていることを示すラベル「患者名：」「診療科名：」等の直後に記載されている可能性が高いという知識を利用することができる。すなわち、属性値抽出部１２２は、文書内から文字認識により上記のラベルを抽出し、直後に出現する文字矩形領域を属性領域として抽出すればよい。 In step S401, the attribute value extraction unit 122 selects one document image as a processing target from the processing target document image group. In step S 402, the attribute value extraction unit 122 extracts an area that is likely to include the attribute value to be extracted as an attribute area. A known document layout analysis technique or a character recognition technique can be used for the process of extracting the attribute area. For example, for the document type, it is possible to use the knowledge that the document title (document type) is generally at the top of the document and has a larger font size than the text. That is, the attribute value extraction unit 122 extracts a block of character areas in the document image as a character rectangle by document layout analysis, and calculates an attribute area by calculating the likelihood of the document title from the coordinate value and size of the extracted character rectangle. To do. In addition, it is possible to use the knowledge that the patient name and the department name are likely to be described immediately after the labels “patient name:”, “clinical department name:” indicating that they are described. it can. That is, the attribute value extraction unit 122 may extract the label from the document by character recognition, and extract a character rectangular area that appears immediately after as an attribute area.

次に、Ｓ４０３において、属性値抽出部１２２は、属性領域が抽出されたか否かの判定を行う。属性値抽出部１２２は、属性領域が抽出できた場合には（Ｓ４０３でＹｅｓ）、処理をＳ４０６へ進める。属性値抽出部１２２は、属性領域が抽出できなかった場合には（Ｓ４０３でＮｏ）、処理をＳ４０４へ進める。Ｓ４０４において、属性値抽出部１２２は、認識処理に失敗したと判定する。次に、Ｓ４０５において、属性値抽出部１２２は、処理対象の文書画像の属性種別に対して属性値「Ｕｎｋｎｏｗｎ」を設定する。例えば、患者ＩＤを処理対象としている場合には、属性値抽出部１２２は、文書画像の患者ＩＤの属性種別に対して、属性値「Ｕｎｋｎｏｗｎ」を設定する。 Next, in S403, the attribute value extraction unit 122 determines whether or not an attribute region has been extracted. If the attribute area can be extracted (Yes in S403), the attribute value extraction unit 122 advances the process to S406. If the attribute value extraction unit 122 cannot extract the attribute area (No in S403), the process proceeds to S404. In S404, the attribute value extraction unit 122 determines that the recognition process has failed. In step S 405, the attribute value extraction unit 122 sets an attribute value “Unknown” for the attribute type of the document image to be processed. For example, when the patient ID is the processing target, the attribute value extraction unit 122 sets the attribute value “Unknown” for the attribute type of the patient ID of the document image.

一方、Ｓ４０６において、属性値抽出部１２２は、属性領域の文字列矩形画像に対して文字認識処理により、文字列情報を生成する。ここで、文字列情報は属性値であり、Ｓ４０６の処理は、文書画像群に属する複数の文書画像それぞれに対する認識処理により、文書画像の属性値を抽出する認識処理の一例である。次に、Ｓ４０７において、属性値抽出部１２２は、抽出した文字列情報が抽出対象の属性種別に対して定められた特徴条件を満たすか否かを判定する。ここで、特徴条件とは、抽出された文字列が辞書に登録されていることである。ここで、辞書は、特徴的に出現する文字列が登録されたものであり、サーバ装置１２０のＨＤＤ２０４等に予め記憶されているものとする。属性値抽出部１２２は、Ｓ４０６において抽出された文字列情報と辞書に登録された登録文字列との間で部分文字列マッチングを行い、マッチング結果に応じて、特徴条件を満たすか否かを判定する。 On the other hand, in S 406, the attribute value extraction unit 122 generates character string information by character recognition processing on the character string rectangular image in the attribute area. Here, the character string information is an attribute value, and the process of S406 is an example of a recognition process for extracting an attribute value of a document image by a recognition process for each of a plurality of document images belonging to the document image group. In step S 407, the attribute value extraction unit 122 determines whether the extracted character string information satisfies a feature condition defined for the attribute type to be extracted. Here, the feature condition is that the extracted character string is registered in the dictionary. Here, it is assumed that a character string that appears characteristically is registered in the dictionary and stored in advance in the HDD 204 or the like of the server device 120. The attribute value extraction unit 122 performs partial character string matching between the character string information extracted in S406 and the registered character string registered in the dictionary, and determines whether or not the feature condition is satisfied according to the matching result. To do.

例えば、文書種別の例では、辞書に、「問診票」、「同意書」、「説明書」、「診療情報提供書」等医療文書のタイトルに頻出する単語が予め登録されているものとする。そして、属性値抽出部１２２は、文字列情報に辞書に登録されている単語が存在する場合に、特徴条件を満たすと判定し、存在しない場合に特徴条件を満たさないと判定する。また、診療科ＩＤの例では、辞書に、「脳神経外科」、「整形外科」、「呼吸器内科」、「循環器内科」、「眼科」、「皮膚科」等の病院の診療科名が頻出する単語として登録されているものとする。 For example, in the document type example, it is assumed that words frequently appearing in the titles of medical documents such as “inquiry form”, “consent form”, “instructions”, “medical information provision form” are registered in the dictionary in advance. . The attribute value extraction unit 122 determines that the feature condition is satisfied when a word registered in the dictionary exists in the character string information, and determines that the feature condition is not satisfied when the word does not exist. In the example of the department ID, the names of hospital departments such as “neurosurgery”, “orthopedics”, “respiratory medicine”, “cardiovascular medicine”, “ophthalmology”, “dermatology” are listed in the dictionary. Assume that it is registered as a frequently occurring word.

また、他の例としては、属性値抽出部１２２は、文字列情報が、抽出対象の属性種別が示す文字数や文字の種類等の特徴条件を満たすか否かを判定してもよい。また、属性種別として数値やアルファベットを用いた患者ＩＤを利用する場合には、適用先の病院の患者ＩＤの文字種（数値、アルファベット）や文字数のパターンを特徴条件として予め登録しておいてもよい。 As another example, the attribute value extraction unit 122 may determine whether or not the character string information satisfies a feature condition such as the number of characters and the type of characters indicated by the attribute type to be extracted. In addition, when a patient ID using a numerical value or alphabet is used as the attribute type, the character type (numerical value, alphabet) of the patient ID of the application destination hospital or the pattern of the number of characters may be registered in advance as a feature condition. .

次に、Ｓ４０８において、属性値抽出部１２２は、文字列情報が特徴条件を満たす場合には（Ｓ４０８でＹｅｓ）、処理をＳ４０９へ進める。属性値抽出部１２２は、文字列情報が特徴条件を満たさない場合には（Ｓ４０８でＮｏ）、処理をＳ４０４へ進める。Ｓ４０９において、属性値抽出部１２２は、認識処理に成功したと判定する。次に、Ｓ４１０において、文字列情報を処理対象の文書画像の属性種別に対する属性値として設定する。次に、Ｓ４１１において、属性値抽出部１２２は、処理対象の文書画像群に含まれるすべての文書画像に対し、Ｓ４０１〜Ｓ４１０の処理を行ったか否かを判定する。属性値抽出部１２２は、すべての文書画像に対する処理が終了している場合には（Ｓ４１１でＹｅｓ）、属性値抽出処理を終了する。属性値抽出部１２２は、未処理の文書画像が存在する場合には（Ｓ４１１でＮｏ）、処理をＳ４０１へ進める。このように、属性値抽出部１２２は、属性領域が存在すること（Ｓ４０３）、認識により得られた文字列情報が特徴条件を満たすこと（Ｓ４０８）を認識条件として、認識条件に従い認識処理に成功したか否かを判定する。すなわち、Ｓ４０３、Ｓ４０８、Ｓ４０４、Ｓ４０９の処理は、認識条件に従い認識処理に成功したか否かを判定する結果判定処理の一例である。 Next, in S408, the attribute value extraction unit 122 advances the process to S409 when the character string information satisfies the characteristic condition (Yes in S408). If the character string information does not satisfy the feature condition (No in S408), the attribute value extraction unit 122 advances the process to S404. In step S409, the attribute value extraction unit 122 determines that the recognition process has been successful. In step S410, character string information is set as an attribute value for the attribute type of the document image to be processed. In step S 411, the attribute value extraction unit 122 determines whether the processing in steps S 401 to S 410 has been performed on all document images included in the document image group to be processed. The attribute value extraction unit 122 ends the attribute value extraction process when the processing for all the document images has been completed (Yes in S411). If there is an unprocessed document image (No in S411), the attribute value extraction unit 122 advances the process to S401. As described above, the attribute value extraction unit 122 succeeds in the recognition process according to the recognition condition with the attribute region existing (S403) and the character string information obtained by the recognition satisfying the characteristic condition (S408). Determine whether or not. That is, the processes of S403, S408, S404, and S409 are an example of a result determination process that determines whether the recognition process is successful according to the recognition conditions.

図５は、図３を参照しつつ説明した属性パターン判定処理（Ｓ３０３）において判定される属性パターンの説明図である。図５（ａ）に示すように、病棟等では、患者毎に文書管理される場合が多い。例えば、患者フォルダ５００には、病棟に入院している患者に対応する医療文書が患者毎にファイリングされて管理されている。このようなケースにおいては、患者フォルダ５００に入れられた紙文書群がＡＤＦを利用した連続スキャンにより、一括して電子化されることが多い。これにより、紙文書群に対応する文書画像群５０１が得られる。５０２は、文書画像群５０１の各文書画像の属性値である。例えば、文書画像群５０１に含まれる文書画像５１０の、文書種別の属性値は「紹介状」５１１であり、患者ＩＤの属性値は「山田太郎」５１２であり、診療科ＩＤの属性値は「耳鼻科」５１３である。図５（ａ）に示すように、患者フォルダ５００の紙文書群から得られた文書画像群５０１においては、患者ＩＤの属性種別に対し、同一の属性値が連続して出現する傾向がある。 FIG. 5 is an explanatory diagram of attribute patterns determined in the attribute pattern determination process (S303) described with reference to FIG. As shown in FIG. 5A, documents are often managed for each patient in a ward or the like. For example, in the patient folder 500, medical documents corresponding to patients admitted to a ward are filed and managed for each patient. In such a case, a group of paper documents placed in the patient folder 500 is often digitized collectively by continuous scanning using ADF. As a result, a document image group 501 corresponding to the paper document group is obtained. Reference numeral 502 denotes an attribute value of each document image in the document image group 501. For example, in the document image 510 included in the document image group 501, the attribute value of the document type is “introduction letter” 511, the attribute value of the patient ID is “Taro Yamada” 512, and the attribute value of the department ID is “ "Otolaryngology" 513. As shown in FIG. 5A, in the document image group 501 obtained from the paper document group of the patient folder 500, the same attribute value tends to appear continuously for the attribute type of the patient ID.

また、図５（ｂ）に示すように、病院内の受付等では、文書種別毎に文書管理される場合が多い。例えば、問診票トレイ５２０には、受付で複数の患者により記入された問診票がすべて管理されている。このようなケースにおいては、問診票トレイ５２０に入れられた紙文書群に対応する文書画像群５２１において、文書種別の同一の属性値が連続して出現する傾向がある。また、図５（ｃ）に示すように、病院内の各診療科等では、特定の診療科名の医療文書が管理される場合が多い。例えば、耳鼻科の診療科トレイ５３０には、耳鼻科内で利用する紙文書が管理されている。このようなケースにおいては、診療科トレイ５３０に入れられた紙文書群に対応する文書画像群５３１において、診療科ＩＤの同一の属性値が連続して出現する傾向がある。 Further, as shown in FIG. 5B, in the hospital reception or the like, there are many cases where documents are managed for each document type. For example, in the questionnaire sheet tray 520, all the questionnaire sheets filled by a plurality of patients at the reception are managed. In such a case, the same attribute value of the document type tends to appear continuously in the document image group 521 corresponding to the paper document group placed in the questionnaire slip tray 520. In addition, as shown in FIG. 5C, in each department in a hospital, a medical document having a specific department name is often managed. For example, a paper document used in the otolaryngology is managed in the medical department tray 530 of the otolaryngology. In such a case, the same attribute value of the medical department ID tends to appear continuously in the document image group 531 corresponding to the paper document group placed in the medical department tray 530.

このように、文書画像群に属する複数の文書画像の属性値はランダムではなく、スキャン順に相当する配列順において一定の傾向を示すものである。本実施形態に係る属性パターン特定部１２３は、処理対象の文書画像が患者ＩＤ連続パターン、文書種別連続パターン、診療科ＩＤ連続パターンの３つの属性パターンのいずれかを判定する。ここで、患者ＩＤ連続パターンは、図５（ａ）を参照しつつ説明したように患者ＩＤの同一属性値が連続するパターンである。文書種別連続パターンは、図５（ｂ）を参照しつつ説明したように、文書種別の同一属性値が連続するパターンである。診療科ＩＤ連続パターンは、図５（ｃ）を参照しつつ説明したように、診療科ＩＤの同一属性値が連続するパターンである。 As described above, the attribute values of a plurality of document images belonging to the document image group are not random, and show a certain tendency in the arrangement order corresponding to the scan order. The attribute pattern specifying unit 123 according to the present embodiment determines whether the document image to be processed is one of three attribute patterns: a patient ID continuous pattern, a document type continuous pattern, and a medical department ID continuous pattern. Here, the patient ID continuous pattern is a pattern in which the same attribute value of the patient ID continues as described with reference to FIG. As described with reference to FIG. 5B, the document type continuous pattern is a pattern in which the same attribute value of the document type is continuous. As described with reference to FIG. 5C, the clinical department ID continuous pattern is a pattern in which the same attribute value of the medical department ID is continuous.

図６は、図３を参照しつつ説明した属性パターン判定処理（Ｓ３０３）における詳細な処理を示すフローチャートである。ここでは、図７（ａ）に示すスキャンジョブＪ１の文書画像群７００の文書画像ｄＩ（ｔ１）〜ｄＩ（ｔ８）を処理対象とする場合を例に説明する。なお、処理対象の文書画像群７００に含まれる複数の文書画像は、スキャンによる生成順に対応した配列順に並んでいるものとする。図７（ａ）には、説明の便宜上、文書画像の属性値のうち、属性種別「患者ＩＤ」の属性値のみを示している。 FIG. 6 is a flowchart showing detailed processing in the attribute pattern determination processing (S303) described with reference to FIG. Here, a case where the document images dI (t1) to dI (t8) of the document image group 700 of the scan job J1 shown in FIG. Note that a plurality of document images included in the document image group 700 to be processed are arranged in an arrangement order corresponding to the generation order by scanning. For convenience of explanation, FIG. 7A shows only the attribute value of the attribute type “patient ID” among the attribute values of the document image.

Ｓ６０１において、属性パターン特定部１２３は、属性種別を設定し、処理対象の文書画像群中の文書画像の、設定された属性種別に対応する属性値を処理対象として特定する。以下、患者ＩＤの属性種別が設定された場合を例に説明する。この場合、図７（ａ）の例においては、属性種別「患者ＩＤ」（ａｔｔ＿Ｐ）とその属性値とが処理対象として特定される。 In step S601, the attribute pattern specifying unit 123 sets an attribute type, and specifies, as a processing target, an attribute value corresponding to the set attribute type of the document image in the processing target document image group. Hereinafter, a case where the attribute type of the patient ID is set will be described as an example. In this case, in the example of FIG. 7A, the attribute type “patient ID” (att_P) and its attribute value are specified as the processing target.

次に、Ｓ６０２において、属性パターン特定部１２３は、処理対象の複数の文書画像の配列順において、Ｓ６０１において設定された属性種別の同一の属性値が連続する確率、即ち連続確率を算出する。具体的には、属性パターン特定部１２３は、（式１）により連続確率を算出する。また、ｆ（Ｘ、Ｙ）は、次式により算出される。

属性ａｔｔ＿Ｘに対する属性値が連続する割合Ｐ（ａｔｔ＿Ｘ）を計算する式は、処理対象とする属性値の数をｎとすると、（式１）のように表される。（式１）は、ある属性の系列に対し、隣接する属性値が同じ値をカウントし、最大カウント数ｎ−１（全て同じ値であった場合）で正規化したものである。 Next, in step S602, the attribute pattern specifying unit 123 calculates a probability that the same attribute value of the attribute type set in step S601 continues, that is, a continuous probability, in the arrangement order of the plurality of document images to be processed. Specifically, the attribute pattern specifying unit 123 calculates the continuity probability using (Equation 1). F (X, Y) is calculated by the following equation.

An expression for calculating the ratio P (att_X) where the attribute values for the attribute att_X are continuous is expressed as (Expression 1), where n is the number of attribute values to be processed. (Expression 1) is obtained by counting the same value of adjacent attribute values for a certain attribute series, and normalizing by a maximum count number n−1 (when all are the same value).

図７を参照しつつ、連続確率算出処理（Ｓ６０２）について具体的に説明する。図７（ａ）は、連続入力されたスキャンジョブＪ１から得られた文書画像群７００を示す図である。文書画像群７００の各文書画像は、同一スキャンジョブ内で読み取られた順に管理される。このスキャン順を文書画像の配列順とする。文書画像群７００の配列順を、１番目から順に、ｄＩ（ｔ１）、ｄＩ（ｔ２）、・・・、ｄＩ（ｔｎ）で表す。また、７０１には、各文書画像の患者ＩＤを示す属性ａｔｔ＿Ｐを示している。文書画像ｄＩ（ｔｎ）に対応する属性値をａｔｔ＿Ｐ（ｔｎ）で表す。また、７０２には、各文書画像の属性値を示している。 The continuous probability calculation process (S602) will be specifically described with reference to FIG. FIG. 7A is a diagram showing a document image group 700 obtained from the continuously input scan job J1. Each document image in the document image group 700 is managed in the order read in the same scan job. This scan order is the document image arrangement order. The order of arrangement of the document image group 700 is represented by dI (t1), dI (t2),..., DI (tn) in order from the first. Reference numeral 701 denotes an attribute att_P indicating the patient ID of each document image. An attribute value corresponding to the document image dI (tn) is represented by att_P (tn). Reference numeral 702 denotes attribute values of each document image.

例えば、図７（ａ）に示す文書画像群７００においては、（式２）により、配列順において隣接する文書画像の属性値が同じ値であれば「１」、異なる値であれば「０」が算出される。図７（ａ）の例では、属性種別「患者ＩＤ」の属性値ａｔｔ＿Ｐ（ｔ１）とａｔｔ＿Ｐ（ｔ２）はいずれも「山田太郎」である。このため、「１」が算出される。一方、属性値ａｔｔ＿Ｐ（ｔ２）とａｔｔ＿Ｐ（ｔ３）は異なるため、「０」が算出される。文書画像群７００においては、属性値の数ｎは８個、連続する属性値の数は５個である。したがって、（式１）より、連続確率「０．７１」が算出される。
Ｐ（ａｔｔ＿Ｐ）＝５／（８−１）＝０．７１ For example, in the document image group 700 shown in FIG. 7A, according to (Equation 2), “1” is used if the attribute values of adjacent document images in the arrangement order are the same, and “0” if they are different. Is calculated. In the example of FIG. 7A, the attribute values att_P (t1) and att_P (t2) of the attribute type “patient ID” are both “Taro Yamada”. For this reason, “1” is calculated. On the other hand, since the attribute values att_P (t2) and att_P (t3) are different, “0” is calculated. In the document image group 700, the number n of attribute values is 8, and the number of consecutive attribute values is 5. Therefore, the continuity probability “0.71” is calculated from (Equation 1).
P (att_P) = 5 / (8-1) = 0.71

一方、図７（ｂ）に示すスキャンジョブＪ２の文書画像群７１０においては、属性値の数ｎは８個、連続する属性値の数は０個である。したがって、（式１）より、連続確率「０」が算出される。
Ｐ（ａｔｔ＿Ｐ）＝０／（８−１）＝０ On the other hand, in the document image group 710 of the scan job J2 shown in FIG. 7B, the number n of attribute values is 8, and the number of consecutive attribute values is 0. Therefore, the continuity probability “0” is calculated from (Equation 1).
P (att_P) = 0 / (8-1) = 0

このように、（式１）は、連続する属性値が多い程高い値を示す。１つの文書画像群において、ある属性種別の属性値がすべて異なる値である場合には、連続確率は、最小値「０」となり、すべて同じ値である場合には、連続確率は最大値「１」となる。 Thus, (Expression 1) indicates a higher value as the number of consecutive attribute values increases. If the attribute values of a certain attribute type are all different values in one document image group, the continuous probability is the minimum value “0”, and if all are the same value, the continuous probability is the maximum value “1”. "

次に、Ｓ６０３において、属性パターン特定部１２３は、連続確率が閾値以上か否かを判定する。属性パターン特定部１２３は、連続確率が閾値以上の場合には（Ｓ６０３でＹｅｓ）、処理をＳ６０４へ進める。Ｓ６０４において、属性パターン特定部１２３は、処理対象の文書群の属性パターンが、属性種別に対応する連続パターンであると判定する。例えば、患者ＩＤの属性種別を処理対象としている場合には、患者ＩＤ連続パターンと判定する。一方、Ｓ６０３において、属性パターン特定部１２３は、連続確率が閾値未満の場合には（Ｓ６０３でＮｏ）、処理対象とする属性種別に対応する連続パターンではないと判定し、属性パターン判定処理を終了する。ここで、Ｓ６０１〜Ｓ６０４の処理は、属性値の連続性に基づいて、属性パターンを特定する属性パターン特定処理の一例である。 In step S 603, the attribute pattern specifying unit 123 determines whether the continuity probability is greater than or equal to a threshold value. The attribute pattern specific | specification part 123 advances a process to S604, when a continuous probability is more than a threshold value (it is Yes at S603). In step S604, the attribute pattern specifying unit 123 determines that the attribute pattern of the document group to be processed is a continuous pattern corresponding to the attribute type. For example, when the attribute type of the patient ID is a processing target, it is determined as a patient ID continuous pattern. On the other hand, in S603, when the continuous probability is less than the threshold (No in S603), the attribute pattern specifying unit 123 determines that the pattern is not a continuous pattern corresponding to the attribute type to be processed, and ends the attribute pattern determination process. To do. Here, the processing of S601 to S604 is an example of attribute pattern specifying processing for specifying an attribute pattern based on the continuity of attribute values.

属性値抽出部１２２は、処理対象の文書画像群の３つの属性種別それぞれを処理対象として、図６に示す処理を実行することにより、文書画像群の属性パターンを判定する。これにより、１つの文書画像群に対し、１又は２以上の属性パターンが特定される。 The attribute value extraction unit 122 determines the attribute pattern of the document image group by executing the process shown in FIG. 6 for each of the three attribute types of the document image group to be processed. Thereby, one or more attribute patterns are specified for one document image group.

図８は、図３を参照しつつ説明した属性値推定処理（Ｓ３０４）における詳細な処理を示すフローチャートである。Ｓ８０１において、属性値推定部１２４は、属性パターン特定処理（Ｓ３０３）において処理対象の文書画像群に対して特定された属性パターンにおいて連続する属性種別を推定対象として設定する。以下、推定対象が患者ＩＤの属性種別である場合を例に説明する。 FIG. 8 is a flowchart showing detailed processing in the attribute value estimation processing (S304) described with reference to FIG. In step S 801, the attribute value estimation unit 124 sets, as an estimation target, consecutive attribute types in the attribute pattern specified for the document image group to be processed in the attribute pattern specification process (S 303). Hereinafter, a case where the estimation target is the attribute type of the patient ID will be described as an example.

次に、Ｓ８０２において、属性値推定部１２４は、処理対象の文書画像群の属性パターンに基づいて、参照タイプを特定する。ここで、参照タイプとは、推定対象の属性種別の属性値の認識率が高い文書タイプである。図９は、参照タイプテーブルの一例を示す図である。参照タイプテーブル９００は、例えば、ＨＤＤ１０４等に記憶されているものとする。参照タイプテーブル９００は、属性パターンにおいて連続する属性種別と、参照タイプとして設定されている文書タイプとを対応付けて記憶している。すなわち、Ｓ８０２の処理は、属性パターンに基づいて、文書画像群の参照タイプを特定する参照タイプ特定処理の一例である。患者ＩＤ連続パターンにおいては、図９の参照タイプテーブル９００から、患者ＩＤの属性種別に対応する、「Ａ」、「Ｂ」、「Ｆ」の文書タイプが参照タイプとして特定される。 In step S 802, the attribute value estimation unit 124 identifies a reference type based on the attribute pattern of the document image group to be processed. Here, the reference type is a document type having a high recognition rate of the attribute value of the attribute type to be estimated. FIG. 9 is a diagram illustrating an example of the reference type table. It is assumed that the reference type table 900 is stored in the HDD 104 or the like, for example. The reference type table 900 stores attribute types consecutive in the attribute pattern in association with document types set as reference types. That is, the process of S802 is an example of a reference type specifying process that specifies the reference type of the document image group based on the attribute pattern. In the patient ID continuous pattern, the document types “A”, “B”, and “F” corresponding to the attribute type of the patient ID are specified as reference types from the reference type table 900 of FIG.

次に、Ｓ８０３において、属性値推定部１２４は、文書画像群の複数の文書画像の中から第１の参照画像を検索する。ここで、第１の参照画像とは、推定対象の属性種別の属性値を推定する際に、同種別の属性値を参照する文書画像である。ここで、図１０を参照しつつ第１の参照画像検索処理（Ｓ８０３）について詳述する。 In step S 803, the attribute value estimation unit 124 searches for a first reference image from a plurality of document images in the document image group. Here, the first reference image is a document image that refers to the attribute value of the same type when estimating the attribute value of the attribute type to be estimated. Here, the first reference image search process (S803) will be described in detail with reference to FIG.

図１０は、第１の参照画像検出処理（Ｓ８０３）における詳細な処理を示すフローチャートである。Ｓ１００１において、属性値推定部１２４は、処理対象の文書画像群の複数の文書画像のうち配列順が１番目の文書画像を処理対象として設定する。以下、処理対象の文書画像を対象画像と称する。次に、Ｓ１００２において、属性値推定部１２４は、属性値抽出処理（Ｓ３０２）において、対象画像の認識処理に成功したか否かを判定する。なお、成功している場合には、対象画像の推定対象の文書種別に対し、属性値が対応付けられている。また、失敗している場合には、対象画像の推定対象の文書種別に対し「Ｕｎｋｎｏｗｎ」が対応付けられている。属性値推定部１２４は、認識処理に成功している場合には（Ｓ１００２でＹｅｓ）、処理をＳ１００３へ進める。属性パターン特定部１２３は、認識処理に失敗している場合には（Ｓ１００２でＮｏ）、処理をＳ１００７へ進める。 FIG. 10 is a flowchart showing detailed processing in the first reference image detection processing (S803). In step S 1001, the attribute value estimation unit 124 sets a document image with the first arrangement order as a processing target among a plurality of document images in the processing target document image group. Hereinafter, the document image to be processed is referred to as a target image. In step S 1002, the attribute value estimation unit 124 determines whether the target image recognition process has succeeded in the attribute value extraction process (S 302). In the case of success, the attribute value is associated with the document type to be estimated of the target image. In addition, when it has failed, “Unknown” is associated with the document type of the estimation target of the target image. If the recognition process is successful (Yes in S1002), the attribute value estimation unit 124 advances the process to S1003. If the recognition process has failed (No in S1002), the attribute pattern specifying unit 123 advances the process to S1007.

Ｓ１００３において、属性値推定部１２４は、対象画像の特徴量を算出する。属性値推定部１２４は、例えば、対象画像を解析してテキスト領域と図表領域、背景領域を抽出し、抽出されたそれぞれの領域の割合を示すレイアウト特徴を求める。属性値推定部１２４はさらに、テキスト領域に占めるひらがな及びカタカナ、漢字、アルファベット、数字の割合を示すテキスト特徴、図表領域の色のＲＧＢ成分の割合のベクトル表現を求める。そして、属性値推定部１２４は、これらに基づいて特徴量を算出する。なお、特徴量算出処理については、例えば、特開２００６−３１８２１９号公報を参照することができる。なお、特徴量は、文書画像の特徴を示すものであればよく、特徴量の具体的な種類、特徴量を算出するための処理は、実施形態に限定されるものではない。 In S1003, the attribute value estimation unit 124 calculates the feature amount of the target image. For example, the attribute value estimation unit 124 analyzes the target image to extract a text area, a chart area, and a background area, and obtains a layout feature indicating the ratio of each extracted area. The attribute value estimation unit 124 further obtains a text representation indicating the ratio of hiragana, katakana, kanji, alphabet, and numbers in the text area, and a vector representation of the ratio of the RGB component of the color of the chart area. Then, the attribute value estimation unit 124 calculates a feature amount based on these. For the feature amount calculation processing, for example, Japanese Patent Laid-Open No. 2006-318219 can be referred to. It should be noted that the feature amount only needs to indicate the feature of the document image, and the specific type of feature amount and the processing for calculating the feature amount are not limited to the embodiment.

次に、Ｓ１００４において、属性値推定部１２４は、文書タイプテーブルを参照し、特徴量に基づいて、文書タイプを特定する。図１１は、文書タイプテーブルの一例を示す図である。文書タイプテーブル１１００は、例えば、ＨＤＤ１０４等に記憶されているものとする。文書タイプテーブル１１００は、文書タイプと特徴量とを対応付けて記憶している。文書タイプには、病院内で使用される文書タイプが予め登録される。文書タイプには、例えば、病院内で使用するすべての文書のフォーマットが、異なる文書タイプとして登録されているものとする。また、他の例としては、類似するフォーマットや種別のフォーマットを複数の文書タイプにグルーピングし、文書タイプには、各グループの文書タイプが登録されてもよい。また、他の例としては、既知の文書分類技術や文書クラスタリング技術を用いて類似する文書を分類し、文書タイプには、それぞれの分類項目が文書タイプとして登録されてもよい。特徴量には、対応する文書タイプの文書画像の特徴量ベクトルを特徴量としてされている。 In step S1004, the attribute value estimation unit 124 refers to the document type table and identifies the document type based on the feature amount. FIG. 11 is a diagram illustrating an example of a document type table. The document type table 1100 is assumed to be stored in the HDD 104, for example. The document type table 1100 stores document types and feature amounts in association with each other. In the document type, a document type used in the hospital is registered in advance. In the document type, for example, all document formats used in the hospital are registered as different document types. As another example, similar formats or types of formats may be grouped into a plurality of document types, and the document types of each group may be registered in the document types. As another example, similar documents may be classified using a known document classification technique or document clustering technique, and each classification item may be registered as a document type in the document type. The feature quantity includes a feature quantity vector of a document image of a corresponding document type as a feature quantity.

属性値推定部１２４は、文書タイプテーブル１１００の特徴量と、Ｓ１００３において算出した特徴量の類似度を算出し、類似度が閾値以上となる文書タイプを対象画像の文書タイプとして特定する。なお、属性値推定部１２４は、類似度が最大となる文書タイプを対象画像の文書タイプとして特定してもよい。ここで、Ｓ１００４の処理は、文書画像の文書タイプを特定する文書タイプ特定処理の一例である。 The attribute value estimation unit 124 calculates the similarity between the feature amount of the document type table 1100 and the feature amount calculated in S1003, and specifies the document type with the similarity equal to or greater than the threshold as the document type of the target image. Note that the attribute value estimation unit 124 may specify the document type having the maximum similarity as the document type of the target image. Here, the process of S1004 is an example of a document type specifying process for specifying the document type of the document image.

次に、Ｓ１００５において、属性値推定部１２４は、対象画像の文書タイプが参照タイプか否かを確認する。属性値推定部１２４は、対象画像の文書タイプが参照タイプの場合には（Ｓ１００５でＹｅｓ）、処理をＳ１００６へ進める。属性パターン特定部１２３は、対象画像の文書タイプが参照タイプでない場合には（Ｓ１００５でＮｏ）、処理をＳ１００７へ進める。Ｓ１００６において、属性値推定部１２４は、対象画像を第１の参照画像に設定し、第１の参照画像検索処理を終了する。Ｓ１００７において、属性値推定部１２４は、文書画像群の配列順において、対象画像の次の文書画像が存在するか否かを確認する。属性値推定部１２４は、次の文書画像が存在する場合には（Ｓ１００７でＹｅｓ）、処理をＳ１００８へ進める。属性値推定部１２４は、次の文書画像が存在しない場合には（Ｓ１００７でＮｏ）、処理をＳ１００９へ進める。 In step S1005, the attribute value estimation unit 124 checks whether the document type of the target image is a reference type. If the document type of the target image is the reference type (Yes in S1005), the attribute value estimation unit 124 advances the process to S1006. If the document type of the target image is not the reference type (No in S1005), the attribute pattern specifying unit 123 advances the process to S1007. In step S1006, the attribute value estimation unit 124 sets the target image as the first reference image, and ends the first reference image search process. In step S 1007, the attribute value estimation unit 124 checks whether there is a document image next to the target image in the document image group arrangement order. If the next document image exists (Yes in S1007), the attribute value estimation unit 124 advances the process to S1008. If the next document image does not exist (No in S1007), the attribute value estimation unit 124 advances the process to S1009.

Ｓ１００８において、属性値推定部１２４は、次の文書画像を新たな処理対象に設定し、処理をＳ１００２へ進める。Ｓ１００９において、属性値推定部１２４は、第１の参照画像は存在しないと判定し、第１の参照画像検索処理を終了する。このように、第１の参照画像検索処理（Ｓ８０３）において、文書画像群の文書画像のうち、認識処理に成功し、かつ参照タイプの文書画像で、かつ配列順が最も前の文書画像が第１の文書画像として特定される。すなわち、Ｓ８０３の処理は、文書画像群に属する文書画像の中から、認識処理に成功したと判定され、かつ特定された文書タイプが参照タイプの文書画像の中から、さらに配列順に基づいて、参照画像として特定する参照画像特定処理の一例である。 In step S1008, the attribute value estimation unit 124 sets the next document image as a new processing target, and advances the processing to step S1002. In step S1009, the attribute value estimation unit 124 determines that the first reference image does not exist, and ends the first reference image search process. As described above, in the first reference image search process (S803), among the document images in the document image group, the document image that has been successfully recognized and is the reference type document image and that has the earliest arrangement order is the first. Specified as one document image. In other words, the process of S803 determines that the recognition process is successful from the document images belonging to the document image group, and the specified document type is referred to from the reference-type document images based on the arrangement order. It is an example of the reference image specific process specified as an image.

図８に戻り、Ｓ８０３の処理の後、Ｓ８０４において、属性値推定部１２４は、Ｓ８０３において第１の参照画像が検出されたか否かを判定する。属性値推定部１２４は、第１の参照画像が検出された場合には（Ｓ８０４でＹｅｓ）、処理をＳ８０５へ進める。属性値推定部１２４は、第１の参照画像が検出されなかった場合には（Ｓ８０４でＮｏ）、属性推定処理（Ｓ３０４）を終了する。Ｓ８０５において、属性値推定部１２４は、第１の参照画像に設定された文書画像以降の配列順の文書画像を検索対象として、第２の参照画像を検索する。ここで、第２の検索画像は、第１の参照画像と同様に、推定対象の属性種別の属性値を推定する際に、同種別の属性値を参照する文書画像である。Ｓ８０５における処理は、Ｓ８０３における処理とほぼ同様である。ただし、Ｓ８０５においては、属性値推定部１２４は、図１０に示すＳ１００１において、第１の参照画像の次の配列順の文書画像を処理対象に設定する。そして、属性値推定部１２４は、認識処理に成功し、かつ文書タイプが参照タイプの文書画像を第２の参照画像として特定する。 Returning to FIG. 8, after the processing of S803, in S804, the attribute value estimation unit 124 determines whether or not the first reference image is detected in S803. If the first reference image is detected (Yes in S804), the attribute value estimation unit 124 advances the process to S805. If the first reference image is not detected (No in S804), the attribute value estimation unit 124 ends the attribute estimation process (S304). In step S 805, the attribute value estimation unit 124 searches for the second reference image by using the document images in the order of arrangement after the document image set as the first reference image as a search target. Here, like the first reference image, the second search image is a document image that refers to the attribute value of the same type when estimating the attribute value of the attribute type to be estimated. The process in S805 is almost the same as the process in S803. However, in S805, the attribute value estimation unit 124 sets a document image in the arrangement order next to the first reference image as a processing target in S1001 illustrated in FIG. Then, the attribute value estimation unit 124 identifies a document image having a successful recognition process and a reference document type as the second reference image.

次に、Ｓ８０６において、１２４は、Ｓ８０５において第２の参照画像が検出されたか否かを判定する。属性値推定部１２４は、第２の参照画像が検出された場合には（Ｓ８０６でＹｅｓ）、処理をＳ８０７へ進める。属性値推定部１２４は、第２の参照画像が検出されなかった場合には（Ｓ８０６でＮｏ）、属性推定処理（Ｓ３０４）を終了する。Ｓ８０７において、属性値推定部１２４は、第１の参照画像と第２の参照画像の患者ＩＤの属性種別の属性値を比較する。次に、Ｓ８０８において、属性値推定部１２４は、両属性値が一致する場合には（Ｓ８０８でＹｅｓ）、処理をＳ８０９へ進める。 Next, in S806, 124 determines whether a second reference image has been detected in S805. If the second reference image is detected (Yes in S806), the attribute value estimation unit 124 advances the process to S807. If the second reference image is not detected (No in S806), the attribute value estimation unit 124 ends the attribute estimation process (S304). In S807, the attribute value estimation unit 124 compares the attribute value of the attribute type of the patient ID of the first reference image and the second reference image. Next, in S808, when the two attribute values match (Yes in S808), the attribute value estimation unit 124 advances the process to S809.

Ｓ８０９において、属性値推定部１２４は、文書画像群の配列順において、第１の参照画像と第２の参照画像の間に位置する文書画像を推定対象の文書画像として設定する。そして、属性値推定部１２４は、推定対象の文書画像に対し、一致した属性値を推定結果として対応付ける。そして、属性値推定部１２４は、処理をＳ８１１へ進める。また、Ｓ８１０において、属性値推定部１２４は、文書画像群の配列順において、第１の参照画像と第２の参照画像の間に位置する文書画像を推定対象の文書画像として設定する。そして、属性値推定部１２４は、推定対象の文書画像に対し、第１の参照画像及び第２の参照画像それぞれの推定対象の属性値を推定結果として対応付ける。そして、属性値推定部１２４は、処理をＳ８１１へ進める。Ｓ８１１において、属性値推定部１２４は、第２の参照画像を第１の参照画像に設定し、処理をＳ８０５へ進める。なお、処理対象の文書画像群に対して、複数の属性パターンが特定されている場合には、サーバ装置１２０は、各属性パターンにおいて連続する属性種別を推定対象に設定し、図８に示す属性値推定処理を実行するものとする。なお、本実施形態においては、このように、属性値推定部１２４は、推定対象の文書画像よりも配列順が前の文書画像と、推定対象の文書画像よりも配列順が後の文書画像と、を参照画像として特定するものとする。 In step S809, the attribute value estimation unit 124 sets a document image positioned between the first reference image and the second reference image in the document image group arrangement order as a document image to be estimated. Then, the attribute value estimation unit 124 associates the matched attribute value with the estimation target document image as the estimation result. Then, the attribute value estimation unit 124 advances the process to S811. In S810, the attribute value estimation unit 124 sets a document image located between the first reference image and the second reference image in the document image group arrangement order as a document image to be estimated. Then, the attribute value estimation unit 124 associates the estimation target attribute values of the first reference image and the second reference image as estimation results with the estimation target document image. Then, the attribute value estimation unit 124 advances the process to S811. In S811, the attribute value estimation unit 124 sets the second reference image as the first reference image, and the process proceeds to S805. When a plurality of attribute patterns are specified for the document image group to be processed, the server apparatus 120 sets consecutive attribute types in each attribute pattern as estimation targets, and the attributes shown in FIG. It is assumed that value estimation processing is executed. In this embodiment, as described above, the attribute value estimation unit 124 includes a document image whose arrangement order is earlier than the estimation target document image, and a document image whose arrangement order is later than the estimation target document image. Are specified as reference images.

図１２は、属性値推定処理（Ｓ３０４）の説明図である。１２００は、スキャンジョブＪ１の文書画像群である。１２０１には、各文書画像の属性を示し、１２０２には属性値を示している。図１２の例においては、説明の便宜上、患者ＩＤとして患者名に替えて患者番号を属性値として示している。１２１０は、各文書画像に対して、属性値抽出処理（Ｓ３０１）において抽出された属性値を示している。文書画像群１２００のうち、文書画像ｄＩ（ｔ１）、ｄＩ（ｔ３）、ｄＩ（ｔ６）、ｄＩ（ｔ８）は、認識に成功したと判断され、かつ文書タイプが参照タイプの文書画像であるものとする。一方、文書画像ｄＩ（ｔ２）、ｄＩ（ｔ４）、ｄＩ（ｔ５）、ｄＩ（ｔ７）は、認識に成功したと判断され、かつ文書タイプが参照タイプの文書画像、以外の文書であるものとする。 FIG. 12 is an explanatory diagram of the attribute value estimation process (S304). Reference numeral 1200 denotes a document image group of the scan job J1. Reference numeral 1201 denotes an attribute of each document image, and reference numeral 1202 denotes an attribute value. In the example of FIG. 12, for convenience of explanation, the patient number is shown as the attribute value instead of the patient name as the patient ID. Reference numeral 1210 denotes an attribute value extracted in the attribute value extraction process (S301) for each document image. Of the document image group 1200, the document images dI (t1), dI (t3), dI (t6), and dI (t8) are determined to have been successfully recognized, and the document type is a reference type document image. And On the other hand, the document images dI (t2), dI (t4), dI (t5), and dI (t7) are determined to have been successfully recognized, and the document type is a document other than the reference type document image. To do.

以上の前提において、属性値推定処理（Ｓ３０４）における詳細な処理を具体的に説明する。この場合、Ｓ８０３において、属性値推定部１２４は、認識処理に成功し、かつ文書タイプが参照タイプの文書画像のうち、配列順において最初に位置する文書画像ｄＩ（ｔ１）を第１の参照画像に設定する。第１の参照画像が存在するため（Ｓ８０４でＹｅｓ）、属性値推定部１２４は、処理をＳ８０５へ進める。そして、Ｓ８０５において、属性値推定部１２４は、認識処理に成功し、かつ文書タイプが参照タイプの文書画像のうち、文書画像ｄＩ（ｔ１）より後の配列順の文書画像ｄＩ（ｔ３）を第２の参照画像に設定する。第２の参照画像が存在するため（Ｓ８０６でＹｅｓ）、属性値推定部１２４は、処理をＳ８０７へ進める。 Based on the above assumptions, detailed processing in the attribute value estimation processing (S304) will be specifically described. In this case, in S803, the attribute value estimation unit 124 succeeds in the recognition processing, and among the document images whose document type is the reference type, the document image dI (t1) that is first positioned in the arrangement order is the first reference image. Set to. Since the first reference image exists (Yes in S804), the attribute value estimation unit 124 advances the process to S805. In step S805, the attribute value estimation unit 124 selects the document image dI (t3) in the order of arrangement after the document image dI (t1) among the document images having the recognition process and the document type being the reference type. 2 reference image. Since the second reference image exists (Yes in S806), the attribute value estimation unit 124 advances the process to S807.

そして、Ｓ８０７において、属性値推定部１２４は、文書画像ｄＩ（ｔ１）と文書画像ｄＩ（ｔ３）の患者ＩＤ、ａｔｔ＿Ｐ（ｔ１)とａｔｔ＿Ｐ（ｔ３)を比較する。患者ＩＤは両方とも「１２３４５６」で等しいため（Ｓ８０８でＹｅｓ）、属性値推定部１２４は、処理をＳ８０９へ進める。Ｓ８０９において、属性値推定部１２４は、文書画像ｄＩ（ｔ１）と文書画像ｄＩ（ｔ３）の属性値「１２３４５６」を、文書画像ｄＩ（ｔ１）と文書画像ｄＩ（ｔ３）の間の文書画像ｄＩ（ｔ２）の属性値として推定する。なお、文書画像ｄＩ（ｔ３）に対しては抽出結果が得られているものとする。このように、属性値推定部１２４は、抽出結果が得られている場合にも、正しい属性値を推定し、推定結果を文書画像に対応付けるものとする。 In step S807, the attribute value estimation unit 124 compares the patient IDs att_P (t1) and att_P (t3) of the document image dI (t1) and the document image dI (t3). Since both the patient IDs are equal to “123456” (Yes in S808), the attribute value estimation unit 124 advances the process to S809. In step S809, the attribute value estimation unit 124 determines the attribute value “123456” of the document image dI (t1) and the document image dI (t3), and the document image dI between the document image dI (t1) and the document image dI (t3). Estimated as the attribute value of (t2). It is assumed that an extraction result has been obtained for the document image dI (t3). As described above, the attribute value estimation unit 124 estimates the correct attribute value even when the extraction result is obtained, and associates the estimation result with the document image.

その後、Ｓ８１１において、属性値推定部１２４は、文書画像ｄＩ（ｔ３）を第１の参照画像に設定し、処理をＳ８０５へ進め、文書画像ｄＩ（ｔ６）を第２の参照画像に設定する。その後、属性値推定部１２４は、処理をＳ８０７へ進め、両属性値を比較する。両属性値は一致しないので、処理をＳ８１０へ進める。Ｓ８１０において、属性値推定部１２４は、文書画像ｄＩ（ｔ１）と文書画像ｄＩ（ｔ３）の間の文書画像ｄＩ（ｔ４）、文書画像ｄＩ（ｔ５）を推定対象の文書画像として設定する。そして、属性値推定部１２４は、文書画像ｄＩ（ｔ３）の属性値「１２３４５６」及び文書画像ｄＩ（ｔ６）の属性値「６５４３２１」それぞれを、推定対象の文書画像の推定結果として、推定対象の文書画像に対応付ける。これにより、認識に失敗したと判定され、「Ｕｎｋｎｏｗｎ」が設定された文書画像に対して、正しい属性値を推定し、これを対応付けることができる。 Thereafter, in S811, the attribute value estimation unit 124 sets the document image dI (t3) as the first reference image, advances the processing to S805, and sets the document image dI (t6) as the second reference image. Thereafter, the attribute value estimation unit 124 advances the process to S807 and compares both attribute values. Since both attribute values do not match, the process proceeds to S810. In S810, the attribute value estimation unit 124 sets the document image dI (t4) and the document image dI (t5) between the document image dI (t1) and the document image dI (t3) as the estimation target document image. Then, the attribute value estimation unit 124 uses the attribute value “123456” of the document image dI (t3) and the attribute value “654321” of the document image dI (t6) as the estimation result of the estimation target document image. Associate with document image. Thereby, it is determined that the recognition has failed, and a correct attribute value can be estimated and associated with the document image in which “Unknown” is set.

さらに、属性値推定処理（Ｓ３０４）の後、送受信部１２１は、文書画像と、属性値抽出処理（Ｓ３０２）の抽出結果１２１０と、属性値推定処理（Ｓ３０４）の推定結果１２２０と、を対応付けて、クライアント装置１３０に送信する。そして、クライアント装置１３０は、図１２に示すように、文書画像と、抽出結果１２１０と、推定結果１２２０とを対応付けて、表示部２０５に表示する。これにより、ユーザは、抽出結果だけでなく、推定結果を確認し、文書画像の属性値の登録や修正を行うことができる。 Further, after the attribute value estimation process (S304), the transmission / reception unit 121 associates the document image with the extraction result 1210 of the attribute value extraction process (S302) and the estimation result 1220 of the attribute value estimation process (S304). To the client device 130. Then, as illustrated in FIG. 12, the client device 130 displays the document image, the extraction result 1210, and the estimation result 1220 in association with each other on the display unit 205. Thus, the user can confirm not only the extraction result but also the estimation result, and can register or modify the attribute value of the document image.

図１３は、クライアント装置１３０の表示処理部１３１により表示される確認画面の一例を示す図である。確認画面１３００は、クライアント装置１３０が文書画像、抽出結果及び推定結果を受信した後に表示される。１３０１は、処理対象の文書画像群の文書画像を表示する領域である。文書画像は、ユーザが内容を確認するために十分なサイズで表示される。１３０２，１３０３，１３０４は、１３０１に表示されている文書画像から属性値抽出処理（Ｓ３０２）において抽出された属性値を示す領域である。１３０２は、抽出された文書種別を編集可能な状態で表示する領域である。１３０３は、抽出された患者ＩＤを編集可能な状態で表示する領域である。１３０４は、抽出された診療科ＩＤを編集可能な状態で表示する領域である。 FIG. 13 is a diagram illustrating an example of a confirmation screen displayed by the display processing unit 131 of the client device 130. The confirmation screen 1300 is displayed after the client device 130 receives the document image, the extraction result, and the estimation result. Reference numeral 1301 denotes an area for displaying document images of a document image group to be processed. The document image is displayed in a size sufficient for the user to confirm the contents. Reference numerals 1302, 1303, and 1304 denote areas indicating attribute values extracted in the attribute value extraction process (S302) from the document image displayed in 1301. An area 1302 displays the extracted document type in an editable state. An area 1303 displays the extracted patient ID in an editable state. An area 1304 displays the extracted department ID in an editable state.

１３０５，１３０６は、同一の文書画像群に含まれる複数の文書画像のサムネイルを表示する領域である。１３０５は、抽出された属性値がユーザにより確認され、文書画像とその属性値とが既に登録されている文書画像を示すサムネイル画像である。１３０６は、抽出した属性値の確認が完了していない文書画像、すなわち未登録の文書画像を示すサムネイル画像である。登録済み文書画像のサムネイル画像１３０５には、未登録文書画像のサムネイル画像１３０６と区別するためのチェックマークアイコンが表示される。 Reference numerals 1305 and 1306 denote areas for displaying thumbnails of a plurality of document images included in the same document image group. Reference numeral 1305 denotes a thumbnail image indicating a document image in which the extracted attribute value is confirmed by the user and the document image and its attribute value are already registered. Reference numeral 1306 denotes a thumbnail image indicating a document image whose extracted attribute value has not been confirmed, that is, an unregistered document image. In the thumbnail image 1305 of the registered document image, a check mark icon for distinguishing it from the thumbnail image 1306 of the unregistered document image is displayed.

１３０７は、属性値推定処理（Ｓ３０４）の推定結果を示す領域である。１３０７が選択されると、指示受付部１３２は、１３０７に表示中の属性値への修正指示を受け付ける。なお、図１３においては、１つの属性値に対して１つの推定結果のみを示しているが、複数の推定結果が得られた場合には、複数の推定結果を表示してもよい。１３０８は、１３０１に表示されている文書画像の登録指示を入力するためのボタンである。すなわち、１３０８が選択されると、指示受付部１３２は、表示中の文書画像及び属性値の登録指示を受け付ける。 Reference numeral 1307 denotes an area indicating the estimation result of the attribute value estimation process (S304). When 1307 is selected, the instruction receiving unit 132 receives a correction instruction for the attribute value being displayed in 1307. In FIG. 13, only one estimation result is shown for one attribute value, but when a plurality of estimation results are obtained, a plurality of estimation results may be displayed. Reference numeral 1308 denotes a button for inputting a registration instruction for the document image displayed in 1301. That is, when 1308 is selected, the instruction receiving unit 132 receives a registration instruction for the document image being displayed and the attribute value.

確認画面１３００は、ユーザがクライアント装置１３０上でスキャン情報を登録するアプリケーションを立ち上げた際に表示される。なお、初期状態においては、処理対象の文書画像群のうち、１番目の配列順の文書画像が１３０１に表示され、１番目の文書画像の属性値が１３０２，１３０３，１３０４に表示される。図１３の例では、文書種別の抽出結果には属性値の抽出に失敗したことを示す「Ｕｎｋｎｏｗｎ」が表示されており、患者ＩＤには正しい患者ＩＤ「山田太郎」が、診療科には正しい診療ＩＤ「耳鼻咽喉科」が表示されている。さらに、図１３の例では、１３０２の抽出結果に対し、推定結果が存在するため、１３０７に推定結果が、「修正候補」という見出しのリスボックスで表示される。 The confirmation screen 1300 is displayed when the user launches an application for registering scan information on the client device 130. In the initial state, the first document image in the order of arrangement in the group of document images to be processed is displayed in 1301, and the attribute values of the first document image are displayed in 1302, 1303, and 1304. In the example of FIG. 13, “Unknown” indicating that the extraction of the attribute value has failed is displayed in the document type extraction result, the correct patient ID “Taro Yamada” is correct for the patient ID, and correct for the clinical department The medical treatment ID “Otolaryngology” is displayed. Further, in the example of FIG. 13, since there are estimation results for the extraction results of 1302, the estimation results are displayed in a list box 1307 entitled “Correction candidates”.

１３０２，１３０３，１３０４に表示されている属性値が、１３０１に表示されている文書画像中に表示される属性値と異なる場合は、ユーザは、それぞれの領域に表示されている属性値を編集して書き換える。さらに、ユーザがリストボックス１３０７内の修正候補「問診票」を選択することにより、１３０２の内容「Ｕｎｋｎｏｗｍ」を１３０７の内容「問診票」に書き変えることもできる。 When the attribute values displayed in 1302, 1303, and 1304 are different from the attribute values displayed in the document image displayed in 1301, the user edits the attribute values displayed in the respective areas. And rewrite. Further, when the user selects a correction candidate “interview form” in the list box 1307, the content “Unknown” of 1302 can be rewritten to the content “interview form” 1307.

ユーザが、１３０２、１３０３、１３０４に表示されている属性値に誤りがないことを確認して登録ボタン１３０８を押下したとする。この場合、１３０６に表示されていた未登録であることを示す、１３０１の文書画像に対応する文書サムネイルの表示状態は、登録済みであることを示す１３０５の表示状態へ移行する。さらに、１３０１には、次の配列順の文書画像が表示され、ユーザは、未登録の文書画像をすべて確認、登録するまで、同様の確認作業を繰り返す。 Assume that the user confirms that the attribute values displayed in 1302, 1303, and 1304 are correct and presses the registration button 1308. In this case, the display state of the document thumbnail corresponding to the document image 1301 indicating that it is not registered displayed in 1306 shifts to the display state 1305 indicating that it is already registered. Further, a document image in the next arrangement order is displayed in 1301, and the user repeats the same confirmation operation until all unregistered document images are confirmed and registered.

このように、文書画像から認識技術によって抽出した属性値だけでなく、推定結果としての属性値をユーザに提示し、ユーザが確認した上で登録させることにより、文書画像に誤った属性値が付与されるリスクを減らすことができる。また、認識技術により抽出に失敗した属性値に対して他の属性値を推奨し、ユーザに選択させることにより、ユーザの編集操作の負荷を低減させることができる。 In this way, not only the attribute value extracted from the document image by the recognition technique but also the attribute value as the estimation result is presented to the user, and the user confirms and registers the attribute value, thereby giving the document image an incorrect attribute value. The risk of being lost. In addition, it is possible to reduce the load of the user's editing operation by recommending other attribute values for the attribute values that have failed to be extracted by the recognition technique and causing the user to select them.

また、このとき、送受信部１３３は、１３０１に表示されている文書画像の登録指示をサーバ装置１２０に送信する。そして、サーバ装置１２０は、登録指示を受信すると、図３に示す文書画像登録処理において、処理をＳ３０８へ進める。さらに、サーバ装置１２０は、Ｓ３０８において、確認画面１３００におけるユーザによる修正の有無と、修正後の属性値とを、文書画像に対応付けて修正履歴として記録する。 At this time, the transmission / reception unit 133 transmits a registration instruction for the document image displayed in 1301 to the server apparatus 120. Then, when receiving the registration instruction, the server apparatus 120 advances the process to S308 in the document image registration process shown in FIG. Further, in S308, the server device 120 records the presence / absence of correction by the user on the confirmation screen 1300 and the corrected attribute value as a correction history in association with the document image.

図１４は、修正履歴テーブルのデータ構成の一例を示す図である。修正履歴テーブル１４００は、リレーショナルデータベースのテーブルである。修正履歴テーブル１４００は、例えばＨＤＤ２０４等の記憶部に格納されている。１４０１は、各文書画像を示す文書画像名が格納される列である。１４０２は、文書画像が属する文書画像群の属性パターンを示す列である。１４０３は、文書タイプを示す列である。文書タイプは、文書画像のレイアウトの特徴等から判定され格納される。１４０４は、属性値抽出処理（Ｓ３０２）において抽出された属性値としての患者ＩＤを格納する列である。なお、ここでは、図１２に対応して患者ＩＤとして患者番号を示している。１４０５は、属性値抽出処理（Ｓ３０２）において抽出された属性値に対する修正の有無と、修正後の属性値とを格納する列である。すなわち、修正がなく属性値抽出処理（Ｓ３０２）において抽出された属性値が文書画像に対応付けて登録された場合には、「修正なし」が格納される。また、修正が行われた場合には、修正後の患者ＩＤが格納される。なお、図１４には、患者ＩＤの修正履歴のみを示しているが、サーバ装置１２０は、文書種別、診療科ＩＤに対しても同様のテーブルを記憶しているものとする。 FIG. 14 is a diagram illustrating an example of a data structure of the correction history table. The revision history table 1400 is a relational database table. The correction history table 1400 is stored in a storage unit such as the HDD 204, for example. A column 1401 stores document image names indicating the respective document images. Reference numeral 1402 denotes a column indicating the attribute pattern of the document image group to which the document image belongs. Reference numeral 1403 denotes a column indicating the document type. The document type is determined and stored from the layout characteristics of the document image. A column 1404 stores the patient ID as the attribute value extracted in the attribute value extraction process (S302). Here, a patient number is shown as a patient ID corresponding to FIG. A column 1405 stores whether or not the attribute value extracted in the attribute value extraction process (S302) is corrected, and the corrected attribute value. That is, if there is no correction and the attribute value extracted in the attribute value extraction process (S302) is registered in association with the document image, “no correction” is stored. When correction is performed, the corrected patient ID is stored. 14 shows only the correction history of the patient ID, it is assumed that the server device 120 stores a similar table for the document type and the medical department ID.

図１５は、参照タイプテーブル生成部１２７による参照タイプテーブル生成処理を示すフローチャートである。参照タイプテーブル生成部１２７は、参照タイプテーブル生成処理において、修正履歴テーブル１４００から参照タイプテーブル９００を生成する。Ｓ１５０１において、参照タイプテーブル生成部１２７は、未処理の属性パターンを１つ設定する。次に、Ｓ１５０２において、参照タイプテーブル生成部１２７は、登録済みの文書画像のうち、設定された属性パターンの文書画像を処理対象として設定する。そして、参照タイプテーブル生成部１２７は、文書タイプ毎に、対象とする文書タイプのすべての文書画像に対する、「修正なし」の修正情報が記録されている文書画像の割合を算出する。図１４の例においては、文書タイプＡを対象とした場合、対象の文書タイプ（文書タイプＡ）のすべての文書画像の数が２、「修正なし」の文書画像の数が２であり、修正なしの割合は１となる。一方、文書タイプＢを対象とした場合、対象の文書タイプ（文書タイプＢ）のすべての文書画像の数が２、「修正なし」の文書画像の数が０であり、修正なしの割合は０となる。また、文書タイプＣでは、修正なしの割合は０．５となる。 FIG. 15 is a flowchart showing reference type table generation processing by the reference type table generation unit 127. The reference type table generation unit 127 generates the reference type table 900 from the modification history table 1400 in the reference type table generation process. In step S1501, the reference type table generation unit 127 sets one unprocessed attribute pattern. In step S 1502, the reference type table generation unit 127 sets a document image having a set attribute pattern among the registered document images as a processing target. Then, the reference type table generation unit 127 calculates, for each document type, the ratio of the document image in which the correction information “No correction” is recorded with respect to all the document images of the target document type. In the example of FIG. 14, when the document type A is the target, the number of all document images of the target document type (document type A) is 2, and the number of “uncorrected” document images is 2. The ratio of none is 1. On the other hand, when the document type B is the target, the number of all document images of the target document type (document type B) is 2, the number of “uncorrected” document images is 0, and the ratio of no correction is 0. It becomes. In the document type C, the ratio without correction is 0.5.

次に、Ｓ１５０３において、参照タイプテーブル生成部１２７は、修正なしの割合が閾値以上の文書タイプを参照タイプとして決定する。次に、Ｓ１５０４において、参照タイプテーブル生成部１２７は、すべての属性パターンに対してＳ１５０１〜Ｓ１５０３の処理を行ったか否かを判定する。参照タイプテーブル生成部１２７は、未処理の属性パターンが存在する場合には（Ｓ１５０４でＹｅｓ）、処理をＳ１５０１へ進める。参照タイプテーブル生成部１２７は、未処理の属性パターンが存在しない場合には（Ｓ１５０４でＮｏ）、処理をＳ１５０５へ進める。Ｓ１５０５において、参照タイプテーブル生成部１２７は、各属性パターンにおいて連続する属性種別と、各属性パターンに対してＳ１５０３において決定した参照タイプとを対応付けた参照タイプテーブル９００を生成する。以上で、参照テーブル生成処理が完了する。 In step S 1503, the reference type table generation unit 127 determines a document type for which the ratio of no correction is equal to or greater than a threshold as the reference type. In step S1504, the reference type table generation unit 127 determines whether the processing in steps S1501 to S1503 has been performed on all attribute patterns. If there is an unprocessed attribute pattern (Yes in S1504), the reference type table generation unit 127 advances the process to S1501. When there is no unprocessed attribute pattern (No in S1504), the reference type table generation unit 127 advances the process to S1505. In S1505, the reference type table generation unit 127 generates a reference type table 900 in which attribute types that are consecutive in each attribute pattern are associated with the reference types determined in S1503 for each attribute pattern. This completes the reference table generation process.

以上のように、本実施形態にかかる文書管理システム１００によれば、文書画像の認識処理を利用しつつ、文書画像の正しい属性値を推定することができる。すなわち、文書管理システム１００は、認識処理による抽出に失敗した属性値についても正しい値を推定し、これをユーザに提示することができる。また、文書管理システム１００は、認識処理により抽出された属性値についても、誤りの可能性を推定し、これをユーザに提示することができる。したがって、ユーザは、文書登録時には、表示画面に表示された抽出結果及び推定結果の中から、正しい属性値を選択する操作を行うだけでよく、ユーザによる編集作業の負荷を軽減することができる。 As described above, according to the document management system 100 according to the present embodiment, it is possible to estimate a correct attribute value of a document image while using the document image recognition process. That is, the document management system 100 can estimate a correct value for an attribute value that has failed to be extracted by the recognition process, and can present this value to the user. Also, the document management system 100 can estimate the possibility of an error for the attribute value extracted by the recognition process and present it to the user. Therefore, at the time of document registration, the user only needs to perform an operation of selecting a correct attribute value from the extraction result and the estimation result displayed on the display screen, and the burden of editing work by the user can be reduced.

なお、第１の実施形態に係る文書管理システム１００の第１の変更例としては、サーバ装置１２０の属性値抽出部１２２は、文字認識に替えて、１次元又は２次元のバーコードに対する認識処理により、属性値を抽出することとしてもよい。 As a first modification of the document management system 100 according to the first embodiment, the attribute value extraction unit 122 of the server device 120 recognizes a one-dimensional or two-dimensional barcode instead of character recognition. Thus, the attribute value may be extracted.

次に、第２の変更例について説明する。例えば、受付や診療科等文書が生成される際にオンデマンドで文書をスキャンする分散スキャン環境が考えられる。このような環境においては、サーバ装置１２０の属性パターン特定部１２３は、スキャンに利用されたデバイス、スキャンが行われた場所、又はスキャンが行われた部署に基づいて、画像文書群の属性パターンを特定してもよい。 Next, a second modification example will be described. For example, there may be a distributed scanning environment in which a document is scanned on demand when a document such as a reception or a department is generated. In such an environment, the attribute pattern specifying unit 123 of the server device 120 determines the attribute pattern of the image document group based on the device used for the scan, the location where the scan was performed, or the department where the scan was performed. You may specify.

具体的には、送受信部１２１は、読取装置１１０から文書画像群と共に、文書画像群の関連情報を取得する。ここで、関連情報には、スキャンに利用されたデバイス、スキャンが行われた場所又はスキャンが行われた部署が示されているものとする。またサーバ装置１２０は、関連情報と属性パターンとを対応付ける属性パターンテーブルを記憶部に記憶しているものとする。そして、属性パターン特定部１２３は、この属性パターンテーブルを参照し、関連情報から属性パターンを特定する。さらに、サーバ装置１２０は、過去に処理対象とした文書画像群の関連情報と、文書画像群に対する属性パターンの特定結果とを対応付けて履歴として記録しておき、この履歴情報に基づいて、属性パターンテーブルを生成してもよい。 Specifically, the transmission / reception unit 121 acquires related information of the document image group together with the document image group from the reading device 110. Here, it is assumed that the related information indicates the device used for the scan, the location where the scan was performed, or the department where the scan was performed. In addition, the server device 120 stores an attribute pattern table that associates related information with attribute patterns in the storage unit. And the attribute pattern specific | specification part 123 specifies this attribute pattern from related information with reference to this attribute pattern table. Further, the server device 120 records the related information of the document image group that has been processed in the past and the attribute pattern identification result for the document image group in association with each other as a history, and based on this history information, the attribute information is recorded. A pattern table may be generated.

また、第３の変更例としては、サーバ装置１２０の属性値抽出部１２２が認識処理の成否を判定するための認識条件は実施形態に限定されるものではない。属性値抽出部１２２は、例えば、認識精度が閾値以上の場合に成功と判定するという認識条件に従い、認識結果の成否を判定してもよい。 As a third modification, the recognition condition for the attribute value extraction unit 122 of the server device 120 to determine the success or failure of the recognition process is not limited to the embodiment. For example, the attribute value extraction unit 122 may determine the success or failure of the recognition result according to the recognition condition that the recognition accuracy is determined to be successful when the recognition accuracy is equal to or greater than a threshold value.

また、第４の変更例としては、クライアント装置１３０は、サーバ装置１２０から文書画像、抽出結果及び推定結果を受信した後に、これらの情報を表示すればよく、そのための具体的な表示態様等は実施形態に限定されるものではない。図１６は、第４の変更例に係る確認画面を示す図である。確認画面１６００において、１６０１は、処理対象の文書画像群に属する文書画像を表示する領域である。図１６（ａ）においては、領域１６０１には、２つの文書画像１６０２，１６０３が表示されているが、領域１６０１に表示される文書画像の数は特に限定されるものではない。本例においては、ユーザ操作に応じて、領域１６０１に表示する文書画像を切り替えることができる。また、領域１６０１には、各文書画像１６０２，１６０３それぞれに対応付けて、各文書画像１６０２，１６０３のスキャン日時１６０４，１６０５が表示されている。 Further, as a fourth modification example, the client device 130 may display these information after receiving the document image, the extraction result, and the estimation result from the server device 120. It is not limited to the embodiment. FIG. 16 is a diagram illustrating a confirmation screen according to the fourth modification example. In the confirmation screen 1600, reference numeral 1601 denotes an area for displaying document images belonging to the document image group to be processed. In FIG. 16A, two document images 1602 and 1603 are displayed in the area 1601, but the number of document images displayed in the area 1601 is not particularly limited. In this example, the document image displayed in the area 1601 can be switched according to a user operation. In an area 1601, scan dates 1604 and 1605 of the document images 1602 and 1603 are displayed in association with the document images 1602 and 1603, respectively.

さらに、文書画像１６０２，１６０３のうち、文字認識に用いられた領域、すなわち属性領域は、マーカ表示される。文字画像１６０２においては、患者ＩＤが認識された領域１６０６、診療科ＩＤが認識された領域１６０７及び文書種別が認識された１６０８がマーカ表示されている。同様に、文書画像１６０３においても、患者ＩＤが認識された領域１６０９、診療科ＩＤが認識された領域１６１０及び文書種別が認識された１６０１１がマーカ表示されている。なお、本例においては、属性領域をマーカ表示することとしたが、表示処理部１３１は、属性領域を他の領域と識別可能に表示すればよく、そのための具体的な処理は実施形態に限定されるものではない。他の例としては、表示処理部１３１は、属性領域の外枠を表示してもよい。また、患者ＩＤが認識された領域１６０６，１６０９、診療科ＩＤが認識された領域１６０７，１６１０及び文書種別が認識された１６０８，１６１１をそれぞれ異なる表示形態で強調することとしてもよい。例えば、患者ＩＤが認識された領域１６０６，１６０９は赤色のマーカが引かれたように強調する。これに対し、診療科ＩＤが認識された領域１６０７，１６１０は青色のマーカが引かれたように強調し、文書種別が認識された１６０８，１６１１は黄色のマーカが引かれたように強調する。このようにすれば、操作者は一見して患者ＩＤ、診療科ＩＤ、文書種別を容易に区別することが可能となる。 Further, in the document images 1602 and 1603, an area used for character recognition, that is, an attribute area is displayed as a marker. In the character image 1602, an area 1606 where the patient ID is recognized, an area 1607 where the department ID is recognized, and 1608 where the document type is recognized are displayed as markers. Similarly, in the document image 1603, an area 1609 where the patient ID is recognized, an area 1610 where the department ID is recognized, and 16011 where the document type is recognized are displayed as markers. In this example, the attribute area is displayed as a marker. However, the display processing unit 131 only needs to display the attribute area so that it can be distinguished from other areas, and specific processing for that purpose is limited to the embodiment. Is not to be done. As another example, the display processing unit 131 may display an outer frame of the attribute area. The areas 1606 and 1609 in which the patient ID is recognized, the areas 1607 and 1610 in which the medical department ID is recognized, and 1608 and 1611 in which the document type is recognized may be highlighted in different display forms. For example, the areas 1606 and 1609 in which the patient ID is recognized are emphasized as if a red marker is drawn. On the other hand, the areas 1607 and 1610 in which the department ID is recognized are emphasized as if a blue marker is drawn, and the areas 1608 and 1611 in which the document type is recognized are emphasized as if a yellow marker is drawn. In this way, the operator can easily distinguish patient ID, department ID, and document type at a glance.

ユーザにより領域１６０１に表示中のいずれかの文書画像が選択されると、選択された文書画像には、選択中であることを示すチェックマークが表示される。図１６の例では、文書画像１６０２が選択され、文書画像１６０２上に、チェックマーク１６１２が表示されている。１６２０は、選択中の文書画像に対する認識結果を表示する認識結果領域である。認識結果領域１６２０には、患者ＩＤ、診療科ＩＤ及び文書種別の認識結果１６２１，１６２２，１６２３が表示されている。 When one of the document images displayed in the area 1601 is selected by the user, a check mark indicating that the document image is being selected is displayed on the selected document image. In the example of FIG. 16, a document image 1602 is selected, and a check mark 1612 is displayed on the document image 1602. Reference numeral 1620 denotes a recognition result area for displaying a recognition result for the selected document image. In the recognition result area 1620, patient ID, medical department ID, and document type recognition results 1621, 1622, and 1623 are displayed.

また、１６３０は、選択中の文書画像に対する登録情報を表示する登録情報領域である。登録情報領域１６３０の表示内容を図１６（ｂ）に示す。登録情報領域１６３０には、患者ＩＤ欄１６３１、診療科ＩＤ欄１６３４及び文書種別欄１６３５が表示されている。各欄１６３１〜１６３５には、初期値として各項の認識結果が表示されるものとする。ユーザは、各欄の内容を確認し、誤りがある場合には、正しい情報を入力することができる。 Reference numeral 1630 denotes a registration information area for displaying registration information for the selected document image. The display contents of the registration information area 1630 are shown in FIG. In the registration information area 1630, a patient ID column 1631, a department ID column 1634, and a document type column 1635 are displayed. In each of the columns 1631 to 1635, the recognition result of each item is displayed as an initial value. The user can check the contents of each column and input correct information if there is an error.

さらに、患者ＩＤ欄１６３１に対応する検索ボタン１６３２が表示され、ユーザは、検索ボタン１６３２を選択することにより、他の患者ＩＤの検索を行うことができる。また、患者ＩＤ欄１６３１に対応付けて患者情報１６３３が表示されている。患者情報１６３３は、患者ＩＤ欄１６３１に表示される患者ＩＤに対応付けられている患者情報である。なお、クライアント装置１３０は、初期値として、又はユーザ操作に応じて患者ＩＤが患者ＩＤ欄１６３１に表示されると、表示された患者ＩＤをサーバ装置１２０に送信する。サーバ装置１２０は、患者ＩＤと患者情報を対応付けた患者ＤＢを有しており、受信した患者ＩＤに対応する患者情報をクライアント装置１３０に送る。そして、クライアント装置１３０は、受信した患者情報を患者情報１６３３として表示する。なお、患者ＤＢは、サーバ装置１２０以外の不図示の装置が備えるものとしてもよい。この場合、クライアント装置は、患者ＤＢを備える装置から患者情報を取得すればよい。 Further, a search button 1632 corresponding to the patient ID column 1631 is displayed, and the user can search for another patient ID by selecting the search button 1632. Further, patient information 1633 is displayed in association with the patient ID column 1631. Patient information 1633 is patient information associated with the patient ID displayed in the patient ID column 1631. The client device 130 transmits the displayed patient ID to the server device 120 when the patient ID is displayed in the patient ID column 1631 as an initial value or according to a user operation. The server device 120 has a patient DB in which a patient ID and patient information are associated with each other, and sends patient information corresponding to the received patient ID to the client device 130. Then, the client device 130 displays the received patient information as patient information 1633. The patient DB may be provided in a device (not shown) other than the server device 120. In this case, the client device may acquire patient information from a device including the patient DB.

日時欄１６３６には、「日時を指定する」のチェック欄１６３７と、「スキャン日時を使う」のチェック欄１６３８が表示されている。ユーザは、いずれかを選択することができる。さらに、ユーザは、「日時を指定する」のチェック欄１６３７を選択した場合には、入力欄１６３８に日時を入力する。コメント欄１６４０には、ユーザが自由に情報を入力することができる。登録ボタン１６４１が選択されると、登録情報領域１６３０の表示内容が登録される。 In the date / time column 1636, a check column 1637 for “specify date / time” and a check column 1638 for “use scan date / time” are displayed. The user can select either. Further, when the user selects the “Specify date and time” check field 1637, the user inputs the date and time in the input field 1638. In the comment field 1640, the user can freely input information. When the registration button 1641 is selected, the display contents of the registration information area 1630 are registered.

さらに、図１７に示すように、拡大枠１７００により文書画像の一部が指定されると、右上の領域１７０１に拡大枠１７００内の文書画像が拡大表示される。これにより、ユーザは、文書画像の内容をより詳しく確認することができる。また、認識結果を示すマーカ表示が行われた状態で文書画像が拡大されるため、細かい箇所の認識結果の適否を操作者は容易に把握することが可能となる。例えば、患者ＩＤの末尾一文字にマーカ表示がされてない場合には操作者は認識ミスと判断することができる。 Further, as shown in FIG. 17, when a part of the document image is designated by the enlargement frame 1700, the document image in the enlargement frame 1700 is enlarged and displayed in the upper right area 1701. Thereby, the user can confirm the content of the document image in more detail. In addition, since the document image is enlarged in a state in which the marker indicating the recognition result is displayed, it is possible for the operator to easily grasp the suitability of the recognition result of a fine part. For example, when the marker is not displayed at the last character of the patient ID, the operator can determine that it is a recognition error.

なお、図１７に示す例では、領域１７０１は図１６における認識結果領域１６２０と同じ位置に設けられているが認識結果領域１６２０と異なる位置に設けることとしてもよい。例えば、領域１７０１を認識結果領域１６２０の横方向または縦方向に並べて表示させることとしてもよいし、領域１７０１をポップアップウインドウに表示させることとしてもよい。認識結果領域１６２０の表示内容と領域１７０１に表示される拡大された文書画像を見比べることにより、文書画像における文字が小さい場合においても認識結果の適否を操作者は容易に把握することが可能となる。 In the example illustrated in FIG. 17, the area 1701 is provided at the same position as the recognition result area 1620 in FIG. 16, but may be provided at a position different from the recognition result area 1620. For example, the area 1701 may be displayed side by side in the horizontal direction or the vertical direction of the recognition result area 1620, or the area 1701 may be displayed in a pop-up window. By comparing the display contents of the recognition result area 1620 with the enlarged document image displayed in the area 1701, the operator can easily grasp the suitability of the recognition result even when the characters in the document image are small. .

また、第５の変更例としては、文書管理システム１００は、複数ページに渡る文書を処理対象としてもよい。図１８（ａ）は、１つの文書が複数ページに渡る文書を含む文書群に対する確認画面を示す図である。ここで、文書画像１８００，１８０１は、いずれも、患者ＩＤ「１２３４５６」、診療科「耳鼻咽喉科」、文書種別「問診票」の認識結果が得られているものとする。また、文書画像１８０２，１８０２は、いずれも、患者ＩＤ「６５４３２１」、診療科「耳鼻咽喉科」、文書種別「同意書」の認識結果が得られているものとする。この場合、ユーザは、属性の等しい文書画像１８００，１８０１を選択した上で、登録情報領域１６３０の表示内容の一括登録指示を入力することができる。この場合、サーバ装置１２０は、クライアント装置１３０を介して、文書画像１８００，１８０１に対する一括登録指示を受信する。そして、サーバ装置１２０は、一括登録指示に従い、文書画像１８００，１８０１それぞれに対し、登録情報領域１６３０の表示内容を対応付けて登録する。 Further, as a fifth modification example, the document management system 100 may process a document extending over a plurality of pages. FIG. 18A is a diagram showing a confirmation screen for a document group including a document in which one document covers a plurality of pages. Here, it is assumed that the document images 1800 and 1801 have obtained recognition results for the patient ID “123456”, the clinical department “otolaryngology”, and the document type “inquiry sheet”. In addition, it is assumed that the document images 1802 and 1802 have obtained the recognition results of the patient ID “654321”, the clinical department “otolaryngology”, and the document type “consent form”. In this case, the user can input a batch registration instruction for display contents in the registration information area 1630 after selecting document images 1800 and 1801 having the same attributes. In this case, the server apparatus 120 receives a batch registration instruction for the document images 1800 and 1801 via the client apparatus 130. Then, according to the batch registration instruction, the server apparatus 120 registers the display contents of the registration information area 1630 in association with each of the document images 1800 and 1801.

また、ユーザが誤って属性値の異なる文書画像１８００，１８０２を選択した上で一括登録指示を入力する場合も考えられる。この場合、サーバ装置１２０は、文書画像１８００と文書画像１８０２の認識結果が異なることから、登録内容に誤りがあると推定する。そして、サーバ装置１２０は、誤りの可能性がある属性種別をクライアント装置１３０に送信する。これに対し、クライアント装置１３０は、誤りの可能性があることを示す情報を表示する。図１８（ｂ）は、患者ＩＤ及び文書種別に誤りの可能性がある場合の認識結果領域１６２０における表示例を示す図である。文書画像１８００，１８０２の認識結果において、患者ＩＤ及び文書種別が異なっている。そこで、表示処理部１３１は、一括登録指示に係る患者ＩＤと共に、「患者ＩＤが誤っている可能性があります」といった誤りの可能性をユーザに通知する情報を表示する。さらに、一括登録指示に係る文書種別と共に、「文書種別が誤っている可能性があります」といった通知情報を表示する。これにより、ユーザは、誤りの可能性を確認することができる。 It is also conceivable that the user erroneously selects document images 1800 and 1802 having different attribute values and inputs a batch registration instruction. In this case, the server device 120 estimates that there is an error in the registered content because the recognition results of the document image 1800 and the document image 1802 are different. Then, the server device 120 transmits to the client device 130 an attribute type that may be erroneous. On the other hand, the client device 130 displays information indicating that there is a possibility of an error. FIG. 18B is a diagram illustrating a display example in the recognition result area 1620 when there is a possibility of an error in the patient ID and the document type. In the recognition results of the document images 1800 and 1802, the patient ID and the document type are different. Therefore, the display processing unit 131 displays information notifying the user of the possibility of an error such as “the patient ID may be incorrect” together with the patient ID related to the batch registration instruction. Further, notification information such as “the document type may be incorrect” is displayed together with the document type related to the batch registration instruction. Thereby, the user can confirm the possibility of an error.

なお、図１８（ｂ）に示す例では、診療科の認識結果の表示は省略しているが、図１６の認識結果領域１６２０と同様に診療科の認識結果を表示することとしてもよい。また、登録しようとする複数の文書画像間において診療科の認識結果が異なる場合には「診療科が誤っている可能性があります」等の警告を表示することとしてもよい。なお、警告が表示された場合であっても登録ボタン１６４１を用いて文書画像を登録できるようにしてもよいし、登録を禁止することとしてもよい。また、異なる患者ＩＤの文書画像が選択されている場合には、図１６に示した認識結果領域１６２０にはスキャン日時が新しい文書画像の認識結果を表示することとしてもよいし、スキャン日時が古い文書画像の認識結果を表示することとしてもよい。なお、異なる患者ＩＤの文書画像が選択されている場合には、登録情報領域１６３０に、患者ＩＤ欄１６３１等にスキャン日時が新しい文書画像の認識結果を表示することとしてもよいし、スキャン日時が古い文書画像の認識結果を表示することとしてもよい。 In the example shown in FIG. 18B, the display of the recognition result of the medical department is omitted, but the recognition result of the medical department may be displayed as in the recognition result area 1620 of FIG. Further, when the recognition result of the medical department is different among a plurality of document images to be registered, a warning such as “the medical department may be wrong” may be displayed. Even when a warning is displayed, the document image may be registered using the registration button 1641, or registration may be prohibited. If document images with different patient IDs are selected, the recognition result area 1620 shown in FIG. 16 may display the recognition result of the document image with a new scan date or the old scan date. The recognition result of the document image may be displayed. When a document image with a different patient ID is selected, a recognition result of a document image with a new scan date may be displayed in the registration information area 1630 in the patient ID column 1631 or the like. The recognition result of the old document image may be displayed.

＜その他の実施形態＞
また、本発明は、以下の処理を実行することによっても実現される。即ち、上述した実施形態の機能を実現するソフトウェア（プログラム）を、ネットワーク又は各種記憶媒体を介してシステム或いは装置に供給する。そして、そのシステム或いは装置のコンピュータ（又はＣＰＵやＭＰＵ等）がプログラムを読み出して実行する処理である。 <Other embodiments>
The present invention can also be realized by executing the following processing. That is, software (program) that realizes the functions of the above-described embodiments is supplied to a system or apparatus via a network or various storage media. Then, the computer (or CPU, MPU, etc.) of the system or apparatus reads and executes the program.

以上、上述した各実施形態によれば、文書画像の正しい属性値を推定することができる。 As mentioned above, according to each embodiment mentioned above, the correct attribute value of a document image can be estimated.

以上、本発明の好ましい実施形態について詳述したが、本発明は係る特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 The preferred embodiments of the present invention have been described in detail above, but the present invention is not limited to such specific embodiments, and various modifications can be made within the scope of the gist of the present invention described in the claims.・ Change is possible.

１１０読取装置
１２０サーバ装置
１３０クライアント装置
１２２属性値抽出部
１２４属性値推定部 110 Reading device 120 Server device 130 Client device 122 Attribute value extraction unit 124 Attribute value estimation unit

Claims

Recognition means for extracting attribute values of the document image by recognition processing for each of the plurality of document images belonging to the document image group;
A result judging means for judging whether or not the recognition processing is successful according to the recognition condition;
Document type specifying means for specifying the document type of the document image;
A reference image specifying means for determining, from the document images belonging to the document image group, that the recognition process has been successful, and specifying the document image of which the specified document type is a reference type as a reference image;
And estimating means for estimating the attribute value of the document image determined to have failed in the recognition process based on the attribute value of the document image specified by the reference image specifying means. Information processing device.

Attribute pattern specifying means for specifying an attribute pattern based on continuity of attribute values of the document image group;
A reference type specifying unit for specifying the reference type of the document image group based on the attribute pattern;
The information processing apparatus according to claim 1, wherein the reference image specifying unit specifies the document image of the reference type specified by the reference type specifying unit as the reference image.

The reference image specifying means specifies the reference image based on an arrangement order from the document images determined to have succeeded in the recognition process and specified with the reference type. Item 3. The information processing apparatus according to item 1 or 2.

The reference image specifying means includes the document image whose arrangement order is earlier than the document image for which the recognition process has failed, and the document image whose arrangement order is after the document image for which the recognition process has failed. The information processing apparatus according to claim 3, wherein the information processing apparatus is specified as the reference image.

5. The information processing apparatus according to claim 1, wherein the document type specifying unit specifies the document type based on a feature amount of the document image. 6.

The document image is an image showing a medical document,
6. The device according to claim 1, wherein the recognizing unit extracts at least one of a document type, patient identification information, and clinical department identification information as the attribute value of the document image. Information processing device.

The attribute pattern specifying unit is configured to determine the attribute pattern based on at least one of identification information of a device used for scanning related to generation of the document image group, a location where the scan is performed, and a department where the scan is performed. The information processing apparatus according to claim 2, wherein the information processing apparatus is specified.

The information processing apparatus according to claim 1, further comprising an output unit that outputs a recognition result by the recognition unit and an estimation result by the estimation unit.

An information processing method executed by an information processing apparatus,
A recognition step of extracting attribute values of the document image by a recognition process for each of the plurality of document images belonging to the document image group;
A result determination step for determining whether or not the recognition process is successful according to the recognition condition;
A document type specifying step for specifying a document type of the document image;
A reference image specifying step of determining, from among the document images belonging to the document image group, that the recognition processing has been successful, and specifying the document image whose specified document type is a reference type as a reference image;
An estimation step of estimating the attribute value of the document image determined to have failed in the recognition process based on the attribute value of the document image specified in the reference image specifying step. Information processing method.

Computer
Recognition means for extracting attribute values of the document image by recognition processing for each of the plurality of document images belonging to the document image group;
A result judging means for judging whether or not the recognition processing is successful according to the recognition condition;
Document type specifying means for specifying the document type of the document image;
A reference image specifying means for determining, from the document images belonging to the document image group, that the recognition process has been successful, and specifying the document image of which the specified document type is a reference type as a reference image;
A program for functioning as an estimation unit that estimates the attribute value of the document image determined to have failed in the recognition processing based on the attribute value of the document image specified by the reference image specifying unit.