JP4982587B2

JP4982587B2 - Data entry system and data entry method

Info

Publication number: JP4982587B2
Application number: JP2010106999A
Authority: JP
Inventors: 彰夫古畑; 由明黒沢; 博之水谷; 外志正土橋; 直朗小平; 智久鈴木
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2010-05-07
Filing date: 2010-05-07
Publication date: 2012-07-25
Anticipated expiration: 2030-05-07
Also published as: JP2011237905A

Description

本発明の実施形態は、データエントリシステムおよびデータエントリ方法に関する。 Embodiments described herein relate generally to a data entry system and a data entry method.

例えば申込書などの帳票の画像を画面に表示し、エントリ作業者が、その画面を見ながら、帳票の情報をキーボードで入力（キー入力）してゆく業務のためのデータエントリシステムがある。 For example, there is a data entry system for business in which an image of a form such as an application form is displayed on a screen, and an entry worker inputs information on the form with a keyboard (key input) while viewing the screen.

ＦＡＸ等で受信した紙の帳票の内容をデータ化する際に、キー入力によってエントリすることが一般に行われている。 In general, entry is made by key input when converting the contents of a paper form received by FAX or the like into data.

しかし、この作業は、非熟練者が作業した場合、タイプミスや変換ミスなどの問題が起きやすい。また、例えば住所や名前などで難読文字を含む場合、キーエントリ自体が困難になることもある。 However, this work tends to cause problems such as typographical errors and conversion errors when an unskilled person works. For example, when an obfuscated character is included in an address or name, the key entry itself may be difficult.

このような難読文字の入力に対応する技術として、例えば手書きした文字の画像を文字認識する技術がある。この技術の場合、手書きされた文字だけでなくその後に続く候補文字列または候補単語を提示することで、すべての文字を記入することなく文字入力を行える（例えば特許文献１参照）。 As a technique corresponding to the input of such an obfuscated character, for example, there is a technique for recognizing a handwritten character image. In the case of this technique, character input can be performed without entering all characters by presenting not only handwritten characters but also candidate character strings or candidate words that follow (see, for example, Patent Document 1).

特開平８−９６０８５号公報JP-A-8-96085

上述した従来の技術は、人が記入する文字または単語が正しいという前提であり、キー入力のようにタイプミスを起こす可能性のあるデータエントリ作業には適さないという問題がある。 The above-described conventional technique is based on the premise that a character or word written by a person is correct, and has a problem that it is not suitable for a data entry operation that may cause a typo such as key input.

また、帳票に書かれた文字を光学的（機械的）に読み取って得た画像から文字を認識することは１００％確実であるというわけではなく、誤読が少なからず発生する。このため、誤りが許されないシステムでは、認識した文字の検査に多大な時間と労力を要するという問題がある。 Moreover, it is not 100% certain to recognize a character from an image obtained by optically (mechanically) reading a character written on a form, and misreading occurs in some cases. For this reason, in a system in which no error is allowed, there is a problem that it takes a lot of time and labor to inspect the recognized character.

また、光学的な文字読み取り処理で、例えば帳票の画像の住所記載欄を文字認識したときに、例えば「神奈川県」という文字列を「神寮川県」というように、連続する文字の列の中間の文字を誤読した場合、該当する位置の文字だけを訂正しようとすると、入力の変換回数が多くなってしまい、その作業に多くの手間がかかるという問題がある。 In addition, when optical address reading processing recognizes characters in the address entry field of a form image, for example, the character string “Kanagawa” is changed to a string of consecutive characters such as “Kyodogawa”. When an intermediate character is misread, there is a problem that if only the character at the corresponding position is to be corrected, the number of times of input conversion increases, and the work takes much time.

本発明はこのような課題を解決するためになされたもので、作業者が帳票の画像を見ながらデータエントリ作業を行う上で、入力ミスをカバーしつつ容易にかつ効率良くデータエントリ作業を行えるよう支援することができるデータエントリシステムおよびデータエントリ方法を提供することを目的とする。 The present invention has been made to solve such a problem. When a worker performs data entry work while looking at a form image, the data entry work can be performed easily and efficiently while covering input errors. It is an object of the present invention to provide a data entry system and a data entry method that can support the above.

上記の課題を解決するために、実施形態のデータエントリ装置は、入力装置および表示装置を備えたデータエントリ装置において、キーエントリ対象の文字が記載された帳票を光学的に読み取って得られた画像が記憶された画像記憶部と、前記キーエントリ対象の文字を繋げた文字列の候補の構造体が記憶可能な構造体記憶部と、前記画像記憶部に記憶された画像を読み出し、読み出した画像を文字列に沿って切り出し、切り出した領域の画像から各文字を認識し文字を位置とスコアで紐付けた文字列の構造体を生成し、前記構造体記憶部に記憶する文字認識部と、前記帳票の画像と、前記帳票に記載されている文字を入力するための入力ボックスとを有する画面を前記表示装置に表示し、前記入力ボックスへのキー入力を受け付ける画面制御部と、受け付けた前記キーに関連する文字または文字列の候補を、前記構造体記憶部の構造体から読み出し前記入力ボックスに変換候補として表示、または前記入力ボックスの近傍に参照用の情報として表示するエントリ支援部とを備える。 In order to solve the above-described problem, an image obtained by optically reading a form on which a key entry target character is described in a data entry device including an input device and a display device is provided in the data entry device of the embodiment. Is stored in the image storage unit, a structure storage unit capable of storing a candidate structure of a character string connecting the key entry target characters, and an image stored in the image storage unit. A character recognition unit that recognizes each character from the image of the cut region, generates a character string structure in which the character is associated with a position and a score, and stores the structure in the structure storage unit; A screen including an image of the form and an input box for inputting characters described in the form is displayed on the display device, and a screen system for accepting key input to the input box. And characters or character string candidates related to the received key are read from the structure of the structure storage unit and displayed as conversion candidates in the input box, or displayed as reference information in the vicinity of the input box An entry support unit.

また、実施形態のデータエントリ方法は、入力装置、記憶装置および表示装置を備えたデータエントリ装置におけるデータエントリ方法において、キーエントリ対象の文字が記載された帳票を光学的に読み取って得られた画像を前記記憶装置に記憶するステップと、前記記憶装置に記憶された画像を読み出し、読み出した画像を文字列に沿って切り出し、切り出した領域の画像から各文字を認識し文字を位置とスコアで紐付けた文字列の構造体を生成し、前記記憶装置に記憶するステップと、前記帳票の画像と、前記帳票に記載されている文字を入力するための入力ボックスとを有する画面を前記表示装置に表示し、前記入力ボックスへのキー入力を受け付けるステップと、受け付けた前記キーに関連する文字または文字列の候補を、前記構造体から読み出し前記入力ボックスに変換候補として表示、または前記入力ボックスの近傍に参照用の情報として表示するステップとを有する。 Further, the data entry method of the embodiment is an image obtained by optically reading a form on which a key entry target character is described in the data entry method in a data entry device including an input device, a storage device, and a display device. Are stored in the storage device, the image stored in the storage device is read out, the read image is cut out along the character string, each character is recognized from the image of the cut out region, and the character is linked by the position and the score. A screen having a step of generating a structure of attached character strings and storing the structure in the storage device, an image of the form, and an input box for inputting characters described in the form is displayed on the display device. Displaying a key input to the input box, and characters or character string candidates related to the received key. And a step of displaying the display as a conversion candidate to read the input box from the body, or in the vicinity of the input box as the information for reference.

本発明の一実施形態のデータエントリシステムの構成を示す図である。It is a figure which shows the structure of the data entry system of one Embodiment of this invention. 認識結果記憶部のデータを示す図である。It is a figure which shows the data of a recognition result memory | storage part. 文字認識処理を示すフローチャートである。It is a flowchart which shows a character recognition process. 認識結果構造体の作成動作を示すフローチャートである。It is a flowchart which shows creation operation of a recognition result structure. 認識結果構造体を作成する過程を説明するための図である。It is a figure for demonstrating the process which produces a recognition result structure. 文字候補構造体の作成手順の一例を示す図である。It is a figure which shows an example of the preparation procedure of a character candidate structure. データエントリ画面の一例を示す図である。It is a figure which shows an example of a data entry screen. キーエントリ支援動作を示すフローチャートである。It is a flowchart which shows key entry assistance operation | movement. 認識結果構造体の中から最も尤もらしい文字列から補完文字列を生成する動作を説明するための図である。It is a figure for demonstrating the operation | movement which produces | generates a complement character string from the most likely character string from the recognition result structure. 認識結果構造体の一部分が欠落して作成された場合の例を示す図である。It is a figure which shows the example when a part of recognition result structure is missing and produced. 図１０の認識結果構造体で未確定になることを説明するため図である。It is a figure for demonstrating becoming undecided in the recognition result structure of FIG. 中抜けした文字を補完する動作を示すフローチャートである。It is a flowchart which shows the operation | movement which complements the missing character. 中抜けした文字を補完する動作を説明するための図である。It is a figure for demonstrating the operation | movement which complements the missing character. 中抜けした文字を補完する動作を説明するための図である。It is a figure for demonstrating the operation | movement which complements the missing character. 認識結果構造体の各要素が候補文字およびそれぞれの類似度を保持している様子を示す図である。It is a figure which shows a mode that each element of a recognition result structure hold | maintains a candidate character and each similarity. 入力された文字を利用した場合と認識結果１位を利用した場合の両者の文字列を表示した例を示す図である。It is a figure which shows the example which displayed the character string of both the case where the input character is utilized, and the case where the recognition result 1st place is utilized.

以下、図面を参照して、本発明の一つの実施の形態のデータエントリシステムを詳細に説明する。図１は第１実施形態のデータエントリシステムの構成を示す図である。 Hereinafter, a data entry system according to an embodiment of the present invention will be described in detail with reference to the drawings. FIG. 1 is a diagram showing the configuration of the data entry system of the first embodiment.

（第１実施形態）
図１に示すように、この第１実施形態のデータエントリシステムは、画像読取装置または画像入力装置としてのイメージスキャナ１（以下「スキャナ１」と称す）と、例えばテキスト入力やデータエントリ開始操作や終了操作などの指示操作を行うキーボード２などのテキスト入力装置と、データエントリ装置としてのコンピュータ本体３と、モニタ４などの表示装置とを備えている。テキスト入力装置としては、例えばマウスやトラックボールとＧＵＩとの組み合わせで画面上に表示したカナやローマ字のテキストをクリック操作して指定する機器も含まれる。 (First embodiment)
As shown in FIG. 1, the data entry system according to the first embodiment includes an image scanner 1 (hereinafter referred to as “scanner 1”) as an image reading device or an image input device, a text input, a data entry start operation, A text input device such as a keyboard 2 for performing an instruction operation such as an end operation, a computer main body 3 as a data entry device, and a display device such as a monitor 4 are provided. The text input device includes, for example, a device that is specified by clicking and specifying kana or roman text displayed on the screen by a combination of a mouse, a trackball, and a GUI.

スキャナ１は、例えばＣＣＤなどの撮像素子をライン状に配置したＣＣＤラインセンサとこのＣＣＤラインセンサへ帳票を連続して供給するための給紙機構を備えるものである。この他、スキャナ１は、帳票を一枚ずつ読み取るためのフラットベットを備えるものであってもよい。 The scanner 1 includes, for example, a CCD line sensor in which imaging elements such as CCDs are arranged in a line, and a paper feed mechanism for continuously supplying a form to the CCD line sensor. In addition, the scanner 1 may be provided with a flat bed for reading a form one by one.

スキャナ１は、ＣＣＤラインセンサにより読取対象物（申込書などの帳票）をスキャニングし（読み取り）、帳票イメージ（デジタル画像）としてコンピュータ本体３に入力する。帳票は、例えば注文書（図３参照）などのように、文字の記入位置が予め決められており、予め設定された注文書の書式に基づいて帳票イメージ（デジタル画像）を文字認識処理し、注文書に記載された文字のテキストデータが得られるものとする。 The scanner 1 scans (reads) an object to be read (form such as an application form) by a CCD line sensor and inputs it to the computer main body 3 as a form image (digital image). The form has a predetermined character entry position, such as an order form (see FIG. 3), for example, and the form image (digital image) is character-recognized based on a preset order form format. It is assumed that text data of characters described in the purchase order is obtained.

この実施形態では、画像入力装置としてスキャナ１を例示したが、「帳票を光学的に読み取った画像」を入力するものであれば、例えばネットワーク上のサーバやネットワーク回線、その通信インターフェースなども画像入力装置に含まれる。この他、画像入力装置としては、例えばデジタルカメラなど、様々な形態のものを許容する。 In this embodiment, the scanner 1 is exemplified as the image input device. However, if an “image obtained by optically reading a form” is input, for example, a server on a network, a network line, a communication interface thereof, and the like can be input. Included in the device. In addition, various types of image input devices such as a digital camera are allowed.

モニタ４は、コンピュータ本体３から出力されたデータ入力用の画面（図７のデータエントリ画面５０）を表示する。すなわち、モニタ４は、帳票を光学的に読み取った帳票イメージ（デジタル画像）を表示する表示部として機能する。 The monitor 4 displays a data input screen (data entry screen 50 in FIG. 7) output from the computer main body 3. That is, the monitor 4 functions as a display unit that displays a form image (digital image) obtained by optically reading a form.

コンピュータ本体３は、入力受付部３１、メモリ３２、文字認識部３３、文字認識辞書３４、日本語変換辞書としての基本辞書３６、エントリ部３５などを備えている。メモリ３２は、画像記憶部３２ａおよび認識結果記憶部３２ｂを有している。 The computer main body 3 includes an input receiving unit 31, a memory 32, a character recognition unit 33, a character recognition dictionary 34, a basic dictionary 36 as a Japanese conversion dictionary, an entry unit 35, and the like. The memory 32 includes an image storage unit 32a and a recognition result storage unit 32b.

文字認識辞書３４には、文字認識のための一般的な辞書データ（文字の画像（または文字の特徴ベクトル）とテキストデータとを対応付けたもの）が多数格納されている。つまりこの文字認識辞書３４は、文字認識のために利用される辞書である。この文字認識辞書３４とは別に、メモリ３２には例えば「ローマ字」または「かな」等でキー入力されたときに参照される日本語変換のための基本辞書３６が設けられている。基本辞書３６はキー入力された文字のコードとテキストデータとを対応付けたものである。 The character recognition dictionary 34 stores a large number of general dictionary data for character recognition (character images (or character feature vectors) associated with text data). That is, the character recognition dictionary 34 is a dictionary used for character recognition. In addition to the character recognition dictionary 34, the memory 32 is provided with a basic dictionary 36 for Japanese conversion that is referred to when a key is input using, for example, “Romaji” or “Kana”. The basic dictionary 36 associates the character code input by the key with text data.

図７のデータエントリ画面５０の入力ボックス５２〜５５に、カーソルが表示された状態でキーボード２から入力されたキーに対応する日本語の文字または文字列の候補が基本辞書３６から読み出されて入力ボックス５２〜５５に表示される。基本辞書３６は、キーボード２から入力されたキーに対応する変換候補の文字が格納された日本語変換辞書であり、例えば「にほん」と入力することで、「日本」という漢字の文字を出力するためのものである。 Japanese characters or character string candidates corresponding to the keys input from the keyboard 2 in a state where the cursor is displayed in the input boxes 52 to 55 of the data entry screen 50 of FIG. It is displayed in the input boxes 52-55. The basic dictionary 36 is a Japanese conversion dictionary in which conversion candidate characters corresponding to keys input from the keyboard 2 are stored. For example, by inputting “Nihon”, a kanji character “Japan” is output. Is for.

画像記憶部３２ａには、スキャナ１から入力された帳票イメージや文字イメージ領域などが保存される。すなわち、この画像記憶部３２ａには、キーエントリ対象の文字が記載された帳票をスキャナ１が光学的に読み取って得られた画像が記憶される。 The image storage unit 32a stores a form image, a character image area, and the like input from the scanner 1. That is, the image storage unit 32a stores an image obtained by the scanner 1 optically reading a form on which a key entry target character is written.

認識結果記憶部３２ｂには、文字認識部３３により認識された文字認識結果が記憶される。文字認識結果は、例えば図２に示すように、認識した各文字を文字候補として各候補に付加されたスコアと対応付けた文字要素（ノード）の構造体（テーブル）として記憶される。文字要素（ノード）の構造体（テーブル）を認識結果構造体という。 The recognition result storage unit 32b stores the character recognition result recognized by the character recognition unit 33. For example, as shown in FIG. 2, the character recognition result is stored as a structure (table) of character elements (nodes) in which each recognized character is associated with a score added to each candidate as a character candidate. A structure (table) of character elements (nodes) is called a recognition result structure.

この認識結果構造体は、帳票イメージを文字行毎に切り出し、切り出した領域の画像から文字列を認識し、文字列を構成する文字と文字との関係をスコアで紐付けた構造体である。この認識結果構造体の具体的な作成方法については後述する動作説明で詳述する。 This recognition result structure is a structure in which a form image is cut out for each character line, a character string is recognized from an image of the cut-out area, and the relationship between the characters and characters constituting the character string is linked with a score. A specific method of creating the recognition result structure will be described in detail in the operation description to be described later.

メモリ３２には、キーボード２から入力されたテキストデータが保持される。この他、メモリ３２には、読み取り元の帳票に対する帳票イメージのフォーマット情報（これを「帳票フォーマット」という）およびデータエントリ画面５０を構成するための入力フィールド定義情報などが記憶されている。 The memory 32 holds text data input from the keyboard 2. In addition, the memory 32 stores form image format information (hereinafter referred to as “form format”) for the form to be read, input field definition information for configuring the data entry screen 50, and the like.

帳票フォーマットは、読み取り元の帳票に対応して帳票イメージのどの位置にどういった情報が記載されているかといった情報である。すなわち、メモリ３２には、帳票における文字記載部分の配置を含む帳票フォーマットが記憶されている。 The form format is information indicating what information is written in which position of the form image corresponding to the form of the reading source. In other words, the memory 32 stores a form format including the arrangement of character description portions in the form.

すなわち、メモリ３２は、入力されたデータを一時保存すると共に、文字認識部３３およびエントリ部３５がそれぞれの処理を行うための作業領域（データの一時記憶領域）として機能する。メモリ３２および文字認識辞書３４以外の要素（入力受付部３１、文字認識部３３、エントリ部３５など）は、ハードディスク装置にインストールされたプログラムを、コンピュータ本体３に設けられたＣＰＵが実行することで実現される。なお上記各要素はソフトウェアに限らずハードウェアで実現してもよい。 That is, the memory 32 temporarily stores input data and functions as a work area (temporary data storage area) for the character recognition unit 33 and the entry unit 35 to perform respective processes. Elements other than the memory 32 and the character recognition dictionary 34 (input receiving unit 31, character recognition unit 33, entry unit 35, etc.) are executed by a CPU provided in the computer main body 3 by executing a program installed in the hard disk device. Realized. Each of the above elements may be realized not only by software but also by hardware.

入力受付部３１は、スキャナ１から入力される帳票イメージ（デジタル画像）を受け付け、メモリ３２の画像記憶部３２ａに記憶する。また入力受付部３１は、キーボード２からキー入力されたテキストデータ等の入力データを受け付け、エントリ部３５に渡す。 The input receiving unit 31 receives a form image (digital image) input from the scanner 1 and stores it in the image storage unit 32 a of the memory 32. The input receiving unit 31 receives input data such as text data input from the keyboard 2 and passes it to the entry unit 35.

文字認識部３３は、メモリ３２の帳票フォーマットに従い、帳票イメージから文字列のイメージを文字行毎に切り出し、切り出した文字行に対して文字認識辞書３４を参照して文字認識処理を実行し、抽出された各文字行の文字認識結果を用いて文字列を繋げた構造体を構成（作成）し、認識結果記憶部３２ｂに記憶する。なお帳票フォーマットは、データエントリ作業の際にエントリ作業者によるキー操作で入力される場合もある。 In accordance with the form format of the memory 32, the character recognition unit 33 cuts out a character string image from the form image for each character line, performs character recognition processing with reference to the character recognition dictionary 34 for the cut out character line, and extracts the extracted character line. A structure in which the character strings are connected is constructed (created) using the character recognition result of each character line, and is stored in the recognition result storage unit 32b. Note that the form format may be input by a key operation by an entry worker during data entry work.

すなわち、文字認識部３３は、画像記憶部３２ａに記憶された画像を文字列の領域単位に切り出し、切り出した文字列の領域を認識可能な最小単位で切り出し、切り出した最小単位の画像に対して文字認識辞書３４を用いて文字または文字要素を認識し、認識結果の隣り合う文字または文字要素どうしを組み合わせて、文字列の行領域毎に構成した文字列候補の構造体（これを「認識結果構造体」という）を生成し、認識結果記憶部３２ｂへ記憶する。換言すると、この認識結果構造体は、帳票イメージから切り出した文字行の領域より各文字を認識し文字を位置とスコアで紐付け、文字どうしのつながりを示す構造体である。文字どうしのつながりをパスという。 That is, the character recognizing unit 33 cuts out the image stored in the image storage unit 32a in character string region units, cuts out the cut out character string region in a recognizable minimum unit, and extracts the cut out minimum unit image. The character recognition dictionary 34 is used to recognize characters or character elements, and adjacent character or character elements of the recognition result are combined to form a character string candidate structure configured for each row region of the character string (this is referred to as “recognition result”). Structure ”) and is stored in the recognition result storage unit 32b. In other words, this recognition result structure is a structure that recognizes each character from the region of the character line cut out from the form image, associates the character with the position and the score, and indicates the connection between the characters. The connection between letters is called a path.

帳票フォーマットは、帳票イメージ内のどの位置にどの項目の文字イメージ領域が存在するかといった配置データであり、例えば注文書などであれば、注文主の氏名、届け先の住所、注文品の番号、金額、代金などの文字イメージ領域の切り出し範囲を示す座標情報（始点（左上角）と終点（右下角））である。座標情報は、例えば帳票イメージの左上角を基準点（ｘ０，ｙ０など）とする２次元平面の座標である。 The form format is arrangement data such as which item's character image area exists at which position in the form image. For example, in the case of an order form, the orderer's name, delivery address, order number, amount , Coordinate information (start point (upper left corner) and end point (lower right corner)) indicating the cutout range of the character image area such as a price. The coordinate information is, for example, coordinates on a two-dimensional plane with the upper left corner of the form image as a reference point (x0, y0, etc.).

入力フィールド定義情報は、データエントリ画面５０にテキスト入力ボックスを表示する位置を示す座標情報（始点）、入力される文字種（漢字／英数字、半角／全角等）、文字数、禁則文字、色（背景、文字）、入力順序等を定義したものである。入力順序は、複数のテキスト入力ボックスの中でテキストを入力すべき対象のテキスト入力ボックスをアクティブにする順序と、アクティブにされたテキスト入力ボックスに対応する文字イメージ領域（枠）を帳票イメージ上に指し示すための情報などである。 The input field definition information includes coordinate information (start point) indicating the position where the text input box is displayed on the data entry screen 50, the type of characters to be input (kanji / alphanumeric, half-width / full-width, etc.), the number of characters, prohibited characters, color (background , Characters), input order, and the like. The input order is the order of activating the target text input box to which text should be input among multiple text input boxes, and the character image area (frame) corresponding to the activated text input box on the form image. Information for pointing.

エントリ部３５は、入力フィールド定義情報に従って、帳票の画像５１とテキストの入力ボックス５２とを有するデータエントリ画面５０（図７参照）を表示し、データエントリ画面５０の入力ボックス５２に入力されたキーに対応して基本辞書３６および認識結果記憶部３２ｂから読み出され、確定した文字または文字列（テキストデータ）をキーエントリのデータとしてメモリ３２またはハードディスク装置へ保存する。 The entry unit 35 displays a data entry screen 50 (see FIG. 7) having a form image 51 and a text input box 52 in accordance with the input field definition information, and the keys entered in the input box 52 of the data entry screen 50 The character or character string (text data) read out from the basic dictionary 36 and the recognition result storage unit 32b and stored in the memory 32 or the hard disk device as key entry data.

すなわち、エントリ部３５は、帳票の画像５１と、帳票に記載されている文字を入力するための入力ボックス５２とを有するデータエントリ画面５０をモニタ４へ出力（表示）し、データエントリ画面５０の入力ボックス５２へのキー入力を受け付ける画面制御部として機能する。 That is, the entry unit 35 outputs (displays) a data entry screen 50 having a form image 51 and an input box 52 for inputting characters described in the form to the monitor 4. It functions as a screen control unit that accepts key input to the input box 52.

またエントリ部３５は、受け付けたキーに対応する変換候補の文字を基本辞書３６から検索し、入力ボックス５２に表示すると共に、キーに関連する文字または文字列を、認識結果記憶部３２ｂの認識結果構造体から読み出し画面の例えば入力ボックス５２の直下など）に基本辞書３６と同等の変換候補または参照用の情報として表示するエントリ支援部として機能する。 The entry unit 35 searches the basic dictionary 36 for a conversion candidate character corresponding to the received key, displays it in the input box 52, and displays the character or character string associated with the key in the recognition result storage unit 32b. It functions as an entry support unit that displays as a conversion candidate equivalent to the basic dictionary 36 or information for reference on the screen read out from the structure, for example, directly below the input box 52.

以下、図３乃至図６を参照してこのデータエントリシステムの動作を説明する。 The operation of this data entry system will be described below with reference to FIGS.

図３に示すように、例えば注文書４１などの帳票を元文書として、スキャナ１で読み取って得られた帳票イメージ４２をデータエントリ画面５０に表示させながらデータエントリを行う例について説明する。なお注文書４１の他に、例えば保険やカードなどの申込書など、他種の帳票についても同様の処理を行うことができる。またこの例の帳票に限らず本やノートなどの一般文書をデータエントリの処理対象にしてもよい。 As shown in FIG. 3, an example in which data entry is performed while a form image 42 obtained by reading with the scanner 1 is displayed on the data entry screen 50 using a form such as an order form 41 as an original document will be described. In addition to the order form 41, the same processing can be performed for other types of forms such as an application form for insurance or a card. Further, not only the form in this example but also a general document such as a book or a notebook may be set as a data entry processing target.

このデータエントリシステムでは、処理対象の注文書４１の帳票コードがキー入力により指定され、データエントリ開始操作が行われると、スキャナ１により注文書４１がスキャニングされて、帳票イメージ４２がコンピュータ本体３に入力される。帳票コードは、スキャニングの際に、帳票イメージから検出してもよい。 In this data entry system, when the form code of the order form 41 to be processed is designated by key input and a data entry start operation is performed, the order form 41 is scanned by the scanner 1 and the form image 42 is transferred to the computer main body 3. Entered. The form code may be detected from the form image at the time of scanning.

コンピュータ本体３では、スキャナ１から入力された帳票イメージ４２を入力受付部３１が受け付け、画像記憶部３２ａに記憶する。帳票イメージは、例えばＪＰＥＧ形式またはＰＮＧ形式等である。 In the computer main body 3, the input receiving unit 31 receives the form image 42 input from the scanner 1, and stores it in the image storage unit 32a. The form image is, for example, in JPEG format or PNG format.

次に、文字認識部３３は、画像記憶部３２ａに記憶された帳票イメージを読み出し、読み出した帳票イメージに対して文字認識を行い、記入内容の認識結果構造体を作成し、認識結果記憶部３２ｂに記憶しておく。 Next, the character recognition unit 33 reads out the form image stored in the image storage unit 32a, performs character recognition on the read out form image, creates a recognition result structure of the entry contents, and recognizes the recognition result storage unit 32b. Remember it.

ここで、図４乃至図６を参照して文字認識部３３が認識結果構造体を作成する処理を詳述する。文字認識部３３は、画像記憶部３２ａから帳票イメージを読み出し、読み出した帳票イメージから文字行の領域を抽出する（図４のステップＳ１０１）。 Here, the process in which the character recognition unit 33 creates the recognition result structure will be described in detail with reference to FIGS. The character recognition unit 33 reads a form image from the image storage unit 32a, and extracts a character line area from the read form image (step S101 in FIG. 4).

そして、文字認識部３３は、抽出した文字行の領域の画像を２値化する（ステップＳ１０２）。 Then, the character recognition unit 33 binarizes the extracted image of the region of the character line (step S102).

文字認識部３３は、２値化した文字行の画像各々から、文字の候補の位置を切り出し（ステップＳ１０３）、画像中から文字の画像領域の位置を全て抽出する。文字切り出しの方法としては、例えば射影ヒストグラムを参照し、空白部分で切りだすなどの方法を用いる。なお、文字の候補の切り出し手段としては、この他にも例えばエントリ対象となる帳票のモデルを定義し、これに従って文字列の位置を決定するなどしてもよい。 The character recognition unit 33 cuts out the positions of the character candidates from each of the binarized character line images (step S103), and extracts all the positions of the character image areas from the image. As a method for character extraction, for example, a method of referring to a projection histogram and cutting out at a blank portion is used. In addition to this, as a means for extracting character candidates, for example, a model of a form to be entered may be defined, and the position of the character string may be determined in accordance with the model.

次に、文字認識部３３は、抽出した位置に従って文字の画像領域を切り出し、切り出した画像領域の統合処理を行い（ステップＳ１０４）、文字候補を生成し（ステップＳ１０５）、生成した文字候補に対して文字認識を行い（ステップＳ１０６）、認識結果構造体を構成（作成）する（ステップＳ１０７）。 Next, the character recognition unit 33 cuts out an image area of the character according to the extracted position, performs an integration process of the cut out image area (step S104), generates a character candidate (step S105), and generates a character candidate. Character recognition is performed (step S106), and a recognition result structure is constructed (created) (step S107).

図５を参照して上記過程での画像の変化を、順をおって説明する。図５に示すように、抽出した文字行の位置に従い処理対象の文字行画像６１を切り出し、切り出した文字行画像６１に対して大津の方法などにより２値化処理を行う（図５のステップＳ１０２）。なお、２値化処理にはこの他にも例えばKittlerの方法など、既知の２値化手法を用いることもできる。 The change of the image in the above process will be described in order with reference to FIG. As shown in FIG. 5, a character line image 61 to be processed is cut out according to the position of the extracted character line, and binarization processing is performed on the cut out character line image 61 by Otsu's method or the like (step S102 in FIG. 5). ). In addition, for the binarization process, a known binarization method such as the Kittler method can also be used.

次に、２値画像６２を対象とし、文字成分候補を切り出す。具体的には、例えば黒画素の数を行に直行する方向に射影したヒストグラムを作成し、ヒストグラム上で値が０である領域をすべて検出し、領域それぞれの中央を文字成分候補の区切りとする。 Next, character component candidates are extracted from the binary image 62. Specifically, for example, a histogram in which the number of black pixels is projected in a direction perpendicular to the line is created, all areas having a value of 0 on the histogram are detected, and the center of each area is defined as a character component candidate delimiter. .

なおこの他、例えばいわゆるペリフェラル特徴を取得し、その値が大きいところ、あるいは変化の大きいところを候補とするなど、他の手法を用いて候補を取得してもよい。 In addition to this, for example, a so-called peripheral feature may be acquired, and a candidate may be acquired by using another method, such as a place where the value is large or a place where the change is large.

次に、文字認識部３３は、文字成分候補の統合判定（検査）を行い、統合可能なものは統合し、文字候補構造体を作成する（図５のステップＳ１０４）。文字候補構造体の作成手順の一例を図６に示す。 Next, the character recognition unit 33 performs integration determination (inspection) of the character component candidates, integrates those that can be integrated, and creates a character candidate structure (step S104 in FIG. 5). An example of the procedure for creating the character candidate structure is shown in FIG.

図６に示すように、文字候補構造体の各要素は、文字成分候補そのものか、または文字成分候補を複数個連結したものである。まず文字成分候補を全て文字候補であるものとして文字候補構造体（文字どうしのパス）を作成する。 As shown in FIG. 6, each element of the character candidate structure is a character component candidate itself or a plurality of character component candidates connected to each other. First, a character candidate structure (path between characters) is created assuming that all character component candidates are character candidates.

次に、文字候補構造体において、隣接する文字候補どうしが連結可能かどうかを判定し、連結可能な場合はその組み合わせを探索する。なお、連結可能と判定する条件としては、連結した場合の文字成分の縦横比（幅）／（高さ）を基に、（縦横比）＜（閾値）を満たすものとする。 Next, in the character candidate structure, it is determined whether or not adjacent character candidates can be connected. If the character candidates can be connected, the combination is searched. As a condition for determining that connection is possible, it is assumed that (aspect ratio) <(threshold) is satisfied on the basis of the aspect ratio (width) / (height) of the character components when connected.

本例では、閾値を例えば1.2とするが、この値は他の値でもよく、例えば連結した場合に生成される文字候補に対して文字認識を行い、一定の類似度が得られた場合に連結するなど、異なる基準を採用しても良い。 In this example, the threshold value is set to 1.2, for example, but this value may be another value. For example, character recognition is performed on a character candidate generated when concatenated and a certain degree of similarity is obtained. Different criteria may be employed, such as

そして、連結して新たな文字候補を生成する組み合わせについて、それぞれを単独で文字として扱うものと並列にその要素を挿入する。この手順を連結する文字成分の組がなくなるまで繰り返すことにより、文字候補構造体を構成できる。 And about the combination which couple | bonds and produces | generates a new character candidate, the element is inserted in parallel with what treats each as a character independently. A character candidate structure can be constructed by repeating this procedure until there are no more pairs of character components to be connected.

最後に、上記処理により作成した文字候補構造体の各文字候補の領域に対して文字認識を行い、最終的に、図５のように、文字候補どうしの組み合わせを変えて異なるパスで結んだ認識結果構造体６５を作成する。 Finally, character recognition is performed on each character candidate area of the character candidate structure created by the above processing, and finally recognition is performed by changing the combination of character candidates and connecting them with different paths as shown in FIG. A result structure 65 is created.

この例では、文字認識の手法として、例えば部分空間法などを用いている。なお、この他、例えば最近傍法などの既知の異なる文字認識方法を用いてもよい。なお、本例においては、認識結果構造体６５の各要素には、該当する文字候補構造体の要素に対して文字認識を行った結果として取得された文字の情報が含まれるものとする。 In this example, for example, a subspace method is used as a character recognition method. In addition, other known different character recognition methods such as the nearest neighbor method may be used. In this example, each element of the recognition result structure 65 includes character information acquired as a result of performing character recognition on the element of the corresponding character candidate structure.

次に、図７乃至図９を参照してキーエントリ作業の際の動作を説明する。このシステムでは、キーエントリ作業の際のエントリ候補をエントリ作業者に提示する辞書として文字認識結果として認識結果記憶部３２ｂに記憶されている認識結果構造体を用いる。 Next, the operation during the key entry work will be described with reference to FIGS. In this system, a recognition result structure stored in the recognition result storage unit 32b as a character recognition result is used as a dictionary for presenting entry candidates for key entry work to the entry worker.

まず、図７を参照してデータエントリ画面５０について説明する。このシステムの場合、エントリ作業者によるキー操作でデータエントリの開始操作が行われると、エントリ部３５は、データエントリ画面５０をモニタ４へ出力し、モニタ４に、図７に示すようなデータエントリ画面５０が表示される。 First, the data entry screen 50 will be described with reference to FIG. In the case of this system, when a data entry start operation is performed by a key operation by an entry operator, the entry unit 35 outputs a data entry screen 50 to the monitor 4, and the data entry as shown in FIG. A screen 50 is displayed.

データエントリ画面５０には、帳票イメージとして注文書画像５１が画面の左側に表示され、画面の右側に、複数の入力ボックス５２〜５５が縦に並べられて表示される。また各入力ボックス５２〜５５の近傍には、認識結果記憶部３２ｂの認識結果構造体を検索した結果の文字列候補５６が参照用として表示される。文字列候補５６は変換候補として入力ボックス５２〜５５内に表示してもよい。 On the data entry screen 50, a purchase order image 51 is displayed on the left side of the screen as a form image, and a plurality of input boxes 52 to 55 are vertically arranged on the right side of the screen. Further, in the vicinity of each of the input boxes 52 to 55, a character string candidate 56 as a result of searching the recognition result structure in the recognition result storage unit 32b is displayed for reference. The character string candidates 56 may be displayed in the input boxes 52 to 55 as conversion candidates.

データエントリ画面５０に表示された入力ボックス５２〜５５のうち、最上部のものから順にアクティブにされ、エントリ作業者がテキストを入力する際に、注文書画像の対応する文字イメージ領域が枠で囲われて表示される。この例では、始めにアクティブにされる入力ボックス５２は、注文主の氏名を入力するテキスト入力ボックスである。 The input boxes 52 to 55 displayed on the data entry screen 50 are activated in order from the top, and when the entry worker inputs text, the corresponding character image area of the purchase order image is surrounded by a frame. Displayed. In this example, the input box 52 activated first is a text input box for inputting the name of the orderer.

入力ボックス５２にテキストを入力後、エンターキーなどを操作して確定することで、入力ボックス５２に入力されたテキストデータ（入力データ）がエントリ部３５によりメモリ３２に保存されると同時に、直下の入力ボックス５３がアクティブに切り換えられ、住所の入力が可能になる。 By inputting the text in the input box 52 and then confirming it by operating the enter key or the like, the text data (input data) input in the input box 52 is stored in the memory 32 by the entry unit 35, and at the same time The input box 53 is switched to active so that an address can be input.

エントリ部３５は、エントリ作業者からのキー入力を受け付ける。エントリ作業者は帳票に記載されたデータをそのとおり入力し、画像を文字データ化する、いわゆるデータエントリ処理を行うものとする。 The entry unit 35 receives a key input from the entry worker. The entry worker inputs the data described in the form as it is, and performs so-called data entry processing for converting the image into character data.

エントリ部３５は、まず、スキャナ１によって取得された帳票イメージを画像記憶部３２ａから読み出してデータエントリ画面５０内の左側に表示する。本例では、帳票イメージデータを取り込んだコンピュータ本体３の画面上に表示したが、コンピュータ本体３とは別にＬＡＮなどを介して接続した他のデータエントリ専用のコンピュータに表示してもよい。 The entry unit 35 first reads out the form image acquired by the scanner 1 from the image storage unit 32 a and displays it on the left side in the data entry screen 50. In this example, the form image data is displayed on the screen of the computer main body 3 that has taken in. However, it may be displayed on another computer dedicated to data entry that is connected to the computer main body 3 via a LAN or the like.

データエントリ画面５０内の右側の入力ボックス５２に、エントリ作業者がキー入力すると、入力受付部３１は、エントリ作業者からのキー入力を受け付け、エントリ部３５に渡す。 When the entry worker inputs a key into the input box 52 on the right side in the data entry screen 50, the input receiving unit 31 receives the key input from the entry worker and passes it to the entry unit 35.

すると、エントリ部３５は、受け取ったキーをキーワードにして認識結果記憶部３２ｂから認識結果構造体を取得すると共に（図８のステップＳ２０１）、入力されたキーに対応するテキストデータを基本辞書３６から取得する（ステップＳ２０２）。 Then, the entry unit 35 acquires the recognition result structure from the recognition result storage unit 32b using the received key as a keyword (step S201 in FIG. 8), and the text data corresponding to the input key from the basic dictionary 36. Obtain (step S202).

データの取得が終了すると（ステップＳ２０３）、エントリ部３５は、取得したテキストデータと認識結果構造体の文字列とを照合し（ステップＳ２０４）、取得したテキストデータの列（文字列）と一致（マッチ）する文字列を認識結果構造体上で探索（検索）する（ステップＳ２０５）。 When the data acquisition ends (step S203), the entry unit 35 collates the acquired text data with the character string of the recognition result structure (step S204), and matches the acquired text data string (character string) ( A character string to be matched is searched (searched) on the recognition result structure (step S205).

探索の結果、一致する部分が認識結果構造体上に存在した場合（ステップＳ２０５のＹｅｓ）、エントリ部３５は、認識結果構造体の一致する部分の続きの部分に対して最も可能性の高い文字列を決定し、その文字列を入力候補として補完して（ステップＳ２０６）、入力ボックス５２に表示する（ステップＳ２０７）。なお一致する部分の続きの部分に対して最も可能性の高い文字列を、入力されたものとは別に入力ボックス５２の下に表示し、エントリ作業者に提示してもよい。 As a result of the search, if there is a matching part on the recognition result structure (Yes in step S205), the entry unit 35 determines the most likely character for the subsequent part of the matching part of the recognition result structure. A column is determined, and the character string is complemented as an input candidate (step S206) and displayed in the input box 52 (step S207). Note that the most likely character string for the continuation of the matching part may be displayed below the input box 52 separately from the input and presented to the entry worker.

なお、本例における最も可能性の高い文字列の決定方法としては、文字候補の画像としての縦横比を用い、平均縦横比が最も１に近くなるパス上の文字を繋げた文字列を前記最も可能性の高い文字列とするものとする。 Note that the most probable method for determining a character string in this example is to use an aspect ratio as an image of a character candidate, and use a character string that connects characters on a path whose average aspect ratio is closest to 1 to the character string. It is assumed that the character string is highly likely.

以下、本例では、エントリ作業者が、データエントリ画面５０の「お届け先」の入力ボックス５３（入力フィールド）に、文字列を『神奈川』まで入力した場合の例で説明する。 Hereinafter, in this example, an example in which the entry worker inputs a character string up to “Kanagawa” in the “Destination” input box 53 (input field) of the data entry screen 50 will be described.

エントリ部３５は、まず、認識結果記憶部３２ｂの認識結果構造体の中で、今回キー入力された文字列である『神奈川』を検索する。なお、文字列を検索する場合には、入力された文字列は文字列画像先頭からのものであること、および検索結果が認識結果構造体内で連続であること、を前提とし、探索開始点を認識結果構造体の先頭として探索する。 First, the entry unit 35 searches the recognition result structure in the recognition result storage unit 32b for “Kanagawa”, which is a character string key-inputted this time. When searching for a character string, it is assumed that the input character string is from the beginning of the character string image and that the search result is continuous in the recognition result structure. Search as the head of the recognition result structure.

本例においては、図９内の黒太枠で示す位置が一致する。これに並行するパス、すなわち文字列『神奈川』に含まれる文字成分候補が利用されている全てのパスは、正解である可能性がないため、候補から削除する。本例では、図９内の薄色表示になった部分が該当する。 In this example, the positions indicated by the thick black frames in FIG. 9 match. Paths parallel to this, that is, all paths in which the character component candidates included in the character string “Kanagawa” are used are not likely to be correct, and are deleted from the candidates. In this example, the portion that is displayed in light color in FIG.

但し、例えば郵便番号と併記される住所など確定してもよい部分を含む場合には、探索開始点を確定できない部分の先頭として同様に処理してもよい。上記の例であれば、住所は市町村まではエントリの必要なく確定できるため認識結果をそのまま採用し、エントリは街区の先頭から行うようにしてもよい。 However, for example, when a portion that can be confirmed such as an address written together with a zip code is included, the search start point may be processed in the same manner as the head of the portion that cannot be confirmed. In the above example, since the address can be determined up to the municipality without the need for entry, the recognition result may be adopted as it is, and entry may be made from the head of the block.

さらに、上記の場合でも、探索の開始点として上記２箇所を設定し、街区までを確信度の高い部分、街区以下を確信度の低い部分として、どちらからでも開始でくるようにしてもよい。この場合、確信度の低い部分から入力が開始された場合には、確信度の高い部分は認識結果を自動的に補完するようにしてもよい。 Further, even in the above case, the above-mentioned two places may be set as the search starting points, and it may be started from either of the areas with a high degree of certainty up to the block and with a low degree of certainty below the block. In this case, when the input is started from a portion with a low certainty factor, the recognition result may be automatically complemented with a portion with a high certainty factor.

最後に、残りの部分に対して最も尤もらしい候補を連結することで補完文字列を生成する。この例においては、平均縦横比が最も１に近くなるパスを最も尤もらしいパスとし、そのパス上の文字列を連結した『県川崎市』を補完文字列とする。 Finally, a complementary character string is generated by concatenating the most likely candidates for the remaining portions. In this example, the path with the average aspect ratio closest to 1 is set as the most likely path, and “prefecture Kawasaki city” obtained by concatenating the character strings on the path is set as the supplementary character string.

最後に、入力された文字列『神奈川』に補完文字列『県川崎市』を連結した『神奈川県川崎市』を入力候補文字列として提示する。この入力候補文字列が正しい場合、エントリ作業者は上記を確定するのみでエントリを終了することができ、データエントリを容易にかつ効率よく行うことができる。 Finally, “Kawasaki City, Kanagawa Prefecture”, which is obtained by concatenating the input character string “Kanagawa” with the complementary character string “Kawasaki City, Kanagawa”, is presented as an input candidate character string. If this input candidate character string is correct, the entry operator can end the entry only by confirming the above, and data entry can be performed easily and efficiently.

このようにこの第１実施形態のデータエントリシステムによれば、事前にエントリすべき帳票の認識を行って構成した認識結果構造体を、エントリ作業者がキーエントリする際の予測用辞書として用いることで、非熟練者であってもエントリ作業を容易にかつ効率よく行うことができる。またエントリ作業者がキー入力を間違えた場合でも、帳票の画像を用いて作成した認識結果構造体から該当する文字候補を表示するので、入力ミスをカバーできる。 As described above, according to the data entry system of the first embodiment, the recognition result structure configured by recognizing the form to be entered in advance is used as a prediction dictionary when the entry worker performs key entry. Thus, even an unskilled person can easily and efficiently perform entry work. Even if the entry operator makes a mistake in key input, the corresponding character candidate is displayed from the recognition result structure created using the form image, so that input mistakes can be covered.

（第２実施形態）
以下、図１０乃至図１４を参照して第２実施形態を説明する。第２実施形態としては、キー入力された文字列の中に、認識結果記憶部３２ｂの認識結果構造体に含まれない文字が存在する場合の処理を説明する。 (Second Embodiment)
Hereinafter, the second embodiment will be described with reference to FIGS. 10 to 14. As the second embodiment, a process will be described in the case where a character that is not included in the recognition result structure of the recognition result storage unit 32b exists in the character string that is key-input.

図１０に示す認識結果構造体において、例えば『神奈川県』の『川』の部分が欠落してしまった場合を例にして説明する。なお、エントリ部３５の前まで処理は上述した説明と同様のため、設明を省略する。 In the recognition result structure shown in FIG. 10, for example, a case where the “river” portion of “Kanagawa Prefecture” is missing will be described as an example. Since the processing up to the entry unit 35 is the same as described above, the description is omitted.

まず、エントリ作業者が入力ボックス５３に、図１１に示すように『神奈川』までキー入力したものとする。これまでの実施例と同様に認識結果構造体内から『神奈川』を検索したものとしても、前述の通り検索結果は認識結果構造体内で連続している必要があるが、この例では『奈』には『１』と『い』だけしかパスはなく、『川』という文字に続くパスはない。このため、キー入力された『神奈川』のうち、最後の『川』という文字が認識結果構造体とマッチせず、入力ボックス５３の情報が確定しない。 First, it is assumed that the entry worker has entered keys in the input box 53 up to “Kanagawa” as shown in FIG. As in the previous examples, even if “Kanagawa” is searched from within the recognition result structure, the search results need to be continuous within the recognition result structure as described above. Has only “1” and “I” paths, and no path following the word “river”. For this reason, among the “Kanagawa” key input, the last character “river” does not match the recognition result structure, and the information in the input box 53 is not fixed.

この場合、エントリ部３５は、各文字で認識結果構造体と位置合わせを行い、マッチする部分の有無を確かめる。 In this case, the entry unit 35 performs alignment with the recognition result structure for each character, and checks whether there is a matching portion.

この例では、『神奈』の部分が認識結果構造体とマッチするので、まずマッチする『神奈』の部分に該当する部分を上記実施例と同様に確定させる。 In this example, since the “KANA” part matches the recognition result structure, the part corresponding to the matching “KANA” part is first determined in the same manner as in the above embodiment.

『川』の部分の欠落によって認識結果構造体のデータとマッチしないことから、『川』が確定しない。このような例の場合の処理方法として、キー入力を優先する方法とその後にキー入力された文字から穴埋めする方法の２つの方法があり、以下にそれぞれ記載する。 The lack of the “river” part does not match the data in the recognition result structure, so the “river” is not fixed. There are two methods of processing in such an example, a method of giving priority to key input and a method of filling a hole from a character input thereafter, which will be described below.

まず、キー入力を優先する方法について説明する。この場合、エントリ部３５は、補完処理を行わず、図１１に示すように『神奈川』まで入力されたキーに対応するテキストデータを基本辞書３６から読み出してキー入力されたとおり『神奈川』を入力ボックス５３に表示する。エントリ作業者がこの表示内容を見て、変換の正否を判断し、正しいと判断した場合は確定操作を行うことで『神奈川』を確定する。 First, a method for giving priority to key input will be described. In this case, the entry unit 35 does not perform the complement processing, and reads the text data corresponding to the key input up to “Kanagawa” from the basic dictionary 36 and inputs “Kanagawa” as the key is input, as shown in FIG. Displayed in box 53. The entry worker looks at this display content to determine whether conversion is correct or not, and when it is determined to be correct, confirms “Kanagawa” by performing a confirmation operation.

次に、欠落している文字を補完により穴埋めする方法を説明する。この場合、エントリ作業者により、入力ボックス５３に『神奈川』が入力された後、続けて、入力ボックス５３に『県川崎市…』がキー入力されると、エントリ部３５は、まず各文字で位置合わせを行い（図１２のステップＳ３０１）、未確定部分のうち、キー入力で基本辞書３６から読み出されたテキストデータと認識結果構造体のデータとでマッチする部分があるか否かを判定し（ステップＳ３０２）、マッチする部分がない場合（ステップＳ３０２のＮｏ）は、チェックＮＧとして（ステップＳ３０３）、処理を終了する。 Next, a method of filling a missing character by complementing will be described. In this case, when “Kanagawa” is input to the input box 53 by the entry worker and then “Kawasaki-shi ...” is key-inputted to the input box 53, the entry unit 35 first displays each character. Alignment is performed (step S301 in FIG. 12), and it is determined whether or not there is a matching portion between the text data read from the basic dictionary 36 by key input and the data of the recognition result structure among the unconfirmed portions. If there is no matching part (No in step S302), the check is NG (step S303) and the process is terminated.

またマッチする部分がある場合（ステップＳ３０２のＹｅｓ）、さらにマッチする部分が完全一致するか否かを判定し（ステップＳ３０４）、完全一致しない場合（ステップＳ３０４のＮｏ）、テキストデータの文字列の並び順と文字列の領域の幅から中抜けしている文字があるか否かを判定し（ステップＳ３０５のＮｏ）、中抜けであった場合（ステップＳ３０５のＹｅｓ）、抜けた部分を切り出し再認識する（ステップＳ３０６）。 If there is a matching part (Yes in step S302), it is further determined whether or not the matching part is completely matched (step S304). If there is no perfect matching (No in step S304), the character string of the text data It is determined whether or not there is a missing character from the arrangement order and the width of the character string area (No in step S305). If it is a missing character (Yes in step S305), the missing part is cut out and re-executed. Recognize (step S306).

すなわち、エントリ部３５は、未確定部分のうち、最も確定した部分『神奈』の近くに存在する、マッチする文字候補を検索する。 That is, the entry unit 35 searches for a matching character candidate that exists near the most determined part “Kana” among the unconfirmed parts.

本例の場合、図１３に示すように、新たに太枠となった『県』がマッチする。以下この『県』までがマッチしたものとして、上記第１実施形態と同様に処理を行うことで、欠落している文字（『川』）の存在する範囲が確定し、その範囲を切り出し、その部分を再認識することで、欠落している文字である『川』を導出し、この『川』とキー入力された文字列である『神奈川』の『川』とがマッチしたことで（ステップＳ３０７のＹｅｓ）、マッチングＯＫとして（ステップＳ３０８）、図１４に示すように、『神奈川県川崎市…』という補完処理結果が得られ、認識結果構造体に誤りがあった場合でもキーエントリを正しく行うことができる。 In the case of this example, as shown in FIG. Assuming that “prefecture” has been matched, by performing the same processing as in the first embodiment, the range in which the missing character (“river”) exists is determined, the range is cut out, By re-recognizing the part, the missing character “river” was derived, and this “river” was matched with the “river” of “Kanagawa”, which was the keyed-in character string (step) As shown in FIG. 14, a complementary processing result of “Kawasaki City, Kanagawa Pref.” Is obtained as matching OK (Yes in S307), and even if the recognition result structure has an error, the key entry is correct. It can be carried out.

このようにこの第２実施形態のデータエントリシステムによれば、文字認識結果構造体から読み出したデータに欠落箇所が合った場合に欠落箇所近傍で確定した文字と後続して得られるテキストデータとの関係からその部分を補完するので、事前にエントリすべき帳票の認識を行って構成した文字認識結果構造体に誤りがあった場合でも、キーエントリを正しく行うことができる。 As described above, according to the data entry system of the second embodiment, when the missing part matches the data read from the character recognition result structure, the character determined in the vicinity of the missing part and the text data obtained thereafter are obtained. Since that part is complemented from the relationship, even if there is an error in the character recognition result structure configured by recognizing the form to be entered in advance, the key entry can be performed correctly.

（第３実施形態）
以下、図１５、図１６を参照して第３実施形態を説明する。但し、第３実施形態においては、文字認識部３３が文字認識結果構造体を作成する際に、認識結果文字列ではなく、認識結果候補と各候補の類似度を返すように文字認識部３３を構成し、認識結果構造体の各要素は複数の候補文字、およびそれぞれの類似度を保持するものとする。 (Third embodiment)
Hereinafter, the third embodiment will be described with reference to FIGS. 15 and 16. However, in the third embodiment, when the character recognizing unit 33 creates the character recognition result structure, the character recognizing unit 33 returns the recognition result candidate and the similarity between each candidate instead of the recognition result character string. It is assumed that each element of the recognition result structure holds a plurality of candidate characters and respective similarities.

文字認識部３３が行う文字認識までの処理手順については第１実施形態と同様に実行するものとする。この第３実施形態では、図１５に示すように、認識結果構造体の各要素が候補文字およびそれぞれの類似度を保持している。 The processing procedure up to character recognition performed by the character recognition unit 33 is executed in the same manner as in the first embodiment. In the third embodiment, as shown in FIG. 15, each element of the recognition result structure holds a candidate character and each similarity.

次に、エントリ部３５により、エントリ作業者によるデータエントリを受け付ける。本例では、データエントリ画面５０のお届け先の入力ボックス５３に、エントリ作業者が、例えば『神奈川県川崎市』と入力したものとする。 Next, the entry unit 35 accepts a data entry by an entry worker. In this example, it is assumed that the entry worker has entered, for example, “Kawasaki City, Kanagawa Prefecture” in the destination box 53 of the data entry screen 50.

この場合、エントリ部３５は、まず、認識結果構造体の中で、入力された文字列を検索する。但し、本例においては各文字候補が複数の候補を有しているため、２位以下についても探索を行い、一致した部分の平均順位が最も小さい部分を一致した部分とする。また、これに限らず、例えば平均類似度が最も高くなる部分など、異なる基準で一致した部分を決定してもよい。 In this case, the entry unit 35 first searches the input character string in the recognition result structure. However, since each character candidate has a plurality of candidates in this example, a search is performed for the second and lower ranks, and the portion with the lowest average rank of the matched portions is determined as the matched portion. Further, the present invention is not limited to this, and for example, a portion that matches on different criteria, such as a portion having the highest average similarity, may be determined.

本例では、図１６内に黒太枠で示す部分が一致する。次に、入力内部に１位候補ではない文字を用いている例がないか探索する。本例においては、『倚』が該当する。存在する場合、図１６の最下段に示すように、「入力された文字を利用した場合」と「認識結果１位を利用した場合」の両者の文字列を表示し、正しいほうを選択できるようにする。 In this example, the portions indicated by thick black frames in FIG. Next, the input is searched for an example using a character that is not the first candidate. In this example, “倚” corresponds. If it exists, as shown at the bottom of FIG. 16, the character strings of both “when using the input character” and “when using the first recognition result” are displayed so that the correct one can be selected. To.

なお本例では文字列を単純に並べて表示しているが、該当する部分のみ２重に提示してもよく、その際には例えば色を変えるなどして分かりやすくしてもよい。 In this example, the character strings are simply arranged and displayed. However, only the corresponding part may be presented in a double manner. In this case, for example, the color may be changed for easy understanding.

また１位候補ではない文字を用いている場合、該文字候補において入力された文字での類似度が認識候補１位の類似度と十分に近い場合などには上記のような表示を行わず、無条件に入力された文字を採用するようにしてもよい。上記により、エントリ作業者は、誤読を起こすような誤りやすい文字であっても確実に正解を入力することが可能となる。 In addition, when a character that is not the first candidate is used, when the similarity of the character input in the character candidate is sufficiently close to the first candidate of the recognition candidate, the above display is not performed. You may make it employ | adopt the character input unconditionally. As described above, the entry worker can surely input the correct answer even if it is an error-prone character that causes misreading.

また、文字認識結果とエントリ作業の結果が比較されているため、入力作業を行うことで文字認識結果の確認作業が同時に行われており、この結果高い精度の入力を行うことができる。 In addition, since the character recognition result and the entry work result are compared, the input work is performed to confirm the character recognition result at the same time. As a result, high-accuracy input can be performed.

なお、上述した実施形態では入力を開始した時点で補完処理を開始しているが、入力を開始していない場合にも補完処理を行って表示し、エントリ作業者に、補完した結果を最初から提示するようにしてもよい。この処理を行う場合に表示する文字列は、認識結果構造体全体から求めた最も尤もらしいパス上の文字列とすればよい。 In the above-described embodiment, the supplement processing is started when input is started. However, even when input is not started, the supplement processing is performed and displayed, and the result of the supplement is displayed to the entry worker from the beginning. You may make it show. The character string displayed when this processing is performed may be a character string on the most likely path obtained from the entire recognition result structure.

補完した結果が正しければそのまま確定するだけでよく、また補完した結果に誤りがある場合はそのまま入力を続ければよいので、簡単なエントリ作業を実現することができる。 If the complemented result is correct, it is only necessary to confirm it as it is, and if there is an error in the complemented result, it is only necessary to continue input, so that a simple entry operation can be realized.

また、これまで説明した例において、文字列の一致探索の開始点は限定されているが、これを全ての認識結果構造体の要素から開始するようにしてもよい。このようにすれば、上記のように入力前から補完結果を提示するような場合に修正が必要な部分を含む文字列のみをエントリする修正を行うことができ、単一文字を入力するよりも容易に修正を行うことが可能になる。 In the example described so far, the starting point of the character string matching search is limited, but this may be started from all elements of the recognition result structure. In this way, it is possible to perform correction by entering only a character string including a portion that needs to be corrected when a complementary result is presented before input as described above, which is easier than inputting a single character. It becomes possible to make corrections.

このようにこの第３実施形態のデータエントリシステムによれば、「入力された文字を利用した場合」と「認識結果１位を利用した場合」の両者の文字列を表示し、正しいほうを選択できるようにすることで、例えば文字列の間の文字が間違っていた場合の修正を選択操作だけで行えるので、エントリ作業を効率的に行うことができる。なお入力された文字そのものを利用する以外に、例えば入力された文字列を変換辞書で変換したものを利用してもよい。 As described above, according to the data entry system of the third embodiment, the character strings of “when the input character is used” and “when the first recognition result is used” are displayed, and the correct one is selected. By making it possible, for example, correction when a character between character strings is wrong can be performed only by a selection operation, so that entry work can be performed efficiently. In addition to using the input character itself, for example, an input character string converted by a conversion dictionary may be used.

上述した各実施形態の結果、エントリ作業者が自身のキー入力ミスをカバーしつつ帳票の画像を見ながらデータエントリ作業を効率良くかつ確実に行えるよう支援することができる。 As a result of the above-described embodiments, it is possible to assist the entry worker so that the data entry operation can be performed efficiently and reliably while looking at the image of the form while covering his / her key input mistake.

なお、本願発明は、上記実施形態のみに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形してもよい。例えば各構成要素を、コンピュータのハードディスク装置などのストレージにインストールしたプログラムで実現してもよい。 In addition, this invention is not limited only to the said embodiment, You may deform | transform a component in the range which does not deviate from the summary in an implementation stage. For example, each component may be realized by a program installed in a storage such as a hard disk device of a computer.

また上記プログラムを、コンピュータ読取可能な電子媒体：electronic mediaに記憶しておき、プログラムを電子媒体からコンピュータに読み取らせることで本発明の機能をコンピュータが実現するようにしてもよい。電子媒体としては、例えばＣＤ−ＲＯＭ等の記録媒体やフラッシュメモリ、リムーバブルメディア：Removable media等が含まれる。 The program may be stored in a computer-readable electronic medium, and the computer may realize the functions of the present invention by causing the computer to read the program from the electronic medium. Examples of the electronic medium include a recording medium such as a CD-ROM, flash memory, and removable media.

さらに、ネットワークを介して接続した異なるコンピュータに構成要素を分散して記憶し、各構成要素を機能させたコンピュータ間で通信することで実現してもよい。 Further, the configuration may be realized by distributing and storing components in different computers connected via a network, and communicating between computers in which the components are functioning.

１…スキャナ、２…キーボード、３…コンピュータ本体、４…モニタ、５０…データエントリ画面、３１…入力受付部、３２…メモリ、３２ａ…画像記憶部、３２ｂ…認識結果記憶部、３３…文字認識部、３４…文字認識辞書、３５…エントリ部、３６…基本辞書。 DESCRIPTION OF SYMBOLS 1 ... Scanner, 2 ... Keyboard, 3 ... Computer main body, 4 ... Monitor, 50 ... Data entry screen, 31 ... Input reception part, 32 ... Memory, 32a ... Image storage part, 32b ... Recognition result storage part, 33 ... Character recognition 34, character recognition dictionary, 35, entry portion, 36, basic dictionary.

Claims

In a data entry device comprising an input device and a display device,
An image storage unit that stores an image obtained by optically reading a form in which characters for key entry are described;
A structure storage unit capable of storing a candidate structure of a character string connecting the key entry target characters;
Reads out the image stored in the image storage unit, cuts out the read image along the character string, recognizes each character from the image of the cut-out area, and generates a character string structure in which the character is linked by position and score A character recognition unit stored in the structure storage unit;
A screen control unit for displaying a screen having an image of the form and an input box for inputting characters described in the form on the display device, and receiving a key input to the input box;
Entry support for reading a character or character string candidate associated with the received key from the structure in the structure storage unit and displaying it as a conversion candidate in the input box or as reference information in the vicinity of the input box A data entry device.

The entry support unit
The key conversion candidate character or character string obtained from the basic dictionary by the search is modified to match the character or character string candidate read from the character string candidate structure in the structure storage unit. 2. The data entry device according to claim 1, wherein the corrected character string is displayed in or near the input box.

The entry support unit
When reading the character or character string related to the key from the character string candidate structure in the structure storage unit, the character or character string corresponding to the input key is complemented, 2. The data entry device according to claim 1, wherein the complemented character or character string is displayed in or near the input box.

In a data entry method in a data entry device including an input device, a storage device, and a display device,
Storing in the storage device an image obtained by optically reading a form in which characters for key entry are described;
Reads out the image stored in the storage device, cuts out the read image along the character string, recognizes each character from the image of the cut out region, and generates a character string structure in which the character is linked by the position and the score. Storing in the storage device;
Displaying a screen having an image of the form and an input box for inputting characters described in the form on the display device, and receiving a key input to the input box;
A step of reading a candidate of a character or a character string related to the accepted key from the structure and displaying it as a conversion candidate in the input box, or displaying it as information for reference in the vicinity of the input box. Data entry method.