JP2011081454A

JP2011081454A - Information processing apparatus, information processing method, program and recording medium

Info

Publication number: JP2011081454A
Application number: JP2009230922A
Authority: JP
Inventors: Chie Hideishi; 千絵秀石; Chikayuki Koyama; 至幸小山; Mitsuhiro Toya; 充宏斗谷
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2009-10-02
Filing date: 2009-10-02
Publication date: 2011-04-21
Anticipated expiration: 2029-10-02
Also published as: JP5417113B2

Abstract

PROBLEM TO BE SOLVED: To provide an information processing apparatus, along with an information processing method, program and recording medium, capable of correcting an error in a recognized character string to a correct character, without using dictionaries other than recognition dictionaries or word dictionaries. SOLUTION: A matching part 18, based on the similarity calculated from the feature value of a character image and the feature value of characters in a character recognition dictionary 14, extracts candidate characters from the first to the fifth candidates. A character string generation unit 19 generates a recognized result character string consisting of the first candidate characters. A word string generation unit 21 generates a word character string using a word dictionary 24 from the recognized result character string. A replacement decision unit 22 compares the recognized result character string and the word character string and determines whether the character should be replaced or not based on the coincidence of the characters and the similarity of the characters. A replacing part 23 replaces the character determined to be replaced by the replacement decision unit 22 to the character in the word character string and transmits and outputs the replaced recognized result character string to an output device 3. COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、文章などの文字が記載された原稿を光学的に読み取って文字認識を行い、認識した文字を出力する情報処理装置、情報処理方法、プログラムおよび記録媒体に関し、特に認識結果の誤りを訂正することができる情報処理装置、情報処理方法、プログラムおよび記録媒体に関する。 The present invention relates to an information processing apparatus, an information processing method, a program, and a recording medium that optically read a document on which characters such as sentences are described, perform character recognition, and output the recognized characters. The present invention relates to an information processing apparatus, an information processing method, a program, and a recording medium that can be corrected.

光学式文字認識の分野では、文章などの文字が記載された原稿を光学的に読み取って文字認識を行い、認識した文字を出力する光学式文字読取装置などの情報処理装置は、言語処理を用いた後処理を行うことによって文字認識誤りを訂正し、認識精度を向上させるワードコレクト技術を採用している。 In the field of optical character recognition, an information processing device such as an optical character reader that optically reads a document on which characters such as sentences are written, performs character recognition, and outputs the recognized character uses language processing. It employs a word correct technology that corrects character recognition errors and improves recognition accuracy by performing post-processing.

ワードコレクト技術としては、たとえば多くの単語を収録した単語辞書を用意しておき、認識した候補文字によって構成される文字列（以下「認識単語」という）が、その単語辞書に収録される単語と一致するか否かを判定し、一致した場合にはその単語を正しいとみなして、認識単語を単語辞書の一致した単語に置き換える技術が一般的に用いられている。 As a word correction technique, for example, a word dictionary containing many words is prepared, and a character string composed of recognized candidate characters (hereinafter referred to as “recognized words”) is stored in the word dictionary. A technique is generally used in which it is determined whether or not the words match, and when the words match, the word is regarded as correct and the recognized word is replaced with the word matched in the word dictionary.

しかしながら、認識単語を単語辞書に収録されている単語で単純に置き換えるのでは、認識結果が正しくても、単語が単語辞書に収録されていなければ、認識単語が単語辞書に収録される単語と一致しないと判定され、誤コレクトつまり誤った単語に置き換えてしまう可能性がある。 However, simply replacing a recognized word with a word recorded in the word dictionary means that if the recognition result is correct but the word is not recorded in the word dictionary, the recognized word matches the word recorded in the word dictionary. There is a possibility that it will be replaced with an incorrect correction, that is, an incorrect word.

誤コレクトによる認識率低下を防ぐために、認識した候補文字から単語を作成し、作成した単語に類似する単語もしくは完全に一致する単語を単語辞書から検索し、検索した単語と認識単語との類似度を計算することによって、認識単語を出力あるいは単語辞書の単語と置き換えて出力する方法が提案されている（たとえば特許文献１参照）。 In order to prevent the recognition rate from being reduced due to erroneous correction, a word is created from the recognized candidate characters, a word similar to the created word or a completely matching word is searched from the word dictionary, and the similarity between the searched word and the recognized word A method has been proposed in which the recognition word is output or replaced with a word in the word dictionary by calculating (see Patent Document 1, for example).

また、認識される１文字ごとに、たとえば第１位〜第５位の候補文字のように複数の候補文字を抽出し、認識した文字から単語を形成する際に、複数の候補文字を組み合わせて単語を形成することによって、より高い精度で正しい単語に置き換える方法が提案されている(たとえば特許文献２参照)。しかし、この方法においても、複数の候補文字の中に正しい単語を構成することができる文字が含まれていない可能性があり、正しい置き換えを行うことができない場合がある。 For each recognized character, a plurality of candidate characters are extracted, such as the first to fifth candidate characters, and a plurality of candidate characters are combined when a word is formed from the recognized characters. A method of replacing a correct word with higher accuracy by forming a word has been proposed (for example, see Patent Document 2). However, even in this method, there is a possibility that a character that can form a correct word is not included in the plurality of candidate characters, and there is a case where correct replacement cannot be performed.

複数の候補文字の中に正しい文字がない場合にも正しい単語に置き換える方法として、類似する文字を分類した類似性文字辞書を使用して正しい単語に置き換える方法が提案されている。この方法では、複数の候補文字の中に類似性文字辞書の文字に一致する文字がある場合は、その分類に属する類似文字を候補文字に追加する。そして、類似文字が追加された候補文字から単語を形成し、形成した単語と単語辞書の単語とを照合して、一致する単語を正しい文字列として出力する（たとえば特許文献３参照）。 As a method of replacing a correct word even when there is no correct character among a plurality of candidate characters, a method of replacing it with a correct word using a similar character dictionary in which similar characters are classified has been proposed. In this method, when there is a character that matches a character in the similarity character dictionary among a plurality of candidate characters, a similar character belonging to the classification is added to the candidate character. Then, a word is formed from candidate characters to which similar characters are added, the formed word is compared with a word in the word dictionary, and the matching word is output as a correct character string (see, for example, Patent Document 3).

特開平１０−１３４１５０号公報JP 10-134150 A 特開昭５８−４４９０号公報JP 58-4490 A 特開平５−１３５２１１号公報Japanese Patent Laid-Open No. 5-135211

しかしながら、特許文献３に記載される方法のように、類似性文字辞書を用いる場合、文字を認識する文字認識辞書以外に類似性文字辞書をメモリなどの記憶装置に記憶しておく必要があり、記憶装置の記憶容量が増加する。また、辞書に収録される各文字に対して、どれだけの数の類似文字を収録するかにも依存するが、あまりに少ない数では意味をなさないので、より多くの文字を収録する必要があり、辞書を記憶するために多くの記憶容量が必要になるという問題がある。 However, when using a similar character dictionary like the method described in Patent Document 3, it is necessary to store the similar character dictionary in a storage device such as a memory in addition to the character recognition dictionary for recognizing characters. The storage capacity of the storage device increases. Also, depending on how many similar characters are recorded for each character recorded in the dictionary, too few characters make sense, so more characters must be recorded. There is a problem that a large storage capacity is required to store the dictionary.

さらに、類似性文字辞書から抽出される文字の中に正しい文字が必ずしも含まれているとは限らず、それらをすべて候補文字として追加し、単語辞書に存在する単語と類似するもしくは完全に一致する単語を検索して、置き換えるのは処理が複雑になり処理時間も増加することになる。 Furthermore, the characters extracted from the similarity character dictionary do not necessarily include the correct characters, and all of them are added as candidate characters, and are similar or completely coincide with the words existing in the word dictionary. Searching for and replacing a word complicates the processing and increases the processing time.

本発明の目的は、文字認識辞書および単語辞書以外の辞書を用いることなく、認識した文字の誤りを正しい文字に訂正することができる情報処理装置、情報処理方法、プログラムおよび記録媒体を提供することである。 An object of the present invention is to provide an information processing apparatus, an information processing method, a program, and a recording medium that can correct an error of a recognized character to a correct character without using a dictionary other than a character recognition dictionary and a word dictionary. It is.

本発明は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部と、
文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す文字列行切り出し部と、
文字列行切り出し部によって切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す文字切り出し部と、
文字切り出し部によって切り出された文字画像から各文字画像が表す文字の特徴量を抽出する特徴抽出部と、
特徴抽出部によって抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する選択部と、
選択部によって選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する文字列生成部と、
文字列生成部によって生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する単語列生成部と、
文字列生成部によって生成された認識結果文字列と単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、特徴抽出部によって抽出された特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する置き換え判定部と、
置き換え判定部によって置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える置き換え部とを含むことを特徴とする情報処理装置である。 The present invention relates to a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character;
A character string row cutout unit that cuts out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout unit that cuts out one character image representing each character from the character string image cut out by the character string line cutout unit;
A feature extraction unit that extracts a feature amount of a character represented by each character image from the character image cut out by the character cutout unit;
The feature amount extracted by the feature extraction unit is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection unit for selecting at least one from the character recognition dictionary;
A character string generation unit that generates a recognition result character string by combining candidate characters for each character image selected by the selection unit;
The candidate characters constituting the recognition result character string generated by the character string generation unit are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation unit that extracts a character string or a word that matches only some characters and generates a word character string composed of the extracted words;
If there is a different character by comparing the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at the corresponding position, feature extraction is performed for the different characters The feature amount extracted by the unit is compared with the feature amount stored in association with the character that matches the character of the word character string among the character feature amounts recorded in the character recognition dictionary stored in the storage unit. A replacement determination unit that determines to replace a character in the recognition result character string with a corresponding character in the word character string when the comparison result satisfies a predetermined comparison criterion;
An information processing apparatus comprising: a replacement unit that replaces a character in a recognition result character string determined to be replaced by a replacement determination unit with a character at a corresponding position in a word character string.

また本発明は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部と、
文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す文字列行切り出し部と、
文字列行切り出し部によって切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す文字切り出し部と、
文字切り出し部によって切り出された文字画像から各文字画像が表す文字の特徴量を抽出する特徴抽出部と、
特徴抽出部によって抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する選択部と、
選択部によって選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する文字列生成部と、
文字列生成部によって生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する単語列生成部と、
文字列生成部によって生成された認識結果文字列と単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、記憶部に記憶される文字認識辞書に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、記憶部に記憶される文字認識辞書に収録された特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する置き換え判定部と、
置き換え判定部によって置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する文字で置き換える置き換え部とを含むことを特徴とする情報処理装置である。 The present invention also includes a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character,
A character string row cutout unit that cuts out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout unit that cuts out one character image representing each character from the character string image cut out by the character string line cutout unit;
A feature extraction unit that extracts a feature amount of a character represented by each character image from the character image cut out by the character cutout unit;
The feature amount extracted by the feature extraction unit is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection unit for selecting at least one from the character recognition dictionary;
A character string generation unit that generates a recognition result character string by combining candidate characters for each character image selected by the selection unit;
The candidate characters constituting the recognition result character string generated by the character string generation unit are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation unit that extracts a character string or a word that matches only some characters and generates a word character string composed of the extracted words;
When there is a different character by comparing the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at the corresponding position, the storage unit Among the feature quantities recorded in the character recognition dictionary stored in the memory, and the feature quantities stored in association with the characters matching the characters in the recognition result character string, and the features recorded in the character recognition dictionary stored in the storage unit The feature amount stored in association with the character that matches the character of the word character string is compared, and when the comparison result satisfies a predetermined comparison criterion, the character in the recognition result character string is A replacement determination unit that determines to replace with a corresponding character;
An information processing apparatus comprising: a replacement unit that replaces a character in a recognition result character string determined to be replaced by a replacement determination unit with a corresponding character in a word character string.

また本発明は、前記置き換え判定部は、前記文字列生成部によって生成された認識結果文字列と前記単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、前記特徴抽出部によって抽出された特徴量と、前記記憶部に記憶される文字認識辞書に収録された特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較して類似度を算出し、算出した類似度が予め定める閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定することを特徴とする。 In the present invention, the replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position. If there are different characters, for the different characters, the character that matches the character of the word character string out of the feature amount extracted by the feature extraction unit and the feature amount recorded in the character recognition dictionary stored in the storage unit The similarity is calculated by comparing with the feature quantity stored in association with each other, and when the calculated similarity is equal to or greater than a predetermined threshold, the character in the recognition result character string is replaced with the corresponding character in the word character string. It is characterized by determining to replace.

また本発明は、前記置き換え判定部は、前記文字列生成部によって生成された認識結果文字列と前記単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、前記記憶部に記憶される文字認識辞書に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、前記記憶部に記憶される文字認識辞書に収録された特徴量のうち単語文字列の文字の一致する文字に対応付けて記憶される特徴量とを比較して類似度を算出し、算出した類似度が予め定める閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定することを特徴とする。 In the present invention, the replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position. If there are different characters, for the different characters, the feature amount stored in association with the character that matches the character of the recognition result character string among the feature amounts recorded in the character recognition dictionary stored in the storage unit, and The similarity is calculated by comparing the feature amount recorded in the character recognition dictionary stored in the storage unit with the feature amount stored in association with the character that matches the character of the word character string, and the calculated similarity When the degree is equal to or greater than a predetermined threshold, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string.

また本発明は、前記置き換え判定部は、前記算出した類似度が予め定める閾値未満であるとき、前記単語列生成部が前記単語文字列を生成したときに、前記単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語であって、前記単語文字列に含まれない単語が抽出されていた場合は、前記算出した類似度が、予め定める閾値未満の値である予め定める第２の閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定することを特徴とする。 Further, the present invention provides the replacement determination unit, when the calculated similarity is less than a predetermined threshold, and when the word string generation unit generates the word character string, When a word string that matches all characters, or a word that matches only some characters and is not included in the word character string, the calculated similarity is less than a predetermined threshold value. When the value is equal to or greater than a predetermined second threshold value, the character string in the recognition result character string is determined to be replaced with the corresponding character in the word character string.

また本発明は、前記置き換え判定部は、前記算出した類似度が予め定める閾値未満であるとき、前記異なる文字を含む単語の文字数が予め定める基準文字数以上である場合は、前記算出した類似度が、予め定める閾値未満の値である予め定める第２の閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定することを特徴とする。 In the present invention, when the calculated similarity is less than a predetermined threshold, and the number of characters of the word including the different characters is equal to or greater than a predetermined reference number, the calculated similarity is When it is equal to or more than a second predetermined threshold value that is a value less than a predetermined threshold value, it is determined that a character in the recognition result character string is replaced with a corresponding character in the word character string.

また本発明は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部を含む情報処理装置で、記憶部に記憶される文字認識辞書および単語辞書を用いて文字を認識する情報処理方法であって、
文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す文字列行切り出し工程と、
文字列行切り出し工程で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す文字切り出し工程と、
文字切り出し工程で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する特徴抽出工程と、
特徴抽出工程で抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する選択工程と、
選択工程で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する文字列生成工程と、
文字列生成工程で生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する単語列生成工程と、
文字列生成工程で生成された認識結果文字列と単語列生成工程で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、特徴抽出工程で抽出された特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する置き換え判定工程と、
置き換え判定工程で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える置き換え工程とを含むことを特徴とする情報処理方法である。 Further, the present invention provides an information processing including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character. An information processing method for recognizing characters using a character recognition dictionary and a word dictionary stored in a storage unit in an apparatus,
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
If there are different characters by comparing the recognition result character string generated in the character string generation process and the word character string generated in the word string generation process for each character at the corresponding position, feature extraction is performed for the different characters The feature amount extracted in the process is compared with the feature amount stored in association with the character matching the character of the word character string among the character feature amounts recorded in the character recognition dictionary stored in the storage unit. A replacement determination step for determining that a character in the recognition result character string is replaced with a corresponding character in the word character string when the comparison result satisfies a predetermined comparison criterion;
A replacement step of replacing a character in the recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in the word character string.

また本発明は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部を含む情報処理装置で、記憶部に記憶される文字認識辞書および単語辞書を用いて文字を認識する情報処理方法であって、
文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す文字列行切り出し工程と、
文字列行切り出し工程で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す文字切り出し工程と、
文字切り出し工程で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する特徴抽出工程と、
特徴抽出工程で抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する選択工程と、
選択工程で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する文字列生成工程と、
文字列生成工程で生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する単語列生成工程と、
文字列生成工程で生成された認識結果文字列と単語列生成工程で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、記憶部に記憶される文字認識辞書に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する置き換え判定工程と、
置き換え判定工程で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える置き換え工程とを含むことを特徴とする情報処理方法である。 Further, the present invention provides an information processing including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character. An information processing method for recognizing characters using a character recognition dictionary and a word dictionary stored in a storage unit in an apparatus,
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
When there are different characters by comparing the recognition result character string generated in the character string generation step and the word character string generated in the word string generation step for each character at the corresponding position, the storage unit Among the feature quantities recorded in the character recognition dictionary stored in the memory, and the feature quantities stored in association with the characters matching the characters in the recognition result character string, and the characters recorded in the character recognition dictionary stored in the storage unit When the comparison result meets a predetermined comparison criterion, the character in the recognition result character string is converted to the word character string. A replacement determination step for determining to replace with the corresponding character in
A replacement step of replacing a character in the recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in the word character string.

また本発明は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部を含むコンピュータに、
文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す文字列行切り出し工程と、
文字列行切り出し工程で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す文字切り出し工程と、
文字切り出し工程で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する特徴抽出工程と、
特徴抽出工程で抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する選択工程と、
選択工程で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する文字列生成工程と、
文字列生成工程で生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する単語列生成工程と、
文字列生成工程で生成された認識結果文字列と単語列生成工程で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、特徴抽出工程で抽出された特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する置き換え判定工程と、
置き換え判定工程で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える置き換え工程とを実行させるためのプログラムである。 In addition, the present invention provides a computer including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character. ,
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
If there are different characters by comparing the recognition result character string generated in the character string generation process and the word character string generated in the word string generation process for each character at the corresponding position, feature extraction is performed for the different characters The feature amount extracted in the process is compared with the feature amount stored in association with the character matching the character of the word character string among the character feature amounts recorded in the character recognition dictionary stored in the storage unit. A replacement determination step for determining that a character in the recognition result character string is replaced with a corresponding character in the word character string when the comparison result satisfies a predetermined comparison criterion;
This is a program for executing a replacement step of replacing a character in a recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in a word character string.

また本発明は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部を含むコンピュータに、
文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す文字列行切り出し工程と、
文字列行切り出し工程で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す文字切り出し工程と、
文字切り出し工程で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する特徴抽出工程と、
特徴抽出工程で抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する選択工程と、
選択工程で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する文字列生成工程と、
文字列生成工程で生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する単語列生成工程と、
文字列生成工程で生成された認識結果文字列と単語列生成工程で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、記憶部に記憶される文字認識辞書に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する置き換え判定工程と、
置き換え判定工程で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える置き換え工程とを実行させるためのプログラムである。
また本発明は、前記プログラムを記録したコンピュータ読取り可能な記録媒体である。 In addition, the present invention provides a computer including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character. ,
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
When there are different characters by comparing the recognition result character string generated in the character string generation step and the word character string generated in the word string generation step for each character at the corresponding position, the storage unit Among the feature quantities recorded in the character recognition dictionary stored in the memory, and the feature quantities stored in association with the characters matching the characters in the recognition result character string, and the characters recorded in the character recognition dictionary stored in the storage unit When the comparison result meets a predetermined comparison criterion, the character in the recognition result character string is converted to the word character string. A replacement determination step for determining to replace with the corresponding character in
This is a program for executing a replacement step of replacing a character in a recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in a word character string.
The present invention is also a computer-readable recording medium on which the program is recorded.

本発明によれば、記憶部は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する。文字列行切り出し部は、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。文字切り出し部は、文字列行切り出し部によって切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。特徴抽出部は、文字切り出し部によって切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。選択部は、特徴抽出部によって抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する。文字列生成部は、選択部によって選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。単語列生成部は、文字列生成部によって生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。置き換え判定部は、文字列生成部によって生成された認識結果文字列と単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、特徴抽出部によって抽出された特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、置き換え部は、置き換え判定部によって置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 According to the present invention, the storage unit stores a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a word dictionary that records a word composed of at least one character. The character string line cutout unit cuts out a character string image representing a character string for each line from an image of a document in which the character string line that is a character string is described. The character cutout unit cuts out one character image representing each character from the character string image cut out by the character string row cutout unit. The feature extraction unit extracts a feature amount of a character represented by each character image from the character image cut out by the character cutout unit. The selection unit compares the feature amount extracted by the feature extraction unit with the feature amount recorded in the character recognition dictionary stored in the storage unit, and each character represented by the character image is recognized as a character represented by the character image. At least one candidate character to be obtained is selected from the character recognition dictionary. The character string generation unit generates a recognition result character string by combining candidate characters for each character image selected by the selection unit. The word string generation unit collates the candidate characters constituting the recognition result character string generated by the character string generation unit with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary And a character string that matches all characters, or a word that matches only some characters, and generates a word character string composed of the extracted words. The replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position, and if there is a different character, a different character For the feature amount extracted by the feature extraction unit and the feature amount stored in association with the character that matches the character of the word character string among the character feature amounts recorded in the character recognition dictionary stored in the storage unit When the comparison result satisfies a predetermined comparison criterion, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string. Then, the replacement unit replaces the character in the recognition result character string determined to be replaced by the replacement determination unit with the character at the corresponding position in the word character string.

したがって、文字認識辞書および単語辞書以外の辞書を用いることなく、認識した文字の誤りを正しい文字に訂正することができる。文字認識辞書および単語辞書以外の辞書を必要としないので、記憶容量の増加を回避することができる。 Therefore, it is possible to correct the recognized character error to a correct character without using a dictionary other than the character recognition dictionary and the word dictionary. Since no dictionary other than the character recognition dictionary and the word dictionary is required, an increase in storage capacity can be avoided.

また本発明によれば、記憶部は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する。文字列行切り出し部は、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。文字切り出し部は、文字列行切り出し部によって切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。特徴抽出部は、文字切り出し部によって切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。選択部は、特徴抽出部によって抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する。文字列生成部は、選択部によって選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。単語列生成部は、文字列生成部によって生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。置き換え判定部は、文字列生成部によって生成された認識結果文字列と単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、記憶部に記憶される文字認識辞書に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、置き換え部は、置き換え判定部によって置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 Further, according to the present invention, the storage unit stores a character recognition dictionary that records a character and a feature value obtained by quantifying the character feature, and a word dictionary that records a word composed of at least one character. . The character string line cutout unit cuts out a character string image representing a character string for each line from an image of a document in which the character string line that is a character string is described. The character cutout unit cuts out one character image representing each character from the character string image cut out by the character string row cutout unit. The feature extraction unit extracts a feature amount of a character represented by each character image from the character image cut out by the character cutout unit. The selection unit compares the feature amount extracted by the feature extraction unit with the feature amount recorded in the character recognition dictionary stored in the storage unit, and each character represented by the character image is recognized as a character represented by the character image. At least one candidate character to be obtained is selected from the character recognition dictionary. The character string generation unit generates a recognition result character string by combining candidate characters for each character image selected by the selection unit. The word string generation unit collates the candidate characters constituting the recognition result character string generated by the character string generation unit with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary And a character string that matches all characters, or a word that matches only some characters, and generates a word character string composed of the extracted words. The replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position, and if there is a different character, a different character For the feature quantity recorded in the character recognition dictionary stored in the storage unit, the feature quantity stored in association with the character that matches the character of the recognition result character string, and the character recognition dictionary stored in the storage unit Is compared with the feature quantity stored in association with the character that matches the character of the word character string, and when the comparison result satisfies a predetermined comparison criterion, It is determined that the character is replaced with the corresponding character in the word string. Then, the replacement unit replaces the character in the recognition result character string determined to be replaced by the replacement determination unit with the character at the corresponding position in the word character string.

また本発明によれば、前記置き換え判定部は、前記文字列生成部によって生成された認識結果文字列と前記単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、前記特徴抽出部によって抽出された特徴量と、前記記憶部に記憶される文字認識辞書に収録された特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較して類似度を算出し、算出した類似度が予め定める閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。したがって、特徴量による類似度を指標とするので、簡単な計算で文字の類似を判定することができる。 According to the invention, the replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position. If there are different characters, the different character matches the character of the word character string among the feature amount extracted by the feature extraction unit and the feature amount recorded in the character recognition dictionary stored in the storage unit. The similarity is calculated by comparing with the feature value stored in association with the character to be matched, and when the calculated similarity is equal to or greater than a predetermined threshold, the character in the recognition result character string corresponds to the word character string. Judged to replace with a character. Therefore, since the similarity based on the feature amount is used as an index, the similarity of characters can be determined with a simple calculation.

また本発明によれば、前記置き換え判定部は、前記文字列生成部によって生成された認識結果文字列と前記単語列生成部によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、前記記憶部に記憶される文字認識辞書に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、前記記憶部に記憶される文字認識辞書に収録された特徴量のうち単語文字列の文字の一致する文字に対応付けて記憶される特徴量とを比較して類似度を算出し、算出した類似度が予め定める閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。したがって、特徴量による類似度を指標とするので、簡単な計算で文字の類似を判定することができる。 According to the invention, the replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position. If there is a different character, the feature value stored for the different character is associated with the character that matches the character of the recognition result character string among the feature values recorded in the character recognition dictionary stored in the storage unit. And the feature quantity recorded in the character recognition dictionary stored in the storage unit in association with the character that matches the character of the word character string to calculate the degree of similarity. When the similarity is equal to or greater than a predetermined threshold, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string. Therefore, since the similarity based on the feature amount is used as an index, the similarity of characters can be determined with a simple calculation.

また本発明によれば、前記置き換え判定部は、前記算出した類似度が予め定める閾値未満であるとき、前記単語列生成部が前記単語文字列を生成したときに、前記単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語であって、前記単語文字列に含まれない単語が抽出されていた場合は、前記算出した類似度が、予め定める閾値未満の値である予め定める第２の閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。 According to the invention, the replacement determination unit is recorded in the word dictionary when the calculated similarity is less than a predetermined threshold, and when the word string generation unit generates the word character string. When a word string that matches all characters or a word that matches only some characters and is not included in the word character string, the calculated similarity is determined in advance. When the value is equal to or greater than a predetermined second threshold value that is less than the threshold value, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string.

したがって、単語の候補が１つしか生成されていない場合は、その単語の候補は、正解である可能性が高く、類似度が予め定める閾値未満であっても、最適な単語として置き換えることができる。たとえば、光学式文字認識では、「かすれ」あるいは「にじみ」が原因で、複数の文字からなる単語のうちの１つの文字が正しく認識されない場合、類似度が低くなり、予め定める閾値未満になることがあるが、単語辞書から抽出された単語が１つである場合、正解である可能性は非常に高く、この単語を用いて修正することによって、正しい文字に訂正することができるという効果がある。 Therefore, when only one word candidate is generated, the word candidate is likely to be correct and can be replaced as an optimum word even if the similarity is less than a predetermined threshold. . For example, in optical character recognition, if one character of a word consisting of a plurality of characters is not correctly recognized due to “faint” or “smear”, the degree of similarity will be low and will be less than a predetermined threshold. However, if there is only one word extracted from the word dictionary, there is a high possibility that it is correct, and there is an effect that it can be corrected to a correct character by correcting using this word. .

また本発明によれば、前記置き換え判定部は、前記算出した類似度が予め定める閾値未満であるとき、前記異なる文字を含む単語の文字数が予め定める基準文字数以上である場合は、前記算出した類似度が、予め定める閾値未満の値である予め定める第２の閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。したがって、文字数が多い単語については、類似度が予め定める閾値未満であっても、正解である可能性が高く、最適な単語を選択して置き換えることができる。 According to the present invention, the replacement determination unit determines that the calculated similarity is greater than the predetermined reference number of characters including the different characters when the calculated similarity is less than a predetermined threshold. When the degree is equal to or greater than a predetermined second threshold value that is a value less than a predetermined threshold value, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string. Therefore, for words with a large number of characters, even if the degree of similarity is less than a predetermined threshold value, there is a high possibility that the word is correct, and an optimal word can be selected and replaced.

また本発明によれば、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部を含む情報処理装置で、記憶部に記憶される文字認識辞書および単語辞書を用いて文字を認識するにあたって、文字列行切り出し工程では、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。文字切り出し工程では、文字列行切り出し工程で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。特徴抽出工程では、文字切り出し工程で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。選択工程では、特徴抽出工程で抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する。文字列生成工程では、選択工程で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。単語列生成工程では、文字列生成工程で生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。置き換え判定工程では、文字列生成工程で生成された認識結果文字列と単語列生成工程で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、特徴抽出工程で抽出された特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、置き換え工程では、置き換え判定工程で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 In addition, according to the present invention, a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character are included. In the information processing apparatus, when a character is recognized using the character recognition dictionary and the word dictionary stored in the storage unit, a character string is extracted from an image of a document in which a character string row that is a character string is described in the character string row cutout process. A character string image representing is cut out for each line. In the character cutout step, a character image representing each character is cut out character by character from the character string image cut out in the character string row cutout step. In the feature extraction step, the feature amount of the character represented by each character image is extracted from the character image cut out in the character cutout step. In the selection step, the feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and each character represented by the character image is recognized as a character represented by the character image. At least one candidate character to be obtained is selected from the character recognition dictionary. In the character string generation step, a recognition result character string is generated by combining candidate characters for each character image selected in the selection step. In the word string generation step, the candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary And a character string that matches all characters, or a word that matches only some characters, and generates a word character string composed of the extracted words. In the replacement determination step, if there is a different character by comparing the recognition result character string generated in the character string generation step and the word character string generated in the word string generation step for each character at the corresponding position, a different character About the feature amount extracted in the feature extraction step and the feature stored in association with the character that matches the character of the word character string out of the feature amount of the character recorded in the character recognition dictionary stored in the storage unit When the comparison result satisfies a predetermined comparison criterion, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string. In the replacement step, the character in the recognition result character string determined to be replaced in the replacement determination step is replaced with the character at the corresponding position in the word character string.

したがって、文字認識辞書および単語辞書以外の辞書を用いることなく、認識した文字の誤りを正しい文字に訂正することができる。文字認識辞書および単語辞書以外の辞書を必要としないので、情報処理装置の記憶容量の増加を回避することができる。 Therefore, it is possible to correct the recognized character error to a correct character without using a dictionary other than the character recognition dictionary and the word dictionary. Since no dictionary other than the character recognition dictionary and word dictionary is required, an increase in the storage capacity of the information processing apparatus can be avoided.

また本発明によれば、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書、および少なくとも１つの文字からなる単語を収録する単語辞書を記憶する記憶部を含む情報処理装置で、記憶部に記憶される文字認識辞書および単語辞書を用いて文字を認識するにあたって、文字列行切り出し工程では、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。文字切り出し工程では、文字列行切り出し工程で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。特徴抽出工程では、文字切り出し工程で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。選択工程では、特徴抽出工程で抽出された特徴量と記憶部に記憶される文字認識辞書に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書から少なくとも１つ選択する。文字列生成工程では、選択工程で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。単語列生成工程では、文字列生成工程で生成された認識結果文字列を構成する候補文字と、記憶部に記憶される単語辞書に収録された単語とを照合し、単語辞書に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。置き換え判定工程では、文字列生成工程で生成された認識結果文字列と単語列生成工程で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、記憶部に記憶される文字認識辞書に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、記憶部に記憶される文字認識辞書に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、置き換え工程では、置き換え判定工程で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 In addition, according to the present invention, a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character are included. In the information processing apparatus, when a character is recognized using the character recognition dictionary and the word dictionary stored in the storage unit, a character string is extracted from an image of a document in which a character string row that is a character string is described in the character string row cutout process. A character string image representing is cut out for each line. In the character cutout step, a character image representing each character is cut out character by character from the character string image cut out in the character string row cutout step. In the feature extraction step, the feature amount of the character represented by each character image is extracted from the character image cut out in the character cutout step. In the selection step, the feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and each character represented by the character image is recognized as a character represented by the character image. At least one candidate character to be obtained is selected from the character recognition dictionary. In the character string generation step, a recognition result character string is generated by combining candidate characters for each character image selected in the selection step. In the word string generation step, the candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary And a character string that matches all characters, or a word that matches only some characters, and generates a word character string composed of the extracted words. In the replacement determination step, if there is a different character by comparing the recognition result character string generated in the character string generation step and the word character string generated in the word string generation step for each character at the corresponding position, a different character For the feature quantity recorded in the character recognition dictionary stored in the storage unit, the feature quantity stored in association with the character that matches the character of the recognition result character string, and the character recognition dictionary stored in the storage unit Is compared with the feature quantity stored in association with the character that matches the character of the word character string, and when the comparison result satisfies a predetermined comparison criterion, It is determined that the character is replaced with the corresponding character in the word string. In the replacement step, the character in the recognition result character string determined to be replaced in the replacement determination step is replaced with the character at the corresponding position in the word character string.

また本発明によれば、コンピュータに情報処理方法の各工程を実行させるためのプログラムとして提供することができる。 Moreover, according to this invention, it can provide as a program for making a computer perform each process of an information processing method.

また本発明によれば、コンピュータに前記情報処理方法の各工程を実行させるためのプログラムを記録したコンピュータ読み取り可能な記録媒体として提供することができる。 Furthermore, according to the present invention, it is possible to provide a computer-readable recording medium that records a program for causing a computer to execute each step of the information processing method.

本発明の一実施形態である情報処理装置１の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus 1 which is one Embodiment of this invention. 入力装置２から受信した画像データが表す画像４０の一例を示す図である。It is a figure which shows an example of the image 40 which the image data received from the input device 2 represents. 文字列行切り出し部１５によって切り出された文字列行画像４１の一例を示す図である。It is a figure which shows an example of the character string line image 41 cut out by the character string line cutout part. 文字切り出し部１６によって切り出された文字画像４２の一例を示す図である。It is a figure which shows an example of the character image 42 cut out by the character cutout part 16. FIG. 特徴抽出部１７によって抽出された特徴量を説明するための図である。It is a figure for demonstrating the feature-value extracted by the feature extraction part. 候補文字４３および認識結果文字列４４の一例を示す図である。It is a figure which shows an example of the candidate character 43 and the recognition result character string 44. FIG. 単語列生成部２１によって生成された単語文字列４５の一例を示す図である。It is a figure which shows an example of the word character string 45 produced | generated by the word sequence production | generation part 21. FIG. 認識結果文字列と単語文字列との比較例４６を示す図である。It is a figure which shows the comparative example 46 of a recognition result character string and a word character string. 単語文字列の文字の特徴量および対応する位置にある候補文字の特徴量の一例を示す図である。It is a figure which shows an example of the feature-value of the character of a word character string, and the feature-value of the candidate character in a corresponding position. 情報処理装置１、入力装置２および出力装置３が実行する文字認識処理の処理手順を示すフローチャートである。4 is a flowchart illustrating a processing procedure of character recognition processing executed by the information processing device 1, the input device 2, and the output device 3. 後処理部２０が実行する第１の後処理の処理手順を示すフローチャートである。4 is a flowchart illustrating a processing procedure of a first post-processing executed by the post-processing unit 20. 文字切り出し部１６によって切り出された文字画像５２の一例を示す図である。It is a figure which shows an example of the character image 52 cut out by the character cutout part. 候補文字５３および認識結果文字列５４の一例を示す図である。It is a figure which shows an example of the candidate character 53 and the recognition result character string 54. FIG. 単語列生成部２１によって生成された単語文字列５５の一例を示す図である。It is a figure which shows an example of the word character string 55 produced | generated by the word sequence production | generation part 21. FIG. 認識結果文字列と単語文字列との比較例５６を示す図である。It is a figure which shows the comparative example 56 of a recognition result character string and a word character string. 単語文字列の文字の特徴量および対応する位置にある候補文字の特徴量の一例を示す図である。It is a figure which shows an example of the feature-value of the character of a word character string, and the feature-value of the candidate character in a corresponding position. 後処理部２０ａが実行する第２の後処理の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the 2nd post-process which the post-processing part 20a performs. 入力装置２から受信した画像データが表す画像６０の一例を示す図である。It is a figure which shows an example of the image 60 which the image data received from the input device 2 represents. 文字列行切り出し部１５によって切り出された文字列行画像６１の一例を示す図である。It is a figure which shows an example of the character string line image 61 cut out by the character string line cutout part. 文字切り出し部１６によって切り出された文字画像６２の一例を示す図である。It is a figure which shows an example of the character image 62 cut out by the character cutout part 16. FIG.

候補文字６３および認識結果文字列６４の一例を示す図である。It is a figure which shows an example of the candidate character 63 and the recognition result character string 64. FIG. 単語列生成部２１によって生成された単語文字列６５の一例を示す図である。It is a figure which shows an example of the word character string 65 produced | generated by the word sequence production | generation part 21. FIG. 認識結果文字列と単語文字列との比較例６６を示す図である。It is a figure which shows the comparative example 66 of a recognition result character string and a word character string. 単語文字列の文字の特徴量および対応する位置にある候補文字の特徴量の一例を示す図である。It is a figure which shows an example of the feature-value of the character of a word character string, and the feature-value of the candidate character in a corresponding position. 後処理部２０ｂが実行する第３の後処理の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the 3rd post-process which the post-processing part 20b performs.

図１は、本発明の第１の実施形態である情報処理装置１の構成を示すブロック図である。本発明に係る情報処理方法は、情報処理装置１によって実行される。情報処理装置１は、入力装置２および出力装置３に接続され、文章などが記載された原稿を入力装置２によって画像として読み取り、読み取った画像を表す画像データに基づいて文字認識を行い、認識した文字からなる文字列の誤りを正しい文字列に訂正して、出力装置３に出力する装置である。 FIG. 1 is a block diagram showing the configuration of the information processing apparatus 1 according to the first embodiment of the present invention. The information processing method according to the present invention is executed by the information processing apparatus 1. The information processing device 1 is connected to the input device 2 and the output device 3, reads a document on which text or the like is written as an image by the input device 2, performs character recognition based on image data representing the read image, and recognizes the character This is a device that corrects an error in a character string consisting of characters into a correct character string and outputs the corrected character string to the output device 3.

入力装置２は、たとえば原稿を画像として読み取るスキャナなどの画像読み取り装置であり、読み取った画像を表す画像データを情報処理装置１に送信する。出力装置３は、たとえば液晶ディスプレイによって構成されるモニタなどの表示装置であり、情報処理装置１から受信する情報を画面に表示する。図１に示した実施形態では、入力装置２および出力装置３は、情報処理装置１とは独立した装置として構成したが、情報処理装置１に含めてもよい。 The input device 2 is an image reading device such as a scanner that reads a document as an image, and transmits image data representing the read image to the information processing device 1. The output device 3 is a display device such as a monitor configured by a liquid crystal display, for example, and displays information received from the information processing device 1 on a screen. In the embodiment illustrated in FIG. 1, the input device 2 and the output device 3 are configured as devices independent of the information processing device 1, but may be included in the information processing device 1.

情報処理装置１は、たとえばパーソナルコンピュータなどのコンピュータによって構成され、制御部１１、入出力制御部１２、メモリ１３、文字認識辞書１４、文字列行切り出し部１５、文字切り出し部１６、特徴抽出部１７、マッチング部１８、文字列生成部１９および後処理部２０を含んで構成される。後処理部２０は、単語列生成部２１、置き換え判定部２２、置き換え部２３および単語辞書２４を含んで構成される。 The information processing apparatus 1 is configured by a computer such as a personal computer, for example, and includes a control unit 11, an input / output control unit 12, a memory 13, a character recognition dictionary 14, a character string line cutout unit 15, a character cutout unit 16, and a feature extraction unit 17. , A matching unit 18, a character string generation unit 19, and a post-processing unit 20. The post-processing unit 20 includes a word string generation unit 21, a replacement determination unit 22, a replacement unit 23, and a word dictionary 24.

制御部１１は、たとえば図示しない中央処理装置（Central Processing Unit；略称ＣＰＵ）によって構成される。入出力制御部１２は、制御部１１と入力装置２との間の情報の送受信、および制御部１１と出力装置３との間の情報の送受信を制御する。記憶部であるメモリ１３は、たとえば半導体メモリ装置あるいはハードディスク装置などの記憶装置によって構成され、制御部１１によって実行されるプログラム、およびプログラムを実行するために必要な情報を記憶する。メモリ１３に記憶される情報は、制御部１１によって読み書きされる。制御部１１は、メモリ１３に記憶されるプログラムを実行することによって、入出力制御部１２を制御するとともに、文字列行切り出し部１５、文字切り出し部１６、特徴抽出部１７、マッチング部１８、文字列生成部１９および後処理部２０などの機能を実現する。本実施形態では、メモリ１３を情報処理装置１に含めているが、情報処理装置１に接続される独立した記憶装置によって構成してもよいし、あるいは着脱可能な記録媒体によって構成してもよい。 The control part 11 is comprised by the central processing unit (Central Processing Unit; abbreviation CPU) which is not shown in figure, for example. The input / output control unit 12 controls transmission / reception of information between the control unit 11 and the input device 2 and transmission / reception of information between the control unit 11 and the output device 3. The memory 13 as a storage unit is configured by a storage device such as a semiconductor memory device or a hard disk device, and stores a program executed by the control unit 11 and information necessary for executing the program. Information stored in the memory 13 is read and written by the control unit 11. The control unit 11 controls the input / output control unit 12 by executing a program stored in the memory 13, as well as a character string line cutout unit 15, a character cutout unit 16, a feature extraction unit 17, a matching unit 18, a character Functions such as the column generation unit 19 and the post-processing unit 20 are realized. In the present embodiment, the memory 13 is included in the information processing apparatus 1, but may be configured by an independent storage device connected to the information processing apparatus 1, or may be configured by a removable recording medium. .

文字認識辞書１４は、文字コードで表わされる文字と文字の特徴を数値化した特徴量とを対応付けて収録した辞書である。単語辞書２４は、少なくとも１つの文字からなる単語を収録する辞書である。単語辞書２４にも文字ごとの文字コードが記憶されている。文字認識辞書１４および単語辞書２４は、メモリ１３に記憶される。 The character recognition dictionary 14 is a dictionary that records a character represented by a character code and a feature value obtained by quantifying the character feature in association with each other. The word dictionary 24 is a dictionary that records words composed of at least one character. The word dictionary 24 also stores character codes for each character. The character recognition dictionary 14 and the word dictionary 24 are stored in the memory 13.

制御部１１は、入出力制御部１２によって、入力装置２から送信される画像データを受信し、受信した画像データを文字列行切り出し部１５に送る。文字列行切り出し部１５は、制御部１１から受け取った画像データが表す画像から文字列を表す文字列行画像を行ごとに切り出す。文字切り出し部１６は、文字列行切り出し部１５によって切り出された文字列行画像から、文字を１文字ずつ表す矩形画像である文字画像を切り出す。特徴抽出部１７は、文字切り出し部１６によって切り出された文字画像が表す文字の特徴を数値化した特徴量を抽出する。 The control unit 11 receives the image data transmitted from the input device 2 by the input / output control unit 12 and sends the received image data to the character string line cutout unit 15. The character string row cutout unit 15 cuts out a character string row image representing a character string from the image represented by the image data received from the control unit 11 for each line. The character cutout unit 16 cuts out a character image that is a rectangular image representing characters one by one from the character string row image cut out by the character string row cutout unit 15. The feature extraction unit 17 extracts a feature amount obtained by quantifying the character feature represented by the character image cut out by the character cutout unit 16.

選択部であるマッチング部１８は、文字認識辞書１４とのマッチング処理を行う。マッチング処理は、文字画像の特徴と文字認識辞書１４に収録されている文字の特徴とを比較し、どれだけ似ているかを数値化する処理である。具体的には、マッチング部１８は、特徴抽出部１７によって抽出された特徴量と、文字認識辞書１４に収録されているすべての文字の特徴量とを比較して、文字画像が表す文字と文字認識辞書１４に収録された文字との類似度を算出する。そして、算出した類似度に基づいて、文字画像が表す文字の候補文字を少なくとも１つ、たとえば第１候補〜第５候補の候補文字を抽出する。文字列生成部１９は、マッチング部１８によって抽出された候補文字に示される候補文字のうち、第１候補の候補文字を左から順に並べた認識結果文字列を生成する。 The matching unit 18 serving as a selection unit performs matching processing with the character recognition dictionary 14. The matching process is a process of comparing the characteristics of the character image with the characteristics of the characters recorded in the character recognition dictionary 14 and quantifying how much they are similar. Specifically, the matching unit 18 compares the feature amount extracted by the feature extraction unit 17 with the feature amount of all the characters recorded in the character recognition dictionary 14 to determine the character and character represented by the character image. The similarity with the characters recorded in the recognition dictionary 14 is calculated. Then, based on the calculated similarity, at least one candidate character of the character represented by the character image, for example, a first to fifth candidate character is extracted. The character string generation unit 19 generates a recognition result character string in which the candidate characters of the first candidate are arranged in order from the left among the candidate characters indicated by the candidate characters extracted by the matching unit 18.

後処理部２０は、第１の後処理を行う。第１の後処理では、単語列生成部２１は、文字列生成部１９によって生成された認識結果文字列を構成する各文字の複数の候補文字を組み合わせて、単語辞書２４と照合し、最適な単語を選択することで構成される単語文字列を生成する。置き換え判定部２２は、単語列生成部２１によって生成された単語文字列と、文字列生成部１９によって生成された認識結果文字列とを比較し、比較結果が予め定める比較基準を満たすとき、文字を置き換えると判定する。置き換え部２３は、置き換え判定部２２によって置き換えると判定された文字について、認識結果文字列中の文字を単語文字列中の文字で置き換え、置き換えた認識結果文字列を出力装置３に出力する。以下、文字列行切り出し部１５、文字切り出し部１６、特徴抽出部１７、マッチング部１８、文字列生成部１９および後処理部２０を詳細に説明する。 The post-processing unit 20 performs a first post-processing. In the first post-processing, the word string generation unit 21 combines a plurality of candidate characters of each character constituting the recognition result character string generated by the character string generation unit 19 and collates with the word dictionary 24 to obtain the optimum A word string composed by selecting a word is generated. The replacement determination unit 22 compares the word character string generated by the word string generation unit 21 with the recognition result character string generated by the character string generation unit 19, and when the comparison result satisfies a predetermined comparison criterion, Is determined to be replaced. The replacement unit 23 replaces the character in the recognition result character string with the character in the word character string for the character determined to be replaced by the replacement determination unit 22, and outputs the replaced recognition result character string to the output device 3. Hereinafter, the character string row cutout unit 15, the character cutout unit 16, the feature extraction unit 17, the matching unit 18, the character string generation unit 19, and the post-processing unit 20 will be described in detail.

図２は、入力装置２から受信した画像データが表す画像４０の一例を示す図である。入力装置２から受信した画像データが表す画像４０は、文字列行が記載された原稿を入力装置２によって読み取った画像である。文字列行は、少なくとも１つの行からなる文字列である。文字列は、横書きでもよいし、縦書きでもよい。横書きのときは、横方向が文字列であり、縦方向が行である。縦書きのときは、縦方向が文字列で、横方向が行である。画像は、カラー画像でもよいが、本実施形態では、画像は白黒画像であるとして、以下説明する。白画素からなる部分は背景部分であり、黒画素からなる部分が文字部分である。 FIG. 2 is a diagram illustrating an example of the image 40 represented by the image data received from the input device 2. An image 40 represented by image data received from the input device 2 is an image obtained by reading a document on which a character string row is written by the input device 2. The character string line is a character string composed of at least one line. The character string may be written horizontally or vertically. In horizontal writing, the horizontal direction is a character string, and the vertical direction is a line. In vertical writing, the vertical direction is a character string and the horizontal direction is a line. The image may be a color image, but in the present embodiment, the following description will be made assuming that the image is a monochrome image. The portion composed of white pixels is the background portion, and the portion composed of black pixels is the character portion.

図２に示した画像４０の例には、２行にわたる文字列行が示されており、第１行目には、「入居者の保険医療制度についてアンケートを実施。」と示され、第２行目には、「協力、理解を得るために事前に説明会を行った。」と示されている。 In the example of the image 40 shown in FIG. 2, a character string line extending over two lines is shown. The first line indicates “A questionnaire about the resident's insurance medical system.” The line says, “We held a briefing session in advance to gain cooperation and understanding.”

図３は、文字列行切り出し部１５によって切り出された文字列行画像４１の一例を示す図である。文字列行切り出し部１５は、制御部１１が入出力制御部１２によって入力装置２から受信した画像データを制御部１１から受け取り。受け取った画像データが表す画像から文字列を表す文字列行画像を行ごとに切り出す。 FIG. 3 is a diagram illustrating an example of the character string row image 41 cut out by the character string row cutout unit 15. The character string line segmentation unit 15 receives image data received from the input device 2 by the input / output control unit 12 by the control unit 11 from the control unit 11. A character string row image representing a character string is cut out for each row from the image represented by the received image data.

文字列行画像を切り出す方法は、周知の方法でよく、本実施形態では、たとえば、文字列行切り出し部１５は、画像内の空白部分つまり白画素部分、および文字部分つまり黒画素部分の分布から、文字列行画像を切り出す。具体的には、行と行との間には一定の連続する白画素の分布があることを利用して、文字列行画像を切り出す。文字列行画像を切り出す方法は、この方法に限定されるものではなく、これ以外のどのような方法を用いてもよい。 The method for cutting out the character string row image may be a well-known method. In the present embodiment, for example, the character string row cutout unit 15 determines from the distribution of blank portions, that is, white pixel portions, and character portions, that is, black pixel portions in the image. Cut out the character string line image. Specifically, a character string row image is cut out by utilizing the fact that there is a certain distribution of white pixels between rows. The method of cutting out the character string line image is not limited to this method, and any other method may be used.

図３に示した文字列行画像４１の例には、文字列行画像４１１，４１２の２つの文字列行画像が示されている。文字列行画像４１１は、「入居者の保険医療制度について、」という文字列が表示された画像であり、文字列行画像４１２は、「協力、理解を得るために事前に説明会を行った。」という文字列が表示された画像である。 In the example of the character string row image 41 illustrated in FIG. 3, two character string row images of the character string row images 411 and 412 are illustrated. The character string line image 411 is an image in which the character string “about the resident's insurance medical system” is displayed, and the character string line image 412 is “an explanatory meeting was performed in advance to obtain cooperation and understanding”. Is an image on which a character string “.” Is displayed.

図４は、文字切り出し部１６によって切り出された文字画像４２の一例を示す図である。文字切り出し部１６は、文字列行切り出し部１５によって切り出された文字列行画像から、文字を１文字ずつ矩形画像である文字画像として切り出す。 FIG. 4 is a diagram illustrating an example of the character image 42 cut out by the character cutout unit 16. The character cutout unit 16 cuts out characters one by one from the character string row image cut out by the character string row cutout unit 15 as a character image that is a rectangular image.

文字切り出し手法も周知の方法でよいが、文字の上下左右方向の各々一番突出する部分の座標を最大値として外接矩形で切り出すことを条件とする。本実施形態では、座標は、たとえば図２に示した画像４０の左上の頂点を原点とし、画像４０に向かって左右方向をＸ軸、上下方向をＹ軸とする座標系における座標である。文字画像は、Ｘ軸に平行な２辺とＹ軸に平行な２辺とからなる４辺で囲まれる矩形の画像であり、１つの文字を含む最小の大きさの矩形画像である。 A well-known method may be used as the character cutout method, provided that the character is cut out in a circumscribed rectangle with the coordinates of the most protruding portions in the vertical and horizontal directions as the maximum value. In the present embodiment, the coordinates are coordinates in a coordinate system in which, for example, the upper left vertex of the image 40 illustrated in FIG. 2 is the origin, the left and right directions toward the image 40 are the X axis, and the vertical direction is the Y axis. The character image is a rectangular image surrounded by four sides including two sides parallel to the X axis and two sides parallel to the Y axis, and is a rectangular image having a minimum size including one character.

図４に示した文字画像４２の例には、１５文字の文字画像が示されている。最初の文字画像４２１は、「入」という文字を切り出した文字画像であり、第２番目以降に「居」、「者」、「の」、「保」、「険」、「医」、「療」、「制」、「度」、「に」、「つ」、「い」、「て」および「、」の各文字の文字画像が示されている。 In the example of the character image 42 shown in FIG. 4, a character image of 15 characters is shown. The first character image 421 is a character image obtained by cutting out the character “ON”, and “I”, “person”, “NO”, “K”, “K”, “medicine”, “ Character images of the characters “medicine”, “control”, “degree”, “ni”, “tsu”, “i”, “te” and “,” are shown.

図５は、特徴抽出部１７によって抽出された特徴量を説明するための図である。特徴抽出部１７は、文字切り出し部１６によって切り出された文字画像が表す文字の特徴を数値化した特徴量を抽出する。特徴抽出部１７は、後処理部２０のために、抽出した特徴量を文字画像ごとにメモリ１３に記憶しておく。 FIG. 5 is a diagram for explaining the feature amount extracted by the feature extraction unit 17. The feature extraction unit 17 extracts a feature amount obtained by quantifying the character feature represented by the character image cut out by the character cutout unit 16. The feature extraction unit 17 stores the extracted feature amount in the memory 13 for each character image for the post-processing unit 20.

特徴抽出の手法としては、「メッシュ特徴」と呼ばれる手法がよく知られており、本実施形態でも、特徴抽出部１７は、「メッシュ特徴」と呼ばれる手法を用いて特徴量を抽出する。「メッシュ特徴」とは、切り出された矩形画像である文字画像を、メッシュに分割し、各メッシュにおける黒画素数を数値化して正規化を行う処理である。 As a feature extraction method, a method called “mesh feature” is well known, and in this embodiment, the feature extraction unit 17 extracts a feature amount using a method called “mesh feature”. “Mesh feature” is a process of dividing a character image, which is a cut-out rectangular image, into meshes, and quantifying the number of black pixels in each mesh for normalization.

図５（ａ）は「あ」という文字の文字画像３０を縦横８×８＝６４のメッシュ３１に分割して区切った図である。図５（ｂ）は、図５（ａ）に示した各メッシュ３１における黒画素数を数値化して正規化した特徴データ３２の一例を示す図である。特徴データ３２は、それぞれのメッシュ３１の文字部分の特徴量を、対応する位置にある８×８のメッシュ３３ごとに表したデータである。対応する位置とは、８×８のメッシュの中で同じ位置にあるということである。 FIG. 5A is a diagram in which the character image 30 of the character “A” is divided into 8 × 8 = 64 meshes 31 and divided. FIG. 5B is a diagram illustrating an example of the feature data 32 obtained by quantifying and normalizing the number of black pixels in each mesh 31 illustrated in FIG. The feature data 32 is data representing the feature amount of the character portion of each mesh 31 for each 8 × 8 mesh 33 at the corresponding position. The corresponding position means that they are at the same position in the 8 × 8 mesh.

たとえば最上段の各メッシュ３３は、左から順に、第１番目のメッシュ３１の特徴量が「０」であることを示し、第２番目のメッシュ３１の特徴量が「０」であることを示し、第３番目のメッシュ３１の特徴量が「１５」であることを示し、第４番目のメッシュ３１の特徴量が「３４」であることを示し、第５番目のメッシュ３１の特徴量が「０」であることを示し、第６番目のメッシュ３１の特徴量が「０」であることを示し、第７番目のメッシュ３１の特徴量が「３」であり、第８番目のメッシュ３１の特徴量が「０」であることを示している。 For example, each mesh 33 in the uppermost row indicates that the feature amount of the first mesh 31 is “0” and the feature amount of the second mesh 31 is “0” in order from the left. , The feature amount of the third mesh 31 is “15”, the feature amount of the fourth mesh 31 is “34”, and the feature amount of the fifth mesh 31 is “ 0 ”, the feature amount of the sixth mesh 31 is“ 0 ”, the feature amount of the seventh mesh 31 is“ 3 ”, and the feature amount of the eighth mesh 31 is It indicates that the feature amount is “0”.

図６は、候補文字４３および認識結果文字列４４の一例を示す図である。図６（ａ）は、マッチング部１８によって抽出された候補文字４３の一例を示す。候補文字４３は、文字画像４２が表す文字として、マッチング部１８によって抽出された候補の文字である。マッチング部１８は、特徴抽出部１７によって抽出された特徴量と、文字認識辞書１４に収録されているすべての文字の特徴量とを比較して、文字画像が表す文字と文字認識辞書１４に収録された文字との類似度を算出し、算出した類似度に基づいて、文字画像が表す文字の候補文字を少なくとも１つ抽出する。本実施形態では、類似度の高い文字から最大で５つの候補文字、第１候補〜第５候補の候補文字を抽出する。 FIG. 6 is a diagram illustrating an example of the candidate character 43 and the recognition result character string 44. FIG. 6A shows an example of the candidate character 43 extracted by the matching unit 18. The candidate character 43 is a candidate character extracted by the matching unit 18 as a character represented by the character image 42. The matching unit 18 compares the feature amount extracted by the feature extraction unit 17 with the feature amount of all characters recorded in the character recognition dictionary 14 and records them in the character recognition dictionary 14 and the character represented by the character image. The degree of similarity with the calculated character is calculated, and at least one candidate character for the character represented by the character image is extracted based on the calculated degree of similarity. In the present embodiment, a maximum of five candidate characters, a first candidate to a fifth candidate character, are extracted from characters with high similarity.

類似度は、文字画像が表す文字と文字認識辞書１４に収録された文字とが、どれだけ似ているかを評価するための指標であり、類似度が高いほどより似ていることを示す。本実施形態では、類似度として、単純類似度と呼ばれる指標を用いてどれだけ似ているかを評価する。単純類似度Ｓは、式（１）で算出する。 The similarity is an index for evaluating how similar the character represented by the character image and the character recorded in the character recognition dictionary 14 are, and indicates that the similarity is higher as the similarity is higher. In the present embodiment, the degree of similarity is evaluated using an index called simple similarity as the similarity. The simple similarity S is calculated by equation (1).

ここに、文字画像が示す文字の特徴データをＸ、文字認識辞書１４に収録されている文字の特徴データをＹ、文字画像の第ｍ番目のメッシュの特徴量をｘ_ｍ、文字認識辞書１４に収録されている文字の第ｍ番目のメッシュの特徴量をｙ_ｍ、メッシュの数をｋとしている。メッシュの第ｍ番目とは、本実施形態では、８×８のメッシュであるので、ｋ＝６４であり、最上段の左から順にｍ＝１，２，３，……８であり、右端までくると、次段の左端からｍ＝９，１０，１１，……であり、最下段の右端がｍ＝６４である。 Here, the feature data of the character indicated by the character image is X, the feature data of the character recorded in the character recognition dictionary 14 is Y, the feature amount of the mth mesh of the character image is x _m , and the character recognition dictionary 14 The feature amount of the m-th mesh of the recorded characters is y _m , and the number of meshes is k. In the present embodiment, the m-th mesh is an 8 × 8 mesh, so k = 64, m = 1, 2, 3,... 8 in order from the left in the uppermost stage, up to the right end. When coming, m = 9, 10, 11,... From the left end of the next stage, and m = 64 at the right end of the lowermost stage.

式（１）の最右辺の分母は、文字画像が表す文字の特徴データおよび文字認識辞書１４に収録されている文字の特徴データを正規化しておくと一定となるので、式（１）で計算を行うのは、分子についてのみ計算を行えばよい。本実施形態では、８×８のメッシュについてメッシュ特徴を用いているので、分子は、式（２）のように表すことができる。 The rightmost denominator of equation (1) becomes constant when the character feature data represented by the character image and the character feature data recorded in the character recognition dictionary 14 are normalized. It is only necessary to perform calculations for molecules. In this embodiment, since the mesh feature is used for an 8 × 8 mesh, the numerator can be expressed as shown in Equation (2).

式（２）を用いると、計算結果の値が非常に大きくなって扱いにくいので、本実施形態では、式（３）に示すように式（２）の計算結果を「５１２」で除算した値を、単純類似度Ｓとして用いる。 If the expression (2) is used, the value of the calculation result becomes very large and difficult to handle. In this embodiment, the value obtained by dividing the calculation result of the expression (2) by “512” as shown in the expression (3) Is used as the simple similarity S.

式（３）によって、単純類似度Ｓの最大値は「１２８」になるように設定される。
特徴抽出部１７は、文字切り出し部１６によって切り出された文字画像ごとに、文字認識辞書１４に収録されている文字との類似度を式（３）によって計算する。そして、文字認識辞書１４に収録されている文字から、類似度が高い順に第１候補〜第５候補の５つまでの候補文字を抽出する。候補文字は、文字画像が表す文字としての候補となる文字である。類似度が、予め定める最低基準未満であるときは、候補文字として抽出しない。予め定める最低基準は、たとえば「８０」である。本実施形態では、予め定める最低基準を「８０」としたが、この値に限定されるものではない。また、予め定める最低基準による判断を行うことなく、常に第１候補〜第５候補までを抽出するようにしてもよい。本実施形態では、候補文字を第５候補までとしたが、これに限定されるものではなく、たとえば第１０候補まで抽出してもよい。ただし、文字列が長くなった場合、あるいは認識する文字列が増加した場合は、記憶するデータ量も増加することに注意する必要がある。 According to Expression (3), the maximum value of the simple similarity S is set to be “128”.
The feature extraction unit 17 calculates the similarity with the characters recorded in the character recognition dictionary 14 for each character image cut out by the character cutout unit 16 according to Expression (3). Then, from the characters recorded in the character recognition dictionary 14, up to five candidate characters from the first candidate to the fifth candidate are extracted in descending order of similarity. A candidate character is a character that is a candidate for the character represented by the character image. If the similarity is less than a predetermined minimum standard, it is not extracted as a candidate character. The predetermined minimum standard is, for example, “80”. In the present embodiment, the predetermined minimum standard is “80”, but is not limited to this value. Alternatively, the first to fifth candidates may always be extracted without making a determination based on a predetermined minimum standard. In the present embodiment, the candidate characters are up to the fifth candidate. However, the present invention is not limited to this, and for example, up to the tenth candidate may be extracted. However, it should be noted that the amount of data to be stored increases when the character string becomes longer or when the number of recognized character strings increases.

図６（ａ）に示した候補文字４３は、最上段に入力文字を左から順に配置し、第２段目に最上段のそれぞれの文字に対する第１候補の候補文字、第３段目に最上段のそれぞれの文字に対する第２候補の候補文字、第４段目に最上段のそれぞれの文字に対する第３候補の候補文字、第５段目に最上段のそれぞれの文字に対する第４候補の候補文字、第６段目に最上段のそれぞれの文字に対する第５候補の候補文字を示している。入力文字は、候補文字の本来の文字、つまり原稿に記載されていた文字であり、以下「正解文字」ともいう。各候補文字の下側に付している数字は、その候補文字の類似度を表している。第１候補は、類似度が最も高い候補文字であり、以降類似度が高い順に、第２候補、第３候補、第４候補および第５候補の候補文字となる。 In the candidate character 43 shown in FIG. 6A, input characters are arranged in order from the left in the top row, the first candidate candidate character for each character in the top row in the second row, and the first character in the third row. Second candidate candidate character for each character in the upper row, Third candidate character for each uppermost character in the fourth row, Fourth candidate candidate character for each uppermost character in the fifth row In the sixth row, the fifth candidate character for each character in the uppermost row is shown. The input character is the original character of the candidate character, that is, the character described in the manuscript, and is hereinafter also referred to as “correct answer character”. The number attached to the lower side of each candidate character represents the similarity of the candidate character. The first candidate is a candidate character having the highest similarity, and thereafter becomes a candidate character of the second candidate, the third candidate, the fourth candidate, and the fifth candidate in descending order of similarity.

図６（ａ）に示した候補文字４３の例では、最上段に、入力文字として、「入」、「居」、「者」、「の」、「保」、「険」、「医」、「療」、「制」、「度」、「に」、「つ」、「い」、「て」および「、」が示されている。第１候補〜第５候補については、たとえば正解文字「入」については、第１候補「入１２１」、第２候補「人１２０」、第３候補「大１１５」および第４候補「λ １１０」が示され、第５候補は、空白である。 In the example of the candidate character 43 shown in FIG. 6A, “input”, “residence”, “person”, “no”, “protection”, “drug”, “medicine” are input characters at the top. , “Medicine”, “control”, “degree”, “ni”, “tsu”, “i”, “te” and “,” are shown. For the first to fifth candidates, for example, for the correct character “ON”, the first candidate “ON 121”, the second candidate “person 120”, the third candidate “large 115”, and the fourth candidate “λ 110”. And the fifth candidate is blank.

図６（ｂ）は、文字列生成部１９によって生成された認識結果文字列４４の一例を示す。文字列生成部１９は、マッチング部１８によって抽出された候補文字４３に示される候補文字のうち、第１候補の候補文字を左から順に並べた認識結果文字列４４を生成する。文字列生成部１９は、後処理部２０のために、候補文字４３をメモリ１３に記憶しておく。図６（ｂ）に示した例では、認識結果文字列４４として「入居青の振険医療制度について、」なる文字列が示されている。後処理部２０による処理を行わない場合は、この認識結果文字列４４を、認識結果として出力することになる。本実施形態では、この後、後処理部２０によって第１の後処理が実行される。 FIG. 6B shows an example of the recognition result character string 44 generated by the character string generation unit 19. The character string generation unit 19 generates a recognition result character string 44 in which the candidate characters of the first candidate are arranged in order from the left among the candidate characters indicated by the candidate character 43 extracted by the matching unit 18. The character string generation unit 19 stores candidate characters 43 in the memory 13 for the post-processing unit 20. In the example shown in FIG. 6B, the recognition result character string 44 is a character string “Regarding the resident blue tremor medical system”. When the processing by the post-processing unit 20 is not performed, the recognition result character string 44 is output as a recognition result. In the present embodiment, thereafter, the first post-processing is executed by the post-processing unit 20.

後処理部２０は、第１の後処理を実行する。すなわち、文字列生成部１９によって生成された認識結果文字列４４を構成する各文字の複数の候補文字を組み合わせて、単語辞書２４と照合し、最適な単語を選択することで構成される単語文字列を生成する。そして、生成した単語文字列と、文字列生成部１９によって生成された認識結果文字列４４とを比較して、認識結果文字列４４中の誤りを修正し、修正した認識結果文字列を出力装置３に出力する。第１の後処理は、後処理部２０に含まれる単語列生成部２１、置き換え判定部２２および置き換え部２３によって実行される。 The post-processing unit 20 executes a first post-processing. That is, a word character configured by combining a plurality of candidate characters of each character constituting the recognition result character string 44 generated by the character string generation unit 19 and collating with the word dictionary 24 and selecting an optimum word Generate a column. Then, the generated word character string and the recognition result character string 44 generated by the character string generation unit 19 are compared to correct an error in the recognition result character string 44, and the corrected recognition result character string is output. 3 is output. The first post-processing is executed by the word string generation unit 21, the replacement determination unit 22, and the replacement unit 23 included in the post-processing unit 20.

具体的には、単語列生成部２１は、文字列生成部１９によって生成された認識結果文字列４４を構成する各文字の複数の候補文字を組み合わせて複数生成される文字列と、単語辞書２４に収録されている単語とを照合して単語の候補を生成し、生成した単語の候補の中から最適な単語を選択することによって構成される単語文字列を生成する。 Specifically, the word string generation unit 21 includes a character string generated by combining a plurality of candidate characters of each character constituting the recognition result character string 44 generated by the character string generation unit 19, and the word dictionary 24. The word candidate is generated by collating with the words recorded in, and a word character string formed by selecting an optimum word from the generated word candidates is generated.

候補文字から単語の候補を生成する方法には、形態素解析を用いる方法などがあるが、どのような方法を用いてもよい。本実施形態では、認識結果文字列４４の第１文字目から順に、各候補文字について、単語辞書２４に収録されている単語に完全に一致する文字列、もしくは部分的に一致する文字列を単語の候補として生成する。 A method for generating word candidates from candidate characters includes a method using morphological analysis, and any method may be used. In the present embodiment, in order from the first character of the recognition result character string 44, for each candidate character, a character string that completely matches or partially matches a word recorded in the word dictionary 24 is a word. Generate as a candidate.

たとえば、図６（ｂ）に示した認識結果文字列４４のうち、第１文字目〜第３文字目の文字列は、「入居青」である。ここで、単語辞書２４に「入居者」という単語が存在していれば、この「入居者」が「入居青」に対応する単語の候補として抽出される。また、「入居」という単語が単語辞書２４に収録されていれば「入居」も単語の候補になる。 For example, in the recognition result character string 44 shown in FIG. 6B, the character string of the first character to the third character is “occupied blue”. Here, if the word “resident” exists in the word dictionary 24, this “resident” is extracted as a word candidate corresponding to “resident blue”. Further, if the word “resident” is recorded in the word dictionary 24, “resident” is also a word candidate.

「入居」および「入居者」が単語の候補として単語辞書２４から抽出された場合は、もし、認識結果文字列４４の第３文字目の「青」という第１候補の候補文字に対応する第２候補〜第５候補の候補文字の中に「者」の文字があれば、「入居者」という単語を構成する文字を、認識結果文字列４４を構成する文字の候補文字で生成することができる。第１候補の候補文字に対応する第２候補〜第５候補の候補文字とは、入力文字が第１候補の候補文字と同じ第２候補〜第５候補の候補文字ということである。そして、「入居者」という単語の候補は「入居」という単語の候補よりも単語長が長いので、単語長が長いほうの「入居者」を最適な単語として選択する。 If “resident” and “resident” are extracted from the word dictionary 24 as word candidates, the third candidate “blue” in the recognition result character string 44 corresponds to the first candidate candidate character “blue”. If there is a character “person” among the candidate characters of the second candidate to the fifth candidate, a character constituting the word “resident” may be generated as a candidate character of the character constituting the recognition result character string 44. it can. The candidate characters of the second candidate to the fifth candidate corresponding to the candidate character of the first candidate are the candidate characters of the second candidate to the fifth candidate whose input characters are the same as the candidate characters of the first candidate. Then, since the word candidate “resident” has a longer word length than the word candidate “resident”, the “resident” having the longer word length is selected as the optimum word.

また、もし文字「青」の候補文字に対応する第２候補〜第５候補の候補文字の中に「者」という文字が存在しない場合、つまり図６（ａ）のような場合は、認識結果文字列４４を構成する文字と一致する文字の文字数が同じであれば、単語を構成する文字の文字数が大きいほうの単語を選択する。つまり、「入居」という単語の候補および「入居者」という単語の候補は、どちらも「入居」という文字列が、認識結果文字列４４の第１，２文字目の文字列と一致している。そして「入居」という単語の候補は単語を構成する文字数が２であり、「入居者」という単語の候補は単語を構成する文字数が３であることから、「入居者」を選択することになる。これは、部分的に一致する文字列を最適な単語として選択する例である。以下部分的に一致することを「部分一致」ともいう。 If the character “person” does not exist among the candidate characters of the second to fifth candidates corresponding to the candidate character “blue”, that is, as shown in FIG. 6A, the recognition result If the number of characters matching the characters constituting the character string 44 is the same, the word having the larger number of characters constituting the word is selected. That is, in both the word candidate “resident” and the word candidate “resident”, the character string “resident” matches the first and second character strings of the recognition result character string 44. . Since the word candidate “resident” has 2 characters, the word “resident” has 3 characters. Therefore, “resident” is selected. . This is an example of selecting a partially matching character string as an optimum word. Hereinafter, partial matching is also referred to as “partial matching”.

単語列生成部２１は、認識結果文字列４４の第４文字目についても、単語辞書２４に収録されている単語と完全に一致する文字列、もしくは部分的に一致する文字列を最適な単語として選択する。単語列生成部２１は、この処理を、認識結果文字列４４の最終文字、たとえば図６（ｂ）に示した例では「、」の文字に対する最適な単語を選択するまで繰り返す。 For the fourth character of the recognition result character string 44, the word string generation unit 21 sets a character string that completely or partially matches a word recorded in the word dictionary 24 as an optimum word. select. The word string generation unit 21 repeats this process until the optimum word for the last character of the recognition result character string 44, for example, the character “,” in the example shown in FIG. 6B is selected.

ただし、この処理においては、すべての認識結果文字列４４を構成する候補文字以外の候補文字の組合せを用いて単語辞書２４と照合し、単語辞書２４に収録された単語との部分一致も許容する構成としているので、以下のようなことを考慮する必要がある。 However, in this process, the word dictionary 24 is collated using a combination of candidate characters other than the candidate characters constituting all the recognition result character strings 44, and partial matching with the words recorded in the word dictionary 24 is allowed. Since it is configured, it is necessary to consider the following.

たとえば、図６（ｂ）に示した認識結果文字列４４を構成する第５，６文字目は、正しくは「保険」という文字列であるが、マッチング結果では認識結果文字列４４の第５，６文字目は「振険」となっており、単語辞書２４を検索した場合、部分一致となる「保険」および「振替」などの単語が単語の候補として生成される。すなわち、図６（ａ）に示した候補文字４３には、「保」および「替」の文字がそれぞれ「保」の候補文字および「険」の候補文字に存在しないため、完全に一致する単語の候補がなく、部分一致となる複数の単語の候補が生成される。たとえば「険」の第３候補である「陰」と部分一致である「山陰」、および「険」の第２候補である「陵」と部分一致である「丘陵」などの部分一致となる単語が単語辞書２４に収録されていれば、単語の候補として生成される。このようにして生成された単語の候補の一例を図７に示す。 For example, the fifth and sixth characters constituting the recognition result character string 44 shown in FIG. 6B are correctly the character string “insurance”, but the matching result shows the fifth and fifth characters of the recognition result character string 44. The sixth character is “Short”, and when the word dictionary 24 is searched, words such as “insurance” and “transfer” that partially match are generated as word candidates. That is, in the candidate character 43 shown in FIG. 6A, the characters “K” and “K” are not present in the “K” candidate character and the “K” candidate character, respectively. There are no candidates, and a plurality of word candidates that are partially matched are generated. For example, a word that partially matches such as “San-in” that is partially matched with “Yin”, which is the third candidate for “Steep”, and “Hill” that is partially matched with “Ling” that is the second candidate for “Steep”. Is recorded in the word dictionary 24, it is generated as a word candidate. An example of the word candidates generated in this way is shown in FIG.

図７は、単語列生成部２１によって生成された単語文字列４５の一例を示す図である。図７に示した単語文字列４５の例には、最適な単語たとえば単語４５１、および複数生成された単語の候補たとえば単語の候補４５２が混在している。単語の候補４５２には、４つの単語「保険」、「振替」、「山陰」および「丘陵」が生成されている。複数の候補がある場合は、単語列生成部２１は、以下のような３つの条件で優先順位をつけて、最適な単語を選択し、最適な単語のみからなる単語文字列を生成する。以下、単語の候補が２文字からなる場合を例にして説明する。 FIG. 7 is a diagram illustrating an example of the word character string 45 generated by the word string generation unit 21. In the example of the word character string 45 shown in FIG. 7, an optimum word such as a word 451 and a plurality of generated word candidates such as a word candidate 452 are mixed. In the word candidate 452, four words “insurance”, “transfer”, “sanin”, and “hill” are generated. When there are a plurality of candidates, the word string generation unit 21 assigns priorities according to the following three conditions, selects an optimal word, and generates a word character string including only the optimal word. Hereinafter, a case where a word candidate consists of two characters will be described as an example.

条件１は、第１番目の文字が候補文字内に存在する文字であり、かつ第２番目の文字が候補文字内に存在する文字であるという条件である。条件２は、第１番目の文字が候補文字内に存在する文字であり、かつ第２番目の文字が候補文字内に存在しない文字であるという条件、または第１番目の文字が候補文字内に存在しない文字であり、かつ第２番目の文字が候補文字内に存在する文字であるという条件である。条件３は、２文字の類似度が最大であるという条件である。２文字の類似度とは、単語の候補と、認識結果文字列の文字のうちその単語の候補に対応する位置にある２文字との類似度である。対応する位置とは、認識結果文字列および単語文字列のそれぞれの先頭文字からの順番が同じ位置ということである。２文字の類似度は、２文字を構成する各文字の類似度の平均値である。本実施形態では、２文字の類似度の平均値を用いているが、２文字の類似度の合計値を用いてもよい。 Condition 1 is a condition in which the first character is a character that exists in the candidate character and the second character is a character that exists in the candidate character. Condition 2 is that the first character is a character that exists in the candidate character and the second character is a character that does not exist in the candidate character, or the first character is in the candidate character. The condition is that the character does not exist and the second character is a character that exists in the candidate character. Condition 3 is a condition that the similarity between two characters is the maximum. The similarity between two characters is the similarity between a word candidate and two characters at positions corresponding to the word candidate among the characters of the recognition result character string. The corresponding position means that the recognition result character string and the word character string have the same order from the first character. The similarity between two characters is an average value of the similarity between the characters constituting the two characters. In this embodiment, the average value of the similarity of two characters is used, but the total value of the similarity of two characters may be used.

単語列生成部２１は、複数の単語の候補の中に、条件１を満たす単語の候補が１つであれば、その単語の候補を最適な単語として選択する。条件１を満たす単語の候補がなく、かつ条件２を満たす単語の候補が１つであれば、その単語の候補を最適な単語として選択する。条件１を満たす単語の候補が複数あるときまたは条件２を満たす単語の候補が複数あるとき、条件３を満たす単語の候補を最適な単語として選択する。 If there is one word candidate satisfying the condition 1 among a plurality of word candidates, the word string generation unit 21 selects the word candidate as the optimum word. If there is no word candidate satisfying the condition 1 and there is one word candidate satisfying the condition 2, the word candidate is selected as the optimum word. When there are a plurality of word candidates satisfying the condition 1, or when there are a plurality of word candidates satisfying the condition 2, a word candidate satisfying the condition 3 is selected as an optimum word.

たとえば図７に示した第５，６文字目の単語の候補４５２の場合、単語の候補として、「保険」、「振替」、「山陰」および「丘陵」が生成されている。これらの単語の候補は、いずれも条件１を満たさず、条件２を満たすので、条件３によって最適な単語を選択する。それぞれの類似度は、「保険」が「１１８」であり、「振替」が「１１７」であり、「山陰」が「１１４」であり、「丘陵」が「１１６」である。したがって、単語列生成部２１は、これら４つの単語の候補の中から最大の類似度「１１８」である「保険」を最適な単語として選択する。 For example, in the case of the word candidate 452 of the fifth and sixth characters shown in FIG. 7, “insurance”, “transfer”, “San-in”, and “hill” are generated as word candidates. None of these word candidates satisfies condition 1 but satisfies condition 2, and therefore, an optimal word is selected according to condition 3. As for the respective similarities, “insurance” is “118”, “transfer” is “117”, “sanin” is “114”, and “hill” is “116”. Therefore, the word string generation unit 21 selects “insurance” having the maximum similarity “118” from among these four word candidates as the optimum word.

本実施形態では、最適な単語を上述した条件に従って選択したが、このような条件だけに限らず、候補文字のうち第１候補および第２候補の候補文字を優先として、第３候補以降の候補を使用せずに構成することができる単語のみを選択するような構成にしてもよい。 In the present embodiment, the optimum word is selected according to the above-described conditions. However, the candidate is not limited to such a condition, and the first candidate and the second candidate candidate characters are prioritized among the candidate characters, and candidates after the third candidate are selected. It may be configured to select only words that can be configured without using.

図８は、認識結果文字列と単語文字列との比較例４６を示す図である。比較例４６は、上段が文字列生成部１９によって生成された認識結果文字列であり、下段が単語列生成部２１によって生成された単語文字列である。図８に示した下段の単語文字列は、図７に示した単語文字列４５で最適な単語として選択された単語からなる文字列である。比較例４６は、単語文字列の各々の単語を構成する文字数で区切った矩形によって、認識結果文字列と単語文字列とを区切って示している。 FIG. 8 is a diagram illustrating a comparative example 46 between the recognition result character string and the word character string. In the comparative example 46, the upper row is a recognition result character string generated by the character string generator 19, and the lower row is a word character string generated by the word string generator 21. The lower word character string shown in FIG. 8 is a character string made up of words selected as the optimum word in the word character string 45 shown in FIG. In the comparative example 46, the recognition result character string and the word character string are separated by a rectangle divided by the number of characters constituting each word of the word character string.

置き換え判定部２２は、文字列生成部１９で生成された認識結果文字列と単語列生成部２１によって生成された単語文字列とを、単語ごとに先頭文字から一文字ずつ比較して、異なるか否かを判定する。文字が異なるか否かは、たとえば認識結果文字列中の文字の文字コードと、単語文字列中の文字の文字コードとが異なるか否かによって判定することができる。文字コードが異なるときに、文字が異なると判定し、文字コードが一致するときに、文字が異ならないと判定する。文字コードは、たとえば文字認識辞書１４および単語辞書２４のそれぞれに、文字ごとに収録しておくことによって実現可能である。本実施形態では、先頭文字から一文字ずつ比較するが、単語の最後尾の文字から先頭文字の方向に順番に一文字ずつ比較してもよい。 The replacement determination unit 22 compares the recognition result character string generated by the character string generation unit 19 and the word character string generated by the word string generation unit 21 by comparing one character at a time from the first character for each word. Determine whether. Whether or not the characters are different can be determined, for example, based on whether or not the character code of the character in the recognition result character string is different from the character code of the character in the word character string. When the character codes are different, it is determined that the characters are different, and when the character codes match, it is determined that the characters are not different. The character code can be realized, for example, by recording each character in the character recognition dictionary 14 and the word dictionary 24. In this embodiment, the characters are compared one by one from the first character, but may be compared one by one in order from the last character of the word to the first character.

置き換え判定部２２は、単語を構成するすべての文字について、異なる文字がない場合、つまり同じ文字である場合、単語を構成するすべての文字について、置き換えると判定する。 The replacement determination unit 22 determines that all characters constituting the word are replaced when there are no different characters for all the characters constituting the word, that is, when the characters are the same.

置き換え判定部２２は、異なる文字がある場合、さらに、単語を構成する文字の中で異なると判定された文字と同じ文字が、認識結果文字列を構成する候補文字以外の対応する位置にある候補文字の中にあるか否かを判定する。対応する位置にあるとは、単語文字列中の文字の先頭からの位置と、認識結果文字列の先頭からの位置とが同じである位置にあるということである。単語を構成する文字の中で異なると判定された文字と同じ文字が、認識結果文字列を構成する候補文字以外の対応する位置にある候補文字の中にすべてある場合、置き換え判定部２２は、単語を構成するすべての文字について、置き換えると判定する。 When there are different characters, the replacement determination unit 22 further has a candidate in which the same character as the character determined to be different among the characters constituting the word is at a corresponding position other than the candidate characters constituting the recognition result character string. Determine if it is in a character. To be in the corresponding position means that the position from the beginning of the character in the word character string is the same as the position from the beginning of the recognition result character string. When the same character as the character determined to be different among the characters constituting the word is all in the candidate characters at the corresponding positions other than the candidate characters constituting the recognition result character string, the replacement determining unit 22 It is determined that all characters constituting the word are replaced.

すなわち、単語を構成する文字と同じ文字が、認識結果文字列の対応する位置にある候補文字の中にすべてある場合、置き換え判定部２２は、単語を構成するすべての文字について、置き換えると判定する。 In other words, if all the characters that are the same as the characters that make up the word are among the candidate characters at the corresponding positions in the recognition result character string, the replacement determination unit 22 determines that all the characters that make up the word are replaced. .

本実施形態では、置き換え判定部２２は、このように候補文字の中に一致する候補文字があるか否かを検索したが、単語列生成部２１が単語文字列を生成するとき、各単語が部分一致の単語であり、かつ単語を構成する文字が候補文字に存在しない文字であるか否かを示す情報をメモリ１３に記憶しておき、置き換え判定部２２は、メモリ１３に記憶されるその情報を参照することによって、候補文字にあるか否かを判定してもよい。 In the present embodiment, the replacement determination unit 22 searches for whether there is a matching candidate character among the candidate characters in this way, but when the word string generation unit 21 generates a word character string, each word is Information indicating whether or not a character that is a partially matching word and a character constituting the word is a character that does not exist in the candidate character is stored in the memory 13, and the replacement determination unit 22 stores the information stored in the memory 13. By referring to the information, it may be determined whether the candidate character is present.

置き換え判定部２２は、単語を構成する文字のうちいずれかの文字と同じ文字が、認識結果文字列の対応する位置にある候補文字の中にない場合、つまり単語を構成する文字のうち認識結果文字列の対応する位置にある第１候補〜第５候補の候補文字の中に同じ文字がない場合、単語を構成する文字のうち認識結果文字列の対応する位置にある第１候補〜第５候補の候補文字の中に同じ文字がない文字について、さらに、類似度による置き換えの要否を判定する。以下、単語を構成する文字のうち認識結果文字列の対応する位置にある第１候補〜第５候補の候補文字の中に同じ文字のない文字のことを、単語を構成する文字のうち候補文字の中に同じ文字がない文字という。 If the same character as any one of the characters constituting the word is not among the candidate characters at the corresponding position in the recognition result character string, that is, the replacement determination unit 22 recognizes the recognition result among the characters constituting the word. If there is no same character among the first to fifth candidate characters at the corresponding position of the character string, the first to fifth candidates at the corresponding position of the recognition result character string among the characters constituting the word For characters that do not have the same character among the candidate characters, it is further determined whether or not replacement is necessary based on the similarity. Hereinafter, among the characters constituting the word, the characters that do not have the same character among the first to fifth candidate characters at the corresponding positions in the recognition result character string are the candidate characters among the characters constituting the word. Characters that do not have the same character in

具体的には、置き換え判定部２２は、単語を構成する文字のうち候補文字の中に同じ文字がない文字の特徴量と、認識結果文字列を構成する対応する位置にある候補文字、つまり第１候補の候補文字の特徴量とから、式（３）によって、これらの特徴量の類似度を計算する。認識結果文字列を構成する対応する位置にある候補文字の特徴量は、特徴抽出部１７によって抽出された特徴量であり、メモリ１３に記憶されている。そして、計算した類似度が予め定める閾値以上であるとき、単語を構成する文字のうち候補文字の中に同じ文字がない文字について、置き換えると判定し、計算した類似度が予め定める閾値未満であるとき、単語を構成する文字のうち候補文字の中に同じ文字がない文字について、置き換えないと判定する。単語を構成する文字のうち候補文字の中に同じ文字がある文字については、置き換えると判定する。予め定める閾値は、任意の値を設定することができる。たとえば式（３）で計算される類似度の最大類似度＝１２８の７割の値とする場合、「８９」が予め定める閾値となる。類似度が予め定める閾値以上であることは、予め定める比較基準を満たすことの一例である。 Specifically, the replacement determination unit 22 determines the feature amount of the character that does not have the same character among the candidate characters, and the candidate character at the corresponding position that forms the recognition result character string. Based on the feature amount of one candidate character, the similarity between these feature amounts is calculated by Equation (3). The feature amount of the candidate character at the corresponding position constituting the recognition result character string is the feature amount extracted by the feature extraction unit 17 and stored in the memory 13. Then, when the calculated similarity is equal to or greater than a predetermined threshold, it is determined that a character that does not have the same character among candidate characters among characters constituting the word is determined to be replaced, and the calculated similarity is less than a predetermined threshold At this time, it is determined that a character that does not have the same character among the candidate characters among the characters constituting the word is not replaced. It is determined that a character having the same character among the candidate characters among the characters constituting the word is replaced. An arbitrary value can be set as the predetermined threshold. For example, when the maximum similarity of the degree of similarity calculated by the expression (3) = 70% of 128, “89” is a predetermined threshold value. That the similarity is equal to or higher than a predetermined threshold is an example of satisfying a predetermined comparison criterion.

たとえば、図８に示した単語文字列の単語「入居者」は、図８に示した認識結果文字列の中の文字列「入居青」に対応する単語であり、前から順番に１文字ずつ比較していくと、第３文字目が異なっている。この場合、単語文字列の単語「入居者」の第３文字目の文字「者」が、認識結果文字列の対応する第３文字目の候補文字に存在するかをチェックする。図６（ａ）に示した候補文字４３を参照すると、第３文字目の第１候補の候補文字は、文字「者」とは異なる文字「青」であり、また第２候補以降の候補文字に「者」という文字は存在しない。 For example, the word “resident” in the word character string shown in FIG. 8 is a word corresponding to the character string “resident blue” in the recognition result character string shown in FIG. When compared, the third character is different. In this case, it is checked whether the third character “person” of the word “resident” in the word character string exists in the corresponding third character candidate character in the recognition result character string. Referring to the candidate character 43 shown in FIG. 6A, the third candidate character of the first candidate is the character “blue” different from the character “person”, and the candidate characters after the second candidate Does not have the word “person”.

すなわち、図８に示した例では、２点鎖線４６１で囲まれた文字「青」と文字「者」とが一致せず、単語を構成する文字「者」が、対応する位置にある第１候補〜第５候補の候補文字に存在しないので、置き換え判定部２２は、文字認識辞書１４に収録されている文字「者」の特徴量と、認識結果文字列を構成する対応する候補文字の特徴量とを比較して類似度を計算する。図８に示した例では、認識結果文字列を構成する対応する位置にある候補文字の特徴量とは、図４に示した文字画像４２のうち第３文字目の文字画像が表す文字について、マッチング部１８によって第１候補の候補文字「青」として抽出された文字の特徴量である。特徴抽出部１７によって抽出された特徴量は、メモリ１３に記憶されているので、置き換え判定部２２は、メモリ１３を参照することによって、再度特徴量を計算することなく、特徴抽出部１７によって抽出された特徴量を知ることができる。 That is, in the example shown in FIG. 8, the character “blue” and the character “person” surrounded by the two-dot chain line 461 do not match, and the character “person” constituting the word is in the corresponding position. Since it does not exist in the candidate characters of the candidate to the fifth candidate, the replacement determination unit 22 features the feature amount of the character “person” recorded in the character recognition dictionary 14 and the feature of the corresponding candidate character constituting the recognition result character string. Compare the quantity and calculate the similarity. In the example shown in FIG. 8, the feature amount of the candidate character at the corresponding position constituting the recognition result character string is the character represented by the third character image in the character image 42 shown in FIG. This is the feature amount of the character extracted as the first candidate character “blue” by the matching unit 18. Since the feature amount extracted by the feature extraction unit 17 is stored in the memory 13, the replacement determination unit 22 refers to the memory 13 and extracts it by the feature extraction unit 17 without calculating the feature amount again. It is possible to know the feature amount.

図９は、単語文字列の文字の特徴量および対応する位置にある候補文字の特徴量の一例を示す図である。図９（ａ）は、候補文字「青」の特徴量を示す特徴データ３９ａである。候補文字「青」は、図４に示した文字画像４２のうち第３文字目の文字画像が表す文字について、マッチング部１８によって第１候補の候補文字「青」として抽出された文字である。図９（ｂ）は、文字認識辞書１４に収録されている文字「者」の特徴量を示す特徴データ３２ａである。実際の計算は省略するが、候補文字「青」の特徴量と文字認識辞書１４に収録されている文字「者」の特徴量との類似度は、「１０８」である。ここに、類似度の最大値は１２８である。候補文字「青」および文字認識辞書１４に収録されている文字「者」の例では、置き換え判定部２２は、計算した類似度「１０８」が予め定める閾値＝８９以上の類似度であるので、置き換えると判定する。 FIG. 9 is a diagram illustrating an example of character feature amounts of word character strings and feature amounts of candidate characters at corresponding positions. FIG. 9A shows feature data 39a indicating the feature amount of the candidate character “blue”. The candidate character “blue” is a character extracted by the matching unit 18 as the first candidate candidate character “blue” for the character represented by the third character image in the character image 42 shown in FIG. 4. FIG. 9B is feature data 32 a indicating the feature amount of the character “person” recorded in the character recognition dictionary 14. Although the actual calculation is omitted, the similarity between the feature amount of the candidate character “blue” and the feature amount of the character “person” recorded in the character recognition dictionary 14 is “108”. Here, the maximum value of the similarity is 128. In the example of the candidate character “blue” and the character “person” recorded in the character recognition dictionary 14, the replacement determination unit 22 has a calculated similarity “108” that is a similarity equal to or higher than a predetermined threshold = 89. Determine to replace.

置き換え判定部２２は、単語文字列を構成する文字のうち候補文字の中に同じ文字がない文字を含むすべての単語について、類似度によって置き換えるか否かを判定する。図８に示した例では、単語文字列を構成する文字のうち候補文字の中に同じ文字がない文字は、文字「者」以外に、文字「保」があり、置き換え判定部２２は、２点鎖線４６２で囲まれた文字「振」および文字「保」についても類似度を計算し、計算した類似度に基づいて、置き換えるか否かを判定する。 The replacement determination unit 22 determines whether or not to replace all words including characters that do not have the same character among the candidate characters among the characters constituting the word character string, based on the similarity. In the example shown in FIG. 8, among the characters constituting the word character string, characters that do not have the same character among the candidate characters include the character “preservation” in addition to the character “person”. Similarities are also calculated for the character “wa” and the character “K” enclosed by the dotted line 462, and it is determined whether or not to replace based on the calculated similarity.

図９（ｃ）は、候補文字「振」の特徴量を示す特徴データ３９ｂである。図９（ｄ）は、文字認識辞書１４に収録されている文字「保」の特徴量を示す特徴データ３２ｂである。実際の計算は省略するが、候補文字「振」の特徴量と文字認識辞書１４に収録されている文字「保」の特徴量との類似度は、「１１０」である。置き換え判定部２２は、候補文字「振」の特徴量と文字認識辞書１４に収録されている文字「保」の特徴量との類似度「１１０」が予め定める閾値＝８９以上であるので、置き換えると判定する。 FIG. 9C shows feature data 39b indicating the feature amount of the candidate character “shake”. FIG. 9D shows feature data 32 b indicating the feature amount of the character “K” stored in the character recognition dictionary 14. Although the actual calculation is omitted, the degree of similarity between the feature amount of the candidate character “Ring” and the feature amount of the character “K” stored in the character recognition dictionary 14 is “110”. The replacement determination unit 22 replaces the similarity “110” between the feature amount of the candidate character “shake” and the feature amount of the character “K” recorded in the character recognition dictionary 14 because the predetermined threshold value is 89 or more. Is determined.

本実施形態では、置き換えるか否かの判定を、類似度が予め定める閾値以上であるときに置き換えると判定したが、これに限定されるものではない。たとえば、候補文字の特徴量と文字認識辞書１４に収録されている文字の特徴量とから計算した類似度と、対象となる文字番目の第１候補文字の類似度との差がＮ以下であるときに、置き換えると判定するようにしてもよい。Ｎは、任意の正の整数である。 In the present embodiment, the determination as to whether or not to replace is determined to be performed when the similarity is equal to or higher than a predetermined threshold, but the present invention is not limited to this. For example, the difference between the similarity calculated from the feature amount of the candidate character and the feature amount of the character recorded in the character recognition dictionary 14 and the similarity of the first candidate character of the target character is N or less. Sometimes, it may be determined to replace. N is an arbitrary positive integer.

置き換え部２３は、認識結果文字列のうち置き換えると判定された文字について、認識結果文字列の文字を単語文字列の文字で置き換え、置き換えた認識結果文字列を出力装置３に送信して出力させる。認識結果文字列のすべての文字について置き換えないと判定されたときは、置き換えを行うことなく、認識結果文字列を出力装置３に送信して出力させる。 The replacement unit 23 replaces the character of the recognition result character string with the character of the word character string for the character determined to be replaced in the recognition result character string, and transmits the replaced recognition result character string to the output device 3 for output. . When it is determined not to replace all the characters in the recognition result character string, the recognition result character string is transmitted to the output device 3 and output without replacement.

たとえば図８に示した例では、２点鎖線４６１で囲まれた文字「青」および文字「者」、ならびに２点鎖線４６２で囲まれた文字「振」および文字「保」はいずれも、置き換え判定部２２によって置き換えると判定されているので、置き換え部２３は、認識結果文字列中の文字「青」を単語文字列中の文字「者」で置き換えるとともに、認識結果文字列中の文字「振」を単語文字列中の文字「保」で置き換える。 For example, in the example shown in FIG. 8, the characters “blue” and “person” surrounded by the two-dot chain line 461 and the characters “shake” and character “K” enclosed by the two-dot chain line 462 are replaced. Since it is determined to be replaced by the determination unit 22, the replacement unit 23 replaces the character “blue” in the recognition result character string with the character “person” in the word character string, and also replaces the character “vibration” in the recognition result character string. ”Is replaced with the character“ K ”in the word string.

２点鎖線４６１で囲まれた文字「青」および文字「者」、ならびに２点鎖線４６２で囲まれた文字「振」および文字「保」以外の文字は、いずれも同じ文字であるので、置き換えると判定されており、置き換え部２３は、認識結果文字列の文字を、単語文字列の同じ文字と置き換える。置き換え部２３は、置き換えた認識結果文字列を出力装置３に送信して、出力させる。 Characters other than the characters “blue” and the character “person” enclosed by the two-dot chain line 461 and the characters “wa” and the character “preservation” enclosed by the two-dot chain line 462 are the same characters, and thus are replaced. The replacement unit 23 replaces the character in the recognition result character string with the same character in the word character string. The replacement unit 23 sends the replaced recognition result character string to the output device 3 for output.

すなわち、認識結果文字列の「入居青」が単語文字列の「入居者」に置き換えられ、認識結果文字列の「振険」が単語文字列の「保険」に置き換えられることになる。したがって、置き換え部２３によって、図６（ｂ）に示した認識結果文字列４４、つまり「入居青の振険医療制度について、」が「入居者の保険医療制度について、」に置き換えられ、置き換えられた認識結果文字列である「入居者の保険医療制度について、」が出力装置３に送られて出力される。 That is, the recognition result character string “resident blue” is replaced with the word character string “resident”, and the recognition result character string “insurance” is replaced with the word character string “insurance”. Accordingly, the replacement unit 23 replaces and replaces the recognition result character string 44 shown in FIG. 6B, that is, “about the resident blue tremor medical system” with “about the resident insurance medical system”. The recognition result character string “about the resident's insurance medical system” is sent to the output device 3 and output.

図１０は、情報処理装置１、入力装置２および出力装置３が実行する文字認識処理の処理手順を示すフローチャートである。たとえば、入力装置２に原稿がセットされ、入力装置２の図示しない操作部によって原稿の読み取り指示が行われると、ステップＡ１に移る。 FIG. 10 is a flowchart illustrating a processing procedure of character recognition processing executed by the information processing device 1, the input device 2, and the output device 3. For example, when a document is set on the input device 2 and a document reading instruction is given by an operation unit (not shown) of the input device 2, the process proceeds to step A1.

ステップＡ１では、入力装置２は原稿を画像として読込み、読込んだ画像を表す画像データを情報処理装置１に送信する。情報処理装置１では、制御部１１は、入出力制御部１２によって、入力装置２から送信される画像データを受信し、受信した画像データを文字列行切り出し部１５に送る。ステップＡ２では、文字列行切り出し部１５は、制御部１１から受け取った画像データが表す画像から文字列を表す文字列行画像を行ごとに切り出す。ステップＡ３では、文字切り出し部１６は、文字列行切り出し部１５によって切り出された文字列行画像から、文字を１文字ずつ表す矩形画像である文字画像を切り出す。 In step A1, the input device 2 reads the document as an image, and transmits image data representing the read image to the information processing device 1. In the information processing device 1, the control unit 11 receives the image data transmitted from the input device 2 by the input / output control unit 12, and sends the received image data to the character string row cutout unit 15. In step A2, the character string row cutout unit 15 cuts out the character string row image representing the character string from the image represented by the image data received from the control unit 11 for each row. In step A3, the character cutout unit 16 cuts out a character image that is a rectangular image representing characters one by one from the character string row image cut out by the character string row cutout unit 15.

ステップＡ４では、特徴抽出部１７は、文字切り出し部１６によって切り出された文字画像が表す文字の特徴を数値化した特徴量を抽出する。ステップＡ５では、マッチング部１８は、文字認識辞書１４とのマッチングを行う。具体的には、マッチング部１８は、特徴抽出部１７によって抽出された特徴量と、文字認識辞書１４に収録されているすべての文字の特徴量とを比較して、文字画像が表す文字と文字認識辞書１４に収録された文字との類似度を算出し、算出した類似度に基づいて、文字画像が表す文字の候補文字を少なくとも１つ、たとえば第１候補〜第５候補までの候補文字を抽出する。 In step A 4, the feature extraction unit 17 extracts a feature amount obtained by quantifying the character feature represented by the character image cut out by the character cutout unit 16. In step A5, the matching unit 18 performs matching with the character recognition dictionary 14. Specifically, the matching unit 18 compares the feature amount extracted by the feature extraction unit 17 with the feature amount of all the characters recorded in the character recognition dictionary 14 to determine the character and character represented by the character image. The degree of similarity with the characters recorded in the recognition dictionary 14 is calculated, and at least one candidate character for the character represented by the character image, for example, the first to fifth candidate characters is calculated based on the calculated similarity. Extract.

ステップＡ６では、文字列生成部１９は、マッチング部１８によって抽出された候補文字のうち、第１候補の候補文字を左から順に並べた認識結果文字列を生成する。ステップＡ７では、後処理部２０は、第１の後処理を行い、文字認識処理を終了する。 In step A6, the character string generation unit 19 generates a recognition result character string in which candidate characters of the first candidate are arranged in order from the left among the candidate characters extracted by the matching unit 18. In step A7, the post-processing unit 20 performs the first post-processing and ends the character recognition processing.

図１０に示したステップＡ２は、文字列行切り出し工程に相当し、図１０に示したステップＡ３は、文字切り出し工程に相当し、図１０に示したステップＡ４は、特徴抽出工程に相当し、図１０に示したステップＡ５は、選択工程に相当し、図１０に示したステップＡ６は、文字列生成工程に相当する。 Step A2 shown in FIG. 10 corresponds to a character string row cutout process, step A3 shown in FIG. 10 corresponds to a character cutout process, step A4 shown in FIG. 10 corresponds to a feature extraction process, Step A5 shown in FIG. 10 corresponds to a selection process, and step A6 shown in FIG. 10 corresponds to a character string generation process.

図１１は、後処理部２０が実行する第１の後処理の処理手順を示すフローチャートである。図１０に示したステップＡ７が実行されると、ステップＢ１に移る。 FIG. 11 is a flowchart showing the processing procedure of the first post-processing executed by the post-processing unit 20. When step A7 shown in FIG. 10 is executed, the process proceeds to step B1.

ステップＢ１では、単語列生成部２１は、文字列生成部１９によって生成された認識結果文字列から単語文字列を生成する。具体的には、文字列生成部１９によって生成された認識結果文字列を構成する各文字の複数の候補文字を組み合わせて複数生成される文字列と、単語辞書２４に収録されている単語とを照合して単語の候補を生成し、生成した単語の候補の中から最適な単語を選択し、選択した最適な単語から構成される単語文字列を生成する。 In step B1, the word string generation unit 21 generates a word character string from the recognition result character string generated by the character string generation unit 19. Specifically, a character string generated by combining a plurality of candidate characters of each character constituting the recognition result character string generated by the character string generation unit 19 and a word recorded in the word dictionary 24 Word candidates are generated by collation, an optimal word is selected from the generated word candidates, and a word character string composed of the selected optimal word is generated.

ステップＢ２では、置き換え判定部２２は、単語文字列を構成する単語ごとに、単語を構成する文字列内のすべての文字において、認識結果文字列は存在するか否かを判定する。すなわち、初めてステップＢ２を実行するときは、単語文字列を構成する最初の単語を構成するすべての文字について、第２回目以降にステップＢ２を実行するときは、前回処理した単語の次に単語を構成するすべての文字について、認識結果文字列の対応する位置にある第１候補〜第５候補の中に同じ文字があるか否かを判定する。以下、ステップＢ２で単語文字列を構成する単語のうち処理の対象としている単語のことを「対象としている単語」という。 In step B 2, the replacement determination unit 22 determines whether a recognition result character string exists for all characters in the character string constituting the word for each word constituting the word character string. That is, when step B2 is executed for the first time, for all characters constituting the first word constituting the word character string, when step B2 is executed after the second time, the word is processed after the previously processed word. It is determined whether or not there is the same character among the first to fifth candidates at the corresponding position in the recognition result character string for all the constituent characters. Hereinafter, the word which is the target of processing among the words constituting the word character string in step B2 is referred to as “target word”.

単語文字列の単語を構成するすべての文字について、認識結果文字列の対応する位置にある第１候補〜第５候補の中に同じ文字があると、単語を構成する文字列内のすべての文字において、認識結果文字列は存在すると判定し、ステップＢ３に進む。単語文字列の単語を構成するいずれかの文字について、認識結果文字列の対応する位置にある第１候補〜第５候補の中に同じ文字がないと、単語を構成する文字列内のすべての文字において、認識結果文字列は存在しないと判定し、ステップＢ５に進む。 For all characters that constitute a word in the word character string, if there is the same character among the first to fifth candidates at the corresponding position in the recognition result character string, all characters in the character string that constitute the word In step S3, it is determined that the recognition result character string exists, and the process proceeds to step B3. If any of the characters constituting the word of the word character string does not have the same character among the first to fifth candidates at the corresponding position in the recognition result character string, all of the characters in the character string constituting the word In the character, it is determined that the recognition result character string does not exist, and the process proceeds to Step B5.

置き換え判定部２２は、ステップＢ３に進むときは、認識結果文字列のうち判定が行われた文字について、置き換えると判定する。ステップＢ５に進むときは、単語を構成する文字について、認識結果文字列の対応する位置にある第１候補〜第５候補の中に同じ文字があると判定された文字があるときは、認識結果文字列のうちその判定が行われた文字について、置き換えると判定する。 When the process proceeds to step B3, the replacement determination unit 22 determines to replace the determined character in the recognition result character string. When the process proceeds to step B5, if there is a character that is determined to have the same character among the first to fifth candidates at the corresponding position in the recognition result character string for the characters constituting the word, the recognition result It is determined that the character that has been determined in the character string is to be replaced.

ステップＢ３では、置き換え部２３は、対象文字について単語文字を出力する。ステップＢ３での対象文字とは、置き換え部２３によって置き換えると判定された文字である。単語文字とは、単語文字列を構成する文字である。ステップＢ３での出力は、メモリ１３に記憶することである。すなわち、置き換え部２３は、対象としている単語のうち、置き換え部２３によって置き換えると判定された文字を、単語文字列の文字で置き換えて、置き換えた単語をメモリ１３に記憶する。このとき、同じ単語の中に置き換えないと判定された文字があるときは、置き換えないと判定された文字については、置き換えは行われない。 In step B3, the replacement unit 23 outputs a word character for the target character. The target character in step B3 is a character determined to be replaced by the replacement unit 23. A word character is a character which comprises a word character string. The output at step B3 is to be stored in the memory 13. That is, the replacement unit 23 replaces the character determined to be replaced by the replacement unit 23 among the target words with the character of the word character string, and stores the replaced word in the memory 13. At this time, if there is a character determined not to be replaced in the same word, the character determined not to be replaced is not replaced.

ステップＢ４では、単語文字列を構成するすべての単語について処理を行ったか否かを判定する。ステップＢ４での処理とは、ステップＢ２，Ｂ３，Ｂ５〜Ｂ７の処理である。単語文字列を構成するすべての単語について処理を行ったときは、置き換え部２３は、メモリ１３に記憶した単語から構成される認識結果文字列を出力装置３に送信して出力し、第１の後処理を終了する。単語文字列を構成するすべての単語について処理を行っていないときは、ステップＢ２に戻る。 In step B4, it is determined whether or not processing has been performed for all words constituting the word character string. The process in step B4 is the process in steps B2, B3, B5 to B7. When the processing has been performed for all the words constituting the word character string, the replacement unit 23 transmits the recognition result character string composed of the words stored in the memory 13 to the output device 3 and outputs the recognition result character string. End post-processing. When processing has not been performed for all the words constituting the word character string, the process returns to step B2.

ステップＢ５では、置き換え判定部２２は、単語辞書の文字の特徴量と認識文字の特徴量とを比較する。単語辞書の文字とは、ステップＢ２で、単語文字列の単語を構成する文字のうち候補文字の中に同じ文字がないと判定された文字である。認識文字とは、認識結果文字列を構成する候補文字のうち対応する位置にある候補文字、つまり第１候補の候補文字である。すなわち、ステップＢ５では、置き換え判定部２２は、単語文字列の単語を構成する文字のうち候補文字の中に同じ文字がない文字の特徴量と、認識結果文字列の対応する位置にある第１候補の候補文字の特徴量とを比較する。置き換え判定部２２は、比較のために、単語文字列の単語を構成する文字のうち候補文字の中に同じ文字がない文字の特徴量と、認識結果文字列の対応する位置にある第１候補の候補文字の特徴量との類似度を計算する。 In step B5, the replacement determination unit 22 compares the character feature value of the word dictionary with the feature value of the recognized character. The characters in the word dictionary are characters that have been determined in step B2 that no candidate character has the same character among the characters constituting the word in the word character string. The recognized character is a candidate character at a corresponding position among candidate characters constituting the recognition result character string, that is, a candidate character of the first candidate. That is, in step B5, the replacement determination unit 22 sets the first feature character in the character that does not have the same character among the candidate characters among the characters constituting the word of the word character string and the first position at the corresponding position of the recognition result character string. The feature amount of the candidate character candidate is compared. For the purpose of comparison, the replacement determining unit 22 compares the feature amount of characters that do not have the same character among the candidate characters among the characters constituting the words of the word character string, and the first candidate at the corresponding position in the recognition result character string. The similarity with the feature amount of the candidate character is calculated.

ステップＢ６では、置き換え判定部２２は、特徴量の比較から、予め定める閾値以上の類似性が得られたか否かを判定する。ステップＢ５で計算した類似度が予め定める閾値以上であるとき、特徴量の比較から、予め定める閾値以上の類似性が得られたと判定し、ステップＢ３に進む。ステップＢ５で計算した類似度が予め定める閾値未満であるとき、特徴量の比較から、予め定める閾値以上の類似性が得られなかったと判定し、ステップＢ７に進む。 In step B 6, the replacement determination unit 22 determines whether similarity equal to or greater than a predetermined threshold is obtained from the comparison of feature amounts. When the similarity calculated in step B5 is equal to or greater than a predetermined threshold, it is determined from the feature amount comparison that similarity equal to or greater than the predetermined threshold is obtained, and the process proceeds to step B3. When the degree of similarity calculated in step B5 is less than a predetermined threshold, it is determined from the feature amount comparison that no similarity equal to or higher than the predetermined threshold has been obtained, and the process proceeds to step B7.

置き換え判定部２２は、ステップＢ３に進むとき、認識結果文字列のうち予め定める閾値以上の類似性が得られたと判定された文字について、置き換えると判定する。ステップＢ７に進むときは、認識結果文字列のうち予め定める閾値以上の類似性が得られなかったと判定された文字について、置き換えないと判定する。 When the process proceeds to step B3, the replacement determination unit 22 determines to replace a character determined to have a similarity equal to or greater than a predetermined threshold in the recognition result character string. When the process proceeds to step B7, it is determined that the character that is determined to have no similarity equal to or greater than a predetermined threshold in the recognition result character string is not replaced.

ステップＢ７では、置き換え部２３は、対象文字について認識文字を出力して、ステップＢ４に進む。ステップＢ７での対象文字とは、置き換え部２３によって置き換えないと判定された文字である。ステップＢ７での出力は、メモリ１３に記憶することである。すなわち、置き換え部２３は、対象としている単語のうち、置き換え部２３によって置き換えないと判定された文字を、単語文字列の文字で置き換えることなく、認対象としている単語をメモリ１３に記憶する。このとき、同じ単語の中に置き換えると判定された文字があるときは、置き換えると判定された文字については、認識結果文字列の文字を単語文字列の文字で置き換える。 In step B7, the replacement unit 23 outputs a recognized character for the target character, and proceeds to step B4. The target character in step B7 is a character determined not to be replaced by the replacement unit 23. The output at step B7 is to be stored in the memory 13. That is, the replacement unit 23 stores the word to be recognized in the memory 13 without replacing the characters determined to be not replaced by the replacement unit 23 among the target words with the characters of the word character string. At this time, if there is a character determined to be replaced in the same word, the character in the recognition result character string is replaced with the character in the word character string for the character determined to be replaced.

図１１に示したステップＢ１は、単語列生成工程に相当し、図１１に示したステップＢ２，Ｂ４〜Ｂ６は、置き換え判定工程に相当し、図１１に示したステップＢ３，Ｂ７は、置き換え工程に相当する。 Step B1 shown in FIG. 11 corresponds to a word string generation process, steps B2, B4 to B6 shown in FIG. 11 correspond to a replacement determination process, and steps B3 and B7 shown in FIG. It corresponds to.

本実施形態では、置き換え部２３は、置き換えられた文字および置き換えられていない文字から構成される単語を一旦メモリ１３に記憶し、すべての単語について処理が行われた後、メモリ１３に記憶した単語から認識結果文字列を構成して、出力装置３に送信して出力したが、メモリ１３に記憶することなく、ステップＢ３あるいはステップＢ７で、置き換えられた文字および置き換えられていない文字から構成される単語を、単語ごとに出力装置３に送信して出力する構成としてもよい。 In the present embodiment, the replacement unit 23 temporarily stores words composed of replaced characters and non-replaced characters in the memory 13, and after all the words have been processed, the words stored in the memory 13. Although the recognition result character string is constructed from the above and transmitted to the output device 3 and outputted, it is composed of the replaced character and the non-replaced character in step B3 or step B7 without being stored in the memory 13. It is good also as a structure which transmits and outputs the word to the output device 3 for every word.

次に、本発明の第２の実施形態である情報処理装置１ａについて説明する。本発明に係る情報処理方法は、情報処理装置１ａによって実行される。情報処理装置１ａの構成は、後処理部２０ａを除き、図１に示した情報処理装置１の構成と同じであり、重複を避けるために、以下相違する部分についてのみ説明する。後処理部２０ａが実行する処理を、以下第２の後処理という。後処理部２０ａは、置き換え判定部２２ａを除き、図１に示した後処理部２０の構成と同じであり、重複を避けるために、以下相違する部分についてのみ説明する。 Next, the information processing apparatus 1a which is the 2nd Embodiment of this invention is demonstrated. The information processing method according to the present invention is executed by the information processing apparatus 1a. The configuration of the information processing apparatus 1a is the same as the configuration of the information processing apparatus 1 shown in FIG. 1 except for the post-processing unit 20a, and only different parts will be described below in order to avoid duplication. The processing executed by the post-processing unit 20a is hereinafter referred to as second post-processing. The post-processing unit 20a is the same as the configuration of the post-processing unit 20 shown in FIG. 1 except for the replacement determination unit 22a, and only different parts will be described below in order to avoid duplication.

置き換え判定部２２ａは、最初、単語列生成部２１により生成された単語文字列を構成する単語について類似単語が抽出されていたか否かをメモリ１３に記憶する。類似単語は、単語列生成部２１によって単語文字列が生成される際、単語の候補が複数生成されたとき、その複数の単語の候補のうち類似度が最も高い単語の候補を除く残余の単語の候補のことである。この処理は、第１の実施形態の置き換え判定部２２では行われなかった処理である。 The replacement determination unit 22a first stores in the memory 13 whether or not similar words have been extracted for the words constituting the word character string generated by the word string generation unit 21. When a word character string is generated by the word string generator 21, when a plurality of word candidates are generated, the remaining words other than the word candidate having the highest similarity among the plurality of word candidates are generated. It is a candidate. This process is a process that was not performed by the replacement determination unit 22 of the first embodiment.

次に、置き換え判定部２２ａは、第１の実施形態の置き換え判定部２２と同様に、単語文字列の単語を構成する文字と同じ文字が、認識結果文字列の対応する位置にある候補文字の中にあるか否かを判定する。単語文字列の単語を構成する文字と同じ文字が、認識結果文字列の対応する位置にある候補文字の中にすべてある場合、置き換え判定部２２は、単語を構成するすべての文字について、置き換えると判定する。 Next, similarly to the replacement determination unit 22 of the first embodiment, the replacement determination unit 22a detects the candidate character that has the same character as the character constituting the word of the word character string at the corresponding position in the recognition result character string. Determine if it is inside. When the same characters as the characters constituting the word in the word character string are all in the candidate characters at the corresponding positions in the recognition result character string, the replacement determination unit 22 replaces all the characters constituting the word. judge.

単語を構成する文字のうち認識結果文字列の対応する位置にある候補文字の中に同じ文字がない場合、置き換え判定部２２ａは、第１の実施形態の置き換え判定部２２と同様に、単語を構成する文字のうち候補文字の中に同じ文字がない文字の特徴量と、認識結果文字列を構成する対応する候補文字、つまり第１候補の候補文字の特徴量とから、式（３）によって、これらの特徴量の類似度を計算する。そして、計算した類似度が予め定める閾値以上であるとき、単語を構成する文字のうち候補文字の中に同じ文字がない文字に対応する位置にある認識結果文字列の文字について、置き換えると判定する。 When there is no same character among the candidate characters at the corresponding position in the recognition result character string among the characters constituting the word, the replacement determination unit 22a selects the word as in the replacement determination unit 22 of the first embodiment. From the feature amount of characters that do not have the same character among the candidate characters and the corresponding candidate characters constituting the recognition result character string, that is, the feature amount of the candidate character of the first candidate, among the constituent characters, according to equation (3) The similarity between these feature quantities is calculated. Then, when the calculated similarity is equal to or greater than a predetermined threshold, it is determined to replace a character in the recognition result character string at a position corresponding to a character that does not have the same character among candidate characters among characters constituting the word. .

計算した類似度が予め定める閾値未満であるとき、置き換え判定部２２ａは、第１の実施形態の置き換え判定部２２とは異なり、さらに、計算した類似度が予め定める閾値未満であると判定された文字を含む単語に類似単語があるか否かを、メモリ１３を参照して判定する。計算した類似度が予め定める閾値未満であると判定された文字を含む単語に類似単語がない場合、計算した類似度が予め定める閾値未満であると判定された文字を含む単語は、類似度が高い場合と同様に確信度が高いと判定し、置き換えると判定する。計算した類似度が予め定める閾値未満であると判定された文字を含む単語に類似単語がある場合、置き換えないと判定する。
以下、具体的な例示として、図３に示した文字列行画像４１２を用いて、第２の実施形態である情報処理装置１ａの処理を説明する。 When the calculated similarity is less than a predetermined threshold, the replacement determination unit 22a is different from the replacement determination unit 22 of the first embodiment, and it is further determined that the calculated similarity is less than a predetermined threshold. It is determined with reference to the memory 13 whether or not there are similar words in the words including characters. When there is no similar word in a word including a character determined to have a calculated similarity less than a predetermined threshold, a word including a character determined to have a calculated similarity less than a predetermined threshold has a similarity It is determined that the certainty level is high as in the case of high, and it is determined to replace. When there is a similar word in a word including a character that has been determined that the calculated similarity is less than a predetermined threshold, it is determined not to be replaced.
Hereinafter, as a specific example, the processing of the information processing apparatus 1a according to the second embodiment will be described using the character string row image 412 illustrated in FIG.

図１２は、文字切り出し部１６によって切り出された文字画像５２の一例を示す図である。文字切り出し部１６は、文字列行切り出し部１５によって切り出された文字列行画像から、文字を１文字ずつ矩形画像として切り出す。図１２に示した文字画像５２の例には、図３に示した文字列行画像４１２から切り出された２２文字の文字画像が示されている。最初の文字画像５２１は、「協」という文字を切り出した文字画像であり、第２番目以降に「力」、「，」、「理」、「解」、「を」、「得」、「る」、「た」、「め」、「に」、「事」、「前」、「に」、「説」、「明」、「会」、「を」、「行」、「っ」、「た」および「。」の各文字の文字画像が示されている。 FIG. 12 is a diagram illustrating an example of the character image 52 cut out by the character cutout unit 16. The character cutout unit 16 cuts out characters one by one as a rectangular image from the character string row image cut out by the character string row cutout unit 15. In the example of the character image 52 shown in FIG. 12, a character image of 22 characters cut out from the character string row image 412 shown in FIG. 3 is shown. The first character image 521 is a character image obtained by cutting out the character “Kyo”, and “power”, “,”, “reason”, “solution”, “o”, “gain”, “ ``, '' `` Ta '', `` me '', `` ni '', `` thing '', `` before '', `` ni '', `` theory '', `` akira '', `` meeting '', `` to '', `` row '', `` tsu '' , “Ta” and “.” Are shown as character images.

図１３は、候補文字５３および認識結果文字列５４の一例を示す図である。図１３（ａ）は、マッチング部１８によって抽出された候補文字５３の一例を示す。候補文字５３は、文字画像５２が表す文字として、マッチング部１８によって抽出された候補文字である。図１３（ａ）に示した候補文字５３の例では、最上段に入力文字、つまり候補文字の正解文字として、「協」、「力」、「，」、「理」、「解」、「を」、「得」、「る」、「た」、「め」、「に」、「事」、「前」、「に」、「説」、「明」、「会」、「を」、「行」、「っ」、「た」および「。」が示されている。第１候補〜第５候補については、たとえば正解文字「協」については、第１候補「協１２１」、第２候補「脇１１８」、第３候補「摺１１０」、第４候補「堪１０８」および第５候補「洋１０４」が示されている。 FIG. 13 is a diagram illustrating an example of the candidate character 53 and the recognition result character string 54. FIG. 13A shows an example of the candidate character 53 extracted by the matching unit 18. The candidate character 53 is a candidate character extracted by the matching unit 18 as a character represented by the character image 52. In the example of the candidate character 53 shown in FIG. 13A, “Kyo”, “Power”, “,”, “Science”, “Solution”, “ "," "Gain", "ru", "ta", "me", "ni", "thing", "previous", "ni", "theory", "akira", "meeting", "to" , “Line”, “tsu”, “ta” and “.” Are shown. As for the first to fifth candidates, for example, for the correct character “Kyo”, the first candidate “Kyo 121”, the second candidate “Waki 118”, the third candidate “Suri 110”, and the fourth candidate “Enjoyment 108”. And the fifth candidate “Western 104” is shown.

図１３（ｂ）は、文字列生成部１９によって生成された認識結果文字列５４の一例を示す。文字列生成部１９は、マッチング部１８によって抽出された候補文字５３に示される候補文字のうち、第１候補の文字を左から順に並べた認識結果文字列５４を生成する。文字列生成部１９は、後処理部２０ａのために、候補文字４３をメモリ１３に記憶しておく。図１３（ｂ）に示した例では、認識結果文字列５４として「協力，埋斛を得るあめに事間に説明会を行った。」なる文字列が示されている。後処理部２０ａによる処理を行わない場合は、この認識結果文字列５４を、認識結果として出力することになる。本実施形態では、この後、後処理部２０ａによって第２の後処理が実行される。 FIG. 13B shows an example of the recognition result character string 54 generated by the character string generation unit 19. The character string generation unit 19 generates a recognition result character string 54 in which the first candidate characters are arranged in order from the left among the candidate characters indicated by the candidate character 53 extracted by the matching unit 18. The character string generation unit 19 stores candidate characters 43 in the memory 13 for the post-processing unit 20a. In the example shown in FIG. 13B, the recognition result character string 54 is a character string “A briefing was held between days to obtain cooperation and buried”. When the processing by the post-processing unit 20a is not performed, the recognition result character string 54 is output as a recognition result. In the present embodiment, thereafter, the second post-processing is executed by the post-processing unit 20a.

第２の後処理では、まず、単語列生成部２１は、第１の実施形態と同様に、認識結果文字列の第１文字目から順に、各候補文字について、単語辞書２４に収録されている単語に完全に一致する文字列、もしくは部分的に一致する文字列を単語の候補として生成する。このようにして生成された単語文字列の例を、図１４に示す。 In the second post-processing, first, the word string generation unit 21 stores each candidate character in the word dictionary 24 in order from the first character of the recognition result character string, as in the first embodiment. A character string that completely matches or partially matches a word is generated as a word candidate. An example of the word character string generated in this way is shown in FIG.

第２の実施形態では、第１の実施形態とは異なり、単語列生成部２１は、単語の候補が複数生成されたか否かを，後処理部２０ａのために，メモリ１３に記憶しておく。具体的には、生成した単語文字列に含まれる単語の候補が、複数の単語の候補が生成された単語の候補の１つである場合は、類似単語が抽出されたことをメモリ１３に記憶し、生成した単語文字列に含まれる単語の候補が、１つの単語の候補しか生成されなかった単語の候補である場合は、類似単語が抽出されないことをメモリ１３に記憶する。類似単語は、生成した複数の単語の候補のうち生成した単語文字列に含まれない単語の候補のことである。 In the second embodiment, unlike the first embodiment, the word string generation unit 21 stores in the memory 13 whether or not a plurality of word candidates are generated for the post-processing unit 20a. . Specifically, when the word candidate included in the generated word character string is one of the word candidates from which a plurality of word candidates are generated, the fact that similar words have been extracted is stored in the memory 13. If the word candidate included in the generated word character string is a word candidate for which only one word candidate has been generated, the memory 13 stores that similar words are not extracted. The similar word is a word candidate that is not included in the generated word character string among the plurality of generated word candidates.

図１４は、単語列生成部２１によって生成された単語文字列５５の一例を示す図である。図１４に示した単語文字列５５の例には、最適な単語たとえば単語５５１、および複数生成された単語の候補たとえば単語の候補５５２，５５３が混在している。単語の候補５５２には、２つの単語の候補「得る」および「得ろ」が生成され、単語の候補５５３には、２つの単語の候補「事前」および「専門」が生成されている。 FIG. 14 is a diagram illustrating an example of the word character string 55 generated by the word string generation unit 21. In the example of the word character string 55 shown in FIG. 14, an optimal word such as a word 551 and a plurality of generated word candidates such as word candidates 552 and 553 are mixed. Two word candidates “Get” and “Get” are generated for the word candidate 552, and two word candidates “advance” and “specialty” are generated for the word candidate 553.

次に、単語列生成部２１は、単語文字列５５のうち複数の単語の候補があるものについて最適な単語を選択し、最適な単語の候補のみからなる単語文字列を生成する。図１４に示した例では、単語の候補５５２、５５３に複数の単語の候補があるので、単語列生成部２１は、第１の実施形態と同じ３つの条件で優先順位をつけて、最適な単語を選択する。 Next, the word string generation unit 21 selects an optimal word for a word character string 55 having a plurality of word candidates, and generates a word character string including only the optimal word candidates. In the example shown in FIG. 14, since there are a plurality of word candidates in the word candidates 552 and 553, the word string generation unit 21 assigns priorities under the same three conditions as in the first embodiment, Select a word.

たとえば図１４に示した例では、単語の候補５５２「得る」および「得ろ」は、ともに条件１を満たすので、条件３によって最適な単語を選択する。「得る」および「得ろ」のうち、「得る」の方が類似度が大きいので、「得る」が選択される。「事前」および「専門」については、「事前」は条件１を満たすが、「専門」は条件１を満たさず、条件２を満たすので、優先順位の高い条件を満たす「事前」が選択される。 For example, in the example shown in FIG. 14, the word candidates 552 “Get” and “Get” both satisfy the condition 1, so the optimum word is selected according to the condition 3. Of “Get” and “Get”, “Get” has a higher similarity, so “Get” is selected. For “advanced” and “specialized”, “advanced” satisfies condition 1, but “advanced” does not satisfy condition 1 but satisfies condition 2; therefore, “advanced” that satisfies a high priority condition is selected. .

図１５は、認識結果文字列と単語文字列との比較例５６を示す図である。比較例５６は、上段が文字列生成部１９によって生成された認識結果文字列であり、下段が単語列生成部２１によって生成された単語文字列である。図１５に示した下段の単語文字列は、図１４に示した単語文字列５５で最適な単語として選択された単語からなる文字列である。比較例５６は、単語文字列の各々の単語を構成する文字数で区切った矩形によって、認識結果文字列と単語文字列とを区切って示している。 FIG. 15 is a diagram illustrating a comparative example 56 of the recognition result character string and the word character string. In the comparative example 56, the upper row is a recognition result character string generated by the character string generator 19, and the lower row is a word character string generated by the word string generator 21. The lower word character string shown in FIG. 15 is a character string made up of words selected as optimum words in the word character string 55 shown in FIG. In the comparative example 56, the recognition result character string and the word character string are separated by a rectangle divided by the number of characters constituting each word of the word character string.

続いて、置き換え判定部２２ａは、文字列生成部１９で生成された認識結果文字列と単語列生成部２１により生成された単語文字列とを、単語ごとに先頭文字から一文字ずつ比較して、置き換えるか否かを判定する。置き換えるか否かの判定は、以下を除き、第１の実施形態と同じである。 Subsequently, the replacement determination unit 22a compares the recognition result character string generated by the character string generation unit 19 and the word character string generated by the word string generation unit 21 one by one from the first character for each word, Determine whether to replace. The determination as to whether or not to replace is the same as in the first embodiment except for the following.

第１の実施形態では、置き換え判定部２２は、計算した類似度が予め定める閾値未満であるとき、単語を構成する文字のうち候補文字の中に同じ文字がない文字について、置き換えないと判定したが、第２の実施形態では、置き換え判定部２２ａは、計算した類似度が予め定める閾値未満であるとき、類似単語が抽出されたか否かを判定する。置き換え判定部２２ａは、メモリ１３を参照し、メモリ１３に類似単語が抽出されたことが記憶されていないと、置き換えると判定し、メモリ１３に類似単語が抽出されたことが記憶されていると、置き換えないと判定する。 In the first embodiment, when the calculated similarity is less than a predetermined threshold, the replacement determination unit 22 determines that a character that does not have the same character among the candidate characters among the characters constituting the word is not replaced. However, in the second embodiment, the replacement determination unit 22a determines whether a similar word has been extracted when the calculated similarity is less than a predetermined threshold. If the replacement determination unit 22a refers to the memory 13 and does not store that the similar word is extracted in the memory 13, the replacement determination unit 22a determines that the replacement is performed and stores that the similar word is extracted in the memory 13. It is determined that it will not be replaced.

たとえば図１５に示した例について、置き換え判定部２２ａによる置き換えるか否かの判定を説明する。前から順番に文字を比較していくと第４文字目と第５文字目、つまり２点鎖線５６１で囲んだ部分にある文字が異なる。すなわち、認識結果文字列では「埋斛」であり、単語文字列では「理解」である。この実施形態では、認識結果文字列と単語文字列とを前から順番に比較するが、後ろから順番に比較するような構成でもよい。 For example, in the example shown in FIG. 15, determination as to whether or not the replacement determination unit 22a replaces will be described. When the characters are compared in order from the front, the characters in the fourth character and the fifth character, that is, the portion surrounded by the two-dot chain line 561 are different. That is, the recognition result character string is “buried”, and the word character string is “understanding”. In this embodiment, the recognition result character string and the word character string are compared in order from the front, but may be configured to compare in order from the back.

置き換え判定部２２ａは、まず単語文字列中の第４文字目の「理」という文字が、認識結果文字列の対応する位置にある候補文字に存在するか否かをチェックする。図１３（ａ）に示した候補文字５３には、第４文字目の第２候補の候補文字に「理」という文字がある。したがって、「理」という文字は候補文字にあるので、「理」という文字については、それ以上の処理は行わない。次に、第５文字目の「解」という文字が、認識結果文字列の対応する位置にある候補文字に存在するか否かをチェックする。図１３（ａ）に示した候補文字５３には、第５文字目の第１候補の候補文字は「斛」であり、第２候補の候補文字以降にも「解」という文字は存在しない。 The replacement determination unit 22a first checks whether or not the fourth character “Ri” in the word character string exists in the candidate character at the corresponding position in the recognition result character string. In the candidate character 53 shown in FIG. 13A, there is a character “RI” in the candidate character of the second candidate of the fourth character. Therefore, since the character “RI” is in the candidate character, no further processing is performed on the character “RI”. Next, it is checked whether or not the fifth character “solution” exists in the candidate character at the corresponding position in the recognition result character string. In the candidate character 53 shown in FIG. 13A, the candidate character of the first candidate of the fifth character is “斛”, and the character “solution” does not exist after the candidate character of the second candidate.

単語「理解」を構成する「解」という文字が認識結果文字列の候補文字の中に含まれていないので、置き換え判定部２２ａは、次に、類似度による置き換えるか否かの判定を行う。このとき、仮に、単語「理解」のすべての文字が認識結果文字列の候補文字に含まれている場合は、置き換えると判定する。 Since the character “solution” constituting the word “understanding” is not included in the candidate characters of the recognition result character string, the replacement determination unit 22a next determines whether or not to replace based on the similarity. At this time, if all characters of the word “understanding” are included in the candidate characters of the recognition result character string, it is determined to be replaced.

置き換え判定部２２ａは、単語文字列中の文字「解」が、認識結果文字列内の対応する位置にある候補文字に存在しないので、文字認識辞書１４に収録されている文字「解」の特徴量と、図１２に示した文字画像５２のうち第５文字目の文字画像から抽出された特徴量とを比較して、類似度を抽出する。図１２に示した文字画像５２のうち第５文字目の文字画像から抽出された特徴量は、メモリ１３に記憶されている。 Since the character “solution” in the word character string does not exist in the candidate character at the corresponding position in the recognition result character string, the replacement determination unit 22a has the feature of the character “solution” recorded in the character recognition dictionary 14. The degree of similarity is extracted by comparing the amount with the feature amount extracted from the character image of the fifth character in the character image 52 shown in FIG. The feature amount extracted from the character image of the fifth character in the character image 52 shown in FIG. 12 is stored in the memory 13.

図１６は、単語文字列の文字の特徴量および対応する位置にある候補文字の特徴量の一例を示す図である。図１６（ａ）は、候補文字「斛」の特徴量を示す特徴データ３９ｃである。候補文字「斛」は、図１２に示した文字画像５２のうち第５文字目の文字画像が表す文字について、マッチング部１８によって第１候補の候補文字「斛」として抽出された文字である。図１６（ｂ）は、文字認識辞書１４に収録されている文字「解」の特徴量を示す特徴データ３２ｃである。式（３）による実際の計算は省略するが、候補文字「斛」の特徴量と文字認識辞書１４に収録されている文字「解」の特徴量との類似度は、「１１２」である。ここに、類似度の最大値は１２８である。 FIG. 16 is a diagram illustrating an example of character feature amounts of word character strings and feature amounts of candidate characters at corresponding positions. FIG. 16A shows feature data 39c indicating the feature amount of the candidate character “斛”. The candidate character “斛” is a character extracted by the matching unit 18 as the first candidate candidate character “斛” for the character represented by the fifth character image in the character image 52 shown in FIG. 12. FIG. 16B is feature data 32 c indicating the feature amount of the character “solution” recorded in the character recognition dictionary 14. Although the actual calculation by Expression (3) is omitted, the similarity between the feature amount of the candidate character “斛” and the feature amount of the character “solution” recorded in the character recognition dictionary 14 is “112”. Here, the maximum value of the similarity is 128.

予め定める閾値は、任意の値として設定可能であるが、たとえば予め定める閾値を第１候補の候補文字との類似度の差が５以内であるとする場合、第５文字目の第１候補の候補文字は「斛」であり、文字「斛」の類似度は、図１３（ａ）に「１１９」と示されているので、予め定める閾値は「１１９」から「５」を減算した「１１４」になる。第５文字目に対応する文字画像と文字「解」との類似度「１１２」は、予め定める閾値＝１１４未満であり、類似単語の有無の判定が行われる。予め定める閾値以上の類似度である場合は置き換えると判定される。 The predetermined threshold value can be set as an arbitrary value. For example, when the predetermined threshold value is less than 5 in similarity with the first candidate candidate character, the first candidate of the fifth character The candidate character is “斛”, and the similarity of the character “斛” is indicated as “119” in FIG. 13A. Therefore, the predetermined threshold is “114” obtained by subtracting “5” from “119”. "become. The similarity “112” between the character image corresponding to the fifth character and the character “solution” is less than a predetermined threshold = 114, and the presence / absence of a similar word is determined. If the degree of similarity is equal to or greater than a predetermined threshold, it is determined to replace.

類似単語の有無の判定では、置き換え判定部２２ａは、「理解」という単語のほかに類似単語が抽出されていたか否かを、メモリ１３を参照して判定する。類似単語が抽出されたことがメモリ１３に記憶されていなければ、「埋斛」という文字列に対して、「理解」という単語は確信度が高いとして、予め定める閾値以上の類似度であったときと同じ処理を行うと判断し、置き換えると判定する。もし、類似単語が抽出されたことがメモリ１３に記憶されていれば、「埋斛」という文字列に対して、「理解」という単語は確信度が高くないとして、予め定める閾値未満の類似度であったときと同じ処理を行うと判断し、置き換えないと判定する。 In determining whether or not there is a similar word, the replacement determining unit 22a refers to the memory 13 to determine whether or not a similar word has been extracted in addition to the word “understanding”. If it is not stored in the memory 13 that the similar word has been extracted, the word “understanding” has a high degree of certainty with respect to the character string “buried”, and the degree of similarity is equal to or higher than a predetermined threshold. It is determined that the same processing is performed, and replacement is determined. If it is stored in the memory 13 that similar words have been extracted, it is determined that the word “understanding” is not highly reliable with respect to the character string “buried”, and the degree of similarity is less than a predetermined threshold. It is determined that the same processing is performed as in the case of, and it is determined not to be replaced.

第２の実施形態では、類似単語が抽出されたことがメモリ１３に記憶されていなければ、無条件に確信度が高いとしたが、無条件に確信度が高いとするのではなく、文字画像と文字「解」との類似度を判定する予め定める閾値を最大類似度＝１２８の５割に再設定するなどして、予め定める閾値を低く設定し直して判定する構成としてもよい。 In the second embodiment, if the fact that similar words have been extracted is not stored in the memory 13, the certainty factor is unconditionally high. However, the certainty factor is not unconditionally high. For example, the predetermined threshold value for determining the similarity between the character “solution” and the character “solution” may be reset to 50% of the maximum similarity = 128.

第２の後処理では、最後に、置き換え部２３は、第１の実施形態と同様に、認識結果文字列のうち置き換えると判定された文字について、認識結果文字列の文字を単語文字列の文字で置き換え、置き換えた認識結果文字列を出力装置３に送信して出力させる。認識結果文字列のすべての文字について置き換えないと判定されたときは、置き換えを行うことなく、認識結果文字列を出力装置３に送信して出力させる。 In the second post-processing, finally, the replacement unit 23 replaces the characters in the recognition result character string with the characters in the word character string for the characters determined to be replaced in the recognition result character string, as in the first embodiment. And the replaced recognition result character string is transmitted to the output device 3 for output. When it is determined not to replace all the characters in the recognition result character string, the recognition result character string is transmitted to the output device 3 and output without replacement.

図１５に示した単語文字列のうち「理解」という単語は、類似単語が抽出されたことがメモリ１３に記憶されていないので、図１５に示した認識結果文字列のうち「埋斛」という文字列に対して、「理解」という単語は確信度が高いとして、予め定める閾値以上の類似度であったときと同様に、置き換えると判定される。したがって、置き換え部２３は、認識結果文字列のうち「埋斛」という文字列を、単語文字列中の「理解」という単語に置き換えて、出力装置３に送信して出力する。すなわち、単語文字列中の単語「理解」を出力することによって、「埋斛」として誤認識された単語を正しくコレクト、つまり修正することができる。 Since the word “understanding” in the word character string shown in FIG. 15 is not stored in the memory 13 that the similar word is extracted, it is called “buried” in the recognition result character string shown in FIG. For the character string, it is determined that the word “understanding” has a high certainty factor, and is replaced in the same manner as when the similarity is equal to or higher than a predetermined threshold. Therefore, the replacement unit 23 replaces the character string “buried” in the recognition result character string with the word “understanding” in the word character string, and transmits it to the output device 3 for output. That is, by outputting the word “understanding” in the word character string, the word erroneously recognized as “buried” can be correctly corrected, that is, corrected.

図１５に示した例では、認識結果文字列中の「埋斛」の「埋」と単語文字列中の「理解」の「理」と以外に、認識結果文字列中の「あめに」の「あ」と単語文字列中の「ために」の「た」と、また、認識結果文字列中の「事間」の「間」と単語文字列中の「事前」の「前」との文字がそれぞれ異なるので、類似度による置き換えの要否が判定される。 In the example shown in FIG. 15, in addition to “buried” of “buried” in the recognition result character string and “reason” of “understanding” in the word character string, “ameni” in the recognition result character string “A” and “Ta” of “For” in the word string, “Between” of “Things” in the recognition result string, and “Previous” of “Pre” in the word string Since the characters are different from each other, it is determined whether or not replacement is necessary based on the similarity.

これらの比較結果についての説明は省略するが、第２の実施形態においては、仮に認識結果文字列中の文字と単語文字列中の文字とが異なって、類似度が予め定める閾値未満であったとしても、単語文字列を構成する単語の確信度が高ければ、無条件に認識結果文字列中の文字を単語文字列中の文字で置き換えることよって、正しい認識結果を得ることができる。あるいは、仮に認識結果文字列中の文字と単語文字列中の文字とが異なって、類似度が予め定める閾値未満であったとしても、予め定める閾値を低く設定し直して、類似度が設定し直した予め定める閾値以上であるとき、単語文字列を構成する単語の確信度が高いとして、認識結果文字列中の文字を単語文字列中の文字で置き換えることによって、正しい認識結果を得ることができる。 Although explanations of these comparison results are omitted, in the second embodiment, the characters in the recognition result character string are different from the characters in the word character string, and the similarity is less than a predetermined threshold. However, if the certainty of the words constituting the word character string is high, a correct recognition result can be obtained by unconditionally replacing the characters in the recognition result character string with the characters in the word character string. Alternatively, even if the character in the recognition result character string is different from the character in the word character string, and the similarity is less than a predetermined threshold, the predetermined threshold is reset to a lower value and the similarity is set. When the corrected threshold value is equal to or higher than the predetermined threshold, it is possible to obtain a correct recognition result by replacing characters in the recognition result character string with characters in the word character string, assuming that the certainty of the words constituting the word character string is high. it can.

図１７は、後処理部２０ａが実行する第２の後処理の処理手順を示すフローチャートである。情報処理装置１ａ、入力装置２および出力装置３が実行する文字認識処理は、情報処理装置１、入力装置２および出力装置３が実行する文字認識処理と同じであり、重複を避けるために説明は省略する。第２の実施形態では、図１０に示したステップＡ７が実行されると、ステップＣ１に移る。ステップＣ１，Ｃ３〜Ｃ６，Ｃ９は、それぞれ図１１に示したステップＢ１〜Ｂ５，Ｂ７と同じであり、説明は省略する。 FIG. 17 is a flowchart showing the procedure of the second post-processing executed by the post-processing unit 20a. The character recognition processing executed by the information processing device 1a, the input device 2 and the output device 3 is the same as the character recognition processing executed by the information processing device 1, the input device 2 and the output device 3, and will not be described in order to avoid duplication. Omitted. In the second embodiment, when step A7 shown in FIG. 10 is executed, the process proceeds to step C1. Steps C1, C3 to C6 and C9 are the same as steps B1 to B5 and B7 shown in FIG.

ステップＣ２では、単語列生成部２１は、生成した単語文字列を構成する単語において類似単語が抽出されたか否かを、生成した単語文字列を構成する単語ごとに、メモリ１３に記憶する。すなわち、生成した単語文字列に含まれる単語が、生成された複数の単語の候補の中の１つの単語の候補であるとき、類似単語が抽出されたことをメモリ１３に記憶し、生成した単語文字列に含まれる単語が、１つのみ生成された単語の候補であるとき、類似単語が抽出されないことをメモリ１３に記憶する。 In Step C2, the word string generation unit 21 stores, in the memory 13, for each word constituting the generated word character string, whether or not a similar word has been extracted from the words constituting the generated word character string. That is, when the word included in the generated word character string is one word candidate among the plurality of generated word candidates, the fact that a similar word has been extracted is stored in the memory 13, and the generated word When the word included in the character string is a candidate for only one generated word, it is stored in the memory 13 that a similar word is not extracted.

ステップＣ７では、置き換え判定部２２ａは、特徴量の比較から、予め定める閾値以上の類似性が得られたか否かを判定する。ステップＣ６で計算した類似度が予め定める閾値以上であるとき、特徴量の比較から、予め定める閾値以上の類似性が得られたと判定し、ステップＣ４に進む。ステップＣ６で計算した類似度が予め定める閾値未満であるとき、特徴量の比較から、予め定める閾値以上の類似性が得られなかったと判定し、ステップＣ８に進む。 In step C7, the replacement determination unit 22a determines whether similarity equal to or higher than a predetermined threshold is obtained from the comparison of feature amounts. When the similarity calculated in step C6 is equal to or greater than a predetermined threshold, it is determined from the feature amount comparison that similarity equal to or greater than the predetermined threshold is obtained, and the process proceeds to step C4. When the degree of similarity calculated in step C6 is less than a predetermined threshold, it is determined from the feature amount comparison that no similarity equal to or higher than the predetermined threshold has been obtained, and the process proceeds to step C8.

予め定める閾値は、任意の値として設定可能であるが、本実施形態では、たとえば予め定める閾値は、第１候補の候補文字との類似度から「５」を減算した値である。図１６（ａ）に示した例では、第５文字目の第１候補の候補文字は「斛」であり、文字「斛」の類似度は、図１３（ａ）に「１１９」と示されているので、予め定める閾値は「１１４」になる。 Although the predetermined threshold value can be set as an arbitrary value, in the present embodiment, for example, the predetermined threshold value is a value obtained by subtracting “5” from the similarity with the candidate character of the first candidate. In the example shown in FIG. 16A, the candidate character of the first candidate for the fifth character is “斛”, and the similarity of the character “斛” is shown as “119” in FIG. Therefore, the predetermined threshold value is “114”.

ステップＣ８では、置き換え判定部２２ａは、対象としている単語について、ステップＣ２において類似単語が抽出されたか否かを判定する。対象としている単語、つまり単語文字列のうちステップＣ３で処理の対象としている単語について、メモリ１３に類似単語が抽出されたことが記憶されていると、ステップＣ９に進み、対象としている単語について、メモリ１３に類似単語が抽出されていないことが記憶されていると、ステップＣ４に進む。置き換え判定部２２ａは、ステップＣ４に進むとき、認識結果文字列のうち予め定める閾値以上の類似性が得られたと判定された文字について、置き換えると判定する。ステップＣ９に進むときは、認識結果文字列のうち予め定める閾値以上の類似性が得られなかったと判定された文字について、置き換えないと判定する。 In step C8, the replacement determination unit 22a determines whether or not a similar word has been extracted in step C2 for the target word. If it is stored in the memory 13 that similar words have been extracted for the word to be processed, that is, the word to be processed in step C3 in the word character string, the process proceeds to step C9. If it is stored in the memory 13 that no similar word has been extracted, the process proceeds to step C4. When the process proceeds to step C4, the replacement determination unit 22a determines to replace a character determined to have a similarity equal to or greater than a predetermined threshold in the recognition result character string. When the process proceeds to step C9, it is determined that the character that is determined not to have a similarity equal to or higher than a predetermined threshold in the recognition result character string is not replaced.

図１７に示したステップＣ１，Ｃ２は、単語列生成工程に相当し、図１７に示したステップＣ３，Ｃ５〜Ｃ８は、置き換え判定工程に相当し、図１７に示したステップＣ４，Ｃ９は、置き換え工程に相当する。 Steps C1 and C2 shown in FIG. 17 correspond to a word string generation process, steps C3 and C5 to C8 shown in FIG. 17 correspond to a replacement determination process, and steps C4 and C9 shown in FIG. This corresponds to the replacement process.

次に、本発明の第３の実施形態である情報処理装置１ｂについて説明する。本発明に係る情報処理方法は、情報処理装置１ｂによって実行される。情報処理装置１ｂの構成は、後処理部２０ｂを除き、図１に示した情報処理装置１の構成と同じであり、重複を避けるために、以下相違する部分についてのみ説明する。後処理部２０ｂが実行する処理を、以下第３の後処理という。後処理部２０ｂは、置き換え判定部２２ｂを除き、図１に示した後処理部２０の構成と同じであり、重複を避けるために、以下相違する部分についてのみ説明する。 Next, the information processing apparatus 1b which is the 3rd Embodiment of this invention is demonstrated. The information processing method according to the present invention is executed by the information processing apparatus 1b. The configuration of the information processing apparatus 1b is the same as the configuration of the information processing apparatus 1 shown in FIG. 1 except for the post-processing unit 20b, and only different parts will be described below in order to avoid duplication. The processing executed by the post-processing unit 20b is hereinafter referred to as third post-processing. The post-processing unit 20b is the same as the configuration of the post-processing unit 20 shown in FIG. 1 except for the replacement determination unit 22b, and only different parts will be described below in order to avoid duplication.

置き換え判定部２２ｂは、最初、第１の実施形態の置き換え判定部２２と同様に、単語文字列の単語を構成する文字と同じ文字が、認識結果文字列の対応する位置にある候補文字の中にあるか否かを判定する。単語文字列の単語を構成する文字と同じ文字が、認識結果文字列の対応する位置にある候補文字の中にすべてある場合、置き換え判定部２２は、単語を構成するすべての文字について、置き換えると判定する。 The replacement determination unit 22b, first, in the same way as the replacement determination unit 22 of the first embodiment, the same character as the character constituting the word of the word character string is among the candidate characters at the corresponding position of the recognition result character string. It is determined whether or not. When the same characters as the characters constituting the word in the word character string are all in the candidate characters at the corresponding positions in the recognition result character string, the replacement determination unit 22 replaces all the characters constituting the word. judge.

単語を構成する文字のうち認識結果文字列の対応する位置にある候補文字の中に同じ文字がない場合、置き換え判定部２２ｂは、第１の実施形態の置き換え判定部２２と同様に、単語を構成する文字のうち候補文字の中に同じ文字がない文字の特徴量と、認識結果文字列を構成する対応する候補文字、つまり第１候補の候補文字の特徴量とから、式（３）によって、これらの特徴量の類似度を計算する。そして、計算した類似度が予め定める閾値以上であるとき、単語を構成する文字のうち候補文字の中に同じ文字がない文字に対応する位置にある認識結果文字列の文字について、置き換えると判定する。 When there is no same character among the candidate characters at the corresponding position in the recognition result character string among the characters constituting the word, the replacement determination unit 22b determines the word as in the replacement determination unit 22 of the first embodiment. From the feature amount of characters that do not have the same character among the candidate characters and the corresponding candidate characters constituting the recognition result character string, that is, the feature amount of the candidate character of the first candidate, among the constituent characters, according to equation (3) The similarity between these feature quantities is calculated. Then, when the calculated similarity is equal to or greater than a predetermined threshold, it is determined to replace a character in the recognition result character string at a position corresponding to a character that does not have the same character among candidate characters among characters constituting the word. .

計算した類似度が予め定める閾値未満であるとき、置き換え判定部２２ｂは、第１の実施形態の置き換え判定部２２とは異なり、さらに、対象としている単語の文字数が予め定める基準文字数以上か否かを判定する。対象としている単語は、単語文字列を構成する単語のうち類似度を判定している文字を含む単語である。予め定める基準文字数以上でない場合、置き換えないと判定する。 When the calculated similarity is less than a predetermined threshold, the replacement determination unit 22b is different from the replacement determination unit 22 of the first embodiment, and whether or not the number of characters of the target word is equal to or greater than a predetermined reference character number. Determine. The target word is a word including a character whose similarity is determined among words constituting the word character string. If it is not equal to or more than the predetermined standard number of characters, it is determined not to be replaced.

予め定める基準文字数以上である場合、予め定める閾値を低めに設定し直し、類似度が設定し直した予め定める閾値以上であるとき、置き換えると判定する。これは、文字数が長い文字列の場合に、単語辞書２４に収録される単語と完全に一致もしくは部分一致して照合できるということは、単語文字列の単語として選択された単語が最適である確度が高いと考えられ、予め定める基準文字数以上の単語については、類似度の判定に使用する予め定める閾値を低めに設定しなおす。類似度が設定し直した予め定める閾値未満であるとき、置き換えないと判定する。 When the number of characters is equal to or greater than the predetermined reference character number, the predetermined threshold value is reset to a lower value, and when the similarity is equal to or higher than the predetermined threshold value that is reset, it is determined to be replaced. This is because, in the case of a character string having a long number of characters, the word selected in the word character string can be matched with the word recorded in the word dictionary 24 completely or partially. For words that are considered to be high and have a predetermined number of reference characters or more, a predetermined threshold value used for determining the similarity is set to a lower value. When the similarity is less than a preset threshold that has been reset, it is determined that the replacement is not performed.

図１８は、入力装置２から受信した画像データが表す画像６０の一例を示す図である。入力装置２から受信した画像データが表す画像６０は、文字列行が記載された原稿を入力装置２によって読み取った画像である。画像６０には、文字列行が表示されている。図１３に示した画像６０の例には、１行の文字列行「車庫証明の取得後、提出する必要がある。」が表示されている。 FIG. 18 is a diagram illustrating an example of an image 60 represented by image data received from the input device 2. An image 60 represented by image data received from the input device 2 is an image obtained by reading a document on which a character string row is written by the input device 2. In the image 60, a character string line is displayed. In the example of the image 60 shown in FIG. 13, one character string row “Need to submit after obtaining the garage certificate” is displayed.

以下、第３の実施形態の具体的な例示として、図１８に示した画像６０を用いて、情報処理装置１ｂの処理を説明する。 Hereinafter, as a specific example of the third embodiment, the processing of the information processing apparatus 1b will be described using the image 60 illustrated in FIG.

図１９は、文字列行切り出し部１５によって切り出された文字列行画像６１の一例を示す図である。文字列行切り出し部１５は、制御部１１が入出力制御部１２によって入力装置２から受信した画像データを制御部１１から受け取り、受け取った画像データが表す画像から文字列を表す文字列行画像を行ごとに切り出す。図１９に示した文字列行画像６１の例には、文字列行画像６１１の１つの文字列行画像が示されている。文字列行画像６１１は、「車庫証明の取得後、提出する必要がある。」という文字列が表示された画像である。 FIG. 19 is a diagram illustrating an example of the character string row image 61 cut out by the character string row cutout unit 15. The character string row cutout unit 15 receives the image data received from the input device 2 by the control unit 11 from the input device 2 by the input / output control unit 12, and generates a character string row image representing a character string from the image represented by the received image data. Cut out line by line. In the example of the character string row image 61 illustrated in FIG. 19, one character string row image of the character string row image 611 is illustrated. The character string row image 611 is an image in which a character string “It is necessary to submit after obtaining the garage certificate” is displayed.

図２０は、文字切り出し部１６によって切り出された文字画像６２の一例を示す図である。文字切り出し部１６は、文字列行切り出し部１５によって切り出された文字列行画像から、文字を１文字ずつ矩形画像として切り出す。図２０に示した文字画像６２の例には、１９文字の文字画像が示されている。最初の文字画像６２１は、「車」という文字を切り出した文字画像であり、第２番目以降に「庫」、「証」、「明」、「の」、「取」、「得」、「後」、「、」、「提」、「出」、「す」、「る」、「必」、「要」、「が」、「あ」、「る」および「。」の各文字の文字画像が示されている。 FIG. 20 is a diagram illustrating an example of a character image 62 cut out by the character cutout unit 16. The character cutout unit 16 cuts out characters one by one as a rectangular image from the character string row image cut out by the character string row cutout unit 15. In the example of the character image 62 shown in FIG. 20, a character image of 19 characters is shown. The first character image 621 is a character image obtained by cutting out the character “car”. The second and subsequent characters are “warehouse”, “proof”, “light”, “no”, “take”, “gain”, “ After, “,”, “proposed”, “out”, “su”, “ru”, “necessary”, “necessary”, “ga”, “a”, “ru” and “.” A character image is shown.

図２１は、候補文字６３および認識結果文字列６４の一例を示す図である。図２１（ａ）は、マッチング部１８によって抽出された候補文字６３の一例を示す。候補文字６３は、文字画像６２が表す文字として、マッチング部１８によって抽出された候補文字である。図２１（ａ）に示した候補文字５３の例では、最上段に入力文字、つまり候補文字の正解文字として、「車」、「庫」、「証」、「明」、「の」、「取」、「得」、「後」、「、」、「提」、「出」、「す」、「る」、「必」、「要」、「が」、「あ」、「る」および「。」が示されている。第１候補〜第５候補については、たとえば正解文字「車」については、第１候補「事１２３」、第２候補「専１２０」、第３候補「実１１９」、および第４候補「要１１８」が示され、第５候補は空白である。 FIG. 21 is a diagram illustrating an example of the candidate character 63 and the recognition result character string 64. FIG. 21A shows an example of the candidate character 63 extracted by the matching unit 18. The candidate character 63 is a candidate character extracted by the matching unit 18 as a character represented by the character image 62. In the example of the candidate character 53 shown in FIG. 21A, the input character at the top, that is, the correct character of the candidate character is “car”, “storage”, “proof”, “light”, “no”, “ `` Take '', `` gain '', `` after '', ``, '', `` suggest '', `` out '', `` su '', `` ru '', `` must '', `` must '', `` ga '', `` a '', `` ru '' And “.” Are shown. For the first to fifth candidates, for example, for the correct character “car”, the first candidate “Matter 123”, the second candidate “Exclusive 120”, the third candidate “Act 119”, and the fourth candidate “Required 118”. ”And the fifth candidate is blank.

図２１（ｂ）は、文字列生成部１９によって生成された認識結果文字列６４の一例を示す。文字列生成部１９は、マッチング部１８によって抽出された候補文字６３に示される候補文字のうち、第１候補の文字を左から順に並べた認識結果文字列６４を生成する。文字列生成部１９は、後処理部２０ｂのために、候補文字６３をメモリ１３に記憶しておく。図２１（ｂ）に示した例では、認識結果文字列６４として「事庫証明の取得後、提出する必要がある。」なる文字列が示されている。後処理部２０ｂによる処理を行わない場合は、この認識結果文字列６４を、認識結果として出力することになる。本実施形態では、この後、後処理部２０ｂによって第３の後処理が実行される。 FIG. 21B shows an example of the recognition result character string 64 generated by the character string generation unit 19. The character string generation unit 19 generates a recognition result character string 64 in which the first candidate characters are arranged in order from the left among the candidate characters indicated by the candidate character 63 extracted by the matching unit 18. The character string generation unit 19 stores the candidate character 63 in the memory 13 for the post-processing unit 20b. In the example shown in FIG. 21B, a character string “It is necessary to submit after obtaining the case certificate” is shown as the recognition result character string 64. When the processing by the post-processing unit 20b is not performed, the recognition result character string 64 is output as a recognition result. In the present embodiment, thereafter, the third post-processing is executed by the post-processing unit 20b.

第３の後処理では、まず、単語列生成部２１は、第１の実施形態と同様に、認識結果文字列の第１文字目から順に、各候補文字について、単語辞書２４に収録されている単語に完全に一致する文字列、もしくは部分的に一致する文字列を単語の候補として生成する。このようにして生成された単語文字列の例を、図２２に示す。 In the third post-processing, first, the word string generation unit 21 records each candidate character in the word dictionary 24 in order from the first character of the recognition result character string, as in the first embodiment. A character string that completely matches or partially matches a word is generated as a word candidate. An example of the word character string generated in this way is shown in FIG.

図２２は、単語列生成部２１によって生成された単語文字列６５の一例を示す図である。図２２に示した単語文字列６５の例には、最適な単語たとえば単語６５１からなる単語文字列が示されている。単語文字列６５には、複数の単語の候補は示されていない。単語列生成部２１は、図２１（ｂ）に示した認識結果文字列６４のうち第１文字目から第４文字目の文字列「事庫証明」について、単語辞書２４に「車庫証明」という単語が収録されていれば、この「車庫証明」は「事庫証明」とは第１文字目の「車」と「事」とのみが異なる部分的に一致する文字列だと判断し、「事庫証明」に対応する単語として「車庫証明」を抽出する。他にも「事故」あるいは「車庫」といった単語を抽出することができるが、どれも完全一致ではなく、また「車庫証明」よりも文字列長が短いので、最終的には、文字列長の最も長い「車庫証明」のみを抽出する。「車庫証明」は部分一致の単語として抽出される。 FIG. 22 is a diagram illustrating an example of the word character string 65 generated by the word string generation unit 21. In the example of the word character string 65 shown in FIG. 22, a word character string made up of an optimum word, for example, the word 651 is shown. The word character string 65 does not show a plurality of word candidates. The word string generation unit 21 refers to the word dictionary 24 as “garage proof” for the character string “case certificate” of the first to fourth characters in the recognition result character string 64 shown in FIG. If the word is recorded, this “garage proof” is determined to be a partially matching character string in which only the first characters “car” and “thing” differ from the “case proof”. The “garage certificate” is extracted as a word corresponding to the “case certificate”. Other words such as “accident” or “garage” can be extracted, but none of them are exact matches, and the string length is shorter than “garage proof”. Only the longest “garage certificate” is extracted. The “garage proof” is extracted as a partially matched word.

図２３は、認識結果文字列と単語文字列との比較例６６を示す図である。比較例６６は、上段が文字列生成部１９によって生成された認識結果文字列であり、下段が単語列生成部２１によって生成された単語文字列である。図２３に示した下段の単語文字列は、図２２に示した単語文字列６５で最適な単語として選択された単語からなる文字列である。比較例６６は、単語文字列の各々の単語を構成する文字数で区切った矩形によって、認識結果文字列と単語文字列とを区切って示している。 FIG. 23 is a diagram illustrating a comparative example 66 of the recognition result character string and the word character string. In the comparative example 66, the upper row is a recognition result character string generated by the character string generator 19, and the lower row is a word character string generated by the word string generator 21. The lower word character string shown in FIG. 23 is a character string made up of words selected as optimum words in the word character string 65 shown in FIG. In the comparative example 66, the recognition result character string and the word character string are separated by a rectangle divided by the number of characters constituting each word of the word character string.

続いて、置き換え判定部２２ｂは、文字列生成部１９で生成された認識結果文字列と単語列生成部２１により生成された単語文字列とを、単語ごとに先頭文字から一文字ずつ比較して、置き換えるか否かを判定する。置き換えるか否かの判定は、以下を除き、第１の実施形態と同じである。 Subsequently, the replacement determination unit 22b compares the recognition result character string generated by the character string generation unit 19 and the word character string generated by the word string generation unit 21 one by one from the first character for each word, Determine whether to replace. The determination as to whether or not to replace is the same as in the first embodiment except for the following.

第１の実施形態では、置き換え判定部２２は、計算した類似度が予め定める閾値未満であるとき、単語を構成する文字のうち候補文字の中に同じ文字がない文字について、置き換えないと判定したが、第３の実施形態では、置き換え判定部２２ｂは、計算した類似度が予め定める閾値未満であるとき、さらに、対象としている単語の文字数が予め定める基準文字数以上か否かを判定する。置き換え判定部２２ｂは、対象としている単語の文字数が予め定める基準文字数以上でない場合、置き換えないと判定する。対象としている単語の文字数が予め定める基準文字数以上である場合、予め定める閾値を低めに設定し直し、類似度が設定し直した予め定める閾値以上であるとき、置き換えると判定し、類似度が設定し直した予め定める閾値以上でないとき、置き換えないと判定する。 In the first embodiment, when the calculated similarity is less than a predetermined threshold, the replacement determination unit 22 determines that a character that does not have the same character among the candidate characters among the characters constituting the word is not replaced. However, in the third embodiment, when the calculated similarity is less than a predetermined threshold, the replacement determination unit 22b further determines whether or not the number of characters of the target word is greater than or equal to a predetermined reference character number. The replacement determination unit 22b determines that the replacement is not performed when the number of characters of the target word is not greater than or equal to a predetermined reference number of characters. If the number of characters of the target word is equal to or greater than the predetermined reference character number, the predetermined threshold is reset to a lower value, and when the similarity is equal to or higher than the predetermined threshold that has been reset, it is determined to replace and the similarity is set. If it is not equal to or more than the predetermined threshold value, it is determined not to be replaced.

たとえば図２３に示した例について、置き換え判定部２２ｂによる置き換えるか否かの判定を説明する。図２３に示した単語文字列の単語「車庫証明」は、図２３に示した認識結果文字列の単語「事庫証明」に対応し、前から順番に文字を１文字ずつ比較していくと、第１文字目が異なる。置き換え判定部２２ｂは、単語辞書２４から得られた「車」という文字が、認識結果文字列の対応する位置にある候補文字に存在するかをチェックする。この実施形態では、認識結果文字列と単語文字列とを前から順番に比較するが、後ろから順番に比較するような構成でもよい。 For example, with respect to the example shown in FIG. 23, the determination as to whether or not to replace by the replacement determination unit 22b will be described. The word “garage proof” of the word character string shown in FIG. 23 corresponds to the word “case proof” of the recognition result character string shown in FIG. 23, and the characters are compared one by one in order from the front. The first character is different. The replacement determination unit 22b checks whether the character “car” obtained from the word dictionary 24 exists in the candidate character at the corresponding position in the recognition result character string. In this embodiment, the recognition result character string and the word character string are compared in order from the front, but may be configured to compare in order from the back.

図２１（ａ）に示した候補文字６３には、認識結果文字列の第１文字目の第１候補の候補文字「事」が示され、第２候補以下の候補文字以降にも「車」いう文字は存在しない。単語文字列の単語「車庫証明」の「車」が認識結果文字列の候補文字に含まれていない、つまり存在しないので、置き換え判定部２２ｂは、類似度による置き換えるか否かの判定を行う。このとき、仮に、単語文字列の単語「車庫証明」のすべての文字が認識結果文字列の候補文字に含まれている場合は、置き換え判定部２２ｂは、置き換えると判定する。 In the candidate character 63 shown in FIG. 21A, the candidate character “thing” of the first candidate of the first character of the recognition result character string is shown, and “car” after the candidate character below the second candidate. There is no such character. Since the “car” of the word “garage proof” in the word character string is not included in the candidate characters of the recognition result character string, that is, does not exist, the replacement determination unit 22b determines whether or not to replace based on the similarity. At this time, if all characters of the word “garage certificate” in the word character string are included in the candidate characters of the recognition result character string, the replacement determination unit 22b determines to replace.

置き換え判定部２２ｂは、認識結果文字列中の文字と単語文字列中の文字とに異なる文字があった場合、認識結果文字列中の文字の特徴量と単語文字列中の文字の特徴量とを比較して、類似度を抽出する。たとえば、図２３に示した２点鎖線６６１によって囲んだ文字「事」と文字「車」とが異なるので、文字認識辞書１４に収録されている文字「車」の特徴量と、認識結果文字列の第１文字目の文字画像から抽出された特徴量とを比較して類似度を抽出する。認識結果文字列の第１文字目の文字画像から抽出された特徴量、つまり図２０に示した文字画像６２のうち第１文字目の文字画像６２１から抽出された特徴量は、メモリ１３に記憶されている。 When there is a different character between the character in the recognition result character string and the character in the word character string, the replacement determination unit 22b determines the character feature amount in the recognition result character string and the character feature amount in the word character string. Are compared to extract the degree of similarity. For example, since the character “thing” and the character “car” enclosed by the two-dot chain line 661 shown in FIG. 23 are different, the feature amount of the character “car” recorded in the character recognition dictionary 14 and the recognition result character string The similarity is extracted by comparing with the feature amount extracted from the character image of the first character. The feature amount extracted from the character image of the first character of the recognition result character string, that is, the feature amount extracted from the character image 621 of the first character in the character image 62 shown in FIG. Has been.

図２４は、単語文字列の文字の特徴量および対応する位置にある候補文字の特徴量の一例を示す図である。図２４（ａ）は、候補文字「事」の特徴量を示す特徴データ３９ｄである。候補文字「事」は、図２０に示した文字画像６２のうち第１文字目の文字画像６２１が表す文字について、マッチング部１８によって第１候補の候補文字「事」として抽出された文字である。図２４（ｂ）は、文字認識辞書１４に収録されている文字「車」の特徴量を示す特徴データ３２ｄである。式（３）による実際の計算は省略するが、候補文字「事」の特徴量と文字認識辞書１４に収録されている文字「車」の特徴量との類似度は、「１１６」である。ここに、類似度の最大値は１２８である。 FIG. 24 is a diagram illustrating an example of character feature amounts of word character strings and feature amounts of candidate characters at corresponding positions. FIG. 24A shows feature data 39d indicating the feature amount of the candidate character “thing”. The candidate character “thing” is a character extracted by the matching unit 18 as the first candidate candidate character “thing” for the character represented by the first character image 621 in the character image 62 shown in FIG. . FIG. 24B shows feature data 32 d indicating the feature amount of the character “car” recorded in the character recognition dictionary 14. Although the actual calculation by Expression (3) is omitted, the similarity between the feature amount of the candidate character “thing” and the feature amount of the character “car” recorded in the character recognition dictionary 14 is “116”. Here, the maximum value of the similarity is 128.

予め定める閾値は、任意の値として設定可能であるが、たとえば予め定める閾値を第１候補の候補文字との類似度の差が５以内であるとする場合、第１文字目の第１候補の候補文字は「事」であり、文字「事」の類似度は、図２１（ａ）に「１２３」と示されているので、予め定める閾値は「１２３」から「５」を減算した「１１８」になる。第１文字目に対応する文字画像と文字「車」との類似度「１１６」は、予め定める閾値「１１８」未満であり、文字数の判定が行われる。予め定める閾値以上の類似度である場合は、置き換えると判定される。 The predetermined threshold can be set as an arbitrary value. For example, when the predetermined threshold is a difference in similarity between the first candidate candidate character and the difference is within 5, the first candidate of the first character The candidate character is “Thing”, and the similarity of the character “Thing” is indicated as “123” in FIG. 21A. Therefore, the predetermined threshold is “118” obtained by subtracting “5” from “123”. "become. The similarity “116” between the character image corresponding to the first character and the character “car” is less than a predetermined threshold “118”, and the number of characters is determined. If the degree of similarity is equal to or greater than a predetermined threshold, it is determined to replace.

文字数の判定では、置き換え判定部２２ｂは、対象としている単語の文字数が予め定める基準文字数以上であるか否かを判定する。対象としている単語の文字数が予め定める基準文字数以上でない場合、置き換え判定部２２ｂは、置き換えないと判定する。文字数が長い文字列の場合、単語辞書２４に収録された単語と完全に一致もしくは部分一致して照合することができるということは、抽出した単語が最適である確度が高いと考えられ、予め定める基準文字数以上の単語については、類似度の判定に使用する予め定める閾値を低めに設定し直す。 In the determination of the number of characters, the replacement determination unit 22b determines whether or not the number of characters of the target word is greater than or equal to a predetermined reference number of characters. If the number of characters of the target word is not equal to or greater than the predetermined number of characters, the replacement determination unit 22b determines not to replace. In the case of a character string having a long number of characters, the fact that it can be matched completely or partially with a word recorded in the word dictionary 24 is considered to have a high probability that the extracted word is optimal, and is determined in advance. For words that exceed the number of reference characters, the predetermined threshold value used for similarity determination is reset to a lower value.

予め定める基準文字数は、たとえば「４」であり、設定し直した予め定める閾値は、第１候補の候補文字の類似度から「１０」を減算した値、あるいは最大類似度＝１２８の８割、たとえば１０２とする。置き換え判定部２２ｂは、類似度が設定し直した予め定める閾値以上であると、置き換えると判定し、類似度が設定し直した予め定める閾値未満であると、置き換えないと判定する。 The predetermined reference character number is, for example, “4”, and the predetermined threshold value that is reset is a value obtained by subtracting “10” from the similarity of the candidate character of the first candidate, or 80% of the maximum similarity = 128, For example, 102. The replacement determination unit 22b determines that the replacement is performed when the similarity is equal to or higher than a predetermined threshold that has been reset, and determines that the replacement is not performed when the similarity is less than the predetermined threshold that has been reset.

図２３に示した第１文字目の例では、設定し直した予め定める閾値を、第１候補の候補文字の類似度から「１０」を減算した値とする場合、設定し直した予め定める閾値は「１１３」であり、第１文字目に対応する文字画像と文字「車」との類似度「１１６」は、設定し直した予め定める閾値以上であり、置き換え判定部２２ｂは、置き換えると判定する。 In the example of the first character shown in FIG. 23, when the preset threshold value that has been reset is a value obtained by subtracting “10” from the similarity of the candidate character of the first candidate, the preset threshold value that has been reset Is “113”, and the similarity “116” between the character image corresponding to the first character and the character “car” is equal to or higher than the preset threshold value, and the replacement determination unit 22b determines to replace To do.

第３の後処理では、最後に、置き換え部２３は、第１の実施形態と同様に、認識結果文字列のうち置き換えると判定された文字について、認識結果文字列の文字を単語文字列の文字で置き換え、置き換えた認識結果文字列を出力装置３に送信して出力させる。認識結果文字列のすべての文字について置き換えないと判定されたときは、置き換えを行うことなく、認識結果文字列を出力装置３に送信して出力させる。 In the third post-processing, finally, the replacement unit 23 replaces the characters of the recognition result character string with the characters of the word character string for the characters determined to be replaced among the recognition result character strings, as in the first embodiment. And the replaced recognition result character string is transmitted to the output device 3 for output. When it is determined not to replace all the characters in the recognition result character string, the recognition result character string is transmitted to the output device 3 and output without replacement.

図２３に示した認識結果文字列のうち単語「事庫証明」の「事」が、置き換えると判定されるので、置き換え部２３は、認識結果文字列の単語「事庫証明」の「事」を単語文字列の「車庫証明」の「車」で置き換え、置き換えた単語「車庫証明」を出力装置３に送信して出力させる。すなわち、認識結果文字列を構成する単語のうち誤認識された単語「事庫証明」の「事」を、単語文字列の「車庫証明」の「車」でコレクト、つまり修正して出力することができる。したがって、情報処理装置１ｂは、「車庫証明の取得後、提出する必要がある。」という文字列を認識結果として出力することができる。 Since it is determined that the “thing” of the word “case certificate” in the recognition result character string illustrated in FIG. 23 is to be replaced, the replacement unit 23 performs “thing” of the word “case certificate” of the recognition result character string. Is replaced with "car" in the word character string "garage certificate", and the replaced word "garage certificate" is transmitted to the output device 3 for output. That is, the word “thing” of the misrecognized word “case certificate” of the words constituting the recognition result character string is collected by the “car” of the word string “garage certificate”, that is, corrected and output. Can do. Therefore, the information processing apparatus 1b can output the character string “It is necessary to submit after obtaining the garage certificate” as the recognition result.

図２５は、後処理部２０ｂが実行する第３の後処理の処理手順を示すフローチャートである。情報処理装置１ｂ、入力装置２および出力装置３が実行する文字認識処理は、情報処理装置１、入力装置２および出力装置３が実行する文字認識処理と同じであり、重複を避けるために説明は省略する。第３の実施形態では、図１０に示したステップＡ７が実行されると、ステップＤ１に移る。ステップＤ１〜Ｄ５，Ｄ１０は、それぞれ図１１に示したステップＢ１〜Ｂ５，Ｂ７と同じであり、説明は省略する。 FIG. 25 is a flowchart illustrating a processing procedure of third post-processing executed by the post-processing unit 20b. The character recognition processing executed by the information processing device 1b, the input device 2 and the output device 3 is the same as the character recognition processing executed by the information processing device 1, the input device 2 and the output device 3, and will not be described in order to avoid duplication. Omitted. In the third embodiment, when step A7 shown in FIG. 10 is executed, the process proceeds to step D1. Steps D1 to D5 and D10 are the same as steps B1 to B5 and B7 shown in FIG.

ステップＤ６では、置き換え判定部２２ｂは、特徴量の比較から、予め定める閾値以上の類似性が得られたか否かを判定する。ステップＤ５で計算した類似度が予め定める閾値以上であるとき、特徴量の比較から、予め定める閾値以上の類似性が得られたと判定し、ステップＤ３に進む。ステップＤ５で計算した類似度が予め定める閾値未満であるとき、特徴量の比較から、予め定める閾値以上の類似性が得られなかったと判定し、ステップＤ７に進む。 In step D6, the replacement determination unit 22b determines whether similarity equal to or greater than a predetermined threshold is obtained from the comparison of feature amounts. When the similarity calculated in step D5 is equal to or greater than a predetermined threshold, it is determined from the feature amount comparison that similarity equal to or greater than the predetermined threshold is obtained, and the process proceeds to step D3. When the degree of similarity calculated in step D5 is less than a predetermined threshold, it is determined from the feature amount comparison that no similarity equal to or higher than the predetermined threshold has been obtained, and the process proceeds to step D7.

ステップＤ７では、置き換え判定部２２ｂは、対象としている単語の文字数が予め定める基準文字数Ｍ以上であるか否かを判定する。対象としている単語の文字数が予め定める基準文字数Ｍ以上であるとき、ステップＤ８に進み、対象としている単語の文字数が予め定める基準文字数Ｍ未満であるとき、ステップＤ１０に進む。置き換え判定部２２ｂは、ステップＤ１０に進むとき、置き換えないと判定する。ステップＤ８では、置き換え判定部２２ｂは、予め定める閾値を低い値に再設定する。 In step D7, the replacement determination unit 22b determines whether or not the number of characters of the target word is equal to or greater than a predetermined reference number M. When the number of characters of the target word is greater than or equal to the predetermined reference character number M, the process proceeds to step D8, and when the number of characters of the target word is less than the predetermined reference character number M, the process proceeds to step D10. The replacement determination unit 22b determines not to replace when proceeding to step D10. In step D8, the replacement determination unit 22b resets the predetermined threshold value to a low value.

ステップＤ９では、置き換え判定部２２ｂは、類似度が再設定した予め定める閾値以上であるか否かを判定する。類似度が再設定した予め定める閾値以上であるとき、ステップＤ３に進み、類似度が再設定した予め定める閾値未満であるとき、ステップＤ１０に進む。置き換え判定部２２ｂは、ステップＤ３に進むとき、置き換えると判定し、ステップＤ１０に進むとき、置き換えないと判定する。 In step D9, the replacement determination unit 22b determines whether or not the similarity is equal to or higher than a preset threshold value. When the similarity is equal to or higher than the preset predetermined threshold, the process proceeds to step D3, and when the similarity is less than the preset predetermined threshold, the process proceeds to step D10. The replacement determination unit 22b determines to replace when proceeding to step D3, and determines not to replace when proceeding to step D10.

図２５に示したステップＤ１は、単語列生成工程に相当し、図２５に示したステップＤ２，Ｄ４〜Ｄ９は、置き換え判定工程に相当し、図２５に示したステップＤ３，Ｄ１０は、置き換え工程に相当する。 Step D1 shown in FIG. 25 corresponds to a word string generation process, steps D2 and D4 to D9 shown in FIG. 25 correspond to a replacement determination process, and steps D3 and D10 shown in FIG. It corresponds to.

このように、上述した第１〜３の実施形態では、文字認識辞書１４および単語辞書２４以外の辞書、たとえば類似性文字辞書などの他の辞書を用いることなく、簡単な処理で、認識文字列内の誤りを正しい文字に訂正することができる。文字認識辞書１４および単語辞書２４以外の辞書を必要としないので、記憶容量の増加を回避することができる。 Thus, in the first to third embodiments described above, a recognized character string can be obtained by a simple process without using a dictionary other than the character recognition dictionary 14 and the word dictionary 24, for example, another dictionary such as a similarity character dictionary. The error in can be corrected to the correct character. Since no dictionary other than the character recognition dictionary 14 and the word dictionary 24 is required, an increase in storage capacity can be avoided.

また、上述した第１〜３の実施形態では、認識結果文字列中の文字と単語文字列中の文字とが異なる場合に、異なっている文字について、認識結果文字列中の対応する文字番目の文字画像から抽出される特徴データと、単語文字列中の文字に対応する文字認識辞書内の特徴データとから算出される類似度によって、認識結果文字列中の文字を単語文字列中の文字で置き換えるか否かを判定する構成としたが、これに限定されるものではない。たとえば、認識結果文字列中の第１候補の候補文字に対応する文字認識辞書１４に収録される文字の特徴データと、単語文字列中の文字に対応する文字認識辞書１４に収録される文字の特徴データとから算出される類似度によって、認識結果文字列中の文字を単語文字列中の文字で置き換える構成とすることもできる。 In the first to third embodiments described above, when the character in the recognition result character string is different from the character in the word character string, the corresponding character number in the recognition result character string is changed for the different character. Depending on the similarity calculated from the feature data extracted from the character image and the feature data in the character recognition dictionary corresponding to the character in the word character string, the character in the recognition result character string is replaced with the character in the word character string. Although it was set as the structure which determines whether it replaces, it is not limited to this. For example, the character feature data recorded in the character recognition dictionary 14 corresponding to the first candidate candidate character in the recognition result character string, and the character recorded in the character recognition dictionary 14 corresponding to the character in the word character string. Depending on the similarity calculated from the feature data, a character in the recognition result character string may be replaced with a character in the word character string.

また、上述した第１〜３の実施形態では、文字の特徴量を比較して、文字同士がどれだけ似ているかを数値化した類似度として、単純類似度を用いたが、これに限定されるものではなく、たとえば単純類似度からさらに計算することによって算出される複合類似度を用いることも可能である。 In the first to third embodiments described above, simple similarity is used as the similarity obtained by comparing the character feature amounts and quantifying how much the characters are similar to each other. However, the present invention is not limited to this. For example, it is also possible to use composite similarity calculated by further calculating from simple similarity.

また、上述した第１〜３の実施形態では、文章が記載された原稿を読込み、読込んだ原稿の画像にあらわされる文章の文字列を単語に分割して、コレクト処理を行う例を示したが、これに限定されるものではない。たとえば、住所を記載した文字列を読込み、単語辞書を住所辞書に置き換えてコレクト処理を行う際にも、適用可能であり、誤認識した住所を正しい住所にコレクト、つまり修正することが可能である。単語辞書を住所辞書に置き換えて、住所の文字列に適用してコレクト処理を行う場合にも、類似性文字辞書などの他の辞書を必要としないので、メモリ１３の追加容量を必要とせず、また簡単な方法で正しいコレクト処理を行うことができる。 In the first to third embodiments described above, an example in which a document in which a document is written is read, a character string of the document represented in the image of the read document is divided into words, and collect processing is performed is shown. However, the present invention is not limited to this. For example, it is also applicable when reading a character string describing an address and replacing the word dictionary with an address dictionary to perform a collect process. It is possible to correct a misrecognized address to a correct address, that is, to correct it. . Even when the word dictionary is replaced with the address dictionary and applied to the address character string to perform the collect process, no other dictionary such as a similarity character dictionary is required, so that an additional capacity of the memory 13 is not required. In addition, correct collection processing can be performed by a simple method.

上述した実施の形態では、入出力制御部１２を制御するとともに、文字列行切り出し部１５、文字切り出し部１６、特徴抽出部１７、マッチング部１８、文字列生成部１９および後処理部２０，２０ａ，２０ｂなどの機能を実現するプログラムは、情報処理装置１，１ａ，１ｂのメモリ１３などの記憶装置に記憶されるが、この記憶装置に限定されるものではなく、コンピュータで読取り可能な記録媒体に記録されていてもよい。記録媒体は、たとえば図示しない外部記憶装置としてプログラム読取装置を情報処理装置１，１ａ，１ｂに設け、そこに記録媒体を挿入することによって読取り可能な記録媒体であってもよいし、あるいは他の装置の記憶装置であってもよい。 In the above-described embodiment, the input / output control unit 12 is controlled, and the character string row cutout unit 15, the character cutout unit 16, the feature extraction unit 17, the matching unit 18, the character string generation unit 19, and the post-processing units 20 and 20a. , 20b and the like are stored in a storage device such as the memory 13 of the information processing apparatus 1, 1a, 1b, but are not limited to this storage device, and are a computer-readable recording medium May be recorded. The recording medium may be, for example, a recording medium that is readable by providing a program reading device as an external storage device (not shown) in the information processing apparatus 1, 1a, 1b and inserting the recording medium therein. It may be a storage device of the device.

いずれの記録媒体であっても、記憶されているプログラムがコンピュータからアクセスされて実行される構成であればよい。あるいはいずれの記録媒体であっても、プログラムが読み出され、読み出されたプログラムが、記憶装置のプログラム記憶エリアに記憶されて、そのプログラムが実行される構成であってもよい。 Any recording medium may be used as long as the stored program is accessed from a computer and executed. Alternatively, any recording medium may be configured such that the program is read, the read program is stored in the program storage area of the storage device, and the program is executed.

情報処理装置１，１ａ，１ｂと分離可能に構成される記録媒体は、たとえば磁気テープ／カセットテープなどのテープ系の記録媒体、フレキシブルディスク／ハードディスクなどの磁気ディスクもしくはＣＤ−ＲＯＭ（Compact Disk Read Only Memory）／ＭＯ（
Magneto Optical disk）／ＭＤ（Mini Disc）／ＤＶＤ（Digital Versatile Disk）／ＣＤ−Ｒ（Compact Disk Recordable）／ブルーレイディスクなどの光ディスクのディスク系の記録媒体、ＩＣ（Integrated Circuit）カード（メモリカードを含む）／光カードなどのカード系の記録媒体、またはマスクＲＯＭ／ＥＰＲＯＭ（Erasable Programmable
Read Only Memory）／ＥＥＰＲＯＭ（Electrically Erasable Programmable Read Only
Memory）／フラッシュＲＯＭなどの半導体メモリを含む固定的にプログラムを担持する記録媒体であってもよい。 The recording medium configured to be separable from the information processing apparatuses 1, 1a, 1b is, for example, a tape-type recording medium such as a magnetic tape / cassette tape, a magnetic disk such as a flexible disk / hard disk, or a CD-ROM (Compact Disk Read Only). Memory) / MO (
Magnet-type optical disks (MDs) / MDs (Mini Discs) / DVDs (Digital Versatile Disks) / CD-Rs (Compact Disk Recordables) / Blu-rays and other optical discs, IC (Integrated Circuit) cards (including memory cards) ) / Card-type recording media such as optical cards, or mask ROM / EPROM (Erasable Programmable)
Read Only Memory) / EEPROM (Electrically Erasable Programmable Read Only
Memory) / a recording medium that carries a fixed program including a semiconductor memory such as a flash ROM.

また、情報処理装置１，１ａ，１ｂを通信ネットワークと接続可能に構成し、通信ネットワークを介して上記プログラムを供給してもよい。この通信ネットワークとしては、特に限定されず、たとえば、インターネット、イントラネット、エキストラネット、ＬＡＮ（Local Area Network）、ＩＳＤＮ（Integrated Services Digital Network）、ＶＡＮ（Value Added Network）、ＣＡＴＶ（Community Antenna Television）通信網、仮想専用網（Virtual Private Network）、電話回線網、移動体通信網、または衛星通信網など通信ネットワークが利用可能である。また、通信ネットワークを構成する伝送媒体としては、特に限定されず、たとえば、ＩＥＥＥ１３９４、ＵＳＢ（Universal Serial Bus）、電力線搬送、ケーブルＴＶ回線、電話線、ＡＤＳＬ（Asymmetric Digital Subscriber
Line）回線等の有線でも、ＩｒＤＡ（Infrared Data Association）あるいはリモートコントロールで用いられる赤外線、Ｂｌｕｅｔｏｏｔｈ（登録商標）、８０２．１１無線、ＨＤＲ（High Data Rate）、携帯電話網、衛星回線、地上波デジタル網などの無線でも利用可能である。なお、本発明は、上記プログラムが電子的な伝送で具現化された、搬送波に埋め込まれたコンピュータデータ信号の形態でも実現され得る。 The information processing apparatuses 1, 1a, and 1b may be configured to be connectable to a communication network, and the program may be supplied via the communication network. The communication network is not particularly limited. For example, the Internet, intranet, extranet, LAN (Local Area Network), ISDN (Integrated Services Digital Network), VAN (Value Added Network), CATV (Community Antenna Television) communication network. A communication network such as a virtual private network, a telephone line network, a mobile communication network, or a satellite communication network can be used. Also, the transmission medium constituting the communication network is not particularly limited. For example, IEEE1394, USB (Universal Serial Bus), power line carrier, cable TV line, telephone line, ADSL (Asymmetric Digital Subscriber)
Line (wire) lines, etc., IrDA (Infrared Data Association) or infrared light used for remote control, Bluetooth (registered trademark), 802.11 wireless, HDR (High Data Rate), mobile phone network, satellite line, terrestrial digital It can also be used wirelessly such as on the network. The present invention can also be realized in the form of a computer data signal embedded in a carrier wave in which the program is embodied by electronic transmission.

このように、メモリ１３は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書１４、および少なくとも１つの文字からなる単語を収録する単語辞書２４を記憶する。文字列行切り出し部１５は、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。文字切り出し部１６は、文字列行切り出し部１５によって切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。特徴抽出部１７は、文字切り出し部１６によって切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。マッチング部１８は、特徴抽出部１７によって抽出された特徴量とメモリ１３に記憶される文字認識辞書１４に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書１４から少なくとも１つ選択する。文字列生成部１９は、マッチング部１８によって選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。単語列生成部２１は、文字列生成部１９によって生成された認識結果文字列を構成する候補文字と、メモリ１３に記憶される単語辞書２４に収録された単語とを照合し、単語辞書２４に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。置き換え判定部２２，２２ａ，２２ｂは、文字列生成部１９によって生成された認識結果文字列と単語列生成部２１によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、特徴抽出部１７によって抽出された特徴量と、メモリ１３に記憶される文字認識辞書１４に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、置き換え部２３は、置き換え判定部２２，２２ａ，２２ｂによって置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 As described above, the memory 13 stores the character recognition dictionary 14 that records the character and the feature value obtained by quantifying the feature of the character in association with each other, and the word dictionary 24 that records the word composed of at least one character. The character string line cutout unit 15 cuts out a character string image representing a character string for each line from an image of a document on which a character string line that is a character string is described. The character cutout unit 16 cuts out character images representing each character one by one from the character string image cut out by the character string row cutout unit 15. The feature extraction unit 17 extracts the feature amount of the character represented by each character image from the character image cut out by the character cutout unit 16. The matching unit 18 compares the feature amount extracted by the feature extraction unit 17 with the feature amount recorded in the character recognition dictionary 14 stored in the memory 13, and for each character represented by the character image, the character represented by the character image. At least one candidate character that can be recognized as is selected from the character recognition dictionary 14. The character string generation unit 19 generates a recognition result character string by combining candidate characters for each character image selected by the matching unit 18. The word string generation unit 21 collates the candidate characters constituting the recognition result character string generated by the character string generation unit 19 with the words recorded in the word dictionary 24 stored in the memory 13, and stores them in the word dictionary 24. A character string in which all the characters match the recorded word or a word in which only a part of the characters match is extracted, and a word character string including the extracted words is generated. The replacement determination units 22, 22 a, and 22 b differ by comparing the recognition result character string generated by the character string generation unit 19 and the word character string generated by the word string generation unit 21 for each character at a corresponding position. When there is a character, the different character matches the character of the word character string among the feature amount extracted by the feature extraction unit 17 and the character feature amount recorded in the character recognition dictionary 14 stored in the memory 13. It compares with the feature-value memorize | stored matched with a character, and when a comparison result satisfy | fills the comparison criterion defined beforehand, it determines with replacing the character in a recognition result character string with the corresponding character in a word character string. Then, the replacement unit 23 replaces the character in the recognition result character string determined to be replaced by the replacement determination unit 22, 22a, 22b with the character at the corresponding position in the word character string.

したがって、文字認識辞書１４および単語辞書２４以外の辞書を用いることなく、認識した文字の誤りを正しい文字に訂正することができる。文字認識辞書１４および単語辞書２４以外の辞書を必要としないので、記憶容量の増加を回避することができる。 Therefore, it is possible to correct a recognized character error to a correct character without using a dictionary other than the character recognition dictionary 14 and the word dictionary 24. Since no dictionary other than the character recognition dictionary 14 and the word dictionary 24 is required, an increase in storage capacity can be avoided.

さらに、メモリ１３は、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書１４、および少なくとも１つの文字からなる単語を収録する単語辞書２４を記憶する。文字列行切り出し部１５は、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。文字切り出し部１６は、文字列行切り出し部１５によって切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。特徴抽出部１７は、文字切り出し部１６によって切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。マッチング部１８は、特徴抽出部１７によって抽出された特徴量とメモリ１３に記憶される文字認識辞書１４に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書１４から少なくとも１つ選択する。文字列生成部１９は、マッチング部１８によって選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。単語列生成部２１は、文字列生成部１９によって生成された認識結果文字列を構成する候補文字と、メモリ１３に記憶される単語辞書２４に収録された単語とを照合し、単語辞書２４に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。置き換え判定部２２，２２ａ，２２ｂは、文字列生成部１９によって生成された認識結果文字列と単語列生成部２１によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、メモリ１３に記憶される文字認識辞書１４に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、メモリ１３に記憶される文字認識辞書１４に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、置き換え部２３は、置き換え判定部２２，２２ａ，２２ｂによって置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 Furthermore, the memory 13 stores a character recognition dictionary 14 that records characters and feature values obtained by quantifying the features of the characters in association with each other, and a word dictionary 24 that records words composed of at least one character. The character string line cutout unit 15 cuts out a character string image representing a character string for each line from an image of a document on which a character string line that is a character string is described. The character cutout unit 16 cuts out character images representing each character one by one from the character string image cut out by the character string row cutout unit 15. The feature extraction unit 17 extracts the feature amount of the character represented by each character image from the character image cut out by the character cutout unit 16. The matching unit 18 compares the feature amount extracted by the feature extraction unit 17 with the feature amount recorded in the character recognition dictionary 14 stored in the memory 13, and for each character represented by the character image, the character represented by the character image. At least one candidate character that can be recognized as is selected from the character recognition dictionary 14. The character string generation unit 19 generates a recognition result character string by combining candidate characters for each character image selected by the matching unit 18. The word string generation unit 21 collates the candidate characters constituting the recognition result character string generated by the character string generation unit 19 with the words recorded in the word dictionary 24 stored in the memory 13, and stores them in the word dictionary 24. A character string in which all the characters match the recorded word or a word in which only a part of the characters match is extracted, and a word character string including the extracted words is generated. The replacement determination units 22, 22 a, and 22 b differ by comparing the recognition result character string generated by the character string generation unit 19 and the word character string generated by the word string generation unit 21 for each character at a corresponding position. When there is a character, for different characters, the feature amount stored in association with the character that matches the character of the recognition result character string among the feature amounts recorded in the character recognition dictionary 14 stored in the memory 13, and the memory 13 is compared with the feature amount stored in association with the character that matches the character of the word character string among the feature amounts of the characters recorded in the character recognition dictionary 14 stored in 13, and a comparison criterion determined in advance by the comparison result is determined. When it is satisfied, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string. Then, the replacement unit 23 replaces the character in the recognition result character string determined to be replaced by the replacement determination unit 22, 22a, 22b with the character at the corresponding position in the word character string.

さらに、置き換え判定部２２，２２ａ，２２ｂは、文字列生成部１９によって生成された認識結果文字列と単語列生成部２１によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、特徴抽出部１７によって抽出された特徴量と、メモリ１３に記憶される文字認識辞書１４に収録された特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較して類似度を算出し、算出した類似度が予め定める閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。したがって、特徴量による類似度を指標とするので、簡単な計算で文字の類似を判定することができる。 Furthermore, the replacement determination units 22, 22 a, and 22 b compare the recognition result character string generated by the character string generation unit 19 and the word character string generated by the word string generation unit 21 for each character at a corresponding position. If there is a different character, the different character matches the character of the word character string among the feature amount extracted by the feature extraction unit 17 and the feature amount recorded in the character recognition dictionary 14 stored in the memory 13. The similarity is calculated by comparing the feature quantity stored in association with the character, and when the calculated similarity is equal to or greater than a predetermined threshold, the character in the recognition result character string is converted to the corresponding character in the word character string. It is determined to be replaced with. Therefore, since the similarity based on the feature amount is used as an index, the similarity of characters can be determined with a simple calculation.

さらに、置き換え判定部２２，２２ａ，２２ｂは、文字列生成部１９によって生成された認識結果文字列と単語列生成部２１によって生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、メモリ１３に記憶される文字認識辞書１４に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、メモリ１３に記憶される文字認識辞書１４に収録された特徴量のうち単語文字列の文字の一致する文字に対応付けて記憶される特徴量とを比較して類似度を算出し、算出した類似度が予め定める閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。したがって、特徴量による類似度を指標とするので、簡単な計算で文字の類似を判定することができる。 Furthermore, the replacement determination units 22, 22 a, and 22 b compare the recognition result character string generated by the character string generation unit 19 and the word character string generated by the word string generation unit 21 for each character at a corresponding position. If there is a different character, the different character is stored in association with the character that matches the character of the recognition result character string among the feature values recorded in the character recognition dictionary 14 stored in the memory 13. The similarity is calculated by comparing the feature quantity recorded in the character recognition dictionary 14 stored in the memory 13 with the feature quantity stored in association with the matching character of the word character string. When the similarity is equal to or greater than a predetermined threshold, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string. Therefore, since the similarity based on the feature amount is used as an index, the similarity of characters can be determined with a simple calculation.

さらに、置き換え判定部２２，２２ａ，２２ｂは、前記算出した類似度が予め定める閾値未満であるとき、単語列生成部２１が前記単語文字列を生成したときに、単語辞書２４に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語であって、前記単語文字列に含まれない単語が抽出されていた場合は、前記算出した類似度が、予め定める閾値未満の値である予め定める第２の閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。 Further, the replacement determination unit 22, 22a, 22b is configured to store the words recorded in the word dictionary 24 when the word string generation unit 21 generates the word character string when the calculated similarity is less than a predetermined threshold. And a character string that matches all characters, or a word that matches only a part of the characters and is not included in the word character string, the calculated similarity is a predetermined threshold value. When the value is equal to or greater than a predetermined second threshold value that is less than the value, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string.

したがって、単語の候補が１つしか生成されていない場合は、その単語の候補は、正解である可能性が高く、類似度が予め定める閾値未満であっても、最適な単語として置き換えることができる。たとえば、光学式文字認識では、「かすれ」あるいは「にじみ」が原因で、複数の文字からなる単語のうちの１つの文字が正しく認識されない場合、類似度が低くなり、予め定める閾値未満になることがあるが、単語辞書２４から抽出された単語が１つである場合、正解である可能性は非常に高く、この単語を用いて修正することによって、正しい文字に訂正することができるという効果がある。 Therefore, when only one word candidate is generated, the word candidate is likely to be correct and can be replaced as an optimum word even if the similarity is less than a predetermined threshold. . For example, in optical character recognition, if one character of a word consisting of a plurality of characters is not correctly recognized due to “faint” or “smear”, the degree of similarity will be low and will be less than a predetermined threshold. However, if there is only one word extracted from the word dictionary 24, there is a very high possibility that it is a correct answer, and there is an effect that it can be corrected to a correct character by correcting using this word. is there.

さらに、置き換え判定部２２，２２ａ，２２ｂは、前記算出した類似度が予め定める閾値未満であるとき、前記異なる文字を含む単語の文字数が予め定める基準文字数以上である場合は、前記算出した類似度が、予め定める閾値未満の値である予め定める第２の閾値以上であるとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。したがって、文字数が多い単語については、類似度が予め定める閾値未満であっても、正解である可能性が高く、最適な単語を選択して置き換えることができる。 Furthermore, when the calculated similarity is less than a predetermined threshold and the number of characters of the word including the different characters is greater than or equal to a predetermined reference number, the replacement determination unit 22, 22a, 22b Is greater than or equal to a predetermined second threshold value that is a value less than a predetermined threshold value, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string. Therefore, for words with a large number of characters, even if the degree of similarity is less than a predetermined threshold value, there is a high possibility that the word is correct, and an optimal word can be selected and replaced.

さらに、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書１４、および少なくとも１つの文字からなる単語を収録する単語辞書２４を記憶するメモリ１３を含む情報処理装置で、メモリ１３に記憶される文字認識辞書１４および単語辞書２４を用いて文字を認識するにあたって、図１０に示したステップＡ２では、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。図１０に示したステップＡ３では、図１０に示したステップＡ２で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。図１０に示したステップＡ４では、図１０に示したステップＡ３で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。図１０に示したステップＡ５では、図１０に示したステップＡ４で抽出された特徴量とメモリ１３に記憶される文字認識辞書１４に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書１４から少なくとも１つ選択する。図１０に示したステップＡ６では、図１０に示したステップＡ５で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。図１１に示したステップＢ１、図１７に示したステップＣ１，Ｃ２、または図２５に示したステップＤ１では、図１０に示したステップＡ６で生成された認識結果文字列を構成する候補文字と、メモリ１３に記憶される単語辞書２４に収録された単語とを照合し、単語辞書２４に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。図１１に示したステップＢ２，Ｂ４〜Ｂ６、図１７に示したステップＣ３，Ｃ５〜Ｃ８、または図２５に示したステップＤ２，Ｄ４〜Ｄ９では、図１０に示したステップＡ６で生成された認識結果文字列と図１１に示したステップＢ１、図１７に示したステップＣ１，Ｃ２、または図２５に示したステップＤ１で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、図１０に示したステップＡ４で抽出された特徴量と、メモリ１３に記憶される文字認識辞書１４に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、図１１に示したステップＢ３，Ｂ７、図１７に示したステップＣ４，Ｃ９、または図２５に示したステップＤ３，Ｄ１０では、図１１に示したステップＢ２，Ｂ４〜Ｂ６、図１７に示したステップＣ３，Ｃ５〜Ｃ８、または図２５に示したステップＤ２，Ｄ４〜Ｄ９で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 Furthermore, the information processing apparatus includes a memory 13 for storing a character recognition dictionary 14 that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a word dictionary 24 that records a word composed of at least one character. Thus, when recognizing characters using the character recognition dictionary 14 and the word dictionary 24 stored in the memory 13, in step A2 shown in FIG. 10, characters are extracted from an image of a document in which character string rows that are character strings are described. A character string image representing a column is cut out for each row. In step A3 shown in FIG. 10, a character image representing each character is cut out character by character from the character string image cut out in step A2 shown in FIG. In step A4 shown in FIG. 10, the feature amount of the character represented by each character image is extracted from the character image cut out in step A3 shown in FIG. In step A5 shown in FIG. 10, the feature quantity extracted in step A4 shown in FIG. 10 is compared with the feature quantity recorded in the character recognition dictionary 14 stored in the memory 13, and each character represented by the character image is displayed. Then, at least one candidate character that can be recognized as the character represented by the character image is selected from the character recognition dictionary 14. In step A6 shown in FIG. 10, a recognition result character string is generated by combining candidate characters for each character image selected in step A5 shown in FIG. In step B1 shown in FIG. 11, step C1 and C2 shown in FIG. 17, or step D1 shown in FIG. 25, candidate characters constituting the recognition result character string generated in step A6 shown in FIG. The words recorded in the word dictionary 24 stored in the memory 13 are collated, and a character string in which all characters match the word recorded in the word dictionary 24 or a word in which only some characters match is extracted, Generate a word string consisting of the extracted words. In steps B2, B4 to B6 shown in FIG. 11, steps C3, C5 to C8 shown in FIG. 17, or steps D2, D4 to D9 shown in FIG. 25, the recognition generated in step A6 shown in FIG. The result character string and the word character string generated in step B1 shown in FIG. 11, step C1, C2 shown in FIG. 17, or step D1 shown in FIG. 25 are compared for each character at the corresponding position. In the case where there are different characters, for the different characters, among the feature values extracted in step A4 shown in FIG. 10 and the character feature values recorded in the character recognition dictionary 14 stored in the memory 13, The feature quantity stored in association with the character that matches the character is compared, and when the comparison result satisfies a predetermined comparison criterion, the character in the recognition result character string is replaced with the corresponding character in the word character string. Replace the judges. Then, in steps B3 and B7 shown in FIG. 11, steps C4 and C9 shown in FIG. 17, or steps D3 and D10 shown in FIG. 25, steps B2, B4 to B6 shown in FIG. The characters in the recognition result character string determined to be replaced in steps C3, C5 to C8 or steps D2, D4 to D9 shown in FIG. 25 are replaced with the characters at the corresponding positions in the word character string.

したがって、文字認識辞書１４および単語辞書２４以外の辞書を用いることなく、認識した文字の誤りを正しい文字に訂正することができる。文字認識辞書１４および単語辞書２４以外の辞書を必要としないので、情報処理装置の記憶容量の増加を回避することができる。 Therefore, it is possible to correct a recognized character error to a correct character without using a dictionary other than the character recognition dictionary 14 and the word dictionary 24. Since no dictionary other than the character recognition dictionary 14 and the word dictionary 24 is required, an increase in the storage capacity of the information processing apparatus can be avoided.

さらに、文字とその文字の特徴を数値化した特徴量とを対応付けて収録する文字認識辞書１４、および少なくとも１つの文字からなる単語を収録する単語辞書２４を記憶するメモリ１３を含む情報処理装置で、メモリ１３に記憶される文字認識辞書１４および単語辞書２４を用いて文字を認識するにあたって、図１０に示したステップＡ２では、文字列である文字列行が記載された原稿の画像から文字列を表す文字列画像を行ごとに切り出す。図１０に示したステップＡ３では、図１０に示したステップＡ２で切り出された文字列画像から各文字を表す文字画像を１文字ずつ切り出す。図１０に示したステップＡ４では、図１０に示したステップＡ３で切り出された文字画像から各文字画像が表す文字の特徴量を抽出する。図１０に示したステップＡ５では、図１０に示したステップＡ４で抽出された特徴量とメモリ１３に記憶される文字認識辞書１４に収録された特徴量とを比較し、文字画像が表す文字ごとに、文字画像が表す文字として認識され得る候補文字を文字認識辞書１４から少なくとも１つ選択する。図１０に示したステップＡ６では、図１０に示したステップＡ５で選択された文字画像ごとの候補文字を組み合わせて認識結果文字列を生成する。図１１に示したステップＢ１、図１７に示したステップＣ１，Ｃ２、または図２５に示したステップＤ１では、図１０に示したステップＡ６で生成された認識結果文字列を構成する候補文字と、メモリ１３に記憶される単語辞書２４に収録された単語とを照合し、単語辞書２４に収録された単語と全文字が一致する文字列、または一部の文字のみが一致する単語を抽出し、抽出した単語からなる単語文字列を生成する。図１１に示したステップＢ２，Ｂ４〜Ｂ６、図１７に示したステップＣ３，Ｃ５〜Ｃ８、または図２５に示したステップＤ２，Ｄ４〜Ｄ９では、図１０に示したステップＡ６で生成された認識結果文字列と図１１に示したステップＢ１、図１７に示したステップＣ１，Ｃ２、または図２５に示したステップＤ１で生成された単語文字列とを対応する位置にある文字ごとに比較して異なる文字がある場合、異なる文字については、メモリ１３に記憶される文字認識辞書１４に収録された特徴量のうち認識結果文字列の文字に一致する文字に対応付けて記憶される特徴量と、メモリ１３に記憶される文字認識辞書１４に収録された文字の特徴量のうち単語文字列の文字に一致する文字に対応付けて記憶される特徴量とを比較し、比較結果が予め定める比較基準を満たすとき、認識結果文字列中の文字を単語文字列中の対応する文字で置き換えると判定する。そして、図１１に示したステップＢ３，Ｂ７、図１７に示したステップＣ４，Ｃ９、または図２５に示したステップＤ３，Ｄ１０では、図１１に示したステップＢ２，Ｂ４〜Ｂ６、図１７に示したステップＣ３，Ｃ５〜Ｃ８、または図２５に示したステップＤ２，Ｄ４〜Ｄ９で置き換えると判定された認識結果文字列中の文字を単語文字列中の対応する位置にある文字で置き換える。 Furthermore, the information processing apparatus includes a memory 13 for storing a character recognition dictionary 14 that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a word dictionary 24 that records a word composed of at least one character. Thus, when recognizing characters using the character recognition dictionary 14 and the word dictionary 24 stored in the memory 13, in step A2 shown in FIG. 10, characters are extracted from an image of a document in which character string rows that are character strings are described. A character string image representing a column is cut out for each row. In step A3 shown in FIG. 10, a character image representing each character is cut out character by character from the character string image cut out in step A2 shown in FIG. In step A4 shown in FIG. 10, the feature amount of the character represented by each character image is extracted from the character image cut out in step A3 shown in FIG. In step A5 shown in FIG. 10, the feature quantity extracted in step A4 shown in FIG. 10 is compared with the feature quantity recorded in the character recognition dictionary 14 stored in the memory 13, and each character represented by the character image is displayed. Then, at least one candidate character that can be recognized as the character represented by the character image is selected from the character recognition dictionary 14. In step A6 shown in FIG. 10, a recognition result character string is generated by combining candidate characters for each character image selected in step A5 shown in FIG. In step B1 shown in FIG. 11, step C1 and C2 shown in FIG. 17, or step D1 shown in FIG. 25, candidate characters constituting the recognition result character string generated in step A6 shown in FIG. The words recorded in the word dictionary 24 stored in the memory 13 are collated, and a character string in which all characters match the word recorded in the word dictionary 24 or a word in which only some characters match is extracted, Generate a word string consisting of the extracted words. In steps B2, B4 to B6 shown in FIG. 11, steps C3, C5 to C8 shown in FIG. 17, or steps D2, D4 to D9 shown in FIG. 25, the recognition generated in step A6 shown in FIG. The result character string and the word character string generated in step B1 shown in FIG. 11, step C1, C2 shown in FIG. 17, or step D1 shown in FIG. 25 are compared for each character at the corresponding position. If there are different characters, for the different characters, among the feature amounts recorded in the character recognition dictionary 14 stored in the memory 13, a feature amount stored in association with a character that matches the character of the recognition result character string; Of the character feature quantities recorded in the character recognition dictionary 14 stored in the memory 13, the feature quantities stored in association with the characters matching the characters in the word character string are compared, and the comparison result is determined in advance. When meeting the comparison reference that determines that replaces the character recognition result in the character string in the corresponding character in the word string. Then, in steps B3 and B7 shown in FIG. 11, steps C4 and C9 shown in FIG. 17, or steps D3 and D10 shown in FIG. 25, steps B2, B4 to B6 shown in FIG. The characters in the recognition result character string determined to be replaced in steps C3, C5 to C8 or steps D2, D4 to D9 shown in FIG. 25 are replaced with the characters at the corresponding positions in the word character string.

さらに、コンピュータに情報処理方法の各工程を実行させるためのプログラムとして提供することができる。 Furthermore, it can provide as a program for making a computer perform each process of the information processing method.

さらに、コンピュータに前記情報処理方法の各工程を実行させるためのプログラムを記録したコンピュータ読み取り可能な記録媒体として提供することができる。 Furthermore, the present invention can be provided as a computer-readable recording medium that records a program for causing a computer to execute each step of the information processing method.

１，１ａ，１ｂ情報処理装置
２入力装置
３出力装置
１１制御部
１２入出力制御部
１３メモリ
１４文字認識辞書
１５文字列行切り出し部
１６文字切り出し部
１７特徴抽出部
１８マッチング部
１９文字列生成部
２０，２０ａ，２０ｂ後処理部
２１単語列生成部
２２，２２ａ，２２ｂ置き換え判定部
２３置き換え部
２４単語辞書
３０文字
３１メッシュ
３２特徴量
３３メッシュ特徴量 1, 1a, 1b Information processing device 2 Input device 3 Output device 11 Control unit 12 Input / output control unit 13 Memory 14 Character recognition dictionary 15 Character string row cutout unit 16 Character cutout unit 17 Feature extraction unit 18 Matching unit 19 Character string generation unit 20, 20a, 20b Post-processing unit 21 Word string generation unit 22, 22a, 22b Replacement determination unit 23 Replacement unit 24 Word dictionary 30 Character 31 Mesh 32 Feature amount 33 Mesh feature amount

Claims

A storage unit for storing a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other;
A character string row cutout unit that cuts out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout unit that cuts out one character image representing each character from the character string image cut out by the character string line cutout unit;
A feature extraction unit that extracts a feature amount of a character represented by each character image from the character image cut out by the character cutout unit;
The feature amount extracted by the feature extraction unit is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection unit for selecting at least one from the character recognition dictionary;
A character string generation unit that generates a recognition result character string by combining candidate characters for each character image selected by the selection unit;
The candidate characters constituting the recognition result character string generated by the character string generation unit are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation unit that extracts a character string or a word that matches only some characters and generates a word character string composed of the extracted words;
If there is a different character by comparing the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at the corresponding position, feature extraction is performed for the different characters The feature amount extracted by the unit is compared with the feature amount stored in association with the character that matches the character of the word character string among the character feature amounts recorded in the character recognition dictionary stored in the storage unit. A replacement determination unit that determines whether to replace a character in the recognition result character string with a corresponding character in the word character string based on the comparison result;
An information processing apparatus comprising: a replacement unit that replaces a character in a recognition result character string determined to be replaced by a replacement determination unit with a character at a corresponding position in a word character string.

A storage unit for storing a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other;
A character string row cutout unit that cuts out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout unit that cuts out one character image representing each character from the character string image cut out by the character string line cutout unit;
A feature extraction unit that extracts a feature amount of a character represented by each character image from the character image cut out by the character cutout unit;
The feature amount extracted by the feature extraction unit is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection unit for selecting at least one from the character recognition dictionary;
A character string generation unit that generates a recognition result character string by combining candidate characters for each character image selected by the selection unit;
The candidate characters constituting the recognition result character string generated by the character string generation unit are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation unit that extracts a character string or a word that matches only some characters and generates a word character string composed of the extracted words;
When there is a different character by comparing the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at the corresponding position, the storage unit Among the feature quantities recorded in the character recognition dictionary stored in the memory, and the feature quantities stored in association with the characters matching the characters in the recognition result character string, and the features recorded in the character recognition dictionary stored in the storage unit The feature amount stored in association with the character that matches the character of the word character string is compared, and when the comparison result satisfies a predetermined comparison criterion, the character in the recognition result character string is A replacement determination unit that determines to replace with a corresponding character;
An information processing apparatus comprising: a replacement unit that replaces a character in a recognition result character string determined to be replaced by a replacement determination unit with a corresponding character in a word character string.

When the replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position and has different characters The different characters are stored in association with the character extracted from the feature extraction unit and the character matching the character of the word character string among the feature values recorded in the character recognition dictionary stored in the storage unit. The similarity is calculated by comparing with the feature amount to be determined, and when the calculated similarity is equal to or greater than a predetermined threshold, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string The information processing apparatus according to claim 1.

When the replacement determination unit compares the recognition result character string generated by the character string generation unit and the word character string generated by the word string generation unit for each character at a corresponding position and has different characters For different characters, feature quantities stored in association with characters that match the characters in the recognition result character string among the feature quantities recorded in the character recognition dictionary stored in the storage unit, and stored in the storage unit Among the feature quantities recorded in the character recognition dictionary to be compared with the feature quantity stored in association with the character that matches the character of the word character string, and the similarity is calculated, and the calculated similarity is a predetermined threshold value When it is above, it determines with replacing the character in a recognition result character string with the corresponding character in a word character string, The information processing apparatus of Claim 2 characterized by the above-mentioned.

When the calculated similarity is less than a predetermined threshold, when the word string generation unit generates the word character string, the replacement determination unit matches all words with the words recorded in the word dictionary. If a word that matches only a character string or a part of characters and a word that is not included in the word character string has been extracted, the calculated similarity is a value that is less than a predetermined threshold value. 5. The information processing apparatus according to claim 3, wherein when it is equal to or more than a predetermined second threshold value, it is determined that the character in the recognition result character string is replaced with the corresponding character in the word character string.

When the calculated similarity is less than a predetermined threshold, and the number of characters of the word including the different characters is equal to or greater than a predetermined reference number, the calculated similarity is less than a predetermined threshold 5. The information according to claim 3, wherein when the value is equal to or greater than a predetermined second threshold value, the character in the recognition result character string is determined to be replaced with the corresponding character in the word character string. Processing equipment.

An information processing apparatus including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character. An information processing method for recognizing characters using a character recognition dictionary and a word dictionary stored in
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
If there are different characters by comparing the recognition result character string generated in the character string generation process and the word character string generated in the word string generation process for each character at the corresponding position, feature extraction is performed for the different characters The feature amount extracted in the process is compared with the feature amount stored in association with the character matching the character of the word character string among the character feature amounts recorded in the character recognition dictionary stored in the storage unit. A replacement determination step for determining that a character in the recognition result character string is replaced with a corresponding character in the word character string when the comparison result satisfies a predetermined comparison criterion;
A replacement step of replacing a character in the recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in the word character string.

An information processing apparatus including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word composed of at least one character. An information processing method for recognizing characters using a character recognition dictionary and a word dictionary stored in
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
When there are different characters by comparing the recognition result character string generated in the character string generation step and the word character string generated in the word string generation step for each character at the corresponding position, the storage unit Among the feature quantities recorded in the character recognition dictionary stored in the memory, and the feature quantities stored in association with the characters matching the characters in the recognition result character string, and the characters recorded in the character recognition dictionary stored in the storage unit When the comparison result meets a predetermined comparison criterion, the character in the recognition result character string is converted to the word character string. A replacement determination step for determining to replace with the corresponding character in
A replacement step of replacing a character in the recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in the word character string.

In a computer including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word including at least one character,
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
If there are different characters by comparing the recognition result character string generated in the character string generation process and the word character string generated in the word string generation process for each character at the corresponding position, feature extraction is performed for the different characters The feature amount extracted in the process is compared with the feature amount stored in association with the character matching the character of the word character string among the character feature amounts recorded in the character recognition dictionary stored in the storage unit. A replacement determination step for determining that the character in the recognition result character string is replaced with the corresponding character in the word character string when the comparison result satisfies a predetermined comparison criterion;
A program for executing a replacement step of replacing a character in a recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in a word character string.

In a computer including a character recognition dictionary that records a character and a feature value obtained by quantifying the character of the character in association with each other, and a storage unit that stores a word dictionary that records a word including at least one character,
A character string row cutting step for cutting out a character string image representing a character string for each line from an image of a document in which character string lines that are character strings are described;
A character cutout step of cutting out one character image representing each character from the character string image cut out in the character string row cutout step;
A feature extraction step of extracting the feature amount of the character represented by each character image from the character image cut out in the character cutout step;
The feature amount extracted in the feature extraction step is compared with the feature amount recorded in the character recognition dictionary stored in the storage unit, and for each character represented by the character image, a candidate character that can be recognized as the character represented by the character image is determined. A selection step of selecting at least one from the character recognition dictionary;
A character string generation step of generating a recognition result character string by combining candidate characters for each character image selected in the selection step;
The candidate characters constituting the recognition result character string generated in the character string generation step are compared with the words recorded in the word dictionary stored in the storage unit, and the words recorded in the word dictionary match all the characters. A word string generation step of extracting a word string or a word that matches only some characters, and generating a word string composed of the extracted words;
When there are different characters by comparing the recognition result character string generated in the character string generation step and the word character string generated in the word string generation step for each character at the corresponding position, the storage unit Among the feature quantities recorded in the character recognition dictionary stored in the memory, and the feature quantities stored in association with the characters matching the characters in the recognition result character string, and the characters recorded in the character recognition dictionary stored in the storage unit When the comparison result meets a predetermined comparison criterion, the character in the recognition result character string is converted to the word character string. A replacement determination step of determining whether to replace with a corresponding character in
A program for executing a replacement step of replacing a character in a recognition result character string determined to be replaced in the replacement determination step with a character at a corresponding position in a word character string.

The computer-readable recording medium which recorded the program of Claim 9 or 10.