JPH0614375B2

JPH0614375B2 - Character input device

Info

Publication number: JPH0614375B2
Application number: JP58241992A
Authority: JP
Inventors: 健二藤方; 宣之武市
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1983-12-23
Filing date: 1983-12-23
Publication date: 1994-02-23
Anticipated expiration: 2009-02-23
Also published as: JPS60134992A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明は文字認識、いわゆるＯＣＲ（Optical Charact
er Recognition）の後処理方式あるいは音声認識の後
処理方式に係り、特に一意に認識できないいわゆる不読
文字を、言語としての観点から評価して決定する文字入
力装置に関する。DETAILED DESCRIPTION OF THE INVENTION Field of Application of the Invention The present invention relates to character recognition, so-called OCR (Optical Charact).
er Recognition) post-processing method or voice recognition post-processing method, and particularly to a character input device that evaluates and determines a so-called unreadable character that cannot be uniquely recognized from the viewpoint of language.

[Background of the Invention]

手書き文字や印刷文字をそのパターンを調べて文字認識
する場合において、書かれている文字を正しく認識でき
ない不読文字の場合でも、何番目かの候補には正しい文
字が含まれている場合が多い。従来、このような場合の
正しい候補を選び出すための文字認識の後処理方式とし
て、認識した文字列を単語辞書と比較することによって
正誤を判定する方式が知られており、すでに住所などの
単語入力について効果があることが報告されているが、
一般の文章入力に対しては、単語数が膨大でかつ処理が
複雑で時間がかかり、実用は困難であつた。なお、従来
の文字認識方式については、例えば、特開昭５８−４８
１８１号、５８−１６６４９０号を参照。When recognizing handwritten characters or printed characters by checking their patterns, even if the characters are not readable and cannot be recognized correctly, some candidates often include correct characters. . Conventionally, as a post-processing method of character recognition for selecting a correct candidate in such a case, a method of determining correctness by comparing a recognized character string with a word dictionary is already known. Has been reported to be effective,
For general text input, the number of words is enormous, the processing is complicated and time-consuming, and it is difficult to put into practical use. Regarding the conventional character recognition method, for example, Japanese Patent Laid-Open No. 58-48
181, 58-166490.

[Object of the Invention]

本発明の目的は、一般の文章に対しても適用可能な言語
解析による認識後処理機能により、入力文字の修正工数
を削減可能な文字認識による文字入力装置を提供するこ
とにある。An object of the present invention is to provide a character input device by character recognition that can reduce the number of correction steps for input characters by a post-recognition processing function by language analysis that can be applied to general sentences.

以下の説明では日文語の場合を例にとり説明するが、他
の言語たとえば英語、韓国語、中国語等においても実施
可能である。また以下とくに文字認識の出力について説
明するが、音声認識による認識出力についても実施可能
である。In the following description, the case of the Japanese language will be described as an example, but the present invention can also be implemented in other languages such as English, Korean, and Chinese. Further, although the output of character recognition will be described below in particular, recognition output by voice recognition can also be implemented.

[Outline of Invention]

本発明は、文字または音声を認識し、認識結果を少なく
とも文字コード、類似度、およびその類似度の順位の組
で構成される文字候補として出力する確認手段と、前記
文字候補を処理単位となる文字列に区切る文字列区切り
判定手段と、前記処理単位となる文字列内の各文字候補
を組み合わせて複数の文字列候補を作成する文字列候補
作成手段と、前記複数の文字列候補について言語として
の妥当性を示す言語尤度を出力する言語解析手段とを備
えた文字入力装置において、前記複数の文字列候補につ
いて求めた言語尤度のうち、最大の言語尤度が所定の言
語尤度閾値を越える場合には、前記最大の言語尤度に対
応する文字列候補を出力し、前記最大の言語尤度が所定
の言語尤度閾値以下の場合には、前記類似度の順位が第
１位の文字候補から成る文字列候補を出力する文字列候
補選択手段を備えたことを特徴とする。The present invention recognizes a character or a voice and outputs a recognition result as a character candidate composed of at least a set of a character code, a similarity and a rank of the similarity, and the character candidate as a processing unit. A character string delimiter determining unit that delimits a character string, a character string candidate creating unit that creates a plurality of character string candidates by combining each character candidate in the character string that is the processing unit, and a language for the plurality of character string candidates In a character input device provided with a language analysis unit that outputs a language likelihood indicating the validity of, the maximum language likelihood among the language likelihoods obtained for the plurality of character string candidates is a predetermined language likelihood threshold. When the maximum linguistic likelihood is less than or equal to a predetermined linguistic likelihood threshold, the similarity rank is ranked first. Character candidates Characterized by comprising a string candidate selection means for outputting a character string candidates made.

Example of Invention

第１図は文字原稿１を文字認識部２によつて文字認識
し、その結果を日本文解析部３によつて評価して最適な
認識文字候補を選択し、テキストデータ４としてコード
化するための処理手順の概要である。本発明はこのうち
の日本文解析部３の構成と制御に関し、必要最小限の日
本文解析処理を行なうとともに、日本文解析の評価値と
認識類似度の両者によつて最適な認識文字候補を選択出
力するものである。FIG. 1 shows that the character manuscript 1 is character-recognized by the character recognition unit 2 and the result is evaluated by the Japanese sentence analysis unit 3 to select the optimum recognized character candidate and code it as the text data 4. 2 is an outline of a processing procedure of. The present invention relates to the configuration and control of the Japanese sentence analysis unit 3 among them, performs the minimum necessary Japanese sentence analysis processing, and determines the optimum recognized character candidate based on both the evaluation value of the Japanese sentence analysis and the recognition similarity. It is for selective output.

以下、本発明の一実施例を第２図の構成図、第３図のフ
ローチヤート、第４図から第６図までのデータにより説
明する。An embodiment of the present invention will be described below with reference to the configuration diagram of FIG. 2, the flow chart of FIG. 3, and the data of FIGS. 4 to 6.

第２図の文字認識部１１は手書きあるいは印刷文書の文
字原稿２１に書かれた文字を認識し、各文字に対する認
識結果を複数の文字コードと類似度および順位の組で構
成される認識文字候補２２として出力する。例えば「大
きい犬だ。」という文を認識した場合の認識文字候補２
２は第４図に示した形式で出力される。すなわち各文字
の候補数３１、文字コードイと類似度ロから成る文字候
補３２が出力され、順位は類似度の大きい上から順に１
位，２位，３位…となる。The character recognition unit 11 shown in FIG. 2 recognizes characters written on a character original 21 of a handwritten or printed document, and recognizes the recognition result for each character as a recognized character candidate composed of a plurality of character codes, similarity and rank. 22 is output. For example, the recognition character candidate 2 when the sentence "It is a big dog." Is recognized.
2 is output in the format shown in FIG. That is, the number of candidates 31 for each character and the character candidate 32 consisting of the character code a and the similarity degree b are output, and the rank is 1 in descending order of similarity degree.
The second place, the third place ...

文字列区切り判定部１２は、認識文字候補２２を、後続
の日本文解析処理が簡単になるように、解析処理単位に
区切るものであり、第１位の文字候補が句読点やスペー
スであるところで区切る。第４図の例では「大きり犬
だ」が１つの解析処理単位となる。The character string delimiter determination unit 12 divides the recognized character candidates 22 into analysis processing units so as to simplify the subsequent Japanese sentence analysis processing, and separates the first character candidates at punctuation marks or spaces. . In the example of FIG. 4, “large dog” is one analysis processing unit.

つぎに類似度閾値判定部１３では解析処理単位の各文字
候補の類似度を所定の類似度閾値と比較し、日本文解析
の対象とすべき文字列候補作成のための組合せ文字候補
２３を抽出する。具体的には第５図に示したように所定
の類似度閾値を越える文字候補については、各文字毎に
その候補数を求め、組合せ文字候補数４１として対応す
る組合せ文字候補４２に対応して記録する。ただし所定
の類似度閾値を越える文字候補がない場合はその組合せ
文字候補数は１とする。第５図は所定の類似度閾値を６
０とした場合の組合せ文字候補である。Next, the similarity threshold determination unit 13 compares the similarity of each character candidate in the analysis processing unit with a predetermined similarity threshold, and extracts a combined character candidate 23 for creating a character string candidate to be subjected to Japanese sentence analysis. To do. Specifically, as shown in FIG. 5, for character candidates that exceed a predetermined similarity threshold, the number of candidates is calculated for each character, and the number of combined character candidates 41 corresponds to the corresponding combined character candidate 42. Record. However, if there is no character candidate that exceeds the predetermined similarity threshold, the number of combined character candidates is set to 1. FIG. 5 shows that the predetermined similarity threshold is 6
It is a combination character candidate when 0 is set.

文字列候補作成部１４は、組合せ文字候補２３を組合せ
て文字列候補２４を作る。第５図の組合せ文字候補に対
しては、組合せ文字候補数４１の積に相当する組合せが
でき、第６図に示すように１２種の文字列候補２４が出
力される。The character string candidate creating unit 14 creates the character string candidate 24 by combining the combined character candidates 23. With respect to the combination character candidates of FIG. 5, a combination corresponding to the product of the combination character candidate number 41 is formed, and 12 kinds of character string candidates 24 are output as shown in FIG.

日本文解析部１５は与えられた文字列について日本文と
しての妥当性をチエツクするものであり、公知の技術を
用いて実現できる。簡単な方法としては、文字列を単語
辞書と比較し、対応する単語の有無や品詞あるいは単語
の使用頻度を得、また前後の文字列と文法的に接続可能
性を評価して言語尤度を求めることができる。さらに進
んだ方法としては、文節で区切られていないベタ書き文
についても文献（「ベタ書き文の仮名漢字変換」，昭和
５２年度電子通信学会情報部門全国大会９１）などに見
られる技術を用いて日本文の尤度を求めることができ
る。日本文解析手法については詳細説明は省略し、文字
列に対して日本語尤度Ｊを出力する機能を有するものと
して扱う。第２図の日本文解析部１５では、文字列候補
２４の各々について日本語尤度２５を求める。日本文解
析部１５による処理は、第３図のフローチヤートの上部
に示すように、文字列候補２４すべてについて日本語尤
度２５を求め終るまで続けられ、その結果は第６図で示
したようになる。第６図の例では日本語尤度は５段階評
価で表わされており、３番目の文字列候補の尤度が５で
最大である。The Japanese sentence analysis unit 15 checks the validity of a given character string as a Japanese sentence and can be realized by using a known technique. A simple method is to compare the character string with a word dictionary, obtain the presence or absence of the corresponding word, the part of speech, or the frequency of use of the word, and evaluate the linguistic likelihood by evaluating the grammatical connection with the preceding and following character strings. You can ask. As a further advanced method, the technique found in the literature (“Kana-to-Kanji conversion of solid written sentences”, 1987 National Institute of Electronics and Communication Information Division National Convention 91) is also used for solid written sentences that are not separated by clauses. The likelihood of Japanese sentences can be calculated. A detailed description of the Japanese sentence analysis method is omitted, and the Japanese sentence analysis method is treated as having a function of outputting the Japanese likelihood J to a character string. The Japanese sentence analysis unit 15 in FIG. 2 obtains the Japanese likelihood 25 for each of the character string candidates 24. The processing by the Japanese sentence analysis unit 15 is continued until the Japanese likelihood 25 is obtained for all the character string candidates 24, as shown in the upper part of the flow chart in FIG. 3, and the result is as shown in FIG. become. In the example of FIG. 6, the Japanese likelihood is represented by a 5-step evaluation, and the likelihood of the third character string candidate is 5, which is the maximum.

文字列候補選択部１６では、日本語尤度２５を比較と、
最大な日本語尤度を検出する。この最大な日本語尤度を
Ｊ_MAXとする。また予め定めた日本語尤度閾値をＪ_THと
する。第３図のフローチヤートの下半分に示したよう
に、Ｊ_MAXとＪ_THを比較して、Ｊ_MAX＞Ｊ_THの場合はＪ
_MAXに対応する文字列を最適文字列出力２６として選択
出力する。第６図では３番目の文字列候補「大きい犬
だ」の日本語尤度が最大で、Ｊ_MAX＝５であるので、Ｊ
_TH＝４と仮定した場合には「大きい犬だ」が出力され
る。Ｊ_MAX≦Ｊ_THの場合には、当該文字列は日本文とし
ての体をなしていないことになり、日本文解析によつて
も正しい文字列候補の判定ができないので、第１位の文
字候補から成る文字列を文字列候補２４の中から選択し
て最適文字列出力２６として出力する。In the character string candidate selection unit 16, comparing the Japanese likelihood 25,
Detect the maximum Japanese likelihood. Let this maximum Japanese likelihood be J _MAX . A predetermined Japanese likelihood threshold is J _TH . As shown in the lower half of the flow chart in Fig. 3, J _MAX and J _TH are compared, and when J _MAX > J _TH , J
_The character string corresponding to _MAX is selected and output as the optimum character string output 26. In FIG. 6, the Japanese likelihood of the third character string candidate “big dog” is J _MAX = 5, so J
_{If TH} = 4, "Big dog" is output. If J _MAX ≤ J _TH, the character string does not form the body of a Japanese sentence, and the correct character string candidate cannot be determined even by the Japanese sentence analysis. A character string consisting of is selected from the character string candidates 24 and output as the optimum character string output 26.

この結果を具体的に説明するとつぎのようになる。まず
Ｊ_MAX≦Ｊ_THとなるのは、文字候補の中に正解が存在し
ない場合に生じ易い。具体例を第７図に示すが、これは
「日本国民は」を文字認識させた場合に「日」という字
が認識できず、候補に含まれていない例である。この場
合には隣接する文字の誤つた候補と組合わされた文字列
候補の日本語尤度が最大になる。すなわち２番目の誤つ
た候補と組合された「白木国民は」という文字列候補の
日本語尤度が最大値Ｊ_MAXとなり、このままでは正しく
認識された２番目の「本」という文字まで巻き添えを受
けて改悪されることになるが、Ｊ_MAX≦Ｊ_THの場合の処
理によりこのような併害を無くすことが可能となり、第
１位の文字候補から成る「白本国民は」が出力される。The result will be specifically described as follows. First, J _MAX ≦ J _TH is likely to occur when there is no correct answer in the character candidates. A specific example is shown in FIG. 7. This is an example in which the character "day" cannot be recognized when "Japanese nationals" is recognized, and it is not included in the candidates. In this case, the Japanese likelihood of the character string candidate combined with the erroneous candidate of the adjacent character is maximized. That is, the Japanese likelihood of the character string candidate "Shiraki Kuniwa" combined with the second wrong candidate is the maximum value J _MAX . However, the processing in the case of J _MAX ≦ J _TH can eliminate such a harm, and “Shiramoto nationals” consisting of the first character candidate is output.

なお、本実施例において認識文字候補の類似度が非常に
低い場合には、正しい組合せ文字候補が存在しない可能
性が大であり、その場合には無駄に日本文解析するのを
止めるように制御することが望ましい。そのため制御を
追加した実施例のフローチヤートを第８図に示す。すな
わち、類似度閾値判定部１３において、所定の類似度閾
値の照合に先立つて、それよりも低い足切り類似度閾値
と第１位の文字候補の類似度を比較し、足切り類似度閾
値以下の第１位の文字候補の類似度が存在する場合は文
字列候補作成部１４において類似度が第１位の文字候補
から成る文字列を作成し、これを直接に最適文字列出力
２６として出力する。In this example, if the similarity of the recognized character candidates is very low, there is a high possibility that a correct combination character candidate does not exist, and in that case, control is performed so as to stop wasted Japanese sentence analysis. It is desirable to do. Therefore, the flow chart of the embodiment in which control is added is shown in FIG. That is, in the similarity threshold determination unit 13, prior to the matching of the predetermined similarity threshold, the lower threshold of the foot cut similarity and the similarity of the first character candidate are compared, and the threshold of the foot cut similarity is equal to or less than the threshold. If the similarity of the first character candidate of No. 1 exists, the character string candidate creating unit 14 creates a character string composed of the character candidates having the first similarity, and directly outputs this as the optimum character string output 26. To do.

つぎに本発明をワードプロセツサの入力アダプタとして
用いた実施例を第９図により説明する。本実施例は、Ｏ
ＣＲ６３をワードプロセツサ６１の入力手段として用い
るものである。文書入力制御装置６２はＯＣＲ６３によ
る手書き文書や印刷文書の認識入力、認識結果の認識文
字候補メモリ６４への格納、言語解析すべき文字列候補
の作成とその文字列候補メモリ６５への格納、言語解析
装置６６の制御、処理結果の文字列出力用テキストメモ
リ６７への格納などの、各制御を行なう。Next, an embodiment in which the present invention is used as an input adapter of a word processor will be described with reference to FIG. In this embodiment, O
The CR 63 is used as an input means of the word processor 61. The document input control device 62 recognizes a handwritten document or a printed document by the OCR 63, stores the recognition result in the recognized character candidate memory 64, creates a character string candidate to be subjected to language analysis and stores it in the character string candidate memory 65, language. Each control such as control of the analysis device 66 and storage of the processing result in the character string output text memory 67 is performed.

文書入力制御装置６２はマイクロプロセツサにより構成
され、認識文字候補メモリ６４、文字列候補メモリ６
５、文字列出力用テキストメモリ６７はマイクロプロセ
ツサのメモリであり、OCR６３や言語解析装置６６はそ
れぞれマイクロプロセツサの付加装置として構成され
る。The document input control device 62 is composed of a microprocessor, and has a recognized character candidate memory 64 and a character string candidate memory 6.
5. The character string output text memory 67 is a memory of the microprocessor, and the OCR 63 and the language analysis device 66 are each configured as an additional device of the microprocessor.

本実施例によれば、ＯＣＲで読みとつた文字のうち、一
意に決定できない不読文字を、言語としての尤度という
別の観点から評価することにより決定することが可能と
なり、ワードプロセツサの初期入力用にＯＣＲを使用す
ることが可能となる。According to the present embodiment, among the characters read by OCR, it is possible to determine an unread character that cannot be uniquely determined by evaluating it from another viewpoint of likelihood as a language. It is possible to use OCR for initial input.

ここで、漢字ＯＣＲの代りに音声認識装置を用いた場合
でも同様の効果が得られることは言うまでもなく、音声
入力によるワードプロセツサの実現も可能となる。Here, it goes without saying that the same effect can be obtained even when a voice recognition device is used instead of the Kanji OCR, and a word processor can be realized by voice input.

〔The invention's effect〕

以上述べたように、本発明によれば、まず言語解析すべ
き文字列候補を必要最小限に絞ることにより、処理時間
の減少を図ることができる。つぎに、言語解析によつて
も評価の良くない文字列については、認識類似度第１位
の文字候補を選択出力するため、隣接の誤認識文字につ
られて誤つた文字候補が文として成り立つ場合の誤判定
の弊害を無くすることができ、言語解析の適用による文
字認識後処理の効果を高めることができるので、ＯＣＲ
や音声認識を一般の文章入力に利用することが可能とな
る。As described above, according to the present invention, the processing time can be reduced by first narrowing down the character string candidates to be subjected to language analysis to the necessary minimum. Next, in the case of a character string that is not well evaluated by linguistic analysis, the character candidate with the first highest degree of recognition similarity is selected and output. Since it is possible to eliminate the adverse effect of erroneous determination of, and to enhance the effect of character recognition post-processing by applying language analysis, OCR
And voice recognition can be used for general text input.

[Brief description of drawings]

第１図は本発明の概要を示す図、第２図は本発明の一実
施例、第３図はそのフローチヤート、第４図は認識文字
候補、第５図は組合せ文字候補、第６図は文字列候補と
その日本語尤度、第７図は組合せ候補の他の例、第８図
は第２図の構成において制御機能を追加した場合のフロ
ーチヤート、第９図は他の実施例を示す図である。１１……文字認識部、１２……文字列区切り判定部、１
３……類似度閾値判定部、１４……文字列候補作成部、
１５……日本文解析部、１６……文字列候補選択部。FIG. 1 is a diagram showing an outline of the present invention, FIG. 2 is an embodiment of the present invention, FIG. 3 is its flow chart, FIG. 4 is a recognition character candidate, FIG. 5 is a combination character candidate, and FIG. Is a character string candidate and its Japanese likelihood, FIG. 7 is another example of a combination candidate, FIG. 8 is a flow chart when a control function is added in the configuration of FIG. 2, and FIG. 9 is another embodiment. FIG. 11 ... Character recognition unit, 12 ... Character string delimitation determination unit, 1
3 ... Similarity threshold determination unit, 14 ... Character string candidate creation unit,
15: Japanese sentence analysis unit, 16: Character string candidate selection unit.

Claims

[Claims]

1. A recognition unit for recognizing a character or a voice and outputting a recognition result as a character candidate composed of at least a set of a character code, a similarity and a rank of the similarity, and the character candidate as a processing unit. A character string delimiter determining unit that delimits the character string into a character string, a character string candidate creating unit that creates a plurality of character string candidates by combining each character candidate in the character string serving as the processing unit, and a language for the plurality of character string candidates. In a character input device provided with a language analysis unit that outputs a language likelihood indicating the validity as, among the language likelihoods obtained for the plurality of character string candidates,
When the maximum linguistic likelihood exceeds a predetermined linguistic likelihood threshold, a character string candidate corresponding to the maximum linguistic likelihood is output, and when the maximum linguistic likelihood is less than or equal to the predetermined linguistic likelihood threshold. In the character input device, there is provided a character string candidate selecting means for outputting a character string candidate composed of the character candidates having the first rank of similarity.

2. The character string delimiter according to claim 1, wherein the character string which is the processing unit and is divided by the character string delimiter determination unit has a predetermined similarity among the character candidates having the first rank. When there is a character candidate that is equal to or less than the cutoff similarity threshold, the character string candidate creating unit determines whether the first rank of the similarity rank is the first.
A character input device characterized in that a character string composed of character candidates for positions is created and directly output.