JP2903779B2

JP2903779B2 - Character string recognition method and apparatus

Info

Publication number: JP2903779B2
Application number: JP3155039A
Authority: JP
Inventors: 弥生小林
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1991-06-27
Filing date: 1991-06-27
Publication date: 1999-06-14
Anticipated expiration: 2014-06-14
Also published as: JPH056464A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は文字列認識方法及びその
装置に関し、特に光学的文字読み取り装置（ＯＣＲ）に
おける文字列認識方法及びその装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for character string recognition, and more particularly to a method and apparatus for character string recognition in an optical character reader (OCR).

【０００２】[0002]

【従来の技術】文字列認識方法及びその装置において、
光学的文字読み取り装置の帳票は、文字記入枠が設けら
れ、筆記者がその枠に１文字ずつを記入する方法がとら
れていた。これに対して、罫線のような緩い書式制約の
元で筆記された文字を認識するための技術が文字列認識
である。2. Description of the Related Art In a character string recognition method and apparatus,
The form of the optical character reading device is provided with a character entry frame, and a writer writes a character in the frame one by one. On the other hand, a technique for recognizing a character written under loose format constraints such as ruled lines is character string recognition.

【０００３】文字列認識においては、文字切り出しの問
題が１つの重要な課題である。特に日本語のように一文
字が複数の連結したパタンに分かれる場合があり、ま
た、手書きのように隣接する文字パタンが接触する場合
には、必ずしもパタン間の余白やパタンの形状だけから
切り出すことは困難である。そこで、文字候補となるパ
タンをまず認識し、その結果を用いて文字列として最適
なパタンの組み合わせを探索する方法が知られている。
この方法については、例えば、文献「候補文字ラティス
法による枠無し筆記文字列のオンライン認識」、電子情
報通信学会（Ｄ）Ｊ６８−Ｄ，Ｎｏ．４、１９８５、７
６５〜７７２頁）に記載されている。In character string recognition, the problem of character segmentation is one important problem. In particular, one character may be divided into multiple connected patterns as in Japanese, and when adjacent character patterns touch each other as in handwriting, it is not always necessary to cut out only from the space between patterns and the shape of the pattern. Have difficulty. Therefore, a method of first recognizing a pattern serving as a character candidate and using the result to search for an optimal combination of patterns as a character string is known.
This method is described in, for example, the document “Online Recognition of Handwritten Character Strings without Frames by Candidate Character Lattice Method”, IEICE (D) J68-D, No. 4, 1985, 7
65-772).

【０００４】しかしながら、漢字には「明」のように偏
と旁に分かれて、それぞれが文字と認識できる文字もあ
り、このような文字が含まれていると上記のような方法
では正しく認識することは困難である。このような場合
にも精度良く認識するために、言語的な知識を利用した
文字列認識方法が知られている。例えば文献「言語情報
を利用した手書き文字列からの文字切り出しと認識」、
電子情報通信学会論文誌（Ｄ）、Ｊ６９−Ｄ，Ｎｏ．
９、１９８６、１２９２−１３０１頁に記載されてい
る。[0004] However, some Chinese characters, such as "Akira", are separated from each other and can be recognized as characters. If such characters are included, the above method correctly recognizes them. It is difficult. A character string recognition method using linguistic knowledge is known in order to perform recognition with high accuracy even in such a case. For example, the document "Character segmentation and recognition from handwritten character strings using linguistic information",
IEICE Transactions (D), J69-D, No.
9, 1986, pp.1292-1301.

【０００５】[0005]

【発明が解決しようとする課題】上述した従来の文字列
認識方法及びその装置では、文字候補を組み合わせてで
きる文字列候補の中に、正しい認識結果が含まれていれ
ば、単語照合によりこれを認識結果として判断すること
は可能であるが、含まれない場合には、正しい認識を行
うことは困難である。In the above-described conventional character string recognition method and apparatus, if a character string candidate formed by combining character candidates includes a correct recognition result, the character string candidate is recognized by word matching. Although it is possible to judge as a recognition result, if it is not included, it is difficult to perform correct recognition.

【０００６】また、図２のように文字パタンが誤認識さ
れている場合には、上述のように認識結果だけを利用し
て文字列候補を選出すると、正しい認識結果が数の限定
された文字列候補の中に含まれないことが多くなり、単
語照合によっても認識誤りを修正できなくなる。In the case where a character pattern is erroneously recognized as shown in FIG. 2, if a character string candidate is selected using only the recognition result as described above, the correct recognition result will be a character having a limited number of characters. In many cases, it is not included in the column candidates, and the recognition error cannot be corrected even by word matching.

【０００７】ところで、認識対象が住所のように予め限
定されていて、２つの文字が連続して発生する場合は非
常に限定される。例えば、ある地域の住所に限定した場
合に、「東」の次に「京」が書かれることはあるが、
「野」が書かれることはないといった知識が得られる。
従って、文字列候補を選出するときに、単に個別のパタ
ンの文字らしさの評価値だけでなく、隣接する文字との
整合性を文字遷移情報という上位の知識を用いて評価す
ることにより、たとえ、個別のパタンが誤認識され、下
位の候補として認識された場合にも、文字遷移情報によ
り、文字列候補を有効に選ぶことが可能になり、単語照
合が効果的に利用できる。[0007] Incidentally, the case where the recognition target is limited in advance like an address and two characters occur consecutively is very limited. For example, if the address is limited to a certain area, "K" may be written after "East",
The knowledge that "field" is never written is obtained.
Therefore, when selecting a character string candidate, by evaluating not only the evaluation value of the character-likeness of an individual pattern but also the consistency with adjacent characters using the higher-level knowledge of character transition information, Even when an individual pattern is erroneously recognized and is recognized as a lower candidate, character transition information can be used to effectively select a character string candidate, and word matching can be used effectively.

【０００８】本発明の目的は、認識対象が予め限定され
ている場合に、対象に関する知識として、文字間の遷移
情報を予め文字遷移評価値として記憶し、これを用いる
ことにより、誤認識された場合にもこれを修正すること
を可能にする。[0008] An object of the present invention is that when recognition targets are limited in advance, transition information between characters is stored in advance as character transition evaluation values as knowledge about the objects, and this is used to make misrecognition. In this case it is possible to correct this.

【０００９】[0009]

【課題を解決するための手段】本発明の文字列認識方法
は、光学的に帳票から読み取った文字列画像から候補文
字パタンを抽出し、前記候補文字パタンと文字認識辞書
の標準パタンとを照合する認識処理の結果により文字評
価値を算出し、前記候補文字パタンにより前記文字列画
像の文字数を仮定し、前記文字評価値及び予め単語辞書
から算出した文字間の遷移情報を表す文字遷移評価値に
より前記文字列画像から前記文字数を切り出して文字コ
ード列として出力し、前記文字コード列と前記単語辞書
に格納してある単語とを照合し、前記文字コード列の全
ての文字が照合された場合には前記文字コード列を前記
文字列画像の認識結果として出力し、照合しない場合に
は可能な文字数の範囲内で前記文字数を変更して前記文
字切り出し処理及び前記単語照合処理を反復し、可能な
文字数の範囲の全てに対して照合されなかった場合に
は、可能な文字数の範囲内の最適な照合結果を前記文字
列画像の認識結果として出力する。According to the character string recognition method of the present invention, a candidate character pattern is extracted from a character string image optically read from a form, and the candidate character pattern is compared with a standard pattern of a character recognition dictionary. The character evaluation value is calculated based on the result of the recognition processing to be performed, the number of characters of the character string image is assumed by the candidate character pattern, the character evaluation value and a character transition evaluation value representing transition information between characters calculated from a word dictionary in advance. When the number of characters is cut out from the character string image and output as a character code string, the character code string is compared with a word stored in the word dictionary, and all the characters in the character code string are matched. Output the character code string as a recognition result of the character string image, and when collation is not performed, change the number of characters within the range of the number of possible characters and perform the character cutout processing. It said word collation processing repeated, if it is not matched against the entire range of number of characters that can outputs a best matching result of the range of the number of characters allowed in the recognition result of the character string image.

【００１０】本発明の文字列認識装置は、光学的な走査
で読み取った文字列画像を格納する文字列画像記憶手段
と、前記文字列画像記憶手段から読み出された文字列画
像により文字候補となる候補文字パタンを抽出し格納す
る候補文字パタン抽出手段と、標準パタンを記憶する文
字認識辞書記憶手段と、前記候補文字パタン抽出手段か
ら読み出された候補文字パタンと前記文字認識辞書記憶
手段から読み出された標準パタンとを照合する認識処理
により文字コードと認識結果とから得られる文字評価値
を格納する文字認識手段と、前記候補文字パタン抽出手
段から読み出された候補文字パタンにより前記文字列画
像の文字数を仮定し、認識結果検定部から文字数が入力
された場合には可能な範囲内で文字数を変更する文字数
設定手段と、前記文字数設定手段からの文字数と前記文
字認識手段から読み出された候補文字パタンの文字評価
値と文字遷移評価値記憶手段から読み出された文字間の
遷移情報を表す文字遷移評価値とにより複数の候補文字
パタンから構成される文字列候補の内最適なものを探索
し、前記文字列候補と認識結果である文字コード列とを
格納する文字切り出し手段と、前記文字切り出し手段か
らの文字コード列と、単語辞書記憶手段からの単語とを
照合し、前記文字コード列の全ての文字が照合された場
合には前記文字コード列を文字列認識結果出力手段に出
力し、照合しない場合には照合文字列を記憶するととも
に、文字数を前記文字数設定手段に出力し、可能な文字
数の全てに対して照合されなかった場合には、格納され
ている照合文字列の内で最適な照合文字列を前記文字列
認識結果出力手段に出力する認識結果検定部と、前記認
識結果検定部からの文字照合結果を出力する文字列認識
結果出力手段とを有する。A character string recognizing device according to the present invention comprises a character string image storage means for storing a character string image read by optical scanning, and a character candidate stored in the character string image read from the character string image storage means. Candidate character pattern extraction means for extracting and storing candidate character patterns, character recognition dictionary storage means for storing standard patterns, and candidate character patterns read from the candidate character pattern extraction means and the character recognition dictionary storage means. Character recognition means for storing a character evaluation value obtained from a character code and a recognition result by a recognition process for collating with the read standard pattern; and the character recognition means for storing the character by the candidate character pattern read from the candidate character pattern extraction means. Assuming the number of characters in the column image, when the number of characters is input from the recognition result test unit, the number of characters setting means for changing the number of characters within a possible range; The number of characters from the character number setting unit, the character evaluation value of the candidate character pattern read from the character recognition unit, and the character transition evaluation value representing the transition information between characters read from the character transition evaluation value storage unit, A character cutout unit that searches for an optimum one of character string candidates composed of candidate character patterns and stores the character string candidates and a character code string as a recognition result, and a character code string from the character cutout unit. Comparing the word with the word from the word dictionary storage means, outputting the character code string to the character string recognition result output means when all the characters of the character code string are matched, In addition to storing the column, the number of characters is output to the number-of-characters setting means, and if collation has not been performed for all of the possible number of characters, the optimal collation is performed among the stored collation character strings. It has a recognition result verification unit that outputs a string on the character string recognition result output means, and a character string recognition result output means for outputting a character collation result from said recognition result verification unit.

【００１１】[0011]

【作用】本発明の文字列認識方法について説明する。以
下の説明では、縦書き文字列画像を例として用いるが、
横書きの文字列についても同様に適用できる。図３
（ａ）は手書き文字列画像の例である。入力された文字
列画像に対して、文字候補となるパタン（候補文字パタ
ン）を抽出する。図３（ｂ）はその抽出結果を表したラ
ティスである。ラティスの各ノードが文字列画像の切り
出し候補位置に対応し、２つのノードを結ぶリンクが、
対応する切り出し位置に挟まれたパタンで、候補文字パ
タンを表す。The character string recognition method according to the present invention will be described. In the following description, a vertically written character string image is used as an example.
The same can be applied to a horizontally written character string. FIG.
(A) is an example of a handwritten character string image. From the input character string image, a pattern that becomes a character candidate (candidate character pattern) is extracted. FIG. 3B is a lattice showing the extraction result. Each node of the lattice corresponds to the cutout candidate position of the character string image, and a link connecting the two nodes is
A pattern sandwiched between corresponding cutout positions indicates a candidate character pattern.

【００１２】各候補文字パタンと文字認識辞書の標準パ
タンとを照合し、候補文字パタンの認識を行い、これに
基づいて候補文字パタンの文字である信頼性を表す文字
評価値を算出する。文字評価値の算出方法として、例え
ば特願平１−３３４３４７「文字列認識方法及びその装
置」が知られている。これにより、候補文字パタンが文
字らしいほど高い評価値を得ることが可能である。Each candidate character pattern is compared with a standard pattern in a character recognition dictionary to recognize the candidate character pattern, and a character evaluation value representing the reliability of the character of the candidate character pattern is calculated based on the recognition result. As a method of calculating a character evaluation value, for example, Japanese Patent Application No. 1-334347, "Character String Recognition Method and Apparatus" is known. As a result, it is possible to obtain a higher evaluation value as the candidate character pattern looks like a character.

【００１３】候補文字パタンを組み合わせて構成される
文字列候補に対して、単語照合を行い、文字列認識を行
うが、一般にはすべての組み合わせを照合すると処理量
が膨大となる。そのために、ある文字の次に別の文字が
発生することがあるかどうかを文字遷移評価値として、
予め単語辞書があるいは学習データから算出しておく。
文字遷移評価値としては、例えば文字Ｃｉの次に文字Ｃ
ｊがくることがある場合には１、そうでない場合には０
として、２値で表す方法、あるいはカテゴリの出現確率
という形で算出する方法等がある。この文字遷移評価値
と文字評価値を利用して文字列候補を絞ることによっ
て、たとえ文字パタンが図２のように誤認識されて文字
評価値が低く、下位の候補として認識されていても、文
字遷移評価値により文字列候補の中に選出することが可
能となる。Word matching is performed on character string candidates formed by combining candidate character patterns, and character string recognition is performed. In general, however, if all combinations are compared, the processing amount becomes enormous. Therefore, whether or not another character may occur after one character is used as a character transition evaluation value.
It is calculated in advance from a word dictionary or from learning data.
The character transition evaluation value is, for example, the character C followed by the character C
1 if j can come, 0 otherwise
As a method, there is a method of expressing in binary form, a method of calculating in the form of a category appearance probability, or the like. By narrowing down the character string candidates using the character transition evaluation value and the character evaluation value, even if the character pattern is erroneously recognized as shown in FIG. 2 and the character evaluation value is low, the character pattern is recognized as a lower candidate. It is possible to select among character string candidates by the character transition evaluation value.

【００１４】具体的には、文字数の違いによる文字列候
補の評価誤差を生じないために、文字数の等しい文字列
候補毎に、候補を絞る。このときに、文字列候補に含ま
れる候補文字パタンの文字評価値及び文字遷移評価値の
和を文字列評価値として、文字列評価値の最大となるも
のを探索し、これを候補とする方法を採る。これによっ
て、最終的には文字列候補として互いに文字数が異なる
ものが選出されるが、この中から単語照合によって、最
も良く照合した文字列を認識結果として出力する処理に
より、たとえ文字パタンが誤認識されても文字遷移情報
を利用することにより、これを修正し、正しく認識する
ことが可能となる。More specifically, candidates are narrowed down for each character string candidate having the same number of characters in order to prevent an evaluation error of the character string candidates due to the difference in the number of characters. At this time, the sum of the character evaluation value and the character transition evaluation value of the candidate character pattern included in the character string candidate is used as the character string evaluation value, and the search for the largest character string evaluation value is performed. Take. As a result, character strings having different numbers of characters are finally selected as character string candidates. However, by performing a process of outputting the best-matched character string as a recognition result by word matching, even if a character pattern is erroneously recognized. However, by using the character transition information, this can be corrected and correctly recognized.

【００１５】[0015]

【実施例】次に本発明について図面を参照して説明す
る。図１は本発明の一実施例を示すブロック図である。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing one embodiment of the present invention.

【００１６】文字画像記憶部１は、２値化された文字列
画像を格納する。候補文字パタン抽出部２は、文字列画
像記憶部１から読み出した文字列画像信号１１により文
字候補となるパタンを抽出して、これをラティスとして
記憶装置に格納する。文字候補となるパタンの抽出方法
としては、例えば以下のようにして実現できる。まず、
文字列画像から図形としてのまとまりの最小単位のパタ
ンを抽出するために、文字列画像に対してその文字列方
向と垂直な方向に黒画素を計数して得られる投影関数を
ｆ（ｘ）とする。関数ｆ（ｘ）に対して式（１）で表さ
れるガウシァン関数ｇ（ｘ）を用いて、式（２）の計算
を施し、平滑化した投影関数ｆ′（ｘ）を算出する。The character image storage unit 1 stores a binarized character string image. The candidate character pattern extraction unit 2 extracts a character candidate pattern from the character string image signal 11 read from the character string image storage unit 1 and stores it as a lattice in a storage device. A method of extracting a pattern serving as a character candidate can be realized, for example, as follows. First,
In order to extract a pattern of a minimum unit of a set as a figure from a character string image, a projection function obtained by counting black pixels in the character string image in a direction perpendicular to the character string direction is represented by f (x). I do. Using the Gaussian function g (x) expressed by the equation (1) for the function f (x), the equation (2) is calculated to calculate a smoothed projection function f ′ (x).

【００１７】 [0017]

【００１８】この投影関数ｆ′（ｘ）の極小値を与える
全てのｘ座標を算出し、この点を切り出し候補位置Ｎｉ
（ｉ＝０，…，Ｋ）とし、この点を通り文字列方向と垂
直な方向の直線を切り出し候補線とし、この隣接する２
つの切り出し候補線に挟まれたパタンを処理の最小パタ
ンとする。文字列画像から平滑化した投影関数を算出
し、その極小点を算出する。また、隣接する文字が接触
したり、重なったりする場合にも、その分離点は投影関
数ｆ′（ｘ）の極小点となることが予想される。従っ
て、このように画像を細かく分割しておくことにより、
たとえ文字と文字が接触したり、重なったりしていて
も、これを分離しておくことが可能になる。この切り出
し候補位置を記憶手段に格納しておく。All the x-coordinates which give the minimum value of the projection function f '(x) are calculated, and this point is cut out to the candidate position Ni.
(I = 0,..., K), and a straight line passing through this point and perpendicular to the character string direction is cut out as a candidate line.
A pattern sandwiched between two extraction candidate lines is set as a minimum pattern for processing. A smoothed projection function is calculated from the character string image, and its minimum point is calculated. Also, when adjacent characters touch or overlap, the separation point is expected to be the minimum point of the projection function f ′ (x). Therefore, by dividing the image finely in this way,
Even if characters touch or overlap with each other, it is possible to separate them. This cutout candidate position is stored in the storage means.

【００１９】次に異なる２つの切り出し候補線に挟まれ
たパタンが１文字であるか否かを判断し、１文字と判断
されたパタンを候補文字パタンとして記憶手段に格納す
る。ここで、１文字である可能性のあるパタンは、判定
基準（１）及び（２）により判定する。Next, it is determined whether or not the pattern sandwiched between two different cutout candidate lines is one character, and the pattern determined to be one character is stored in the storage means as a candidate character pattern. Here, the pattern that may be one character is determined based on the determination criteria (1) and (2).

【００２０】判定基準（１）はパタンの外接矩形の縦横
比（ｗ／ｈ）が、０．５≦ｒ≦２．５ならばそのパタン
は候補文字パタンとする（図４）。判定基準（２）はパ
タンの内部の余白（ｓ）の文字幅（ｗ）に対する比が
０．５以下ならば、候補文字パタンとする（図４）。The criterion (1) is that if the aspect ratio (w / h) of the circumscribed rectangle of the pattern is 0.5 ≦ r ≦ 2.5, the pattern is a candidate character pattern (FIG. 4). For the criterion (2), if the ratio of the margin (s) inside the pattern to the character width (w) is 0.5 or less, it is determined as a candidate character pattern (FIG. 4).

【００２１】文字認識部４は、候補文字パタン抽出部２
から信号１２として各候補文字パタンを読み込み、文字
認識辞書記憶部３から信号１３として標準パタンを読み
込み、候補文字パタンの認識を行い、認識結果の候補そ
れぞれに対する信頼性を示す文字評価値を算出し（図
５）、候補文字パタンの文字コードと文字評価値を記憶
手段に記憶する。候補文字パタンの各認識候補に対する
信頼性が０から１までの値で表され、１に近いほど信頼
性が高いと評価されることになる。文字数設定部５は候
補文字パタン抽出部２から全ての候補文字パタンを信号
１４として読み込み、候補文字パタンを基に、入力され
た文字列画像に対して可能な文字数を設定し、この文字
数を信号１５として出力する。The character recognition unit 4 includes a candidate character pattern extraction unit 2
, Each candidate character pattern is read as a signal 12, and a standard pattern is read as a signal 13 from the character recognition dictionary storage unit 3, the candidate character pattern is recognized, and a character evaluation value indicating the reliability of each candidate of the recognition result is calculated. (FIG. 5), the character code of the candidate character pattern and the character evaluation value are stored in the storage means. The reliability of the candidate character pattern for each recognition candidate is represented by a value from 0 to 1, and the closer to 1, the higher the reliability is evaluated. The character number setting unit 5 reads all the candidate character patterns from the candidate character pattern extraction unit 2 as a signal 14, sets the number of possible characters for the input character string image based on the candidate character patterns, and signals this number of characters as a signal. 15 is output.

【００２２】次に図６を参照して、具体的な処理動作を
説明する。まず、ステップ６１ではレジスタに文字数ｘ
として０をセットする。ステップ６２では、ｘ＝ｘ＋１
と文字数をインクリメントして、ステップ６３で文字数
ｘが最大値Ｋ以下であるか否かを判断する。ステップ６
３で、ｘが最大値Ｋ以下の場合はステップ６４におい
て、入力文字列画像がｘ個の候補文字パタンから構成さ
れるか否かを判断する。これは、切り出し候補位置Ｎｉ
（ｉ＝０，…，Ｋ）をノードとし、候補文字パタンｌ
（ｉ，ｊ）をリンクとする２端子有向グラフの始点から
終点に至るパスの内、ｘ個のリンクを通るパスが存在す
るかどうかを調べることにより容易に実現できる。ステ
ップ６４において肯定された場合は、文字数ｘを信号１
５として出力する。否定された場合は、ステップ６２に
戻る。ステップ６３で文字数ｘが最大値ｋを越えた場合
は、信号１５を通して処理の終了を伝える。Next, a specific processing operation will be described with reference to FIG. First, in step 61, the number of characters x is stored in the register.
Is set to 0. In step 62, x = x + 1
And the number of characters are incremented, and it is determined in step 63 whether or not the number of characters x is equal to or less than the maximum value K. Step 6
If x is equal to or smaller than the maximum value K in step 3, it is determined in step 64 whether or not the input character string image is composed of x candidate character patterns. This is the cut candidate position Ni
(I = 0,..., K) as nodes, and a candidate character pattern l
This can be easily realized by checking whether or not there is a path passing through x links among paths from the start point to the end point of the two-terminal directed graph having (i, j) as a link. If affirmative in step 64, the number of characters x
Output as 5. If not, the process returns to step 62. If the number x of characters exceeds the maximum value k in step 63, the end of the processing is notified through a signal 15.

【００２３】文字遷往評価値記憶部６は、カテゴリＣｉ
の次にカテゴリＣｊが発生するかどうかを予め単語辞書
から文字遷移評価値ｔ（Ｃｉ，Ｃｊ）として算出し、こ
れを通常の記憶装置に格納しておく。文字遷移評価値ｔ
としては、例えばカテゴリＣｉの次のカテゴリＣｊが発
生することがある場合には１で、無い場合には０の２値
で表す。The character history evaluation value storage unit 6 stores the category Ci
Is calculated in advance from the word dictionary as a character transition evaluation value t (Ci, Cj), and this is stored in a normal storage device. Character transition evaluation value t
Is represented by a binary value, for example, 1 when a category Cj next to the category Ci may occur, and 0 when there is no category Cj.

【００２４】文字切り出し部７は文字数設定部５から信
号１５として文字数ｘを読み込み、文字認識部４から信
号１６として文字評価値を読み込み、文字遷移評価値記
憶部６から信号１７として文字遷移評価値を読み込み、
ｘ個の候補文字パタンから構成される文字列候補の内、
最適なものを文字評価値と文字遷移評価値とを使って決
定し、その切り出し位置及び文字コード列を記憶する。The character cutout unit 7 reads the number of characters x as the signal 15 from the number of characters setting unit 5, reads the character evaluation value as the signal 16 from the character recognition unit 4, and reads the character transition evaluation value as the signal 17 from the character transition evaluation value storage unit 6. And load
Among the character string candidates composed of x candidate character patterns,
The optimum one is determined using the character evaluation value and the character transition evaluation value, and the cutout position and the character code string are stored.

【００２５】ｘ個の文字からなる最適な文字列の決定
は、以下のようにして実現できる。ｓ（ｔ）（ｔ＝０，
…，ｘ）を第ｔ番目文字の終了位置（切り出し位置）と
する。ｓ（０）＝０，ｓ（ｘ）＝Ｋである。また、ｒ
（ｔ）（ｔ＝１，…，ｘ）を第ｔ番目の候補文字パタン
の認識結果の順位とする。文字切り出し部７では、文字
評価値と文字遷移評価値により算出される式（３）を最
大にする切り出し位置ｓ（ｔ）と認識結果の順位ｒ
（ｔ）を、動的計画法を用いて決定する。但し、切り出
し候補位置ＮｉとＮｊに挟まれた候補文字パタンｌ
（ｉ，ｊ）の認識結果として、第１位から第Ｒ位までの
文字コードをｃ（ｉ，ｊ，ｋ）（ｋ＝１，…，Ｒ）及び
その文字評価値ｗ（ｉ，ｊ，ｋ）と表す。また、文字Ｃ
ｉの次に文字Ｃｊが発生する時の文字遷移評価値をｔ
（Ｃｉ，Ｃｊ）と表し、文字Ｃｉが文字列の先頭に位置
するかどうかを表す評価値をｔ′（Ｃｉ）で表す。パラ
メータαは、文字遷移評価値の文字評価値に対する影響
を制御するためのパラメータである。これにより、各文
字のパタンの文字である信頼性が高く、かつ直前の文字
に対する出現の可能性の大きい文字が最適な文字列とし
て決定される。Determination of an optimal character string consisting of x characters can be realized as follows. s (t) (t = 0,
..., x) is the end position (cutout position) of the t-th character. s (0) = 0 and s (x) = K. Also, r
(T) (t = 1,..., X) is the rank of the recognition result of the t-th candidate character pattern. In the character cutout unit 7, a cutout position s (t) that maximizes the expression (3) calculated from the character evaluation value and the character transition evaluation value, and a rank r of the recognition result.
(T) is determined using dynamic programming. However, the candidate character pattern 1 sandwiched between the cutout candidate positions Ni and Nj
As the recognition result of (i, j), the character codes from the first place to the R-th place are c (i, j, k) (k = 1,..., R) and the character evaluation value w (i, j, k). Also, the letter C
Character transition evaluation value when character Cj occurs after i is t
(Ci, Cj), and an evaluation value indicating whether the character Ci is located at the head of the character string is represented by t ′ (Ci). The parameter α is a parameter for controlling the influence of the character transition evaluation value on the character evaluation value. As a result, a character having high reliability as a character of the pattern of each character and having a high possibility of appearing with respect to the immediately preceding character is determined as an optimal character string.

【００２６】 [0026]

【００２７】ここで、最適な文字列の探索は式（４）と
（５）を用いることにより、実現できる。但し、φｔ
（ｎ_t-1，ｎ_t，ｒ_t-1，ｒ_t）はｔ個の候補文字パタ
ンからなる文字列候補の内、ｔ−１番目のパタンの終了
位置がｎ_t-1で、第ｒ_t-1位の認識結果を採用し、ｔ番
目のパタンの終了位置がｎ_tで、第ｒ_t位の認識結果を
採用する文字列候補の評価値の中で最大の評価値を表
す。また、式（４）の中で、ｔ＝２の場合には式（６）
により算出する。Here, the search for the optimum character string can be realized by using equations (4) and (5). Where φt
(N _t-1, n _t, r _t-1, r _t) of the string candidates consisting of t number of candidate character patterns, the end position of the t-1 th pattern is at n _t-1, the r adopted recognition results of _t-1 position, at t-th pattern of end position n _t, represents the largest evaluation value among the evaluation values of the character string candidates to adopt a recognition result of the r _t position. Also, in equation (4), when t = 2, equation (6)
It is calculated by:

【００２８】単語辞書記憶部８には、単語としての文字
コード列を記憶手段によって記憶しておく。認識結果検
定部９は文字切り出し部７から信号１８として文字コー
ド列を読み込み、単語辞書記憶部８に格納してある各単
語の文字コード列を信号１９として読み込み、文字コー
ド列の意味が成立するかどうかを検定する。具体的には
以下のような手段により実現できる。The word dictionary storage unit 8 stores a character code string as a word by a storage unit. The recognition result test unit 9 reads the character code string as the signal 18 from the character cutout unit 7, reads the character code string of each word stored in the word dictionary storage unit 8 as the signal 19, and the meaning of the character code string is established. Test whether Specifically, it can be realized by the following means.

【００２９】文字コード列をｃ₀，ｃ₁，…ｃ_xとす
る。文字コード列の先頭の文字コードｃ₀から単語辞書
のすべての単語と照合を行う。１文字でも照合された単
語があれば、その単語及び先頭の文字位置と最後の文字
位置および一致した文字数を記憶手段により記憶する。
例えば、図７（ａ）では単語「東京都」に対して、一致
文字数３、先頭文字位置Ｃ₀，末端文字位置Ｃ₂を記憶
する。次に照合する文字位置を１つずつずらして同様に
単語辞書と照合を行う。このようにして、文字コード列
の各文字位置を先頭として可能な単語を全て記憶する。
その結果を示したのが図７（ｂ）である。この照合され
た単語候補を組み合わせてできる文字列の中から、最も
一致文字数の多い文字列ｄ₀ｄ₁…ｄ_yを探索する。The character code string is assumed to be c ₀ , c ₁ ,... C _x . Do all of the words and collation of the word dictionary of the character code string beginning from the character code c _0. If there is at least one matched word, the word, the first character position, the last character position, and the number of matched characters are stored by the storage means.
For example, in FIG. 7A, the number of matching characters, the leading character position C ₀ , and the terminal character position C ₂ are stored for the word “Tokyo”. Next, the position of the character to be collated is shifted one by one, and collation with the word dictionary is similarly performed. In this way, all possible words are stored starting from each character position in the character code string.
FIG. 7B shows the result. From the character string that can be a combination of the collated word candidate, searching for the best matching Longer strings d ₀ d ₁ ... d _y.

【００３０】文字コード列ｃ₀，ｃ₁，…ｃ_xの全ての
文字が照合結果の文字列ｄ₀ｄ₁…ｄ_yと一致した場合
は、この文字コード列を文字列認識結果出力部１０に信
号２１として出力し、処理を終了する。そうでない場合
は、文字数設定部５に信号２０として文字数ｘを出力す
る。文字数設定部５では、ステップ６２で文字数ｘを入
力し、前述と同様の処理を行う。The character code string c _0, c _1, ... every character collation result string c _x d ₀ d ₁ ... If a match with a d _y, the character string recognition result output unit 10 to the character code string Is output as a signal 21 and the process is terminated. Otherwise, the number of characters x is output as the signal 20 to the number-of-characters setting section 5. The number-of-characters setting section 5 inputs the number of characters x in step 62, and performs the same processing as described above.

【００３１】また、文字数設定部５から処理終了の信号
が送られているときには、各文字数ｘに対して得られて
いる照合結果の文字コード列ｄ₀ ^xｄ₁ ^x…ｄ_y ^xの最
適なものを探索する。例えば、照合結果の文字コード列
ｄ₀ ^xｄ₁ ^x…ｄ_y ^xから、入力文字コード列ｃ₀ｃ₁
…ｃ_xとの一致文字数、照合単語数および照合結果の文
字数ｙを計数し、これらを基に最良の照合結果を以下の
ような基準を用いて探索する。基準１は最も一致文字数
の多い文字列コード列とし、基準２は基準１を満たす文
字コード列が複数ある場合には、それらの内で照合単語
数の最小の文字コード列とし、基準３は基準２を満たす
文字コード列が複数ある場合には、それらの内で文字数
ｙの最も多い文字コード列とする。When a processing end signal is sent from the number-of-characters setting section 5, the optimum character code string d ₀ ^x d ₁ ^x ... _Dy ^x of the collation result obtained for each number of characters x is obtained. Explore things. For example, the result of matching character code string _{^{_{^{d 0 x d 1 x ... d}}}} y x, the input character code string c ₀ c ₁
... C The number of characters matched with _x , the number of collation words, and the number y of characters of collation results are counted, and based on these, the best collation result is searched for using the following criteria. The criterion 1 is a character code string having the largest number of matching characters, the criterion 2 is a character code string having the smallest number of collation words among a plurality of character code strings satisfying the criterion 1, and the criterion 3 is a criterion When there are a plurality of character code strings satisfying 2, the character code string having the largest number of characters y among them is determined.

【００３２】文字列認識結果出力部１０は文字コード列
の先頭から認識結果を出力する。The character string recognition result output unit 10 outputs a recognition result from the beginning of the character code string.

【００３３】なお、本実施例では、縦書きの文字列を使
って説明したが、横書きの文字列にも本実施例と同様に
適用できる。また、候補文字パタンの抽出方法も、本実
施例に限らず、接触した文字でも分離することが可能な
方法ならば、支障なく適用できる。また、住所などの読
み取り対象が限定され、知識の階層化が可能な対象に対
しては、単語辞書として木（ツリー）構造の表現を用い
て単語を記憶しておくことにより、単語照合時の処理を
削減することが可能であると共に、より高精度の認識が
可能になる。Although the present embodiment has been described using a vertically written character string, the present invention can be applied to a horizontally written character string in the same manner as in the present embodiment. Also, the method of extracting the candidate character pattern is not limited to the present embodiment, and any method that can separate the touching character can be applied without any problem. In addition, for objects to be read, such as addresses, which can be hierarchized in knowledge, words are stored by using a tree (tree) structure expression as a word dictionary, so that words at the time of word matching can be stored. Processing can be reduced, and recognition with higher accuracy can be performed.

【００３４】また、文字遷移評価値についても本実施例
に限定されることはなく、例えば、式（７）で算出され
る評価値を用いても本実施例と同様の結果が得られる。The character transition evaluation value is not limited to the present embodiment. For example, the same result as in the present embodiment can be obtained by using the evaluation value calculated by the equation (7).

【００３５】 [0035]

【００３６】また、文字遷移評価値の算出のためのデー
タを単語辞書すべてを用いて行う必要もなく、学習デー
タとして、本装置に入力される可能性のあるもの、ある
いは出現頻度の多いデータに限っても良い。Further, it is not necessary to perform data for calculating the character transition evaluation value using all of the word dictionaries, and it is possible to use, as learning data, data that may be input to the apparatus or data having a high frequency of appearance. May be limited.

【００３７】[0037]

【発明の効果】以上説明したように本発明によれば、隣
接する文字が接触したり重なったりして、比較的自由に
筆記された文字列画像からも、文字を正確に切り出すこ
とを可能にすると共に、個別文字が誤認識されたときに
も単語情報を用いて修正することが可能である。As described above, according to the present invention, a character can be accurately cut out even from a character string image written relatively freely, with adjacent characters touching or overlapping. In addition, even when an individual character is erroneously recognized, it can be corrected using the word information.

[Brief description of the drawings]

【図１】本発明の文字列認識装置の一実施例を示すブロ
ック図である。FIG. 1 is a block diagram showing one embodiment of a character string recognition device of the present invention.

【図２】従来の文字列認識方法を説明するための図であ
る。FIG. 2 is a diagram for explaining a conventional character string recognition method.

【図３】本実施例の手書き文字列画像に対する候補文字
パタン抽出の説明図である。FIG. 3 is an explanatory diagram of extracting candidate character patterns from a handwritten character string image according to the embodiment;

【図４】本実施例の文字パタンの評価要素の説明図であ
る。FIG. 4 is an explanatory diagram of a character pattern evaluation element of the present embodiment.

【図５】本実施例の候補文字パタンの文字評価値の説明
図である。FIG. 5 is an explanatory diagram of a character evaluation value of a candidate character pattern according to the present embodiment.

【図６】本実施例の文字数設定部５の処理内容を示すフ
ローチャートである。FIG. 6 is a flowchart showing processing contents of a character number setting unit 5 of the embodiment.

【図７】本実施例の認識結果検定部９の単語照合の説明
図である。FIG. 7 is an explanatory diagram of word collation by the recognition result test unit 9 of the embodiment.

[Explanation of symbols]

１文字列画像記憶部２候補文字パタン抽出部３文字認識辞書記憶部４文字認識部５文字数設定部６文字遷移評価値記憶部７文字切り出し部８単語辞書記憶部９認識結果検定部１０文字列認識結果出力部 1 Character string image storage unit 2 Candidate character pattern extraction unit 3 Character recognition dictionary storage unit 4 Character recognition unit 5 Character number setting unit 6 Character transition evaluation value storage unit 7 Character cutout unit 8 Word dictionary storage unit 9 Recognition result test unit 10 Character string Recognition result output section

フロントページの続き (56)参考文献特開昭63−136291（ＪＰ，Ａ) 特開平２−238589（ＪＰ，Ａ) 特開昭63−103393（ＪＰ，Ａ) 特開平２−116988（ＪＰ，Ａ) 特開昭60−156190（ＪＰ，Ａ) 特開平４−92994（ＪＰ，Ａ) 特開平４−85687（ＪＰ，Ａ) 特開平３−225579（ＪＰ，Ａ) 特開昭59−216285（ＪＰ，Ａ) 特開昭61−220081（ＪＰ，Ａ) 特開平２−245889（ＪＰ，Ａ) 特開平２−165287（ＪＰ，Ａ) 特開平１−281586（ＪＰ，Ａ) 特開平２−91785（ＪＰ，Ａ) 特開平１−134585（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06K 9/62 G06K 9/72 G06K 9/34 Continuation of the front page (56) References JP-A-63-136291 (JP, A) JP-A-2-238589 (JP, A) JP-A-63-103393 (JP, A) JP-A-2-116988 (JP) JP-A-60-156190 (JP, A) JP-A-4-92994 (JP, A) JP-A-4-85687 (JP, A) JP-A-3-225579 (JP, A) 59-216285 (JP, A) JP-A-61-220081 (JP, A) JP-A-2-245889 (JP, A) JP-A-2-165287 (JP, A) JP-A-1-281586 (JP, A A) JP-A-2-91785 (JP, A) JP-A-1-134585 (JP, A) (58) Fields investigated (Int. Cl. ⁶ , DB name) G06K 9/62 G06K 9/72 G06K 9 / 34

Claims

(57) [Claims]

1. A character evaluation value is calculated based on a result of a recognition process of extracting a candidate character pattern from a character string image optically read from a form and comparing the candidate character pattern with a standard pattern of a character recognition dictionary. Assuming the number of characters of the character string image by the candidate character pattern, cut out the number of characters from the character string image by the character evaluation value and a character transition evaluation value representing transition information between characters calculated from a word dictionary in advance, and a character code Output as a sequence, collate the character code sequence with words stored in the word dictionary, and if all characters in the character code sequence are collated, recognize the character code sequence in the character string image. Output as a result, if not collated, change the number of characters within the range of the number of possible characters, repeat the character cutout process and the word matching process, and A character string recognizing method characterized in that, if collation is not performed for all of the range, an optimal collation result within the range of the number of possible characters is output as a recognition result of the character string image.

2. A character string image storage means for storing a character string image read by optical scanning, and a candidate character pattern to be a character candidate is extracted from the character string image read from the character string image storage means. Candidate character pattern extraction means for storing, character recognition dictionary storage means for storing standard patterns, candidate character patterns read from the candidate character pattern extraction means, and standard patterns read from the character recognition dictionary storage means. Character recognition means for storing a character evaluation value obtained from a character code and a recognition result by a recognition process for collating, and a candidate character pattern read from the candidate character pattern extraction means, assuming the number of characters of the character string image. A character number setting means for changing the number of characters within a possible range when the number of characters is input from the recognition result test unit; and a character number from the character number setting means. And a character evaluation value of a candidate character pattern read from the character recognition unit and a character transition evaluation value indicating transition information between characters read from the character transition evaluation value storage unit. Character extraction means for searching for an optimum one of the character string candidates, and storing the character string candidate and a character code string as a recognition result; a character code string from the character extraction means; and a word dictionary storage means. When all the characters of the character code string are matched, the character code string is output to the character string recognition result output means, and when not matched, the matching character string is stored. The number of characters is output to the number-of-characters setting means, and if collation is not performed for all of the possible number of characters, an optimum collation character string among the stored collation character strings is determined by the character string recognition result. A recognition result verification unit that outputs the force means, the recognition result verification unit string recognition apparatus characterized by having a character string recognition result output means for outputting a character collation result from.