JP2851865B2

JP2851865B2 - Character recognition device

Info

Publication number: JP2851865B2
Application number: JP1148382A
Authority: JP
Inventors: 正己久貝
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1989-06-13
Filing date: 1989-06-13
Publication date: 1999-01-27
Anticipated expiration: 2014-01-27
Also published as: JPH0314081A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、文字画像を文字コードに変換する文字認識
装置に関する。Description: TECHNICAL FIELD The present invention relates to a character recognition device that converts a character image into a character code.

[Conventional technology]

従来、文字認識装置は文字画像から抽出した特徴パタ
ーンと文字コードが判明している標準特徴パターンとの
距離計算を行い、最も両特徴パターンが類似する文字コ
ードを文字画像の認識結果として出力する。文字認識装
置に用いられている文字認識手法の一つとして文献H.A.
Glucksman“Classification of mixed font alphabetic
s by characteristic loci"Digestol 1st Ann.IEEE Com
puter Conf.,pp.137−141,1967に示されている文字認識
手法（以降、グラクスマンの方法と呼ぶ）は、文字の位
相的特徴を良くとらえているため、簡単な認識処理で英
数字・カタカナ程度の文字認識を比較的好結果に行える
ことが知られている。2. Description of the Related Art Conventionally, a character recognition device calculates a distance between a feature pattern extracted from a character image and a standard feature pattern whose character code is known, and outputs a character code having a similarity between the two feature patterns as a recognition result of the character image. Document HA is one of the character recognition methods used in character recognition devices.
Glucksman “Classification of mixed font alphabetic
s by characteristic loci "Digestol 1st Ann.IEEE Com
puter Conf., pp. 137-141, 1967, the character recognition method (hereinafter referred to as Graxman's method) captures the topological features of characters well, so it uses simple recognition processing for alphanumeric characters and characters. It is known that character recognition of about katakana can be performed relatively successfully.

グラクスマンの方法では、文字画像の背景部に着目
し、背景部を構成する白画像から上下左右方向に半直線
を延ばし文字線と交差する回数を計数する。そして計数
値を０回、１回、２回以上の３段階に分けてそれぞれ
“0"、“1"、“2"の数値にコード化し、１つの白画素に
ついて各方向ごとにこのコードを符す。次に、各白画素
についての3⁴＝81通りの文字線に対する上記コードの組
み合わせ（包囲状態情報）をテーブルとして予め作成し
ておき、認識対象の文字画像から包囲情報コードを抽出
し、上記テーブルの包囲情報コードと同じ包囲情報コー
ドをもつ白画素の個数を81次元の特徴ベクトルとして与
える。次にこの特徴ベクトルを文字コードが判明してい
る標準特徴ベクトルと照合することにより文字認識を行
っていた。In the method of Glaxman, focusing on the background portion of the character image, a half line is extended in the vertical and horizontal directions from the white image forming the background portion, and the number of times of intersection with the character line is counted. Then, the count value is divided into three stages of 0 times, 1 time, 2 times or more, and coded into numerical values of “0”, “1”, and “2”, respectively, and this code is coded for each direction for one white pixel. You. Next, a combination of the above codes (surrounding state information) for 3 ⁴ = 81 character lines for each white pixel is created in advance as a table, and the surrounding information code is extracted from the character image to be recognized. The number of white pixels having the same surrounding information code as the surrounding information code is given as an 81-dimensional feature vector. Next, character recognition is performed by comparing this feature vector with a standard feature vector whose character code is known.

[Problems to be solved by the invention]

しかしながら、上述のグラクスマンの方法に限らず認
識対象の文字から抽出した特徴パターン（ベクトル）と
文字コードが判明している文字から抽出した標準特徴パ
ターンとの距離計算を行って文字認識を行う場合、形状
が似ている文字を誤認識する場合がある。例えば、上述
のグラクスマンの方法では、Ｊをンに、９を？に、シを
ンに、ンをシに、ミを８に間違えることがあった。However, when character recognition is performed by performing a distance calculation between a feature pattern (vector) extracted from a character to be recognized and a standard feature pattern extracted from a character whose character code is known, without being limited to the above-mentioned Graxman method, Characters with similar shapes may be erroneously recognized. For example, in the above-mentioned Graxman's method, set J to n and 9 to? In some cases, he mistaken for Shi, N for Shi, and Mi for 8.

そこで、本発明の目的は、上述の不具合を解消し、従
来の文字認識装置の構成を大幅に変更することなく、よ
り文字認識精度を向上させることが可能な文字認識装置
を提供することにある。Accordingly, it is an object of the present invention to provide a character recognition device capable of solving the above-mentioned problems and improving the character recognition accuracy without significantly changing the configuration of the conventional character recognition device. .

[Means for solving the problem]

このような目的を達成するために本発明の第１形態
は、認識対象の文字画像から抽出した特徴パターンと文
字コード別に用意した標準特徴パターンとの距離計算を
行って、特徴パターンに近似する複数個の標準パターン
の文字コードを、文字認識候補として出力する文字識別
手段と、認識対象の文字画像から、画像認識手法を用い
て当該文字画像を構成する輪郭線の長さおよび個数の少
なくともいずれか一方の輪郭線特徴情報を検出する検出
手段と、輪郭線特徴情報を照合するため文字コード毎に
予め設定した標準輪郭線特徴情報を記憶した記憶手段
と、文字識別手段から出力された複数個の文字コードに
対応する標準輪郭線特徴情報を記憶手段から抽出し、抽
出した標準輪郭線特徴情報と検出手段により検出された
輪郭線特徴情報とが一致する文字コードを選択出力する
選択手段とを具えたことを特徴とする。In order to achieve such an object, the first embodiment of the present invention calculates a distance between a characteristic pattern extracted from a character image to be recognized and a standard characteristic pattern prepared for each character code, and calculates a plurality of distances approximate to the characteristic pattern. Character identification means for outputting the character codes of the standard patterns as character recognition candidates, and at least one of the length and the number of contour lines constituting the character image by using an image recognition method from the character image to be recognized. Detecting means for detecting one of the contour feature information; storage means for storing standard contour feature information preset for each character code for collating the contour feature information; The standard contour feature information corresponding to the character code is extracted from the storage means, and the extracted standard contour feature information matches the contour feature information detected by the detection means. Characterized in that comprises a selecting means for selectively outputting the character code that.

さらに本発明の第２形態は、第２輪郭線情報に含まれ
る個数は変動上限値と下限値とを有することを特徴とす
る。Further, the second mode of the present invention is characterized in that the number included in the second contour information has a fluctuation upper limit value and a lower limit value.

(Operation)

本発明では、文字毎にその文字を構成する輪郭線の長
さまたは個数が定まることに着目し、認識対象文字およ
び標準文字のそれぞれから抽出した特徴パターンとの距
離計算により複数の認識候補文字を選出し、輪郭線特徴
情報に基づき、選出した候補文字の中から文字認識結果
を選択照合するようにしたので、例えばグラクスマンの
手法を用いた文字認識を行う場合は、より人間の直感的
観測に近い識別を達成でき高精度の認識を可能とする。In the present invention, attention is paid to the fact that the length or the number of contour lines constituting the character is determined for each character, and a plurality of recognition candidate characters are calculated by calculating a distance from a feature pattern extracted from each of a recognition target character and a standard character. Based on selection and outline feature information, character recognition results are selected and collated from the selected candidate characters.For example, when performing character recognition using the Graxman method, more intuitive human observation Close recognition can be achieved and high-precision recognition is possible.

〔Example〕

以下に、図面を参照して本発明を詳細に説明する。 Hereinafter, the present invention will be described in detail with reference to the drawings.

第１図は本発明第１実施例の回路構成を示す。第１図
において１は、文書を読み込み２値画像データに変換す
る光電変換素子を用いた読取り部である。２は２値画像
データを記憶する画像メモリである。３は画像メモリ２
に記憶された２値画像データを１文字分の大きさに切り
出すための中央演算処理装置（CPU）を用いた文字切り
出し回路、４は切り出された文字画像データからグラク
スマン手法により文字の特徴ベクトルを抽出し、抽出し
た特徴ベクトルを第１識別回路５へ送り込む特徴抽出回
路であり、CPUを用いる。FIG. 1 shows a circuit configuration of a first embodiment of the present invention. In FIG. 1, reference numeral 1 denotes a reading unit using a photoelectric conversion element for reading a document and converting it into binary image data. Reference numeral 2 denotes an image memory for storing binary image data. 3 is an image memory 2
A character extraction circuit using a central processing unit (CPU) for extracting the binary image data stored in the CPU into a size of one character, and 4 extracts a character feature vector from the extracted character image data by the Graxman method. This is a feature extraction circuit that extracts and sends the extracted feature vector to the first identification circuit 5, and uses a CPU.

５は上記特徴ベクトルを受け取り、特徴ベクトルと辞
書部６内の各文字の平均ベクトルとの距離計算を行うCP
Uを用いた第１識別回路であり、距離の最も小さい方か
ら例えば３個の候補文字を選択し、該３個の候補文字の
文字コードを照合回路８へ送り込む。A CP 5 receives the feature vector and calculates a distance between the feature vector and an average vector of each character in the dictionary unit 6.
This is a first identification circuit using U, and selects, for example, three candidate characters from the smallest distance, and sends the character codes of the three candidate characters to the matching circuit 8.

７は画像メモリ２内の切り出された文字画像データか
ら輪郭線を抽出し、内側輪郭数と外側輪郭数の係数を求
めるCPUを用いた輪郭線数計数回路である。８はCPUを用
いた照合回路であり、輪郭線計数回路７で求まった内側
輪郭線数および外側輪郭線数と輪郭線情報テーブルの各
文字毎の内側輪郭線数および外側輪郭線数とを照合し、
第１認識回路５で得られた３個の候補文字のうち最も合
致する文字を選択しその文字コードを出力する。10は文
字コードを記憶する出力バッファである。Reference numeral 7 denotes a contour line number counting circuit using a CPU for extracting contour lines from the cut-out character image data in the image memory 2 and calculating coefficients of the number of inner contours and the number of outer contours. Reference numeral 8 denotes a matching circuit using a CPU, which compares the number of inner contour lines and the number of outer contour lines obtained by the contour line counting circuit 7 with the number of inner contour lines and the number of outer contour lines for each character in the contour line information table. And
The best matching character among the three candidate characters obtained by the first recognition circuit 5 is selected and its character code is output. An output buffer 10 stores a character code.

第２図は輪郭線情報テーブルの内容を示す。 FIG. 2 shows the contents of the contour line information table.

第２図において、輪郭線情報テーブル９には認識対象
とする文字の文字コードと文字コードと対応する内側輪
郭線数変動上限および外側輪郭線数変動上限が記憶され
ている。ここで、変動上限とは手書き文字の変動により
統計的に有り得る輪郭線数の上限を意味する。例えば
“E"という文字は通常の書き方では外側輪郭線数が“1"
であるが、文字線分を離して書くと外側輪郭線数が“2"
となるので、変動上限は“2"となる。なお、上記変動上
限に加え変動下限をも用いると文字の認識精度が高ま
る。In FIG. 2, the contour line information table 9 stores the character code of the character to be recognized and the upper limit of the number of inner contour lines and the upper limit of the change of the number of outer contour lines corresponding to the character code. Here, the upper limit of variation means the upper limit of the number of contour lines that can be statistically due to the variation of handwritten characters. For example, the letter “E” has the number of outer contour lines “1” in normal writing
However, if the characters are written apart from each other, the number of outer contours will be "2".
Therefore, the upper limit of the fluctuation is “2”. If the lower limit of variation is used in addition to the upper limit of variation, the accuracy of character recognition is increased.

第１図に戻り、本発明実施例の回路動作を第３図
（１），（２）のフローチャートを参照して説明する。Returning to FIG. 1, the circuit operation of the embodiment of the present invention will be described with reference to the flowcharts of FIGS. 3 (1) and 3 (2).

読取り部１が原稿から文書画像を入力すると、文書情
報は２値画像に変換されて画像メモリ２に記憶される
（第３図ステップS1）。次に文字切り出し回路３が動作
し、文書単位の２値画像データが１文字毎文字画像デー
タに切り出されて画像メモリ２の余白に書き込まれる
（ステップS2）。When the reading unit 1 inputs a document image from a document, the document information is converted into a binary image and stored in the image memory 2 (step S1 in FIG. 3). Next, the character cutout circuit 3 operates, and the binary image data for each document is cut out into character image data for each character and written in the margin of the image memory 2 (step S2).

次に特徴抽出回路４が動作し文字画像データからグラ
クスマンの手法により特徴ベクトルを作成し、この特徴
ベクトルを識別回路５へ出力する（ステップS11）。次
に識別回路５が働き、辞書部６内の各文字の平均ベクト
ルと特徴ベクトルのユークリッド距離を計算して距離値
の小さい順に３個の候補文字コードを検索（ソート）す
る（ステップS12〜S16）。ここまでは従来の文字認識回
路と同様の動作となる。次に３個の候補文字コードを距
離値の小さい順番に照合回路８へ渡す。Next, the feature extraction circuit 4 operates to create a feature vector from the character image data by the method of Graxman, and outputs this feature vector to the identification circuit 5 (step S11). Next, the identification circuit 5 operates, calculates the Euclidean distance between the average vector and the feature vector of each character in the dictionary unit 6, and searches (sorts) three candidate character codes in ascending order of distance value (steps S12 to S16). ). The operation so far is the same as that of the conventional character recognition circuit. Next, the three candidate character codes are passed to the matching circuit 8 in the order of the smallest distance value.

一方、特徴抽出回路４が動作開始するのと同時に輪郭
線数計数回路７が動作開始して、周知の輪郭追跡手法に
より文字画像データの内側輪郭線数と外側輪郭数を算出
し、算出結果を照合回路８へ送る（ステップS20）。照
合回路８は輪郭線数計数回路７および識別回路５からの
情報入力を終了すると、次の照合処理を行う。ここで、
入力した内側輪郭数をn_i、入力した外側輪郭数をn_o、第
１候補文字、第２候補文字および第３候補文字と対応す
る輪郭線情報テーブル９の内側輪郭数上限と外側輪郭数
上限をおのおのN_1i,N_1o,N_2i,N_2o,及びN_3i,N_3oと表わ
す。すなわち、照合処理（１）上記数値の比較を行い、比較結果がn_i≦N_1iかつn_1o≦
N_1oの時には第１候補文字をそのまま最終認識結果とし
てその文字コードを出力バッファ10へ出力する（ステッ
プS22→S26）。On the other hand, at the same time as the operation of the feature extraction circuit 4 starts, the number of contour lines counting circuit 7 starts to calculate the number of inner contour lines and the number of outer contours of the character image data by a well-known contour tracking method. It is sent to the matching circuit 8 (step S20). When the input of information from the contour line number counting circuit 7 and the identification circuit 5 is completed, the matching circuit 8 performs the next matching process. here,
Entered the inner contour number n _i, the outer contour number n _o input, the first candidate character, the second option character and the third candidate characters as the corresponding inner contour number and upper outer contour maximum number of contour information table 9 _{Are denoted as} N _1i , N _1o , N _2i , N _2o , and N _3i , N _3o , respectively. That is, the matching process (1) The above numerical values are compared, and the comparison result is n _i ≦ N _1i and n _1o ≦
In the case of _N1o , the first candidate character is output to the output buffer 10 as the final recognition result as it is as the final recognition result (step S22 → S26).

比較結果がn_i＞N_1iまたはn_o＞N_1oのときには以降の照
合処理（２）、（３）を行う。When the comparison result is n _i > N _1i or n _o > N _1o , the following collation processes (2) and (3) are performed.

（２）もし、輪郭数の大小関係がn_i≦N_2iかつn_o≦N_2oな
らば、第１候補文字と第２候補文字を入れ替える。そう
でなければ次の照合処理（３）を行う（ステップS23→S
25）。(2) If the magnitude relation of the number of contours is n _i ≦ N _2i and n _o ≦ N _2o , the first candidate character and the second candidate character are exchanged. Otherwise, the next matching process (3) is performed (step S23 → S
twenty five).

（３）もし輪郭数の大小関係がn_i≦N_3iかつn_o≦N_3oなら
ば、第１候補文字と第３候補文字を入れかえる（ステッ
プS24→S25）。(3) If the relationship of the number of contours is n _i ≦ N _3i and n _o ≦ N _3o , the first candidate character and the third candidate character are switched (step S24 → S25).

上述の（１）〜（３）の照合処理を行った後、照合回
路８は第１候補文字の文字コードを出力バッファ10に出
力する（ステップS26）。After performing the above-described matching processes (1) to (3), the matching circuit 8 outputs the character code of the first candidate character to the output buffer 10 (Step S26).

本実施例では認識候補を３個と個数限定しているが所
望に応じて一定個数に定めれば良く、また識別回路５で
行う距離計算結果をしきい値比較して、しきい値以下の
距離計算結果を有する文字コードすべてを認識候補とし
てもよい。In the present embodiment, the number of recognition candidates is limited to three. However, the number may be set to a predetermined number as desired. All the character codes having the distance calculation result may be set as recognition candidates.

[Other embodiments]

次に第２の実施例について説明する。本例は輪郭線情
報として輪郭数に代り、内側輪郭線長と外側輪郭線長を
利用した例である。Next, a second embodiment will be described. This example is an example in which the inner contour length and the outer contour length are used as the contour information instead of the number of contours.

この場合、第４図に示すように、輪郭線数計数回路７
が輪郭線長計算回路７′に代り、輪郭線情報テーブルの
記載内容が輪郭線から輪郭線長に代わる。In this case, as shown in FIG.
Replaces the contour length calculation circuit 7 ', and the content of the contour line information table changes from the contour line to the contour line length.

本例では第１の実施例と同様に識別回路５では各文字
毎の平均ベクトルと入力特徴ベクトルの距離計算がなさ
れ距離の小さい平均ベクトルを有する文字コード３個お
よびその距離値が照合回路８へ送られる。これらの距離
値を順にd₁,d₂,d₃で表わす。また認識候補として選択さ
れた文字コードを順にc₁,c₂,c₃で示す。輪郭線長計算回
路７で計算された入力文字画像の内側輪郭線長と外側輪
郭線長をそれぞれl_i,l_oで示す。In this example, as in the first embodiment, the identification circuit 5 calculates the distance between the average vector for each character and the input feature vector, and sends three character codes having an average vector with a small distance and the distance value to the matching circuit 8. Sent. These distance values are represented by d ₁ , d ₂ and d ₃ in order. The order indicated by c _1, c _2, c ₃ a character code selected as a recognition candidate. Inner contour line length of the calculated input character image in the contour line length calculating circuit 7 and the outer contour line length respectively indicated by l _i, l _o.

照合回路８では３つの各候補文字について輪郭線長に
関する距離が次の式で求められる。In the matching circuit 8, the distance with respect to the contour length for each of the three candidate characters is obtained by the following equation.

e_k＝（l_i−L_i ^(k)）^２＋（l_o−L_o ^(k)）^２ここでe_kは求めようとしている候補文字C_kについての
距離、L_i ^(k)、L_o ^(k)は候補文字C_kについての輪郭線情報
テーブル（第５図参照）の内側輪郭線の平均長と外側輪
郭線の平均長である。そして予め決定してある重みωを
使って新たな距離D_kを次式で計算する。e _k = (l _i −L _i ^(k) ) ² + (l _o −L _o ^(k) ) ² where e _k is the distance of the candidate character C _{k to be obtained} , L _i ^(k) , L _o ^(k) is the average length of the average length and the outer contour of the inner contour line of the contour line information table for the candidate character C _k (see FIG. 5). Then, a new distance _Dk is calculated by the following equation using the predetermined weight ω.

D_k＝d_k＋ωe_k （_ｋ＝1,2,3）そして、D_k（_ｋ＝1,2,3）の中で最小値の対応する文
字コードC_kを出力バッファへ送る。D _k = d _k + ωe _k ( _k = 1,2,3) Then, the corresponding character code C _k of the minimum value among D _k ( _k = 1,2,3) is sent to the output buffer.

〔The invention's effect〕

以上説明したように、本発明によれば文字毎にその文
字を構成する輪郭線の長さまたは個数が定まることに着
目し、認識対象文字および標準文字のそれぞれから抽出
した特徴パターンとの距離計算により認識候補文字を選
出し、選出した候補文字の中からそれぞれを輪郭線情報
に基づき、文字認識結果を選択照合するようにしたの
で、例えばグラクスマンの手法を用いた文字認識を行う
場合は、より人間の直感的観測に近い識別を達成でき高
精度の認識を可能とする。As described above, according to the present invention, attention is paid to the fact that the length or the number of contour lines constituting the character is determined for each character, and the distance calculation with the feature pattern extracted from each of the recognition target character and the standard character is performed. Is selected based on the outline information, and the character recognition result is selectively collated from among the selected candidate characters.For example, when performing character recognition using the method of Graxman, A classification close to intuitive human observation can be achieved, and highly accurate recognition is possible.

さらに、輪郭線の個数に上限値に加えて下限値を設け
ることにより、文字を構成する輪郭線がそれぞれ離れて
いる場合でも正確に文字認識を行うことができる。Further, by setting a lower limit value in addition to the upper limit value to the number of contour lines, character recognition can be performed accurately even when the contour lines forming the character are separated from each other.

[Brief description of the drawings]

第１図は本発明第１実施例の回路構成を示すブロック
図、第２図は本発明第１実施例の輪郭線情報テーブル９の内
容を示す説明図、第３図（１），（２）は本発明第１実施例の動作手順を
示すフローチャート、第４図は本発明第２実施例の回路構成を示すブロック
図、第５図は本発明第２実施例の輪郭線情報テーブルの内容
を示す説明図である。１……読取り部、２……画像メモリ、３……文字切り出し回路、４……特徴抽出回路、５……識別回路、６……辞書部、７……輪郭線計数回路、８……照合回路、９……輪郭線情報テーブル、 10……出力バッファ。FIG. 1 is a block diagram showing the circuit configuration of the first embodiment of the present invention, FIG. 2 is an explanatory diagram showing the contents of the contour line information table 9 of the first embodiment of the present invention, and FIGS. ) Is a flowchart showing the operation procedure of the first embodiment of the present invention, FIG. 4 is a block diagram showing the circuit configuration of the second embodiment of the present invention, and FIG. 5 is the contents of the contour line information table of the second embodiment of the present invention. FIG. 1 ... Reading unit, 2 ... Image memory, 3 ... Character extraction circuit, 4 ... Feature extraction circuit, 5 ... Identification circuit, 6 ... Dictionary unit, 7 ... Contour line counting circuit, 8 ... Collation Circuit 9 ... Contour line information table 10 ... Output buffer.

フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06K 9/46 G06K 9/62 G06K 9/68 G06K 9/03Continued on the front page (58) Fields surveyed (Int.Cl. ⁶ , DB name) G06K 9/46 G06K 9/62 G06K 9/68 G06K 9/03

Claims

(57) [Claims]

A distance calculation is performed between a feature pattern extracted from a character image to be recognized and a standard feature pattern prepared for each character code, and the character codes of the plurality of standard patterns approximate to the feature pattern are converted into a character code. Character identification means for outputting as a recognition candidate, and detecting, from the character image to be recognized, at least one of contour length feature information of the length and the number of contours constituting the character image by using an image recognition technique. Detecting means; storage means for storing standard contour feature information preset for each character code for collating the contour feature information; and the plurality of character codes corresponding to the plurality of character codes output from the character identifying means. Standard outline feature information is extracted from the storage unit, and the extracted standard outline feature information and the outline feature information detected by the detection unit are extracted. Matching character recognition apparatus, characterized in that it comprises a selection means for selectively outputting the character code.

2. The character recognition device according to claim 1, wherein the number of contour lines included in the standard contour line feature information has a fluctuation upper limit value and a lower limit value.