JP2574776B2

JP2574776B2 - Character recognition method

Info

Publication number: JP2574776B2
Application number: JP61251638A
Authority: JP
Inventors: 康明中野; 彰三門田; 浩道藤澤
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1986-10-24
Filing date: 1986-10-24
Publication date: 1997-01-22
Anticipated expiration: 2012-01-22
Also published as: JPS63106884A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は文字認識方法に係り、特に文字フオントが未
知の印刷文字パターンを高精度で認識する目的に好適な
文字認識方法に関する。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition method, and more particularly to a character recognition method suitable for the purpose of recognizing a printed character pattern whose character font is unknown with high accuracy.

[Conventional technology]

従来の印刷文字認識方式では、文字フオントが既知の
場合（これをシングルフオント文字認識と呼ぶ）にはそ
の文字フオントの標準パターンからなるパターン辞書を
使用して高精度で認識するようにしていた。しかし、文
字フオントが多数あつてその中のどれが使用されている
かが未知の場合（これをマルチフオント文字認識と呼
ぶ）には、対象となる全ての文字フオントについての標
準パターンからなるパターン辞書を使用する必要がある
ため、辞書メモリーが増加するとともに認識速度が低下
し、認識精度も悪化するなどの問題があつた。In the conventional print character recognition method, when a character font is known (this is called single-font character recognition), recognition is performed with high precision using a pattern dictionary including a standard pattern of the character font. However, if there are many fonts and it is not known which one is used (this is called multi-font recognition), a pattern dictionary consisting of standard patterns for all target fonts is used. Since it must be used, the dictionary memory increases, the recognition speed decreases, and the recognition accuracy deteriorates.

この問題に対処するため、例えば特公昭59-3793にお
いては複数個のパターン辞書を用意し、そのうちの１個
を用いて１行分の文字を認識し、認識結果の中のリジエ
クトの個数を計数して、リジエクト数が多い場合には他
の辞書に取り替えて認識を再度行う方法が記述されてい
る。また、特開昭53-51928においては複数のフオントに
対応する複数の判別器を用意し、その全てを用いて１フ
イールド分の文字を認識し、各判別器による認識結果の
中のリジエクトの個数を計数して、リジエクト数が最小
となるような認識結果を選択する方法が記述されてい
る。この方法では複数の判別器を用いているが、判別器
は１個として複数個のパターン辞書を順次切り替えて認
識するようにしても、速度は遅くなるが認識率の面では
同一の効果が得られることは明らかである。To deal with this problem, for example, Japanese Patent Publication No. 59-3793 prepares a plurality of pattern dictionaries, recognizes one line of characters using one of them, and counts the number of rejects in the recognition result. A method is described in which when the number of rejects is large, the dictionary is replaced with another dictionary and recognition is performed again. In Japanese Patent Application Laid-Open No. 53-51928, a plurality of discriminators corresponding to a plurality of fonts are prepared, all of them are used to recognize one field of characters, and the number of rejects in the recognition result by each discriminator is determined. And a method of selecting a recognition result that minimizes the number of rejects is described. In this method, a plurality of discriminators are used. However, even if one discriminator is used to sequentially switch and recognize a plurality of pattern dictionaries, the speed is reduced but the same effect is obtained in terms of the recognition rate. It is clear that

[Problems to be solved by the present invention]

特公昭59-3793あるいは特開昭53-51928のような従来
手法では、複数のフオントに対応して全文字カテゴリー
についての標準パターンを用意するため、（文字カテゴ
リー数）×（フオント数）だけの標準パターンが必要と
なり、辞書メモリが増大する問題があつた。In a conventional method such as Japanese Patent Publication No. 59-3793 or Japanese Patent Application Laid-Open No. 53-51928, standard patterns for all character categories are prepared corresponding to a plurality of fonts, so that only (number of character categories) × (number of fonts) is required. A standard pattern is required, and the dictionary memory is increased.

本発明の目的は、マルチフオント印刷文字に対して、
辞書メモリが比較的小さくて済み、シングルフオント印
刷文字なみの高い認識率が得られる文字認識方法を提供
することである。An object of the present invention is to print multi-font characters.
An object of the present invention is to provide a character recognition method that requires a relatively small dictionary memory and can obtain a recognition rate as high as that of a single font print character.

[Means for solving the problem]

本発明では上記の問題点を、辞書として文字フオント
別の標準パターンを用意するのではなく、文字カテゴリ
ごとに複数個の標準パターンを記憶させ、各文字カテゴ
リ内でフオントが異なつていても類似したパターンであ
る場合には同一の標準パターンとすることにより、記憶
させる標準パターンの個数を減少させることにより解決
する。その際、各文字フオントごとに各文字カテゴリに
対してどの標準パターンを使用するかを指定する情報を
あらかじめ記憶させたメモリを別に用意しておくことに
より、各フオントごとに別の辞書を用意したのと同一の
効果を得る。In the present invention, the above problem is solved by storing a plurality of standard patterns for each character category instead of preparing a standard pattern for each character font as a dictionary, and even if fonts are different in each character category. If the patterns are the same, the problem is solved by reducing the number of standard patterns to be stored by using the same standard pattern. At that time, a separate dictionary was prepared for each font by separately preparing a memory that previously stored information specifying which standard pattern to use for each character category for each font. Has the same effect as

[Action]

辞書として文字フオント別の標準パターンを用意せ
ず、文字カテゴリごとに複数個の標準パターンを記憶さ
せたものを使用するので、異なつた文字フオントであつ
ても同一カテゴリ内で類似している場合、それらに対す
る標準パターンを共用できるので、メモリが節約され
る。As a dictionary, a standard pattern for each character font is not prepared, and a dictionary in which a plurality of standard patterns are stored for each character category is used, so even if different character fonts are similar in the same category, Memory can be saved because standard patterns for them can be shared.

文字認識に際しては、未知文字フオントの文字パター
ンが与えられ、認識装置は、その文字カテゴリに属する
複数個の標準パターンの各々と未知文字フオントの入力
文字パターンとを比較し、最も類似した標準パターンを
決定する。各文字フオントについて、あらかじめ各文字
カテゴリごとに、どの標準パターンが対応するかを指定
しておく。複数個の（例えば１行分の）文字パターンの
認識を行つた後、上記の最も類似した標準パターンに対
応する文字フオント（複数個になることもある）を決定
し、その集計結果から入力文書の文字フオントを決定す
る。このようにして決定した文字フオント情報を用い
て、辞書中の該文字フオントに対応する標準パターンを
選択することができ、入力パターンを再度認識すれば、
実効的にシングルフオント文字認識と同等となり、高精
度の文字認識が可能となる。また、文書上の一部につい
て文字フオントを決定した後、文書上の残りの部分につ
いてはその文字フオントを仮定して文字認識を行うこと
ができ、高速に認識できる。At the time of character recognition, the character pattern of the unknown character font is given, and the recognition device compares each of the plurality of standard patterns belonging to the character category with the input character pattern of the unknown character font, and determines the most similar standard pattern. decide. For each character font, which standard pattern corresponds to each character category is specified in advance. After recognizing a plurality of character patterns (for example, for one line), a character font (may be plural) corresponding to the above-described most similar standard pattern is determined, and an input document is determined from the tally result. Determine the font of the character. Using the character font information determined in this way, a standard pattern corresponding to the character font in the dictionary can be selected, and if the input pattern is recognized again,
Effectively, it is equivalent to single-font character recognition, and high-precision character recognition becomes possible. After the character font is determined for a part of the document, the character recognition can be performed for the remaining part of the document assuming the character font, thereby enabling high-speed recognition.

本発明では文字フオントごとに指定する情報は、各文
字カテゴリごとに標準パターンの番号だけでよいので、
文字フオントごとにパターンを記憶する従来方式に比
べ、総メモリ量が大きく節約される。In the present invention, since the information to be specified for each character font need only be the standard pattern number for each character category,
Compared with the conventional method of storing a pattern for each character font, the total memory amount is greatly reduced.

また、同一フオント内の字形変動に対しても、変動パ
ターンそれぞれに対して辞書の中の複数個の標準パター
ンのいずれに相当するかを指定する情報を付加するだけ
で対処できる。In addition, it is possible to cope with the character shape variation in the same font simply by adding information specifying which of the plurality of standard patterns in the dictionary corresponds to each variation pattern.

〔Example〕

以下、本発明を図面を用いて詳細に説明する。 Hereinafter, the present invention will be described in detail with reference to the drawings.

第１図は本発明の文字認識方式の一実施例における処
理の流れをPAD（Problem Analysis Diagram）図で表し
たものである。第２図は第１図の処理を実施する装置の
構成を示すブロツク図である。装置の各部はバス１に接
続され、全体の動作は制御部２により制御される。FIG. 1 is a PAD (Problem Analysis Diagram) diagram showing the flow of processing in an embodiment of the character recognition system of the present invention. FIG. 2 is a block diagram showing the configuration of an apparatus for performing the processing of FIG. Each part of the device is connected to a bus 1, and the overall operation is controlled by a control unit 2.

第１図の処理内容を説明する。第１図（Ａ），（Ｂ）
は、図中の点で接続されている。第１図の（Ａ）は全
体の制御に関連した101〜104,138及び文字切り出し部分
105,文字フオント決定部分106〜126からなり、第１図
（Ｂ）は決定されたフオントを用いて、文字を認識する
部分127〜136からなる。101は処理中で用いられる各種
の変数の初期化である。この中にはフアイル３からパタ
ーン辞書を第２図のメモリ61に、標準パターン−文字フ
オント番号対応テーブルをメモリ62に読み込む処理を含
む。メモリ61,62は後述する63〜65とともにメモリ６の
一部をなす。パターン辞書及び標準パターン−文字フオ
ント番号対応テーブルの作成法は後述する。101の中に
は後述するスイツチSWを０にリセツトする処理も含む。The processing contents of FIG. 1 will be described. FIG. 1 (A), (B)
Are connected at points in the figure. FIG. 1A shows 101 to 104, 138 and character cutout portions related to the overall control.
FIG. 1 (B) is composed of portions 127 to 136 for recognizing characters using the determined font. Reference numeral 101 denotes initialization of various variables used during processing. This processing includes reading the pattern dictionary from the file 3 into the memory 61 of FIG. 2 and reading the standard pattern-character font number correspondence table into the memory 62. The memories 61 and 62 form a part of the memory 6 together with 63 to 65 described later. The method of creating the pattern dictionary and the standard pattern-character font number correspondence table will be described later. 101 includes processing for resetting a switch SW to be described later to 0.

第１図の102は文書画像の入力であり、第２図に示さ
れる文書４上の情報（文書画像）を光電変換装置５によ
り走査し、デイジタル化し、バス１を介してメモリ63に
格納することに対応する。Reference numeral 102 in FIG. 1 denotes a document image input. The information (document image) on the document 4 shown in FIG. 2 is scanned by the photoelectric conversion device 5, digitized, and stored in the memory 63 via the bus 1. Corresponding to

処理の説明の前に辞書の構成と、その引き方を述べ
る。辞書61のメモリ内でのデータ配置の例を第３図に示
す。第３図から分かるように、辞書の中にはまずカテゴ
リポインタテーブル301があり、カテゴリの総数Ｊだけ
エントリがあつて、その内容はカテゴリ名称Ｄ（ｊ）
と、各カテゴリごとの標準パターンテーブル群302-1〜3
02-Jのアドレスを指すポインタＢ（ｊ）からなる。各カ
テゴリごとの標準パターンテーブル302-jは、各カテゴ
リについて登録された標準パターンの個数Ｍ（ｊ）だけ
エントリがあつて、その各各が標準パターンの通番を示
す。すなわち、カテゴリｊの第ｉパターンＰ（j,i）
は、通番ｗを有することを示す。303はパターンアドレ
ステーブルであり、その内容は第ｗ標準パターン本体が
辞書本体部304において占めるアドレスＡ（ｗ）を指
す。辞書本体部304は総数Ｗ個の標準パターンのパター
ンデータあるいは特徴抽出で得られた特徴値を単純に並
べたものである。各パターンの長さが一定の場合には上
記のアドレスＡ（ｗ）は指定不要であるが、不定長のと
きは少なくとも先頭アドレスは指定する必要がある。終
了アドレスは例えばＡ（ｗ＋１）−１として計算でき
る。以上の説明から理解されるようにカテゴリｊが指定
されれば、301→302→303→304の順で、このｊに属する
Ｍ（ｊ）個の標準パターン各々の本体に到達できる。Before describing the processing, the configuration of the dictionary and how to look it up will be described. FIG. 3 shows an example of the data arrangement of the dictionary 61 in the memory. As can be seen from FIG. 3, there is a category pointer table 301 in the dictionary, and there are entries for the total number J of categories, and the contents are the category names D (j).
And standard pattern tables 302-1 to 3 for each category
It consists of a pointer B (j) pointing to the address 02-J. The standard pattern table 302-j for each category has entries for the number M (j) of standard patterns registered for each category, and each entry indicates the serial number of the standard pattern. That is, the i-th pattern P (j, i) of category j
Indicates that it has a serial number w. Reference numeral 303 denotes a pattern address table, the contents of which indicate an address A (w) occupied in the dictionary main unit 304 by the w-th standard pattern main unit. The dictionary main unit 304 simply arranges pattern data of a total of W standard patterns or feature values obtained by feature extraction. If the length of each pattern is constant, the above address A (w) does not need to be specified, but if the length is indefinite, it is necessary to specify at least the start address. The end address can be calculated, for example, as A (w + 1) -1. As will be understood from the above description, if the category j is specified, the main body of each of the M (j) standard patterns belonging to j can be reached in the order of 301 → 302 → 303 → 304.

次に、標準パターン−文字フオント番号対応テーブル
の構成と、その引き方を述べる。標準パターン−文字フ
オント番号対応テーブルのメモリ内でのデータ配置の例
を第４図に示す。第４図から分かるように、文字フオン
トの総数Ｋだけエントリを有する文字フオントポインタ
テーブル401があり、その内容は各文字フオントごとの
標準パターン対応テーブル群402-1〜402-Kのアドレスを
指す。標準パターン対応テーブル402-kの内部構造は、
拡大図403に示されている。この拡大図から分かるよう
に、文字フオントｋについて登録したパターン数ｑが先
頭のレコードで、以下カテゴリCa、パターン番号Paの組
み合わせがｑ組続く。この意味は、この文字フオントｋ
ではカテゴリCaの文字の字体としてパターン番号Paを使
用することを示す。同じカテゴリに対し複数のパターン
番号が登録されていてもよく、そのときは同一フオント
内の変形を表す。以上の説明により、標準パターン−文
字フオント番号対応テーブル62を参照すれば、各文字フ
オントが文字カテゴリごとにどの標準パターン番号を使
用するかの情報が得られることは明らかであろう。Next, the configuration of the standard pattern-character font number correspondence table and how to look it up will be described. FIG. 4 shows an example of the data arrangement in the memory of the standard pattern-character font number correspondence table. As can be seen from FIG. 4, there is a character font pointer table 401 having entries for the total number K of character fonts, and the contents indicate the addresses of the standard pattern correspondence table groups 402-1 to 402-K for each character font. The internal structure of the standard pattern correspondence table 402-k is
This is shown in the enlarged view 403. As can be seen from the enlarged view, the number q of patterns registered for the character font k is the first record, and the following q combinations of the category Ca and the pattern number Pa continue. This means that this font
Indicates that the pattern number Pa is used as the font of the character of the category Ca. A plurality of pattern numbers may be registered for the same category, in which case they represent deformations within the same font. By referring to the standard pattern-character font number correspondence table 62 described above, it is apparent that information on which standard pattern number each character font uses for each character category can be obtained.

第１図の説明に戻ると、第１図の103は文書画像から
文字行を抽出する処理である。ここでは、繰り返しの単
位は文書画像上の文字行としたが、文字行の代わりに、
文字フイールドすなわち文字行の中の帯状領域、あるい
は文字行領域すなわち複数の文字行で構成される領域で
あつてもよい。文書画像から、文字行あるいは文字領域
を抽出する手法は、例えば、電子通信学会論文誌第66-D
巻第１号111頁に掲載の秋山照雄・増田功両氏の論文
「書式指定情報によらない紙面構成要素抽出法」（昭和
58年１月）に紹介されている手法を用いればよい。Returning to the description of FIG. 1, reference numeral 103 in FIG. 1 is a process for extracting a character line from a document image. Here, the unit of repetition is the character line on the document image, but instead of the character line,
A character field, that is, a band-shaped area in a character line, or a character line area, that is, an area configured by a plurality of character lines may be used. A method of extracting a character line or a character region from a document image is described in, for example, IEICE Transactions No. 66-D.
Volume 1, Issue 111, by Teruo Akiyama and Isao Masuda, "Paper Constituent Elements Extraction Method without Format Specification Information" (Showa
The method introduced in Jan. '58 may be used.

第１図の104は文書画像上の文字行ｆ（ｆ＝１〜Ｆ）
について105〜113の処理を繰り返すループ制御である。
ここで、Ｆは文書上の文字行数である。104 in FIG. 1 is a character line f (f = 1 to F) on the document image
Is a loop control that repeats the processing of 105 to 113.
Here, F is the number of character lines on the document.

105は、上記の文字行（または文字フイールドあるい
は文字領域）から文字パターン群を切り出す処理であ
る。この処理は公知であるので詳細は省略する。切り出
された文字パターンは第２図のメモリ64に格納される。
Ｎは切り出された文字パターン数を表す。Reference numeral 105 denotes a process for extracting a character pattern group from the character line (or character field or character area). Since this processing is publicly known, its details are omitted. The extracted character pattern is stored in the memory 64 of FIG.
N represents the number of extracted character patterns.

106は、スイツチSWの値によつて分岐する部分で、SW
が０の場合は入力文書のフオントが未決定の場合に相当
し、107〜126の処理により文字フオントを決定する。106 is a portion that branches depending on the value of the switch SW.
Is 0, which corresponds to the case where the font of the input document has not been determined, and the character font is determined by the processing of 107 to 126.

107は切り出された文字パターンの各々について108〜
114の処理を繰り返すループ制御であり、ｎは文字パタ
ーン番号を示す。すなわち、108〜114の処理が一文字の
認識処理を表わす。108は以下の処理に先立つて類似度
の最大値R0を０に設定しておく初期化処理である。109
は辞書61中の各文字カテゴリについて110〜114の処理を
繰り返すループ制御であり、ｊはカテゴリ番号、Ｊはカ
テゴリの総数を示す。107 is for each of the cut out character patterns.
This is a loop control for repeating the processing of 114, where n indicates a character pattern number. That is, the processing of 108 to 114 represents the recognition processing of one character. Reference numeral 108 denotes initialization processing in which the similarity maximum value R0 is set to 0 prior to the following processing. 109
Is a loop control for repeating the processing of 110 to 114 for each character category in the dictionary 61, j is the category number, and J is the total number of categories.

110では第ｊ文字カテゴリに属する標準パターンの個
数Ｐ（ｊ）を取り出す。111はこのＰ（ｊ）個の標準パ
ターンについて112〜114の処理を繰り返すループ制御で
あり、ｉは文字カテゴリｊの中での標準パターン番号で
ある。i,jから、第３図に示したようにして、これに対
応する標準パターン通番ｗが定まる。112は第ｎ番目の
切り出された文字パターンと第ｗ番目の標準パターンと
を比較して類似度Ｒを求める部分である。標準パターン
が特徴抽出されたものである場合には、類似度計算に先
立つて入力パターンも特徴抽出する必要があることはい
うまでもない。類似度は第２図の制御部２のソフトウエ
アで求められるものとするが、専用のハードウエアを使
用してもよい。類似度の定義は通常の文字認識で使用さ
れる方法のうち適当なものを用いればよい。113は類似
度の最大値R0と現在の類似度Ｒとを比較する判定部であ
る。判定の結果、Ｒの方が大きいときは、114でR0を更
新し、そのときの文字番号ｊと標準パターン通番ｗをＸ
（1,n）及びＸ（2,n）として第２図のメモリ65の中に記
憶する。At 110, the number P (j) of standard patterns belonging to the j-th character category is extracted. 111 is a loop control for repeating the processing of 112 to 114 for the P (j) standard patterns, and i is a standard pattern number in the character category j. As shown in FIG. 3, the corresponding standard pattern serial number w is determined from i and j. Reference numeral 112 denotes a portion for comparing the n-th cut-out character pattern with the w-th standard pattern to obtain a similarity R. If the standard pattern has been subjected to feature extraction, it is needless to say that the input pattern also needs to be feature extracted prior to similarity calculation. Although the degree of similarity is determined by the software of the control unit 2 in FIG. 2, dedicated hardware may be used. The definition of the degree of similarity may use an appropriate method among the methods used in ordinary character recognition. A determination unit 113 compares the maximum similarity R0 with the current similarity R. If the result of the determination is that R is greater, R0 is updated at 114, and the character number j and standard pattern serial number w at that time are X
(1, n) and X (2, n) are stored in the memory 65 of FIG.

以上の説明から了解されるように、107のループが終
了したとき、メモリ65の中のＸ（1,n）及びＸ（2,n）
（ｎ＝１〜Ｎ）には、文字パターンｎに最も類似した文
字カテゴリと標準パターン通番が入つている。As will be understood from the above description, when the loop of 107 is completed, X (1, n) and X (2, n) in the memory 65 are set.
In (n = 1 to N), a character category most similar to the character pattern n and a standard pattern serial number are entered.

115〜124は文字フオントを識別する処理である。115
では最大一致度S0の初期値を０にリセツトする。116は
Ｍ個の文字フオントについて117〜121の処理を繰り返す
ループ制御であり、ｍは文字フオント番号を示す。117
では一致度Ｓの初期値を０にリセツトする。118は切り
出されたＮ個の文字パターンについて119〜121の処理を
繰り返すループ制御であり、ｎは文字パターン番号を示
す。119では文字カテゴリｊ＝Ｘ（1,n）と標準パターン
通番ｗ＝Ｘ（2,n）を取り出す。120では文字フオントｍ
をキー情報としてパターン−文字フオント番号対応テー
ブル62を参照することにより、文字フオントｍ、文字カ
テゴリｊ、標準パターン通番ｗの組み合わせが存在する
か否かを判定する。判定結果がYESの場合には、121で一
致度Ｓを１だけ増す。118のループが終了した後、122で
それまでの最大一致度S0とフオントｍに対する一致度Ｓ
とを比較し、S0＜Ｓならば123で最大一致度S0を更新す
るとともに文字フオント番号ｍをm0として記憶する。Reference numerals 115 to 124 denote character font identification processing. 115
Then, the initial value of the maximum matching degree S0 is reset to 0. Reference numeral 116 denotes a loop control for repeating the processing of steps 117 to 121 for M character fonts, and m indicates a character font number. 117
Then, the initial value of the coincidence S is reset to 0. Reference numeral 118 denotes loop control for repeating the processing of steps 119 to 121 for the cut out N character patterns, and n indicates a character pattern number. At 119, the character category j = X (1, n) and the standard pattern serial number w = X (2, n) are extracted. In 120, the font is m
Is used as key information to refer to the pattern-character font number correspondence table 62 to determine whether a combination of a character font m, a character category j, and a standard pattern serial number w exists. If the result of the determination is YES, the degree of coincidence S is increased by 1 at 121. After the loop 118 is completed, the maximum matching score S0 and the matching score S for the font m at 122 are reached.
And if S0 <S, the maximum matching score S0 is updated at 123 and the character font number m is stored as m0.

Ｍ個の文字フオントについて116のループが終了した
後、124で最大一致度S0を閾値θ・Ｎと比較する。ここ
でθは、０＜θ≦１なるパラメータである。S0が大きい
ときは、125でスイツチSWに１を代入し、文字フオント
が決定されたことを示す。そのときm0は入力されたＮ個
の文字パターンの文字フオントである。After the loop of 116 is completed for M character fonts, the maximum matching degree S0 is compared with the threshold value θ · N at 124. Here, θ is a parameter satisfying 0 <θ ≦ 1. When S0 is large, 1 is assigned to the switch SW at 125 to indicate that the character font has been determined. At that time, m0 is the character font of the inputted N character patterns.

124でS0が閾値より小さいときは、スイツチSWは０の
ままであり、文字フオントが決定不能であつたことを示
す。このとき、126で文字カテゴリＸ（1,n）（ｎ＝１〜
Ｎ）を、Ｎ個の文字パターンの認識結果として出力す
る。If S0 is smaller than the threshold value at 124, the switch SW remains 0, indicating that the character font could not be determined. At this time, at 126, the character category X (1, n) (n = 1 to
N) is output as the recognition result of the N character patterns.

127は、スイツチSWの値によつて分岐し、１の場合は
入力文書のフオントが決定されているので、128〜136で
上記のＮ個の文字パターンについて、文字フオントをm0
として認識を行う。127 branches according to the value of the switch SW, and in the case of 1, the font of the input document is determined.
As recognition.

128はＮ個の文字パターンについて129〜136の処理を
繰り返すループ制御であり、ｎは文字パターン番号を示
す。129〜136は上述した108〜114と同様であるが、133
の判定が入つている点が異なる。133は判定120と同様で
あり、文字フオントm0をキー情報としてパターン−文字
フオント番号対応テーブル62を参照することにより、文
字フオントm0、文字カテゴリｊ、標準パターン通番ｗの
組み合わせが存在するか否かを判定する。判定結果がYE
Sの場合にのみ134〜136の処理、すなわち類似度計算と
最大類似度の更新を行うものである。128のループが終
了した時点で、文字パターンｎ＝１〜Ｎに対しＸ（1,
n）及びＸ（2,n）に認識結果の文字カテゴリとその標準
パターン番号が求められているが、この場合には前者の
文字カテゴリのみが意味がある。137で文字カテゴリＸ
（1,n）（ｎ＝１〜Ｎ）を認識結果として出力装置７に
出力する。138は終了処理で、たとえば入力された文書
に対して出力装置７に作られたデータセツトのクローズ
処理などを行う。128 is a loop control for repeating the processing of 129 to 136 for N character patterns, and n indicates a character pattern number. 129-136 are the same as 108-114 above, but 133
Is different. 133 is the same as the determination 120.Referring to the pattern-character font number correspondence table 62 using the character font m0 as key information, it is determined whether or not a combination of the character font m0, the character category j, and the standard pattern serial number w exists. Is determined. Judgment result is YE
Only in the case of S, the processes of 134 to 136, that is, the similarity calculation and the update of the maximum similarity are performed. At the end of the loop of 128, X (1,1) for character patterns n = 1 to N
The character category of the recognition result and its standard pattern number are obtained for n) and X (2, n). In this case, only the former character category is significant. Character category X at 137
(1, n) (n = 1 to N) is output to the output device 7 as a recognition result. Reference numeral 138 denotes an end process, for example, a process of closing a data set created in the output device 7 for an input document.

なお、101〜138は一つのサブルーチンとして作成し、
文書が光電変換装置４にセツトされた時点で、あるいは
オペレータの指示により起動されるようにするとよい。Note that 101 to 138 are created as one subroutine,
It is preferable that the document is started when the document is set in the photoelectric conversion device 4 or in response to an instruction from an operator.

また、全体の処理を操作するためオペレータはコンソ
ール８を用いて制御部２と対話を行つているが、その内
容は公知の処理であるので、詳細説明は省略する。In addition, the operator uses the console 8 to interact with the control unit 2 in order to operate the entire process. However, since the content is a known process, the detailed description is omitted.

第５図は上記の実施例に示したように辞書を作成する
ための処理の一例のPAD図である。本処理は、あらかじ
めオフラインで行つておくものである。まず、501で総
計Ｉ個の文字フオント既知の学習用文字パターンが、カ
テゴリコードＣ（ｉ）及び文字フオントコードｍ（ｉ）
とともに入力される（ｉ＝１〜Ｉ）。502では、過去に
学習している辞書が学習用フアイル（図示せず）から読
み込まれる。辞書内の標準パターンの総数をＷとする。
ここでの、辞書の内部表現は第３図とは異なり、第６図
のようになつている。すなわち、辞書は標準パターン制
御テーブル601と本体部602からなる。標準パターン制御
テーブル601はＷ個のエントリ601-1〜601-Wを有し、そ
の各々は文字カテゴリコードＣ（ｗ）とアドレスポイン
タＡ（ｗ）からなる。アドレスポインタは本体部602に
おける各標準パターン本体の先頭アドレスを指す。文字
フオントコードテーブル603,604は第４図の401,402と同
じである。先の説明と同様、登録済みの文字フオントの
総数をＭとする。FIG. 5 is a PAD diagram of an example of processing for creating a dictionary as shown in the above embodiment. This processing is performed offline in advance. First, at 501, a learning character pattern with a total of I character fonts known is a category code C (i) and a character font code m (i).
(I = 1 to I). At 502, dictionaries that have been learned in the past are read from a learning file (not shown). Let W be the total number of standard patterns in the dictionary.
Here, the internal representation of the dictionary is different from that of FIG. 3 and is as shown in FIG. That is, the dictionary includes the standard pattern control table 601 and the main body 602. The standard pattern control table 601 has W entries 601-1 to 601-W, each of which is composed of a character category code C (w) and an address pointer A (w). The address pointer indicates the head address of each standard pattern main body in the main body 602. The character font code tables 603 and 604 are the same as 401 and 402 in FIG. M is the total number of registered fonts as described above.

第５図503は、入力文字Ｉ個について504〜514の処理
を行うループ制御である。ｉは文字パターンの通し番号
である。504では類似度の最大値R0を０にリセツトす
る。505は、登録済みの標準パターンＷ個について506〜
509の処理を行うループ制御である。ｗは標準パターン
の通し番号である。506では入力パターンｉのカテゴリ
Ｃ（ｉ）と標準パターンｗのカテゴリＤ（ｗ）とを比較
し、一致した場合のみ507〜509の処理を行う。507は、
入力文字パターンｉと標準パターンｗとを比較して類似
度Ｒを求める部分である。類似度の計算式及びパラメー
タは文字認識で使用するのと同一とする必要がある。50
8ではそれまでの類似度の最大値R0とＲを比較し、ＲがR
0を超えるとき、509でR0を更新し、ｗをwxとして記憶す
る。標準パターンＷ個について505のループが終了した
後、510で最大値R0と閾値αを比較し、R0がαより大な
らば、511でパターン登録総数Ｗを１だけ増し、512で辞
書本体部602の末尾に入力パターンｉを追加するととも
に、標準パターン制御テーブル601の第（Ｗ＋１）エン
トリ601−（Ｗ＋１）を追加する。513で（Ｗ＋１）をwx
として記憶する。FIG. 503 is a loop control for performing the processing of 504 to 514 for I input characters. i is the serial number of the character pattern. In step 504, the maximum similarity value R0 is reset to zero. 505 is 506 to W for the registered standard patterns.
This is a loop control for performing the process of 509. w is the serial number of the standard pattern. At 506, the category C (i) of the input pattern i is compared with the category D (w) of the standard pattern w, and the processes of 507 to 509 are performed only when they match. 507 is
This is a part for calculating the similarity R by comparing the input character pattern i with the standard pattern w. The calculation formula and parameters of the similarity need to be the same as those used in character recognition. 50
8 compares the maximum similarity value R0 up to that point with R, and R becomes R
If it exceeds 0, R0 is updated at 509 and w is stored as wx. After the loop of 505 for the W standard patterns is completed, the maximum value R0 is compared with the threshold α at 510, and if R0 is larger than α, the total number of registered patterns W is increased by 1 at 511, and the dictionary main body 602 at 512. Of the standard pattern control table 601 and the (W + 1) th entry 601- (W + 1). 513: (W + 1) wx
To be stored.

第５図514は、文字フオントコードテーブル603,604の
更新を行う部分であつて、入力文字の文字フオントコー
ドｍ（ｉ）をテーブル内部で探索し、未登録ならば、そ
の文字フオントのエントリを追加するなどの処理を行
い、さらに入力パターンｉのカテゴリＣ（ｉ）と標準パ
ターン番号wxの情報を用いて、標準パターン対応テーブ
ルの更新を行うものであるが、その詳細は省略する。Ｉ
個の入力パターンについて学習が終了した後メモリの内
容を学習用フアイルに出力しておく。FIG. 514 is a portion for updating the character font code tables 603 and 604. The character font code m (i) of the input character is searched in the table, and if not registered, the entry of the character font is added. And the like, and the standard pattern correspondence table is updated using the information of the category C (i) of the input pattern i and the standard pattern number wx, but the details are omitted. I
After learning is completed for the input patterns, the contents of the memory are output to a learning file.

以上の説明から分かるように、本学習処理はある程度
学習が進行した辞書に追加学習する場合を想定している
が、初期学習すなわちＷ＝０の状態から出発する場合で
も使用できる。その場合は別の処理により、学習用フア
イルのＷに０を書き込んでおけばよい。以上のようにし
て、学習が終了した後、この辞書を文字フオント認識に
使用するためには、学習用フアイル内の辞書の内部表現
（第６図）から第３図のように変換しフアイル３に格納
しておく必要があるが、その実現は容易であるから説明
を省略する。As can be understood from the above description, the present learning process is assumed to perform additional learning to a dictionary to which learning has progressed to some extent. In that case, 0 may be written to W of the learning file by another process. After the learning is completed as described above, in order to use this dictionary for character font recognition, the internal representation of the dictionary (FIG. 6) in the learning file is converted as shown in FIG. , But its realization is easy, so the description is omitted.

以上本発明の一実施例について説明したが、発明の本
質に影響を与えることなく各種の変更を加えることが可
能なことは言うまでもない。例えば、実施例では各入力
文字パターンに対し、最も類似した１個の標準パターン
を選定するとしたが、類似度の高い複数個の標準パター
ンを選出し、120の判定においてはその中のいずれかに
合致すれば判定結果をYESとするようにしてもよく、ま
た文字フオント一致度Ｓとして単に文字フオント、文字
カテゴリ、標準パターンの組み合わせの存在を1,0の二
値で加算するのではなく、各標準パターンに対する類似
度や類似度の順位などを加算するようにしてもよい。Although the embodiment of the present invention has been described above, it goes without saying that various changes can be made without affecting the essence of the invention. For example, in the embodiment, one standard pattern most similar to each input character pattern is selected. However, a plurality of standard patterns having high similarity are selected, and in the determination of 120, any one of the standard patterns is selected. If they match, the determination result may be set to YES. Also, instead of simply adding the combination of a character font, a character category, and a standard pattern as a binary value of 1,0 as the character font matching degree S, The similarity with respect to the standard pattern, the rank of the similarity, and the like may be added.

また、実施例では文字行に対して１個のフオントを決
定しているが、123の処理において最大一致度のフオン
トだけでなく、２位あるいは３位以下の一致度を有する
フオントも記憶するようにして、上位複数個のフオント
を用いて決定するようにしてもよい。Further, in the embodiment, one font is determined for a character line. However, in the process of 123, not only the font having the maximum matching degree but also the font having the second or third or lower matching degree is stored. Then, the determination may be made using a plurality of upper fonts.

また、実施例では文字フオントを決定するのに用いた
Ｎ個のパターンについて、フオントm0を固定して128〜1
36で再度認識するようにしているが、全フオントを用い
た認識結果の中からフオントm0に含まれるものだけを抽
出するようにして、再度認識する処理を省略することに
より高速化を図つてもよい。In the embodiment, the font m0 is fixed to 128 to 1 for the N patterns used to determine the character font.
Although re-recognition is performed at step 36, only those included in the font m0 are extracted from the recognition results using all fonts, and the speedup can be achieved by omitting the re-recognition processing. Good.

また、実施例では127の判定でSWが１の場合、フオン
トを固定して以後の認識を行うが、137で認識結果が得
られたとき、類似度を用いて認識結果の確からしさを評
価し、評価値が低いときは全部のフオントを用いて再度
認識するようにしてもよい。In the embodiment, when the SW is 1 in the determination of 127, the font is fixed and the subsequent recognition is performed. When the recognition result is obtained in 137, the likelihood of the recognition result is evaluated using the similarity. When the evaluation value is low, the recognition may be performed again using all the fonts.

また、実施例では最大類似度を与える標準パターンの
文字カテゴリを認識結果としたが、最大類似度の大きさ
及び次大類似度との差分値などを用いて認識結果の確か
らしさを評価し、評価値が低いときはリジエクトとする
ようにしてもよい。Further, in the embodiment, the character category of the standard pattern that gives the maximum similarity is used as the recognition result.However, the likelihood of the recognition result is evaluated using the magnitude of the maximum similarity and the difference value with the next-largest similarity. When the evaluation value is low, a reject may be made.

また実施例では、文字フオントごとに各文字カテゴリ
に対してどの標準パターンを使用しているかを示すテー
ブルを設けたが、このテーブルの代わりに、各標準パタ
ーンごとにそれを使用する文字フオントの情報を付随さ
せたテーブルを設けるようにしてもよい。In the embodiment, a table indicating which standard pattern is used for each character category is provided for each character font. Instead of this table, information of the character font using the standard pattern is used for each standard pattern. May be provided.

また実施例では、異なるカテゴリの間で類似した標準
パターンがある場合、それぞれのカテゴリに別々に標準
パターンを設けているが、これらを共用させることもで
きる。例えば、文字フオント甲の数字１（イチ）と文字
フオント乙の英小文字ｌ（エル）とを共用して同一の標
準パターンとし、それぞれのアドレスポインタがこの同
一の標準パターンを指すようにできる。そのための一手
法は、第５図の判定506を除去することである。In the embodiment, when there are similar standard patterns between different categories, the standard patterns are separately provided for each category. However, these can be shared. For example, the numeral 1 (Ichi) of the character font A and the lowercase letter l (L) of the character font B can be shared to form the same standard pattern, and each address pointer can point to the same standard pattern. One way to do this is to eliminate decision 506 in FIG.

また実施例では、１行分の文字パターンに対する認識
結果を直接用いて各フオントごとの一致度を評価してい
るが、１行分の文字パターンに対する認識結果について
言語処理あるいは算術演算を行つて、言語的にあるいは
数値としての妥当性が正しいような標準パターンのみに
ついて、各フオントごとの一致度を評価するようにして
もよい。Further, in the embodiment, the matching degree for each font is evaluated by directly using the recognition result for the character pattern for one line, but language processing or arithmetic operation is performed on the recognition result for the character pattern for one line, Only for standard patterns whose linguistic or numerical validity is correct, the degree of coincidence of each font may be evaluated.

〔The invention's effect〕

以上説明したごとく、本発明によれば文字フオント認
識用の辞書のにメモリ容量の増大を防いで、しかも多数
の文字フオントに対しシングルフオントなみの認識率が
達成できる文字認識方式が実現される。As described above, according to the present invention, a character recognition method that can prevent a memory capacity of a dictionary for character font recognition from increasing and achieve a recognition rate similar to a single font for a large number of character fonts is realized.

[Brief description of the drawings]

第１図は本発明の文字認識方式における処理の流れを示
す図、第２図は第１図の処理内容を実施する装置の構成を示す
ブロツク図、第3,4,6図は第１図の処理で使用される辞書の内部構造
の一例を説明する図、第５図は第１図の処理で使用される辞書の作成処理の流
れを示す図である。１……バス、２……制御部、３……フアイル、４……文
書、５……光電変換装置、６……メモリ、７……出力装
置、８……コンソール。FIG. 1 is a diagram showing the flow of processing in the character recognition system of the present invention, FIG. 2 is a block diagram showing the configuration of an apparatus for executing the processing contents of FIG. 1, and FIGS. 3, 4, and 6 are FIG. FIG. 5 is a view for explaining an example of the internal structure of a dictionary used in the processing of FIG. 5, and FIG. 5 is a view showing a flow of processing for creating a dictionary used in the processing of FIG. DESCRIPTION OF SYMBOLS 1 ... Bus, 2 ... Control part, 3 ... File, 4 ... Document, 5 ... Photoelectric conversion device, 6 ... Memory, 7 ... Output device, 8 ... Console.

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開昭58−78276（ＪＰ，Ａ) 特開昭59−109978（ＪＰ，Ａ) 特開昭60−237584（ＪＰ，Ａ) ──────────────────────────────────────────────────続き Continuation of the front page (56) References JP-A-58-78276 (JP, A) JP-A-59-109978 (JP, A) JP-A-60-237584 (JP, A)

Claims

(57) [Claims]

1. A pattern dictionary for storing a plurality of standard patterns for each character category in advance, and a standard for storing for each character category in the pattern dictionary which of the plurality of standard patterns a character font corresponds to. What is claimed is: 1. A character recognition method for performing character recognition having a pattern character font correspondence means, wherein: (a) a document image is scanned and digitized by a photoelectric conversion device, and a character pattern whose character font is undetermined is cut out from the document image (B) sequentially comparing the extracted character pattern whose character font is undetermined with a plurality of standard patterns of the character category in the pattern dictionary to determine a standard pattern similar to the character pattern whose character font is undetermined And (c) a character phone corresponding to the similar standard pattern determined above. Is obtained from the standard pattern character font correspondence means. (D) The above (b) and (c) are sequentially performed on each of the plurality of cut-out character patterns for which the character fonts are undetermined, and A character font having the highest matching degree among character fonts obtained for a character pattern whose character font has not been determined, is determined as the character font of the document image.

2. The standard pattern character font correspondence means according to claim 1, wherein a standard pattern of a character category corresponding to a character font determined as a character font of said document image is obtained by said standard pattern character font correspondence means. A character recognition method characterized by sequentially comparing a cut-out character pattern with a standard pattern of each character category corresponding to the determined character font, and performing character recognition on the document image.

3. The character font extracted from the document image according to claim 2, wherein the character font of the character pattern cut out from the document image is not registered in the standard pattern character font correspondence means. A character recognition method, characterized by updating the contents of the standard pattern character font correspondence means with respect to which of the plurality of standard patterns in the pattern dictionary corresponds to which standard pattern.