JPS6125284A

JPS6125284A - Character recognizing device

Info

Publication number: JPS6125284A
Application number: JP14638284A
Authority: JP
Inventors: Minoru Nagao; 永尾　実
Original assignee: Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1984-07-13
Filing date: 1984-07-13
Publication date: 1986-02-04
Anticipated expiration: 2009-01-26
Also published as: JPH067394B2

Abstract

PURPOSE:To improve efficiency of process by searching an untraced stroke on the basis of search specification information, and by discriminating unknown characters from the searched result, and by shortening time for referring to the dictionaries. CONSTITUTION:A character recognizing device is composed of as shown in the figure to repeat and carry out follow-up search of end & point of characters. When search is over and there is not any other end or point of characters, search character information decides whether it is ''1'' or not. If the decision is ''YES'', search is carried out whether or not untraced stroke information, referring to a region of RAM6. In this case, regarding the already searched stroke information converted to eight directions, presence or absence of the untraced stroke information can be easily discriminated because the information is converted into a space code, etc. accordingly. From the result of this discrimination, characters can be discriminated without collating operations.

Description

【発明の詳細な説明】〈発明の技術分野〉本発明は、未知のカナ、英字、数字等の文字を自動的に
読取って認識する文字認識装置に関する。DETAILED DESCRIPTION OF THE INVENTION Technical Field of the Invention The present invention relates to a character recognition device that automatically reads and recognizes unknown characters such as kana, alphabets, and numbers.

〈発明の背景〉従来この種の文字認識装置では、第６図のフローチャー
トに示す如（、用紙Ｐ上の文字を１文字毎に光電変換し
、これをさらに１１″、−１０１の２値パターンに変換
している。続いて上記２値パターンにつき後の処理を効
果的にするための前処理、すなわち文字が記憶されてい
る用紙Ｐ上の黒点などのノイズ処理や文字図形境界面の
平滑化などを含むｉ連の処理がなされる。次に文字認識
を行なうのに必要ないくつかの特徴（交点、分岐点、ル
ープ数、ストロークの長さ情報など）を抽出する特徴抽
出処理が行なわれる。この抽出結果に応じて多数の文字
より特徴が共通する文字のいくつかが候補文字として絞
り込まれる。この絞り込みにより第１段階の認識が終了
するが、候補文字が複数ある場合にはさらにその中から
唯一の文字の選択を行なうための詳細な識別処理がなさ
れる。この詳細な識別処理は一般には辞書照合処理とい
われる。ここで辞書とは各文字の特徴を一定の系列で格
納したもので通常ＲＯＭで構成されている。それゆえ上
述した候補文字の絞り込みは、特徴抽出によって得られ
た特徴群にもとづき、辞書の中から特定の文字群を選定
するーことをいい辞書誘導処理と呼ばれる。<Background of the Invention> Conventionally, in this type of character recognition device, as shown in the flowchart of FIG. Next, the above binary pattern is pre-processed to make subsequent processing more effective, i.e. noise processing such as black dots on paper P where characters are stored and smoothing of character/figure boundaries. Next, feature extraction processing is performed to extract several features (intersections, branch points, number of loops, stroke length information, etc.) necessary for character recognition. According to this extraction result, some of the characters with common characteristics are narrowed down as candidate characters.This narrowing down completes the first stage of recognition, but if there are multiple candidate characters, the candidate characters are further narrowed down. Detailed identification processing is performed to select the only character from the list.This detailed identification processing is generally referred to as dictionary matching processing.Here, a dictionary is a storage of the characteristics of each character in a fixed series. Usually, it consists of a ROM.Therefore, the above-mentioned narrowing down of candidate characters is called dictionary guidance processing, in which a specific group of characters is selected from a dictionary based on the feature group obtained by feature extraction.

上述したようにこの辞書誘導処理に続いて、辞書中に予
じめ記憶される各文字の標準となる特徴系列（以下標準
パターンという）と認識対象文字の対応する特徴系列の
一致度を検出する照合処理がなされる。この照合処理結
果にもとづき読取った文字が特定されたり、あるいは認
識不能として認識結果が出力される。As described above, following this dictionary guidance process, the degree of matching between the standard feature series of each character (hereinafter referred to as standard pattern) stored in advance in the dictionary and the corresponding feature series of the recognition target character is detected. Verification processing is performed. Based on the result of this collation process, the read character is specified, or the recognition result is output as unrecognizable.

次に辞書の一例について説明する。Next, an example of a dictionary will be explained.

第７図（Ａｌは、同図（Ｂｌのカナ文字「ア」の標準パ
ターンＤ１を図形化したものであり、ここに示した標準
パターンＤ１が辞書として記憶されている。すなわちこ
の辞書において文字「ア」は、３つの端点■と、１個の
分岐点［株］と、番号値で表わされる１０個の方向成分
（ベクトル）。Figure 7 (Al is a graphic representation of the standard pattern D1 of the kana character "A" in Figure 7 (Bl), and the standard pattern D1 shown here is stored as a dictionary. That is, in this dictionary, the character "A" is ``A'' has three end points ■, one branch point [stock], and 10 directional components (vectors) expressed by number values.

とで構成されている。なおここに示すベクトル番号値は
文字のストローク方向がそれぞれ第８図に示す０から７
までの８方向のいずれの方向であるかを区別するための
ものである。したがって上記標準パターンＤ１は文字「
ア」が端点■から始まって２・１・３・４・５というベ
クトルをこの順番で有した後分岐点■で終了する要素と
、端点■から始まりベクトル２・３・４・５および端点
■をこの順番で有する要素とを含むことを示している。It is made up of. Note that the vector number values shown here are in the stroke direction of the characters from 0 to 7 shown in Figure 8.
This is to distinguish which of the eight directions up to. Therefore, the standard pattern D1 is the character "
A'' starts from the endpoint ■ and has vectors 2, 1, 3, 4, 5 in this order and then ends at the branching point ■, and an element that starts from the endpoint ■ and has vectors 2, 3, 4, 5 and the endpoint ■ In this order, the elements that have the following elements are included.

これらの各要素を図示したものが第７図（Ｂｌの下段に
示すものである。A diagram illustrating each of these elements is shown in FIG. 7 (lower part of Bl).

ここで、第６図のフローチャートで説明した文字認識装
置のより具体的な処理手順を上記した第７図および、第
９図ないし第１３図を用いて説明する。Here, a more specific processing procedure of the character recognition device explained using the flowchart of FIG. 6 will be explained using FIG. 7 and FIGS. 9 to 13 described above.

用紙Ｐから読取られた文字は２値化処理、前処理されて
、画像メモリに記憶される。ここで記憶される文字パタ
ーンは第９図に示すように筆跡そのままの太目のもので
ある。この太目の文字パターンはいわゆる細め等の処理
が施され、第１０図に示すように文字の骨格を示す情報
に変換される。この変換にあたって前記骨格は、第１１
図に示すＡ（左右）、Ｂ（上下）、Ｃ（斜右上り）、Ｄ
（斜右下り）の４方向に正規化される。ここで行なう正
規化は第８図に示した８方向成分への区別を容易にする
ためのも９で、この変換処理に続いて読取った文字の特
徴を得る処理がなされる。この処理では、複数の文字特
徴（ループ、交点、分岐点、端点等）が抽出され、つぎ
にこれら文字特徴を用いて辞書誘導−処理が実施される
。その結果いくつかの候補文字に絞られると、つぎに未
知文字の特徴系列を求め、各候補文字の標準パターンと
辞書照合される。この特徴系列は、辞書すなわち第７図
（Ｂ）の標準パターンＤ１と同様に表現され、その結果
、第１２図に示すような■１３４■■３４■という特徴
系列が得られる。この特徴系列は読取対象文字すなわち
未知の文字のパターンを示すものであるから、以下未知
パターンと呼ぶ。Characters read from paper P are binarized and preprocessed, and then stored in an image memory. The character pattern stored here is a thick one that is the same as the handwriting, as shown in FIG. This thick character pattern is subjected to so-called thinning processing and converted into information indicating the skeleton of the character as shown in FIG. In this conversion, the skeleton is
A (left and right), B (up and down), C (diagonally upward to the right), and D shown in the diagram.
(diagonally downward to the right). The normalization performed here is 9 to facilitate differentiation into the eight directional components shown in FIG. 8. Following this conversion process, a process for obtaining the characteristics of the read character is performed. In this process, a plurality of character features (loops, intersections, branch points, end points, etc.) are extracted, and then dictionary guidance processing is performed using these character features. Once the candidate characters have been narrowed down to a few, the characteristic sequence of the unknown character is then determined and compared with the standard pattern of each candidate character in a dictionary. This feature series is expressed in the same way as the standard pattern D1 in the dictionary, ie, FIG. 7(B), and as a result, the feature series ■134■■34■ as shown in FIG. 12 is obtained. Since this feature series indicates a pattern of the character to be read, that is, an unknown character, it is hereinafter referred to as an unknown pattern.

以上のようにして得られた未知パターンは辞書誘導処理
の結果得られた各候補文字の標準パターンとその一致度
が照合される。金弟１２図に示す未知パターンと、第７
図例に示す「ア」の標準パターンとが照合されたとする
と、この照合処理は第１３図に示すように、標準パター
ン中の方向成分の順番と未知パターンの方向成分の順番
とが一致しているか等゛、一定の規則にしたがって行な
われる。この例の場合、未知パターンの方向成分は標準
パターンが持つ方向成分にすべて含まれ、かつ順番も一
致しているので、未知パターンはカナ文字「ア」として
認識され出力される。The unknown pattern obtained as described above is compared with the standard pattern of each candidate character obtained as a result of the dictionary guidance process for its degree of matching. The unknown pattern shown in Figure 12 and the 7th pattern
Assuming that the standard pattern of "A" shown in the figure example is matched, this matching process will be performed as shown in FIG. It is done according to certain rules. In this example, the unknown pattern's directional components are all included in the directional components of the standard pattern and also match in order, so the unknown pattern is recognized and output as a kana character "a".

一般的な文字認識装置においては、以上のようにして文
字認識がなされるが、上記説明で理解できるように文字
認識における照合処理においては、未知パターンを構成
している要素の数や方向成分の順番が重要であることが
わかる。In a general character recognition device, character recognition is performed as described above, but as can be understood from the above explanation, in the matching process in character recognition, the number of elements making up the unknown pattern and the directional component are It turns out that the order is important.

ところで第、１４図および第１５図に示す未知文字ｒＡ
ＪおよびｒＰＪの場合、この両者゛の文字特徴を抽出し
て比較すると、端点数、分岐点数、連結数等の各特徴が
共に一致している。従ってこれら文字については、辞書
誘導処理の結果、ｒＡＪおよびｒＰＪの両文字が候補文
字として絞り込まれることになる。このような場合、従
来の文字認識装置においては、未知パターンと各候補文
字ｒＡＪｒＦ？Ｊｉ標準パターンとの照合処理を画一的
に実行しており、これがため辞書照合動作の時間が長く
かかり、処理効率の低下を招く等の問題があった。By the way, the unknown character rA shown in FIGS. 14 and 15
In the case of J and rPJ, when character features of both characters are extracted and compared, each feature such as the number of end points, the number of branch points, and the number of connections are the same. Therefore, regarding these characters, as a result of the dictionary guidance process, both characters rAJ and rPJ are narrowed down as candidate characters. In such a case, in the conventional character recognition device, the unknown pattern and each candidate character rAJrF? The matching process with the Ji standard pattern is performed in a uniform manner, which causes problems such as the dictionary matching operation taking a long time and resulting in a decrease in processing efficiency.

〈発明の目的〉本発明は、上記した従来の文字認識装置の欠点を解消し
、辞書照合の処理時間を短縮し、高速且つ高精度の文字
認識装置を提供することを目的とする。<Object of the Invention> An object of the present invention is to eliminate the drawbacks of the conventional character recognition device described above, shorten the processing time for dictionary comparison, and provide a high-speed and highly accurate character recognition device.

〈発明の構成および効果〉上記目的を、達成するため、本発明では、標準パターン
を格納する辞書記憶手段に、各文字毎に未追跡ストロー
クの探索の要否を指定する探索指定情報を記憶させてお
き、辞書照合動作に際して未知文字の特徴系列を求める
とき、前記探索指定情報に基づき未追跡ストロークを探
索し、その探索結果から未知文字を識別するようにした
。<Configuration and Effects of the Invention> In order to achieve the above object, the present invention stores search designation information that designates whether or not to search for untracked strokes for each character in a dictionary storage means that stores standard patterns. In addition, when determining the feature sequence of an unknown character during a dictionary matching operation, an untracked stroke is searched based on the search designation information, and the unknown character is identified from the search result.

通常未知文字の特徴系列（未知パターン）を求める場合
、例えば第１６図に示す如く、文字の上側から文字端点
探索の走査を開始し、文字端点を検出したとき、その文
字端点からストローク追跡を行ない、文字端点や分岐点
て追跡を完了させている。かくして第１４図の未知文字
ｒＡＪについては第１７図（１）で示す文字の一部分の
ストローク追跡が実行され、また第１５図の未知文字ｒ
ＰＪについては第１８図（１）　（２）に示す文字全部
のストローク追跡が実行されることになる。従って本発
明の実施により、文字「Ａ」「Ｐ」について未埠跡スト
ロークの探索を指定したとき、未知文字ｒＡＪでは第１
７図（２）Ｉこ示す未追跡ストロークが存在しており、
この未追跡ストロークの有無からこれら両文字の区別−
が可能である。よって本発明によれば、パターン照合に
入る前の段階で文字を識別でき、辞書照合動作の時間を
短縮し得、処理効率の向上に貢献する等、発明目的を達
成した効果を奏する。Normally, when finding a feature sequence (unknown pattern) of an unknown character, for example, as shown in Figure 16, scanning for character endpoints is started from the top of the character, and when a character endpoint is detected, stroke tracing is performed from that character endpoint. , character endpoints and branching points to complete the tracking. Thus, for the unknown character rAJ in FIG. 14, the stroke tracing of the part of the character shown in FIG. 17(1) is executed, and for the unknown character rAJ in FIG.
For PJ, stroke tracing of all characters shown in FIGS. 18(1) and 18(2) is executed. Therefore, by implementing the present invention, when searching for unburied strokes for characters "A" and "P" is specified, for unknown characters rAJ, the first
Figure 7 (2) There is an untracked stroke shown in I.
Distinguish between these two characters based on the presence or absence of untracked strokes.
is possible. Therefore, according to the present invention, characters can be identified at a stage before pattern matching begins, the time required for dictionary matching operations can be shortened, and the object of the invention can be achieved, such as contributing to improvement of processing efficiency.

〈実施例の説明〉第１図は本発明にかかる文字認識装置を示し、用紙Ｐ上
の文字はＡ／Ｄ変換回路１で光学的に読取られデジタル
データに変換されて画像メモリ２に記憶される。−Ａ／
Ｄ変換回路１にはＣＰＵ３およびストローク検出回路４
が結合なされており、ｃｐｕ３は照合処理を含む文字認
識処理をプログヴムにしたがい実行し、またストローク
検出回路４は読取られた文字をたとえば第２図に示すよ
うに４方向ストロークに変換する。<Description of Embodiments> FIG. 1 shows a character recognition device according to the present invention, in which characters on paper P are optically read by an A/D conversion circuit 1, converted into digital data, and stored in an image memory 2. Ru. -A/
The D conversion circuit 1 includes a CPU 3 and a stroke detection circuit 4.
The CPU 3 executes character recognition processing including matching processing according to the program, and the stroke detection circuit 4 converts read characters into four-directional strokes as shown in FIG. 2, for example.

また図中、ＲＯＭ５は認識対象文字の標準パターンや未
追跡ストロークに関する探索指定情報を記憶する辞書お
よびプログラム用のメモリであり、ＲＡＭ５は入力文字
の４方向ストロークや８方向変換データの記憶用および
プログラム実行時のワークエリア用のメモリである。Ｒ
ＡＭ６の記憶内容は第２図に示すストローク抽出後のイ
メージに対応して格納されており、その具体的な内容は
第３図に示しである。In the figure, ROM5 is a dictionary and program memory for storing standard patterns of characters to be recognized and search designation information regarding untracked strokes, and RAM5 is a memory for storing 4-direction strokes and 8-direction conversion data of input characters, and a program memory. This is memory for the work area during execution. R
The memory contents of AM6 are stored corresponding to the image after stroke extraction shown in FIG. 2, and the specific contents are shown in FIG. 3.

第３図中、ＴＥＲＭは文字端点情報の格納領域であり、
Ｔ１は１つの端点、ＳＣはＣ方向のストロークを、ＳＬ
はそのＣ方向の追番を夫々示す。同様にＴ２は他の端点
を示しており、ＳＤはＤ方向のストロークであることを
示す。またり、Ｒは文字端点のストロークに対する位置
を一示しており、Ｌは端点位置がストロークの左側であ
ることを、またＫは同様に右側であることを夫々示す。In FIG. 3, TERM is a storage area for character end point information,
T1 is one end point, SC is the stroke in the C direction, SL
indicates the serial number in the C direction. Similarly, T2 indicates another end point, and SD indicates a stroke in the D direction. Further, R indicates the position of the character endpoint with respect to the stroke, L indicates that the endpoint position is on the left side of the stroke, and K indicates that the endpoint position is on the right side.

ＣＨＫＮはストローク間の接続情報（屈折点）、が格納
される領域であり、図示例では追番Ｓ１のＣ方向ストロ
ークのｋ（右側）と追番Ｓ１のＤ方向ストロークのしく
右側）とが屈折点として接続されていることを意味して
いる。ＣＨＢＫは文字分岐点情報の格納領域を示し、図
示例では、文字分岐点Ｂ１が追番Ｓ１のＡ方向ストロー
クのしく左側）に、また文字分岐点Ｂ２が追番Ｓ１のＡ
方向ストロークのｋ（右側）に存在位置することを意味
している。CHKN is an area where connection information (reflection points) between strokes is stored, and in the illustrated example, k (right side) of the C direction stroke of serial number S1 and the right side of the D direction stroke of serial number S1 are inflections. It means that they are connected as points. CHBK indicates a storage area for character branch point information, and in the illustrated example, the character branch point B1 is located at the left side of the stroke in the A direction of the serial number S1, and the character branch point B2 is located at the A direction of the serial number S1.
This means that it is located on the k (right side) of the direction stroke.

ＡＳＴＭはＡストロークの端点の２次元座標アドレス情
報が格納される領域であり、同様にＣ３ＴＭ、ＤＳＴＭ
はＣストロークおよびＤストロークの端点の２次元座標
アドレス情報が格納される領域である。またＦＯＮＴは
未知文字の特徴系列（未知パターン）を求める際の追跡
過程を一時記憶する領域であり、またＳＣＭは未知文字
の８方向に変換された特徴系列を格納する領域である。ASTM is an area where two-dimensional coordinate address information of the end point of the A stroke is stored, and similarly C3TM, DSTM
is an area where two-dimensional coordinate address information of the end points of the C stroke and D stroke is stored. Further, FONT is an area for temporarily storing the tracking process when finding a feature sequence (unknown pattern) of an unknown character, and SCM is an area for storing a feature sequence of the unknown character converted in eight directions.

第４図はＲＯＭ５の記憶内容のうち、辞書部分を例示し
たものであり、図示例は第１５図の文字ｒＰＪを示して
いる。この辞書には各文字格納領域毎に先ず未追跡スト
ロークの探索を指定する探索指定情報が、続いて標準パ
ターンを構成する特徴系列が、夫々格納しである。前記
探索指定情報は、２値データｒｌＪｒｏｊ　をもって表
わされ、データ「１」は未追跡ストロークが存在するか
否かを探索することを意味し、一方データ「０」は、未
追跡ストロークが存在するか否かを探索しないことを意
味している。FIG. 4 shows an example of the dictionary portion of the contents stored in the ROM 5, and the illustrated example shows the character rPJ in FIG. 15. This dictionary stores, for each character storage area, search designation information that designates a search for untracked strokes, followed by a feature series that constitutes a standard pattern. The search designation information is represented by binary data rlJroj, where data "1" means searching for the presence or absence of untracked strokes, while data "0" means searching for the presence or absence of untracked strokes. This means that we do not search for whether or not.

第５図は、上記実施例にかかる装置の照合処理動作を示
し、以下、辞書誘導処理により、候補文字が「Ａ」「Ｐ
」に絞られた場合における未知文字−ｒＡＪと文字ｒＰ
Ｊの標準パターンとの照合処理について説明する。FIG. 5 shows the collation processing operation of the apparatus according to the above embodiment.
Unknown characters - rAJ and character rP when narrowed down to ``
The process of matching J with the standard pattern will be explained.

まずステップ１１（以下ｒＳＴ１１ｊの如（示す）にお
いて、走査カウンタを初期化し、第２図に示すイメージ
を、ＲＡＭ５上で走査する。First, in step 11 (hereinafter referred to as rST11j), a scanning counter is initialized and the image shown in FIG. 2 is scanned on the RAM 5.

図示例では、第１７図に示す如く、上辺左側から右方へ
走査し−でゆくことになる。この時各走査点毎にＲＡＭ
５のＡＳＴＭ領域ＤＳＴＭ領域のストローク端点の２次
元アドレスと走査カウンタの座標が一致するか否かが比
較される（ＳＴ１２）。比較の結果側座標が一致しない
限り走査カウンタが更新され、この座標アドレス比較が
繰り返される（ｓＴｌ２．１４．１５）。そして走査カ
ウンタの内容とＲＡＭ・６・の端点２次元アドレスが例
えば座標（７，３）にて一致すると、このストローク端
点を仮の起点とし、次にこのストローク端点が文字端点
であるか否かを判定する（ＳＴＩ３）。この仮の起点の
座標（７，３）はＣ８ＴＭ領域に格納されており、Ｃス
トロークのストローク番号２の右側端点に割当する。In the illustrated example, as shown in FIG. 17, scanning is performed from the left side of the upper side to the right. At this time, the RAM for each scanning point
The two-dimensional address of the stroke end point in the ASTM area DSTM area No. 5 and the coordinates of the scan counter are compared to see if they match (ST12). Unless the coordinates on the comparison result side match, the scan counter is updated and this coordinate address comparison is repeated (sTl2.14.15). If the content of the scan counter and the two-dimensional address of the end point in RAM 6 match, for example, at coordinates (7, 3), this stroke end point is set as a temporary starting point, and then whether or not this stroke end point is a character end point is determined. (STI3). The coordinates (7, 3) of this temporary starting point are stored in the C8TM area, and are assigned to the right end point of stroke number 2 of the C stroke.

ところがこの端点情報はＴＥＲ’Ｍ領域において文字端
点の情報として存在しておらず、従ってストローク端点
−（ｓｃ、ｓｌ、ｋ）は文字端点てないと判定される。However, this end point information does not exist as character end point information in the TER'M area, and therefore, it is determined that the stroke end point -(sc, sl, k) is not a character end point.

これによりＳＴＩ　３の判定がｔ　Ｎ　０１となって、
５Ｔ１４．１５．１２の処理フローが繰返し実施される
。′ かくて走査カウンタの内容とＲＡＭ５の端点２次元アド
レスとは座標（３，２６）にて一致するが、この座標は
Ｃ５ＴＭｉ域においてＣストロークのストローク番号１
・の左側端点であるとして格納され、而もこのストロー
ク端点はＴＥＲＭ領域中区域中端点Ｔ１として存在して
いるから、端点（ｓｃ、ｓｌ、Ｌ）は文字端点であると
判断される。そこで文字端点マーク■をＲＡＭ６の３０
Ｍ領域に格納し、更にこの文字端点情報（ｓｃ、ｓｌ、
Ｌ）をＦＯＮＴ領域に格納する。As a result, the determination of STI 3 becomes t N 01,
The processing flow of 5T14.15.12 is executed repeatedly. 'Thus, the contents of the scan counter and the two-dimensional address of the end point in RAM5 match at the coordinates (3, 26), but this coordinate is stroke number 1 of the C stroke in the C5TMi area.
Since this stroke end point exists as the middle end point T1 of the TERM area, the end points (sc, sl, L) are determined to be character end points. Then, move the character end mark ■ to 30 in RAM6.
This character end point information (sc, sl,
L) is stored in the FONT area.

この文字端点の検出により５Ｔ１３がＷＹＥＳｌとなり
、つぎにｓＴｌ　６において、前記ＦＯＮＴ領域のスト
ローク情報を用いて、第８図に示す８方向データへの変
換処理が実行される。図示例のＣ方向ストロークで左か
ら右へのストロークは方向ｌＯ“　に変換され、この方
向データ１０ＩがＲＡＭ６の３０Ｍ領域に格納される。By detecting this character end point, 5T13 becomes WYES1, and then in sT16, the conversion process to the 8-direction data shown in FIG. 8 is executed using the stroke information of the FONT area. A stroke from left to right in the C direction stroke in the illustrated example is converted into a direction lO'', and this direction data 10I is stored in the 30M area of the RAM 6.

続いてＦＯＮＴ領域に格納したストロークからさらに接
続情報を探すために、このストロークの゛もう一方のス
トローク端点情報（ＳＣ，５１、Ｒ）にＦＯＮＴ領域を
変更する。そしてＣ方向ストロークのストローク番号１
の右側端（８））が文字端点であるか、あるいは屈折点
を構成しているかをＦＯＮＴ領域のデータを参照して、
ＴＥＲＭ領域およびＣＨＫＮ領域内の内容を調べてチェ
ックする。図示例ではＣＨＫＮ領域で（ＳＣ１Ｓ１、Ｒ
）を見出すことができ、５Ｔ１７の「接続情報有か」の
判定が”ＹＥＳ’となって、５Ｔ１６へ戻り、同様に８
方向変換処理が実行される。Next, in order to further search for connection information from the stroke stored in the FONT area, the FONT area is changed to the other stroke end point information (SC, 51, R) of this stroke. And stroke number 1 of C direction stroke
Refer to the data in the FONT area to determine whether the right end (8)) is a character end point or an inflection point.
Examine and check the contents in the TERM and CHKN areas. In the illustrated example, in the CHKN area (SC1S1, R
) can be found, and the determination of "Connection information available" of 5T17 becomes "YES", and the process returns to 5T16, and similarly 8.
A direction conversion process is executed.

かくして３０Ｍ領域に方向データｌ　Ｏｌ　１２１、更
には文字端点マーク■が格納された段階に至リ、ｓ’ｒ
１７の判定が＠　Ｎ　Ｑ　＃となって５ＴＩＩへ戻り、
再度走査カウンタを初期化して、前記文字端点の探索フ
ロー（ｓＴｌ　２．１３．１４．１５）が繰り返し実施
される。本実施例の未知文字「Ａ」の場合、他の文字端
点は存在しないから、５Ｔ１５の判定が’ＹＥＳ”とな
って５Ｔ１８へ進む。この５Ｔ１８ぼ、前記探索指定情
報が「１」か否かを判定しており、この場合、その判定
は’ＹＥＳ”　（第４図参照）であるから、つきにＳＴ
Ｉ　９において、ＲＡＭ５のＴ、　Ｅ　ＲＭ領域、ＣＨ
ＫＮ領域、ＣＨＢＫ領域を参照して、未追跡のストロー
ク情報゛が残っているか否かを探索する。この場合、８
方向変換された探索済のストローク情報については、そ
の都度スペースコード等に変換しであるため、容易に未
追跡ストローク情報の有無を判別できる。本実施例の場
合、ＣＨＢＫ領域にＡ方向の未追跡ストローク情報が存
在しており、従って５Ｔ２０の判定は”ＹＥＳ”となり
、この時点で未知文字は「Ｐ゛」でないと判定し得、５
Ｔ１１の照合動作なくして第１７図の処理フローを完了
する。In this way, we have reached the stage where the direction data lOl 121 and further the character end point mark ■ have been stored in the 30M area.
The judgment of 17 becomes @N Q # and returns to 5TII,
The scan counter is initialized again and the character endpoint search flow (sTl 2.13.14.15) is repeated. In the case of the unknown character "A" in this embodiment, there are no other character endpoints, so the determination at 5T15 is 'YES' and the process proceeds to 5T18.At this 5T18, whether or not the search designation information is "1" is determined. In this case, the decision is 'YES' (see Figure 4), so ST
In I9, T, E RM area of RAM5, CH
With reference to the KN area and CHBK area, it is searched whether or not untracked stroke information remains. In this case, 8
Since the searched stroke information whose direction has been changed is converted into a space code or the like each time, it is possible to easily determine the presence or absence of untracked stroke information. In the case of this embodiment, there is untracked stroke information in the A direction in the CHBK area, so the determination of 5T20 is "YES", and at this point it can be determined that the unknown character is not "P",
The processing flow in FIG. 17 is completed without the verification operation at T11.

[Brief explanation of drawings]

第１図はこの発明が実施される文字認識装置の回路ブロ
ック図、第２図は第１図に示す文字認識装置に人力され
る文字例を示す図、第３図は第１図に示す文字認識装置
のＲＡＭの格納領域配置例を示す図、第４図は同ＲＯＭ
の辞書格納領域の一文字分のデータ配置例を示す図、第
５図は第１図に示す文字認識装置の動作を説明するため
のフローチャート″、第６図は一般的に文字認識装置の
概略を説明するためのフローチャート、第７図（Ａ）　
（Ｂｌは文字例「ア」を方向成分に変換した標準パター
ン例を説明するための図、第８図は文字のストロークの
方向成分を８方向に正規化する場合の方向番号を示す図
、第９図は文字例「ア」の光学的に読取った状態でのパ
ターンを示す図、第１Ｏ図は第９図の文字「ア」を細め
てストロークの骨格のみを示した図、ｉｉ１図は４方向
成分を挽肌するための図、第１２図は第１０図に示す文
字「ア」を端点、方向成分、屈折点で表わす場合を説明
するための図、第１３図は第７図に示す文字「ア」の標
準パターンと第１０図、第１２図に示す文字の照合によ
る文字認識動作を説明するための図、第１４図は文字ｒ
ＡＪのストロークを示す図、第１５図は文字ｒＰＪのス
）ローフを示す図、第１６図は文字の探索走査を開始す
る起点辺を説明する図、第１７図（１）　（２）は第１
４図の文字「Ａ」の追跡ストロークおよび未追跡ストロ
ークを示す図、第１８図（１）　（２）は文字「Ｐ」の
追跡ストロークを示す図である。１・・・Ａ／Ｄ変換回路　２・・・画像メモリ３・・・
ＣＰＵ　　　　４・・・ストローク検出回路５、・・Ｒ
ＯＭ　　　　　６・・・ＲＡＭオ乙　図分８　図牙・０図　　　　冴７２図洲　／２　爾テ／３ｊコFIG. 1 is a circuit block diagram of a character recognition device in which the present invention is implemented, FIG. 2 is a diagram showing an example of characters manually entered into the character recognition device shown in FIG. 1, and FIG. 3 is a diagram showing the characters shown in FIG. 1. A diagram showing an example of the storage area arrangement of the RAM of the recognition device, FIG. 4 is the same ROM
5 is a flowchart for explaining the operation of the character recognition device shown in FIG. 1, and FIG. 6 is a general outline of the character recognition device. Flowchart for explanation, FIG. 7 (A)
(Bl is a diagram for explaining a standard pattern example in which the character example "A" is converted into a directional component, Figure 8 is a diagram showing direction numbers when the directional component of a character stroke is normalized into 8 directions, Figure 9 is a diagram showing the optically read pattern of the character "A", Figure 1O is a diagram showing only the stroke skeleton by narrowing the character "A" in Figure 9, and Figure ii1 is 4. A diagram for refining the directional component, Figure 12 is a diagram for explaining the case where the letter "A" shown in Figure 10 is represented by an end point, a directional component, and an inflection point. Figure 13 is the diagram shown in Figure 7. A diagram for explaining the character recognition operation by matching the standard pattern of the character "A" with the characters shown in Figures 10 and 12. Figure 14 is for the character r.
A diagram showing the stroke of AJ, Figure 15 is a diagram showing the stroke of character rPJ, Figure 16 is a diagram explaining the starting point edge for starting the character search scan, and Figures 17 (1) and (2) are diagrams showing the stroke of character rPJ. 1
Figures 18(1) and 18(2) show traced strokes and untracked strokes of the character "A" in Figure 4, and Figures 18(1) and 18(2) depict traced strokes of the character "P". 1... A/D conversion circuit 2... Image memory 3...
CPU 4...Stroke detection circuit 5,...R
OM 6...RAM Otsu Diagram 8 Zuga・0 Diagram Sae 72 Zuzu /2 Erte/3jko

Claims

[Claims] Means for optically reading unknown characters to obtain a character pattern, means for extracting features of the character pattern, means for storing the extracted features, and characters having common extracted features. means for guiding a group as candidate characters; dictionary storage means for storing a unique standard pattern for each character and search designation information for specifying a search for untracked strokes for each character; and the guided character. A character recognition device comprising matching means for searching for features of an unknown pattern for each character of a group based on the search designation information and comparing it with a standard pattern.