JPS6211388B2

JPS6211388B2 -

Info

Publication number: JPS6211388B2
Application number: JP54036499A
Authority: JP
Inventors: Nobuhiko Mori; Toshio Ishikawa; Takushi Senzaki
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1979-03-28
Filing date: 1979-03-28
Publication date: 1987-03-12
Also published as: JPS55129872A

Description

[Detailed description of the invention]

本発明は、文字特徴弁別回路に関する。印字文
字読取装置において、一回目で読み取れなかつた
文字を対象に、特徴抽出の視点を変えて、再度読
み取り処理を行うことがよくあるが、本発明はこ
の際に利用される特徴弁別回路である。印字文字読取装置で用いられている特徴抽出法
としては、代表的なものとして、ストローク・ア
ナリシス法と、パターン・マツチング法とがあ
る。ストローク・アナリシス法は初期の文字読取
装置で良く用いられた方法で、特別に設計された
特殊な文字を用いる必要がある。ストローク・ア
ナリシス法で用いられる文字の一例を第１図に示
す。これらの文字の判定は、上，中，下の水平
線、右上，右下，左上，左下の垂直線などが存在
するか否かで行われる。パターン・マツチング法
は、現在、印字文字読取装置で最も広く用いられ
ている方法であり、JISOCR―Ｂフオントはパタ
ーン・マツチング法で文字を読む場合に最も読み
易いように設計された文字である。第２図は
JISOCR―Ｂフオントの字形である。パターン・
マツチング法とは、入力文字と、全標準文字とを
重ね合せ、どの標準文字と最も良く合うかを調べ
て、入力文字の判定を行う方法であり、通常位置
ずれ補正が行われる。位置ずれ補正とは、入力文
字を標準文字の上で前後左右に動かし、最も良く
合う位置を捜し出す事である。この方法を用いる
と、OCR用として、特にきれいに印字された数
字は、殆ど問題なく読み取る事が出来る。しか
し、読取対象文字が、数字と英字と記号というよ
うに多くなつてくると、パターン・マツチング法
では似通つた文字の間の区別がつきにくくなつて
くる。JISOCR―Ｂフオントでの例を示すと、
（０ゼローＯオー），（Ｄ―Ｏオー），（１―？），
（２―Ｚ），（８―Ｂ）等が区別がつきにくい例で
ある。その理由は、パターン・マツチング法で
は、第３図に示すように、標準文字作成時に、文
字線の領域R₁と白地の領域R₃の間に相当広いど
ちらでもよい領域R₂を設けなければならないの
で、文字輪郭線の小さな変化をつかまえる事が出
来なくなつてしまうからである。従来、パターン・マツチング法を用いている印
字文字読取装置で用いられている、読めない文字
に対する処理法としては、次のような方法が知ら
れていた。一番目は、一つの標準文字では収容し
切れない変形文字に対して、変形標準文字を用意
し、変形標準文字を用いて再読み取りを行う方法
である。二番目は、量子化図形を作成する時の、
白と黒の間のしきい値を変えて、量子化図形の作
り直しをし、以後は同じ標準文字で再読み取りを
行う方法である。三番目は、もう一度走査からや
り直しをして、再読み取りを行う方法である。こ
れらの方法は、印字品質が悪く、読み取り対象文
字が数字だけの場合は、大変有効である。しかし
印字品質は比較的良いのだが、読み取り対象文字
が多い場合にはあまり有効ではない。その理由
は、読めない原因が違つているからで、数字だけ
の場合は、文字の形が悪い為に、どの標準文字と
も一致せず、読めない場合が殆どであるのに対
し、読み取り対象文字が多い場合は、文字の形は
比較的良く、標準文字とも良く合つているのだ
が、２つの標準文字の間の区別が出来ず、読めな
い場合が圧倒的に多い為である。したがつて、本発明の目的は、読み取り対象文
字が多く、２つの標準文字の間の区別が出来なく
て読めなかつた文字に対して、２つの標準文字の
間で最も差異のある所を極めて簡単に詳しく調べ
る事により、２つの内のどちらであるかを有効に
区別できる文字特徴弁別回路を提供することであ
る。次に本発明の原理を説明する。第４図は本発明
による装置で扱える入力図形（パタン）の一例で
あり、24×34メツシユで表わさせる。このような
図形は、まず第５図に示すように、記憶装置内の
長方形の記憶領域に、入力図形の中心と、長方形
の中心が一致するように記憶される。入力図形の
中心とは、高さの二等分線と、幅の二等分線の交
点である。次にこの様にして記憶された図形の外
輪郭線と、長方形の四辺との距離を３個所で計測
する。いま、入力図形が「ＤかＯ（オー）のどち
らかであるが、そのどちらであるかが判らない」
というものであれば、３個所の距離は第６図ａ，
ｂに示すように、L1行目，L2行目，L3行目の長
方形の左辺から図形までの長さとなる。この長さ
を第６図ａ，ｂに示すように、それぞれａ，ｂ，
ｃとすれば、両者を区別する為の値Ｐは次式で計
算される。Ｐ＝（ａ−ｂ）＋（ｃ−ｂ）＝ａ＋ｃ−2b このあと、Ｐの値を調べ、零の近くであればＤ
（第６図ａ）と判定し、ある値以上大きければＯ
（オー）（第６図ｂ）と判定する。ここでＰの値は第７図に示すように、文字線の
幅が太くなつても殆んど変らない。つまりＰは、
文字線の太さにあまり影響されない値である。
又、第８図に示すように、文字が傾いてもL1と
L2の間隔と、L2とL3の間隔がほぼ同じならば、
Ｐの値は殆ど変らない。つまりＰは、３個所
L₁，L₂，L₃の選び方を工夫すれば、文字の傾き
にもあまり影響されない。この３個所は入力文字
がどのような標準文字の間（組み合せ）で区別で
きなかつたものであるかによつてあらかじめ決め
ることができる。第９〜１３図はそれぞれＤとＯ（オー），０
（ゼロ）とＯ（オー），１と？，２とＺ，８とＢの
場合に、３個所L₁，L₂，L₃の位置を示す図であ
る。このように、Ｐの値を用いる事により、輪郭線
の小さな変化をかなり正確につかまえる事が可能
となる。なお、ａ，ｂ，ｃの長さを決定する時
は、第１４図に示すような、印字文字に良く現れ
る１行のへこみ（同図ａ）や、１行の切れ（同図
ｂ）や１行の突出（同図ｃ）や、１行のノイズ
（同図ｄ）等に影響されないように、上下の行
（列の場合は左右の列）を参照して決める必要が
ある。そのため本発明では外輪郭線の位置を計る
のに、連続した３行にて総合的に判断している。
すなわち３行のうち２行においてパターンが存在
する所を外輪郭線の位置としている。次に本発明の一実施例を示した図面を参照して
本発明を詳細にに説明する。第１５図は本発明の一実施例を示す図であり、
図において、区別すべき文字の組によつて、あら
かじめ定められたアドレスが、アドレス・レジス
タ１にセツトされ、メモリ２の内容が読み出され
る。メモリ２には区別すべき文字の組によつてあ
らかじめ決められている行の数（例えばＤとＯ
（オー）の組の場合は９行，17行，26行）が格納
されており、このデータはアドレスレジスタ３に
順番に出力される。またメモリ２には特徴の判定
の際に使用されるＰの値を比較するための閾値も
格納されており、このデータはレジスタ５，６に
出力される。アドレスレジスタ３に格納されたアドレスによ
り、メモリ４に格納されているパターンの特定の
行（本実施例の場合連続した３行）が、それぞれ
指定の行の数に対応して読み出される。すなわ
ち、９行が指定された場合は８行，９行，10行の
パターン（これを対象文字は異なるが第４図に示
した）が読みだされ、１行ずつシフトレジスタ
９，１０，１１に格納される。メモリ４に格納さ
れるパターンについては長方形の記憶領域の中心
にパターンの中心が存在するように正規化されて
いるものとする。連続した３行がシフトレジスタ９，１０，１１
に格納されると、次にシフト信号が存在する間、
クロツクパルスCL₄に同期して、シフトされてゆ
き、シフトレジスタ９，１０，１１における端の
ビツトの情報（パターンのその点が白か黒かとい
う情報）がフリツプフロツプ１２，１３，１４に
それぞれおくられてゆく。フリツプフロツプ１
２，１３，１４はシフトレジスタ９，１０，１１
からの情報が“１”すなわち“黒”のときセツト
されるが、このセツトされる位置はそれぞれの行
の外輪郭の位置に対応している。本実施例におい
ては第１０図に示したような場合にそなえて、フ
リツプフロツプ１２，１３，１４のうち２つがセ
ツトされたとき外輪郭の位置と判断している。そ
のためフリツプフロツプ１２，１３，１４の出力
は２ビツトのフルアダー１５に送られており、桁
上げがあつた場合オアーゲート１７を経て外輪郭
を検出したことを示すエンド信号を出力する。カウンタ１６は、外輪郭がパターンの最後まで
いつても検知されない場合、すなわち所定の行で
パターンが切れている場合にシフトを中止するた
めに、シフトパルスCL₄をカウントしたとえば３
１（１行は34メツシユ）となつたときキヤリー出
力をオアゲート１７に送る。またカウンタ１６か
らのキヤリー出力は読取り不能信号として外部へ
出力され、このパターンについては読取りができ
ないことを示す。この信号はＦ・Ｆ（図示せず）
等にセツトされる。１回目のシフトが終了すると、メモリ２から次
の行（例えば17行）を示すアドレスがレジスタ３
にセツトされメモリ４から次の３行（又は３列）
が読み出されてシフトレジスタ９，１０，１１に
格納され再び外輪郭が検出される。そして次に３
回目（例えば26行目）の外輪郭が検出される。次にＰを計算するためのに必要なａ，ｂ，ｃの
計数及びＰの計算について説明する。クロツクパ
ルスCL₄と同じ周波数で位相が互いに異なるクロ
ツクパルスCL₁，CL₃が別々にゲート２１，２２
に加わつている。すなわち、Ｐ＝ａ＋ｃ−2bを
計算するため、ａ，ｃを計数するときはクロツク
パルスCL₃のみを計数し、２ｂを計数するときは
クロツクパルスCL₁，CL₃をともに計数してい
る。ここで、ａ，ｃ，２ｂの計数の切替えはカウ
ンタ１９の制御によつている。ａ，ｃ，２ｂの値
はゲート２３を経てカウンタ２０に送られる。こ
こでゲート２１，２２の構造から、ゲート２２に
は反転されたクロツクパルス_３が供給されて
いる。Ｐの計算はカウンタ２０で行なわれるが、
Ｐを計算するためには加算と引算を制御する必要
があり、そのためにもカウンタ１９が存在してい
る。カウンタ１９はオアゲート１７からのエンド
信号を計数してゆくものであり、出力の最下位ビ
ツトが制御信号として利用されている。すなわ
ち、ａを計数するときはエンド信号は１個も入つ
ていないから“０”，ｂを計数するときは１個の
エンド信号が入つているから“１”，ｃを計数す
るときは２個のエンド信号が入つているから
“０”である。この制御信号はゲート２１の他方
の入力となつており、“０”の場合はクロツクパ
ルスCL₁は計数されない。またこの制御信号はカ
ウンタ２０のアツプ，ダウンを制御し“０”のと
きアツプであり、“１”のときダウンである。ゲート２３の他方の入力にはフリツプフロツプ
１８からシフト信号が供給されており、このシフ
ト信号が存在する場合にだけ計数が行なわれる。
すなわちフリツプフロツプ１８はスタート信号と
クロツクパルスCL₅のアンドでセツトされ、オア
ゲート１７からのエンド信号でリセツトされる。カウンタ２０で計数されるＰの値については負
になることもあり、複雑化を防ぐために固有の値
Ｖがカウンタ２０に始めから与えられて、カウン
タ２０から出力される計数値PVが負にならない
ようにしている。カウンタ２０からの計数量PV
は比較器７，８に供給されレジスタ５，６に格納
されている閾値と比較されて特徴弁別信号Ｘ，Ｙ
をそれぞれ出力する。次に弁別の条件の一例を示
す。０（ゼロ）とＯ（オー）との組み合せではＰ
≦５（PV≦13）のとき０（ゼロ），Ｐ≧７（PV
≧15）のときＯ（オー）である。ＤとＯとの組み
合せではＰ≦２（PV≦10）のときＤ，Ｐ≧３
（PV≧11）のときＯ（オー）である。１と？との
組み合せではＰ≦２（Ｐ≦10）のとき１，Ｐ≧４
（PV≧12）のとき？である。２とＺとの組み合せ
ではＰ≦６（PV≦14）のときＺ，Ｐ≧８（PV≧
16）のとき２である。８とＢとの組み合せではＰ
≦−４（PV≦４）のとき８，Ｐ≧−２（PV≧
６）のときＢである。文字の組み合せ（区別すべ
きカテゴリー）と、レジスタ５，６の内容と、Ｘ
＝“１”，Ｙ＝“１”のときの判定の関係を次表に
示す。 The present invention relates to a character feature discrimination circuit. In printed character reading devices, characters that cannot be read the first time are often read again by changing the viewpoint of feature extraction, and the present invention is a feature discrimination circuit used in this case. . Typical feature extraction methods used in printed character reading devices include a stroke analysis method and a pattern matching method. Stroke analysis is a method commonly used in early character reading devices and requires the use of specially designed special characters. An example of characters used in the stroke analysis method is shown in FIG. These characters are determined based on the presence or absence of horizontal lines at the top, middle, and bottom, vertical lines at the top right, bottom right, top left, and bottom left. The pattern matching method is currently the most widely used method in printed character reading devices, and the JISOCR-B font is a character designed to be the easiest to read when reading characters using the pattern matching method. Figure 2 is
JISOCR-B font shape. pattern·
The matching method is a method of superimposing an input character with all standard characters and determining which standard character matches the input character best, and usually involves correcting positional deviations. Misalignment correction involves moving the input character forward, backward, left, and right on top of the standard character to find the position that best matches it. Using this method, especially clearly printed numbers for OCR can be read without any problems. However, as the number of characters to be read increases, including numbers, letters, and symbols, it becomes difficult to distinguish between similar characters using the pattern matching method. An example using JISOCR-B font is:
(0 zero oh), (D-O oh), (1-?),
(2-Z), (8-B), etc. are examples that are difficult to distinguish. The reason for this is that in the pattern matching method, as shown in Figure 3, when creating standard characters, a fairly wide area _R2 , which can be either one, must be provided between the character line area _R1 and the blank area _R3 . This is because small changes in character outlines cannot be detected. Conventionally, the following method has been known as a method for processing unreadable characters used in a printed character reading device using a pattern matching method. The first method is to prepare a modified standard character for a modified character that cannot be accommodated in one standard character, and reread the modified standard character. The second thing is when creating a quantized figure,
This method involves changing the threshold between white and black, re-creating the quantized figure, and re-reading the same standard characters from then on. The third method is to start scanning again and reread. These methods are very effective when the print quality is poor and the characters to be read are only numbers. However, although the print quality is relatively good, it is not very effective when there are many characters to be read. The reason for this is that the causes of unreadability are different: in the case of only numbers, the characters are in bad shape and do not match any standard characters, making them unreadable in most cases, whereas the characters to be read are If there are many characters, the shape of the characters is relatively good and they match well with the standard characters, but the two standard characters cannot be distinguished and are often unreadable. Therefore, an object of the present invention is to identify the most different part between two standard characters for characters that cannot be read because there are many characters to be read and it is impossible to distinguish between the two standard characters. It is an object of the present invention to provide a character feature discriminating circuit that can effectively distinguish between two characters by simply examining the character in detail. Next, the principle of the present invention will be explained. FIG. 4 shows an example of an input figure (pattern) that can be handled by the apparatus according to the present invention, and is represented by a 24×34 mesh. As shown in FIG. 5, such a figure is first stored in a rectangular storage area in a storage device such that the center of the input figure coincides with the center of the rectangle. The center of the input figure is the intersection of the height bisector and the width bisector. Next, the distances between the outer contour of the figure stored in this manner and the four sides of the rectangle are measured at three locations. Now, the input shape is either D or O, but I don't know which one it is.
If so, the distances between the three locations are as shown in Figure 6a,
As shown in b, this is the length from the left side of the rectangle in the L1, L2, and L3 lines to the figure. As shown in Fig. 6 a, b, this length is a, b,
c, the value P for distinguishing between the two is calculated by the following formula. P=(a-b)+(c-b)=a+c-2b After this, check the value of P, and if it is close to zero, D
(Figure 6a), and if it is larger than a certain value, O
(Oh) (Fig. 6b) is determined. Here, as shown in FIG. 7, the value of P hardly changes even if the width of the character line becomes thicker. In other words, P is
This value is not affected much by the thickness of the character line.
Also, as shown in Figure 8, even if the letters are tilted, they will still be read as L1.
If the distance between L2 and the distance between L2 and L3 are almost the same, then
The value of P remains almost unchanged. In other words, P is in 3 places
If you choose L ₁ , L ₂ , and L ₃ carefully, it will not be affected much by the inclination of the letters. These three locations can be determined in advance depending on what kind of standard characters (combinations) the input characters are indistinguishable from each other. Figures 9 to 13 are D, O, and 0, respectively.
(Zero) and O (Oh), 1? , 2 and Z, and 8 and B, the positions of three locations L ₁ , L ₂ , and L ₃ are shown. In this way, by using the value of P, it becomes possible to detect small changes in the contour line quite accurately. When determining the lengths of a, b, and c, consider the dents in one line that often appear in printed characters (a in the same figure), breaks in one line (b in the same figure), etc., as shown in Figure 14. It is necessary to determine by referring to the upper and lower rows (in the case of columns, the left and right columns) so as not to be affected by the protrusion of one row (c in the same figure) or noise in one row (d in the same figure). Therefore, in the present invention, the position of the outer contour line is comprehensively determined based on three consecutive lines.
In other words, the location where the pattern exists in two of the three rows is defined as the position of the outer contour line. Next, the present invention will be explained in detail with reference to the drawings showing one embodiment of the present invention. FIG. 15 is a diagram showing an embodiment of the present invention,
In the figure, a predetermined address is set in address register 1 according to a set of characters to be distinguished, and the contents of memory 2 are read out. Memory 2 has a predetermined number of lines depending on the set of characters to be distinguished (for example, D and O).
In the case of the (O) set, rows 9, 17, and 26) are stored, and this data is output to the address register 3 in order. The memory 2 also stores a threshold value for comparing the value of P used in feature determination, and this data is output to registers 5 and 6. Based on the address stored in the address register 3, specific rows (three consecutive rows in this embodiment) of the pattern stored in the memory 4 are read out in correspondence with the designated number of rows. That is, if 9 lines are specified, a pattern of 8 lines, 9 lines, and 10 lines (this is shown in Figure 4, although the target characters are different) is read out, and the shift registers 9, 10, and 11 are read out one by one. is stored in It is assumed that the pattern stored in the memory 4 has been normalized so that the center of the pattern is located at the center of the rectangular storage area. Three consecutive lines are shift registers 9, 10, 11
, then while the shift signal is present,
It is shifted in synchronization with clock pulse _CL4 , and the information of the edge bits in shift registers 9, 10, and 11 (information as to whether that point in the pattern is white or black) is sent to flip-flops 12, 13, and 14, respectively. I'm going to go. flipflop 1
2, 13, 14 are shift registers 9, 10, 11
This is set when the information from the line is "1", that is, "black", and the position where it is set corresponds to the position of the outer contour of each row. In this embodiment, in preparation for the case shown in FIG. 10, the position of the outer contour is determined when two of the flip-flops 12, 13, and 14 are set. Therefore, the outputs of the flip-flops 12, 13, and 14 are sent to a 2-bit full adder 15, which outputs an end signal through an OR gate 17 when a carry occurs, indicating that the outer contour has been detected. A counter 16 counts shift pulses CL ₄ , e.g.
When it becomes 1 (one line is 34 meshes), the carry output is sent to the OR gate 17. Further, the carry output from the counter 16 is outputted to the outside as a read-unable signal, indicating that this pattern cannot be read. This signal is F・F (not shown)
etc. When the first shift is completed, the address indicating the next row (for example, row 17) from memory 2 is transferred to register 3.
The next three rows (or columns) from memory 4 are set to
is read out and stored in the shift registers 9, 10, and 11, and the outer contour is detected again. and then 3
The outer contour of the second time (for example, the 26th line) is detected. Next, the counting of a, b, c necessary to calculate P and the calculation of P will be explained. Clock pulses CL ₁ and CL ₃ having the same frequency as clock pulse CL ₄ and different phases are separately connected to gates 21 and 22.
are joining. That is, in order to calculate P=a+c-2b, only the clock pulse _CL3 is counted when counting a and c, and both clock pulses _CL1 and _CL3 are counted when counting 2b. Here, the switching of the counts a, c, and 2b is controlled by the counter 19. The values of a, c, and 2b are sent to the counter 20 via the gate 23. Here, due to the structure of gates 21 and 22, gate 22 is supplied with an inverted clock pulse ₃ . The calculation of P is performed by the counter 20,
In order to calculate P, it is necessary to control addition and subtraction, and the counter 19 is also provided for this purpose. The counter 19 counts the end signal from the OR gate 17, and the least significant bit of the output is used as a control signal. That is, when counting a, there is no end signal, so it is "0", when counting b, there is one end signal, so it is "1", and when counting c, it is "2". It is "0" because it contains end signals. This control signal is the other input of the gate 21, and when it is "0", the clock pulse _CL1 is not counted. This control signal also controls up and down of the counter 20, and when it is "0" it is up and when it is "1" it is down. The other input of gate 23 is supplied with a shift signal from flip-flop 18, and counting is performed only when this shift signal is present.
That is, flip-flop 18 is set by ANDing the start signal and clock pulse _CL5 , and is reset by the end signal from OR gate 17. The value of P counted by the counter 20 may be negative, so to prevent complications, a unique value V is given to the counter 20 from the beginning so that the count value PV output from the counter 20 does not become negative. That's what I do. Counting amount PV from counter 20
are supplied to comparators 7 and 8, and are compared with threshold values stored in registers 5 and 6 to obtain feature discrimination signals X and Y.
Output each. Next, an example of discrimination conditions will be shown. In the combination of 0 (zero) and O (oh), P
0 (zero) when ≦5 (PV≦13), P≧7 (PV
≧15), it is O (O). In the combination of D and O, when P≦2 (PV≦10), D, P≧3
When (PV≧11), it is O (O). 1? In combination with 1 when P≦2 (P≦10), P≧4
When (PV≧12)? It is. In the combination of 2 and Z, when P≦6 (PV≦14), Z, P≧8 (PV≧
16), it is 2. In the combination of 8 and B, P
8 when ≦-4 (PV≦4), P≧-2 (PV≧
6), it is B. The combination of characters (categories to be distinguished), the contents of registers 5 and 6, and the
The following table shows the determination relationship when Y=“1” and Y=“1”.

【表】クロツクパルスCL₁〜CL₅は周波数が等しく位
相の異なるクロツクパルスであり、クロツクパル
スCL₂は使用されていない。これはＰの計数を安
定に行うためにクロツクパルスCL₁とクロツクパ
ルスCL₃との間に充分な時間をもたせるためであ
る。カウンタ１９はノニシアル信号でクリアさ
れ、カウンタ２０はイニシアル信号で値Ｖがロー
ドされる。これらイニシアル信号、スタート信
号、シフト信号、クロツクパルスCL₁〜CL₅のタ
イミングを第１６図に示す。図において、期間
T₁，T₂，T₃がパターンがシフトレジスタ９，１
０，１１にセツトされる期間であり、期間t₁，
t₂，t₃がパターンがシフトレジスタ９，１０，１
１内でシフトされる期間である。以上の説明からも明らかなように、本発明によ
れば簡単で安価な回路で、２つの文字の間の小さ
な相違点を的確にとらえ、従来のパターンマツチ
ング法では弁別できなかつた文字を効果的に区別
する事が可能となる。[Table] Clock pulses CL ₁ to _{CL 5} have the same frequency and different phases, and clock pulse CL ₂ is not used. This is to provide sufficient time between clock pulse CL ₁ and clock pulse CL ₃ in order to stably count P. The counter 19 is cleared by the non-initial signal, and the counter 20 is loaded with the value V by the initial signal. The timings of these initial signals, start signals, shift signals, and clock pulses CL ₁ to CL ₅ are shown in FIG. In the figure, the period
T ₁ , T ₂ , T ₃ are pattern shift registers 9, 1
The period is set to 0, 11, and the period t ₁ ,
t ₂ and t ₃ are shift registers 9, 10, 1
This is the period that is shifted within 1. As is clear from the above explanation, according to the present invention, small differences between two characters can be accurately detected using a simple and inexpensive circuit, and characters that cannot be distinguished using conventional pattern matching methods can be effectively detected. It is possible to distinguish between them.

[Brief explanation of the drawing]

第１図はストローク・アナリシス法により識別
できる文字例を示す図、第２図はパターン・マツ
チング法により識別できるJISOCR―Ｂフオント
の文字例を示す図、第３図はパターン・マツチン
グ法によるパターン認識の際の規則の例を示す
図、第４図は入力図形の一例を示す図、第５図は
メモリの中心とパターンの中心を一致させること
を説明する図、第６図ａ，ｂは本発明の原理を説
明するための図、第７図は本発明の原理を説明す
るための図で、パターンが太い場合を示す。第８
図は本発明の原理を説明するための図でパターン
が傾いている場合を示す。第９図〜１３図は各種
文字の組み合せの場合のパターンと、外輪郭測定
位置を示す図、第１４図ａ，ｂ，ｃ，ｄはパター
ンにおけるへこみａ，切れｂ，突出ｃ，ノイズｄ
を示す図、第１５図は本発明の一実施例を示す
図、第１６図は第１５図における各種タイミング
信号の関係を示す図。 Figure 1 shows an example of characters that can be identified using the stroke analysis method, Figure 2 shows an example of characters in JISOCR-B font that can be identified using the pattern matching method, and Figure 3 shows pattern recognition using the pattern matching method. Figure 4 is a diagram showing an example of the input figure; Figure 5 is a diagram explaining how to match the center of the memory with the center of the pattern; A diagram for explaining the principle of the invention. FIG. 7 is a diagram for explaining the principle of the invention, and shows a case where the pattern is thick. 8th
The figure is a diagram for explaining the principle of the present invention and shows a case where the pattern is tilted. Figures 9 to 13 are diagrams showing patterns for various character combinations and outer contour measurement positions, Figure 14 a, b, c, and d are dents a, cuts b, protrusions c, and noise d in the patterns.
FIG. 15 is a diagram showing an embodiment of the present invention, and FIG. 16 is a diagram showing the relationship between various timing signals in FIG. 15.

Claims

[Claims]

1 Make sure the center of the character shape is in the fixed position,
a memory for storing a quantized figure; a measuring means for measuring the positions of at least three outer contour lines predetermined for each type of figure in the figure stored in the memory; Calculating means for calculating the relative positional relationship of the other two positions with respect to the middle position among the three positions, and comparing means for comparing the value obtained by the calculating means with a value given in advance and outputting a discrimination output. A character feature discrimination circuit comprising: