JPS62271088A

JPS62271088A - Pattern recognizing device

Info

Publication number: JPS62271088A
Application number: JP61113790A
Authority: JP
Inventors: Mitsumasa Sugiyama; 杉山　光正
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1986-05-20
Filing date: 1986-05-20
Publication date: 1987-11-25

Abstract

PURPOSE:To obtain a device which can recognize a continuous character, by considering enough a writing order variation at the time of generating a pattern, and thereafter, resolving the pattern into a fundamental stroke, and comparing it with a stroke of a pattern which is registered in advance. CONSTITUTION:When a writer writes a character to a tablet 1 with an input pen 2, a coordinate of the pen and up-down information of the pen are sent to a feature extracting part 3, and in the feature extracting part 3, from an inputted character pattern of one character portion, a feature is extracted and sent to a character discriminating part 4. In the character discriminating part 4, the feature which is extracted from the inputted character pattern is compared with a character pattern of each character which has been registered in advance in a dictionary part 5 and its similarity is derived, and the character having the highest similarity is outputted as a result of recognition. In such a case, the inputted pattern is resolved into a fundamental stroke, the resolved stroke information is compared with stroke information containing an end point coordinate of a stroke which is registered in advance, and based on a result of comparison, the pattern is recognized.

Description

【発明の詳細な説明】３、発明の詳細な説明［産業上の利用分腎］本発明はパターン認識装置、特に手書文字、記号、図形
等の筆跡情報をもとにして文字、記号、図形等を認識す
るパターン認識装置に関する。[Detailed Description of the Invention] 3. Detailed Description of the Invention [Industrial Application] The present invention is a pattern recognition device, in particular, a pattern recognition device that recognizes characters, symbols, and shapes based on handwriting information such as handwritten characters, symbols, and figures. The present invention relates to a pattern recognition device for recognizing figures, etc.

［従来の技術］従来、データタブレット等からペンの座標とペンのアッ
プダウン情報を入力し、データタブレット上に筆記され
た文字等のパターンを認識するオンラインパターン認識
では、筆順制限が厳しく、誤った筆順の文字等は誤認識
されるか、リジェクトされることが多かった。また、筆
順を自由にするオンラインパターン認識装置も試みられ
ていたが、筆順を制限する場合に比較して認識時間が長
くなるため認識対象パターン種の拡張が困難であったり
、あるいは筆順を自由にしたために、かえつて誤認識率
が高くなるという欠点があった。[Conventional technology] Conventionally, online pattern recognition, which recognizes the pattern of characters written on the data tablet by inputting pen coordinates and pen up/down information from a data tablet, has strict stroke order restrictions and is prone to errors. Characters in stroke order were often misrecognized or rejected. Additionally, online pattern recognition devices that allow for free stroke order have been attempted, but the recognition time is longer compared to when the stroke order is restricted, making it difficult to expand the types of patterns to be recognized. This has the disadvantage that the rate of misrecognition increases.

入カバターンを認識するという点において、データタブ
レット等からペンの座標とペンのアップダウン情報を入
力して認識するオンラインパターン認識の大きな特徴の
１つは筆順情報を利用できることである。筆順はパター
ン入力者によって異なり、必ずしも−通りとは限らない
が、筆順情報を利用することで、認識率が高く、かつ認
識時間の短い文字認識が完成するのならば、筆順情報も
大いに利用すベミである。In terms of recognizing incoming cover patterns, one of the major features of online pattern recognition, which is recognized by inputting pen coordinates and pen up-down information from a data tablet or the like, is that stroke order information can be used. The stroke order differs depending on the person inputting the pattern, and is not necessarily the same, but if by using stroke order information, character recognition can be achieved with a high recognition rate and a short recognition time, then stroke order information can also be used to a great extent. It's Bemi.

［発明が解決しようとする問題点］本発明は、パターン作成時の筆順変動を十分に考慮した
上で、パターンを基本ストロークに分解して、予め登録
されているパターンのストロークと比較することにより
、続は書き文字の認識率が高く、かつ認識時間の短いパ
ターン認識装置な提供する。[Problems to be Solved by the Invention] The present invention fully takes into account stroke order variations during pattern creation, and then decomposes the pattern into basic strokes and compares them with strokes of patterns registered in advance. The present invention provides a pattern recognition device that has a high recognition rate for written characters and a short recognition time.

［問題点を解決するための手段］この問題点を解決するための一手段として、本発明のパ
ターン認識装置は、入力されたパターンを基本ストロー
クに分解して、分解されて得たストローク情報と予め登
録されているストロークの端点座標を含むストローク情
報とを比較する比較手段と、該比較手段の比較結果に基
づいて前記入力されたパターンを認識する認識手段とを
備える。[Means for Solving the Problem] As a means for solving this problem, the pattern recognition device of the present invention decomposes an input pattern into basic strokes and uses the stroke information obtained by decomposition and the The apparatus includes a comparison means for comparing stroke information including stroke end point coordinates registered in advance, and a recognition means for recognizing the input pattern based on a comparison result of the comparison means.

［作用コかかる構成において、入力されたパターンは基本ストロ
ークに分解されて、予め登録されているストローク情報
との比較によって、パターンが認識される。[Operation] In this configuration, the input pattern is decomposed into basic strokes, and the pattern is recognized by comparison with stroke information registered in advance.

（以下余白）［実施例］以下、添付図面に従って本発明の実施例を詳細に説明す
る。(The following is a margin) [Example] Hereinafter, an example of the present invention will be described in detail with reference to the accompanying drawings.

第１図（ａ）は第１実施例のパターン認識装置の構成図
であり、１はタブレット、２は入力ペン、３は特徴抽出
部、４は文字判別部、５は辞書部である。筆記者が入力
ペン２でタブレット１に文字を筆記すると、ペンの座標
とペンのアップダウン情報が特徴抽出部３に送られ、特
徴抽出部３では１文字分の入力された文字パターンから
特徴を抽出し、文字判別部４へ送る０文字判別部４では
入力された文字パターンから抽出された特徴を辞書部５
に予め登録されている各文字の文字パターンと比較して
、類似度を求め、最も類似度の高い（又は相違度の低い
）文字を認識結果として出力する。尚、文字パターンに
は、記号１図形等も含まれる。第１図（ｂ）は、文字判
別部４の構成図であり、４１は処理をするＣＰＵ、４２
はＣＰＵ４１の処理手順のプログラムを格納するＲＯＭ
、４３は補助用のＲＡＭである。FIG. 1(a) is a block diagram of the pattern recognition device of the first embodiment, in which 1 is a tablet, 2 is an input pen, 3 is a feature extraction section, 4 is a character discrimination section, and 5 is a dictionary section. When a scribe writes a character on the tablet 1 with the input pen 2, the coordinates of the pen and the up-down information of the pen are sent to the feature extraction unit 3, which extracts features from the input character pattern for one character. The zero character discriminator 4 extracts the features extracted from the input character pattern and sends them to the character discriminator 4.
The degree of similarity is determined by comparing the character patterns of each character registered in advance with the character pattern, and the character with the highest degree of similarity (or the lowest degree of difference) is output as a recognition result. Note that the character pattern also includes symbols, figures, and the like. FIG. 1(b) is a block diagram of the character discrimination unit 4, in which 41 is a CPU for processing, 42
is a ROM that stores the program of the processing procedure of the CPU 41
, 43 are auxiliary RAMs.

第２図は第１図（ａ）の辞書部５の構成を示す図であり
、本例では辞書部５は字画順に登録されている。５１は
９画文字の文字ブロックの構成を示す図であり、５２は
１つの文字ブロックが文字コード５３、分割情報５４、
ストローク情報５５から構成されていることを示す図で
ある。FIG. 2 is a diagram showing the configuration of the dictionary section 5 of FIG. 1(a), and in this example, the dictionary section 5 is registered in the order of strokes. 51 is a diagram showing the structure of a character block of nine stroke characters, and 52 is a diagram in which one character block has a character code 53, division information 54,
5 is a diagram showing that it is composed of stroke information 55. FIG.

第３図（ａ）は１つの文字ブロック５２の例であり、「
思」の文字ブロックの構造を示している。ｒ３ｂ５７Ｊ
は「思」の文字コード５３であり、「５」は「思」は始
めの５画と後半の４画に分割して処理することを示す分
割情報５４であり、次の９個のブロックには「思」のス
トローク情報５５が措置の筆順に従って登録されている
。FIG. 3(a) is an example of one character block 52, and "
It shows the structure of the character block for `` Thoughts''. r3b57J
is the character code 53 for "shi", and "5" is the division information 54 indicating that "shi" is divided into the first five strokes and the last four strokes, and is processed in the next nine blocks. The stroke information 55 for "thought" is registered according to the stroke order of the measure.

第３図（ｂ）は分割数として「１」を別に持っている分
割情報５４を持つ文字ブロックを示している。FIG. 3(b) shows a character block having division information 54 which additionally has "1" as the number of divisions.

以下、本実施例の動作を説明する。The operation of this embodiment will be explained below.

第４図はｒ思」の筆記例を示す図であり、「思」では「
田」の部分の筆順変動が忽しく、４０で示す筆順が基本
であるが、４１，４２゜４３で示す筆順のように異なっ
た筆順で筆記されることが多い、ただし、「心」を筆記
してから「田」を筆記したり、「田」の途中で「心」を
筆記し、ざらに「田」の残りを筆記することはなく、「
田」を筆記した後に「心」を筆記することがほとんどで
ある。このように種々な筆順の「思」を認識するには、
筆順を制限したまま、種々な筆順の「思」を辞書部５に
登録するか、あるいは筆順を自由にして入カバターンと
「思」のすべてのパターンとの比較を行うか２通りが考
えられているが、前者では、辞書部５の容量が非常に大
きくなるので、現実的ではなく、後者では不必要な比較
を行うため、認識時間が長くなったり、かえって誤った
対応をすることがあった。Figure 4 is a diagram showing an example of how to write ``r thought'', and ``residence'' is written as ``residence''.
There is a sudden change in the stroke order of the part ``田'', and although the stroke order shown in 40 is the basic stroke order, it is often written in different stroke orders such as the stroke order shown in 41, 42, and 43. However, when writing ``heart''``Ten'' is written down after ``Ten'', or ``Kokoro'' is written in the middle of ``Ten'', and the rest of ``Ten'' is not written in the middle.
In most cases, ``kokoro'' (heart) is written after writing down ``田''. In order to recognize the ``thoughts'' of various stroke orders in this way,
There are two possible ways to do this: register ``shi'' in various stroke orders in the dictionary section 5 while restricting the stroke order, or set the stroke order freely and compare the entered cover patterns with all patterns of ``shi''. However, in the former case, the capacity of the dictionary section 5 becomes extremely large, which is not practical, and in the latter case, unnecessary comparisons are made, which increases the recognition time and may even lead to incorrect responses. .

第５図（ａ）は本実施例による入力された文字のストロ
ーク情報と辞書部５の文字のストローク情報との比較結
果を示す図である。第５図（ｂ）は比較結果に対する重
み付けを示す図である。入力された文字パターンを、分
割情報５４に従って始めの５画と後半の４画に分割し、
始めの５画と辞書部５の始めの５画を比較し、後半の４
画と辞書部５の後半の４画を比較している。比較結果に
は重み付けを行い、本例では許容値以下であれば、同一
文字と判断する。例えば許容値を１００とすると、第５
図（ａ）の例では、入力された文字パターンは第４図の
４２の誤った筆順で筆記されているが、比較によって正
しい対応がとれることを示している。尚、重み付は及び
許容値は本例に限らない、又、同一と判断された文字コ
ードが複数ある場合は、最も小さい値のものを同一文字
と判断する。以後、比較結果の出力値を相違度の値と呼
ぶ。重み付けを逆にすれば、類似度の値となる。FIG. 5(a) is a diagram showing a comparison result between stroke information of input characters and stroke information of characters stored in the dictionary section 5 according to this embodiment. FIG. 5(b) is a diagram showing weighting for comparison results. The input character pattern is divided into the first 5 strokes and the latter 4 strokes according to the division information 54,
Compare the first 5 strokes with the first 5 strokes of dictionary section 5, and compare the last 4 strokes.
This figure is compared with the last four strokes of dictionary section 5. The comparison results are weighted, and in this example, if they are less than a tolerance value, they are determined to be the same character. For example, if the allowable value is 100, the fifth
In the example of FIG. 4A, the input character pattern is written in the incorrect stroke order of 42 in FIG. 4, but comparison shows that the correct correspondence can be achieved. Note that the weighting and tolerance values are not limited to those in this example, and if there are multiple character codes determined to be the same, the one with the smallest value is determined to be the same character. Hereinafter, the output value of the comparison result will be referred to as a dissimilarity value. Reversing the weighting results in a similarity value.

第６図は本実施例の２分割をする場合の処理手順を示す
フローチャートである。ステップＳ６０で特徴抽出部３
より、入力された文字パターンを受信した後、ステップ
Ｓ６１で比較の終了していない文字ブロックがあるかど
うかの判別を行い、無い場合は処理を終了する。まだ比
較していない文字ブロックがある場合は、ステップＳ６
２てその文字ブロック５２を辞書部５から入力し、特に
ステップＳ６３では文字ブロック５２から分割情報５４
を取り出す。ステップＳ６４で分割情報５４に従って入
力された文字パターンを前半と後半に分割し、前半のス
トローク同志の比較を行う。ステップ３６５では後半の
ストローク同志の比較を行い、ステップＳ６６ではステ
ップＳ６４とステップＳ６５の結果から入力された文字
パターンと文字ブロック５２の文字パターンが同一であ
るかどうかの判別を行い、同一パターンと判断できる場
合はステップＳ６７で文字コード５３を出力した後、ス
テップＳ６１へ戻り、同一パターンと判断できない場合
は直ちにステップＳ６１へ戻る。ステップＳ６７で、文
字コードを出力すると同時に、その時の相違度の値を以
後の判定の許容値とすると、最終的に１つの文字を発見
できる。又、最後に許容値が所定値以上である場合は、
筆記者に入力ミスを伝達することも考えてもよい。FIG. 6 is a flowchart showing the processing procedure for dividing into two parts according to this embodiment. In step S60, the feature extraction unit 3
After receiving the input character pattern, it is determined in step S61 whether there is any character block for which comparison has not been completed, and if there is no character block, the process is terminated. If there is a character block that has not been compared yet, step S6
2, input the character block 52 from the dictionary section 5, and in particular, in step S63, divide information 54 is input from the character block 52.
Take out. In step S64, the input character pattern is divided into the first half and the second half according to the division information 54, and the strokes of the first half are compared. In step 365, the strokes in the latter half are compared, and in step S66, it is determined whether the input character pattern and the character pattern of the character block 52 are the same based on the results of step S64 and step S65, and it is determined that they are the same pattern. If possible, the character code 53 is output in step S67, and then the process returns to step S61. If it cannot be determined that the patterns are the same, the process immediately returns to step S61. In step S67, when the character code is output and the difference value at that time is set as the allowable value for subsequent determination, one character can finally be found. Also, if the allowable value is greater than or equal to the predetermined value,
You may also consider communicating input errors to the scribe.

本実施例ではタブレットに対して入力ベンで筆記してい
るが、圧力を感知するタブレットを用い、指で筆記する
ようにしても良い。又、比較結果の判別方法は重み付け
には限らない。更に、入力された文字の画数により、本
例では９画から比較を始めると処理時間が短い。In this embodiment, writing is performed using an input ben on a tablet, but writing may be performed using a finger using a tablet that senses pressure. Further, the method for determining the comparison result is not limited to weighting. Furthermore, depending on the number of strokes of the input character, in this example, if the comparison starts from 9 strokes, the processing time will be shorter.

ところが、筆順の誤り以外に、続は書きをすることも考
えられる。そこで、次に文字が続は書きされた場合の実
施例のパターン認識装置の対応を説明する。However, in addition to errors in stroke order, it is also possible that the continuation is written. Therefore, the response of the pattern recognition device of the embodiment when a continuation of characters is written next will be explained.

第７図は「思」の筆記例を示す図であり、７０は正しい
筆順で、かつ精舎で筆記された「思」を示す一方、７１
は誤った筆順で、かつ一部分が続は書きされた「思」を
示している。Figure 7 is a diagram showing an example of writing ``shi'', and 70 shows ``shi'' written in the correct stroke order and in a Buddhist temple, while 71
is written in the wrong order of strokes, and part of it shows the ``thoughts'' written.

第８図（ａ）は本実施例における続は書きされた「思」
の文字バターーノと辞書部の文字パターンとの比較結果
を示す図である。第８図（ｂ）は比較結果の重み付けを
示す図である。入力された文字パターンの始めの５画と
辞書部の文字パターンの始めの５画との比較を行い、入
力された文字パターンの終りの４画と辞書部の文字パタ
ーンの後半の４画との比較を行うことを示している。Figure 8(a) shows the continuation in this example written as “thought”.
It is a figure which shows the comparison result of the character Batano and the character pattern of a dictionary part. FIG. 8(b) is a diagram showing weighting of comparison results. Compare the first five strokes of the input character pattern with the first five strokes of the character pattern in the dictionary section, and compare the last four strokes of the input character pattern with the last four strokes of the character pattern in the dictionary section. Indicates that a comparison is to be made.

第８図（ａ）の例では、筆順が正しくなく、かつ一部分
が続は書きされていても、比較によって正しい対応がと
れることを示している。The example in FIG. 8(a) shows that even if the order of strokes is incorrect and some parts are continued, a correct correspondence can be taken by comparison.

以上述べた様な２つの分割ばかりでなく、３つ以上の分
割を行った方がより有利な場合がある。It may be more advantageous to perform not only two divisions as described above but three or more divisions.

例えば、「数」という文字は「米」と「女」と「久」に
分割した方が、認識率が高く、認識時間が短い認識が可
能である。そこで、次に文字を任意の数に分割するパタ
ーン認識装置を説明する。For example, if the character ``number'' is divided into ``rice'', ``女'', and ``久'', the recognition rate will be higher and the recognition time will be shorter. Next, a pattern recognition device that divides characters into arbitrary numbers will be described.

第１０図（ａ）は「数」の文字ブロック５２の例であり
、ｒ３Ｆ７４Ｊは数の文字コード５３、「６」および「
３」は始めの６画の「米」次の３画の１女」、そして残
りの「欠」に分割されることを示す分割情報５４である
。次の１３個のブロックには「数」のストローク情報５
５が正しい筆順に従って格納されている。第１０図（ｂ
）は分割数として「２」を別に持っている分割情報５４
を含む文字ブロックを示している。FIG. 10(a) is an example of a "number" character block 52, and r3F74J is a number character code 53, "6" and "
3" is the division information 54 indicating that the image is divided into the first six strokes of "rice," the next three strokes of "1 woman," and the remaining strokes of "missing." The next 13 blocks have “number” stroke information 5
5 are stored in the correct stroke order. Figure 10 (b
) is the division information 54 that separately has “2” as the division number.
It shows a block of characters that contains .

第９図は「数」の筆記例を示す図であり、９０が正しい
筆順の筆記例であるのに対し、９１は誤った筆順で筆記
されている上、「久」の部分は、一部分、続は書きされ
た筆記例である。このようにｒ数」を構成する「米」、
「女」、「欠」はそれぞれ種々な筆順で筆記される。し
かし、「米」「女」、「欠」の順で筆記されることがほ
とんどである。Figure 9 is a diagram showing an example of writing the word ``number''. 90 is an example of writing in the correct stroke order, while 91 is written in the wrong stroke order, and the part for ``ku'' is partially written in the correct stroke order. What follows is a written example. In this way, "rice" constitutes "r number",
``Onna'' and ``miss'' are each written in various stroke orders. However, in most cases, it is written in the order of ``rice'', ``woman'', and ``missing''.

第１１図は「数」の辞書部の文字パターンと入力された
文字パターンの比較結果を示す図である。ｒ数」の分割
情報５４は「６」と「３」であるから、文字パターンを
「米」　（６画）とｒ女」（３画）と「欠」に分割し、
それぞれを入力された文字パターンの始めの６画、６画
〜９画、終りの４画と比較し、正しいパターンの対応を
得ている。このように本実施例では少ない処理量で誤筆
順文字や続は文字も正しいパターンに対応させており、
実用上の筆順自由や続は書きを可能にしている。FIG. 11 is a diagram showing a comparison result between the character pattern of the dictionary section for "number" and the input character pattern. Since the division information 54 for "r number" is "6" and "3", the character pattern is divided into "rice" (6 strokes), "r woman" (3 strokes), and "missing".
Each of these is compared with the first six strokes, six to nine strokes, and the last four strokes of the input character pattern to obtain correct pattern correspondence. In this way, in this embodiment, the incorrect stroke order characters and the continuation characters are made to correspond to the correct pattern with a small amount of processing.
In practical terms, free stroke order and continuous writing are possible.

第１２図は本実施例の任意の分割の場合の処理手順を示
すフローチャートである。ステップ５１２０で、特徴抽
出部３より入力された文字パターンを受信し、その画数
から比較する文字の画数を決定する。各文字ブロック５
２の比較ではステップ５１２１で終りかどうかの判別を
行い、終りの場合は処理を終了する。終りでない場合は
ステップ５１２２でステップ５１２０で決定された画数
の文字ブロックを読み込み、ステップ５１２３でその中
から特に分割情報５４を取り出す。ステップ５１２４で
は分割情報５４に従って文字パターンを分割し、ステッ
プ５１２５で入力された文字パターンと予め登録されて
いる文字パターンのストロークの比較を行う。ステップ
３１２６では全ストロークの比較が終了しかを判断して
、終了していなければステップ３１２５〜１２６を繰り
返す。ステップ５１２７ではその比較の結果、入力され
た文字パターンと文字ブロックが同一文字であるかどう
かの判別を行い、同一文字と判断された場合は、ステッ
プ５１２８で文字コードを出力し、ステップ５１２１に
戻って次の文字ブロックの比較に移る。そうでない場合
は直接ステップ５１２１へ戻る。尚、前述したように同
一と判断された文字コードが複数ある場合は、最も小さ
い値のものを同一文字と判断する。FIG. 12 is a flowchart showing the processing procedure in the case of arbitrary division in this embodiment. In step 5120, the character pattern inputted from the feature extraction unit 3 is received, and the number of strokes of the character to be compared is determined from the number of strokes. Each character block 5
In the second comparison, it is determined in step 5121 whether or not it is the end, and if it is the end, the process ends. If it is not the end, in step 5122, the character block with the number of strokes determined in step 5120 is read, and in step 5123, particularly the division information 54 is extracted from it. In step 5124, the character pattern is divided according to the division information 54, and in step 5125, the strokes of the input character pattern and the previously registered character pattern are compared. In step 3126, it is determined whether the comparison of all strokes has been completed, and if not, steps 3125 to 126 are repeated. In step 5127, as a result of the comparison, it is determined whether the input character pattern and the character block are the same character. If they are determined to be the same character, the character code is output in step 5128, and the process returns to step 5121. to move on to comparing the next block of characters. Otherwise, the process returns directly to step 5121. Note that, as described above, if there are multiple character codes determined to be the same, the one with the smallest value is determined to be the same character.

又、ステップ３１２８で、文字コードを出力すると同時
に、その時の相違度の値を以後の判定の許容値とすると
、最終的に１つの文字を発見できるし、最後に許容値が
所定値以上である場合は、筆記者に入力ミスを伝達する
ことも考えてもよい。In addition, in step 3128, when the character code is output, and at the same time, the value of the degree of difference at that time is set as the tolerance value for subsequent determination, one character can finally be found, and finally the tolerance value is greater than or equal to the predetermined value. If so, you may consider communicating the input error to the scribe.

本例では３つに分割した例を説明したが、分割数はいく
らでも良い。又、その分割方法もそれぞれの文字につい
て最良の方法を考えれば良い。In this example, an example in which the image is divided into three parts has been explained, but the number of divisions may be any number. Also, as for the division method, it is sufficient to consider the best method for each character.

更に、すべてのストロークの比較を終了するのを待たず
に相違度が所定値を越えると、判断を中止するようにす
ると、次の認識に早く移ることが出来て認識時間を短く
できる。そこで、次に相違度が所定値を越えると判断を
中止するパターン認識製蓋を説明する。Furthermore, if the determination is stopped when the degree of difference exceeds a predetermined value without waiting for the comparison of all strokes to be completed, the next recognition can be started quickly, and the recognition time can be shortened. Therefore, next, a pattern recognition lid making method will be described in which the determination is stopped when the degree of difference exceeds a predetermined value.

第１３図の１３０は辞書部５に登録されている「思」の
文字パターンであり、正しい筆順に従ったストローク情
報である。１３１は「計」の筆記例と正しい筆順に従っ
たストローク情報である。Reference numeral 130 in FIG. 13 is a character pattern for "shi" registered in the dictionary section 5, which is stroke information that follows the correct stroke order. 131 is a handwriting example of "to" and stroke information according to the correct stroke order.

この２文字の比較では「思」の始めの５画である「田」
と筆記例である「計」の始めの５画である「芦Ｊの比較
を行う。第１４図は「思」と「計」のストローク対応を
示す図であるが、「田」の各ストロークは文字枠の上半
分に存在するのに対し、「ア」の各ストロークは文字枠
の左半分に存在し、始めの５画同志の相違度は大きく、
許容値より大きくなる。相違度が許容値より大きくなっ
たので、残りの「心」と「コ↑」の比較は行わない。In comparing these two characters, ``田'' is the first five strokes of ``し''.
Compare the first five strokes of the writing example ``ke'' with ``芦J''. Figure 14 shows the stroke correspondence between ``shi'' and ``ke'', and each stroke of ``田''. exists in the upper half of the character frame, whereas each stroke of "A" exists in the left half of the character frame, and the degree of difference between the first five strokes is large.
becomes larger than the allowable value. Since the degree of difference has become larger than the allowable value, the remaining "heart" and "ko↑" are not compared.

第１５図は相違度により処理を中断する場合の処理手順
を示すフローチャートである。ステップ５１５０で、特
徴抽出部３より入力された文字パターンを受信し、その
画数から比較する文字の画数を決定する。各文字ブロッ
ク５２の比較ではステップ５１５１で終りかどうかの判
別を行い、終りの場合は処理を終了する。終りでない場
合はステップ５１５２でステップ５１５０で決定された
画数の文字ブロックを読み込み、ステップ５１５３でそ
の中から特に分割情報５４を取り出す。ステップ５１５
４では分割情報５４に従って文字パターンを分割し、ス
テップ５１５５で入力された文字パターンと予め登録さ
れている文字パターンのストロークの比較を行う、ステ
ップ５１５６では、ステップ５１５５での比較結果の相
違度を加算する。ステップ５１５７では、ステップ５１
５６での加算結果が許容値以下かが判定され、許容値を
越えた場合はその文字ブロックに対する比較を中断して
、ステップ５１５１に戻り次の文字ブロックとの比較に
穆る。ステップ５１５８では全ストロークの比較が終了
したかを判断して、終了していなければステップ５１５
５〜１５８を繰り返す。ステップ５１５９ではその比較
の結果、入力された文字パターンと文字ブロックが同一
文字であるかどうかの判別を行い、同一文字と判断され
た場合は、ステップ３１６０で文字コードを出力し、ス
テップ５１５１に戻って次の文字ブロックの比較に穆る
。そうでない場合は直接ステップ５１５１へ戻る。尚、
前述したように同一と判断された文字コードが複数ある
場合は、最も小さい値のものを同一文字と判断する。FIG. 15 is a flowchart showing the processing procedure when processing is interrupted due to the degree of difference. In step 5150, the character pattern inputted from the feature extraction unit 3 is received, and the number of strokes of the character to be compared is determined from the number of strokes. In comparing each character block 52, it is determined in step 5151 whether it is the end or not, and if it is the end, the process ends. If it is not the end, in step 5152, the character block with the number of strokes determined in step 5150 is read, and in step 5153, particularly the division information 54 is extracted from it. Step 515
In Step 4, the character pattern is divided according to the division information 54, and in Step 5155, the strokes of the input character pattern and the previously registered character pattern are compared. In Step 5156, the degree of difference of the comparison result in Step 5155 is added. do. In step 5157, step 51
It is determined whether the addition result in step 5156 is less than or equal to the allowable value, and if it exceeds the allowable value, the comparison for that character block is interrupted, and the process returns to step 5151 to start comparison with the next character block. In step 5158, it is determined whether the comparison of all strokes has been completed, and if not, step 515
Repeat steps 5 to 158. In step 5159, as a result of the comparison, it is determined whether the input character pattern and the character block are the same character. If they are determined to be the same character, the character code is output in step 3160, and the process returns to step 5151. to start comparing the next block of characters. Otherwise, the process returns directly to step 5151. still,
As described above, if there are multiple character codes determined to be the same, the one with the smallest value is determined to be the same character.

又、ステップ５１６０で、文字コードを出力すると同時
に、その時の相違度の値を以後の判定の許容値とすると
、最終的に１つの文字を発見できるし、最後に許容値が
所定値以上である場合は、筆記者に入力ミスを伝達する
ことも考えてもよい。Also, in step 5160, if the character code is output and the difference value at that time is set as the tolerance value for subsequent determination, one character can be finally found, and finally the tolerance value is greater than or equal to the predetermined value. If so, you may consider communicating the input error to the scribe.

更に、辞書部の文字の順序を考えて、ステップ５１５７
で相違度が許容値を越えた場合は、比較の終了した部分
パターンとおなしパターンを持つ文字との比較をスキッ
プすることも出来る。Furthermore, considering the order of characters in the dictionary section, step 5157
If the degree of dissimilarity exceeds the allowable value, it is possible to skip the comparison between the partially compared partial pattern and the character with the blank pattern.

相違度の許容値は入力された文字パターンの画数によっ
て定められていても、辞書部の文字パターンの画数によ
って定められていたり、各文字毎、あるいは各部分の画
数によって定められていてもよい。また、入力された文
字パターンの特徴と予め登録されている文字パターンと
の比較において、その入力された文字パターンの特徴と
それ以前に比較した文字パターンとの相違度のうち、最
も小さい相違度を許容値にしてもよい。又、本例では「
田」の方から比較したが、「心」から比較しても、その
文字によっては、文字の特徴が最も現れたパターンから
比較した方が認識時間が短い場合や認識率が高い場合も
ある。The allowable value of the degree of difference may be determined by the number of strokes of the input character pattern, the number of strokes of the character pattern in the dictionary, or the number of strokes of each character or part. In addition, when comparing the characteristics of the input character pattern with character patterns registered in advance, the smallest difference between the characteristics of the input character pattern and the previously compared character pattern is selected. It may be a permissible value. Also, in this example, “
Although the comparison was made starting with ``田'', even when comparing starting with ``心'', depending on the character, the recognition time may be shorter or the recognition rate may be higher if the pattern that shows the most character features is compared first.

以上説明したように、入力された文字パターンの特徴と
辞書部の文字パターンとの比較において、この一方ある
いは両方をいくつかの部分パターンに分割し、対応する
部分パターン同志の比較の結果、相違度がある値以上で
ある場合、あるいは逆に類似度がある値以下である場合
、入力された文字パターンと辞書部の文字パターンとの
実行中の比較を終了して次の処理に進む、あるいは類似
パターンの文字の比較をスキップすることで認識時間の
短縮化が可能になる。As explained above, when comparing the characteristics of the input character pattern and the character pattern in the dictionary, one or both of them are divided into several partial patterns, and as a result of comparing the corresponding partial patterns, the degree of dissimilarity is determined. If the similarity is greater than or equal to a certain value, or conversely, if the similarity is less than or equal to a certain value, the ongoing comparison between the input character pattern and the character pattern in the dictionary section is terminated and the process proceeds to the next step, or if the similarity is By skipping the comparison of characters in the pattern, recognition time can be shortened.

次に、ストローク情報例及びストローク情報による比較
例を説明する。Next, an example of stroke information and a comparative example using stroke information will be described.

第１６図はストローク情報の作成に使用する基本ストロ
ークの一例を示す図であり、代表的なストロークの形と
その番号が登録されている０入力された文字パターンの
各ストロークはこれらの基本ストローク１６のいずれか
に分類される。FIG. 16 is a diagram showing an example of basic strokes used to create stroke information, in which typical stroke shapes and their numbers are registered. Each stroke of the input character pattern is one of these basic strokes 16. It is classified as one of the following.

第１７図は辞書部５に登録されている文字ブロック５２
の一例を示す図であり、「何」の文字パターンである。FIG. 17 shows a character block 52 registered in the dictionary section 5.
is a diagram showing an example of the character pattern of "What".

ｒ３２３ＦＪは「何」の文字コード５３である。「６」
、ｒ３６，９２Ｊ、ｒ８，４７Ｊは「何」の第１画であ
るｒノ」の基本ストローク番号、始点の座標、終点の座
標である。始点及び終点の座標は「何」の標準パターン
に位置と大きさに正規化を施した後の各端点のｘ、ｙ座
標値である。さらに、「７」、「２２゜６０」、ｒ２４
，８Ｊは「何」の第２画である「１」の基本ストローク
番号、始点の座標、終点の座標である。以下同様にスト
ローク情報５５が登録されている。前述のように分割す
る場合は、文字コード５３とストローク情報５５の間に
分割情報５４が登録されている。r323FJ is the character code 53 for "What". "6"
, r36, 92J, r8, 47J are the basic stroke number, the coordinates of the starting point, and the coordinates of the ending point of "rノ" which is the first stroke of "What". The coordinates of the starting point and the ending point are the x and y coordinate values of each end point after normalizing the position and size of the "what" standard pattern. Furthermore, "7", "22°60", r24
, 8J are the basic stroke number, starting point coordinates, and ending point coordinates of "1", which is the second stroke of "what". Stroke information 55 is similarly registered below. When dividing as described above, division information 54 is registered between character code 53 and stroke information 55.

第１８図は「何」の筆記例を示した図である。FIG. 18 is a diagram showing an example of writing "What".

１８０は措置かつ正しい筆順で筆記された「何」の筆記
例であるが、１８１，１８２は措置ではあるが、誤った
筆順で筆記された「何」の筆記例である。１８３は一部
分が続は書きされ、かつ誤った筆順で筆記された「何」
の例である。基本ストロークを利用した従来のオンライ
ン文字認識では、このような誤筆順文字や続は字の認識
は困スａであった。180 is an example of "what" written as a measure and in the correct stroke order, while 181 and 182 are examples of "what" written as a measure but in the wrong stroke order. 183 is a part of the word ``what'' written in the wrong stroke order.
This is an example. In conventional online character recognition using basic strokes, it is difficult to recognize such incorrect stroke order characters and continuation characters.

第１９図と第２０図は本実施例による認識方法の一例の
フローチャートであり、第２１図は認識の際に利用する
基本ストローク間相似皮表の一部を示す図である。FIGS. 19 and 20 are flowcharts of an example of the recognition method according to this embodiment, and FIG. 21 is a diagram showing a part of the basic stroke similarity surface used for recognition.

第１９図はストローク情報による入力された文字パター
ンと辞書部の文字パターンとの比較手順を示すフローチ
ャートであり、ステップ５１９１で辞書部の文字パター
ンの各ストロークと、入力された文字パターンの各スト
ロークとを比較して対応するストロークを求め、次にス
テップ５１９２で対応するストローク間の距離の和を求
め、入力された文字パターンと文字パターンとの相違度
にしている。このようにして辞書部に登録されている各
文字パターンと入力された文字パターンとの相違度を求
め、最も相違度の小さな文字を認識結果として出力して
いる。FIG. 19 is a flowchart showing a procedure for comparing the input character pattern with the character pattern of the dictionary section based on stroke information. In step 5191, each stroke of the character pattern of the dictionary section and each stroke of the input character pattern are compared. are compared to find corresponding strokes, and then in step 5192 the sum of the distances between the corresponding strokes is found, which is used as the degree of difference between the input character pattern and the character pattern. In this way, the degree of difference between each character pattern registered in the dictionary section and the input character pattern is determined, and the character with the smallest degree of difference is output as the recognition result.

第２０図は辞書部の文字パターンの各ストロークと入力
された文字パターンの各ストロークとの対応を求める手
順を示すフローチャートであり、ステップ５２０１で両
ストロークの始点同志の距離を求め、ステップ５２０２
で終点同志の距離を加え、さらにステップ５２０３で第
２１図に示す基本ストローク間相違度表に従い、基本ス
トローク間相違度を加え、この値をストローク間距離と
し、辞書部の文字パターンのストロークに対し、入力さ
れた文字パターンのストロークの中で最も距離の小さい
ストロークを対応ストロークとする。FIG. 20 is a flowchart showing the procedure for determining the correspondence between each stroke of the character pattern in the dictionary section and each stroke of the input character pattern. In step 5201, the distance between the starting points of both strokes is determined, and in step 5202
In step 5203, the distance between the end points is added, and in step 5203, the degree of difference between basic strokes is added according to the degree of difference between basic strokes table shown in FIG. 21, and this value is defined as the distance between strokes. , the stroke with the shortest distance among the strokes of the input character pattern is set as the corresponding stroke.

第２２図は「何」の筆記例であり、第１８図の１８３の
筆記例である。第２３図は第２２図の筆記例に対し、筆
順に従って各ストロークの基本ストローク番号、始点の
座標、終点の座標を抽出したものである。尚、基本スト
ロークへの分解は、各ストロークと基本ストロークの始
点を同じ座標にＢ勤して、終点間の距離の最も短い基本
ストロークを選ぶ等が考えられる。FIG. 22 is an example of writing "What", which is an example of writing 183 in FIG. FIG. 23 shows the basic stroke number, starting point coordinates, and ending point coordinates of each stroke extracted from the writing example shown in FIG. 22 according to the stroke order. Note that for decomposition into basic strokes, it is possible to set the starting point of each stroke and the basic stroke to the same coordinates and select the basic stroke with the shortest distance between the end points.

第２４図は第２２図の入力された筆記例と第１７図の文
字パターンの各ストロークの対応を示す図である。入力
された文字パターンに従い、辞書部の文字パターンの第
１ストロークと入力された文字パターンの各ストローク
との比較を行い、辞書部の文字パターンの第１ストロー
クには入力された文字パターンの第１ストロークが対応
したことを示している。以下同様である。入力された文
字パターンの第４ストロークは、辞書部の文字パターン
の第５ストロークと第６ストロークに対応している。ス
トローク情報が正しく対応しているため、入力された文
字パターンの「何」と辞書部の文字パターンの「何」の
相違度は比較的小さな値となり正しく認識される。FIG. 24 is a diagram showing the correspondence between the input handwriting example of FIG. 22 and each stroke of the character pattern of FIG. 17. According to the input character pattern, the first stroke of the character pattern in the dictionary section is compared with each stroke of the input character pattern, and the first stroke of the character pattern in the dictionary section is compared with the first stroke of the input character pattern. This indicates that the strokes corresponded. The same applies below. The fourth stroke of the input character pattern corresponds to the fifth and sixth strokes of the character pattern in the dictionary section. Since the stroke information corresponds correctly, the degree of difference between the input character pattern "what" and the dictionary part character pattern "what" becomes a relatively small value and is correctly recognized.

以上のように、基本ストロークのストロークの端点の座
標を利用することで、誤筆順文字や続は字でも正しいス
トローク対応を得ることが多く、高い認識率が得られる
。尚、本例は基本ストロークに分解して比較したが、文
字ストローク自身を比較する方法等も考えられる。As described above, by using the coordinates of the stroke end points of the basic strokes, correct stroke correspondence is often obtained even for incorrect stroke order characters and continuation characters, resulting in a high recognition rate. In this example, the basic strokes are broken down and compared, but a method of comparing the character strokes themselves may also be considered.

さて、文字をグループに分割するだけでなく、特定の出
現率の多い、例えば、部首等を予め比較して見付けると
認識時間が短くなる。以下、上記のパターン認識を実現
するパターン認識装置を説明する。Now, in addition to dividing characters into groups, the recognition time can be shortened by comparing and finding specific occurrence rates, such as radicals, in advance. A pattern recognition device that realizes the above pattern recognition will be described below.

第２５図は第２実施例のパターン認識装置の構成図であ
り、２５１はタブレット、２５２は入力ベン、２５３は
特徴抽出部、２５４は部分パターン判別部、２５５は文
字判別部、２５６は辞書部、２５７は辞書部２５７の内
の部分パターン辞書部、２５８は辞書部２５７の内の文
字辞書部である。筆記者が入力ベン２５２でタブレット
２５１に文字を筆記すると、ペンの座標とペンのアップ
ダウン情報が特徴抽出部２５３に送られ、特徴抽出部２
５３では１文字分の入力された文字パターンに対して位
置と大きさの正規化を施し、ストロークの数、各ストロ
ークの端点の座標、各ストロークの概形等の特徴を抽出
し、部分パターン判別部２５４へ送る。部分パターン判
別部２５４では特徴抽出部２５３から送られた特徴を部
分パターン辞書部２５７に登録されている各部分パター
ンの特徴と比較して部分パターンの候補を求め、特徴抽
出部２５３から送られた入力された文字パターンの特徴
とともに、候補部分パターンを文字判別部２５５へ送る
。文字判別部２５５では、部分パターン判別部２５４か
ら送られる入力された文字パターンの特徴と候補部分パ
ターンに従い、候補部分パターンによって文字辞書部２
５８に登録されている文字パターンの中から比較すべき
文字パターンを選別して入力された文字パターンの特徴
と文字パターンを比較し、最も相違度の小さい、あるい
は類似度の高い文字を認識結果として出力する。尚、部
分パターン判別部２５４及び文字判別部２５５は、第１
図（ｂ）と同様にＣＰＵ、ＲＯＭ、ＲＡＭから構成され
ており、第２５図に示したＲＯＭ２５４ａ、２５５ａに
は、処理プログラムが格納されている。FIG. 25 is a block diagram of the pattern recognition device of the second embodiment, in which 251 is a tablet, 252 is an input bezel, 253 is a feature extraction section, 254 is a partial pattern discrimination section, 255 is a character discrimination section, and 256 is a dictionary section. , 257 is a partial pattern dictionary section of the dictionary section 257, and 258 is a character dictionary section of the dictionary section 257. When a scribe writes characters on the tablet 251 using the input pen 252, the coordinates of the pen and the up-down information of the pen are sent to the feature extraction unit 253.
53 normalizes the position and size of the input character pattern for one character, extracts features such as the number of strokes, the coordinates of the end points of each stroke, and the outline of each stroke, and discriminates partial patterns. 254. The partial pattern discrimination unit 254 compares the features sent from the feature extraction unit 253 with the features of each partial pattern registered in the partial pattern dictionary unit 257 to obtain partial pattern candidates, and compares the features sent from the feature extraction unit 253 with the features of each partial pattern registered in the partial pattern dictionary unit 257. The candidate partial pattern is sent to the character discrimination unit 255 along with the characteristics of the input character pattern. The character discrimination section 255 selects the character dictionary section 2 according to the candidate partial patterns according to the characteristics of the input character pattern and the candidate partial patterns sent from the partial pattern discrimination section 254.
Select character patterns to be compared from among the character patterns registered in 58, compare the characteristics of the input character pattern with the character pattern, and select the characters with the smallest degree of difference or highest degree of similarity as the recognition result. Output. Note that the partial pattern discrimination unit 254 and the character discrimination unit 255 are
As in FIG. 25, it is composed of a CPU, ROM, and RAM, and processing programs are stored in the ROMs 254a and 255a shown in FIG.

第２６図は部分パターンの例を示す図であり、偏や帝、
冠等の部首や多くの漢字に出現する部分パターンに番号
を対応させている。第２７図は多くの文字に出現するス
トロークの例を示す図であり、各ストロークに対して番
号を対応させている。これらのストロークは文字を構成
している基本的なストロークなので、以後、基本ストロ
ークと呼ぶ。Figure 26 is a diagram showing an example of partial patterns.
Numbers correspond to radicals such as crowns and partial patterns that appear in many kanji. FIG. 27 is a diagram showing an example of strokes that appear in many characters, and a number is associated with each stroke. Since these strokes are the basic strokes that make up characters, they will be referred to as basic strokes hereinafter.

第２８図は第２５図の部分パターン辞書部２５７の構造
を示す図である０部分パターン辞書部２５７は多数の部
分パターンブロック２５７ａの集合であり、各部分パタ
ーンブロックは部分パターン番号、部分パターンの画数
、その部分パターンの措置体の部分パターン番号、部分
２５７ｂ、切り出し情報から構成されている０部分には
、その部分パターンを構成している各ストロークの端点
の座標と第２７図で示した基本ストローク２７を表わす
番号が登録されている。FIG. 28 is a diagram showing the structure of the partial pattern dictionary section 257 in FIG. The 0 part consisting of the number of strokes, the partial pattern number of the measure body of the partial pattern, the part 257b, and the cutting information contains the coordinates of the end points of each stroke making up the partial pattern and the basic information shown in FIG. A number representing stroke 27 is registered.

第２９図は部分パターンブロック２５７ａの一例であり
、「糸」に対応する部分パターンブロック２５７ａであ
る。「１２」は「糸」の部分パターン番号であり、「５
」は「糸」の画数、「１１」は「糸」の措置体「糸」の
部分パターン番号であり、次の５個の情報には各ストロ
ークの始点の座標、終点の座標、概形を表わす番号を含
む部分２５７ｂが登録されている。最後の「０」は切り
出し情報であり、入力された文字パターンとこの部分パ
ターンとの比較判別では、入力された文字パターンから
筆順において「始め」の５画を切り出して比較すること
を示している。もし、切り出し情報が「１」であれば、
入力された文字パターンから筆順において「終り」の５
画を切り出して比較する。第３０図は第２９図の部分２
５７ｂの基になった部分パターンを図示したものであり
、すべてのストロークが文字枠の左半分にある。FIG. 29 is an example of a partial pattern block 257a, which corresponds to "thread". "12" is the partial pattern number of "thread", and "5" is the partial pattern number of "thread".
" is the number of strokes of "thread", "11" is the partial pattern number of the measure body "thread" of "thread", and the next five pieces of information contain the coordinates of the starting point, coordinates of the ending point, and outline of each stroke. A portion 257b including the representing number is registered. The last "0" is cutting information, and indicates that when comparing and determining the input character pattern and this partial pattern, the "first" 5 strokes in the stroke order are cut out from the input character pattern and compared. . If the extraction information is “1”,
5 of "end" in the stroke order from the input character pattern
Cut out the images and compare them. Figure 30 is part 2 of Figure 29.
This diagram illustrates the partial pattern on which 57b is based, and all strokes are in the left half of the character frame.

第３１図は部分パターン判別部の処理手順を示すフロー
チャートであり、第３２図は部分パターンの判別におい
て利用する各基本ストローク間の相違度表の一部を示す
図である。以下、第３１図に従って動作を説明する。部
分パターン判別部２５４はステップ５３１０で特徴抽出
部２５３から入力された文字パターンの特徴を入力する
と、辞書部２５６の部分パターン辞書部２５７をアクセ
スし、ステップ５３１１へ進む。ステップ５３１１では
比較していない部分パターンがまだあるかどうかを判別
し、ある場合はステップ５３１２へ進む。ステップ５３
１２では比較すべき部分パターンの各情報を入力し、ス
テップ５３１３でその画数を切り出し情報により入力さ
れた文字パターンから対応する情報を切り出し、ステッ
プ５３１４で入力された文字パターンと部分の各ストロ
ークの端点間の距離の和、およびストローク間相違度の
和を求め、この２つの値の和を部分ストロークの画数で
割った値をこの部分パターンの入力された文字パターン
に対する相違度とする。このようにして、１つの部分パ
ターンの相違度が求まるとステップ５３１０へ戻る。ス
テップ５３１１で入力された文字パターンとすべての部
分パターンとの比較が終了したと判断された場合はステ
ップ５３１４へ進み、最も相違度の小さな部分パターン
３個を候補部分パターンとして出力する。FIG. 31 is a flowchart showing the processing procedure of the partial pattern discrimination section, and FIG. 32 is a diagram showing a part of the difference table between basic strokes used in discrimination of partial patterns. The operation will be explained below according to FIG. 31. When the partial pattern discrimination section 254 receives the characteristics of the character pattern input from the feature extraction section 253 in step 5310, it accesses the partial pattern dictionary section 257 of the dictionary section 256, and proceeds to step 5311. In step 5311, it is determined whether there are any partial patterns that have not been compared, and if there are, the process advances to step 5312. Step 53
In step 12, each piece of information on the partial pattern to be compared is input, in step 5313 the corresponding information is cut out from the input character pattern using the cutting information, and in step 5314, the input character pattern and the end point of each stroke of the part are The sum of the distances between the strokes and the sum of the degrees of dissimilarity between strokes are calculated, and the sum of these two values divided by the number of partial strokes is determined as the degree of dissimilarity of this partial pattern with respect to the input character pattern. In this way, once the degree of difference of one partial pattern is determined, the process returns to step 5310. If it is determined in step 5311 that the comparison between the input character pattern and all partial patterns has been completed, the process proceeds to step 5314, and the three partial patterns with the smallest degree of difference are output as candidate partial patterns.

第３３図は本実施例の文字辞書部２５８の一例の構成を
示す図であり、文字辞書部２５８は画数毎に分割され、
さらに、各画数辞書部２５８ａは部分パターン毎にまと
まっている。例えば、第３３図に示すように、「シ」を
含む文字は１つのグループを形成し、「糸」を含む文字
は別の１つのグループを形成している。各グループの先
頭にはその部分パターン番号が格納されている。FIG. 33 is a diagram showing the configuration of an example of the character dictionary section 258 of this embodiment, and the character dictionary section 258 is divided by the number of strokes,
Furthermore, each stroke number dictionary section 258a is organized for each partial pattern. For example, as shown in FIG. 33, characters including "shi" form one group, and characters including "thread" form another group. At the beginning of each group, its partial pattern number is stored.

第３４図は文字判別部の処理手順を示すフローチャート
である０文字判別部２５５はステップ５３４０で部分パ
ターン判別部２５４から入力された文字パターンの特徴
と候補部分パターンを入力する。ステップ５３４１では
まだ判別していない部分パターンのグループがあるかど
うかの判別を行い、ある場合はステップ５３４２へ進む
、ステップ５３４２では、文字辞書部２５８をアクセス
し、部分パターンを入力し、ステップ５３４３でその部
分パターンが候補部分パターンに含まれているかどうか
を判別し、含まれている場合はステップ５３４４へ進み
、この部分パターンのグループに属する文字パターンと
入力された文字パターンとの比較を行いステップ５３４
１へ戻る。FIG. 34 is a flowchart showing the processing procedure of the character discriminator. In step 5340, the character discriminator 255 receives the characteristics of the character pattern and candidate partial patterns input from the partial pattern discriminator 254. In step 5341, it is determined whether there is a group of partial patterns that have not yet been determined, and if there is, the process proceeds to step 5342.In step 5342, the character dictionary section 258 is accessed, the partial pattern is input, and in step 5343, the process proceeds to step 5342. It is determined whether the partial pattern is included in the candidate partial patterns, and if so, the process proceeds to step 5344, where the character pattern belonging to the group of this partial pattern is compared with the input character pattern, and step 534
Return to 1.

ステップ５３４３で部分パターンが候補部分パターンに
含まれていなければステップ５３４５へ進み、その部分
パターンのグループに含まれる文字パターンをスキップ
し、ステップ５３４１へ戻る。ステップ５３４４で比較
を行うか、ステップ５３４５で文字パターンをスキップ
することにより、次の部分パターンのグループに到達す
る。画数辞書部２５８ａの終りに到達すると、ステップ
５３４１で画数辞書部２５８ａの終りを判断して終了す
る。他の画数辞書部２５８ａについて比較する場合は、
再びステップ５３４０より開始する。If the partial pattern is not included in the candidate partial patterns in step 5343, the process advances to step 5345, where character patterns included in the group of partial patterns are skipped, and the process returns to step 5341. The next group of subpatterns is reached by performing a comparison in step 5344 or by skipping character patterns in step 5345. When the end of the stroke number dictionary section 258a is reached, the end of the stroke number dictionary section 258a is determined in step 5341, and the process ends. When comparing other stroke number dictionary sections 258a,
The process starts again at step 5340.

尚、本実施例において、部分パターン判別部は３個の候
補部分パターンを出力しているが、必ずしも３個である
必要はなく、１個あるいは５個というように少数個であ
れば良い、又、多くとも３個以内というようにある一定
個以内の候補部分パターンを出力するようにしてもよい
、さらに、矢力された文字パターンの特徴と、各部分パ
ターン辞書部との比較において、入力された文字パター
ンがすべての部分パターンと大きく異なる、すなわち、
相違度が非常に大きい場合は候補部分パターンが無いこ
とを出力し、そのような場合、文字判別部は入力された
文字パターンをすべての文字パターンと比較するように
してもよい。又、第１実施例と第２実施例とを組み合わ
せて、それぞれの文字によって、より認識率の高く、認
識時間の短い方を選ぶような構成も考えられる。この場
合は候補部分パターンとして、相違度が所定のしきい値
より小さいものを還び、小さいものがない場合は、第１
実施例の比較をする等の組み合わせが考えられる。In this embodiment, the partial pattern discriminator outputs three candidate partial patterns, but the number does not necessarily have to be three, and may be as few as one or five. , it may be possible to output a certain number of candidate partial patterns, such as at most three candidate partial patterns.Furthermore, in comparing the characteristics of the input character pattern with each partial pattern dictionary section, the input If the character pattern is significantly different from all subpatterns, i.e.
If the degree of difference is very large, it may be output that there is no candidate partial pattern, and in such a case, the character discrimination unit may compare the input character pattern with all character patterns. It is also conceivable to combine the first embodiment and the second embodiment and select the one with a higher recognition rate and shorter recognition time depending on each character. In this case, those whose dissimilarity is smaller than a predetermined threshold are returned as candidate partial patterns, and if there is no smaller dissimilarity, the first partial pattern is returned.
Combinations such as comparing examples are possible.

以上説明したように、辞書部に文字辞書部とともに部分
パターン辞書部を設け、部分パターン辞書部に各部分パ
ターンの少なくともストローク数と、各ストロークの端
点の座標と、ストロークの形を表わす情報と、１文字の
情報の中からその部分パターンを切り出すための情報を
登録し、入力された文字パターンに対して部分パターン
辞書部を用いて一部分パターン判別を行い、その結果に
よって比較すべき文字パターンを選別し、入力された文
字パターンの特徴と文字パターンとの比較を行うことに
より、入力された文字パターンの特徴と文字パターンと
の比較回数が減少し、認識時間の短いパターン認識装置
が実現される。ひいては、認識文字種の拡大が可能にな
る一方、多くの文字パターンを登録可能となる他、１文
字パターンとの比較にかかる処理時間が問題にならなく
なるため、画数や筆順などの制限の緩和された情報処理
装置が可能になる。As explained above, the dictionary section is provided with a partial pattern dictionary section together with a character dictionary section, and the partial pattern dictionary section includes at least the number of strokes of each partial pattern, the coordinates of the end points of each stroke, and information representing the shape of the stroke. Register information for cutting out a partial pattern from one character's information, use the partial pattern dictionary section to identify partial patterns for the input character pattern, and select character patterns to be compared based on the results. However, by comparing the characteristics of the input character pattern with the character pattern, the number of comparisons between the characteristics of the input character pattern and the character pattern is reduced, and a pattern recognition device with short recognition time can be realized. As a result, it becomes possible to expand the number of recognized characters, register many character patterns, and eliminate the processing time required for comparison with single character patterns, which eases restrictions such as the number of strokes and stroke order. Information processing equipment becomes possible.

尚、本実施例においては、説明を記号１図形を含む文字
パターンについて行ったが、音声１画像等のパターン認
識においても、まとまりを持った部分パターンへの分割
や出現率の高い部分パターンの予備判断等の技術思想は
広く利用できる。In this example, the explanation was given for a character pattern that includes one symbol, one figure, but in pattern recognition such as audio and one image, division into cohesive subpatterns and backup of subpatterns with a high occurrence rate are also possible. Technical ideas such as judgment can be widely used.

［発明の効果］本発明により、パターン作成時の筆順変動を十分に考慮
した上で、パターンを基本ストロークに分解して、予め
登録されているパターンのストロークと比較することに
より、続は字を認識でき、かつ認識時間の短いパターン
認識装置を提供できる。[Effects of the Invention] According to the present invention, the pattern can be broken down into basic strokes, and the strokes can be compared with the strokes of a pattern registered in advance, with full consideration given to variations in stroke order during pattern creation. It is possible to provide a pattern recognition device that can be recognized and has a short recognition time.

[Brief explanation of the drawing]

第１図（ａ）は第１実施例のパターン認識装置の構成図
、第１図（ｂ）は文字判別部の構成図、第２図は辞書部の構造を示す図、第３図（ａ）、（ｂ）は「思」の文字ブロックの構造を
示す図、第４図は措置の種々な筆順の「思」を示す図、第５図（
ａ）は措置の「思」の比較結果を示す図、第５図（ｂ）は相違度を表わす重み付けを示す図、第６図は２分割の場合の処理手順を示すフローチャート
、第７図は続は書きされた「思」の筆記例を示す図、第８図（ａ）は続は書きされた「思」の入力された文字
パターンと辞書部の文字パターンとの比較結果を示す図
、第８図（ｂ）は相違度を表わす重み付けを示す図、第９図は「数」の筆記例を示す図、第１０図（ａ）、（ｂ）は「数」の文字ブロックの構成
図、第１１図はｒ数」の入力された文字パターンと辞書部の
文字パターンとの比較結果を示す図、第１２図は任意の
分割の場合の処理手順のフローチャート、第１３図は辞書部に登録されている「思」と「計」の筆
記例を示す図、第１４図は「思」と「計」のストローク対応を示す図、第１５図は相違度により処理を中断する場合の入力され
た文字パターンと辞書部の文字パターンとの処理手順を
示すフローチャート、第１６図は基本ストローク表の一例を示す図、ＴＳ１７
図は「何」の文字パターンの一例を示す図、第１８図は「何」の筆記例を示す図、第１９図、第２０図は認識手順を示すフローチャート、第２１図は基本ストローク間相違度表の一部を示す図、第２２図は「何」の筆記例を示す図、第２３図は第ｎ図の「何」の筆記例の各ストロークの基
本ストローク番号と始終点の座標を示す図、第２４図は基本ストロークに分解した場合の入力された
文字パターンと辞書部の文字パターンのストロークの対
応関係を示す図、第２５図は第２実施例のパターン認識装置の構成図、第２６図は部分パターンの側口、第２７図は基本ストロークの側口、第２８図は部分パターン辞書部の構成図、第２９図は部
分パターン辞書部に登録されている「糸」の部分パター
ンの一例図、 ′ｆＳ３ｏ図は「糸」の部分パターンを示、した図、第
３１図は部分パターン判別部の処理手順を示すフローチ
ャート、第３２図は基本ストローク間の相違度表の一部を示す図
、第３３図は文字辞書部の構成を示す図、第３４図は文字
判別部の処理手順を示すフローチャートである。図中、１・・・タブレット、２・・・入力ペン、３・・
・特徴抽出部、４・・・文字判別部、５・・・辞書部、
４１・・・ＣＰＵ、４２・・・ＲＯＭ、４３・・・ＲＡ
Ｍ、２５１・・・情報入力部、２５２・・・入力ペン、
２５３・・・特徴抽出部、２５４・・・部分パターン判
別部、２５５・・・文字判別部、２５６・・・辞書部、
２５７・・・部分パターン辞書部、２５８・・・文字辞
書部、２５４ａ。２５５ａ・・・ＲＯＭである。特許出願人　　キャノン株式会社第１図（ａ）第２図第３図（ａ）　　　　第３図（ｂ）第４図１）　　　　　　Ｓｕ゛第５図（ａ）第５図（ｂ）第８図（Ｑ）第８図（ｂ）第１１図第１６図第１７図第１８図第１９図第２０図第２１図第２２図第２３図第２５図第２６図第２８図第２９図第３０図第３１図第３２図第３３図FIG. 1(a) is a block diagram of the pattern recognition device of the first embodiment, FIG. 1(b) is a block diagram of the character discrimination section, FIG. 2 is a diagram showing the structure of the dictionary section, and FIG. 3(a) is a block diagram of the pattern recognition device of the first embodiment. ), (b) is a diagram showing the structure of the character block for ``shi'', Figure 4 is a diagram showing ``shi'' in various stroke orders of measures, and Figure 5 (
Figure 5 (b) is a diagram showing the weighting that represents the degree of difference, Figure 6 is a flowchart showing the processing procedure in the case of two divisions, and Figure 7 is a diagram showing the comparison results of the "thoughts" of the measures. The following is a diagram showing a handwritten example of the written ``shi'', and Figure 8 (a) is a diagram showing the comparison result between the input character pattern of the written ``shi'' and the character pattern in the dictionary section. Figure 8 (b) is a diagram showing the weighting representing the degree of difference, Figure 9 is a diagram showing an example of writing "number", and Figures 10 (a) and (b) are configuration diagrams of the character block of "number". , Fig. 11 is a diagram showing the comparison result of the input character pattern of "r number" and the character pattern of the dictionary section, Fig. 12 is a flowchart of the processing procedure in case of arbitrary division, and Fig. 13 is a diagram showing the comparison result of the input character pattern of "r number" and the character pattern of the dictionary section. Figure 14 shows an example of the registered writing of "shi" and "ke", Figure 14 shows the stroke correspondence between "shi" and "ke", Figure 15 shows the input when interrupting the process due to the degree of difference. Flowchart showing the processing procedure of the character pattern and the character pattern of the dictionary part, FIG. 16 is a diagram showing an example of the basic stroke table, TS17
The figure shows an example of the character pattern for ``What''. Figure 18 shows an example of how the word ``What'' is written. Figures 19 and 20 are flowcharts showing the recognition procedure. Figure 21 shows the differences between basic strokes. Figure 22 shows an example of how to write "What"; Figure 23 shows the basic stroke number and coordinates of the start and end points of each stroke in the example of how to write "What" in Figure n. FIG. 24 is a diagram showing the correspondence between the strokes of the input character pattern and the character pattern in the dictionary section when it is decomposed into basic strokes; FIG. 25 is a configuration diagram of the pattern recognition device of the second embodiment; Figure 26 is the side opening of the partial pattern, Figure 27 is the side opening of the basic stroke, Figure 28 is a configuration diagram of the partial pattern dictionary, and Figure 29 is the "thread" part registered in the partial pattern dictionary. Figure 31 is a flowchart showing the processing procedure of the partial pattern discriminator, Figure 32 is part of the difference table between basic strokes. FIG. 33 is a diagram showing the configuration of the character dictionary section, and FIG. 34 is a flowchart showing the processing procedure of the character discrimination section. In the figure, 1...tablet, 2...input pen, 3...
・Feature extraction unit, 4...Character discrimination unit, 5...Dictionary unit,
41...CPU, 42...ROM, 43...RA
M, 251... Information input section, 252... Input pen,
253...Feature extraction unit, 254...Partial pattern discrimination unit, 255...Character discrimination unit, 256...Dictionary unit,
257...Partial pattern dictionary section, 258...Character dictionary section, 254a. 255a...ROM. Patent applicant: Canon Co., Ltd. Figure 1 (a) Figure 2 Figure 3 (a) Figure 3 (b) Figure 4 (1) Figure 5 (a) Figure 5 (b) Figure 8 ( Q) Figure 8(b) Figure 11 Figure 16 Figure 17 Figure 18 Figure 19 Figure 20 Figure 21 Figure 22 Figure 23 Figure 25 Figure 26 Figure 28 Figure 29 Figure 30 Figure 31 Figure 32 Figure 33

Claims

[Claims]

(1) In a pattern recognition device that recognizes the input pattern by comparing the input pattern with a pre-registered pattern, the input pattern is decomposed into basic strokes. It is characterized by comprising a comparison means for comparing stroke information and stroke information including stroke end point coordinates registered in advance, and a recognition means for recognizing the input pattern based on the comparison result of the comparison means. pattern recognition device.

(2) The pattern recognition device according to claim 1, wherein the pattern is a character pattern including symbols and figures.

(3) The comparison means compares the difference between the strokes based on the sum of the value representing the difference in the coordinates of the end points and the value representing the difference in the basic stroke, and a predetermined tolerance value. The pattern recognition device according to scope 1.

(4) The recognition means recognizes the difference between the patterns based on the magnitude of a value representing the difference in stroke between the input pattern and the pre-registered pattern and a predetermined tolerance value. The pattern recognition device according to scope 1.

(5) The pattern recognition device according to claim 4, wherein the pre-registered pattern includes a code for identifying the pattern and a stroke representing a stroke of the pattern.