JP3290110B2

JP3290110B2 - Handwritten character recognition device

Info

Publication number: JP3290110B2
Application number: JP27334797A
Authority: JP
Inventors: 良英馬籠; 匡紀戸田; 直人大内; 金玲胡
Original assignee: Kenwood KK
Current assignee: Kenwood KK
Priority date: 1997-09-22
Filing date: 1997-09-22
Publication date: 2002-06-10
Anticipated expiration: 2017-09-22
Also published as: JPH1196303A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は手書き文字認識装置
に関し、さらに詳細には、文字の部首偏旁冠脚などに基
づきオンライン手書き文字を認識する手書き文字認識装
置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a handwritten character recognition device, and more particularly, to a handwritten character recognition device for recognizing an online handwritten character on the basis of a non-radical part of a character.

【０００２】[0002]

【従来の技術】従来のオンライン手書き文字認識装置と
して、例えば特公昭６０−３２３８号公報、特開平２−
６９８８７号公報などに開示されているものが知られて
いる。2. Description of the Related Art As a conventional online handwritten character recognition device, for example, Japanese Patent Publication No. 60-3238,
Japanese Patent Application Laid-Open No. 69887 and the like are known.

【０００３】特公昭６０−３２３８号公報に開示された
オンライン手書き文字認識方式は、漢字などの文字の字
形のトップ・ダウン記述を利用している。つまり、一個
の漢字は１個または複数個のコンポーネントの集合とし
て定義され、コンポーネントは1個または複数個のエレ
メントの集合として定義され、エレメントは同定された
1個または複数個のストロークの集合として定義され
る。The on-line handwritten character recognition system disclosed in Japanese Patent Publication No. 60-3238 uses a top-down description of the character shape of a character such as a kanji. That is, one kanji is defined as a set of one or more components, a component is defined as a set of one or more elements, and the elements are identified.
Defined as a set of one or more strokes.

【０００４】このような文字の字形のトップ・ダウン記
述に基づく認識方式では、識別文字の字種の増加に対処
し易いことおよび変形した文字も認識できることなどを
意図している。[0004] Such a recognition method based on the top-down description of the character shape of a character is intended to easily cope with an increase in the character type of the identification character and to be able to recognize a deformed character.

【０００５】また、特開平２−６９８８７号公報に開示
されたオンライン手書き文字認識装置においては、文字
を比較判定する際、入力文字の特徴データと、特徴辞書
に記憶された文字の特徴データとを１文字ずつ比較判定
して、辞書に記憶された文字の偏や旁などの各部分毎に
その合否の判定を行うことにより、文字認識の高速化を
意図している。In the online handwritten character recognition device disclosed in Japanese Patent Laid-Open No. 2-69888, when comparing and judging a character, the characteristic data of the input character and the characteristic data of the character stored in the characteristic dictionary are compared. It is intended to speed up character recognition by comparing and judging one character at a time and judging the pass / fail of each part of the characters stored in the dictionary, such as bias or side.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、上記し
た従来の技術の前者によるときは、続け字などの文字の
変形に対処するために、より多種の基本ストロークとエ
レメントの準備が必要であり、認識処理時間の増大を招
くという問題点があった。また、上記した従来の技術の
後者によるときは、同じ偏や旁などの部分を持つ文字で
も、毎回最初から判定を行うために、認識処理の効率が
よいと言えないという問題点があった。However, in the case of the former of the prior art described above, it is necessary to prepare more basic strokes and elements in order to deal with deformation of characters such as continuation characters. There is a problem that the processing time is increased. Also, in the latter case of the above-mentioned conventional technique, there is a problem that even if a character has the same partiality, such as a partiality, a determination is made from the beginning every time, and the efficiency of the recognition processing cannot be said to be high.

【０００７】本発明は、続け字などの文字の変形にも対
処でき、かつ高速に文字認識を行うことができる手書き
文字認識装置を提供することを目的とする。SUMMARY OF THE INVENTION It is an object of the present invention to provide a handwritten character recognition device capable of coping with deformation of characters such as continuous characters and performing high-speed character recognition.

【０００８】[0008]

【課題を解決するための手段】本発明の手書き文字認識
装置は、文字を大分類するための大分類情報と同一の大
分類情報を有する文字を形成する部分パターンを複数の
文字に対して記憶させた記憶手段と、筆順の順に読み込
んだ入力文字の大分類情報と同一の大分類情報を有する
文字を記憶手段に記憶の文字中から候補文字として選択
する選択手段と、入力文字を形成する部分パターンと記
憶手段に記憶されている候補文字を形成する部分パター
ン中の対応する部分パターンとを順次１回づつのみ照合
する照合手段と、照合毎に入力文字を形成する部分パタ
ーンに対する被照合部分パターンの差異を求め、該差異
が予め定めた閾値を超える被照合部分パターンを有する
候補文字を選択手段によって選択された候補文字中から
除去していく候補文字選別手段と、を備え、候補文字選
別手段によって残った候補文字に基づいて入力文字を認
識することを特徴とする。SUMMARY OF THE INVENTION A handwritten character recognition apparatus of the present invention stores a partial pattern forming a character having the same large classification information as the large classification information for large classification of a character for a plurality of characters. Storage means, a selection means for selecting a character having the same large classification information as the large classification information of the input character read in the stroke order as a candidate character from the characters stored in the storage means, and a part for forming the input character Collating means for sequentially and only once collating a pattern and a corresponding partial pattern in a partial pattern forming a candidate character stored in a storage means; and a collated partial pattern for a partial pattern forming an input character for each collation Of the candidate character having the partial pattern to be compared whose difference exceeds a predetermined threshold from the candidate characters selected by the selection means. It includes a character selection means, the, and recognizes an input character based on the remaining candidate characters by the candidate character selection means.

【０００９】本発明の手書き文字認識装置は、筆順の順
に読み込んだ入力文字の大分類情報と同一の大分類情報
を有する文字が選択手段によって記憶手段に記憶の文字
中から候補文字として選択される。入力文字を形成する
部分パターンと記憶手段に記憶されている候補文字を形
成する部分パターン中の対応する部分パターンとが照合
手段によって順次１回づつのみ照合され、照合毎に入力
文字を形成する部分パターンに対する被照合部分パター
ンの差異が求められて、該差異が予め定めた閾値を超え
る被照合部分パターンを有する候補文字が、選択手段に
よって選択された候補文字中から候補文字選別手段によ
って除去されていく。したがって、大分類情報による選
択が選択手段によってなされて、認識文字が絞られ、絞
られた認識文字がさらに部分パターンの照合によって絞
られることになって、文字認識が速く行われることにな
る。In the handwritten character recognition apparatus of the present invention, a character having the same large classification information as the large classification information of the input character read in the stroke order is selected as a candidate character from the characters stored in the storage means by the selection means. . The partial pattern forming the input character and the corresponding partial pattern in the partial pattern forming the candidate character stored in the storage means are sequentially and only once collated by the collation means, and the part forming the input character for each collation A difference between the pattern to be compared with the pattern is determined, and candidate characters having the pattern to be compared with the difference exceeding a predetermined threshold are removed from candidate characters selected by the selection unit by the candidate character selection unit. Go. Therefore, the selection based on the large classification information is made by the selection means, the recognition characters are narrowed down, and the narrowed down recognition characters are further narrowed down by collation of the partial patterns, so that the character recognition is performed quickly.

【００１０】手書き文字認識装置において、記憶手段に
記憶する部分パターンはセグメントからなり、かつ文字
構成上必ず存在するセグメントのほかに、続け字や走り
書きの曲線部として現れる可能性のあるセグメントも含
み、入力文字を形成する部分パターンと、記憶手段に記
憶されている文字構成上必ず存在するセグメントと続け
字や走り書きの曲線部として現れる可能性のあるセグメ
ントとからなる部分パターンとを照合し、文字構成上必
ず存在するセグメントと入力文字を形成する部分パター
ン中の対応する部分パターンとの差異を求めることを特
徴とする。[0010] In the handwritten character recognition device, the partial pattern stored in the storage means is composed of segments. In addition to the segments which are always present in the character configuration, the partial patterns include segments which may appear as curved portions of continuous characters or scribbles. The partial pattern forming the input character is compared with the partial pattern consisting of the segment which is always present in the character configuration stored in the storage means and the segment which may appear as a continuation character or a scribble curve portion, and character configuration is performed. It is characterized in that a difference between a segment always present above and a corresponding partial pattern in a partial pattern forming an input character is obtained.

【００１１】手書き文字認識装置において、記憶手段に
文字構成上必ず存在するセグメントのほかに、続け字や
走り書きの曲線部として現れる可能性のあるセグメント
も含む部分パターンが部分パターンとして記憶されてい
て、入力文字を形成する部分パターンとの照合に際して
は、文字構成上必ず存在するセグメントと続け字や走り
書きの曲線部として現れる可能性のあるセグメントとか
らなる部分パターンと照合し、入力文字の部分パターン
の文字構成上必ず存在するセグメントと、記憶手段に記
憶されている文字構成上必ず存在するセグメントとの差
異が求められる。したがって、入力文字が楷書のときに
も続け字のときにも対応することができる。In the handwritten character recognition device, a partial pattern including a segment which may appear as a continuation character or a scribble curve portion in addition to a segment which always exists in the character configuration is stored in the storage means as a partial pattern. When matching with a partial pattern forming an input character, the partial pattern of the input character is compared with a partial pattern consisting of a segment that always exists in the character configuration and a segment that may appear as a continuation character or a scribble curve part. The difference between the segment that is always present in the character configuration and the segment that is always present in the character configuration stored in the storage means is required. Therefore, it is possible to cope with both the case where the input characters are in the regular style and the case where the input characters are the continuous characters.

【００１２】[0012]

【発明の実施の形態】以下、本発明にかかる手書き文字
認識装置を実施の形態によって説明する。図１は本発明
の実施の一形態にかかる手書き文字認識装置の構成を機
能的に示したブロック図である。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a handwritten character recognition device according to the present invention will be described with reference to embodiments. FIG. 1 is a block diagram functionally showing the configuration of a handwritten character recognition device according to one embodiment of the present invention.

【００１３】本明細書において、以下、部分パターンの
語は文字の偏旁冠脚を含む部首の全部のパターンまたは
一部のパターンの意味で使用し、文字を形成する部首の
パターン、偏旁のパターン、部首の一部を形成するパタ
ーンおよび偏旁の一部を形成するパターンを含み、かつ
それぞれのパターンはセグメントから構成されている。In the present specification, the term "partial pattern" is used hereinafter to mean the entire pattern or a part of the radical including the paraleg of the character, and the pattern of the radical forming the character, It includes a pattern, a pattern forming a part of a radical, and a pattern forming a part of a side, and each pattern is composed of a segment.

【００１４】タブレットなどオンライン文字入力機器か
ら、手書きの文字を筆順の順に文字入力部１にて読み取
り、一旦記憶装置に記憶する。文字入力部１へ入力され
た文字に対して、同一ストローク内に同一方向のセグメ
ントが続いているときは直線とし、また予め定めた範囲
内の湾曲部を含む範囲のセグメントを直線とするなどの
直線近似、正規化、ノイズ除去処理などの前処理を前処
理部２において行う。From an online character input device such as a tablet, handwritten characters are read by the character input unit 1 in the order of strokes and are temporarily stored in a storage device. For a character input to the character input unit 1, when a segment in the same direction continues in the same stroke, a straight line is used, and a segment in a range including a curved portion within a predetermined range is used as a straight line. The pre-processing unit 2 performs pre-processing such as linear approximation, normalization, and noise removal processing.

【００１５】前処理部２において処理されて近似された
直線を、それが引かれた方向によって表１に示した大分
類コードに基づいて大分類コード化し、表１において示
したセグメントコードに基づいてセグメントコード化す
る。ここで、図２は表１と協同して大分類コード化およ
びセグメントコード化の説明に供する図であって、前処
理部２において処理されて近似された直線の方向に基づ
いて、セグメント化処理部３においてセグメントコード
化処理される。図２において大分類コードとセグメント
コードを重畳して示してある。なお、表１における右払
いは左から右へ斜めに書かれた文字の部分を意味し、左
払いは右から左へ斜めに書かれた文字の部分を意味して
いる。The straight line processed and approximated by the preprocessing unit 2 is converted into a large classification code based on the large classification code shown in Table 1 according to the direction in which the straight line is drawn, and based on the segment code shown in Table 1. Segment coding. Here, FIG. 2 is a diagram provided for explanation of the large classification coding and the segment coding in cooperation with Table 1, and based on the direction of the approximated straight line processed and approximated by the preprocessing unit 2, the segmentation processing is performed. The unit 3 performs a segment coding process. In FIG. 2, the large classification code and the segment code are shown superimposed. In Table 1, right payment means a character portion written diagonally from left to right, and left payment means a character portion written diagonally from right to left.

【００１６】[0016]

【表１】 [Table 1]

【００１７】例えば前処理された結果、横に引かれた直
線の方向が、３５０°〜７０°の範囲に入っているとき
は、大分類コード化によって大分類コード〃Ｉ〃にコー
ド化され、セグメントコード化によってセグメントコー
ド〃０〃にコード化される。For example, as a result of preprocessing, when the direction of the straight line drawn in the horizontal direction falls within the range of 350 ° to 70 °, the line is coded into the large classification code {I} by the large classification coding, It is coded into segment code {0} by segment coding.

【００１８】セグメント化処理部３において処理された
入力文字のセグメント数および最初と最後のセグメント
の大分類コードが同一であるＪＩＳコードなどの文字コ
ードが、文字辞書９内の文字コード群内から、候補文字
として大分類部４にて大雑把に大分類して抽出される。
ここでの最初と最後のセグメントの大分類コードは、セ
グメントコードが〃０〃、或は〃１〃のとき、大分類コ
ードは〃Ｉ〃に分類され、セグメントコードが〃２〃、
或は〃３〃のとき、大分類コードは〃ＩＩ〃に分類さ
れ、セグメントコードが〃４〃のとき、大分類コードは
〃Ｉ〃に分類され、セグメントコードが〃５〃のとき、
大分類コードは〃Ｉ〃または〃ＩＩ〃に分類され、セグ
メントコードが〃６〃のとき、大分類コードは〃ＩＩ〃
に分類される。A character code such as a JIS code in which the number of input characters processed in the segmentation processing unit 3 and the large classification code of the first and last segments are the same is extracted from the character code group in the character dictionary 9. The large characters are roughly classified and extracted by the large classification unit 4 as candidate characters.
Here, the large classification code of the first and last segments is classified into {I} when the segment code is {0} or {1}, and the segment code is {2},
Alternatively, when {3}, the large classification code is classified as {II}, when the segment code is {4}, the large classification code is classified as {I}, and when the segment code is {5},
The major classification code is classified into {I} or {II}. When the segment code is {6}, the major classification code is {II}.
are categorized.

【００１９】ここで、文字辞書９には、（ａ）入力文字のＪＩＳコードなどの文字コード。（ｂ）入力文字に対して現われる可能性のある最小セグ
メント数、；大分類に利用される。（ｃ）入力文字に対して現われる可能性のある最大セグ
メント数、；大分類に利用される。（ｄ）入力文字の最初のセグメントの方向の種別、；大
分類に利用される。（ｅ）入力文字の最後のセグメントの方向の種別、；大
分類に利用される。（ｆ）入力文字を構成する部分パターン数、；部分パタ
ーンの照合に利用される。（ｇ）書かれた順序で格納された部分パターン辞書の番
号、；部分パターンの位置照合に利用される。（ｈ）部分パターンの位置関係情報、；部分パターン照
合の後、例えば文字〃員〃と文字〃唄〃等部分パターン
位置が異なる文字を区別するのために、文字を構成する
部首、偏旁位置の照合に利用される。などが格納されている。Here, the character dictionary 9 includes (a) a character code such as a JIS code of an input character. (B) The minimum number of segments that may appear for an input character; used for major classification. (C) the maximum number of segments that may appear for input characters; used for major classification. (D) Type of direction of the first segment of the input character; used for major classification. (E) Type of the direction of the last segment of the input character; used for major classification. (F) The number of partial patterns constituting the input character; used for collation of the partial patterns. (G) Partial pattern dictionary numbers stored in the order in which they were written; used for position matching of partial patterns. (H) positional relationship information of the partial patterns; after the partial pattern matching, radicals constituting the characters and side by side positions to distinguish characters having different partial pattern positions such as, for example, character {member} and character {song} Used for matching. Are stored.

【００２０】ここで、入力文字に対して現われる可能性
のある最小セグメント数と、入力文字に対して現われる
可能性のある最大セグメント数とが文字辞書９に格納し
てあるのは、楷書の場合と続け字の場合とによってセグ
メント数が異なり、幅があるためである。Here, the minimum number of segments that may appear for an input character and the maximum number of segments that may appear for an input character are stored in the character dictionary 9 in the case of a block style. This is because the number of segments differs depending on the case of the continuation character, and there is a width.

【００２１】入力文字におけるセグメント数が、入力文
字に対して現われる可能性のある最小セグメント数と入
力文字に対して現われる可能性のある最大セグメント数
との間にあり、かつ、入力文字の最初のセグメントの大
分類コードが文字辞書９に格納されている文字の最初の
セグメントの大分類コードと同一であり、入力文字の最
後のセグメントの方向が文字辞書９に格納されている文
字の最後のセグメントの大分類コードと同一である文字
が、大分類部４によって文字辞書９から候補文字として
大雑把に絞り込まれる。ここで、大分類部４は、入力文
字に基づいて候補文字を選択する選択手段に対応してい
る。The number of segments in the input character is between the minimum number of segments that can appear for the input character and the maximum number of segments that can appear for the input character, and The major classification code of the segment is the same as the major classification code of the first segment of the character stored in the character dictionary 9, and the direction of the last segment of the input character is the last segment of the character stored in the character dictionary 9. The character that is the same as the large classification code is roughly narrowed down from the character dictionary 9 by the large classification unit 4 as a candidate character. Here, the large classification unit 4 corresponds to a selection unit that selects a candidate character based on an input character.

【００２２】大分類部４によって文字辞書９内から大雑
把に検出された候補文字を構成する部分パターンが部分
パターン照合部５によって照合される。部分パターンの
照合部５における照合は、大分類部４によって絞り込ま
れた候補文字について格納されている部分パターンの辞
書８の番号が文字辞書９中から検索することによりなさ
れる。The partial pattern constituting the candidate character roughly detected from the character dictionary 9 by the large classification unit 4 is collated by the partial pattern collation unit 5. The matching of the partial pattern in the matching unit 5 is performed by searching the character dictionary 9 for the number of the dictionary 8 of the partial pattern stored for the candidate character narrowed down by the large classification unit 4.

【００２３】部分パターン辞書８には、（ａ）部分パタ
ーン辞書の番号、（ｂ）部分パターンに対して現われる
可能性のある筆順数（Ｏｎ）、（ｃ）筆順１のセグメン
トの方向コード列および各セグメント間位置関係情報、
（ｄ）筆順２のセグメントの方向コード列および各セグ
メント間位置関係情報、（ｅ）筆順３のセグメントの方
向コード列および各セグメント間位置関係情報、……、
などが格納されている。ここで、部分パターン辞書８と
文字辞書９とは記憶手段に対応している。また、部分パ
ターンは部首、部首の一部を構成するセグメントから形
成されていることは前記のとおりである。The partial pattern dictionary 8 includes (a) the number of the partial pattern dictionary, (b) the number of stroke orders (On) that may appear for the partial pattern, (c) the direction code sequence of the segment of the stroke order 1, and Information on the positional relationship between segments,
(D) the direction code sequence of the segment in the stroke order 2 and the positional relationship information between each segment; (e) the direction code sequence of the segment in the stroke order 3 and the positional relationship information between each segment.
Are stored. Here, the partial pattern dictionary 8 and the character dictionary 9 correspond to storage means. Further, as described above, the partial pattern is formed from a radical and a segment constituting a part of the radical.

【００２４】文字辞書９中から絞り込まれた候補文字を
構成する部分パターン辞書の番号が参照されて、該番号
と同一番号の部分パターン辞書８が検索されて、検索さ
れた部分パターン辞書８に格納されている各セグメント
の方向と入力文字のセグメントの方向との照合が行わ
れ、かつ検索された部分パターン辞書８に格納されてい
るセグメント間相対位置関係と入力文字のセグメント間
相対位置関係との比較が行われ、照合と比較結果に重み
付けされて、この重み付けされた照合と比較結果に基づ
いてペナルティが算出される。照合処理済みの入力文字
のセグメント数は文字辞書内の部分パターン毎に一旦保
存される。By referring to the number of the partial pattern dictionary constituting the candidate character narrowed down from the character dictionary 9, the partial pattern dictionary 8 having the same number as the number is searched and stored in the searched partial pattern dictionary 8. The direction of each segment is compared with the direction of the segment of the input character, and the relative positional relationship between segments stored in the searched partial pattern dictionary 8 and the relative positional relationship between input characters are compared. The comparison is performed, the collation and the comparison result are weighted, and a penalty is calculated based on the weighted collation and the comparison result. The number of segments of the input character having undergone the collation processing is temporarily stored for each partial pattern in the character dictionary.

【００２５】算出されたペナルティが予め定めた閾値と
比較されて閾値を超えているか否かを判定する閾値処理
が行われて、閾値を超えているときは不合格と判定さ
れ、不合格と判定された部分パターンがあれば、その部
分パターンを持つすべての文字が候補文字から除去され
る。次いで、残された候補文字を構成する次の部分パタ
ーン辞書８が検索され、該部分パターン辞書８に対して
閾値処理が繰り返される。ここで、部分パターン照合部
５は、入力文字の部分パターンと候補文字の部分パター
ンを照合する照合手段および選択手段によって選択され
た候補文字をさらに選別する候補文字選別手段に対応し
ている。The calculated penalty is compared with a predetermined threshold value to perform a threshold process for determining whether or not the threshold value exceeds the threshold value. If there is a partial pattern, all characters having that partial pattern are removed from the candidate characters. Next, the next partial pattern dictionary 8 constituting the remaining candidate characters is searched, and the threshold processing is repeated on the partial pattern dictionary 8. Here, the partial pattern matching unit 5 corresponds to a matching unit that matches the partial pattern of the input character with the partial pattern of the candidate character and a candidate character selecting unit that further selects the candidate character selected by the selecting unit.

【００２６】部分パターン照合部５の処理で残った候補
文字に対して、部分パターン位置照合部６で入力文字の
部分パターンの位置関係と候補文字の部分パターンの位
置関係との照合が行われ、部分パターンの位置関係が入
力文字の部分パターンの位置関係候補と同じ候補文字が
選択されて、部分パターン照合部５の処理で得られたペ
ナルティと総合して認識文字候補が最終的に決定され
る。すなわち、部分パターン位置照合部６の処理で残っ
た候補文字中のペナルティが一番小さいものが認識結果
出力部７から認識結果として出力される。With respect to the candidate characters remaining in the process of the partial pattern matching unit 5, the partial pattern position matching unit 6 checks the positional relationship between the partial patterns of the input characters and the partial pattern of the candidate characters, A candidate character whose positional relationship is the same as the positional relationship candidate of the partial pattern of the input character is selected, and the recognized character candidate is finally determined in combination with the penalty obtained by the processing of the partial pattern matching unit 5. . That is, the candidate character remaining in the processing of the partial pattern position matching unit 6 with the smallest penalty is output from the recognition result output unit 7 as the recognition result.

【００２７】次ぎに、表２に、文字〃員〃、〃唄〃、〃
消〃、〃浸〃、〃粋〃、〃粉〃を定義する部分パターン
とそれらの部分パターン辞書の番号の例を示す。文字〃
員〃、〃唄〃、〃消〃、〃浸〃、〃粋〃、〃粉〃は部分
パターン辞書の番号ｎ１〜ｎ１３中の部分パターンを組
合せることによって表わされる。Next, Table 2 shows the characters {member}, {song},
Here are examples of partial patterns that define consumption, immersion, refinement, and powdering and numbers of those partial pattern dictionaries. letter〃
The members {song}, {song}, {soak}, {sin}, {powder} are represented by combining the partial patterns in the partial pattern dictionary numbers n1 to n13.

【００２８】[0028]

【表２】 [Table 2]

【００２９】表３に文字〃員〃に対する文字辞書９の記
憶内容を示す。文字、〃員〃は表２から明らかなように
部分パターン、口、目、八に対応する部分パターンによ
って構成され、部分パターン数は〃３〃であり、楷書に
よるときのセグメント数は〃１２〃であるが、続け字に
よる場合に対して余裕をみて最小セグメント数は〃１０
〃、最大セグメント数は〃２４〃とされ、最初のセグメ
ントの大分類コード〃ＩＩ〃であり、最後のセグメント
の大分類コードは〃Ｉ〃であり、文字を構成する部分パ
ターン辞書の番号はｎ１、ｎ２、ｎ３である。Table 3 shows the stored contents of the character dictionary 9 for the characters {member}. Characters and {members} are composed of partial patterns, mouths, eyes, and eight corresponding partial patterns, as is clear from Table 2. The number of partial patterns is {3}, and the number of segments in square writing is {12}. However, the minimum number of segments is $ 10 to allow for the continuation character case.
{, The maximum number of segments is assumed to be {24}, the major classification code of the first segment is {II}, the major classification code of the last segment is {I}, and the number of the partial pattern dictionary constituting the character is n1 , N2 and n3.

【００３０】[0030]

【表３】 [Table 3]

【００３１】表４に文字〃員〃、〃唄〃、〃消〃、〃浸
〃、〃粋〃、〃粉〃に対する部分パターン辞書の番号の
例を示し、部分パターン辞書の番号順によって、対応す
る文字が構成できることは容易に理解できよう。Table 4 shows an example of partial pattern dictionary numbers for the characters {member}, {song}, {consequence}, {soak}, [genki], and {powder}. It can be easily understood that the characters can be composed.

【００３２】[0032]

【表４】 [Table 4]

【００３３】部分パターン辞書８に格納されているセグ
メントコード列は、筆順のセグメントコード列と、文字
の構造上必ず存在するセグメントコード列と、続け字や
走り書きによる曲線部分として現れる可能性のあるセグ
メントコード列とである。セグメント間相対位置情報
は、二つの必ず存在するセグメント間の位置情報（セグ
メントの端点の位置関係、交差、Ｔ型接続など）であ
る。表２に例示した部分パターンに対する辞書番号ｎ５
についてみれば筆順は筆順１、２、３の第３図（ａ）、
（ｂ）および（ｃ）に示すように３種類がある。この部
分パターンの辞書に格納されている筆順数、筆順のデー
タおよび位置関係情報は表５に例示する如くである。The segment code sequence stored in the partial pattern dictionary 8 includes a segment code sequence in a stroke order, a segment code sequence which always exists in the character structure, and a segment which may appear as a curved portion formed by continuation characters or scribbles. And a code sequence. The inter-segment relative position information is position information between two indispensable segments (positional relationship of end points of segments, intersection, T-type connection, etc.). Dictionary number n5 for the partial pattern illustrated in Table 2
As for the stroke order, the stroke order is as shown in FIG.
There are three types as shown in (b) and (c). Table 5 shows the number of stroke orders, stroke order data and positional relationship information stored in the dictionary of the partial patterns.

【００３４】[0034]

【表５】 [Table 5]

【００３５】第３図（ａ）（ｂ）および（ｃ）における
（ィ）、（ロ）、（ハ）の符号はセグメントを示すと共
に書かれた順番を示し、破線は続けて書く場合に増える
可能性のあるセグメントである。表５の筆順１のデータ
の中における、右欄左側の〃２１３〃は筆順が（イ）、
（ロ）、（ハ）の順序であることを示し、かつ方向コー
ドもセグメント（イ）、（ロ）、（ハ）の方向コードで
あることを示している。In FIGS. 3 (a), (b) and (c), the symbols (a), (b) and (c) indicate segments and the order of writing, and the broken lines increase when writing is continued. A possible segment. In the data of stroke order 1 in Table 5, {213} on the left side of the right column indicates that the stroke order is (a),
(B) and (c), and the direction codes are also the direction codes of the segments (a), (b) and (c).

【００３６】表５における筆順２のデータ中における、
右欄左側の〃１０３２〃および筆順３のデータ中におけ
る、右欄左側の〃１０２０３〃も方向コードである。図
３において破線で示したように、〃０〃は方向コード０
のセグメントが増える可能性があるということを意味し
ており、表５においてアンダーラインをも付して示して
ある。しかし、図３（ａ）の筆順１において増える可能
性のある破線で示すセグメントは方向が７０°〜１８５
°の間にあるため、セグメント化によって除去されてい
る。In the data of stroke order 2 in Table 5,
{ 1 0 32} on the left side of the right column and { 1 0 0 2 0 3} on the left side of the right column in the data of the stroke order 3 are also direction codes. As indicated by the broken line in FIG. 3, {0} is the direction code 0.
This means that there is a possibility that the number of segments will increase, which is also shown in Table 5 with an underline. However, the segment indicated by a broken line that may increase in the stroke order 1 in FIG.
° and therefore have been removed by segmentation.

【００３７】これらの関係の一部を図４に示す。図４に
示す例は、部分パターン辞書の番号ｎ５において、
（イ）、（ロ）および（ハ）に示す形状のセグメントの
方向コードは破線部分も含めて〃１０２０３〃である。
これに対して入力文字は破線部分がない場合の方向コー
ドは〃１２３〃となって、対応することになる。FIG. 4 shows a part of these relationships. In the example shown in FIG. 4, at the number n5 of the partial pattern dictionary,
(A), (b) and the direction code of the segment of the shape shown in (c) is 〃1 0 2 0 3〃, including broken line portion.
On the other hand, when the input character has no broken line portion, the direction code is {123}, which corresponds to the input character.

【００３８】表５における右側欄の最左の方向コードが
部分パターンを構成するセグメントのコードを示してお
り、同右側欄の最左の方向コードに続く位置情報では１
番目の数字がセグメント番号であり、２番目の数字はセ
グメントの始点か終点情報であり、３番目のアルファベ
ットは位置関係情報であり、４番目の数字がセグメント
番号であり、５番目の数字がセグメントの始点か終点情
報である。３番目の位置関係を表すアルファベットは〃
Ｌ、Ｒ、Ｈ、Ｂ〃などがあり、それぞれ〃左、右、上、
下〃などの意味である。例えば、位置情報〃２１Ｌ１１
〃の意味は、セグメント（ロ）の始点がセグメント
（イ）の始点の左側にあることを示している。The leftmost direction code in the right column of Table 5 indicates the code of the segment constituting the partial pattern, and the position information following the leftmost direction code in the right column is 1
The second digit is the segment number, the second digit is the start or end point information of the segment, the third alphabet is the positional relationship information, the fourth digit is the segment number, and the fifth digit is the segment Is the start or end point information. The alphabet representing the third positional relationship is 〃
L, R, H, B}, etc., {left, right, top,
It means the following. For example, position information $ 21L11
The meaning of 〃 indicates that the starting point of the segment (b) is on the left side of the starting point of the segment (a).

【００３９】図５は本発明の実施の一形態にかかる手書
き文字認識装置における部分パターン照合部５の処理を
詳細に示したフローチャートである。FIG. 5 is a flowchart showing in detail the processing of the partial pattern collating unit 5 in the handwritten character recognition apparatus according to one embodiment of the present invention.

【００４０】以下、入力文字を〃員〃（楷書で１２セグ
メント）として、図１、図２、表１乃至表４を用いてフ
ローチャートの動作を説明する。Hereinafter, the operation of the flowchart will be described with reference to FIGS. 1 and 2 and Tables 1 to 4 assuming that the input character is {member} (12 segments in square writing).

【００４１】文字入力部１によって入力文字〃員〃が読
み取られ、前処理部２において前処理されて、セグメン
ト化処理部３において図２および表１に基づいてセグメ
ント化される。セグメント化の結果に基づく大分類部４
の処理で表４に示す候補文字に絞り込まれる。これらの
候補文字のペナルテイを記憶しているペナルテイバッフ
ァの内容がクリアされる。The input character {member} is read by the character input unit 1, preprocessed by the preprocessing unit 2, and segmented by the segmentation processing unit 3 based on FIG. Major classification unit 4 based on the segmentation result
Are narrowed down to the candidate characters shown in Table 4. The contents of the penalty buffer storing the penalties of these candidate characters are cleared.

【００４２】ついで、各候補文字の同じ順位にある部分
パターンの辞書番号がバッファに読み込まれる（ステッ
プ５１）。第一回目の場合、表４の最初の第１列の部分
パターン辞書番号〃ｎ１、ｎ１、ｎ４、ｎ４、ｎ１０、
ｎ１０〃が読み込まれる（ステップ５１）。ステップ５
２において読み込まれた部分パターン中に重複するもの
が除去されて、照合する部分パターン表が作成される
（ステップ５２）。この例の場合、第一回目の照合する
部分パターンが〃ｎ１、ｎ４、ｎ１０〃になる。Next, the dictionary numbers of the partial patterns in the same order of the candidate characters are read into the buffer (step 51). In the first case, the partial pattern dictionary numbers {n1, n1, n4, n4, n10,
n10} is read (step 51). Step 5
Duplicate ones of the partial patterns read in 2 are removed, and a partial pattern table to be collated is created (step 52). In this example, the first partial pattern to be collated is {n1, n4, n10}.

【００４３】照合する部分パターン表から順番に部分パ
ターンの辞書データが読み込まれる（ステップ５３）。
この例の場合には第一回目は〃ｎ１〃の辞書データが読
み込まれる。続いて部分パターン辞書のセグメントの方
向、位置と、入力文字のセグメントの方向、位置との照
合が行われて、ペナルティが算出される（ステップ５
４）。ついで、部分パターン辞書のセグメントのすべて
との照合が終了したか否かがチェックされる（ステップ
５５）。ステップ５５において終了していないと判別さ
れたときは、ステップ５４の処理が繰り返される。The dictionary data of the partial patterns is read in order from the partial pattern table to be collated (step 53).
In the case of this example, the dictionary data of {n1} is read for the first time. Subsequently, the direction and position of the segment of the partial pattern dictionary are compared with the direction and position of the segment of the input character, and a penalty is calculated (step 5).
4). Next, it is checked whether or not the comparison with all the segments of the partial pattern dictionary has been completed (step 55). If it is determined in step 55 that the process has not been completed, the process of step 54 is repeated.

【００４４】ステップ５５において終了したと判定され
た場合は、これまで照合処理済み入力文字のセグメント
数が、照合する部分パターンを持つ候補文字ごとに保存
される（ステップ５６）。ついで、照合する部分パター
ン表に未照合の部分パターンがあるか否かがチェックさ
れる（ステップ５７）。ステップ５７において未照合の
部分パターンがあると判別された場合、ステップ５３〜
ステップ５６の処理が繰り返して実行される。ステップ
５７において未照合の部分パターンがないと判別された
場合、ペナルティに基づき候補文字の再絞り込みの処理
が行われる（ステップ５８）。If it is determined in step 55 that the process has been completed, the number of segments of the input characters that have been subjected to the collation processing is stored for each candidate character having a partial pattern to be collated (step 56). Then, it is checked whether or not there is an unmatched partial pattern in the partial pattern table to be matched (step 57). If it is determined in step 57 that there is an unmatched partial pattern, steps 53 to 53
The process of step 56 is repeatedly executed. If it is determined in step 57 that there is no unmatched partial pattern, the process of re-selecting candidate characters is performed based on the penalty (step 58).

【００４５】この例の場合には、入力文字が〃員〃であ
るため、部分パターン〃口〃を持つ候補文字のペナルテ
ィが一番小さく、部分パターン〃米〃を持つ候補文字の
ペナルティが一番大きいと考えられる。ペナルティの閾
値処理で、部分パターン〃米〃を持つ候補文字が候補か
ら除去される。この時点で残された候補文字が〃員、
唄、消、浸〃になる。In this example, since the input character is {member}, the penalty of the candidate character having the partial pattern {mouth} is the smallest, and the penalty of the candidate character having the partial pattern {US} is the least. Considered large. In the penalty threshold processing, candidate characters having the partial pattern {US} are removed from the candidates. The candidate characters left at this point are members,
Singing, disappearing, immersing.

【００４６】残された候補文字に照合していない部分パ
ターンがあるか否かがチェックされる（ステップ５
９）。ステップ５９において照合していない部分パター
ンがあると判別された場合、ステップ５１〜ステップ５
８までの処理が繰り返される。この例の場合、２回目候
補文字から読み込まれる部分パターン辞書番号は表４の
第２列の〃ｎ２、ｎ２、ｎ５、ｎ７〃になる。この場合
重複するものは〃ｎ２〃であって、１つの〃ｎ２〃が除
去されて照合される部分パターンは〃ｎ２、ｎ５、ｎ７
〃となる。ステップ５９において照合していない部分パ
ターンがないと判別されたときは、部分パターン位置照
合部６の処理が実行される。It is checked whether or not there is a partial pattern that has not been collated with the remaining candidate characters (step 5).
9). If it is determined in step 59 that there is a partial pattern that has not been collated, steps 51 to 5
The processing up to 8 is repeated. In this example, the partial pattern dictionary number read from the second candidate character is {n2, n2, n5, n7} in the second column of Table 4. In this case, the overlapping pattern is {n2}, and partial patterns to be collated by removing one {n2} are {n2, n5, n7}
It becomes 〃. If it is determined in step 59 that there is no unmatched partial pattern, the process of the partial pattern position matching unit 6 is executed.

【００４７】表６は部分パターンの辞書セグメント（入
力文字のセグメントに対し辞書側のセグメントを辞書セ
グメントとも記す）と入力文字のセグメントの方向を照
合するとき利用するペナルティ表の一例である。Table 6 is an example of a penalty table used when collating the dictionary segment of the partial pattern (the dictionary side segment is also referred to as a dictionary segment with respect to the input character segment) and the direction of the input character segment.

【００４８】[0048]

【表６】 [Table 6]

【００４９】現在照合している辞書セグメントｉのコー
ドはｄｃｉで、入力文字セグメントｊのコードはｉｃｊ
とする。照合している部分パターンを持つ候補文字の現
在までの累積ペナルティをＰｍとする。ｍが候補文字の
番号を表す。コードｄｃｉには〃必ず存在する〃と、〃
現れる可能性がある〃２つのタイプがある。コードｄｃ
ｉのタイプによって照合の方法が異ならせてある。次に
説明する。The code of the dictionary segment i currently being collated is dci, and the code of the input character segment j is icj
And The accumulated penalty up to the present of the candidate character having the collating partial pattern is defined as Pm. m represents the number of the candidate character. In code dci, {it always exists},
There are two types that can appear: Code dc
The matching method differs depending on the type of i. Next, a description will be given.

【００５０】（ａ）コードｄｃｉが〃必ず存在する〃場
合で、（ａ１）ｄｃｉ＝ｉｃｊのときについて説明す
る。（イ）セグメントｉとセグメントｊの位置が同じと
き、セグメントｉとｊと対応させ、Ｐｍを変更しない。
（ロ）セグメントｉとセグメントｊの位置が同じではな
く、ｄｃｉ＝ｉｃｊ＋１、かつセグメントｉとセグメン
ト(ｊ＋１)の位置が同じとき、ｉとｊ＋１とを対応さ
せ、ｉｃｊを飛ばす。但し、ｉｃｊを飛ばしたペナルテ
ィとして、Ｐｍに１を加算する。（ハ）セグメントｉと
セグメントｊの位置が同じではなく、ｄｃｉ＋１＝ｉｃ
ｊ、かつセグメントｉ＋１とセグメントｊの位置が同じ
とき、ｉ＋１とｊとを対応させ、ｉｃｉを飛ばす。但
し、ｉｃｉを飛ばしたペナルティとして、Ｐｍに１を加
算する。（ニ）以上のほか、セグメントｉとセグメント
ｊとを対応させ、位置が合わないときペナルティとし
て、Ｐｍに１を加算する。The case where (a) the code dci is {always present} and (a1) dci = icj will be described. (A) When the positions of the segment i and the segment j are the same, the segments i and j are made to correspond to each other, and Pm is not changed.
(B) When the positions of the segment i and the segment j are not the same, dci = icj + 1, and the position of the segment i is the same as the position of the segment (j + 1), i and j + 1 are made to correspond and icj is skipped. However, 1 is added to Pm as a penalty for skipping icj. (C) The positions of the segment i and the segment j are not the same, and dci + 1 = ic
j, and when the position of the segment i + 1 is the same as that of the segment j, the i + 1 is made to correspond to the j and the ici is skipped. However, 1 is added to Pm as a penalty for skipping ici. (D) In addition to the above, the segment i and the segment j are associated with each other, and when the positions do not match, 1 is added to Pm as a penalty.

【００５１】次に、（ａ）ｄｃｉが〃必ず存在する〃場
合で、（ａ２）ｄｃｉ≠ｉｃｊのときについて説明す
る。（イ）ｄｃｉ＝ｉｃｊ＋１、かつ、セグメントｉと
セグメントｊ＋１の位置が同じとき、ｉとｊ＋１とを対
応させ、ｉｃｊを飛ばす。但し、ｉｃｊを飛ばしたペナ
ルティとして、Ｐｍに１を加算する。（ロ）ｄｃｉ＋１
＝ｉｃｊ、かつ、セグメントｉ＋１とセグメントｊの位
置が同じとき、ｉ＋１とｊとを対応させ、ｉｃｉを飛ば
す。但し、ｉｃｉを飛ばしたペナルティとして、Ｐｍに
１を加算する。（ハ）以上のほか、ｄｃｉとｉｃｊを対
応させ、表６に基づいてＰｍにペナルティを加算する。Next, the case where (a) dci is {always present} and (a2) dci @ icj will be described. (A) When dci = icj + 1 and the position of segment i is the same as that of segment j + 1, i and j + 1 are made to correspond and icj is skipped. However, 1 is added to Pm as a penalty for skipping icj. (B) dci + 1
= Icj, and when the position of segment i + 1 is the same as that of segment j, i + 1 is made to correspond to j, and ici is skipped. However, 1 is added to Pm as a penalty for skipping ici. (C) In addition to the above, dci and icj are associated with each other, and a penalty is added to Pm based on Table 6.

【００５２】次に（ｂ）コードｄｃｉが〃現れる可能性
がある〃場合で、（ｂ１）ｄｃｉ＝ｉｃｊ、ｄｃｉ＋１
＝ｉｃｊ＋１、かつ、セグメントｉ＋１とセグメントｊ
＋１の位置が同じとき、ｄｃｉとｉｃｊとを対応させ
る。（ｂ２）以上のほか、入力文字にｄｃｉと対応する
セグメントが増えていないとして、ｄｃｉを飛ばし、Ｐ
ｍを変更しない。Next, in the case (b) where the code dci is {possible to appear}, (b1) dci = icj, dci + 1
= Icj + 1 and segment i + 1 and segment j
When the position of +1 is the same, dci and icj are associated. (B2) In addition to the above, it is determined that the number of segments corresponding to dci in the input characters has not increased, and
Do not change m.

【００５３】次に、図４によって部分パターン辞書と入
力文字とを照合する例をペナルテイと関連して説明す
る。Next, an example of collating a partial pattern dictionary with input characters will be described with reference to FIG. 4 in relation to penalties.

【００５４】例えば、入力文字〃消〃の部分パターンｎ
５が楷書で書かれた場合と続けて書かれた場合を説明す
る。筆順は共に図３（ｃ）と同じであるとする。ｎ５が
楷書で書かれた場合のセグメント方向コードは〃１２３
〃であり、続けて書かれた場合のセグメント方向コード
は〃１０２０３〃である。For example, the partial pattern n of the input character {OFF}
The case where 5 is written in square style and the case where it is written continuously will be described. The stroke order is assumed to be the same as that of FIG. The segment direction code when n5 is written in square style is $ 123
}, And the segment direction code when written continuously is {10203}.

【００５５】先ず、楷書で書かれた部分パターンｎ５と
部分パターン辞書８の部分パターンｎ５との照合を説明
する。部分パターン辞書８の部分パターンｎ５の筆順数
は３つあるので、入力文字の部分パターンｎ５は、部分
パターン辞書における筆順１、筆順２、筆順３と順番に
照合していくが、筆順１と筆順２とは合わないので、照
合の結果のペナルテイは０ではないと考えられる。First, the collation between the partial pattern n5 written in the standard style and the partial pattern n5 of the partial pattern dictionary 8 will be described. Since the number of stroke orders of the partial pattern n5 of the partial pattern dictionary 8 is three, the partial pattern n5 of the input character is collated in order with the stroke order 1, the stroke order 2, and the stroke order 3 in the partial pattern dictionary. Since it does not match 2, it is considered that the penalty of the collation result is not 0.

【００５６】ここで、筆順３との照合について詳細に説
明する。部分パターンｎ５の部分パターン辞書筆順３の
セグメント方向コードは〃１０２０３〃であるため、ｄ
ｃ１＝１、ｄｃ２＝０、ｄｃ３＝２、ｄｃ４＝０、ｄｃ
５＝３である。一方、入力部分パターンｎ５のセグメン
ト方向コードは〃１２３〃であるため、ｉｃ１＝１、ｉ
ｃ２＝２、ｉｃ３＝３である。Here, the collation with the stroke order 3 will be described in detail. Since the segment direction code of the partial pattern dictionary stroke order 3 of the partial pattern n5 is { 1 0 2 0 3 }, d
c1 = 1, dc2 = 0 , dc3 = 2, dc4 = 0 , dc
5 = 3. On the other hand, since the segment direction code of the input partial pattern n5 is {123}, ic1 = 1, i
c2 = 2 and ic3 = 3.

【００５７】部分パターン辞書のｄｃ１＝１は必ず存在
するセグメントであり、しかもｄｃ１＝ｉｃ１、両セグ
メント位置も同じとして、図４の矢印に示すように辞書
セグメント１は入力セグメント１と対応させ、ペナルテ
イを付けない。Assuming that dc1 = 1 in the partial pattern dictionary is a segment that always exists, dc1 = ic1, and that both segment positions are the same, dictionary segment 1 is made to correspond to input segment 1 as shown by the arrow in FIG. Do not attach.

【００５８】部分パターン辞書のｄｃ２＝０は現れる可
能性があるセグメントであり、しかもｄｃ２≠ｉｃ２で
あり、入力文字にｄｃ２と対応するセグメントが現れて
いないとして、ｄｃ２を飛ばす。以上と同様に、ｄｃ３
とｉｃ２と対応させ、ｄｃ４＝０を飛ばしてｄｃ５とｉ
ｃ３とを対応させることになる。結果として、ペナルテ
イは０になる。Dc2 = 0 in the partial pattern dictionary is a segment that may appear, and dc2 ≠ ic2, and dc2 is skipped on the assumption that no segment corresponding to dc2 appears in the input character. As above, dc3
And ic2, skip dc4 = 0 , and dc5 and i
and c3. As a result, the penalty is zero.

【００５９】次に、続けて書かれた部分パターンｎ５と
部分パターン辞書の部分パターンｎ５との照合を説明す
る。この場合も、部分パターン辞書の部分パターンｎ５
の筆順１と筆順２との照合の説明を省略し、筆順３との
照合だけについて説明する。続けて書かれた入力文字の
部分パターンｎ５のセグメント方向コードは〃１０２０
３〃であるため、ｉｃ１＝１、ｉｃ２＝０、ｉｃ３＝
２、ｉｃ４＝０、ｉｃ５＝３である。Next, the collation of the subsequently written partial pattern n5 with the partial pattern n5 of the partial pattern dictionary will be described. Also in this case, the partial pattern n5 of the partial pattern dictionary
The description of the comparison between the stroke order 1 and the stroke order 2 will be omitted, and only the comparison with the stroke order 3 will be described. The segment direction code of the input character partial pattern n5 is $ 1020
3〃, ic1 = 1, ic2 = 0, ic3 =
2, ic4 = 0 and ic5 = 3.

【００６０】部分パターン辞書のｄｃ１＝１は必ず存在
するセグメントであり、しかもｄｃ１＝ｉｃ１、両セグ
メント位置も同じとして、図４の矢印に示すように、辞
書セグメント１は入力セグメント１と対応させ、ペナル
テイを付けない。Assuming that dc1 = 1 in the partial pattern dictionary is a segment that always exists, dc1 = ic1, and that both segment positions are the same, dictionary segment 1 is made to correspond to input segment 1 as shown by the arrow in FIG. Do not penalize.

【００６１】部分パターン辞書のｄｃ２＝０は現れる可
能性があるセグメントであり、しかもｄｃ２＝ｉｃ２、
ｄｃ３＝ｉｃ３、かつ部分パターン辞書ｎ５のセグメン
ト３と入力されたｎ５のセグメント３の位置も同じであ
るため、ｉｃ２が続け字によって現れたセグメントとし
て、ｄｃ２と対応させる。ペナルテイは付けない。以上
と同様にして、ｄｃ３とｉｃ３、ｄｃ４とｉｃ４、ｄｃ
５とｉｃ５とを対応させることになる。結果としてペナ
ルテイは０となる。Dc2 = 0 in the partial pattern dictionary is a segment that may appear, and dc2 = ic2,
Since dc3 = ic3 and the segment 3 of the partial pattern dictionary n5 and the position of the input segment 3 of n5 are also the same, ic2 is made to correspond to dc2 as a segment that appears as a continuation character. No penalty will be charged. Similarly, dc3 and ic3, dc4 and ic4, dc
5 and ic5 correspond to each other. As a result, the penalty becomes zero.

【００６２】上記の処理は、入力文字を形成する部分パ
ターンと、部分パターン辞書８に記憶されている文字構
成上必ず存在するセグメントと続け字や走り書きの曲線
部として現れる可能性のあるセグメントとからなる部分
パターンとを照合し、文字構成上必ず存在するセグメン
トと入力文字を形成する部分パターン中の対応する部分
パターンとの差異を求めることと等価である。The above-described processing is carried out on the basis of a partial pattern forming an input character, a segment which always exists in the character configuration stored in the partial pattern dictionary 8 and a segment which may appear as a continuous character or a scribble curve. This is equivalent to collating a partial pattern with a certain partial pattern and determining a difference between a segment that always exists in the character configuration and a corresponding partial pattern in a partial pattern forming an input character.

【００６３】以上のようにしてステップ５８が実行され
てペナルテイに基づく候補文字の絞りこみが行われて、
ステップ５９において残った候補文字に部分パターンが
残っていないと判別されたときは、部分パターン位置照
合部６によって部分パターン位置が照合されて、ペナル
テイの値と総合された入力文字の認識文字が最終的に決
定される。As described above, the step 58 is executed to narrow down the candidate characters based on the penalty.
If it is determined in step 59 that there is no partial pattern remaining in the remaining candidate characters, the partial pattern position matching unit 6 checks the partial pattern position, and the final recognized character of the input character integrated with the penalty value is determined. Is determined.

【００６４】[0064]

【発明の効果】以上説明したように本発明にかかる手書
き文字認識装置によれば、手書き文字認識処理におい
て、大分類情報によって候補文字を絞り、さらに該候補
文字を部分パターンとの照合に基づく結果によってさら
に絞って文字認識を行うために、文字認識が高速に行わ
れるという効果が得られる。As described above, according to the handwritten character recognition apparatus according to the present invention, in the handwritten character recognition processing, the candidate characters are narrowed down by the large classification information, and the candidate characters are compared with the partial patterns. As a result, character recognition is performed more narrowly, so that the effect that character recognition is performed at high speed is obtained.

【００６５】また本発明にかかる手書き文字認識装置に
よれば、部分パターンをセグメントで形成し、かつ文字
の構造上必ず存在するセグメント以外に、続け字や走り
書きの曲線部として現れる可能性のあるセグメントも含
め、入力文字の部分パターンと照合して文字認識を行う
ために、続け字等にも対応することができるという効果
が得られる。Further, according to the handwritten character recognition device of the present invention, a segment which forms a partial pattern by a segment and which may appear as a curved portion of a continuation character or a scribble in addition to a segment which is always present in the structure of a character. Since the character recognition is performed by collating with the partial pattern of the input character, it is possible to obtain the effect of being able to cope with continuous characters and the like.

[Brief description of the drawings]

【図１】本発明の実施の一形態にかかる手書き文字認識
装置の構成を機能的に示すブロック図である。FIG. 1 is a block diagram functionally showing a configuration of a handwritten character recognition device according to an embodiment of the present invention.

【図２】本発明の実施の一形態にかかる手書き文字認識
装置におけるセグメントの方向とセグメントコードとの
説明図である。FIG. 2 is an explanatory diagram of a segment direction and a segment code in the handwritten character recognition device according to one embodiment of the present invention.

【図３】本発明の実施の一形態にかかる手書き文字認識
装置における部分パターンの筆順例の説明図である。FIG. 3 is an explanatory diagram of a stroke order example of a partial pattern in the handwritten character recognition device according to the embodiment of the present invention;

【図４】本発明の実施の一形態にかかる手書き文字認識
装置における部分パターンの辞書セグメントと入力文字
のセグメントとの照合の説明図である。FIG. 4 is an explanatory diagram of collation between a dictionary segment of a partial pattern and a segment of an input character in the handwritten character recognition device according to the embodiment of the present invention;

【図５】本発明の実施の一形態にかかる手書き文字認識
装置の作用の説明に供するフローチャートである。FIG. 5 is a flowchart for explaining the operation of the handwritten character recognition device according to the embodiment of the present invention;

[Explanation of symbols]

１文字入力部２前処理部３セグメント化処理部４大分類部５部分パターン照合部６部分パターン位置照合部７認識結果出力部８部分パターン辞書９文字辞書 Reference Signs List 1 character input unit 2 preprocessing unit 3 segmentation processing unit 4 major classification unit 5 partial pattern matching unit 6 partial pattern position matching unit 7 recognition result output unit 8 partial pattern dictionary 9 character dictionary

───────────────────────────────────────────────────── フロントページの続き (72)発明者大内直人東京都荒川区西日暮里１−62−24 ディアハイム西日暮里401 (72)発明者胡金玲東京都渋谷区道玄坂１丁目14番６号株式会社ケンウッド内 (56)参考文献特開昭62−229384（ＪＰ，Ａ) 特開平９−198466（ＪＰ，Ａ) 特開平２−266485（ＪＰ，Ａ) 特開昭62−271087（ＪＰ，Ａ) 戸田匡紀大内直人胡金玲馬籠良英窪田忠弘，セグメント分析によるオンライン文字認識，情報処理学会第55 回（平成９年後期）全国大会講演論文集（２），日本，情報処理学会，1997年９月24日，第55回（２），ｐ．２− 194〜195 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/62 ＪＩＣＳＴファイル（ＪＯＩＳ)──────────────────────────────────────────────────続き Continued on the front page (72) Naoto Ouchi, Inventor 401-62-24 Nishi-Nippori, Arakawa-ku, Tokyo 401 Di Aheim Nishi-Nippori 401 (72) Inventor Hu Jin-ling 1-14-6 Dogenzaka, Shibuya-ku, Tokyo (56) References JP-A-62-2229384 (JP, A) JP-A-9-198466 (JP, A) JP-A-2-266485 (JP, A) JP-A-62-271087 (JP) , A) Masanori Toda, Naoto Ouchi, Rei Hukin, Yoshihide Magago, Tadahiro Kubota, Online Character Recognition by Segment Analysis, IPSJ 55th Annual Meeting (2nd half of 1997) Proceedings (2), Japan, IPSJ , September 24, 1997, 55th (2), p. 2-194 to 195 (58) Field surveyed (Int. Cl. ⁷ , DB name) G06K 9/62 JICST file (JOIS)

Claims

(57) [Claims]

1. A storage means for storing, for a plurality of characters, a partial pattern forming a character having the same large classification information as the large classification information for performing a large classification of characters, and input characters read in a stroke order. Selection means for selecting a character having the same large classification information as the large classification information from the characters stored in the storage means as a candidate character; forming a partial pattern forming an input character and forming a candidate character stored in the storage means A matching means for sequentially and only once matching a corresponding partial pattern in a partial pattern to be compared, and a difference between a partial pattern to be matched and a partial pattern forming an input character for each matching, and the difference is set to a predetermined threshold value. Candidate character selecting means for removing candidate characters having a partial pattern to be compared exceeding from the candidate characters selected by the selecting means. Recognizing the input character on the basis of the remaining candidate characters Te handwritten character recognition apparatus according to claim.

2. The handwritten character recognition apparatus according to claim 1, wherein the large classification information includes information on the number of segments forming the character and information on a type of a predetermined segment direction.

3. A handwritten character recognition apparatus according to claim 1, wherein the partial pattern is composed of a segment, a radical pattern forming a character, a bystander pattern, a pattern forming part of a radical, and a part of bystander. A handwritten character recognition device characterized by including a pattern forming a character.

4. A handwritten character recognizing device according to claim 1, wherein the collated partial patterns collated by the collating means are stored in the storage means in the same order as the partial patterns in the stroke order forming the input characters. A handwritten character recognition device, which is a character partial pattern.

5. The handwritten character recognition apparatus according to claim 1, wherein the partial pattern stored in the storage means is composed of segments, and may appear as a continuous character or a scribble curve in addition to the segments which are always present in the character configuration. Pattern, which includes a segment that can be input, and a partial pattern consisting of a segment that always exists in the character configuration stored in the storage means and a segment that may appear as a continuation character or a scribble curve portion A handwritten character recognition apparatus for determining a difference between a segment necessarily present in a character configuration and a corresponding partial pattern in a partial pattern forming an input character.