JPS61117680A

JPS61117680A - Generating system of dictionary for recognition of hand-written character

Info

Publication number: JPS61117680A
Application number: JP59239693A
Authority: JP
Inventors: Tatsuo Kasahara; 笠原　龍夫
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1984-11-14
Filing date: 1984-11-14
Publication date: 1986-06-05

Abstract

PURPOSE:To obtain a dictionary for recognition of hand-written characters which is suitable for a user, by synthesizing deformed patterns of characters and selecting the character pattern resembling hand-written characters of the user from them the registering its feature vectors in the dictionary for recognition of hand-written characters. CONSTITUTION:In the learning mode operation, a dictionary retrieving part 11 retrieves the dictionary similarly to the normal mode. If the recognition result is rejected, the operation is performed as follows; a control part 7 displays an input character pattern and a message to an operator on a display device 6, and the operator discriminates a character in accordance with the displayed input pattern and inputs its character code from a key input device 3. This character code is sent to a recognizing part 4 and is given to a dictionary register part 13 through the dictionary retrieving part 11. If there is one candidate character for the input character pattern and the recognition result is rejected because the distance between its feature vectors exceeds the allowable error range of the candidate character, the dictionary register part 13 combines feature vectors of the input character pattern with the key-inputted character code, etc. and registers additionally the result in a dictionary memory 14.

Description

【発明の詳細な説明】〔技術分野〕本発明は下書き文字を認識可能な文学誌５１！装置に関
し１．詳しくは、そのような文字認識装置における個々
のユーザに最適な手７き文字認識用辞書を作成する方式
に関する。[Detailed Description of the Invention] [Technical Field] The present invention provides a literary magazine 51 that can recognize draft characters! Regarding the device 1. Specifically, the present invention relates to a method for creating a manual character recognition dictionary optimal for each user in such a character recognition device.

[Prior art]

手書き文字を認識するための文字認識装置においては、
ユーザに最適な辞書を辞書メモリにロードする必要があ
る。そこで従来は、（１）メーカ側で標準辞書から特定
のユーザに最適と思われる標準を作成し１、ニードはそ
の辞書を辞書メモリにロードして用いる方法、（２）多
様な変形文字を印刷したシートを容易しておき、ユーザ
はその中から類似していると思われるシートを選択し１
、選択したシートに対応する辞書を振巾辞書から選択し
て辞書メモリにロードする方法、または（３）ユーザが
学習シートに手書き文字を記入し１．それを文字認、１
！装置に読み取１）せて学習させ、標へ１（辞１１トか
Ｃ）ユーザに適した必要最少限の辞詩を辞非メモリにロ
ードする方法が採用されている。In character recognition devices for recognizing handwritten characters,
It is necessary to load the most suitable dictionary for the user into the dictionary memory. Conventionally, the following methods were used: (1) the manufacturer created a standard that was considered optimal for a specific user from a standard dictionary (1) and then the needs loaded the dictionary into dictionary memory and used it, (2) printed a variety of modified characters. The user selects a sheet that seems similar from among them.
(3) A method in which the user writes handwritten characters on the learning sheet and (1) selects a dictionary corresponding to the selected sheet from the dictionary and loads it into the dictionary memory; Recognize it literally, 1
! A method is adopted in which the device is made to read (1) and learn, and the minimum necessary poems suitable for the user are loaded into the reading memory.

し、かじ、阿、ｔＬの方法であっても、多様な変形文字
にケ、を応できるような大規模な標べＩτ辞１；・を容
５する必要がある。そして、標準辞書を作成するには。However, even with the ``Kaji'', ``A'', and ``tL'' methods, it is necessary to create a large-scale marking Iτ dictionary 1;・ that can accommodate a variety of modified characters. And to create a standard dictionary.

多数の人の手書き文字を集め、それを計算機に入力し、
て処理する必要があり、草大な時間と人手がかかってい
る。このため、辞書作成コストがＬ昇し、またユーザの
ニーズに迅速に吋応できなかった。また、必ずし、もユ
ーザに最適な辞書を得られなかった。Collect handwritten letters from many people and enter them into a calculator.
This requires a huge amount of time and manpower. For this reason, the cost of creating a dictionary increased by L, and it was not possible to respond quickly to user needs. Furthermore, it is not always possible to obtain a dictionary that is most suitable for the user.

〔the purpose〕

本発明は上記実情に鑑みてなされたもので、その目的は
、大規模な標準辞書を用意し、なくても、個々のユーザ
に適した手書き文字認識用辞書を迅速かつ筒中に作成す
る方式を提供することにある。The present invention has been made in view of the above circumstances, and its purpose is to provide a method for quickly creating a dictionary for handwritten character recognition suitable for each user without having to prepare a large-scale standard dictionary. It is about providing.

〔composition〕

上記目的を達成するために１本発明は、ｆ、ＩＦき文字
を認識可能で、かつ入力文字パターンの特徴ベクトルを
抽出して手書き文字！！識用辞書に登録する装置を備え
る文字認識装置において２文字合成装置を用いて文字の
変形したパターンを合成し５その中からユーザの手’ｔ
ｅ！文字に似た文字パターンを選んで入力することによ
り、その文字パターンの特徴ベクトルを該手書き文字！
！識用辞書に登録させることにより１個々のユーザに最
適な手書き文字認識用辞書を作成することを特徴とする
ものである。In order to achieve the above object, the present invention is capable of recognizing f, IF characters, and extracting feature vectors of input character patterns to create handwritten characters! ! A character recognition device equipped with a device for registering characters in a common dictionary uses a two-character synthesizer to synthesize transformed patterns of characters.
e! By selecting and inputting a character pattern similar to a character, the feature vector of that character pattern can be applied to the handwritten character!
! This feature is characterized in that a handwritten character recognition dictionary optimal for each user is created by registering the handwritten character in a common dictionary.

以下２図面を参照し１本発明の実施例について説明する
。An embodiment of the present invention will be described below with reference to two drawings.

第１図は１本発明の一実施例に係るｆ、ｖき文字認識装
置の概略ブロック図である。ここに示す手書き文字認識
装置は、文字パターンの入力手段とし、て、原稿などを
光学的に走査して文字パターンを読み取り、その２値化
１文字切り出し、正規化。FIG. 1 is a schematic block diagram of an f, v character recognition device according to an embodiment of the present invention. The handwritten character recognition device shown here uses a character pattern input means to optically scan a document, read the character pattern, binarize it, cut out one character, and normalize it.

ノイズ除去などの前処理を行う文字読取部１の他に１手
書き文字認識用辞書を作成する場合に文字パターンを合
成して入力する文字合成部２を備えている。さらに、オ
ペレータ入力に用いられるキー人力装置！３も備えてい
る。４は認識部であり。In addition to a character reading section 1 that performs preprocessing such as noise removal, it is provided with a character synthesis section 2 that synthesizes and inputs character patterns when creating a dictionary for handwritten character recognition. Additionally, key human power devices used for operator input! It also has 3. 4 is a recognition part.

人力文字パターンから特徴ベクトルを抽出し、手書き文
字認識用辞書の特徴ベクトルと比較して文字認識を行っ
たり、辞書への文字追加登録（後述するように手書き文
字！！識用辞書の新規作成および更新を含む）を行う部
分である。５は各種辞書や１文字認識結果を格納するた
めの外部記憶装置である。６は文字認識結果１文字パタ
ーン、オペレータへのメツセージなどを表示するための
表示装置である。７は前記各部のシーケンス制御やデー
タ転送制御などを行う制御部であり、これはシステムバ
ス８を介して他の部分と接続されている。Extract feature vectors from human character patterns and compare them with feature vectors in handwritten character recognition dictionaries to perform character recognition, register additional characters in the dictionary (as described later, create new handwritten character dictionaries, and (including updates). 5 is an external storage device for storing various dictionaries and single character recognition results. Reference numeral 6 denotes a display device for displaying character recognition results, single character patterns, messages to the operator, and the like. Reference numeral 7 denotes a control section that performs sequence control and data transfer control of each section, and is connected to other sections via a system bus 8.

認識部１１の構成を第２図より説明する。図示のように
認識部４は、特徴抽出部１０、辞書検索部１１、辞１！
登録部１３および辞書メモリ１４からなる。本実施例の
手書き文字認識装置は１通常モード、学習モード、また
は辞書作成モードのいずれかで動作可能であり、キー人
力装置３を通じてオペレータが指定できる。The configuration of the recognition unit 11 will be explained with reference to FIG. As shown in the figure, the recognition unit 4 includes a feature extraction unit 10, a dictionary search unit 11, and a dictionary 1!
It consists of a registration section 13 and a dictionary memory 14. The handwritten character recognition device of this embodiment can operate in one of the normal mode, learning mode, and dictionary creation mode, which can be designated by the operator through the key manual device 3.

まず、通常モード時における認識部４の動作を説明する
。特徴抽出部１０は、入力文字パターンの特徴ベクトル
ｖｐを抽出する。辞書メモリ１４には、制御部７の制御
により、外部記憶装置５から手書き文字認識用辞書がロ
ードされている。辞書検索部１１は、入力文字パターン
の特徴ベクトルＶｐと、辞７Ｆの特徴ベクトルＶｄとを
順次比較し、特徴ベクトル間距離が最小の候補文字を検
索し、その文字コードを認識結果とし、てシステムバス
８へ出力する。ただし、最小の特徴ベクトル間距離がそ
の候補文字の許容誤差範囲（特徴ベクトルに付加されて
いる）を越えている場合、または距離が許容誤差範囲内
となる候補文字が２つ以ヒある場合、入力文字パターン
はリジェクトされ、辞書検索部１１はリジェクト・コー
ドを出力する。First, the operation of the recognition unit 4 in the normal mode will be explained. The feature extraction unit 10 extracts a feature vector vp of the input character pattern. A handwritten character recognition dictionary is loaded into the dictionary memory 14 from the external storage device 5 under the control of the control unit 7 . The dictionary search unit 11 sequentially compares the feature vector Vp of the input character pattern with the feature vector Vd of the dictionary 7F, searches for a candidate character with the minimum distance between the feature vectors, uses the character code as a recognition result, and uses the character code in the system. Output to bus 8. However, if the minimum distance between feature vectors exceeds the tolerance range (added to the feature vector) for that candidate character, or if there are two or more candidate characters whose distances are within the tolerance range, The input character pattern is rejected, and the dictionary search section 11 outputs a reject code.

認識結果は制御部７の制御により外部記憶装Ｐ５へ格納
される。The recognition result is stored in the external storage device P5 under the control of the control section 7.

学習モード動作においては、辞書検索部１１は通常モー
ド時に同様に辞書検索を行い、候補文字を検索し、その
結果をシステムバス８へ出力する。In the learning mode operation, the dictionary search unit 11 performs a dictionary search in the same way as in the normal mode, searches for candidate characters, and outputs the results to the system bus 8.

この認識結果がリジェクトとなった場合は１通常モード
と異なり次のように動作する。まず制御部７は、入力文
字パターンとオペレータへのメツセージを表示装置ｉ！
６に表示させる。オペレータは。If the recognition result is rejected, the operation is as follows, unlike the 1 normal mode. First, the control unit 7 displays the input character pattern and a message to the operator on the display device i!
6. The operator is.

表示された入カバターンから文字を判断し、その文字コ
ードをキー人力装置３から入力する。この文字コードは
制御部７の制御により認識部４へ送られ、辞書検索部１
１を介して辞Ｖ登録部１３へ与えられる。入力文字パタ
ーンに対する候補文字が１つで、その特徴ベクトル間距
離が、その候補文字の許容誤差範囲（候補文字の特徴ベ
クトルに付加されている）を越えているためにリジェク
トさ九た資金（これは辞書検索部１１から辞ＩＦ登録部
１３へ報告される）、辞ＩＦ登録部１３は、入力文字パ
ターンの特徴ベクトルを、キー人力された文字コード、
および標準の許容誤差範囲Δ０と組合わせ、辞書（辞書
メモリ１４内）に追加登録し。The character is determined from the displayed input cover pattern, and the character code is inputted from the key input device 3. This character code is sent to the recognition unit 4 under the control of the control unit 7, and is sent to the dictionary search unit 1.
1 to the letter V registration unit 13. There is one candidate character for the input character pattern, and the distance between its feature vectors exceeds the tolerance range (added to the feature vector of the candidate character) for that candidate character, so it is rejected. is reported from the dictionary search unit 11 to the dictionary IF registration unit 13), and the dictionary IF registration unit 13 converts the feature vector of the input character pattern into the character code manually entered by the key,
and the standard tolerance range Δ0, and are additionally registered in the dictionary (in the dictionary memory 14).

その旨を辞書検索部１１を経由してシステムバス８へ送
出する。この登録通知は、制御部７の制御により表示装
置６に表示される。一方、入力文字パターンとの距離が
それぞれの許容誤差範囲内の候補文字が２つ以上あるた
めにリジェクトされた場合、辞書登録部１３は、候補文
字中の入力文字パターンとの距離が最小の候補文字を選
び、その距離Δ′を許容最小値ΔＬと比較する。そして
。A message to that effect is sent to the system bus 8 via the dictionary search section 11. This registration notification is displayed on the display device 6 under the control of the control unit 7. On the other hand, if the candidate character is rejected because there are two or more candidate characters whose distances from the input character pattern are within the respective allowable error ranges, the dictionary registration unit 13 selects the candidate character whose distance from the input character pattern is the shortest among the candidate characters. Select a character and compare its distance Δ' with the minimum allowable value ΔL. and.

Δ′〉ΔＬならば、距離が近すぎるため、その入力文字
パターンの特徴ベクトルは辞書に追加せず。If Δ′>ΔL, the distance is too short, so the feature vector of that input character pattern is not added to the dictionary.

その旨を制御部７へ通知し、表示させる。一方。This is notified to the control unit 7 and displayed. on the other hand.

Δ′≧ΔＬの場合、入力文字パターンの特徴ベクトルを
辞書に追加登録し、その旨を表示させる。If Δ'≧ΔL, the feature vector of the input character pattern is additionally registered in the dictionary, and a message to that effect is displayed.

その際、その許容誤差範囲Δは第３図に示すような関数
Ｔ（Δ′）により計算する。At this time, the permissible error range Δ is calculated using a function T(Δ') as shown in FIG.

二のように、追加登録の可盃と判定と、追ＩＪＩＩ文字
の特徴ベクトル間許容誤差範囲の決定を行うのは、辞書
への追加登録に上り類似文字の増加、それによる誤認識
やりジェクトの増加を避けるためである。As shown in 2, determining whether additional registration is possible and determining the allowable error range between the feature vectors of additional IJII characters is necessary to avoid the increase in similar characters due to additional registration in the dictionary and the resulting misrecognition. This is to avoid an increase.

辞書作成モードの場合１文字パターンを文字合成部２よ
り入力するが、認識部４の動作は学習モードの場合と同
様である。In the dictionary creation mode, a single character pattern is input from the character synthesis section 2, but the operation of the recognition section 4 is the same as in the learning mode.

次に文字合成部２について説明する。この文字合成部２
は、ストローク合成により文字を合成する装置であり、
第４図に示す構成となっている。Next, the character synthesis section 2 will be explained. This character synthesis part 2
is a device that synthesizes characters by stroke composition,
The configuration is shown in FIG.

この図において、２０はシステムバス８から文字コード
などをバス２１に取り込む入力ポートである。２２は装
置全体の制御や演算処理を行う主処理部である。２３は
文字辞書メモリ、２４はストローク辞書メモリ、２５は
飾り辞書メモリであり。In this figure, reference numeral 20 is an input port that takes in character codes and the like from the system bus 8 to the bus 21. 22 is a main processing section that controls the entire device and performs arithmetic processing. 23 is a character dictionary memory, 24 is a stroke dictionary memory, and 25 is a decoration dictionary memory.

そｈぞれに制御部７および主処理部２２の制御により、
外部記憶装置５から文字辞書、ストローク辞書、飾り辞
書がロードされる。Under the control of the control section 7 and the main processing section 22, respectively,
A character dictionary, a stroke dictionary, and a decoration dictionary are loaded from the external storage device 5.

文字辞書には、漢字を含む文字や記号のパターン（漂帛
パターンと変形パターン）を合成するための基本的なデ
ータが登録される８文字辞書のデータは、第５図に概念
図に示すように２合成すべ！！文字を示す文字コード（
標準パターンと各変形パターンの識別番号情報が付加さ
れている）１文字を構成するストロークの番号、ストロ
ーク辞書的データを指定するストロークコード、飾り辞
害内データを指定する飾りコード、ストロークの始点度
標、ストロークの長さを示すプロポーションからなる。The character dictionary contains the basic data for synthesizing character and symbol patterns (drifting patterns and deformed patterns) including kanji. 2 should be combined! ! Character code indicating the character (
(Identification number information for the standard pattern and each modified pattern is added) Number of strokes that make up one character, Stroke code that specifies stroke dictionary data, Decoration code that specifies data within the decoration, Stroke starting point degree It consists of proportions that indicate the length of the stroke.

文字の心線を構成するストロークは例えば８０種１！ａ
程度に標準化され、各ストロークに関するデータがスト
ローク辞書に登録されている。このストローク辞書のデ
ータは、第６図に概念的に示すように、ストロークを指
定するストロークコート、ストロークの始点座標、スト
ロークの長さを示すプロポーション、ストロークに肉付
けするための太線化情報、ストロークをビットパターン
に展開するためのチェーンコートからなる。二のチェー
ンコードは、ストロークをビット展開する際に。For example, there are 80 types of strokes that make up the core of a character! a
It has been standardized to a certain degree, and data regarding each stroke is registered in a stroke dictionary. As conceptually shown in Figure 6, the data in this stroke dictionary includes a stroke coat that specifies a stroke, coordinates of the starting point of the stroke, proportions that indicate the length of the stroke, thickening information for fleshing out the stroke, and stroke code that specifies the stroke. Consists of a chain coat for development into a bit pattern. The second chain code expands the stroke a bit.

あるドツトから次のドツトへ移行するための方向を、第
１３図に示す０〜７のコートな表現し、たちのであり、
８はストップコードである、太線ｆヒ情報は、第８図に
示すように、１つのストロークコートについて、特徴コ
ート、節点座環、太さデータ（Ｌ、Ｒ）からなる。特徴
コードは、スＩ・ローフの肉付けする部分を規定するコ
ードである５第９図に例示するように（図は文字ｒ大」
の／Ｅ下にはらう部分などのストロークを示している）
１節点座標はストロークの太さが変化するＸ方向座標を
示す。太さデータは、ストロークの心線すなわちベアボ
ーン】００にＹ方向に左右に肉付けする幅を示し、でい
る。The direction of transition from one dot to the next is expressed as a code from 0 to 7 as shown in Figure 13.
8 is a stop code. As shown in FIG. 8, the information consists of a feature coat, a nodal seat ring, and thickness data (L, R) for one stroke coat. The feature code is a code that specifies the fleshed out part of the loaf.
/E indicates the stroke of the part to be drawn etc.)
The 1-node coordinate indicates the X-direction coordinate at which the thickness of the stroke changes. The thickness data indicates the width of the center line of the stroke, that is, the bare bone 00, to be filled left and right in the Y direction.

飾り辞書には、明朝体漢字の横棒や縦棒の始端と終端に
ある突起、カギの角部分の突起なと゛の飾りを示すデー
タが、８（Ｎ１ｍ程度１２Ｂさ↑（でいる。In the decoration dictionary, there are 8 data showing the protrusions at the beginning and end of the horizontal bar and vertical bar of Mincho typeface kanji, the protrusions on the corner of a key, and the decorations such as ゛.

二の飾り辞；ＩＦのデータは、第７図に示すように。Second decoration: IF data is as shown in Figure 7.

飾りコート、飾りの始点Ｓの片漂、飾りのＩ−ソトパタ
ーンからなる。その四を第１０図に示す５第１１図に示
す横棒のストロークの右端に、基準点Ｓｔと始点Ｓを一
致させて第１Ｏ図の飾りを付加す九ば、第１２図のスト
ロークパターンが得られる。It consists of a decorative coat, a one-sided drift of the starting point S of the decoration, and an I-soto pattern of the decoration. Part 4 is shown in Fig. 10. 5. At the right end of the stroke of the horizontal bar shown in Fig. 11, the decoration shown in Fig. 1O is added by aligning the reference point St and the starting point S. The stroke pattern shown in Fig. 12 is can get.

ｉｆ￥び？ＰＪ／１図を参照し５．ストローク発生部２
６は。If ¥bi? 5. Refer to PJ/1 diagram. Stroke generator 2
6 is.

ストロークデータを入力として、ストロークのドツトパ
ターンを出力バッファ３６上に展開するものである。飾
り発生部２８は、飾りのデータを入力とし２て、飾りパ
ターンを出カバソファ３６上のストロークパターンに合
成する装置である。ストローク発生部２６および飾り発
生部２８への入力データは、主処理部２２により各辞書
メモリ２３゜２４．２５から読み出される。出力バッフ
ァ３Ｇの内容は、出力制御部３０の制御により出力され
、平滑処理部３８で斜線のエツジなどの平滑（ヒを施さ
れた後、システムバス８へ送出される、次に、第１４図
のフローチャートを参照し、て。The dot pattern of the stroke is developed on the output buffer 36 by inputting stroke data. The decoration generating section 28 is a device that receives the decoration data as input and synthesizes the decoration pattern with the stroke pattern on the cover sofa 36. Input data to the stroke generating section 26 and the decoration generating section 28 are read out from each dictionary memory 23°, 24, and 25 by the main processing section 22. The contents of the output buffer 3G are outputted under the control of the output control section 30, smoothed by the smoothing section 38 to remove diagonal edges, etc., and then sent to the system bus 8. Please refer to the flowchart.

文字合成部２により文字合成処理を説明する。まず、オ
ペレータはキー人力装置３によって評言作成モードを指
定する。そうすると、制御部７は文字合成部２を作動状
態にする。ついで、オペレータは発生すべき文字のコー
ドのキー人力装置３から入力する５この文字コードはシ
ステムバス８を通じて文字合成部２に送られ、入力ボー
ト２０およびバス２１を介し、て主処理部２２に入力さ
れる（ブロックｌｏｔ＞。Character synthesis processing by the character synthesis section 2 will be explained. First, the operator specifies the review creation mode using the key manual device 3. Then, the control section 7 puts the character composition section 2 into operation. Next, the operator inputs the code of the character to be generated from the key-powered device 3 (5). Input (block lot>.

主処理部２２は、入力された文字フードを用いて文字辞
！Ｆ（２３）を検索し、該当する文字の１つのパターン
（ｉ初は標準パターン、それ以降は各変形パターン）に
対応するデータを読み出す（プロソゲ１０２）。主処理
部２２は、読み出したデータ中の最初のストローク番号
から、ストロークコードによってストローク辞ｖ（２１
１）を検索し、該当する１つのストロークに関するスト
ロークデータを読み出し、ストローク発生部２Ｇへ入力
する（ブロックｌ’０４）、また主処理部２２は、その
ストロークに飾りコードがあるか調べ（ブロック１０７
ｍあるならば、その飾りコードを用いて飾り辞１Ｆ（２
５）を検索し、該当する飾りデータを読み出して飾り発
生部２８へ入力する（ブロック１０８）。The main processing unit 22 uses the input character food to create a character word! F(23) is searched and data corresponding to one pattern of the corresponding character (standard pattern for i first, each modified pattern thereafter) is read out (Prosogame 102). The main processing unit 22 generates a stroke dictionary v(21
1), reads the stroke data related to the corresponding stroke, and inputs it to the stroke generation section 2G (block l'04).The main processing section 22 also checks whether the stroke has a decoration code (block 107).
If there is m, use that decoration code to create decoration 1F (2
5) and reads out the corresponding decoration data and inputs it to the decoration generation section 28 (block 108).

ストローク発生部２６は、入力されたストロークデータ
（始点座標、プロポーション、チェーンコード）に基づ
き、ストロークのドツトパターンを出カバソファ３６上
に展開する。すなわち、指定始点から指定プロポーショ
ンだけチェーンコードで指定される方向にストロークの
ベアボーンを描く（ブロック１０５）。例えば、第１１
図に示す横棒の例では、心線４１が描かれる。つぎにス
トローク発生部２６は、作成したストロークのベアボー
ンに、太線化情報にし、たがって肉付けを行う（ブロッ
ク１０６）。第１１図の例では、肉付は部分的４２．４
３がベアボーン４１に付加される。飾り発生部２８は、
飾りデータに基づいて。The stroke generator 26 develops a stroke dot pattern on the output cover sofa 36 based on the input stroke data (starting point coordinates, proportion, chain code). That is, the bare bones of the stroke are drawn in the direction specified by the chain code by the specified proportion from the specified starting point (block 105). For example, the 11th
In the example of the horizontal bar shown in the figure, a core wire 41 is drawn. Next, the stroke generation unit 26 adds thick line information to the bare bones of the created stroke, and accordingly adds flesh to the bare bones (block 106). In the example in Figure 11, the fleshing is partially 42.4
3 is added to the bare bone 41. The decoration generating part 28 is
Based on ornament data.

出力バソ７７３６．）：に展開されたストロークパター
ンに飾りパターンを付加する（ブロック１０９）、。Output basso 7736. ): adds a decorative pattern to the stroke pattern developed in (block 109).

例えば第１１図の例では、第１Ｏ図の飾りパターンを、
その始点Ｓとベアボーン４Ｉの基準位ＩＳｌに一致させ
て付加し５、これにより第１２図に示す横棒のパターン
が合成される。For example, in the example of Fig. 11, the decorative pattern of Fig. 1O is
The starting point S is added 5 so as to match the reference position ISl of the bare bone 4I, thereby synthesizing the horizontal bar pattern shown in FIG. 12.

１つのストロークパターンの合成が終了すると。When the composition of one stroke pattern is completed.

主処理部２２は合成すべぎストロークが残っているか調
べ（ブロック１０３）、残っているならば次の１のスト
ロークについて同様の処理を実行する。すべてのストロ
ークの合成を終了すると、主処理部２２は出力制御部３
０に指令を与え、出カバソファ３６に合成された文字パ
ターンをシステムバス８へ出力させる（ブロック１１０
）。制御部７はシステムバス８に乗せられた文字パター
ンデータを表示装置！！６八入へさせ９表示させる。The main processing unit 22 checks whether there are any remaining strokes to be synthesized (block 103), and if there are any remaining strokes, the same process is executed for the next stroke. After completing the synthesis of all strokes, the main processing section 22 outputs the output control section 3.
0 and causes the output cover sofa 36 to output the synthesized character pattern to the system bus 8 (block 110
). The control unit 7 displays the character pattern data carried on the system bus 8! ! Enter 68 and display 9.

次に主処理部２２は、指定文字の合成すべぎパターンが
残っているか調べ（ブロックｉｌＮ。Next, the main processing unit 22 checks whether a combination pattern of the specified character remains (block ilN).

残っているならば、そのデータを文字辞書２３から読み
出しくブロック１０２）、以下同条の処理を行って、そ
の文字パターンを合成する１合成された文字パターンは
、同様に表示装置６に表示される。全パターンの合成を
終了すると、主処理部２２は出力制御部３０を通し、て
制御部７に終了報告を送る。If the remaining data remains, the data is read from the character dictionary 23 (block 102), and the processing described in the same article is performed to synthesize the character pattern.1 The synthesized character pattern is similarly displayed on the display device 6. Ru. When the synthesis of all patterns is completed, the main processing section 22 sends a completion report to the control section 7 through the output control section 30.

二のようにし２て、１つの文字の標準パターンと。2 and 2 with a standard pattern of one character.

その１つ以）二の変形パターンが文字合成部２で順次合
成され１表示装置６に表示される。なお、制御部７は、
同一文字の各パターンに識別番号を付加して表示させる
。また、各パターンは表示装置６内部のメモリに記憶さ
れている。制御部７は、終了報告を受けると、オペレー
タに対する文字パターン選択増水を表示装置１６に表示
させる。The one and two modified patterns are sequentially synthesized by the character synthesis section 2 and displayed on the display device 6. Note that the control unit 7
Add an identification number to each pattern of the same character and display it. Further, each pattern is stored in a memory inside the display device 6. Upon receiving the completion report, the control section 7 causes the display device 16 to display a text pattern selection water increase for the operator.

次に９手書き文字認識用辞書の作成について、第１５図
のフローチャートを参照し説明する。この場合、オペレ
ータはキー人力装置３より辞書作成モードを指定する。Next, the creation of a dictionary for 9 handwritten character recognition will be explained with reference to the flowchart of FIG. In this case, the operator specifies the dictionary creation mode using the key manual device 3.

次にオペレータは、辞書に登録したい文字のコードをキ
ー人力装ｗ３から入力する（ブロック２０１）。Next, the operator inputs the code of the character to be registered in the dictionary using the keypad w3 (block 201).

二の文字コードが文字合成部２に入力さ↑Ｌると、その
文字の標準パターンと１つ以上の変形パターンが文字合
成部２により順次合成され１表示装置６に表示される（
ブロック２０２）。制御部７は、全パターンの合成終了
報告を文字合成部２から受けると、パターン選択メツセ
ージを表示装置６に表示させる。オペレータは１表示さ
れているパターンの中から、自分のｆ、ｉｌＦき文字に
似ている１つまたは複数のパターンの番号（前述のよう
に、各パターンには識別番号が一緒に表示されている）
をキー人力装［３から入力し、制御部７は、オペレータ
により選択されたパターンを表示装！！６の内部メモリ
から読み出し、認識部４に入力する（ブロック２０３）
。When the second character code is input to the character synthesis section 2, the standard pattern and one or more modified patterns of that character are sequentially synthesized by the character synthesis section 2 and displayed on the first display device 6 (
block 202). When the control section 7 receives the report of completion of synthesis of all patterns from the character synthesis section 2, it causes the display device 6 to display a pattern selection message. The operator selects the number of one or more patterns from among the displayed patterns that are similar to his f, ilF character (as mentioned above, each pattern is accompanied by an identification number). )
is input from the key operator [3], and the control unit 7 displays the pattern selected by the operator on the display! ! 6 from the internal memory and input to the recognition unit 4 (block 203)
.

認識部４は前記学習モードで動作する（ブロック２０４
）、すなわち、入力文字パターンの特徴ベクトルが特徴
抽出部ｌＯによって抽出され、その特徴ベクトルとの距
離が最小の特徴ベクトルが辞書検索部１１により検索さ
れる。入力文字パターンを認識できた場合、その時に入
力した文字の変形パターンの特徴ベクトルは手書き文字
認識用辞書（メモリ１４）に追加登録する必要はないし
５゜もし登録すると認識率が低下する恐れがある。した
がって、この場合、その旨が制御部７の制御により表示
装置６に表示されるだけで、その入力文字パターンに対
する処理は終了する。リジェクトとなった場合、辞１１
Ｆ登録部１２により追加登録がなされる。まず、類似文
字が辞書に登録さ才１ていないためにリジェクトとなっ
た場合は、入力文字パターンの特徴ベクトルが、標電の
許容誤差範囲△０と文字コード（制御部７より入力され
る）が組み合わされて辞ＩＦ（１／ｌ）に登録される。The recognition unit 4 operates in the learning mode (block 204
), that is, the feature vector of the input character pattern is extracted by the feature extraction unit 1O, and the dictionary search unit 11 searches for the feature vector having the minimum distance from the feature vector. If the input character pattern can be recognized, there is no need to additionally register the feature vector of the deformed pattern of the character input at that time in the handwritten character recognition dictionary (memory 14), and if it is registered, the recognition rate may decrease. . Therefore, in this case, only a message to that effect is displayed on the display device 6 under the control of the control section 7, and the processing for the input character pattern ends. If rejected, 11
Additional registration is performed by the F registration unit 12. First, if a similar character is rejected because it is not registered in the dictionary, the feature vector of the input character pattern is the permissible error range △0 of the signboard and the character code (input from the control unit 7). are combined and registered in the dictionary IF (1/l).

類似候補文字が２つ以上あるためにリジェクトされた場
合、最も類似し、た候補文字と入力文字との特徴ベクト
ル間距離が調べらオシ、それが許容最小値を越えている
ならば、辞書への追加登録は行われず。If the character is rejected because there are two or more similar candidate characters, the distance between the feature vectors between the most similar candidate character and the input character is checked, and if it exceeds the minimum allowable value, it is added to the dictionary. No additional registration was made.

許容最小値以下ならば、それを用いて前述のように計算
される許容誤差範囲および文字コードととに、入力文字
パターンの特徴ベクトルが辞書に登録される。登録の旨
のメツセージが表示装置６に表示される。If it is less than or equal to the minimum allowable value, the feature vector of the input character pattern is registered in the dictionary along with the allowable error range and character code calculated as described above using it. A message to the effect of registration is displayed on the display device 6.

選択されたすべてのパターンについて以上の処理が終る
と、続行確認メツセージが表示される（ブロック２０５
）。オペレータは、別に登録し。When the above processing is completed for all selected patterns, a confirmation message to continue is displayed (block 205).
). Operators must be registered separately.

たい文字があるならば続行指示をキー人力し、終了なら
ば終了指示キー人力する。制御部７は入力された指示が
終了指示か否かを調へ（ブロック２０６）、終了ならば
辞書作成モードから通常モードに移行し７．続行指示な
らば文字コード入力を待つ、なお、辞書作成モードは、全く新し５くニーぜ用の辞書
を作成するためにも、既に作成されているユーザ用辞書
を更新（追加登録）するためにも用いられる。If there is a character you want, press the continue instruction key, and if you want to finish, press the end instruction key. The control unit 7 determines whether the input instruction is an end instruction (block 206), and if it is an end instruction, shifts from the dictionary creation mode to the normal mode7. If the instruction is to continue, it will wait for the character code input. In addition, the dictionary creation mode is used to update (additionally register) an already created user dictionary, as well as to create a completely new dictionary for the 5th Kneeze. Also used for

以上のようにし、て、ユーザの手書き文字を記入し、た
学習シードなどを文字読取部ｌに読ませるなどの方法よ
りも迅速に、崎々のユーザの手書き文字認識にｉ＆適な
辞書を効率よく作成できる３な、お、作成し、た辞書を
外部記憶′４Ａ置５に格納し傑ｌｔできることは勿論で
ある。As described above, it is possible to efficiently create a dictionary that is suitable for recognizing the user's handwritten characters more quickly than the method of writing the user's handwritten characters and having the character reading unit read the learning seeds. Of course, you can easily create a dictionary and store it in the external storage space 5.

以上、−・実施例について説明したが１本発明はそれだ
けに限定されるものではない。Although the embodiments have been described above, the present invention is not limited thereto.

例えば、辞書作成処理において１文字合成部２より合成
したパターンを、オペレータが表示画面をみながら修正
し、その修正パターンを認識部に入力し　辞ＩＩ　’Ｊ
　録処理を実ｔうさせてもよい、二のようなパターン修
正は１表示画面上で直接的に実行してもよいし１、ある
いは、例えば文字辞書から読み出されたデータを表示さ
せ、特定のストロークの始点座環やプロポーションの修
正をキー人力装置から入力し５、修正後のデータを文字
合成部のト処理部に送り、その修ＩＦデータに１＆づき
文字パターンを１１＃ｊσ合成させるなどの方法によっ
て？１つでもよい。For example, in the dictionary creation process, an operator corrects the pattern synthesized by the single character synthesis section 2 while looking at the display screen, and inputs the corrected pattern into the recognition section.
The pattern modification described in (2) may be performed directly on the display screen, or, for example, by displaying data read out from a character dictionary and Input the correction of the starting point ring and proportion of the stroke from the key human power device 5, send the corrected data to the g processing section of the character synthesis section, and synthesize the 1 & character pattern 11#jσ with the modified IF data, etc. By the way? One is fine.

〔effect〕

以上説明したように１本発明によれば、大規模な＋１辞
書を用意することなく、個々のユーザに最適な手書き文
字認識用辞書を従来より効率的に１ヤ成（更新）するこ
とができる。As explained above, according to the present invention, it is possible to create (update) a dictionary for handwritten character recognition that is optimal for each user more efficiently than before, without preparing a large-scale +1 dictionary. .

[Brief explanation of the drawing]

第１図は本発明の一実施例に係る手書き文字認識装置の
概略ブロック図、第２図は認識部のブロック図、第３図
は特徴ベクトルの許容誤差範囲を決定するための関数の
グラフ、第４図は文字合成部のブロック図、第５図は文
字辞書データの構成を示す概念図、第６図はストローク
辞書データの構成を示す概念図、第７図は飾り辞書デー
タの構成を示す概念図、第８図は太線化情報の内容を示
す概念図、第９国は太線化情報の意：味を説明するため
のストロークパターンの・例を示す図、第１０図は飾り
データの〜・例の説明図、第１１図および第１２図はス
トロークのベアボーン、肉付け。飾り（、ｒ加の説明図、第１：１図はチェーンニ１−ド
の説明図、第１４図は文字合成部によるパターン合成処
理を示すフローチャート、第１５図は干書き文字認識用
辞書の作成（更新）処理を示すフローチャートである。ｌ・・・文字読取部、　　２・・・文字合成部。３・・・キー人力装置、　４・・・認識部、　５・・・
外部記憶装置、　６・・・表示装置、　７・・・制御部
。ＩＯ・・・特徴抽出部、　　　１１・・・辞書検索部。１３・・・辞ＩＦ登録部、　　　１４・・・辞書メモリ
。２０・・・入力ポート、　　２２・・・主処理部、２３
・・・文字辞書メモリ、　　２４・・・ストローク辞書
メモリ、　　２５・・・飾り辞書メモリ。２６・・・ストローク発生部、　２８・・飾り発生部。、３０・・出力制御部、３６・・出カバソファ。３８　・平滑処理部。第３図第　４　図第　５　図第　６　図第　７　図第１１図第１２図第１３図第１４図第１５図FIG. 1 is a schematic block diagram of a handwritten character recognition device according to an embodiment of the present invention, FIG. 2 is a block diagram of a recognition unit, and FIG. 3 is a graph of a function for determining an allowable error range of a feature vector. Fig. 4 is a block diagram of the character synthesis section, Fig. 5 is a conceptual diagram showing the structure of character dictionary data, Fig. 6 is a conceptual diagram showing the structure of stroke dictionary data, and Fig. 7 is a conceptual diagram showing the structure of decoration dictionary data. Conceptual diagram, Figure 8 is a conceptual diagram showing the contents of the thick line information, Country 9 is a diagram showing an example of a stroke pattern to explain the meaning of the thick line information, and Figure 10 is a diagram showing the meaning of the thick line information. - Examples of explanatory diagrams, Figures 11 and 12, are bare bones and fleshed-out strokes. Figure 1:1 is an explanatory diagram of the chain needle, Figure 14 is a flowchart showing pattern synthesis processing by the character composition unit, Figure 15 is the creation of a dictionary for recognition of handwritten characters. It is a flowchart showing (update) processing. 1... Character reading section, 2... Character synthesis section. 3... Key human power device, 4... Recognition section, 5...
External storage device, 6...Display device, 7...Control unit. IO: Feature extraction unit; 11: Dictionary search unit. 13...Dictionary IF registration section, 14...Dictionary memory. 20... Input port, 22... Main processing unit, 23
... Character dictionary memory, 24... Stroke dictionary memory, 25... Decoration dictionary memory. 26... Stroke generation part, 28... Decoration generation part. , 30... Output control unit, 36... Output cover sofa. 38 - Smoothing processing section. Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 11 Figure 12 Figure 13 Figure 14 Figure 15

Claims

[Claims]

(1) In a character recognition device that can recognize handwritten characters and is equipped with a device that extracts feature vectors of input character patterns and registers them in a handwritten character recognition dictionary, a character synthesis device is used to synthesize transformed patterns of characters. A method for creating a dictionary for handwritten character recognition, characterized in that by selecting and inputting a character pattern similar to a user's handwritten character from among them, the feature vector of that character pattern is registered in the dictionary for handwritten character recognition. .