JPS58214972A

JPS58214972A - On-line handwritten character recognizer

Info

Publication number: JPS58214972A
Application number: JP57097606A
Authority: JP
Inventors: Akihiro Asada; 昭広浅田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1982-06-09
Filing date: 1982-06-09
Publication date: 1983-12-14
Also published as: JPH0150954B2

Abstract

PURPOSE:To eliminate an operation for indication of capitals and small letters, by deciding a capital when the size detection result of the input character is larger than a fixed value and then a small letter when the size detection result is smaller than the fixed value and then delivering a character code corresponding to the decided character. CONSTITUTION:A written character is fed to a pre-processing part 4, and the size of the character is detected. A comparator 10 compares the detected size value with the set value to decide the character as a capital or small letter when the size value is larger or smaller than the set value respectively. A code converting circuit 8 receives a character code corresponding to the result obtained by giving pattern recognition to the written character and a size decision result signal through an input and regardless of the size of the written character. Then the character code corresponding to the written character is delivered as it is when a capital is decided by the decision of size. While a character code receives a prescribed conversion based on the character code received at the input and is delivered in the case of a small letter.

Description

【発明の詳細な説明】本発明は、オンライン手書き文字認識装置に係り、特に
、筆記者の負担軽減と文字記入速度の低下防止とを図っ
たオンライン手書き文字認識装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to an online handwritten character recognition device, and more particularly to an online handwritten character recognition device that reduces the burden on a scribe and prevents a decrease in character entry speed.

従来技術とその問題点を第１図、第２図により説明する
。第１図において、１はタブレソト（座標入力装置）、
２は文字記入用シート、３は入力ペン、４は前処理部、
５は特徴抽出回路、６はマツチング回路、７は最小値選
択回路、８はコード変換回路、９は出力端、１０はＵＰ
／ＤＯｉ１％’Ｎ検出回路、１１はストローク数検出回
路、１２は標準パターンメモリ、１３は位置比較回路、
１４は指示枠位置メモリ、１５は大文字／小文字フラグ
メモリである。筆記者は、入力ペン３によって、タブレ
ット１上に配置されている文字記入用シート２の各文字
記入枠２−１内に文字を筆記する。The prior art and its problems will be explained with reference to FIGS. 1 and 2. In Fig. 1, 1 is a tablet input device (coordinate input device);
2 is a character entry sheet, 3 is an input pen, 4 is a preprocessing unit,
5 is a feature extraction circuit, 6 is a matching circuit, 7 is a minimum value selection circuit, 8 is a code conversion circuit, 9 is an output terminal, 10 is UP
/DOi1%'N detection circuit, 11 is a stroke number detection circuit, 12 is a standard pattern memory, 13 is a position comparison circuit,
Reference numeral 14 represents an instruction frame position memory, and reference numeral 15 represents an uppercase/lowercase character flag memory. A scribe uses an input pen 3 to write characters in each character entry frame 2-1 of a character entry sheet 2 placed on a tablet 1.

このとき、タブレット１は、入力ペン３のペン先のＸＹ
座標の位置情報を、一定時間（標本化周期）ごとに出力
ライン１−３より出力する。At this time, the tablet 1
Coordinate position information is output from output lines 1-3 at regular intervals (sampling period).

また、入力ペン３には、入力ペン３が文字記入用シート
２に圧着しているか否かを検出するスイッチが内蔵され
ており、このスイッチ出力も、Ｚ軸情報として、前記Ｘ
Ｙ座標の位置情報とともに標本化周期ごとに出力ライン
１−３より出力される。In addition, the input pen 3 has a built-in switch that detects whether or not the input pen 3 is pressed against the character entry sheet 2, and the output of this switch is also used as Z-axis information.
It is output from output lines 1-3 at every sampling period along with the position information of the Y coordinate.

これらのＸ、Ｙ、Ｚ軸情報は、筆跡情報として前処理部
４に供給される。前処理部４では、まず、２軸情報を見
て、入力ペン３が文字記入用シート２に圧着しているデ
ータ（以下筆跡点と呼ぶ）のみを選択して取り込み以下
の正規化処理をする。These X, Y, and Z axis information are supplied to the preprocessing section 4 as handwriting information. The preprocessing unit 4 first looks at the two-axis information, selects and imports only the data (hereinafter referred to as handwriting points) that the input pen 3 presses onto the character entry sheet 2, and performs the following normalization process. .

筆跡点の系列には冗長な点を含んでいる。それは、文字
記入時のペン先の移動が一定速度ではなく、隣接する筆
跡点間の空間的な距離が非常に近接しているものもある
からである。このために、筆跡冗長点の除去が行なわれ
る。除去の方法は、ストローク（入力ペン３が文字記入
用シート２に圧着してから離れるまでに描かれた一つの
線分、筆跡点系列）の始点から一定距離はなれた筆跡点
を再標本化点とし、次に、この再標化点から同様に一定
距離はなれた筆跡点を再び再標本化点とする処理をスト
ロークの終点まで行なう。つまり、各ストロークの筆跡
点系列を、時間空間系列から距離空間系列に変換する（
以下この処理を再標本化処理と呼ぶ）。The series of handwriting points includes redundant points. This is because the pen tip does not move at a constant speed when writing characters, and the spatial distance between adjacent handwriting points may be very close. For this purpose, redundant handwriting points are removed. The removal method is to resample handwriting points that are a certain distance away from the starting point of a stroke (one line segment drawn from when the input pen 3 presses on the character entry sheet 2 until it leaves the character writing sheet 2, a series of handwriting points). Then, the process of again setting handwriting points that are a certain distance away from this re-marking point as re-sampling points is performed until the end point of the stroke. In other words, the handwriting point series of each stroke is converted from a time-space series to a distance-space series (
(Hereinafter, this process will be referred to as resampling process.)

次に、前処理部４は、再標本化処理された１文字分のデ
ータに対して、位置と大きさの正規化を行なう。文字記
入用シート２のどの文字記入枠２−１に文字を筆記する
か（こ、よって筆跡点のＸＹ座標値が異なること、及び
文字記入枠２−１内のどの位置に文字を筆記するかをこ
よっても筆跡点のＸＹ座標値が異ブ、ｆることのために
、１文字単位ごとに、文字の重心位置が一定となるよう
に座標変換するのが、位置の正規化である。また、文字
記入枠２−１に記入される文字の大きさは筆記者によっ
て異なるため、記入文字の大きさが一定となるように各
再標本化点の座標変換を行なうのが大きさの正規化であ
る。Next, the preprocessing unit 4 normalizes the position and size of the data for one character that has been resampled. In which character entry frame 2-1 of the character entry sheet 2 should the characters be written? Since the XY coordinate values of the handwriting points will vary even if this is done, position normalization is a process of coordinate transformation for each character so that the center of gravity of the character remains constant. Also, since the size of the characters written in the character entry frame 2-1 differs depending on the scribe, it is recommended to perform coordinate transformation of each resampling point so that the size of the characters written is constant. It is.

これは、文字の重心位置に対する各再標本化点の距離の
平均が一定になるようにすることで行なわれる。This is done by ensuring that the average distance of each resampling point to the center of gravity of the character is constant.

このよ・）に前処理され一人力文字データは、特徴抽出
回路５によって、以後の処理が容易に行なえるように、
情報量を低減した形で表現される。例えば、Ｍストロー
クからなる入力文字■は、第ｍ番目番こ記入されたスト
ロークをＩｍとして。The pre-processed character data is processed by the feature extraction circuit 5 to facilitate subsequent processing.
Expressed in a form with reduced amount of information. For example, for an input character ■ consisting of M strokes, the mth stroke entered is Im.

Ｈ＝　（Ｉ　１　、Ｉ　２　＋　　・・・　、Ｉｈ１）
というように、ストロークの筆記順に表現する。H= (I 1 , I 2 + ..., Ih1)
It is expressed in the order in which the strokes are written.

また、各ストローク１１〜ＩＭは、それぞれストローク
の始点（書き始めの筆跡点）から終点（書き終りの筆跡
点）までの１ストロークの巌分をＮ等分するＮ＋１個の
折線近似点の系列で表現する。つまり、第ｎ１番目のス
トローク１ｍは、折線近似点Ｐｍ　１．Ｐｍ２．−、Ｐ
ｍＮ＋　１の系列を用いてＩｍ＝（Ｐｍｌ　、　Ｐｍ２
、−、ＰｍＮ＋１　）と表現する。ここで折線近似点Ｐ
＋ｎｎはＰＩＩＴＩ　＝（Ｘ　ｍ　ｎ　、　Ｙ　ｍ　ｒ
・で示されるＸＹ座ａ値である。In addition, each stroke 11 to IM is a series of N+1 broken line approximation points that divide the width of one stroke from the start point (starting handwriting point) to the end point (finishing handwriting point) into N equal parts. express. In other words, the n1th stroke 1m is the approximate point Pm1. Pm2. -, P
Im=(Pml, Pm2
, -, PmN+1). Here, the approximate point P of the broken line
+nn is PIITI = (X m n , Y m r
It is the XY locus a value indicated by .

このように特徴抽出回路５で記述された入力文字は、マ
ツチング回路６の一方の入力端に供給される。マツチン
グ回路６の他方の入力端には、予め認識対象の各文字ご
とに、人力文字に対すると回置な前処理、特徴抽出され
た多数の筆記者による入力文字の平均的なパターンが、
標準パターンメモリー２より供給される。The input characters described by the feature extraction circuit 5 in this manner are supplied to one input terminal of the matching circuit 6. At the other input terminal of the matching circuit 6, the average pattern of characters input by a large number of scribes, whose features have been extracted, is processed for each character to be recognized.
Supplied from standard pattern memory 2.

Φθ ここで、文字θに対する標準パターンＳを・θ　　　　
　　θ　　　　　θ　　　　　　　　θ８　ｗ（Ｓ　　
１．Ｓ　２．・・・、８Ｍ）とする。ただし、Ｍは文字
θのストローク数でθ Ｓｍは θ　　　　　　θ　　　　　θ　　　　　　　θＳ　ｍ
ｅ＝（Ｐｍ１．　Ｐｍｚ　、−、ＰｍＮ＋　１　）と表
現される第ｍ番目のストロークである。Φθ Here, the standard pattern S for the character θ is ・θ
θ θ θ8 w(S
1. S2. ..., 8M). However, M is the number of strokes of the character θ and θ Sm is θ θ θ θS m
This is the m-th stroke expressed as e=(Pm1.Pmz, -, PmN+1).

θ Ｐ　ｍ　ｎは θ　　　　　　θ　　　　θ Ｐｍｎ！（Ｘｍｎ、　ｙｍｎ）と表現される、第ｍ番目のストロークの線分をＮ等分す
る折線近似点の第ｎ番目のＸＹ座標値である。θ P m n is θ θ θ Pmn! (Xmn, ymn) is the nth XY coordinate value of a broken line approximate point that divides the line segment of the mth stroke into N equal parts.

マツチング回路６では、入力文字Ｉと、この工のストロ
ーク数Ｍに等しいストローク数の標６準パターンＳとの距離Ｄ（θ）を次のように計算する。The matching circuit 6 calculates the distance D(θ) between the input character I and the standard pattern S having the number of strokes equal to the number of strokes M of this pattern as follows.

ここで、Ｉｍは入力文字工の第ｍ番目のストロθ　　　
　　　　　　　　　　　・θ −り、Ｓｍは標準パターンＳの第ｍ番目のストロ−り、
ｄｓ　（Ｓ’ｍ　、　Ｉｍ　）は両パターンノ第ｍ番目
ノストロークＳｍ、　Ｉｍ間の距離を示しｎ＝＝１である。Ｎ＋１はストロークの折線近似点数、θ Ｐｍｎ、Ｐｍｎは両パターンの承ｍ酢目のストロークの
第ｎ番目の折線近似点、ｄＰ　（Ｐｍｎ　、　Ｐｍｎ　
）は両パターンの第ｍ番目のストロークの第ｎ番目の折
線近似点間の距離を示しｄＰ（Ｐ”ｍｎ、Ｐｍｎ）＝　（Ｘ”ｍｎ−Ｘｍｎ）％
−（７＋コｙｍＪ９）である。Here, Im is the mth stroke θ of the input character
・θ-ri, Sm is the m-th stroke of the standard pattern S,
ds (S'm, Im) represents the distance between the m-th stroke Sm, Im of both patterns, and n==1. N+1 is the number of broken line approximation points of the stroke, θ Pmn, Pmn is the nth broken line approximate point of the second stroke of both patterns, dP (Pmn, Pmn
) indicates the distance between the n-th broken line approximate points of the m-th stroke of both patterns dP(P”mn, Pmn) = (X”mn-Xmn)%
-(7+koymJ9).

以上の（７）〜（９）式をまとめて、マツチング回路６
は、両パターン間距離Ｄ（θ）としてを計算する。ただ
し、Ｘｍｎ、）（ｎ＋ｎは第ｍ番目のストロークの第ｎ
番目の折線近似点のＸ座標値、ｙｏ−・、ｙ−・は同じ
くＹ座標値である。Combining the above equations (7) to (9), matching circuit 6
is calculated as the distance D(θ) between both patterns. However, Xmn, ) (n+n is the nth stroke of the mth stroke
The X coordinate value, yo-., y-. of the th broken line approximate point is also the Y coordinate value.

ここで、入力文字Ｉのストローク数Ｍに等しい標準パタ
ーンがＬ個あれば、マツチング回路６はこのＬ個の標準
パターンに対して順次入力文字■とのパターン間距離Ｄ
（θ）を計算し、結果を、最小値選択回路７に供給する
。Here, if there are L standard patterns equal to the number of strokes M of the input character I, the matching circuit 6 sequentially calculates the inter-pattern distance D between the input character ■ and the L standard patterns.
(θ) and supplies the result to the minimum value selection circuit 7.

なお、入力文字■のストローク数は、タブレット１から
のＺ軸情報をもとに、ＵＰ／ＤＯＷＮ検出回路１０によ
って、入力ペン３のＵＰ　、　ＤＯＷＮを検出し、そし
て、Ｕ　ｌ）からＤＯＷＮへの変化を、スｌ−ローク数
検出回路工１によって、１文字分にわたり、計数するこ
とによって求める。The number of strokes of the input character ■ is determined by detecting the UP and DOWN of the input pen 3 by the UP/DOWN detection circuit 10 based on the Z-axis information from the tablet 1, and then calculating the number of strokes from U l) to DOWN. The change is determined by counting over one character by the stroke number detection circuit 1.

ストローク数検出回路１１の出力は、標準パターンメモ
リ１２を制御し、入力文字ｌのストローク数Ｍに等しい
標準パターンＳを選択し、マツチング回路６に供給する
。The output of the stroke number detection circuit 11 controls the standard pattern memory 12 to select a standard pattern S equal to the number of strokes M of the input character l and supplies it to the matching circuit 6.

最小値選択回路７は、順次供給されるパターン間距離の
Ｌ個Ｄ（ｄｌ）〜Ｄ（θＬ）のうちの最小値を字コード
を標準パターンメモリ１２より取り込みこれをコード変
換回路８に出力する。The minimum value selection circuit 7 takes the minimum value of the L pieces of inter-pattern distances D(dl) to D(θL) sequentially supplied from the standard pattern memory 12 and outputs it to the code conversion circuit 8. .

以上のような従来のオンライン千１１キ文字認識方式に
おいて問題となるのは、例えば仮名文字「キヤ」　「キ
ー」　「キ・、」における大文字「キ」と小文字［ヤＪ
　　ｒ−Ｊ　　ｒ−Ｊのような、大文字と小文字の入力
方式と認識方式である、なんとなれば、これらの小文字
は、大文字と形状を全く同一にし、大きさが異なるのみ
であり、そして、入力文字の大きさの異なりは、従来技
術では、前処理部番こおける正規化によって、認識部で
は全く同一文字となってしまい、大文字であるか小文字
であるかの判断ができないことになるからである。The problem with the conventional online 111 character recognition method as described above is, for example, the uppercase letter ``ki'' and the lowercase letter [ya J
This is an input method and recognition method for uppercase and lowercase letters, such as r-J r-J. These lowercase letters have exactly the same shape as uppercase letters, only differing in size, and The reason for the difference in character size is that in conventional technology, due to normalization in the preprocessing section, the recognition section ends up with exactly the same characters, making it impossible to determine whether they are uppercase or lowercase. be.

例えば、第２図の（ａ）のように「ツ」という仮名を文
字記入枠２−１の枠内−ばいに書いた場合と、（Ｃ）の
ように文字記入枠２−１のすみに小さく書いた場合とで
、前処理結果は、（ｂｌ　、　（ｄｌのようにＷＯを重
心点として、全く等しいパターンとなってしまう。この
ように大文字と小文字の文字パターンが同一形状となる
ものは、日本字の片仮名、平仮名ばかりでなく、英字に
も存在する。For example, if the kana ``tsu'' is written in the corner of the character entry frame 2-1 as shown in (a) of Figure 2, and if it is written in the corner of the character entry frame 2-1 as shown in (C). The preprocessing result is exactly the same pattern with WO as the center of gravity, such as (bl, (dl). In this way, when the uppercase and lowercase character patterns have the same shape, , exists not only in Japanese katakana and hiragana, but also in English characters.

これに対処して、従来、大文字であるか小文字であるか
の情報を、筆記時にタブレット１を介して筆記者が供給
する方式が採用されていた。To deal with this, conventionally a method has been adopted in which a scribe supplies information on whether a letter is an uppercase or a lowercase letter via the tablet 1 at the time of writing.

例えば、第１図に示すように、タブレット１に大文字指
示枠１−１と小文字指示枠１−２とを設け、どちらかの
指示枠を入力ペン３で押圧することによって、以後の入
力文字は大文字であること、あるいは小文字であること
を宣言する。For example, as shown in FIG. 1, the tablet 1 is provided with an uppercase character designation frame 1-1 and a lowercase character designation frame 1-2, and by pressing either of the designation frames with the input pen 3, subsequent input characters can be changed. Declare uppercase or lowercase letters.

そして認識部では、位置比較回路１３において、指示枠
１−１．１−２に対応する座標値（指示枠位置メモリ１
４に記憶されている）を比較用基準として、入力ペン３
がどちらの指示枠を押圧したか、あるいは全く押圧して
いないかを検出し、検出結束に対応して、大文字／小文
字フラグメモリ１５をセットする。Then, in the recognition section, the position comparison circuit 13 determines the coordinate values corresponding to the indication frame 1-1, 1-2 (indication frame position memory 1
4) as a reference for comparison, input pen 3
It is detected which designation frame is pressed or whether it is not pressed at all, and an uppercase/lowercase character flag memory 15 is set in accordance with the detected combination.

ここで、大文字／小文字フラクメモリ１５の出力Ｆを、
大文字の場合Ｆ＝１、小文字の場合Ｆ＝０とする。また
、大文字の「ツ」の文字コードをＡ３Ｃ４、小文字の「
ツｊの文字コードをＡ３Ｃ５とし、「ツ」の標準パター
ンに対して、大文字の文字コードＡ３Ｃ４を与えたとす
る。Here, the output F of the uppercase/lowercase character flak memory 15 is
For uppercase letters, F=1; for lowercase letters, F=0. Also, the character code for the uppercase letter “tsu” is A3C4, and the lowercase letter “tsu” is A3C4.
Assume that the character code for tsuj is A3C5, and the standard pattern for ``tsu'' is given an uppercase character code A3C4.

そして、「ツ」を入力ペン３で筆記した場合を見ると、
最小値選択回路７の出力には文字コードＡ３Ｃ４が出力
される。この文字コードはコード変換回路８に供給され
る。コード変換回路８は、大文字／小文字フラグメモリ
１５からの指示がＦ−１（大文字指示）のときはそのま
ま文字コードＡ３Ｃ４を出力し、Ｆ’＝０（小文字指示
）のときは、小文字に対応する文字コードＡ３Ｃ５を出
力する。つまり、コード変換回路８は、大文字に対応す
る小文字の文字コード対応表を内部にもち、これを用い
て、Ｆ＝０のときは大文字の文字コードを小文字の文字
コードに変換して出力する。And if you look at the case where "tsu" is written with input pen 3,
The character code A3C4 is outputted from the minimum value selection circuit 7. This character code is supplied to the code conversion circuit 8. When the instruction from the case/lowercase flag memory 15 is F-1 (uppercase instruction), the code conversion circuit 8 outputs the character code A3C4 as is, and when F'=0 (lowercase instruction), it outputs the character code A3C4, which corresponds to a lowercase letter. Outputs character code A3C5. That is, the code conversion circuit 8 internally has a character code correspondence table of lowercase letters corresponding to uppercase letters, and uses this to convert the uppercase character code to a lowercase character code when F=0 and outputs the converted character code.

しかし上記した従来方式には、文字を記入する以外に大
文字であるか小文字であるかの指示をもしなければなら
ず、筆記者にとって負担となるばかりでなく、入力速度
を低下させるという問題がある。However, the conventional method described above has the problem that in addition to writing the letters, it is also necessary to indicate whether the letters are uppercase or lowercase, which not only burdens the scribe but also slows down the input speed. .

本発明の目的は、従来技術での上記した問題点を解決し
、娘記者の負担を軽減し、文字記入速度の低下を防止す
るとともに、英字ζこ対する認識能率を向上させること
のできるオンライン手書き文字認識装置を提供すること
にある。The purpose of the present invention is to solve the above-mentioned problems in the prior art, reduce the burden on the reporter, prevent a decrease in character entry speed, and improve the recognition efficiency for English letters. An object of the present invention is to provide a character recognition device.

本発明の特徴は、上記目的を達成するために、筆記文字
の大きさを検出する文字ザイズ検出手段と、この検出値
と設定値とを比較して設定値より大きいとき大文字、小
さいとき小文字と判定する文字サイズ判定手段と、筆記
文字の文字サイズには依存しないで筆記文字をパターン
認識した結果に対応する文字コードと上記サイズ判定結
果信号とを入力に受けてサイズ判定結果が大文字のとき
は筆記文字に対応する文字コードをそのま差出力しサイ
ズ判定結果が小文字のときは入力に受けた文字コードを
もとに所定のコード変換を行なって出力するコード変換
手段とを備えた構成とするζこゐる。In order to achieve the above object, the present invention has a character size detection means for detecting the size of a written character, and compares this detected value with a set value, and when the size is larger than the set value, it is determined to be an uppercase letter, and when it is smaller than the set value, it is determined to be a lowercase letter. When the size determination result is an uppercase character by receiving the character code corresponding to the result of pattern recognition of the written character without depending on the character size of the written character and the size determination result signal as input, The apparatus is configured to include a code conversion means that directly outputs a character code corresponding to a written character, and when the size determination result is a lowercase character, performs a predetermined code conversion based on the input character code and outputs the result. ζ It's cold.

以下、本発明の一実施例を第３図により説明する。第３
図においで、４−１は再標本化回路、４−２は位置正規
化回路、４−３は重心点正規化回路、４−４は大きさ正
規化回路、４−５は平均半径抽出回路、１６は比較回路
であり、その他は第１図の場合と同じである。人力さイ
１．た文字の婚跡情報はタブレットの出力ラインｌ−３
より再標本化回路４−１に入力され、ここでは前述した
よ・うに、各ストロークの筆跡点のうらのｌ、Ｔ　Ｋ点
を除去し、各ストロークの筆跡点系列を時空間イ；列か
ら距離空間系列に変換する。つまり、ストロークの始点
から終点まＣの線分を一定距離間隔で再標本化する。An embodiment of the present invention will be described below with reference to FIG. Third
In the figure, 4-1 is a resampling circuit, 4-2 is a position normalization circuit, 4-3 is a centroid normalization circuit, 4-4 is a size normalization circuit, and 4-5 is an average radius extraction circuit. , 16 is a comparison circuit, and the other parts are the same as in the case of FIG. Human power 1. The marriage information of the characters is on the output line l-3 of the tablet.
Here, as described above, the l and TK points behind the handwriting points of each stroke are removed, and the handwriting point series of each stroke is extracted from the spatiotemporal column. Convert to metric space series. That is, the line segment C from the start point to the end point of the stroke is resampled at constant distance intervals.

この再標本化された入力文字■の第ｍ番目のストローク
Ｉｍを再標本化点□□□】、Φ２・・・、　（Ｊ）ｎＥ
　（ｒｒ）の系列で表現しＩｍ＝　（Ｑｍｌ　、　Ｑｍ２　、−・−、Ｑ＋ｎＢ（
ｔｒｉ　）とする。ただし、Ｑｍｅは第ｍ番目のストロ
ークの第０番目の再標本化点、Ｅ（ホ）は第ｍ番目のス
トロークの再標本化点の数である。また、再標本化点Ｑ
ｍｅは、ＸＹ座標値を示しＱｍｅ　＝　（Ｘｍｅ　、　Ｙｍｅ　）　　　　　　　
である。The m-th stroke Im of this resampled input character ■ is the resampled point □□□], Φ2..., (J)nE
Expressed as a series of (rr), Im= (Qml , Qm2 , −・−, Q+nB(
tri ). However, Qme is the 0th resampling point of the mth stroke, and E (e) is the number of resampling points of the mth stroke. Also, the resampling point Q
me indicates the XY coordinate value, Qme = (Xme, Yme)
It is.

こうして再標本化された人力文字データは、重心点抽出
回路４−３に供給され、入力文字の重心位置Ｗｏが抽出
される。この重心位置ＷＯは、−文字分の全再標本化点
Ｑｍｅ　（Ｉｎ　”　１〜Ｍ、ｅ：　ｌ　、　Ｆ！に）
）のＸ座浦値の平均値Ｘｏ、Ｙ座標値の平均値ＹＯをＸ
Ｙ座標値とする↓）のでＶｖｏ＝（Ｘｏ　、　Ｙｏ　）と表現される。The human character data resampled in this way is supplied to the center of gravity extraction circuit 4-3, and the center of gravity Wo of the input character is extracted. This center of gravity position WO is the total resampling point Qme for − characters (In” 1 to M, e: l, F!)
), the average value Xo of the X Zaura value, the average value YO of the Y coordinate value
The Y coordinate value is ↓), so it is expressed as Vvo=(Xo, Yo).

次に、位置正規化回路４−２においＣ１この重心位置Ｗ
Ｏを新たなＸＹ座標軸の原点と４−るように、各再標本
化点Ｑｍｅの座標値を変換する。Next, in the position normalization circuit 4-2, C1 this center of gravity position W
The coordinate values of each resampling point Qme are transformed so that O is the origin of the new XY coordinate axes.

つまり、−ＸＱ　　　　−Ｙ。In other words , -XQ -Y.

Ｑｍｅ　＝　（Ｘｍｅ　　　　、　Ｙｍｅ　　　　）の
よ・）に、各再標本化点ＱｍｅのＸＹ座標値ＸｍｅＹｍ
ｅから、Ｘｏ　、　Ｙｏを減算する。ここでｘ　ｍｅ　
＝　Ｘ　ｍｅ　−Ｘ。Qme = (Xme, Yme)), the XY coordinate value XmeYm of each resampling point Qme
Subtract Xo and Yo from e. x me here
= X me −X.

ｙ　ｍｅ　＝　Ｙｍｅ　’−Ｙ。yme = Yme'-Y.

とすると、重心位置はＸ　＝　Ｏ、ｙ　＝　Ｏとなる。Then, the center of gravity position is X=O, y=O.

次に、上記のように位置の正規化処理が行なわれた人力
文字データに対して、平均半径抽出回路４−５において
、入力文字の大きさ、ここでは入力文字の平均半径Ｒをとして求める。ここで、Ｕ＝ΣＥ（、、、）で入力文字
１ｎ＝１の再標本化点数であり、Ｍは入力文字のストローク数、
１ｘｍｅ１．ｌ　ｙｍｅｌは入力文字の重心位置ＷＯを
原点とする第ｍ番目のスト１１−りの第０番目の再標本
化点のＸ軸値、Ｙ軸値の絶対値であ名。叩ち、各再標本
化点ＱｒＴＩｅの重心位置Ｗ０１・からの距離の平均値
として平均半径Ｒを求めたことになる。この平均半径Ｒ
は、文字を天きく記入するほど大きな値となるものであ
り、人力文字の大きさに対応する・マラメータである。Next, the average radius extraction circuit 4-5 calculates the size of the input character, here the average radius R of the input character, for the human character data that has been subjected to the position normalization process as described above. Here, U=ΣE(,,,) is the number of resampling points of input character 1n=1, M is the number of strokes of input character,
1xme1. l ymel is the absolute value of the X-axis value and Y-axis value of the 0th resampling point of the mth string whose origin is the center of gravity WO of the input character. This means that the average radius R is obtained as the average value of the distances of each resampling point QrTIe from the center of gravity W01. This average radius R
The larger the character is written, the larger the value becomes, and is a malameter that corresponds to the size of human-powered characters.

大きさ正規化回路４−４は、この平均半径Ｒ。The size normalization circuit 4-4 calculates this average radius R.

が設定値ＲＯとなるように、各再標本化点Ｑｍｅの座標
値を変換する。この変換処理後の再標本化点ＱｍｅのＸ
、Ｙ軸の値をＸ　ｌｌ１ｅ　’ｉ″Ｙ　ＩＴｌｅとすれ
ば、大きさの正規化はのように、入力文字の平均半径几で各ＸＹ座標値を正規
化（除算）することである。The coordinate values of each resampling point Qme are transformed so that Qme becomes the set value RO. X of the resampling point Qme after this conversion process
, the Y-axis value is X ll1e 'i''Y ITle, then the normalization of the size is to normalize (divide) each XY coordinate value by the average radius of the input character, as shown below.

この入力文字の平均半径Ｒは、また、比較回路１６の一
方の入力端に入力され、他方の入力端に入力される設定
値ａｔｈと比較される。つまり、比較回路１６は、入力
文字が設定値よりも大きいか否かを判定する。判定結果
はコード変換回路８に供給され、コード変換回路８の動
作を制御する。This average radius R of the input characters is also input to one input terminal of the comparison circuit 16 and compared with a set value ath input to the other input terminal. In other words, the comparison circuit 16 determines whether the input character is larger than the set value. The determination result is supplied to the code conversion circuit 8, and the operation of the code conversion circuit 8 is controlled.

大きさ正規化回路４−４の出力信号は特徴抽出回路５に
供給され、従来技術で説明したように、入力文字が情報
量圧縮された形で表現されそして、マツチング回路６に
おいて標準パター１ンとのマツチング計算（パターン間
距離の計算）を行なう。The output signal of the size normalization circuit 4-4 is supplied to the feature extraction circuit 5, where the input character is expressed in a compressed form as explained in the prior art section, and the matching circuit 6 converts the input character into a standard pattern 1. Perform matching calculation (calculation of distance between patterns) with

標準パターンメモリ１２には、英字に関してはＡ、Ｂ、
Ｃ，・・・等の大文字、仮名文字に関してもア、イ、つ
、・・・ツ等の大文字の、多数の筆記者によって記入さ
れ前述の前処理、特徴抽出が行なわれたパターンの平均
的なパターンを、その文字に対応する文字コードととも
に記憶させておく。また、このとき、文字のストローク
数によって分類して、記憶させておく。The standard pattern memory 12 contains letters A, B,
Regarding uppercase letters such as C, ..., kana characters, etc., the average of patterns of uppercase letters such as A, I, tsu, ...tsu, written by many scribes and subjected to the above-mentioned preprocessing and feature extraction. The pattern is stored together with the character code corresponding to that character. Also, at this time, the characters are classified and stored according to the number of strokes.

ここで、第４図の図（１）９図（２）のように、文字記
入枠２−１に英字大文字１”　Ａ　Ｊを第１ストローク
■ｌ［△」　、第２ストローク１２　［’　−Ｊの２ス
トロークで記入したとすると、出力ライン１−３の筆跡
情報は、前処理、特徴抽出が行なわれトマッチング回路
６で、標準パターンメモリ内の２ストロークからなる標
準パターンとのマツチング計算（パターン間距離の計算
）が行なわれ結果が順次、最小値選択回路７に供給され
る。Here, as shown in Figures 4 (1) and 9 (2), write capital letters 1" A J in the character entry frame 2-1 with the first stroke ■l [△" and the second stroke 12 [' - Assuming that the handwriting information of output lines 1-3 is written using two strokes of J, the handwriting information of output lines 1-3 is subjected to pre-processing and feature extraction, and the matching circuit 6 performs matching calculation ( (calculation of inter-pattern distance) is performed and the results are sequentially supplied to the minimum value selection circuit 7.

そして、最小値選択回路７において、パターン間距離Ｄ
（θ）の最小値を検出して、その最小値に対応する標準
パターンの文字コードをコード変換回路８に供給する。Then, in the minimum value selection circuit 7, the inter-pattern distance D
The minimum value of (θ) is detected and the standard pattern character code corresponding to the minimum value is supplied to the code conversion circuit 8.

ここで、標準パターンとその文字（大文字）に対応する
文字コードを第１表の左部のようにし、またその大文字
に対応する小文字の文字コードを第１表の右部のように
設定したとする。Here, if you set the character code corresponding to the standard pattern and its letters (uppercase letters) as shown on the left side of Table 1, and set the character codes of lowercase letters corresponding to the uppercase letters as shown on the right side of Table 1. do.

最小値選択回路７において、大文字ｒＡＪに対するパタ
ーン間距離が最小となったとき、最小値選択回路７は、
大文字ｒＡＪに付与した文字コードＡ３Ｃ１をコード変
換回路８に供給する。そして、第４図の図（１）２図（
２）の英字大文字「Ａｊの筆記に対して、平均半径抽出
回路４−５で抽出した平均半径Ｒ（１）、Ｒ（２）が、
設定値几ｔｈより大きいか否かが、比較回路１６で比較
される。In the minimum value selection circuit 7, when the distance between patterns for the capital letter rAJ becomes the minimum, the minimum value selection circuit 7
The character code A3C1 given to the capital letter rAJ is supplied to the code conversion circuit 8. Figures 4 (1) and 2 (
2) The average radius R(1) and R(2) extracted by the average radius extraction circuit 4-5 for the writing of the capital letter "Aj" are as follows.
A comparison circuit 16 compares whether or not it is larger than a set value th.

ここではＲｔｈくＲｍ　　　　　　・・・図（１）の場合Ｒｔｈ
　＞　Ｒ（２）　　　　　　・・・図（２）の場合と判
定されたとすると、コード変換回路８は、図（１１の場
合は、入力文字コードＡ３Ｃ１（１−Ａ−１に対する文
字コード）をそのまま出力し、図（２）の場合は、小さ
く「Ａ」が記入された七して、入力文字コードＡ３Ｃ１
（１’ＡＪに対する文字コードを基に、対応する英字小
文字１ａ−（の文字コードＡ３Ｂ１を、第１表の大文字
−小文字の文字コード対応テーブル（コード変換回路８
に内蔵）を参照して、出力する。Here, Rth × Rm...In the case of Figure (1), Rth
> R(2)...If it is determined that the case shown in Figure (2) is the case, the code conversion circuit 8 outputs the input character code A3C1 (character code for 1-A-1) as it is in the case of Figure (11). In the case of Figure (2), the input character code is A3C1 with a small "A" written in it.
(Based on the character code for 1'AJ, the character code A3B1 of the corresponding alphabetic lowercase letter 1a-(
(built-in) and output it.

片仮名「ツ」を第４図の図（３）１図（４）のように筆
記した場合も同様である。これらの様子を第２表にまと
めて示している。The same is true when the katakana ``tsu'' is written as shown in Figures (3) and 1 (4) in Figure 4. These conditions are summarized in Table 2.

第　　　２　　　表（注）大きく・・・文字記入枠−ばいに大きく記入小さ
く・・・文字記入枠に対して小さく記入することを意味
する。Table 2 (Note) Large: means to write in a larger size than the character entry frame.Small: means to write in a smaller size relative to the character entry frame.

以上の実施例では、位置の正規化を入力文字の重心位置
を原点ｌこするようにし、また入力文字の大きさを、入
力文字の重心位置と再漂本化点との平均距離（平均半径
）とし、この平均半径を基に大きさの正規化を行なう構
成のものについて説明したが、本発明はこれに限定され
ず上記に代えて、位置の正規化を、入力文字の外接矩形
の中心位置を原点にするようにし、また入力文字の大き
さを、上記外接矩形の対角線長とし、この対角線長を基
に、一定の対角線長となるように、大きさの正規化を行
なう構成とすることもできる。In the above example, the position is normalized so that the center of gravity of the input character is crossed from the origin l, and the size of the input character is set to the average distance (average radius) between the center of gravity of the input character and the re-drifting point. ), and the size is normalized based on this average radius.However, the present invention is not limited to this, and instead of the above, position normalization is performed based on the center of the circumscribed rectangle of the input character. The position is set as the origin, and the size of the input character is set as the diagonal length of the circumscribed rectangle, and the size is normalized based on this diagonal length so that it becomes a constant diagonal length. You can also do that.

以上説明したように、本発明によ（１，ば、入力文字の
大きさを検出し、検出結果が一定値以上のときは大文字
、一定値より小さいときは小文字と判定し、その判定さ
ねた文字に対応する文字コードを出力する構成としたこ
とにより、従来、入力ペンを大文字、小文字の指示枠工
ＩＪ　７内に押圧することによる入力文字の大文字、小
文字の指示操作を不必要とし、筆記者の負担を除くとと
もに、文字入力速度の低下を防Ｉヒすることができるよ
うになり、また、文字が曲線を主体ｌと構成されている
英字・１・文字を認識対象外としたことにより、従来、
英字小文字が曲線を主体にしているが故に筆記者によっ
て多稲多様の変形があって認識率の低下を生じていたの
を防止することができる効果がある。As explained above, according to the present invention (1), the size of an input character is detected, and when the detection result is larger than a certain value, it is judged as an uppercase letter, and when it is smaller than a certain value, it is judged as a lowercase letter. By having a configuration that outputs a character code corresponding to the input character, it is no longer necessary to input the input character by pressing the input pen into the uppercase and lowercase letter indication frame IJ7, In addition to eliminating the burden on the scribe, it is now possible to prevent a drop in character input speed, and alphabetic characters, 1, and characters whose characters are mainly composed of curved lines, are excluded from recognition. Traditionally,
This has the effect of preventing lowercase letters from being deformed in various ways by scribes, which lowers the recognition rate because they are mainly curved.

[Brief explanation of drawings]

第１図は従来例の構成図、第２図は入力文字と前処理結
果を示す図、第３図は本発明の一実施例構成図、第４図
は本発明の文字筆記例を示４〜図である。符号の説明１・・・タブレット　　　２・・・文字記入用シート２
−１・・・文字記入枠　３・・・人力ペン４・・・前処
理部４−４・・・大きさ正規化回路４−５・・平均半径抽出回路５・・・特徴抽出回路　　７・・・最小値選択回路８・
・・コード変換回路１２・・・標準パターンメモリ１６・・・比較回路代理人弁理士　薄　１）利　幸　１・　　　２Ｐ矛（Ｄ　　　　　　　（２）４　閉（３）　　　　　　（４）４１８Fig. 1 is a block diagram of a conventional example, Fig. 2 is a diagram showing input characters and preprocessing results, Fig. 3 is a block diagram of an embodiment of the present invention, and Fig. 4 is a diagram showing an example of character writing according to the present invention. ~It is a figure. Explanation of symbols 1...Tablet 2...Character entry sheet 2
-1...Character entry frame 3...Manual pen 4...Preprocessing unit 4-4...Size normalization circuit 4-5...Average radius extraction circuit 5...Feature extraction circuit 7.・Minimum value selection circuit 8・
...Code conversion circuit 12...Standard pattern memory 16...Comparison circuit agent Patent attorney Usui 1) Toshiyuki 1 ・ 2P spear (D (2) 4 Closed (3) (4) 418

Claims

[Scope of Claims] (Li) An online handwritten character recognition device that writes characters on a tablet with an input pen and recognizes the written characters based on daughter trace information of the written characters output from the tablet. a font size detection means for detecting the size of the font, a font size determination means for comparing the detected value with a set value and determining that it is an upper case letter when it is larger than the set value, and a lower case letter when it is smaller than the set value, and a font size judgment means that does not depend on the font size of written characters. When the size judgment result is an uppercase letter, the character code corresponding to the pattern recognition result of the written character is output as is, and the size judgment result is a lowercase letter. An online handwritten character recognition device characterized by comprising: code conversion means for converting a predetermined code based on a character code received as an input and outputting the resultant code. (2. Claim 1) The above-mentioned character size detection means detects the center of gravity of the written character obtained based on the re-sampling points extracted from the line segments of each stroke of the written character at fixed distance intervals, and the above-mentioned re-sampling points. An online handwritten character recognition device characterized in that it is a character size detection means that determines the size of the handwritten character as the average value of the distance from the marked point. (3) In the device according to claim 1, the character An online handwritten character recognition device characterized in that the size detection means is a character size detection means that determines the length of a diagonal line of a rectangle circumscribing the handwritten character as the size of the handwritten character.