JPH0863552A

JPH0863552A - Device for discriminating capital or small letter in handwritten character string

Info

Publication number: JPH0863552A
Application number: JP6192964A
Authority: JP
Inventors: Koji Matsumoto; 浩司松本; Kenji Okano; 健治岡野; Hidesato Ichii; 英里一井
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1994-08-17
Filing date: 1994-08-17
Publication date: 1996-03-08

Abstract

PURPOSE: To discriminate a capital letter or a small letter in character input without a character frame. CONSTITUTION: A character string segmenting means 2 segments a character string from a stroke code string inputted from a table 1. A recognition means 3 recognizes the character based on stroke information from a starting stroke number to a finishing stroke number. A capital or small letter discriminating means 40 detects whether the character recognized by the recognition means 3 is the same-shaped and duplicated character or not. A threshold value deciding means 42 obtains the threshold of the same-shaped and duplicated character detected by the capital or small letter discriminating means 40. A character size comparing and deciding means 41 obtains the ratio of the featured value of the detected character to the featured value of a capital letter other than the detected character, then it is discriminated that the detected character is small letter when the ratio is smaller than the threshold value and is capital letter when the ratio is larger than the threshold value.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、手書き文字列の大文字
と小文字を判定する手書き文字列大文字小文字判定装置
に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a handwritten character string uppercase / lowercase determination device for determining the uppercase / lowercase of a handwritten character string.

【０００２】[0002]

【従来の技術】従来、このような分野の技術としては、
例えば、次のような文献に記載されるものがあった。文献；特公平１−５０９５４号公報従来、文字枠の中に１文字ずつ筆記する手書き漢字入力
装置では、入力タブレットに文字が手書き入力される
と、１文字毎に文字認識処理が実行され、その認識結果
が表示装置等へ出力される。ところで、この入力文字の
中には、例えば仮名文字「や」と「ゃ」、「ゆ」と
「ゅ」、「よ」と「ょ」、「つ」と「っ」など、同じ文
字であって大文字と小文字の区別の存在するものが含ま
れており、この種の文字入力があると、装置側で両者を
明確に区別する必要がある。これを解決する方法とし
て、前記文献に記載されているように文字枠の大きさに
対する入力文字の大きさによって、大文字と小文字と判
定する方法が考案されている。2. Description of the Related Art Conventionally, techniques in such a field include:
For example, some documents were described in the following documents. Reference: Japanese Examined Patent Publication No. 1-50954 Conventionally, in a handwritten kanji input device that writes characters one by one in a character frame, when a character is handwritten on an input tablet, character recognition processing is executed for each character, and The recognition result is output to a display device or the like. By the way, some of the input characters are the same characters such as kana characters “ya” and “ya”, “yu” and “yu”, “yo” and “yo”, and “tsu” and “tsu”. There are some cases in which there is a distinction between uppercase and lowercase letters, and if there is a character input of this kind, it is necessary for the device side to clearly distinguish the two. As a method for solving this, a method has been devised as described in the above-mentioned document, in which uppercase and lowercase letters are determined according to the size of an input character with respect to the size of a character frame.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、従来の
手書き文字列大文字小文字判定装置においては、次のよ
うな課題があった。手書き漢字入力による良好なマン・
マシンインターフェースを実現する方法として、筆記者
に負担をかけないために文字枠のない白紙上に記入され
た文字列を入力することが望まれており、前記文献に記
載された方法は文字枠が存在する場合のみ有効である。
つまり、文字枠のない自由に筆記された文字列において
入力される文字の大きさは、筆記者が変わったり、同じ
筆記者でも時と場所、さらに同じ画面上でも行が変われ
ば相当なばらつきを生じるため文字枠の大きさを一義に
決定することができず、大文字と小文字を区別すること
は不可能である。However, the conventional handwritten character string case determination device has the following problems. Good man by handwritten kanji input
As a method of realizing a machine interface, it is desired to input a character string written on a blank sheet without a character frame so as not to burden the writer. Only valid if present.
In other words, the size of the characters entered in a freely written character string without a character frame varies considerably if the writer changes, the time and place of the same writer, and even the line changes on the same screen. Since it occurs, the size of the character frame cannot be uniquely determined, and it is impossible to distinguish between uppercase and lowercase letters.

【０００４】[0004]

【課題を解決するための手段】第１の発明は、前記課題
を解決するために、座標入力装置から得られた座標デー
タ列から文字を切り出す文字切り出し手段と、前記文字
切り出し手段により１文字として切り出された座標デー
タ列から文字を認識する認識手段とを、備えた手書き文
字列大文字小文字判定装置において、以下の手段を設け
ている。すなわち、前記認識手段により認識された文字
が小文字となる可能性を判断し、小文字となる可能性が
あればその文字の大きさと前記認識手段により認識され
大文字であるか小文字であるかが既に判定された他の文
字の大きさを前記文字切り出し手段により切り出された
座標データ列に基づいて比較することにより、前記文字
が小文字であるか大文字であるかを判定する大文字小文
字判定手段を設けている。第２の発明は、第１の発明の
大文字小文字判定手段は、以下の手段を備えている。す
なわち、前記認識手段により認識された文字のうち、大
文字と小文字の字体が同じあるいは類似する対象文字を
検出する大文字小文字検出手段と、前記大文字小文字検
出手段により検出された対象文字が大文字であるか小文
字であるかを判別するしきい値を決定するしきい値決定
手段と、前記文字切り出し手段により切り出された座標
データ列に基づき、前記大文字小文字検出手段により検
出された対象文字の大きさを表す特徴量と前記認識手段
により認識され大文字であるか小文字であるかが既に判
定された他の文字の大きさを表す特徴量の比を求め、こ
の比と前記しきい値決定手段によって決定された前記文
字のしきい値を比較し、前記対象文字が大文字であるか
小文字であるかを決定する文字の大きさ比較決定手段と
を、備えている。第３の発明は、第２の発明のしきい値
決定手段は、各対象文字毎にしきい値を決定する。第４
の発明は、第２または第３の発明において、さらに認識
手段により認識された文字と該文字の直前の文字の組み
合わせにより、前記文字が小文字である必要条件を満た
すかどうかを判定する入力規則判定手段を設けている。In order to solve the above-mentioned problems, a first aspect of the present invention provides a character cutting-out means for cutting out a character from a coordinate data string obtained from a coordinate input device, and one character by the character cutting-out means. A handwritten character string upper / lower case determination device provided with a recognition means for recognizing a character from a cut out coordinate data string is provided with the following means. That is, the possibility that the character recognized by the recognizing means is lowercase is judged, and if there is the possibility that the character is lowercase, it is already judged whether the size of the character and whether the character is recognized by the recognizing means is uppercase or lowercase. By comparing the size of the other character based on the coordinate data string cut out by the character cutting means, there is provided an upper / lower case determining means for determining whether the character is lowercase or uppercase. . In the second invention, the upper / lower case determining means of the first invention comprises the following means. That is, among the characters recognized by the recognizing means, an uppercase / lowercase detecting means for detecting a target character having the same or similar uppercase and lowercase fonts, and whether the target character detected by the uppercase / lowercase detecting means is an uppercase character. Threshold value determining means for determining a threshold value for determining whether the character is lowercase, and the size of the target character detected by the uppercase / lowercase detecting means based on the coordinate data string cut out by the character cutting means. The ratio between the feature amount and the feature amount which is recognized by the recognition means and which has already been determined to be uppercase or lowercase and which represents the size of another character is obtained, and this ratio is determined by the threshold value determination means. And a character size comparison / determination means for comparing the threshold values of the characters and determining whether the target character is uppercase or lowercase. In a third invention, the threshold value determining means of the second invention determines a threshold value for each target character. Fourth
In the second or third aspect of the invention, the input rule determination for determining whether or not the character recognized by the recognition means and the character immediately preceding the character satisfy the necessary condition that the character is lowercase. Means are provided.

【０００５】[0005]

【作用】第１の発明によれば、以上のように手書き文字
列大文字小文字判定装置を構成したので、大文字小文字
判定手段は、認識手段により認識された文字のうち、そ
の文字が小文字となる可能性を判断する。そして、小文
字となる可能性があればその文字と認識手段により認識
された他の大文字とを文字切り出し手段により切り出さ
れた座標データ列に基づいて、その大きさを比較するこ
とにより、文字が小文字であるか大文字であるかを判定
する。第２の発明によれば、大文字小文字検出手段は、
認識手段により認識された文字のうち、大文字と小文字
の字体が同じあるいは類似する対象文字を検出する。し
きい値決定手段は、大文字小文字検出手段により検出さ
れた対象文字が小文字であるか大文字であるかを判別す
るしきい値を決定する。文字の大きさ比較決定手段は、
大文字小文字検出手段により検出された対象文字の大き
さを表す特徴量と認識手段により認識され大文字である
か小文字であるかが既に判定された文字の大きさを表す
特徴量との比を求め、この比としきい値決定手段によっ
て決定された文字のしきい値とを比較し、文字が大文字
であるか小文字であるかを決定する。第４の発明によれ
ば、入力規則判定手段は、認識手段により認識された文
字と該文字の直前の文字の組み合わせにより、その文字
が小文字である必要条件を満たすかどうかを判定する。
従って、前記課題を解決できるのである。According to the first aspect of the invention, since the handwritten character string upper / lower case determining device is configured as described above, the upper / lower case determining means can make the character lowercase among the characters recognized by the recognizing means. Judge sex. Then, if there is a possibility that it will be a lowercase letter, the character and another uppercase letter recognized by the recognizing means are compared based on the coordinate data string cut out by the character slicing means, and the size is compared. , Or upper case. According to the second invention, the uppercase / lowercase detection means is
Among the characters recognized by the recognition means, a target character having the same or similar uppercase and lowercase fonts is detected. The threshold value determining means determines a threshold value for determining whether the target character detected by the uppercase / lowercase detecting means is lowercase or uppercase. Character size comparison and determination means
The ratio between the feature amount representing the size of the target character detected by the uppercase / lowercase detection unit and the feature amount representing the size of the character recognized by the recognition unit and already determined to be uppercase or lowercase, is obtained, This ratio is compared with the threshold value of the character determined by the threshold value determining means to determine whether the character is uppercase or lowercase. According to the fourth aspect of the invention, the input rule determination means determines whether or not the required condition that the character is a lower case is satisfied by the combination of the character recognized by the recognition means and the character immediately preceding the character.
Therefore, the above problem can be solved.

【０００６】[0006]

【実施例】第１の実施例図１は、本発明の第１の実施例を示す手書き文字列大文
字小文字判定装置の構成図である。この手書き文字列大
文字小文字判定装置では、手書き文字を入力するタブレ
ット１を有している。タブレット１の出力側には、文字
列切り出し手段２が接続されて、さらに文字列切り出し
手段２の出力側には、認識手段３及び大文字小文字判定
手段４が接続されている。大文字小文字判定手段４は、
認識手段３の出力側に接続された大文字小文字検出手段
４０と大文字小文字検出手段４０の出力側に接続された
文字の大きさ比較決定手段４１及びしきい値決定手段４
２とを有している。文字の大きさ比較決定手段４１は、
文字列切り出し手段２及びしきい値決定手段４２の出力
側にも接続されている。座標入力装置であるタブレット
１は、文字列を筆記入力し、ペン先がタブレット１に接
触してからペン先がタブレット１を離れるまでの各スト
ロークごとに一定時間一定時間間隔で筆点の座標値を取
り入れ、文字列切り出し手段２へ送出する手段である。
文字列切り出し手段２は、各ストロークの筆点の座標値
から文字を切り出し、その結果を認識手段３及び大文字
小文字判定手段４へ送出する手段である。認識手段３
は、文字列切り出し手段２により切り出された文字を認
識する手段である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS First Embodiment FIG. 1 is a block diagram of a handwritten character string upper / lower case determination device showing a first embodiment of the present invention. This handwritten character string upper / lower case determination device has a tablet 1 for inputting handwritten characters. The character string cutout unit 2 is connected to the output side of the tablet 1, and the recognition unit 3 and the upper / lower case determination unit 4 are connected to the output side of the character string cutout unit 2. The case determination means 4 is
The case detection means 40 connected to the output side of the recognition means 3 and the character size comparison determination means 41 and the threshold value determination means 4 connected to the output side of the case detection means 40.
2 and. The character size comparison and determination means 41 is
It is also connected to the output side of the character string cutout means 2 and the threshold value determination means 42. The tablet 1, which is the coordinate input device, inputs a character string by handwriting, and the coordinate value of the writing point is fixed at a fixed time interval for each stroke from when the pen tip contacts the tablet 1 until the pen tip leaves the tablet 1. Is taken in and sent to the character string cutout means 2.
The character string cutout unit 2 is a unit that cuts out a character from the coordinate value of the writing point of each stroke and sends the result to the recognition unit 3 and the uppercase / lowercase determination unit 4. Recognition means 3
Is a means for recognizing the characters cut out by the character string cutout means 2.

【０００７】大文字小文字判定手段４は、認識手段３に
より認識された文字のうち、その文字が「や」と
「ゃ」、「ゆ」と「ゅ」、「よ」と「ょ」、「つ」と
「っ」などの大文字と小文字を判定する手段である。大
文字小文字検出手段４０は、認識手段３により認識され
た結果が「や」、「ゆ」、「よ」、「つ」などの大文字
と小文字の字体が同じ文字（以下、同形重複文字と呼
ぶ）あるいは類似する対象文字を検出する。しきい値決
定手段４１は、大文字小文字検出手段４０により検出さ
れた対象文字が大文字であるか小文字であるかを判別す
るしきい値を決定する手段である。文字の大きさ比較決
定手段４２は、文字列切り出し手段２より切り出された
文字の座標値から、大文字小文字検出手段４０により検
出された対象文字の大きさを表す特徴量及び大文字であ
るか小文字であるかが既に判定された他の文字の大きさ
を表す特徴量を得、それらの特徴量の比を求め、この比
としきい値決定手段４２により決定された対象文字に対
するしきい値とを比較し、対象文字が大文字であるか小
文字であるかを決定する手段である。Among the characters recognized by the recognition means 3, the upper / lower case determination means 4 includes the characters "ya" and "ya", "yu" and "yu", "yo" and "yo", and "tsu". It is a means for determining uppercase and lowercase letters such as "and". The upper-case and lower-case detection means 40 has a character recognized in the recognition result by the recognition means 3 such as "ya", "yu", "yo", and "tsu" in the same upper and lower case letters (hereinafter referred to as an isomorphic duplicate character). Alternatively, a similar target character is detected. The threshold value determining means 41 is means for determining a threshold value for determining whether the target character detected by the uppercase / lowercase detecting means 40 is uppercase or lowercase. The character size comparison / determination means 42 uses the feature amount representing the size of the target character detected by the upper / lower case detection means 40 from the coordinate value of the character cut out by the character string cutout means 2 and whether it is uppercase or lowercase. A feature amount representing the size of another character, which is already determined, is obtained, the ratio of the feature amounts is obtained, and this ratio is compared with the threshold value for the target character determined by the threshold value determining means 42. However, it is a means for determining whether the target character is uppercase or lowercase.

【０００８】次に、図１の手書き文字列大文字小文字判
定装置の動作の説明をする。まず、タブレット１から入
力されたストロークコード列は、文字列切り出し手段２
へ送出される。図２は、入力文字の一例を示す図であ
り、図３は、文字列切り出し手段２により切りだされた
結果（イメージ）を示す図であり、図４は、文字列切り
出し手段２により切りだされた結果（特徴）を示す図で
あり、図５は、認識手段３により認識された結果を示す
図である。例えば、図２のような文字列が入力されたと
すると、文字列切り出し手段２では、図３に示すように
文字が切り出され、図４のように、各ストローク毎に文
字の開始ストローク番号、終了ストローク番号、文字と
して切り出された外接矩形を表すＸ座標の最小値、Ｘ座
標の最大値、Ｙ座標の最小値、Ｙ座標の最大値を出力す
る。認識手段３では、開始ストローク番号から終了スト
ローク番号までのストローク情報をもとに認識し、図５
のような認識結果を出力する。Next, the operation of the handwritten character string upper / lower case determination device of FIG. 1 will be described. First, the stroke code string input from the tablet 1 is a character string cutout unit 2
Sent to FIG. 2 is a diagram showing an example of input characters, FIG. 3 is a diagram showing a result (image) cut out by the character string cutout unit 2, and FIG. 4 is cut out by the character string cutout unit 2. It is a figure which shows the result (feature) which was recognized, and FIG. 5 is a figure which shows the result recognized by the recognition means 3. For example, if a character string as shown in FIG. 2 is input, the character string cutout unit 2 cuts out the character as shown in FIG. 3, and as shown in FIG. 4, the start stroke number and end of the character for each stroke. The stroke number, the minimum value of the X coordinate, the maximum value of the X coordinate, the minimum value of the Y coordinate, and the maximum value of the Y coordinate that represent the circumscribed rectangle cut out as a character are output. The recognition means 3 recognizes the stroke information from the start stroke number to the end stroke number, and
The recognition result like is output.

【０００９】図６は、図１中の大文字子文字判定手段４
の処理内容を示すフローチャートである。図１中の大文
字子文字判定手段４０では、ステップＳ１において、ま
ず認識された結果より１文字取り出し、ステップＳ２へ
進む。ステップＳ２において、取り出した文字が、同形
重複文字であるかどうかを検出し、同形重複文字であれ
ば、ステップＳ３に進む。図５の場合は、２番目の
「ゆ」、３番目の「う」、５番目の「つ」、９番目の
「よ」、１０番目の「つ」、１３番目の「ツ」が同形重
複文字として検出される。それ以外の文字に対しては何
も処理しないので、ステップＳ８へ進む。ステップＳ３
において、文字の大きさ比較決定手段４１で文字切り出
し結果より該当する特徴量を得る。ステップＳ４におい
て、しきい値決定手段４２で大文字小文字検出手段４０
により検出された同形重複文字のしきい値を得る。図７
は、各文字毎のしきい値を示す図であり、上の欄が大文
字の対象文字であり、下の欄が小文字の対象文字であ
る。図７中では、「ラ」と「ぅ」、「フ」と「ッ」等の
大文字と小文字の字体が類似する組み合わせが存在する
が、これは小さく「ぅ」と筆記したにもかかわらず
「ラ」と認識されたしまった場合、「う」と「ぅ」の組
み合わせしか認めないと、大文字小文字検出手段４０で
検出できないからである。FIG. 6 shows the uppercase child character determination means 4 in FIG.
5 is a flowchart showing the processing contents of FIG. In step S1, the uppercase child character determination means 40 in FIG. 1 first extracts one character from the recognized result, and proceeds to step S2. In step S2, it is detected whether the extracted character is an isomorphic duplicate character, and if it is an isomorphic duplicate character, the process proceeds to step S3. In the case of FIG. 5, the second "yu", the third "u", the fifth "tsu", the ninth "yo", the tenth "tsu", and the thirteenth "tsu" have the same shape duplication. It is detected as a character. No processing is performed on the other characters, so the process proceeds to step S8. Step S3
In, the character size comparison and determination means 41 obtains the corresponding feature amount from the character cutout result. In step S4, the threshold value determining means 42 detects the uppercase / lowercase detection means 40.
To obtain the threshold value of the isomorphic duplicate characters detected by. Figure 7
FIG. 4 is a diagram showing a threshold value for each character, in which an upper column is an uppercase target character and a lower column is a lowercase target character. In Fig. 7, there are similar combinations of uppercase and lowercase fonts, such as "la" and "u", and "hu" and "tsu", but this is small despite being written as "u". This is because, in the case where it has been recognized as "La", the uppercase / lowercase detection means 40 cannot detect it unless only the combination of "u" and "u" is recognized.

【００１０】図８は、いきい値決定手段より得た値ＴＨ
を示す図である。検出された文字（以下、検出文字と呼
ぶ）がｎ番目の文字の場合、文字切り出し手段２で出力
された結果より検出文字の座標値（Ｘmin _n，Ｘmax
_n，Ｙmax_n，Ｙmin _n）を得て、次式（１）で示す特
徴量Ｌ_nを計算する。Ｌ_n＝（Ｙmax _n−Ｙmin _n）・・・（１）検出文字の直前が小文字でない場合（図５中の２番目、
５番目、９番目、１３番目）は、検出文字の直前の特徴
量Ｌ_n-1を計算し、小文字の条件Ｌ_n／Ｌ_n-1＜ＴＨを
満たしているかを判別し、この条件を満たしていればス
テップＳ６において小文字と判定しステップＳ８へ進
み、この条件を満たしていなければステップＳ７におい
て大文字と判定しステップＳ８へ進む。FIG. 8 shows the value TH obtained by the threshold value determining means.
FIG. When the detected character (hereinafter referred to as the detected character) is the nth character, the coordinate value (Xmin _n , Xmax of the detected character is obtained from the result output by the character slicing means 2.
_n , Ymax _n , Ymin _n ) is obtained, and the feature amount L _n shown in the following equation (1) is calculated. If _{_{_{L n = (Ymax n -Ymin n}}} ) ··· (1) immediately before the detection character is not a lower case (second in FIG. 5,
The fifth, ninth, and thirteenth) calculate the feature amount L _n-1 immediately before the detected character, determine whether or not the condition L _n / L _n-1 <TH for small letters is satisfied, and satisfy this condition. If so, it is determined to be lowercase in step S6 and the process proceeds to step S8. If this condition is not satisfied, it is determined to be uppercase in step S7 and the process proceeds to step S8.

【００１１】また、検出文字の直前が小文字の場合は、
検出文字の前の小文字をすべて除いた直前の大文字の特
徴量Ｌ_n-mを計算し、小文字の条件Ｌ_n／Ｌ_n-m＜ＴＨ
を満たしているかを判別し、この条件を満たしていれば
ステップＳ６において小文字と判定しステップＳ８へ進
み、この条件を満たしていなければステップＳ７におい
て大文字と判定しステップＳ８へ進む。また、もし検出
文字が最初の文字の場合は、検出文字直後の大文字の特
徴量Ｌ_n+mを計算し、小文字の条件Ｌ_n／Ｌ_n+m＜ＴＨ
を満たしているかを判別し、この条件を満たしていれば
ステップＳ６において小文字と判定しステップＳ８へ進
み、この条件を満たしていなければステップＳ７におい
て大文字と判定しステップＳ８へ進む。ステップＳ８に
おいて、認識結果の文字全てについて上記ステップＳ１
〜ステップＳ７までの処理を行なったかを判定し、全て
の文字について行なっていればステップＳ１に戻り、上
記処理を繰り返し、全ての文字について行なっていれば
処理を終了する。図９は、大文字小文字判定結果を示す
図である。If the character just before the detected character is a small letter,
The upper-case feature amount L _nm immediately before the detected lower-case letters are removed, and the lower-case condition L _n / L _nm <TH
Is satisfied. If this condition is satisfied, it is determined to be a lowercase character in step S6 and the process proceeds to step S8. If this condition is not satisfied, it is determined to be an uppercase character in step S7 and the process proceeds to step S8. If the detected character is the first character, the characteristic amount L _{n + m} of the upper case immediately after the detected character is calculated, and the condition of the lower case L _n / L _{n + m} <TH
Is satisfied. If this condition is satisfied, it is determined to be a lowercase character in step S6 and the process proceeds to step S8. If this condition is not satisfied, it is determined to be an uppercase character in step S7 and the process proceeds to step S8. In step S8, the above step S1 is performed for all the characters of the recognition result.
~ It is determined whether the process up to step S7 has been performed. If all the characters have been performed, the process returns to step S1, the above process is repeated, and if all the characters have been performed, the process ends. FIG. 9 is a diagram showing the upper / lower case determination result.

【００１２】以上説明したように、本第１の実施例で
は、以下のような利点がある。文字枠のない白紙上に記
入された文字列のなかに「や」と「ゃ」、「ゆ」と
「ゅ」、「よ」と「ょ」、「つ」と「っ」等の同形重複
文字が存在していても、その文字とその文字の直前の狭
い範囲に絞って大きさを比較し、かつ文字毎にしきい値
をもつことで、その範囲以外の文字の大きさの影響を排
除し、注目している文字の大文字、小文字が区別できる
ため効果的に日本語を入力することを可能とする。例え
ば、検出されたのが２番目の文字「ゆ」の場合、検出文
字の直前の外接矩形座標値は、（Ｘmin ₁，Ｘmax ₁，Ｙmin ₁，Ｙmax ₁）＝（０，
１００，２０，１５０）、検出文字の外接矩形座標値は、（Ｘmin ₂，Ｘmax ₂，Ｙmin ₂，Ｙmax ₂）＝（１１
４，１７８，１０，１００）、となり、Ｌ₁＝（１００−０）×（１５０−２０）＝１３０００Ｌ₂＝（１７８−１１４）×（１００−１０）＝５６７
０Ｌ₂／Ｌ₁＝０．４４＜ＴＨ（０．６）を得る。従って、小文字「ゅ」と決定される。As described above, the first embodiment has the following advantages. In a character string written on a blank sheet without a character frame, "ya" and "ya", "yu" and "yu", "yo" and "yo", "tsu" and "tsu", etc. Even if a character exists, it is possible to eliminate the influence of the character size outside the range by comparing the size of the character with the narrow range immediately before the character and comparing the size, and by setting a threshold for each character. However, it is possible to effectively input Japanese because the letters of interest can be case sensitive. For example, when the second character "Yu" is detected, the circumscribed rectangle coordinate value immediately before the detected character is (Xmin ₁ , Xmax ₁ , Ymin ₁ , Ymax ₁ ) = (0,
100, 20, 150), and the circumscribed rectangle coordinate value of the detected character is (Xmin ₂ , Xmax ₂ , Ymin ₂ , Ymax ₂ ) = (11
4,178,10,100), and L ₁ = (100-0) × (150-20) = 13000 L ₂ = (178-114) × (100-10) = 567
We obtain 0 L ₂ / L ₁ = 0.44 <TH (0.6). Therefore, it is determined to be a small letter "yu".

【００１３】第２の実施例図１０は、本発明の第２の実施例を示す手書き文字列大
文字小文字判定装置の構成図であり、図１中の要素と同
様の要素には同一の符号を付してある。本第２の実施例
が、第１の実施例と異なる点は、大文字小文字判定手段
４の中の大文字小文字検出手段４０の入力側に、認識手
段３により認識された文字とこの文字の直前の文字の組
み合わせにより、その文字が小文字である必要条件を満
たすかどうかを判定する入力規則判定手段４３を設けた
ことである。図に示すように大文字小文字判定手段４
は、文字切り出し手段２、及び認識手段３の出力側に設
けられている。大文字小文字判定手段４には、入力規則
判定手段４３が設けられている。入力規則判定手段４３
の出力側には、大文字小文字検出手段４０が接続されて
いる。それ以外の構成は、図１と同様である。入力規則
判定手段４３は、検出された文字とその直前の文字が入
力規則（検出された文字が小文字である必要条件）に合
致しているかどうかを判定する手段である。 Second Embodiment FIG. 10 is a block diagram of a handwritten character string upper / lower case determining apparatus according to a second embodiment of the present invention, in which elements similar to those in FIG. It is attached. The second embodiment is different from the first embodiment in that the character recognized by the recognition means 3 and the character immediately before this character are provided on the input side of the case detection means 40 in the case determination means 4. That is, the input rule determination means 43 is provided for determining whether or not the character satisfies the required condition that the character is a lowercase character by combining the characters. As shown in FIG.
Is provided on the output side of the character cutting means 2 and the recognition means 3. The upper case / lower case determining means 4 is provided with an input rule determining means 43. Input rule determination means 43
An uppercase / lowercase detection means 40 is connected to the output side of the. The other configuration is the same as that of FIG. The input rule determination means 43 is a means for determining whether or not the detected character and the character immediately before it match the input rule (required condition that the detected character is lowercase).

【００１４】以下、図１０の手書き文字列大文字小文字
判定装置の動作の説明をする。まず、タブレット１から
入力されたストロークコード列は、文字列切り出し手２
へ送出される。図１１は、入力文字の一例を示す図であ
り、図１２は、文字列切り出し手段２により切りだされ
た結果（イメージ）を示す図であり、図１３、認識手段
３により認識された結果を示す図である。例えば、図１
１のような文字列が入力されたとすると、文字列切り出
し手段２により、図１２に示すように文字が切り出さ
れ、第１の実施例と同様に、文字の開始ストローク番
号、終了ストローク番号、文字として切り出された外接
矩形を表すＸ座標の最小値、Ｘ座標の最大値、Ｙ座標の
最小値、Ｙ座標の最大値が出力される。認識手段３で
は、開始ストローク番号から終了ストローク番号までの
ストローク情報をもとに認識し、図１３のような認識結
果を出力する。図１４は、図１中の大文字子文字判定手
段４の動作を示す概略フローチャートである。図１５
は、入力規則の一例を示す図である。図１６は、入力規
則判定結果と最終結果を示す図である。以下、これらの
図を参照しつつ大文字子文字判定手段４の動作を説明す
る。The operation of the handwritten character string upper / lower case determination device of FIG. 10 will be described below. First, the stroke code string input from the tablet 1 is used by the character string slicing device 2.
Sent to 11 is a diagram showing an example of an input character, FIG. 12 is a diagram showing a result (image) cut out by the character string cutout unit 2, and FIG. 13 is a diagram showing a result recognized by the recognition unit 3. FIG. For example, FIG.
If a character string such as 1 is input, the character string cutout unit 2 cuts out the character as shown in FIG. 12, and the start stroke number, end stroke number, and character of the character are cut as in the first embodiment. The minimum value of the X coordinate, the maximum value of the X coordinate, the minimum value of the Y coordinate, and the maximum value of the Y coordinate that represent the circumscribed rectangle cut out as are output. The recognition means 3 recognizes the stroke information from the start stroke number to the end stroke number and outputs the recognition result as shown in FIG. FIG. 14 is a schematic flowchart showing the operation of the upper case child character determination means 4 in FIG. FIG.
FIG. 6 is a diagram showing an example of an input rule. FIG. 16 is a diagram showing an input rule determination result and a final result. Hereinafter, the operation of the upper case child character determination means 4 will be described with reference to these figures.

【００１５】図１０中の大文字子文字判定手段４では、
ステップＳ１１において、まず認識された結果より１文
字取り出し、ステップＳ１２へ進む。ステップＳ１２に
おいて、取り出した文字が１番最初の文字であるかどう
かを判別し、１番最初の文字であれば、該文字は同形重
複文字でないのでステップＳ１１に戻り、１番最初の文
字でなければ、ステップＳ１３へ進む。ステップＳ１３
において、取り出した文字が、同形重複文字であるかど
うかを検出し、同形重複文字であれば、ステップＳ１４
に進む。図１３の場合は、３番目の「や」、７番目の
「や」が同形重複文字として検出される。それ以外の文
字に対しては何も処理しないので、ステップＳ１１に戻
る。ステップＳ１４において、ステップＳ１３で同形重
複文字が検出された時、検出した同形重複文字の直前の
文字の１文字を取り出す。図１３の場合は、３番目の
「や」については２番目の「字」、７番目の「や」につ
いては６番目の「ち」が取り出される。ステップＳ１５
において、これら前後の文字の組み合わせが図１５に示
す入力規則に存在する文字であるかどうかを判別し、存
在する文字であれば、該同形重複文字が小文字である可
能性があるのでステップＳ１６へ進み、存在しない文字
であれば、該同形重複文字が小文字ではないのでステッ
プＳ２０へ進む。図１３の場合、「字」と「や」は入力
規則を満たさないので、「や」については大文字と判定
され、また「ち」と「や」は入力規則を満たすので、
「や」について大文字か小文字かを判定される。In the uppercase child character determination means 4 in FIG.
In step S11, one character is first extracted from the recognized result, and the process proceeds to step S12. In step S12, it is determined whether or not the extracted character is the first character, and if it is the first character, the character is not an isomorphic duplicate character and the process returns to step S11. If so, the process proceeds to step S13. Step S13
In step S14, it is detected whether the extracted character is an isomorphic duplicate character.
Proceed to. In the case of FIG. 13, the third “ya” and the seventh “ya” are detected as isomorphic duplicate characters. No processing is performed on the other characters, and the process returns to step S11. In step S14, when the isomorphic duplicate character is detected in step S13, one character immediately before the detected isomorphic duplicate character is extracted. In the case of FIG. 13, the second “letter” is extracted for the third “ya” and the sixth “chi” is extracted for the seventh “ya”. Step S15
In step S16, it is determined whether the combination of these preceding and following characters is a character existing in the input rule shown in FIG. If it is a nonexistent character, the isomorphic duplicate character is not a lower case character, and therefore the process proceeds to step S20. In the case of FIG. 13, since “letter” and “ya” do not satisfy the input rule, it is determined that “ya” is uppercase, and “chi” and “ya” satisfy the input rule.
It is determined whether "ya" is uppercase or lowercase.

【００１６】ステップＳ１６において、文字の大きさ比
較決定手段４１で文字切り出し結果より該当する特徴量
を得る。ステップＳ１７において、しきい値決定手段４
２で大文字小文字検出手段４０により検出された同形重
複文字のしきい値を得る。ステップＳ１８、ステップＳ
１９、ステップＳ２０において、図６中のステップＳ
５、ステップＳ６、ステップＳ７と同様の処理を行う。
ステップＳ２１において、認識結果の文字全てについて
上記ステップＳ１１〜ステップＳ２０までの処理を行な
ったかを判定し、全ての文字について行なっていればス
テップＳ１１に戻り、上記処理を繰り返し、全ての文字
について行なっていなければ処理を終了する。図１６
に、入力規則判定結果と最終結果を示す。以上説明した
ように、本第２の実施例では、以下のような利点があ
る。第１の実施例と同様の利点があるうえに、第１の実
施例では大文字子文字検出手段４で大文字と小文字の可
能性のある文字に対して全て検出して、大文字であるか
小文字であるかを判定していたが、入力規則判定手段４
３を設けることにより一般の単語や文章を入力する場合
にほとんど有り得ない状態を除去することができ、さら
に効果的に日本語を入力することを可能とする。In step S16, the character size comparison / determination means 41 obtains the corresponding feature amount from the result of character extraction. In step S17, the threshold value determining means 4
In 2, the threshold value of the isomorphic duplicate characters detected by the uppercase / lowercase detecting means 40 is obtained. Step S18, Step S
19, in step S20, step S in FIG.
5, the same processing as step S6 and step S7 is performed.
In step S21, it is determined whether or not the processes of steps S11 to S20 have been performed for all the characters of the recognition result. If all the characters have been processed, the process returns to step S11 and the above process is repeated to perform all the characters. If not, the process ends. FIG.
Shows the input rule judgment result and the final result. As described above, the second embodiment has the following advantages. In addition to the same advantages as those of the first embodiment, in the first embodiment, the uppercase child character detection means 4 detects all characters that may be uppercase letters and lowercase letters, and detects uppercase letters or lowercase letters. It was determined whether there is any, but the input rule determination means 4
By providing 3, it is possible to eliminate a state that is almost impossible when a general word or sentence is input, and it is possible to input Japanese effectively.

【００１７】第３の実施例図１７は、本発明の第３の実施例を示す手書き文字列大
文字小文字判定装置の構成図であり、図１中の要素と同
様の要素には同一の符号を付してある。本第３の実施例
が、第２の実施例と異なる点は、入力規則判定手段４３
と大文字小文字検出手段４０との間にスイッチ手段４４
を設けたことである。スイッチ手段４４は、筆記者が入
力規則判定手段４３を使用するかどうかを決めるもので
ある。次に、図１７の手書き文字列大文字小文字判定装
置の動作の説明をする。筆記者によって入力規則判定手
段４３を使用するかどうかが決められ、入力規則判定手
段４３を使用する場合には、入力規則判定手段４３と大
文字小文字検出手段４０とがステッチ手段４４によって
接続し、入力規則判定手段４３を使用しない場合には、
入力規則判定手段４３と大文字小文字検出手段４０とを
ステッチ手段４４によって切り離される。以上説明した
ように、本第３の実施例では、入力規則判定手段４３を
使用するかどうかを決めることができるので、より弾力
性に富んだ大文字と小文字の判定をすることができる。 Third Embodiment FIG. 17 is a block diagram of a handwritten character string upper / lower case determination device according to a third embodiment of the present invention, in which elements similar to those in FIG. It is attached. The third embodiment differs from the second embodiment in that the input rule determining means 43 is
Switch means 44 between the upper case and lower case detection means 40
Is provided. The switch means 44 determines whether or not the writer uses the input rule determination means 43. Next, the operation of the handwritten character string upper / lower case determination device of FIG. 17 will be described. The writer decides whether or not to use the input rule determining means 43. When using the input rule determining means 43, the input rule determining means 43 and the uppercase / lowercase detecting means 40 are connected by the stitching means 44 and input. When the rule determination means 43 is not used,
The input rule determination means 43 and the upper / lower case detection means 40 are separated by the stitch means 44. As described above, in the third embodiment, it is possible to determine whether or not to use the input rule determining means 43, so that it is possible to determine the uppercase and lowercase letters that are more elastic.

【００１８】なお、本発明は、上記実施例に限定されず
種々の変形が可能である。その変形例としては、例えば
次のようなものがある。（ａ）本実施例では、外接矩形の高さの比率を用いて
大文字子文字を判定したが、他の手法、例えば、外接矩
形の対角線の距離の比率、横方向の距離の比率、面積の
比率、重心等を用いてもよい。（ｂ）本実施例では、各文字毎のしきい値を固定して
用いたが、筆記者が自由に設定できるような構成にして
さらにきめ細かくしきい値を決定することも可能であ
る。（ｃ）本実施例では、オンライン手書き装置に適用し
た例を説明したが、ＯＣＲ装置にも適用可能である。The present invention is not limited to the above embodiment, and various modifications can be made. The following are examples of such modifications. (A) In the present embodiment, uppercase child characters are determined using the height ratio of the circumscribing rectangle, but other methods such as the ratio of the diagonal distance of the circumscribing rectangle, the ratio of the lateral distance, and the area You may use a ratio, a gravity center, etc. (B) In this embodiment, the threshold value for each character is fixed and used, but it is also possible to set the threshold value more finely with a configuration that can be freely set by the writer. (C) In the present embodiment, the example applied to the online handwriting device has been described, but it is also applicable to the OCR device.

【００１９】[0019]

【発明の効果】以上詳細に説明したように、第１〜第４
の本発明によれば、大文字小文字判定手段を設けたの
で、文字枠のない白紙上に記入された文字列のなかに小
文字が存在していても、大文字と小文字を区別できるた
め効果的に日本語等を入力することができる。As described above in detail, the first to the fourth
According to the present invention, since the upper / lower case determination means is provided, even if there is a lowercase character in a character string written on a blank sheet without a character frame, the uppercase and lowercase letters can be distinguished, so that it is effective in Japan. You can enter words.

[Brief description of drawings]

【図１】本発明の第１の実施例を示す手書き文字列大文
字小文字認識装置の構成図である。FIG. 1 is a configuration diagram of a handwritten character string case recognizing device according to a first embodiment of the present invention.

【図２】入力文字列の一例を示す図である。FIG. 2 is a diagram showing an example of an input character string.

【図３】文字列切り出し手段により切り出された結果
（イメージ）を示す図である。FIG. 3 is a diagram showing a result (image) cut out by a character string cutout unit.

【図４】文字列切り出し手段により切り出された結果
（特徴量）を示す図である。FIG. 4 is a diagram showing a result (feature amount) cut out by a character string cutout unit.

【図５】認識手段により認識された結果を示す図であ
る。FIG. 5 is a diagram showing a result recognized by a recognition means.

【図６】図１中の大文字小文字判定手段の処理内容を示
すフローチャートである。FIG. 6 is a flowchart showing the processing contents of upper / lower case determination means in FIG.

【図７】各文字毎のしきい値を示す図である。FIG. 7 is a diagram showing a threshold value for each character.

【図８】しきい値決定手段より得た値を示す図である。FIG. 8 is a diagram showing values obtained by threshold value determining means.

【図９】大文字小文字判定結果を示す図である。FIG. 9 is a diagram showing a case determination result.

【図１０】本発明の第２の実施例を示す手書き文字列大
文字小文字認識装置の構成図である。FIG. 10 is a configuration diagram of a handwritten character string case recognizing device according to a second embodiment of the present invention.

【図１１】入力文字列の一例を示す図である。FIG. 11 is a diagram showing an example of an input character string.

【図１２】文字列切り出し手段により切り出された結果
（イメージ）を示す図である。FIG. 12 is a diagram showing a result (image) cut out by a character string cutout unit.

【図１３】認識手段により認識された結果を示す図であ
る。FIG. 13 is a diagram showing a result recognized by a recognition means.

【図１４】図１０中の大文字小文字判定手段の処理内容
を示すフローチャートである。FIG. 14 is a flowchart showing the processing contents of upper / lower case determination means in FIG.

【図１５】入力規則の一例を示す図である。FIG. 15 is a diagram showing an example of an input rule.

【図１６】入力規則判定結果と最終結果を示す図であ
る。FIG. 16 is a diagram showing an input rule determination result and a final result.

【図１７】本発明の第３の実施例を示す手書き文字列大
文字小文字認識装置の構成図である。FIG. 17 is a configuration diagram of a handwritten character string case recognizing device according to a third embodiment of the present invention.

[Explanation of symbols]

１タブレット２文字列切り出し手段３認識手段４大文字小文字判定手段４０大文字小文字検出手段４１文字の大きさ比較決定手段４２しきい値決定手段４３入力規則判定手段４４スイッチ手段 DESCRIPTION OF SYMBOLS 1 tablet 2 character string cutout means 3 recognition means 4 upper / lower case determination means 40 upper / lower case detection means 41 character size comparison determination means 42 threshold value determination means 43 input rule determination means 44 switch means

Claims

[Claims]

1. A character cutting means for cutting a character from a coordinate data string obtained from a coordinate input device, and a recognition means for recognizing a character from a coordinate data string cut out as one character by the character cutting means. In the handwritten character string case determination device, the possibility that the character recognized by the recognition means becomes a lowercase character is determined, and if there is a possibility that it becomes a lowercase character, the size of the character and whether the character is recognized by the recognition means is an uppercase character. By comparing the sizes of other characters, which have already been determined to be lowercase, based on the coordinate data string cut out by the character cutting means, it is determined whether the character is lowercase or uppercase. An upper / lower case determination device for handwritten character strings, which is provided with a lower case determination means.

2. The uppercase / lowercase determination means detects uppercase / lowercase detection means for detecting target characters having the same or similar uppercase and lowercase fonts among the characters recognized by the recognition means, and the uppercase / lowercase detection means. A threshold value determining means for determining a threshold value for determining whether the detected target character is uppercase or lowercase, and based on the coordinate data string cut out by the character cutout means, by the uppercase / lowercase detection means. The ratio of the feature amount representing the size of the detected target character and the feature amount representing the size of another character, which is recognized by the recognition means and has already been determined to be uppercase or lowercase, is obtained, and this ratio Comparing the threshold values of the characters determined by the threshold value determining means, the character size for determining whether the target character is uppercase or lowercase. A handwritten character string upper / lower case determination device, characterized in that it comprises a size comparison determination means.

3. The handwritten character string uppercase / lowercase determination means according to claim 2, wherein the threshold value determination means determines a threshold value for each target character.

4. An input rule determining means is provided for determining whether or not the character recognized by the recognizing means and a character immediately preceding the character meet a requirement that the character is lowercase. The handwritten character string upper / lower case determination device according to claim 2 or 3.