JPS6022793B2 - character identification device - Google Patents

character identification device

Info

Publication number
JPS6022793B2
JPS6022793B2 JP55007769A JP776980A JPS6022793B2 JP S6022793 B2 JPS6022793 B2 JP S6022793B2 JP 55007769 A JP55007769 A JP 55007769A JP 776980 A JP776980 A JP 776980A JP S6022793 B2 JPS6022793 B2 JP S6022793B2
Authority
JP
Japan
Prior art keywords
character
input
pattern
standard
identification device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP55007769A
Other languages
Japanese (ja)
Other versions
JPS56105587A (en
Inventor
彰一 平井
貞一 渡辺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Tokyo Shibaura Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tokyo Shibaura Electric Co Ltd filed Critical Tokyo Shibaura Electric Co Ltd
Priority to JP55007769A priority Critical patent/JPS6022793B2/en
Publication of JPS56105587A publication Critical patent/JPS56105587A/en
Publication of JPS6022793B2 publication Critical patent/JPS6022793B2/en
Expired legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/24Character recognition characterised by the processing or recognition method
    • G06V30/248Character recognition characterised by the processing or recognition method involving plural approaches, e.g. verification by template match; Resolving confusion among similar patterns, e.g. "O" versus "Q"

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Description

【発明の詳細な説明】 本発明は手書き文字をオンラインで識別する文字識別装
置に関する。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a character identification device for identifying handwritten characters online.

文字識別装置には、帳票等にあらかじめ記入された文字
を光電変換して識別を行なういわゆるOCRと、文字を
書きながらそのペンの動きの過程を入力して識別するオ
ンライン識別装贋とがある。
Character identification devices include so-called OCR, which performs identification by photoelectrically converting characters written in advance on a form, etc., and online identification, which performs identification by inputting the process of pen movement while writing characters.

前者は既に実用化されて広く使用されているが、後者は
データ発生源ですぐにコード化できる利点があるにもか
かわらず禾だ実用化に到っていない。この原因としては
、後者の場合、ストロークの有無や筆順等で識別のため
の候補となる文字を選ぶ分類効率がわろく、謙取文字数
の増加にしたがって辞書設計に要する人手や時間が増大
することにあると考えられる。本発明は上記欠点を除い
た文字識別装置を供V給することを目的とする。
The former has already been put to practical use and is widely used, but the latter has not yet been put into practical use, despite its advantage of being able to be encoded immediately at the data source. The reason for this is that in the latter case, the classification efficiency for selecting candidate characters for identification based on the presence or absence of strokes, stroke order, etc. is low, and as the number of characters increases, the manpower and time required for dictionary design increases. It is thought that there is. The object of the present invention is to provide a character identification device which eliminates the above-mentioned drawbacks.

第1図に本発明の実施例のブロック図を示す。FIG. 1 shows a block diagram of an embodiment of the present invention.

1はデータブレット、2は文字入力用のペン、3は文字
入力部、4は文字合成部、5は前処理部、6は照合部、
7は辞書、8は判定部である。
1 is a data tablet, 2 is a pen for character input, 3 is a character input section, 4 is a character synthesis section, 5 is a preprocessing section, 6 is a collation section,
7 is a dictionary, and 8 is a determination section.

これらのうち、データタブレット1、ベン2及び文字入
力部3は通常のオンライン識別装置の入力部の構成と同
じであり、ベン2データタブレット1上に入力したい文
字を描くことにより、文字入力部3には、描かれている
文字のデータとともに筆順、ストローク数、ベン2の運
動の方向及び速さ等の情報(以下ストローク情報と呼ぶ
)が得られる。ここでは入力データとしては、第2図に
示すように、一定周期毎にサンプリングされた時刻mで
のペン2の位置座標(X,Y)及び時間tn−,におけ
る座標との差分△X,△Yが得られている。このデータ
のサンプリングはペンがデータ夕ブレット1上に接触し
ている間に行なわれ、ベン2がデータタブレットーに接
触していないときにはデータの変化はないものとする。
上述したストローク情報はこれらのデータから得ること
もできる。文字合成部4は、文字入力部3で得られたデ
ータから文字パターンを合成する。
Among these, the data tablet 1, Ben 2, and character input section 3 have the same configuration as the input section of a normal online identification device, and by drawing the character you want to input on the Ben 2 data tablet 1, you can In addition to data on the drawn characters, information such as stroke order, number of strokes, direction and speed of Ben 2's movement (hereinafter referred to as stroke information) is obtained. Here, as shown in FIG. 2, the input data includes the position coordinates (X, Y) of the pen 2 at time m sampled at regular intervals and the difference ΔX, Δ between the coordinates at time tn-. Y is obtained. This data sampling is performed while the pen is in contact with the data tablet 1, and it is assumed that there is no change in the data when the pen 2 is not in contact with the data tablet.
The above-mentioned stroke information can also be obtained from these data. The character synthesis section 4 synthesizes a character pattern from the data obtained by the character input section 3.

すなわち、文字合成部4内には図示しないパターンメモ
リを有し、このパターンメモリは予じめて0にされてい
る。次に第2図に示される入力順にサンプルされた各位
贋座標(×,Y)を読み出し、パターンメモリ上の対応
する点(X.Y)を1にする。このようにしてすべての
入力座標に対応してパターンメモリ上に1を書き込むこ
とにより、パターンメモリ上には切れにサンプルされた
入力文字パターンが形成される。この文字パターンを上
述のストローク情報を用いてつなぎ処理することにより
各サンプル点が連結された入力文字パターンが得られる
。この入力文字パターンは前処理部5に送られ、周知の
大きさの正規化及び線分の太め処理が為された後照合部
6に供V給される。照合部6は周知のパターンマッチン
グ法によって文字入力パターンの類似度を算出する。
That is, the character synthesis section 4 has a pattern memory (not shown), and this pattern memory is set to 0 in advance. Next, each false coordinate (x, Y) sampled in the input order shown in FIG. 2 is read out, and the corresponding point (X, Y) on the pattern memory is set to 1. In this way, by writing 1 on the pattern memory corresponding to all the input coordinates, an input character pattern sampled in pieces is formed on the pattern memory. By subjecting this character pattern to connection processing using the above-mentioned stroke information, an input character pattern in which each sample point is connected is obtained. This input character pattern is sent to the preprocessing section 5, subjected to known size normalization and line thickening processing, and then supplied to the collation section 6. The matching unit 6 calculates the similarity of character input patterns using a well-known pattern matching method.

すなわち、辞書7には多数の標準パターンが収容されて
おり、その夫々と文字入力パタ−ンとの類似度が求めら
れ、類似度が予じめ定めた値以上となる標準パターンを
文字入力パターンの候補文字として決定し、判定部8に
知らせる。判定部8ではこれら候補文字のなかから上記
ストローク情報によつて文字入力の属する文字の決定を
行なう。例えば、第3図a,bに示すような「ソ」及び
「ン」とが照合部6で候補文字として決定されたとする
That is, the dictionary 7 stores a large number of standard patterns, and the degree of similarity between each of them and the character input pattern is calculated, and the standard pattern whose degree of similarity is equal to or greater than a predetermined value is selected as the character input pattern. is determined as a candidate character, and is notified to the determination unit 8. The determination unit 8 determines the character to which the input character belongs from among these candidate characters based on the stroke information. For example, assume that the matching unit 6 determines "so" and "n" as candidate characters as shown in FIGS. 3a and 3b.

パターンマッチング法では両者の区別はつけにくいが、
ストローク情報によれば第1ストローク10はほぼ同じ
であるが、第2ストローク11では逆方向となるので容
易に区別することができる。以上のように本発明によれ
ば、漢字から英数字までの手書き文字をオンラインで高
精度に識別することができる。
Although it is difficult to distinguish between the two using pattern matching methods,
According to the stroke information, the first stroke 10 is almost the same, but the second stroke 11 is in the opposite direction, so it can be easily distinguished. As described above, according to the present invention, handwritten characters ranging from Chinese characters to alphanumeric characters can be identified online with high accuracy.

この場合認識に要する時間は、識別するカテゴリー数に
も依存するが、OCRでは100宇/秒以上の性能を有
する装置が比較的簡単に実現されており、オンライン認
識の場合は、文字の再合成から識別まで1/1の砂程度
であれば実用上問題ない。したがって、データタブレッ
ト等からなる文字入力装置を複数台有するシステムを実
現することもできる。また、上記実施例ではパターンマ
ッチング法による照合部で候補文字を決定していたが、
逆に、ストローク情報から候補文字を決定し、判定部で
パターンマッチング法による判定を行なうように構成し
てもよい。
In this case, the time required for recognition depends on the number of categories to be identified, but in OCR, devices with performance of 100 U/sec or more can be realized relatively easily, and in the case of online recognition, character resynthesis is required. There is no problem in practical terms if the sand is about 1/1 of the difference between the process and the identification. Therefore, it is also possible to realize a system having a plurality of character input devices such as data tablets. In addition, in the above embodiment, candidate characters were determined by the matching section using the pattern matching method.
Conversely, a configuration may be adopted in which candidate characters are determined from stroke information and the determination section performs determination using a pattern matching method.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例を示すブロック図、第2図及
び第3図a,bは本発明の一実施例の動作を説明するた
めの図である。 1……データタブレット、2……ペン、3……文字入力
部、4・・・・・・文字合成部、5・・・・・・前処理
部、6・・・・・・照合部、7・・・・・・辞書、8・
・・・・・判定部。 第1図第2図 第3図
FIG. 1 is a block diagram showing an embodiment of the invention, and FIGS. 2 and 3 a and 3b are diagrams for explaining the operation of the embodiment of the invention. 1... Data tablet, 2... Pen, 3... Character input unit, 4... Character synthesis unit, 5... Preprocessing unit, 6... Collation unit, 7...Dictionary, 8.
... Judgment department. Figure 1 Figure 2 Figure 3

Claims (1)

【特許請求の範囲】[Claims] 1 入力される文字の筆点運動を観測しその筆点位置を
座標値の時系列として得ると共にそのストローク情報を
抽出する文字入力手段と、この文字入力手段より得られ
た座標値列を用いて幾何学的な文字パターンを形成する
文字処理手段と、多数の幾何学的な標準文字パターンを
予め記憶した記憶手段と、この記憶手段に記載された標
準文字パターンと前記文字処理手段により形成された文
字パターンとを照合して類似度を計算しこの類似度に基
いて標準文字パターンを入力文字候補として出力する照
合手段と、この照合手段より出力された標準文字パター
ンが複数有る場合前記文字入力手段により抽出されたス
トローク情報を用いて入力文字に該当する標準文字パタ
ーンを判定する判定手段とを具備したことを特徴とする
文字識別装置。
1. Character input means that observes the movement of the pen point of an input character, obtains the position of the pen point as a time series of coordinate values, and extracts its stroke information, and uses the coordinate value string obtained from this character input means. a character processing means for forming a geometric character pattern; a storage means for storing a large number of geometric standard character patterns in advance; a collation means that calculates the degree of similarity by comparing the character pattern with the character pattern and outputs the standard character pattern as an input character candidate based on this degree of similarity; and if there are multiple standard character patterns output from the collation means, the character input means; 1. A character identification device comprising: determination means for determining a standard character pattern corresponding to an input character using stroke information extracted by the method.
JP55007769A 1980-01-28 1980-01-28 character identification device Expired JPS6022793B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP55007769A JPS6022793B2 (en) 1980-01-28 1980-01-28 character identification device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP55007769A JPS6022793B2 (en) 1980-01-28 1980-01-28 character identification device

Publications (2)

Publication Number Publication Date
JPS56105587A JPS56105587A (en) 1981-08-22
JPS6022793B2 true JPS6022793B2 (en) 1985-06-04

Family

ID=11674881

Family Applications (1)

Application Number Title Priority Date Filing Date
JP55007769A Expired JPS6022793B2 (en) 1980-01-28 1980-01-28 character identification device

Country Status (1)

Country Link
JP (1) JPS6022793B2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6068484A (en) * 1983-09-22 1985-04-19 Hitachi Ltd On-line handwritten character recognizing system
JPS6458072A (en) * 1987-08-29 1989-03-06 Nec Corp Character recognizing device

Also Published As

Publication number Publication date
JPS56105587A (en) 1981-08-22

Similar Documents

Publication Publication Date Title
JPH0562391B2 (en)
EP0889433A2 (en) Method and apparatus for on-line handwritten input character recognition and recording medium for executing the method
CN101520783A (en) Method and device for searching keywords based on image content
CA2091997C (en) Character recognition methods including locating and extracting predetermined and apparatus data from a document
JPS6022793B2 (en) character identification device
JPH0520794B2 (en)
Abuzaraida et al. The detection of the suitable reduction value of Douglas-Peucker algorithm in online handwritten recognition systems
JPH09319828A (en) On-line character recognition device
JP3066530B2 (en) Online handwriting recognition device
JP2580976B2 (en) Character extraction device
Bhokse et al. Devnagari handwriting recognition system using dynamic time warping algorithm
Yang et al. Online recognition of handwritten characters using differential angles and structural descriptors
JPS5835674A (en) Extracting method for feature of online hand-written character
JP2671984B2 (en) Information recognition device
KR100414051B1 (en) Method for recognizing stroke of character
JP3209197B2 (en) Character recognition device and recording medium storing character recognition program
JPS6239461B2 (en)
Eldin et al. Arabic character recognition: a survey
KR900005141B1 (en) Handwritter character recognizing device
JPH09114927A (en) Method and device for rough classifying input characters in on-line character recognition
KR940001739B1 (en) On-line hand-written korean character recognition method by recognizing stroke
KR950012279A (en) Probabilistic Stroke Recognition Method by Region Segmentation
JPH0365585B2 (en)
JPS5840787B2 (en) Character recognition processing method
JPH01201789A (en) Character reader