JPS5949675A - Character recognition device - Google Patents

Character recognition device

Info

Publication number
JPS5949675A
JPS5949675A JP57161315A JP16131582A JPS5949675A JP S5949675 A JPS5949675 A JP S5949675A JP 57161315 A JP57161315 A JP 57161315A JP 16131582 A JP16131582 A JP 16131582A JP S5949675 A JPS5949675 A JP S5949675A
Authority
JP
Japan
Prior art keywords
character
recognition
erroneous recognition
erroneous
input character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP57161315A
Other languages
Japanese (ja)
Inventor
Yasuhiko Yoshinaga
吉永 泰彦
Tetsuji Morishita
森下 哲次
Koya Fujita
藤田 孝弥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP57161315A priority Critical patent/JPS5949675A/en
Publication of JPS5949675A publication Critical patent/JPS5949675A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation

Abstract

PURPOSE:To generate a similar character discrimination dictionary containing erroneous recognition data and to improve the precision and speed of character recognition by storing the correspondence between character categories obtained by erroneous recognition and input character patterns which are objects of erroneous recognition. CONSTITUTION:An input character is converted into binary data by a tablet 1 and a recognition part 4 refers to the 1st and the 2nd dictionaries 2 and 3 to find recognition candidate characters according to similarity. Those recognition candidate characters are stored in a storage part 5 temporarily according to the similarity and the contents of the storage part 5 are displayed on a display part 6 according to the operation of an operation part 7. A storage part 8 is provided corresponding to the recognition part 4 and storage part 5 and the correspondence between erroneously recognized candidate characters and input character categories which are objects of erroneous recognition is stored therein. Then, character categories obtained by erroneous recognition and input character patterns which are objects of erroneous recognition are stored corresponding to each other to improve the precision and speed of the character recognition.

Description

【発明の詳細な説明】 (A)  発明の技術分野 本発明は文字認識装置とくに手書き文字用の文字認識装
置に関する。
DETAILED DESCRIPTION OF THE INVENTION (A) Technical Field of the Invention The present invention relates to a character recognition device, and particularly to a character recognition device for handwritten characters.

(B)  技術の背景 手書文字読取装b7においては認識精度および認識速度
を向上するために、書法に制限を設ける叫の方法が古く
から用すられているが、利用者層の拡大とともにこのよ
うなf!jll限が守られり・f〈なり誤認識が急増し
ており、このような現状に対し、Pp準文字パターンの
特徴を記憶する辞書のほか、とくに誤B4イ誠を生じ易
い文字を識別するために類似文字11哉別用の辞書を設
けた文字ml識駅11・tが用いられている。
(B) Technical Background In order to improve the recognition accuracy and speed of the handwritten character reading system B7, a method of limiting the writing style has been used for a long time, but as the user base expands, this method has been used for a long time. Like f! In response to this current situation, in addition to a dictionary that memorizes the characteristics of Pp quasi-character patterns, we have created a dictionary that can identify characters that are particularly likely to cause the error B4. For this reason, the character ml code station 11.t, which has a dictionary for similar characters 11 and 11, is used.

(0従来技術と問題点 前記類イ以文字識別用の辞書け、平仮名で冴、れば例え
ば「ね」・「れJ・「ゎjのよう/Ii−相互に61認
識し易い類似文字を識別するために設けたものであり、
従来、類似文字グループ毎にそれぞれの標準文字パター
ンもしくは特定の66者による文字パターンと他との4
月対的特徴に基いて[jllもって作成したものを用い
ていた。
(0 Prior art and problems) A dictionary for identifying characters from the above-mentioned category A, for example, ``ne'', ``reJ, ゎj yo /Ii-61 similar characters that are easy to recognize from each other. It is provided for the purpose of identification.
Conventionally, for each similar character group, each standard character pattern or character pattern by specific 66 people and 4 other characters were used.
Based on the characteristics of the moon, I used the one created by JLL.

しかし一般に手書文字パターンにおいてlト′i’E 
:に個有のYt法上の癖が強く現われるものであり、し
た力(つて1iE来の類似文字jA’別用シ’?4りで
tま不特定の藉者による文字パターンに対してe」充分
に対処できないという欠点があった。
However, in general, in handwritten character patterns,
:The peculiarities of the Yt method are strongly expressed, and the power that has been applied to the character pattern by an unspecified person. ” had the disadvantage that it could not be adequately addressed.

α))発明の目的 本発明は前記従来例の欠点を解消し、不特定の惜者によ
る文字パターンの認識に対処できる文字認識装置−イを
得ることを目的とする。
α)) Purpose of the Invention The object of the present invention is to eliminate the drawbacks of the conventional example and provide a character recognition device that can handle character pattern recognition by an unspecified person.

(へ)発明の構成 本発明になる文字;4識装置斤は、入力文字パターンの
一歳ヲおこなって得られるIJ i、M!H補文字カテ
ゴリを表示し目視によって誤認識の修正をおこなう文字
W?J 識装置において、誤認識によって得られた文字
カテゴリと核誤Nt ?&の対象となった入力文字パタ
ーンとを対応させて記録する誤認識データ記録部と、前
記囮す召識データ記録部に記録した誤?A Mデータに
よって生成される類似文字識別辞書とを設けたものであ
る。
(F) Structure of the Invention Characters according to the present invention; the four-way recognition device has IJ i, M! Character W that displays the H complementary character category and corrects misrecognition by visual inspection? Character category and nuclear error Nt obtained by misrecognition in J recognition device? The erroneous recognition data recording section records the input character pattern that is the target of &, and the erroneous recognition data recording section records the erroneous ? A similar character identification dictionary generated from AM data is provided.

(n 発明の実)面倒 以下に本発明の彎旨を実施例によって具体的にiiと明
する。
(n. Fruits of the Invention) Below, the advantages of the present invention will be specifically clarified as ii by way of examples.

第1図は本狛明をオンライン手書文字罷職装置dにコr
1用した」4合の一実織例を示し、1は手曹文字治・人
力し2値データに亥俟するタブレット、2は文字カテゴ
リ毎に標準の文字パターンの特P k Sr:憶する第
1の辞書、3は類似文字識別辞書として用いる第2の辞
書、4はタブレット1において11られる人力文字パタ
ーンの特tjl第1の辞書2および第2の辞書3の少な
くともいずれかと照合をおこない類似度の順に第5位ま
での文字ノ1テゴリを認識候補文字として求める認識部
、5はng lij表部4において得られた認11峨候
補文字を人力文字パターンに対応させ類似度の順位に一
時記憶する*! 1の記憶部、6は21K 1 ’ty
>記憶部5に一時記憶された認識候捕文字奮後記操作部
のt■作に従って人力文字パターン毎に類似度の順に1
文字ずつt≧示するほか後配第2図に例示する表示をお
こなう表示部。
Figure 1 shows how Komaaki Honkoma is connected to the online handwritten letter dismissal device d.
1 shows an example of a 4-go weave that was used, 1 is a tablet that is manually processed and converted into binary data, and 2 is a standard character pattern special P k Sr: memorization for each character category. A first dictionary, 3 a second dictionary used as a similar character identification dictionary, 4 a human character pattern pattern 11 on the tablet 1, which is compared with at least one of the first dictionary 2 and the second dictionary 3 to identify similarities. The recognition unit calculates the 1st category of characters up to the 5th place in order of degree as recognition candidate characters, 5 is ng lij. Remember*! 1 storage unit, 6 is 21K 1 'ty
> 1 for each human-powered character pattern in the order of similarity according to the operation unit's operation unit temporarily stored in the storage unit 5.
A display unit that displays each character as t≧ and also displays the display illustrated in FIG. 2 below.

7は表示部6に表示された認識候補文字の正誤をオペレ
ータが目視によって判定し正しい1合にtit。
7, the operator visually determines whether the recognition candidate characters displayed on the display unit 6 are correct or not, and selects the correct one.

これf認識結果として決定し、誤りである)tA合には
類似度が次位の認識候補文字を表示部6に〕・り示させ
るとともに誤認識であったレイ識岐浦文字と該誤認識の
対象となった入力文字カテゴリと4・後記゛2’rG 
2の記1.<i Mliに記録させるJ−★件部、8は
前1rj iig 、i、、:識であった認識候補文字
と該誤認識の対象となった入力文字カテゴリとを対応さ
せて記録する誤認識データ記録部として用いる第2の記
憶部である0以上のような構成によって入力文字パター
ンの認識ヲおこなうのであるが、次に類似文字識別辞書
として用いる第2の辞書3の生成について説明をおこな
う。
If this is determined as the recognition result (f), and it is an error, the recognition candidate character with the next highest degree of similarity is displayed on the display section 6, and the Rei Shikiura character that was incorrectly recognized and the incorrectly recognized character. The input character category that was the target of 4.
Note 2 1. <i J-★part to be recorded in Mli, 8 is the previous 1rj iig , i, , erroneous recognition in which the recognition candidate character that was recognized and the input character category that was the target of the erroneous recognition are recorded in correspondence. Recognition of input character patterns is performed by the second storage section, which is used as a data recording section and has a configuration of 0 or more. Next, the generation of the second dictionary 3 used as a similar character identification dictionary will be explained.

例えば、入力文字パターン「ね」の認識ヲおこない第1
位の認識候補文字として文字カテゴリ「わ」が得られ、
かつこれが誤認識であったとすると、文字カテゴリ「わ
」と入力文字カテゴリ「ね」とを対応させ(わ−ね)と
して第2の記憶部8に記録する。次に入力文字パターン
「れ」の認識をおこないf)’r 1のFi4 #I#
 lii補文字として文字カテゴリ「わ」が得られ、か
つこれが誤iWRでありたとすると、f:1G 2の記
憶部8に記録した前記誤認識データ(わ−ね)ヲ(わ−
ね、れ)のように−新する。
For example, the first step is to recognize the input character pattern ``ne''.
The character category “wa” is obtained as a candidate character for recognition.
If this is an erroneous recognition, the character category "wa" and the input character category "ne" are associated and recorded in the second storage unit 8 as (wa-ne). Next, the input character pattern "re" is recognized f) 'r 1's Fi4 #I#
If the character category "wa" is obtained as a complementary character and this is an erroneous iWR, the erroneous recognition data (Wa-ne) wo (Wa-ne) recorded in the storage unit 8 of f:1G2.
Like ne, re) - to renew.

このようにして、いくつかの文字カテゴリに対する誤認
識データが得られたのち第2の辞書3の生成モードに移
る0 第2図は第2の辞書3を生成する際の表示部6の表示内
容を示し、61は誤認識データ表示部であり第2の記憶
部8に記録した誤認識データが文字カテゴリ毎に表示さ
れる062は後記特徴メニ、−に付したキャラクタナン
バを表示するキャラクタナンバ表示部、63は特徴メニ
ューすなわち誤g繊し易い文字を識別するために着目す
べき手書ストローク中の特徴、たとえば「ストロークの
終りのループ」・「二つのストロークの交叉」等を表示
する特徴メニ&−表示部、64と65tま登録テーブル
表示部を構成し、64はp14 ng識データ表示部6
1に表示した第1位の:+ij Hf& R%補文字(
1墳示の例においては「わ」)を対象とし、誤認識の直
接の対象となった入力文字カテゴリ(図示の例において
は「ね」および「れ」)と対比できるようなIFS徴メ
ニ為−をオペレータが特徴メニュー表示部63から選択
することによりて、該特徴メ二1−がキャラクタナンバ
表示部62に表示したキャラクタナンバによって表示さ
れる特徴株示部。
In this way, after obtaining erroneous recognition data for several character categories, the process moves to the generation mode of the second dictionary 3. FIG. 2 shows the contents displayed on the display unit 6 when generating the second dictionary 3. , 61 is an erroneous recognition data display section in which the erroneous recognition data recorded in the second storage section 8 is displayed for each character category. 062 is a character number display that displays the character number marked with - in the feature menu to be described later. Part 63 is a feature menu, which displays features in handwritten strokes that should be noted in order to identify characters that are likely to be misprinted, such as "loop at the end of a stroke" and "intersection of two strokes."&- display section, 64 and 65t constitute a registration table display section, and 64 is p14 ng identification data display section 6
1st place displayed in 1:+ij Hf&R% complementary character (
1) in the example of the burial mound), and which can be compared with the input character category that was the direct target of misrecognition (in the illustrated example, ``ne'' and ``re''). - is selected from the feature menu display section 63 by the operator, and the feature menu 1- is displayed according to the character number displayed on the character number display section 62.

65は特徴表示部64に表示された特徴が現れている入
力文字パターンのストローク番号がオペレータの指示に
よって表示されるストロごり表示部である。
Reference numeral 65 is a stroke display section in which the stroke number of the input character pattern in which the feature displayed on the feature display section 64 appears is displayed according to an operator's instruction.

特徴メニュー表示部63に表示される前記特徴メニーー
は、各JMの文字カテゴリに対し共通に適用できるもの
であり、したがってあらかじめ作成し固定的に用いられ
るものである。一方、前記登録テーブルすなわち、特徴
表示部64とストローク表示部65との表示内容は、誤
認識データ表示部61と特徴メニーー表示部63との表
示内容にもとづいて、オペレータが選択・指示すること
によって表示されるものであり、このとき、同時に第2
の辞書3の生成がおこなわれる。
The feature menu displayed on the feature menu display section 63 can be commonly applied to the character categories of each JM, and therefore is created in advance and used in a fixed manner. On the other hand, the display contents of the registration table, that is, the characteristic display section 64 and the stroke display section 65, can be selected and instructed by the operator based on the display contents of the misrecognition data display section 61 and the feature menu display section 63. At this time, the second
A dictionary 3 is generated.

このようにして、これまでの入力文字パターンの認I哉
において誤+u4 iilにが生ずる度に蓄積された誤
”71tiikデータ會用い、それぞれの文字カテゴリ
に対する第2の辞書3の生成がおこなわれ、これが終了
すると引続いて通常の文字認識モードに移る。
In this way, the second dictionary 3 for each character category is generated using the error data accumulated every time an error +u4 iil occurs in the recognition of input character patterns up to now. Once this is complete, the program proceeds to normal character recognition mode.

このようなザイクルが繰返される度に、第2の辞書3の
内容が華者個有の書法上の癖に適応するようになり、し
たがって認識の、117度および速度が向上する。
Each time such a cycle is repeated, the contents of the second dictionary 3 become adapted to the peculiar Chinese writing habits, thus improving the recognition speed by 117 degrees.

0)発明の詳細 な説明したように、本発明によれば使用する度に用似文
字HA別辞書の生成および充実がおこなわれ、不特定の
筆者による文字パターンに対処することができる。
0) As described in detail of the invention, according to the present invention, a dictionary for each similar character HA is generated and enriched each time it is used, and it is possible to deal with character patterns created by unspecified writers.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明一実施例のシステムブロック図を示し、
3は類似文字識別辞岩として用いる第2の辞書、8は唄
詔鐘データ記録部として用いる第2の記憶部である。′
また第2図は々32の辞再3の生成に関する説明図であ
る。
FIG. 1 shows a system block diagram of an embodiment of the present invention,
Reference numeral 3 designates a second dictionary used as a similar character identification dictionary, and reference numeral 8 designates a second storage unit used as a data recording unit. ′
Further, FIG. 2 is an explanatory diagram regarding the generation of jisai 3 of t32.

Claims (1)

【特許請求の範囲】 入力文字パターンの認N?Jt eおこなって得られる
イ戚 認識l補文字カテゴ1Jt−表示し目視によって誤認識
の修正をおこなう文字認識装置において、誤認識におい
て得られた文字カテゴリと該誤認識の対象となった入力
文字パターンとを対応させて記録する誤認識データ記録
部と、前記誤認識データ配録部に記録した誤認識データ
によって生成される類似文字識別辞書と金設けたことを
特徴とする文字認識装置。
[Claims] Is the input character pattern recognized? In a character recognition device that displays and corrects misrecognition by visual inspection, the character category obtained in misrecognition and the input character pattern that was the subject of misrecognition. What is claimed is: 1. A character recognition device comprising: an erroneous recognition data recording section for recording erroneous recognition data in correspondence; and a similar character identification dictionary generated from the erroneous recognition data recorded in the erroneous recognition data recording section.
JP57161315A 1982-09-16 1982-09-16 Character recognition device Pending JPS5949675A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57161315A JPS5949675A (en) 1982-09-16 1982-09-16 Character recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57161315A JPS5949675A (en) 1982-09-16 1982-09-16 Character recognition device

Publications (1)

Publication Number Publication Date
JPS5949675A true JPS5949675A (en) 1984-03-22

Family

ID=15732757

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57161315A Pending JPS5949675A (en) 1982-09-16 1982-09-16 Character recognition device

Country Status (1)

Country Link
JP (1) JPS5949675A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6174081A (en) * 1984-09-18 1986-04-16 Fujitsu Ltd Character recognizing device
EP0389541A1 (en) * 1987-11-24 1990-10-03 DAVIS, Elliot Pattern recognition error reduction system
EP2747029A4 (en) * 2011-09-15 2016-09-14 Omron Tateisi Electronics Co Image processing device, image processing method, control program, and recording medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6174081A (en) * 1984-09-18 1986-04-16 Fujitsu Ltd Character recognizing device
EP0389541A1 (en) * 1987-11-24 1990-10-03 DAVIS, Elliot Pattern recognition error reduction system
EP2747029A4 (en) * 2011-09-15 2016-09-14 Omron Tateisi Electronics Co Image processing device, image processing method, control program, and recording medium

Similar Documents

Publication Publication Date Title
US5212769A (en) Method and apparatus for encoding and decoding chinese characters
US6513005B1 (en) Method for correcting error characters in results of speech recognition and speech recognition system using the same
JPH03201166A (en) Display system at the time of correcting japanese document reading translation system
JPH04326488A (en) Hand written recognition system and method by character template
US6567548B2 (en) Handwriting recognition system and method using compound characters for improved recognition accuracy
JPS5949675A (en) Character recognition device
JP2738383B2 (en) Address reading device
CN109960707A (en) A kind of colleges and universities&#39; enrollment data acquisition method and system based on artificial intelligence
EP0587163A1 (en) Method and apparatus for evaluating an individual using character recognition processing of input handwritten responses
JPS61272882A (en) Information recognizing device
JPH08272813A (en) Filing device
JPS59123082A (en) Correcting system of character recognizing device
JP2851865B2 (en) Character recognition device
JPH04138583A (en) Character recognizing device
JPS5985570A (en) Information input system
JP2731394B2 (en) Character input device
JPH0434655A (en) Drawing reader
JPS6072089A (en) Recognizing device
JPH0363882A (en) Image processing device
JP2907947B2 (en) Optical character reading system
JP2874815B2 (en) Japanese character reader
JPH03161866A (en) Device for recognizing contents
JPS61163472A (en) Character recognizing device
JP3273778B2 (en) Kana-kanji conversion device and kana-kanji conversion method
JPH11143993A (en) Recognized character correction device and its method