JP2639314B2 - Character recognition method - Google Patents

Character recognition method

Info

Publication number
JP2639314B2
JP2639314B2 JP5207123A JP20712393A JP2639314B2 JP 2639314 B2 JP2639314 B2 JP 2639314B2 JP 5207123 A JP5207123 A JP 5207123A JP 20712393 A JP20712393 A JP 20712393A JP 2639314 B2 JP2639314 B2 JP 2639314B2
Authority
JP
Japan
Prior art keywords
character
weighting
type
order
area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP5207123A
Other languages
Japanese (ja)
Other versions
JPH0744551A (en
Inventor
泰彦 浅川
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
Nippon Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Electric Co Ltd filed Critical Nippon Electric Co Ltd
Priority to JP5207123A priority Critical patent/JP2639314B2/en
Publication of JPH0744551A publication Critical patent/JPH0744551A/en
Application granted granted Critical
Publication of JP2639314B2 publication Critical patent/JP2639314B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)
  • Character Discrimination (AREA)

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【産業上の利用分野】本発明は文字認識方式に関し、特
に文字種の重み付けを行う重み付け規則により異なる文
字種で同一または類似の形態を有する文字(以下、類似
文字という)を判別することを可能にする文字認識方式
に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition system, and more particularly, to a method for determining different character types having the same or similar form (hereinafter referred to as similar characters) by weighting rules for weighting the character types. Related to character recognition method.

【0002】[0002]

【従来の技術】従来の文字認識方式では、類似文字、例
えば口(漢字)とロ(カタカナ)や、1(数字)とl
(英字)のような類似文字を判別するのは困難であり、
類似文字を判別するために様々な方法がとられている。
2. Description of the Related Art In a conventional character recognition system, similar characters, for example, mouth (kanji) and b (katakana) or 1 (number) and l
It is difficult to distinguish similar characters such as (English letters)
Various methods are used to determine similar characters.

【0003】例えば、文字種判定テーブルを用意してお
き、入力された文字が類似文字として登録されているか
どうかをチェックし、類似文字として登録されている場
合には文字種判定テーブルを用いて文字を特定するもの
がある(特開昭61−39175号公報等参照)。
For example, a character type determination table is prepared, and it is checked whether or not the input character is registered as a similar character. If the input character is registered as a similar character, the character is identified using the character type determination table. (See JP-A-61-39175).

【0004】[0004]

【発明が解決しようとする課題】上述した従来の文字認
識方式では、類似文字を判別するために文字種判定テー
ブルを用意するようになっているので、類似文字データ
をあらかじめ登録しておかなかればならず、登録したデ
ータ以外の類似文字は判別できないという問題点があっ
た。
In the above-described conventional character recognition method, a character type determination table is prepared for determining similar characters. Therefore, if similar character data must be registered in advance. In addition, there is a problem that similar characters other than the registered data cannot be determined.

【0005】本発明の目的は、上述の点に鑑み、文字種
判定テーブルを用意して類似文字データを登録しておく
ことなしに、類似文字を判別できるようにした文字認識
方式を提供することにある。
SUMMARY OF THE INVENTION In view of the above, it is an object of the present invention to provide a character recognition system which can distinguish similar characters without preparing a character type determination table and registering similar character data. is there.

【0006】[0006]

【課題を解決するための手段】本発明の文字認識方式
は、入力された文字を文字認識し漢字仮名混じり文に変
換する文字認識方式において、文字を入力する入力手段
と、この入力手段により入力された文字を文字認識辞書
を参照して変換文字候補コードに変換する文字認識部
と、この文字認識部により変換された変換文字候補コー
ドに重み付け規則に基づいてカタカナ種,ひらがな種,
漢字種,数字種および英字種の文字種重み付け属性を付
与し、三文字ごとの組合せ重み付け属性を付与し、文字
種重み付け属性と組合せ重み付け属性との和である重み
付け属性合計により類似文字を判別する類似文字判別部
とを有する。
SUMMARY OF THE INVENTION A character recognition system of the present invention is a character recognition system for recognizing an input character and converting it into a sentence mixed with kanji and kana, and an input means for inputting a character and an input by the input means. A character recognition unit that converts the converted character into a conversion character candidate code with reference to a character recognition dictionary, and converts the converted character candidate code converted by the character recognition unit into katakana, hiragana,
Similar characters that assign character type weighting attributes of Kanji type, number type and English character type, assign a combination weighting attribute for each three characters, and determine similar characters based on the sum of the weighting attribute that is the sum of the character type weighting attribute and the combination weighting attribute A determination unit.

【0007】[0007]

【実施例】次に、本発明について図面を参照して詳細に
説明する。
Next, the present invention will be described in detail with reference to the drawings.

【0008】図1は、本発明の一実施例に係る文字認識
方式の構成を示すブロック図である。本実施例の文字認
識方式は、タブレット1と、文字認識辞書2と、文字認
識部3と、重み付け展開テーブル4と、重み付け規則フ
ァイル5と、類似文字判別部6とから構成されている。
FIG. 1 is a block diagram showing a configuration of a character recognition system according to one embodiment of the present invention. The character recognition method according to the present embodiment includes a tablet 1, a character recognition dictionary 2, a character recognition unit 3, a weight development table 4, a weight rule file 5, and a similar character determination unit 6.

【0009】タブレット1は、文字を手書き入力する入
力手段である。
The tablet 1 is input means for inputting characters by handwriting.

【0010】文字認識辞書2は、個々の文字の特徴情報
が格納されている辞書である。
The character recognition dictionary 2 is a dictionary in which characteristic information of individual characters is stored.

【0011】文字認識部3は、タブレット1からの文字
の形態情報と文字認識辞書2の文字の特徴情報とを比較
することにより、文字の認識を行い単数または複数の変
換文字候補コードに変換し重み付け展開テーブル4に格
納する手段である。
The character recognizing section 3 recognizes the character by comparing the character form information from the tablet 1 with the character feature information of the character recognition dictionary 2 and converts the character into one or more converted character candidate codes. This is a means for storing in the weighting development table 4.

【0012】重み付け規則ファイル5は、図2に示すよ
うに、変換文字候補コードから文字種の重み付けを行う
重み付け規則を格納するファイルである。図2におい
て、領域aは規則番号を登録する領域、領域bは重み付
け規則の内容を記載する領域である。
As shown in FIG. 2, the weighting rule file 5 is a file for storing weighting rules for weighting character types from converted character candidate codes. In FIG. 2, an area a is an area for registering a rule number, and an area b is an area for describing the contents of a weighting rule.

【0013】重み付け展開テーブル4は、図3に示すよ
うに、重み付け属性を展開するメモリ上のテーブルであ
る。図3において、領域イはタブレット1で手書き入力
した文字の入力文字順番を登録する領域、領域ロは類似
文字判別部6で三文字ごとの組合せを処理する際に処理
文字順番を登録する領域、領域ハは文字認識部3で変換
された変換文字候補コードの変換候補順番を登録する領
域、領域ニは変換文字候補コードを登録する領域、領域
ヘは類似文字判別部5で処理文字順番の変換文字候補コ
ードの重み付け属性を三文字の組合せで処理した場合に
生じる組合せ重み付け属性を登録する領域、領域トは各
変換文字候補コードの文字種重み付け属性と組合せ重み
付け属性との合計である重み付け属性合計を登録する領
域、領域チは各変換文字候補コードの順番を重み付け属
性合計の値の大きい順番に1,2,3…と重み付け順番
として登録する領域である。
As shown in FIG. 3, the weighting development table 4 is a table on a memory for developing weighting attributes. In FIG. 3, an area A is an area for registering the input character order of the characters input by handwriting on the tablet 1, an area B is an area for registering the processing character order when the similar character determination unit 6 processes a combination for every three characters, The area C is an area for registering the conversion candidate order of the converted character candidate code converted by the character recognizing unit 3, the area D is an area for registering the converted character candidate code, and the area F is a similar character discriminating unit 5 for converting the processing character order. The area for registering the combination weighting attribute generated when the weighting attribute of the character candidate code is processed by a combination of three characters, the area is the sum of the weighting attribute that is the sum of the character type weighting attribute and the combination weighting attribute of each converted character candidate code. The area to be registered is an area in which the order of each converted character candidate code is registered as a weighting order of 1, 2, 3,... That.

【0014】類似文字判別部6は、文字認識部3により
変換された変換文字候補コードに重み付け規則ファイル
5の重み付け規則に基づいてカタカナ種,ひらがな種,
漢字種,数字種および英字種の文字種重み付け属性を付
与し、三文字ごとの組合せ重み付け属性を付与し、文字
種重み付け属性と組合せ重み付け属性との和である重み
付け属性合計により類似文字を判別する手段である。
The similar character discriminating unit 6 converts the converted character candidate codes converted by the character recognizing unit 3 into katakana and hiragana based on the weighting rules of the weighting rule file 5.
A means for assigning character type weighting attributes of kanji type, number type and alphabetic type, assigning a combination weighting attribute for each of the three characters, and determining similar characters based on the sum of the weighting attribute which is the sum of the character type weighting attribute and the combination weighting attribute. is there.

【0015】図6を参照すると、類似文字判別部6の処
理は、文字種重み付け属性付与ステップ61と、入力文
字順番nの3未満判定ステップ62と、処理文字順番付
与ステップ63と、文字種重み付け属性比較ステップS
64と、同一文字種重み付け属性判断ステップS65
と、組合せ重み付け属性付与ステップS66と、重み付
属性合計付与ステップ67と、重み付け順番付与ステッ
プS68と、処理文字順番消去ステップ69とからな
る。
Referring to FIG. 6, the processing performed by the similar character discriminating unit 6 includes a character type weighting attribute assigning step 61, an input character order n less than 3 determination step 62, a processing character order assigning step 63, and a character type weighting attribute comparison. Step S
64 and the same character type weighting attribute determining step S65
, A combination weighting attribute assigning step S66, a weight appending total assigning step 67, a weighting order assigning step S68, and a processing character order erasing step 69.

【0016】次に、このように構成された本実施例の文
字認識方式の動作について説明する。
Next, the operation of the character recognition system according to the present embodiment configured as described above will be described.

【0017】タブレット1から手書き文字が入力される
たびに、文字認識部3は、入力文字順番n(nは1以上
の整数)を1から昇順に付与し、入力された文字を文字
認識辞書2を参照し特徴マッチング法等の公知の文字認
識アルゴリズムを用いて単数または複数の変換文字候補
コードに変換し、各変換文字候補コードに変換候補順番
を付して重み付け展開テーブル4に格納する。
Each time a handwritten character is input from the tablet 1, the character recognition unit 3 assigns an input character sequence n (n is an integer of 1 or more) in ascending order from 1, and assigns the input character to the character recognition dictionary 2 Is converted to one or more converted character candidate codes using a known character recognition algorithm such as a feature matching method, and the converted character candidate codes are assigned the conversion candidate order and stored in the weighting expansion table 4.

【0018】類似文字判別部6は、文字認識部3により
文字が変換文字候補コードに変換されて重み付け展開テ
ーブル4に格納されるたびに入力文字順番nを引数とし
て起動される。
Each time the character recognizing unit 3 converts a character into a converted character candidate code and stores it in the weighting expansion table 4, the similar character discriminating unit 6 is activated with the input character order n as an argument.

【0019】まず、類似文字判別部6は、重み付け規則
ファイル5の重み付け規則に基づいて重み付け展開テー
ブル4の入力文字順番nの各変換文字候補コードに文字
種重み付け属性を付与する(ステップ61)。
First, the similar character discriminating unit 6 assigns a character type weighting attribute to each converted character candidate code in the input character order n of the weighting development table 4 based on the weighting rule of the weighting rule file 5 (step 61).

【0020】次に、類似文字判別部6は、入力文字順番
nが3未満かどうかを判断し(ステップ62)、3未満
であればそのまま処理を終了する。
Next, the similar character discriminating section 6 judges whether or not the input character order n is less than 3 (step 62).

【0021】入力文字順番nが3以上であれば、類似文
字判別部6は、重み付け展開テーブル4の入力文字順番
(n−2),(n−1)およびnに対して処理文字順番
1,2および3を付与する(ステップ63)。
If the input character sequence n is 3 or more, the similar character discriminating unit 6 processes the input character sequence (n−2), (n−1) and n in the weighted development table 4 into the processing character sequence 1, 2 and 3 are given (step 63).

【0022】次に、類似文字判別部6は、処理文字順番
3の変換文字候補コードの文字種重み付け属性と、処理
文字順番1および2の変換文字候補コードの文字種重み
付け属性との比較を行い(ステップ64)、処理文字順
番1,2および3のそれぞれの変換文字候補コードの文
字種重み付け属性が同じ組合せがあるかどうかを判断す
る(ステップ65)。同じ文字種重み付け属性の組合せ
がなければ、類似文字判別部6は、ステップ67に制御
を移す。
Next, the similar character discriminating unit 6 compares the character type weighting attribute of the converted character candidate code of the processing character order 3 with the character type weighting attribute of the converted character candidate code of the processing character order 1 and 2 (step S1). 64) It is determined whether or not there is a combination having the same character type weighting attribute of the converted character candidate codes in the processing character order 1, 2, and 3 (step 65). If there is no combination of the same character type weighting attributes, the similar character determination unit 6 shifts the control to step 67.

【0023】処理文字順番1,2および3のそれぞれの
変換文字候補コードの文字種重み付け属性が同じ組合せ
があれば、類似文字判別部6は、同じ文字種重み付け属
性の組合せを持つ変換文字候補コードの組合せ重み付け
属性に各々1を付与する(ステップ66)。
If there is the same combination of the character type weighting attributes of the converted character candidate codes in the processing character order 1, 2, and 3, the similar character discriminating unit 6 determines the combination of the converted character candidate codes having the same combination of the character type weighting attributes. One is assigned to each weighting attribute (step 66).

【0024】次に、類似文字判別部6は、各々の変換文
字候補コードの文字種重み付け属性と組合せ重み付け属
性との和を重み付け属性合計に付与する(ステップ6
7)。
Next, the similar character discriminating unit 6 gives the sum of the character type weighting attribute and the combination weighting attribute of each converted character candidate code to the total weighting attribute (step 6).
7).

【0025】続いて、類似文字判別部6は、各々の変換
文字候補コードに対して重み付け属性合計の値の大きい
順に重み付け順番1,2,3,…を付与する(ステップ
68)。
Subsequently, the similar character discriminating unit 6 assigns weighting orders 1, 2, 3,... To the converted character candidate codes in descending order of the value of the total weighting attribute (step 68).

【0026】最後に、類似文字判別部6は、処理文字順
番1,2,3を消去し(ステップ69)、処理を終了す
る。
Finally, the similar character discriminating unit 6 deletes the processing character order 1, 2, 3 (step 69), and ends the processing.

【0027】次に、本実施例の文字認識方式の動作を、
図4および図5を参照しながら、「ア」,「メ」,
「リ」,「カ」および「で」と入力する例を用いて具体
的に説明する。
Next, the operation of the character recognition system of this embodiment will be described.
Referring to FIG. 4 and FIG.
A specific description will be given using an example of inputting “ri”, “f”, and “de”.

【0028】まず、文字「ア」が入力されると、文字認
識部3により、「ア」の変換文字候補コードが生成され
て重み付け展開テーブル4の入力文字順番に1、変換候
補順番に1、変換文字候補コードに「ア」のコードがそ
れぞれ登録され、類似文字識別部6により、重み付け展
開テーブル4の文字種重み付け属性のカタカナ種に3が
登録される(図4の入力文字順番1の列参照)。
First, when the character "A" is input, the character recognizing unit 3 generates a conversion character candidate code of "A" and assigns 1 to the input character sequence in the weighting expansion table 4 and 1 to the conversion candidate sequence. The code of “A” is registered as the conversion character candidate code, and 3 is registered as the katakana type of the character type weighting attribute of the character type weighting attribute of the weighting expansion table 4 by the similar character identification unit 6 (see the column of input character order 1 in FIG. 4). ).

【0029】次に、文字「メ」が入力されると、文字認
識部3により、「メ(カタカナ)」,「x(英字)」お
よび「×(記号)」の変換文字候補コードが生成されて
重み付け展開テーブル4の入力文字順番に2、変換候補
順番に1,2および3、変換文字候補コードに「メ(カ
タカナ)」,「x(英字)」および「×(記号)」のコ
ードがそれぞれ登録され、類似文字判別部6により、重
み付け展開テーブル4の文字種重み付け属性のカタカナ
種に3、英字種に3、その他すべてに0がそれぞれ登録
される(図4の入力文字順番2の列参照)。
Next, when the character "me" is input, the character recognizing section 3 generates converted character candidate codes of "me (katakana)", "x (alphabet)" and "x (symbol)". In the weighting expansion table 4, the input character order is 2, the conversion candidate order is 1, 2 and 3, and the conversion character candidate codes are "me (katakana)", "x (alphabet)" and "x (symbol)". Each is registered, and the similar character discriminating unit 6 registers 3 for the katakana type, 3 for the alphabet type, and 0 for all others in the character type weighting attribute of the weight development table 4 (see the column of input character order 2 in FIG. 4). ).

【0030】続いて、文字「リ」が入力されると、文字
認識部3により、「り(ひらがな)」および「リ(カタ
カナ)」の変換文字候補コードが生成されて重み付け展
開テーブル4の入力文字順番に3、変換候補順番に1お
よび2、変換文字候補コードに「り(ひらがな)」およ
び「リ(カタカナ)」のコードがそれぞれ登録され、類
似文字判別部6により、重み付け展開テーブル4の文字
種重み付け属性のひらがな種に3、カタカナ種に3がそ
れぞれ登録される(図4の入力文字順番3の列参照)。
Subsequently, when the character “R” is input, the character recognition unit 3 generates converted character candidate codes of “Ri (Hiragana)” and “Ri (Katakana)” and inputs the converted character candidate code to the weighting expansion table 4. The character order is 3, the conversion candidate order is 1 and 2, and the conversion character candidate code is “Ri (Hiragana)” and “Ri (Katakana)” are registered. As the character type weighting attribute, 3 is registered as the hiragana type, and 3 is registered as the katakana type (see the column of input character order 3 in FIG. 4).

【0031】次に、類似文字判別部6により、処理文字
順番3の変換文字候補コード「リ(カタカナ)」および
「り(ひらがな)」の文字種重み付け属性と、処理文字
順番1の変換文字候補コード「ア(カタカナ)」の文字
種重み付け属性ならびに処理文字順番2の変換文字候補
コード「メ(カタカナ)」,「x(英字)」および「×
(記号)」の文字種重み付け属性との比較が行われ、同
じ文字種重み付け属性を持つ三文字の組合せがカタカナ
種なので、変換文字候補コード「ア(カタカナ)」,
「メ(カタカナ)」および「リ(カタカナ)」の組合せ
重み付け属性にそれぞれ1が付与される(図5の組合せ
重み付け属性の第1行参照)。
Next, the similar character discriminating unit 6 converts the character type weighting attributes of the conversion character candidate codes “ri (katakana)” and “ri (hiragana)” in the processing character order 3 and the conversion character candidate code in the processing character order 1 Character type weighting attribute of "A (Katakana)" and converted character candidate codes "Me (Katakana)", "x (English character)" and "X"
(Symbol) ”is compared with the character type weighting attribute, and the combination of three characters having the same character type weighting attribute is the katakana type, so the conversion character candidate codes“ a (katakana) ”,
“1” is assigned to each of the combination weighting attributes “me (Katakana)” and “ri (Katakana)” (see the first row of the combination weighting attribute in FIG. 5).

【0032】次に、文字「カ」が入力されると、文字認
識部3および類似文字判別部6により同様な処理が行わ
れ、変換文字候補コード「メ(カタカナ)」,「リ(カ
タカナ)」および「カ(カタカナ)」の組合せ重み付け
属性にそれぞれ1が付与される(図5の組合せ重み付け
属性の第2行参照)。これにより、入力文字順番2の変
換文字候補コード「メ(カタカナ)」,「x(英字)」
および「×(記号)」の重み付け属性合計の値がこの順
番に大きくなり、それぞれ重み付け順番1,2および3
が付与される(図5の入力文字順番2の列参照)。
Next, when the character "f" is input, similar processing is performed by the character recognizing unit 3 and the similar character discriminating unit 6, and the converted character candidate codes "me (katakana)" and "ri (katakana)""1" is assigned to each of the combination weighting attributes "" and "ka" (see the second row of the combination weighting attribute in FIG. 5). Thereby, the conversion character candidate codes “me (Katakana)” and “x (English character)” in the input character sequence 2
And the value of the sum of the weighting attributes of “× (symbol)” increases in this order, and the weighting order is 1, 2, and 3, respectively.
(See the column of input character sequence 2 in FIG. 5).

【0033】続いて、文字「で」が入力されると、文字
認識部3および類似文字判別部6により同様な処理が行
われ、入力文字順番3の変換文字候補コード「り(ひら
がな)」および「リ(カタカナ)」の重み付け属性合計
の値がこの順番に小さくなり、それぞれ重み付け順番2
および1が付与される(図5の入力文字順番3の列参
照)。
Subsequently, when the character "de" is input, similar processing is performed by the character recognizing unit 3 and the similar character discriminating unit 6, and the conversion character candidate codes "ri (hiragana)" and "3" in the input character order 3 are input. The value of the total weighting attribute of “ri (Katakana)” decreases in this order, and the weighting order 2
And 1 are given (see the column of input character order 3 in FIG. 5).

【0034】また、入力文字順番4の変換文字候補コー
ドの「カ(カタカナ)」および「力(漢字)」の重み付
け属性合計の値がこの順番に大きくなり、それぞれ重み
付け順番1および2が付与される(図5の入力文字順番
4の列参照)。
Further, the total value of the weighting attributes of “ka (katakana)” and “power (kanji)” of the conversion character candidate code in the input character order 4 becomes larger in this order, and weighting orders 1 and 2 are given, respectively. (See the column of input character sequence 4 in FIG. 5).

【0035】この結果、複数の変換文字候補コードが発
生した文字「メ」,「リ」および「カ」についてカタカ
ナ種の重み付け順番がそれぞれ1になり、「ア」,
「メ」,「リ」,「カ」および「で」と類似文字が正し
く判別されることになる。
As a result, the weighting order of the katakana type for the characters "me", "ri" and "ka" in which a plurality of converted character candidate codes have been generated becomes 1, and "a", "a",
Similar characters such as "me", "ri", "ka" and "de" are correctly determined.

【0036】なお、上記実施例では、入力手段1として
タブレットを用いた手書き入力の場合を例にとって説明
したが、図6に示す類似文字判別部6の類似文字判別処
理が文字のストローク情報を使用していないので、入力
手段としてOCR(Optical Characte
r Reader)を用いる場合にも本発明が同様に適
用できることはあきらかであろう。
In the above-described embodiment, the case of handwriting input using a tablet as the input means 1 has been described as an example. However, the similar character determination process of the similar character determination unit 6 shown in FIG. As an input means, OCR (Optical Character
It will be apparent that the present invention can be similarly applied to the case where (r Reader) is used.

【0037】[0037]

【発明の効果】以上説明したように本発明は、変換文字
候補コードに重み付け規則に基づいてカタカナ種,ひら
がな種,漢字種,数字種および英字種の文字種重み付け
属性を付与し、三文字ごとの組合せ重み付け属性を付与
して、重み付け属性合計により類似文字を判別するよう
にしたことにより、類似文字を判別するために文字種判
定テーブルに類似文字データを登録しておくことなしに
類似文字を判別することができるという効果がある。
As described above, the present invention assigns character type weighting attributes of katakana type, hiragana type, kanji type, numeric type and alphabet type to the converted character candidate code based on the weighting rule, and provides By assigning the combination weighting attribute and determining the similar character by the total weighting attribute, the similar character is determined without registering the similar character data in the character type determination table in order to determine the similar character. There is an effect that can be.

【図面の簡単な説明】[Brief description of the drawings]

【図1】本発明の一実施例に係る文字認識方式の構成を
示すブロック図である。
FIG. 1 is a block diagram showing a configuration of a character recognition system according to one embodiment of the present invention.

【図2】図1中の重み付け規則ファイルの内容を例示す
る図である。
FIG. 2 is a diagram illustrating the contents of a weighting rule file in FIG. 1;

【図3】図1中の重み付け展開テーブルの内容を示す図
である。
FIG. 3 is a diagram showing the contents of a weighting development table in FIG. 1;

【図4】図3の重み付け展開テーブルの状態遷移図であ
る。
FIG. 4 is a state transition diagram of a weight development table of FIG. 3;

【図5】図4の重み付け展開テーブルの状態遷移図であ
る。
FIG. 5 is a state transition diagram of the weighting development table of FIG. 4;

【図6】図1中の類似文字判別部の処理を示す流れ図で
ある。
FIG. 6 is a flowchart showing processing of a similar character discriminating unit in FIG. 1;

【符号の説明】[Explanation of symbols]

1 タブレット 2 文字認識辞書 3 文字認識部 4 重み付け展開テーブル 5 重み付け規則ファイル 6 類似文字判別部 1 Tablet 2 Character Recognition Dictionary 3 Character Recognition Unit 4 Weight Expansion Table 5 Weighting Rule File 6 Similar Character Discrimination Unit

Claims (4)

(57)【特許請求の範囲】(57) [Claims] 【請求項1】 入力された文字を文字認識し漢字仮名混
じり文に変換する文字認識方式において、 文字を入力する入力手段と、 この入力手段により入力された文字を文字認識辞書を参
照して変換文字候補コードに変換する文字認識部と、 この文字認識部により変換された変換文字候補コードに
重み付け規則に基づいてカタカナ種,ひらがな種,漢字
種,数字種および英字種の文字種重み付け属性を付与
し、三文字ごとの組合せ重み付け属性を付与し、文字種
重み付け属性と組合せ重み付け属性との和である重み付
け属性合計により類似文字を判別する類似文字判別部と
を有することを特徴とする文字認識方式。
1. A character recognition system for recognizing an input character and converting the character into a sentence mixed with kanji and kana, comprising: input means for inputting a character; and converting the character input by the input means with reference to a character recognition dictionary. A character recognition unit for converting to a character candidate code, and character type weighting attributes of katakana type, hiragana type, kanji type, numeric type and alphabet type are assigned to the converted character candidate code converted by the character recognition unit based on a weighting rule. And a similar character discriminating unit for assigning a combination weighting attribute for each of the three characters and discriminating similar characters based on a weighting attribute sum that is a sum of the character type weighting attribute and the combination weighting attribute.
【請求項2】 前記重み付け展開テーブルが、入力文字
順番を登録する領域と、処理文字順番を登録する領域
と、変換候補順番を登録する領域と、変換文字候補コー
ドを登録する領域と、組合せ重み付け属性を登録する領
域と、重み付け属性合計を登録する領域と、重み付け順
番を登録する領域とからなる請求項1記載の文字認識方
式。
2. An area for registering an input character order, an area for registering a processing character order, an area for registering a conversion candidate order, an area for registering a conversion character candidate code, and a combination weighting table. 2. The character recognition method according to claim 1, comprising an area for registering an attribute, an area for registering a total of weighted attributes, and an area for registering a weighting order.
【請求項3】 前記入力手段が、タブレットでなる請求
項1記載の文字認識方式。
3. The character recognition system according to claim 1, wherein said input means is a tablet.
【請求項4】 前記入力手段が、OCRでなる請求項1
記載の文字認識方式。
4. The input means comprises an OCR.
The character recognition method described.
JP5207123A 1993-07-29 1993-07-29 Character recognition method Expired - Fee Related JP2639314B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP5207123A JP2639314B2 (en) 1993-07-29 1993-07-29 Character recognition method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5207123A JP2639314B2 (en) 1993-07-29 1993-07-29 Character recognition method

Publications (2)

Publication Number Publication Date
JPH0744551A JPH0744551A (en) 1995-02-14
JP2639314B2 true JP2639314B2 (en) 1997-08-13

Family

ID=16534579

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5207123A Expired - Fee Related JP2639314B2 (en) 1993-07-29 1993-07-29 Character recognition method

Country Status (1)

Country Link
JP (1) JP2639314B2 (en)

Also Published As

Publication number Publication date
JPH0744551A (en) 1995-02-14

Similar Documents

Publication Publication Date Title
US5982929A (en) Pattern recognition method and system
US4991094A (en) Method for language-independent text tokenization using a character categorization
JP2009193603A (en) Character recognition system for identification of scanned and real time handwritten characters
US6035062A (en) Character recognition method and apparatus
JP2639314B2 (en) Character recognition method
US20020126903A1 (en) Word recognizing apparatus for dynamically generating feature amount of word and method thereof
JP2903779B2 (en) Character string recognition method and apparatus
JPH10302025A (en) Handwritten character recognizing device and its program recording medium
JP3763262B2 (en) Handwritten character recognition device
JPS6224382A (en) Method for recognizing handwritten character
JP2827066B2 (en) Post-processing method for character recognition of documents with mixed digit strings
JPH11120294A (en) Character recognition device and medium
KR940007933B1 (en) User independent type on-line korean character recognition method
JP3022790B2 (en) Handwritten character input device
JP3595081B2 (en) Character recognition method
JP2875678B2 (en) Post-processing method of character recognition result
JPH0436885A (en) Optical character reader
JP2851865B2 (en) Character recognition device
JP2851102B2 (en) Character extraction method
JPH0462692A (en) Method for recognizing character
JPH05298489A (en) System for recognizing character
JPH01166187A (en) Method for recognizing character
JPH06187500A (en) On-line character recognizing device
JPH0337231B2 (en)
JPS6055481A (en) Pattern recognizing device

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080425

Year of fee payment: 11

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090425

Year of fee payment: 12

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20100425

Year of fee payment: 13

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110425

Year of fee payment: 14

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120425

Year of fee payment: 15

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120425

Year of fee payment: 15

S111 Request for change of ownership or part of ownership

Free format text: JAPANESE INTERMEDIATE CODE: R313113

R371 Transfer withdrawn

Free format text: JAPANESE INTERMEDIATE CODE: R371

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120425

Year of fee payment: 15

S111 Request for change of ownership or part of ownership

Free format text: JAPANESE INTERMEDIATE CODE: R313113

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120425

Year of fee payment: 15

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120425

Year of fee payment: 15

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120425

Year of fee payment: 15

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130425

Year of fee payment: 16

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130425

Year of fee payment: 16

LAPS Cancellation because of no payment of annual fees