JP2639314B2

JP2639314B2 - Character recognition method

Info

Publication number: JP2639314B2
Application number: JP5207123A
Authority: JP
Inventors: 泰彦浅川
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1993-07-29
Filing date: 1993-07-29
Publication date: 1997-08-13
Anticipated expiration: 2012-08-13
Also published as: JPH0744551A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は文字認識方式に関し、特
に文字種の重み付けを行う重み付け規則により異なる文
字種で同一または類似の形態を有する文字（以下、類似
文字という）を判別することを可能にする文字認識方式
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition system, and more particularly, to a method for determining different character types having the same or similar form (hereinafter referred to as similar characters) by weighting rules for weighting the character types. Related to character recognition method.

【０００２】[0002]

【従来の技術】従来の文字認識方式では、類似文字、例
えば口（漢字）とロ（カタカナ）や、１（数字）とｌ
（英字）のような類似文字を判別するのは困難であり、
類似文字を判別するために様々な方法がとられている。2. Description of the Related Art In a conventional character recognition system, similar characters, for example, mouth (kanji) and b (katakana) or 1 (number) and l
It is difficult to distinguish similar characters such as (English letters)
Various methods are used to determine similar characters.

【０００３】例えば、文字種判定テーブルを用意してお
き、入力された文字が類似文字として登録されているか
どうかをチェックし、類似文字として登録されている場
合には文字種判定テーブルを用いて文字を特定するもの
がある（特開昭６１−３９１７５号公報等参照）。For example, a character type determination table is prepared, and it is checked whether or not the input character is registered as a similar character. If the input character is registered as a similar character, the character is identified using the character type determination table. (See JP-A-61-39175).

【０００４】[0004]

【発明が解決しようとする課題】上述した従来の文字認
識方式では、類似文字を判別するために文字種判定テー
ブルを用意するようになっているので、類似文字データ
をあらかじめ登録しておかなかればならず、登録したデ
ータ以外の類似文字は判別できないという問題点があっ
た。In the above-described conventional character recognition method, a character type determination table is prepared for determining similar characters. Therefore, if similar character data must be registered in advance. In addition, there is a problem that similar characters other than the registered data cannot be determined.

【０００５】本発明の目的は、上述の点に鑑み、文字種
判定テーブルを用意して類似文字データを登録しておく
ことなしに、類似文字を判別できるようにした文字認識
方式を提供することにある。SUMMARY OF THE INVENTION In view of the above, it is an object of the present invention to provide a character recognition system which can distinguish similar characters without preparing a character type determination table and registering similar character data. is there.

【０００６】[0006]

【課題を解決するための手段】本発明の文字認識方式
は、入力された文字を文字認識し漢字仮名混じり文に変
換する文字認識方式において、文字を入力する入力手段
と、この入力手段により入力された文字を文字認識辞書
を参照して変換文字候補コードに変換する文字認識部
と、この文字認識部により変換された変換文字候補コー
ドに重み付け規則に基づいてカタカナ種，ひらがな種，
漢字種，数字種および英字種の文字種重み付け属性を付
与し、三文字ごとの組合せ重み付け属性を付与し、文字
種重み付け属性と組合せ重み付け属性との和である重み
付け属性合計により類似文字を判別する類似文字判別部
とを有する。SUMMARY OF THE INVENTION A character recognition system of the present invention is a character recognition system for recognizing an input character and converting it into a sentence mixed with kanji and kana, and an input means for inputting a character and an input by the input means. A character recognition unit that converts the converted character into a conversion character candidate code with reference to a character recognition dictionary, and converts the converted character candidate code converted by the character recognition unit into katakana, hiragana,
Similar characters that assign character type weighting attributes of Kanji type, number type and English character type, assign a combination weighting attribute for each three characters, and determine similar characters based on the sum of the weighting attribute that is the sum of the character type weighting attribute and the combination weighting attribute A determination unit.

【０００７】[0007]

【実施例】次に、本発明について図面を参照して詳細に
説明する。Next, the present invention will be described in detail with reference to the drawings.

【０００８】図１は、本発明の一実施例に係る文字認識
方式の構成を示すブロック図である。本実施例の文字認
識方式は、タブレット１と、文字認識辞書２と、文字認
識部３と、重み付け展開テーブル４と、重み付け規則フ
ァイル５と、類似文字判別部６とから構成されている。FIG. 1 is a block diagram showing a configuration of a character recognition system according to one embodiment of the present invention. The character recognition method according to the present embodiment includes a tablet 1, a character recognition dictionary 2, a character recognition unit 3, a weight development table 4, a weight rule file 5, and a similar character determination unit 6.

【０００９】タブレット１は、文字を手書き入力する入
力手段である。The tablet 1 is input means for inputting characters by handwriting.

【００１０】文字認識辞書２は、個々の文字の特徴情報
が格納されている辞書である。The character recognition dictionary 2 is a dictionary in which characteristic information of individual characters is stored.

【００１１】文字認識部３は、タブレット１からの文字
の形態情報と文字認識辞書２の文字の特徴情報とを比較
することにより、文字の認識を行い単数または複数の変
換文字候補コードに変換し重み付け展開テーブル４に格
納する手段である。The character recognizing section 3 recognizes the character by comparing the character form information from the tablet 1 with the character feature information of the character recognition dictionary 2 and converts the character into one or more converted character candidate codes. This is a means for storing in the weighting development table 4.

【００１２】重み付け規則ファイル５は、図２に示すよ
うに、変換文字候補コードから文字種の重み付けを行う
重み付け規則を格納するファイルである。図２におい
て、領域ａは規則番号を登録する領域、領域ｂは重み付
け規則の内容を記載する領域である。As shown in FIG. 2, the weighting rule file 5 is a file for storing weighting rules for weighting character types from converted character candidate codes. In FIG. 2, an area a is an area for registering a rule number, and an area b is an area for describing the contents of a weighting rule.

【００１３】重み付け展開テーブル４は、図３に示すよ
うに、重み付け属性を展開するメモリ上のテーブルであ
る。図３において、領域イはタブレット１で手書き入力
した文字の入力文字順番を登録する領域、領域ロは類似
文字判別部６で三文字ごとの組合せを処理する際に処理
文字順番を登録する領域、領域ハは文字認識部３で変換
された変換文字候補コードの変換候補順番を登録する領
域、領域ニは変換文字候補コードを登録する領域、領域
ヘは類似文字判別部５で処理文字順番の変換文字候補コ
ードの重み付け属性を三文字の組合せで処理した場合に
生じる組合せ重み付け属性を登録する領域、領域トは各
変換文字候補コードの文字種重み付け属性と組合せ重み
付け属性との合計である重み付け属性合計を登録する領
域、領域チは各変換文字候補コードの順番を重み付け属
性合計の値の大きい順番に１，２，３…と重み付け順番
として登録する領域である。As shown in FIG. 3, the weighting development table 4 is a table on a memory for developing weighting attributes. In FIG. 3, an area A is an area for registering the input character order of the characters input by handwriting on the tablet 1, an area B is an area for registering the processing character order when the similar character determination unit 6 processes a combination for every three characters, The area C is an area for registering the conversion candidate order of the converted character candidate code converted by the character recognizing unit 3, the area D is an area for registering the converted character candidate code, and the area F is a similar character discriminating unit 5 for converting the processing character order. The area for registering the combination weighting attribute generated when the weighting attribute of the character candidate code is processed by a combination of three characters, the area is the sum of the weighting attribute that is the sum of the character type weighting attribute and the combination weighting attribute of each converted character candidate code. The area to be registered is an area in which the order of each converted character candidate code is registered as a weighting order of 1, 2, 3,... That.

【００１４】類似文字判別部６は、文字認識部３により
変換された変換文字候補コードに重み付け規則ファイル
５の重み付け規則に基づいてカタカナ種，ひらがな種，
漢字種，数字種および英字種の文字種重み付け属性を付
与し、三文字ごとの組合せ重み付け属性を付与し、文字
種重み付け属性と組合せ重み付け属性との和である重み
付け属性合計により類似文字を判別する手段である。The similar character discriminating unit 6 converts the converted character candidate codes converted by the character recognizing unit 3 into katakana and hiragana based on the weighting rules of the weighting rule file 5.
A means for assigning character type weighting attributes of kanji type, number type and alphabetic type, assigning a combination weighting attribute for each of the three characters, and determining similar characters based on the sum of the weighting attribute which is the sum of the character type weighting attribute and the combination weighting attribute. is there.

【００１５】図６を参照すると、類似文字判別部６の処
理は、文字種重み付け属性付与ステップ６１と、入力文
字順番ｎの３未満判定ステップ６２と、処理文字順番付
与ステップ６３と、文字種重み付け属性比較ステップＳ
６４と、同一文字種重み付け属性判断ステップＳ６５
と、組合せ重み付け属性付与ステップＳ６６と、重み付
属性合計付与ステップ６７と、重み付け順番付与ステッ
プＳ６８と、処理文字順番消去ステップ６９とからな
る。Referring to FIG. 6, the processing performed by the similar character discriminating unit 6 includes a character type weighting attribute assigning step 61, an input character order n less than 3 determination step 62, a processing character order assigning step 63, and a character type weighting attribute comparison. Step S
64 and the same character type weighting attribute determining step S65
, A combination weighting attribute assigning step S66, a weight appending total assigning step 67, a weighting order assigning step S68, and a processing character order erasing step 69.

【００１６】次に、このように構成された本実施例の文
字認識方式の動作について説明する。Next, the operation of the character recognition system according to the present embodiment configured as described above will be described.

【００１７】タブレット１から手書き文字が入力される
たびに、文字認識部３は、入力文字順番ｎ（ｎは１以上
の整数）を１から昇順に付与し、入力された文字を文字
認識辞書２を参照し特徴マッチング法等の公知の文字認
識アルゴリズムを用いて単数または複数の変換文字候補
コードに変換し、各変換文字候補コードに変換候補順番
を付して重み付け展開テーブル４に格納する。Each time a handwritten character is input from the tablet 1, the character recognition unit 3 assigns an input character sequence n (n is an integer of 1 or more) in ascending order from 1, and assigns the input character to the character recognition dictionary 2 Is converted to one or more converted character candidate codes using a known character recognition algorithm such as a feature matching method, and the converted character candidate codes are assigned the conversion candidate order and stored in the weighting expansion table 4.

【００１８】類似文字判別部６は、文字認識部３により
文字が変換文字候補コードに変換されて重み付け展開テ
ーブル４に格納されるたびに入力文字順番ｎを引数とし
て起動される。Each time the character recognizing unit 3 converts a character into a converted character candidate code and stores it in the weighting expansion table 4, the similar character discriminating unit 6 is activated with the input character order n as an argument.

【００１９】まず、類似文字判別部６は、重み付け規則
ファイル５の重み付け規則に基づいて重み付け展開テー
ブル４の入力文字順番ｎの各変換文字候補コードに文字
種重み付け属性を付与する（ステップ６１）。First, the similar character discriminating unit 6 assigns a character type weighting attribute to each converted character candidate code in the input character order n of the weighting development table 4 based on the weighting rule of the weighting rule file 5 (step 61).

【００２０】次に、類似文字判別部６は、入力文字順番
ｎが３未満かどうかを判断し（ステップ６２）、３未満
であればそのまま処理を終了する。Next, the similar character discriminating section 6 judges whether or not the input character order n is less than 3 (step 62).

【００２１】入力文字順番ｎが３以上であれば、類似文
字判別部６は、重み付け展開テーブル４の入力文字順番
（ｎ−２），（ｎ−１）およびｎに対して処理文字順番
１，２および３を付与する（ステップ６３）。If the input character sequence n is 3 or more, the similar character discriminating unit 6 processes the input character sequence (n−2), (n−1) and n in the weighted development table 4 into the processing character sequence 1, 2 and 3 are given (step 63).

【００２２】次に、類似文字判別部６は、処理文字順番
３の変換文字候補コードの文字種重み付け属性と、処理
文字順番１および２の変換文字候補コードの文字種重み
付け属性との比較を行い（ステップ６４）、処理文字順
番１，２および３のそれぞれの変換文字候補コードの文
字種重み付け属性が同じ組合せがあるかどうかを判断す
る（ステップ６５）。同じ文字種重み付け属性の組合せ
がなければ、類似文字判別部６は、ステップ６７に制御
を移す。Next, the similar character discriminating unit 6 compares the character type weighting attribute of the converted character candidate code of the processing character order 3 with the character type weighting attribute of the converted character candidate code of the processing character order 1 and 2 (step S1). 64) It is determined whether or not there is a combination having the same character type weighting attribute of the converted character candidate codes in the processing character order 1, 2, and 3 (step 65). If there is no combination of the same character type weighting attributes, the similar character determination unit 6 shifts the control to step 67.

【００２３】処理文字順番１，２および３のそれぞれの
変換文字候補コードの文字種重み付け属性が同じ組合せ
があれば、類似文字判別部６は、同じ文字種重み付け属
性の組合せを持つ変換文字候補コードの組合せ重み付け
属性に各々１を付与する（ステップ６６）。If there is the same combination of the character type weighting attributes of the converted character candidate codes in the processing character order 1, 2, and 3, the similar character discriminating unit 6 determines the combination of the converted character candidate codes having the same combination of the character type weighting attributes. One is assigned to each weighting attribute (step 66).

【００２４】次に、類似文字判別部６は、各々の変換文
字候補コードの文字種重み付け属性と組合せ重み付け属
性との和を重み付け属性合計に付与する（ステップ６
７）。Next, the similar character discriminating unit 6 gives the sum of the character type weighting attribute and the combination weighting attribute of each converted character candidate code to the total weighting attribute (step 6).
7).

【００２５】続いて、類似文字判別部６は、各々の変換
文字候補コードに対して重み付け属性合計の値の大きい
順に重み付け順番１，２，３，…を付与する（ステップ
６８）。Subsequently, the similar character discriminating unit 6 assigns weighting orders 1, 2, 3,... To the converted character candidate codes in descending order of the value of the total weighting attribute (step 68).

【００２６】最後に、類似文字判別部６は、処理文字順
番１，２，３を消去し（ステップ６９）、処理を終了す
る。Finally, the similar character discriminating unit 6 deletes the processing character order 1, 2, 3 (step 69), and ends the processing.

【００２７】次に、本実施例の文字認識方式の動作を、
図４および図５を参照しながら、「ア」，「メ」，
「リ」，「カ」および「で」と入力する例を用いて具体
的に説明する。Next, the operation of the character recognition system of this embodiment will be described.
Referring to FIG. 4 and FIG.
A specific description will be given using an example of inputting “ri”, “f”, and “de”.

【００２８】まず、文字「ア」が入力されると、文字認
識部３により、「ア」の変換文字候補コードが生成され
て重み付け展開テーブル４の入力文字順番に１、変換候
補順番に１、変換文字候補コードに「ア」のコードがそ
れぞれ登録され、類似文字識別部６により、重み付け展
開テーブル４の文字種重み付け属性のカタカナ種に３が
登録される（図４の入力文字順番１の列参照）。First, when the character "A" is input, the character recognizing unit 3 generates a conversion character candidate code of "A" and assigns 1 to the input character sequence in the weighting expansion table 4 and 1 to the conversion candidate sequence. The code of “A” is registered as the conversion character candidate code, and 3 is registered as the katakana type of the character type weighting attribute of the character type weighting attribute of the weighting expansion table 4 by the similar character identification unit 6 (see the column of input character order 1 in FIG. 4). ).

【００２９】次に、文字「メ」が入力されると、文字認
識部３により、「メ（カタカナ）」，「ｘ（英字）」お
よび「×（記号）」の変換文字候補コードが生成されて
重み付け展開テーブル４の入力文字順番に２、変換候補
順番に１，２および３、変換文字候補コードに「メ（カ
タカナ）」，「ｘ（英字）」および「×（記号）」のコ
ードがそれぞれ登録され、類似文字判別部６により、重
み付け展開テーブル４の文字種重み付け属性のカタカナ
種に３、英字種に３、その他すべてに０がそれぞれ登録
される（図４の入力文字順番２の列参照）。Next, when the character "me" is input, the character recognizing section 3 generates converted character candidate codes of "me (katakana)", "x (alphabet)" and "x (symbol)". In the weighting expansion table 4, the input character order is 2, the conversion candidate order is 1, 2 and 3, and the conversion character candidate codes are "me (katakana)", "x (alphabet)" and "x (symbol)". Each is registered, and the similar character discriminating unit 6 registers 3 for the katakana type, 3 for the alphabet type, and 0 for all others in the character type weighting attribute of the weight development table 4 (see the column of input character order 2 in FIG. 4). ).

【００３０】続いて、文字「リ」が入力されると、文字
認識部３により、「り（ひらがな）」および「リ（カタ
カナ）」の変換文字候補コードが生成されて重み付け展
開テーブル４の入力文字順番に３、変換候補順番に１お
よび２、変換文字候補コードに「り（ひらがな）」およ
び「リ（カタカナ）」のコードがそれぞれ登録され、類
似文字判別部６により、重み付け展開テーブル４の文字
種重み付け属性のひらがな種に３、カタカナ種に３がそ
れぞれ登録される（図４の入力文字順番３の列参照）。Subsequently, when the character “R” is input, the character recognition unit 3 generates converted character candidate codes of “Ri (Hiragana)” and “Ri (Katakana)” and inputs the converted character candidate code to the weighting expansion table 4. The character order is 3, the conversion candidate order is 1 and 2, and the conversion character candidate code is “Ri (Hiragana)” and “Ri (Katakana)” are registered. As the character type weighting attribute, 3 is registered as the hiragana type, and 3 is registered as the katakana type (see the column of input character order 3 in FIG. 4).

【００３１】次に、類似文字判別部６により、処理文字
順番３の変換文字候補コード「リ（カタカナ）」および
「り（ひらがな）」の文字種重み付け属性と、処理文字
順番１の変換文字候補コード「ア（カタカナ）」の文字
種重み付け属性ならびに処理文字順番２の変換文字候補
コード「メ（カタカナ）」，「ｘ（英字）」および「×
（記号）」の文字種重み付け属性との比較が行われ、同
じ文字種重み付け属性を持つ三文字の組合せがカタカナ
種なので、変換文字候補コード「ア（カタカナ）」，
「メ（カタカナ）」および「リ（カタカナ）」の組合せ
重み付け属性にそれぞれ１が付与される（図５の組合せ
重み付け属性の第１行参照）。Next, the similar character discriminating unit 6 converts the character type weighting attributes of the conversion character candidate codes “ri (katakana)” and “ri (hiragana)” in the processing character order 3 and the conversion character candidate code in the processing character order 1 Character type weighting attribute of "A (Katakana)" and converted character candidate codes "Me (Katakana)", "x (English character)" and "X"
(Symbol) ”is compared with the character type weighting attribute, and the combination of three characters having the same character type weighting attribute is the katakana type, so the conversion character candidate codes“ a (katakana) ”,
“1” is assigned to each of the combination weighting attributes “me (Katakana)” and “ri (Katakana)” (see the first row of the combination weighting attribute in FIG. 5).

【００３２】次に、文字「カ」が入力されると、文字認
識部３および類似文字判別部６により同様な処理が行わ
れ、変換文字候補コード「メ（カタカナ）」，「リ（カ
タカナ）」および「カ（カタカナ）」の組合せ重み付け
属性にそれぞれ１が付与される（図５の組合せ重み付け
属性の第２行参照）。これにより、入力文字順番２の変
換文字候補コード「メ（カタカナ）」，「ｘ（英字）」
および「×（記号）」の重み付け属性合計の値がこの順
番に大きくなり、それぞれ重み付け順番１，２および３
が付与される（図５の入力文字順番２の列参照）。Next, when the character "f" is input, similar processing is performed by the character recognizing unit 3 and the similar character discriminating unit 6, and the converted character candidate codes "me (katakana)" and "ri (katakana)""1" is assigned to each of the combination weighting attributes "" and "ka" (see the second row of the combination weighting attribute in FIG. 5). Thereby, the conversion character candidate codes “me (Katakana)” and “x (English character)” in the input character sequence 2
And the value of the sum of the weighting attributes of “× (symbol)” increases in this order, and the weighting order is 1, 2, and 3, respectively.
(See the column of input character sequence 2 in FIG. 5).

【００３３】続いて、文字「で」が入力されると、文字
認識部３および類似文字判別部６により同様な処理が行
われ、入力文字順番３の変換文字候補コード「り（ひら
がな）」および「リ（カタカナ）」の重み付け属性合計
の値がこの順番に小さくなり、それぞれ重み付け順番２
および１が付与される（図５の入力文字順番３の列参
照）。Subsequently, when the character "de" is input, similar processing is performed by the character recognizing unit 3 and the similar character discriminating unit 6, and the conversion character candidate codes "ri (hiragana)" and "3" in the input character order 3 are input. The value of the total weighting attribute of “ri (Katakana)” decreases in this order, and the weighting order 2
And 1 are given (see the column of input character order 3 in FIG. 5).

【００３４】また、入力文字順番４の変換文字候補コー
ドの「カ（カタカナ）」および「力（漢字）」の重み付
け属性合計の値がこの順番に大きくなり、それぞれ重み
付け順番１および２が付与される（図５の入力文字順番
４の列参照）。Further, the total value of the weighting attributes of “ka (katakana)” and “power (kanji)” of the conversion character candidate code in the input character order 4 becomes larger in this order, and weighting orders 1 and 2 are given, respectively. (See the column of input character sequence 4 in FIG. 5).

【００３５】この結果、複数の変換文字候補コードが発
生した文字「メ」，「リ」および「カ」についてカタカ
ナ種の重み付け順番がそれぞれ１になり、「ア」，
「メ」，「リ」，「カ」および「で」と類似文字が正し
く判別されることになる。As a result, the weighting order of the katakana type for the characters "me", "ri" and "ka" in which a plurality of converted character candidate codes have been generated becomes 1, and "a", "a",
Similar characters such as "me", "ri", "ka" and "de" are correctly determined.

【００３６】なお、上記実施例では、入力手段１として
タブレットを用いた手書き入力の場合を例にとって説明
したが、図６に示す類似文字判別部６の類似文字判別処
理が文字のストローク情報を使用していないので、入力
手段としてＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅ
ｒＲｅａｄｅｒ）を用いる場合にも本発明が同様に適
用できることはあきらかであろう。In the above-described embodiment, the case of handwriting input using a tablet as the input means 1 has been described as an example. However, the similar character determination process of the similar character determination unit 6 shown in FIG. As an input means, OCR (Optical Character
It will be apparent that the present invention can be similarly applied to the case where (r Reader) is used.

【００３７】[0037]

【発明の効果】以上説明したように本発明は、変換文字
候補コードに重み付け規則に基づいてカタカナ種，ひら
がな種，漢字種，数字種および英字種の文字種重み付け
属性を付与し、三文字ごとの組合せ重み付け属性を付与
して、重み付け属性合計により類似文字を判別するよう
にしたことにより、類似文字を判別するために文字種判
定テーブルに類似文字データを登録しておくことなしに
類似文字を判別することができるという効果がある。As described above, the present invention assigns character type weighting attributes of katakana type, hiragana type, kanji type, numeric type and alphabet type to the converted character candidate code based on the weighting rule, and provides By assigning the combination weighting attribute and determining the similar character by the total weighting attribute, the similar character is determined without registering the similar character data in the character type determination table in order to determine the similar character. There is an effect that can be.

[Brief description of the drawings]

【図１】本発明の一実施例に係る文字認識方式の構成を
示すブロック図である。FIG. 1 is a block diagram showing a configuration of a character recognition system according to one embodiment of the present invention.

【図２】図１中の重み付け規則ファイルの内容を例示す
る図である。FIG. 2 is a diagram illustrating the contents of a weighting rule file in FIG. 1;

【図３】図１中の重み付け展開テーブルの内容を示す図
である。FIG. 3 is a diagram showing the contents of a weighting development table in FIG. 1;

【図４】図３の重み付け展開テーブルの状態遷移図であ
る。FIG. 4 is a state transition diagram of a weight development table of FIG. 3;

【図５】図４の重み付け展開テーブルの状態遷移図であ
る。FIG. 5 is a state transition diagram of the weighting development table of FIG. 4;

【図６】図１中の類似文字判別部の処理を示す流れ図で
ある。FIG. 6 is a flowchart showing processing of a similar character discriminating unit in FIG. 1;

[Explanation of symbols]

１タブレット２文字認識辞書３文字認識部４重み付け展開テーブル５重み付け規則ファイル６類似文字判別部 1 Tablet 2 Character Recognition Dictionary 3 Character Recognition Unit 4 Weight Expansion Table 5 Weighting Rule File 6 Similar Character Discrimination Unit

Claims

(57) [Claims]

1. A character recognition system for recognizing an input character and converting the character into a sentence mixed with kanji and kana, comprising: input means for inputting a character; and converting the character input by the input means with reference to a character recognition dictionary. A character recognition unit for converting to a character candidate code, and character type weighting attributes of katakana type, hiragana type, kanji type, numeric type and alphabet type are assigned to the converted character candidate code converted by the character recognition unit based on a weighting rule. And a similar character discriminating unit for assigning a combination weighting attribute for each of the three characters and discriminating similar characters based on a weighting attribute sum that is a sum of the character type weighting attribute and the combination weighting attribute.

2. An area for registering an input character order, an area for registering a processing character order, an area for registering a conversion candidate order, an area for registering a conversion character candidate code, and a combination weighting table. 2. The character recognition method according to claim 1, comprising an area for registering an attribute, an area for registering a total of weighted attributes, and an area for registering a weighting order.

3. The character recognition system according to claim 1, wherein said input means is a tablet.

4. The input means comprises an OCR.
The character recognition method described.