JPS6095688A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPS6095688A
JPS6095688A JP58203340A JP20334083A JPS6095688A JP S6095688 A JPS6095688 A JP S6095688A JP 58203340 A JP58203340 A JP 58203340A JP 20334083 A JP20334083 A JP 20334083A JP S6095688 A JPS6095688 A JP S6095688A
Authority
JP
Japan
Prior art keywords
character
recognition
memory
characters
pattern
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58203340A
Other languages
Japanese (ja)
Inventor
Toru Suzuki
透 鈴木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to JP58203340A priority Critical patent/JPS6095688A/en
Publication of JPS6095688A publication Critical patent/JPS6095688A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To improve the character recognition factor by reading the corresponding 2nd character code out of a replacing character memory to use it as the final result of recognition in case the result of recognition is equal to the 1st character code. CONSTITUTION:The pattern information is obtained by scnanning characters with photoelectric conversion and then separated every character to be supplied to a character recognizing part 2. A control circuit 5 delivers a recognition start command to the part 2 and a dictionary pattern memory 3. The part 2 calculates the similarity between each standard character pattern in the memory 3 and segmented unknown pattern and delivers the patterns having the highest similarity to the circuit 5 as a result of recognition. The circuit 5 searches a replacing character memory 7 and discriminates whether the codes of characters to be replaced are registered to a corresponding memory area. Then the character code if registered is used as the final result of recognition.

Description

【発明の詳細な説明】 〔発明の枝術分野〕 本発明は元学的文字読1収装置等の文字認識装置に関す
る。
DETAILED DESCRIPTION OF THE INVENTION [Branch Field of the Invention] The present invention relates to a character recognition device such as an original character reading device.

〔発明の技術的背景とその問題点〕[Technical background of the invention and its problems]

近年、文字認識装置の対象とする文字種がますます拡大
してきており、英字、数字、カナ、記号に限定されてい
た対象文字種が1通常の日本語文章にあられれる文字種
、すなわち英字、叡字、カナ、記号、ひらがな、漢字に
拡大−てきている。
In recent years, the range of character types that character recognition devices can target has been expanding more and more. It has been expanded to include kana, symbols, hiragana, and kanji.

しかしながら対象文字種の拡大とともに認1識率が低下
せざるな得ない。これは「天」と「夫」。
However, as the target character types expand, the recognition rate inevitably decreases. These are "Ten" and "Husband".

「大」と「尤」等の字体が似ている文字(例えば「び」
と「び」の増加と、記入や印字のくずれにより他の文字
に似てしまう文字の増加が原因のひとつであり、認識率
を向上させるのが非常に困難であった。
Characters with similar fonts such as ``大'' and ``尤'' (for example, ``bi'')
One of the reasons for this was the increase in the number of ``bi'' characters and the increase in characters that resembled other characters due to errors in writing or printing, making it extremely difficult to improve the recognition rate.

〔発明の目的〕[Purpose of the invention]

本発明は上記事情を考慮してなされたもので、認識率を
高めることのできる文字認iil&装置を提供すること
を目的とする。
The present invention has been made in consideration of the above circumstances, and an object of the present invention is to provide a character recognition device capable of increasing the recognition rate.

〔発明の概要〕[Summary of the invention]

この目的を達成するために本発明による文字認識装置は
、第1の文字コードに対応して第2の文字コードを記憶
している取換え文字メモリと、認識結果が前記第1の文
字コードであったとき前記敗換え文字メモリから読み出
した前記第2の文字コードを最終的な認識結果とする処
理手段とを具備したことを特徴とする。
In order to achieve this object, the character recognition device according to the present invention includes a replacement character memory storing a second character code corresponding to the first character code, and a character recognition device in which the recognition result is based on the first character code. The present invention is characterized by comprising a processing means for making the second character code read from the lost character memory when the character code is lost as the final recognition result.

〔発明の実施例〕[Embodiments of the invention]

本発明の一実施例による文字認識装置の主要部を第1図
に示す。文字認識部2は1図示しない光電変換部から出
力される被読取り文字が記録された帳票を走査し光電変
換して得たパターン情報から1文字ごとに切出された未
知パターン7入力し。
FIG. 1 shows the main parts of a character recognition device according to an embodiment of the present invention. The character recognition section 2 inputs an unknown pattern 7 cut out character by character from the pattern information obtained by scanning and photoelectrically converting a document in which characters to be read are recorded and output from a photoelectric conversion section (not shown).

予め標準文字パターンが登録されている辞淋パターンメ
モリ3の文字パターンと比較して認識する。
It is recognized by comparing it with the character pattern in the dictionary pattern memory 3 in which standard character patterns are registered in advance.

従来はその認識結果をそのまま最終的な;は識結果とし
ていたが1本実施例では嘔換え文字が記1.ばされた取
換え文字メモリ7を備えている。取換え文字メモリ7に
は、本実施例の文字読取装置力1読み嘔ろことができる
全ての文字に対応してメモリ領域を有し、このメモリ領
域にはその文字に対して喉換えるべき文字の文字コード
が記憶されており。
Conventionally, the recognition result was used as the final recognition result, but in this embodiment, the ``transformation'' character is written as ``1.''. A replacement character memory 7 is provided. The replacement character memory 7 has memory areas corresponding to all the characters that can be read by the character reading device of this embodiment, and this memory area contains the characters to be replaced with the characters. The character code of is memorized.

収り換えるべき文字がない場合にはオール10“を記憶
させておく。例えば「尤」と認識されても最終的な認識
結果を「大」としたい場合に&瓜「尤Jの文字コードと
「大」の文字コードを対応させて記憶しておく。取換文
字メモリ7と文字認識部2と辞書パターンメモリ3は制
御回路5により制御される。
If there are no characters to replace, all 10" is memorized. For example, if you want the final recognition result to be "large" even if it is recognized as "Yu", use the character code of Correlate and memorize the character code for "large". The replacement character memory 7, the character recognition unit 2, and the dictionary pattern memory 3 are controlled by a control circuit 5.

この文字認識装置に第2図(a)に示すような未知パタ
ーンが入力したものとする。11 、12はその文字行
のフィールド番号であり1文字行の各フィールドごとに
付されフォーマットコントロールがなされる。第2図(
a)に示す文字が走査され光電変1婆されて得られたパ
ターン情報は、図示しない回路により1文字ずつ切出さ
れ文字認識部2に入力される。その後、制御回路5は文
字認識部2と辞書パターンメモリ3に認識開始コマンド
を出す。すると文字認識部2は辞書パターンメモリ3内
の各標準文字パターンと切出された未知パターンとの類
似変針Nをおこない、最も類似度の高いもの欠認識結果
とし、制御回路5へ出力する。制御回路5は取換え文字
メモリ7乞サーチしこの認識結果の文字コードに対応す
るメモリ領域に敗り換えるべき文字の文字コードが登録
されているか判断し。
Assume that an unknown pattern as shown in FIG. 2(a) is input to this character recognition device. 11 and 12 are field numbers of the character line, which are assigned to each field of one character line to perform format control. Figure 2 (
The pattern information obtained by scanning and photoelectrically converting the characters shown in a) is cut out character by character by a circuit (not shown) and input to the character recognition section 2. Thereafter, the control circuit 5 issues a recognition start command to the character recognition section 2 and dictionary pattern memory 3. Then, the character recognition unit 2 performs a similarity change N between each standard character pattern in the dictionary pattern memory 3 and the cut out unknown pattern, and outputs the one with the highest degree of similarity as the missing recognition result to the control circuit 5. The control circuit 5 searches the replacement character memory 7 and determines whether the character code of the character to be replaced is registered in the memory area corresponding to the character code resulting from this recognition.

登録されている場合にはその文字コードを最終的な認識
結果とし、登録されていない場合には、前記認識結果の
文字コードをそのまま最終的な認識結果とする。
If the character code is registered, the character code is used as the final recognition result, and if it is not registered, the character code of the recognition result is used as the final recognition result.

今、取換え文字メモIJ7に、文字「尤」に対し取り換
えるべき文字「大」が5文字「ビ」に対し収り喚えるべ
き文字「ビ」が登録されているものとすると1文字認識
部2による認識結果が第2図(b)に示すように「ピー
ルビン」 「尤きい湖」と出ても、最終的な認識結果で
は「ピ」が「ビjVc「尤」カ1大」に取り換えられ「
ピールビン」 「大きい湖」と正しい認識結果が得られ
る。
Now, suppose that in the replacement character memo IJ7, there are 5 characters "dai" which should be replaced with the character "尤" and a character "bi" which should be replaced with the character "bi". As shown in Figure 2 (b), even if the recognition result from 2 shows "Peel bottle" and "Yukii Lake", in the final recognition result, "Pi" is replaced with "BijVc'尤'Ka1dai". "
The correct recognition results are obtained as ``peel bottle'' and ``large lake.''

収換え文字メモリ7への登録はどのようlよ方法でおこ
′なってもよいが、例えば帳票のフォーマットコントロ
ールデータな入力すると佐に同時に入力するようにする
方法のように、ユーザが自由に登録できる方法が望まし
い。
Registration in the replacement character memory 7 may be performed in any way, but the user may freely register the data, such as inputting format control data of a form at the same time. A method that allows this is preferable.

〔発明の効果」 以上の通り1本発明によれば、取換え文字メモリにユー
ザが登録することにより、最終的な認識結果を変更する
ことができる。したがって、出現頻度の低い文字で類似
した文字がある場合、頻度の高い文字VC嘔換えること
とすれば、認識率が向上する。また認識対象文字以外の
新たな文字を認識したい場合、対象文字のうち類似して
いる文字ビ新たな文字に敗換えるよう登録しておけば、
認識対象以外の文字の認識も可能である。ユーザがある
文字をまったく別な文字として読ませたい場合に有効で
ある。
[Effects of the Invention] As described above, according to the present invention, the final recognition result can be changed by the user registering in the replacement character memory. Therefore, if there are similar characters that appear less frequently, the recognition rate will improve if the characters VC that appear more frequently are replaced with the characters VC that appear more frequently. Also, if you want to recognize a new character other than the target character, you can register a similar character among the target characters to be replaced with a new character.
It is also possible to recognize characters other than those to be recognized. This is effective when the user wants a certain character to be read as a completely different character.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の一実施例による文字認識装置4の主要
部のブロック比、第2図は同装置+7)動rV=ン詣、
明するための図である。 2・・・文字認識部、3・・・辞書パターンメモリ。 5・・・制御回路、7・・・取換文字メモリ。 出願人代理人 猪 股 清
FIG. 1 shows the block ratio of the main parts of a character recognition device 4 according to an embodiment of the present invention, and FIG. 2 shows the block ratio of the main part of the character recognition device 4 according to an embodiment of the present invention.
FIG. 2...Character recognition section, 3...Dictionary pattern memory. 5... Control circuit, 7... Replacement character memory. Applicant's agent Kiyoshi Inomata

Claims (1)

【特許請求の範囲】[Claims] 認識結果を文字コードで出力する文字認識装置において
、第1の文字コードに対応して第2の文字コードを記憶
している取換え文字メモリと、認識結果が前記第1の文
字コードであったとき前記取換え文字メモリから読み出
した前記第2の文字コードχ最終的な認識結果とする処
理手段と欠具備したことケ特徴とする文字認識装置。
In a character recognition device that outputs a recognition result as a character code, a replacement character memory stores a second character code corresponding to the first character code, and the recognition result is the first character code. A character recognition device characterized in that the second character code χ read from the replacement character memory is further provided with processing means for producing a final recognition result.
JP58203340A 1983-10-29 1983-10-29 Character recognizing device Pending JPS6095688A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58203340A JPS6095688A (en) 1983-10-29 1983-10-29 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58203340A JPS6095688A (en) 1983-10-29 1983-10-29 Character recognizing device

Publications (1)

Publication Number Publication Date
JPS6095688A true JPS6095688A (en) 1985-05-29

Family

ID=16472397

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58203340A Pending JPS6095688A (en) 1983-10-29 1983-10-29 Character recognizing device

Country Status (1)

Country Link
JP (1) JPS6095688A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62145387A (en) * 1985-12-19 1987-06-29 Mitsubishi Electric Corp Reader for sequencer ladder diagram

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62145387A (en) * 1985-12-19 1987-06-29 Mitsubishi Electric Corp Reader for sequencer ladder diagram

Similar Documents

Publication Publication Date Title
JPH07121664A (en) Automatic decision apparatus of european language
JPS6095688A (en) Character recognizing device
JPH02280195A (en) Character generator
JPS6336389A (en) Character reader
JPS5842904B2 (en) Handwritten kana/kanji character recognition device
JPS61114388A (en) Character input device
JPS6061875A (en) Generation system of standard pattern
JP2939945B2 (en) Roman character address recognition device
JPS61175870A (en) Optical character reading device
JPS6095689A (en) Optical character reader
JP2875678B2 (en) Post-processing method of character recognition result
JPS60110089A (en) Character recognizer
JPH0338787A (en) Character recognition processor
JPH0573027A (en) Individual penmanship dictionary generation device and character output processor using individual penmanship dictionary
JPS6327992A (en) Generating method for dictionary for individual
JPS62119691A (en) Character clean copy device
JPH01301355A (en) Character interval control system
JPH0554180A (en) Slop format defining system for optical character reader
JPS6356756A (en) Western language preparing device with correcting function
JPS5820075B2 (en) pattern recognition device
JPS61121159A (en) Character input device
JPS62288989A (en) Character recognizing system
JPH07296104A (en) Optical character reader
JPH0546806A (en) Character recognition method
JPS6029883A (en) Optical character reader