JPS62281082A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPS62281082A
JPS62281082A JP61123728A JP12372886A JPS62281082A JP S62281082 A JPS62281082 A JP S62281082A JP 61123728 A JP61123728 A JP 61123728A JP 12372886 A JP12372886 A JP 12372886A JP S62281082 A JPS62281082 A JP S62281082A
Authority
JP
Japan
Prior art keywords
character
recognition
input
candidate
similarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP61123728A
Other languages
Japanese (ja)
Inventor
Kazue Kaneko
和恵 金子
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to JP61123728A priority Critical patent/JPS62281082A/en
Publication of JPS62281082A publication Critical patent/JPS62281082A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To attain the corrector character recognition by outputting the evaluation based upon the difference of the similarity of an input character together with a recognizing candidate by matching with a recognizing result. CONSTITUTION:At a matching part 4, a large classification is executed, the characteristic outputted from a characteristic extracting part 3 is matched with some of the standard patterns stored in a dictionary part 5, and plural sets of the difference or similarity with the information of the character type of a standard pattern which comes to be the object of the matching are set to a character candidate deciding part 6. Next, when it is judged that the mode set earlier is not the estimation output mode, the character candidate deciding part 6 executes the output as the recognizing result based upon the difference of similarity out of the results of plural matchings, selects the candidate of a character code and sends it to a result output part 8. When the candidate cannot be selected, the operation is returned to a pre-pressing part 2 and re- executed, and when re-execution exceeds the constant number of times, the non-recognizable signal is sent to the result output part. When it is judged that the mode is the evaluation output mode, the non-recognizable signal is sent to an evaluation deciding part 9.

Description

【発明の詳細な説明】 3、発明の詳細な説明 [産業上の利用分野] 本発明は入力した文字記号を認識する文字認識装置に関
するものである。
Detailed Description of the Invention 3. Detailed Description of the Invention [Field of Industrial Application] The present invention relates to a character recognition device that recognizes input character symbols.

[従来の技術] 従来、この種の装置は、認識結果として入力文字に対す
る候補として挙げられる文字コード、もしくは認識不能
を意味する信号のみを出力するよう構成されていたので
、入力文字がどのような書ぎ方で、どのような字形であ
れば認識がより正確に行えるかについては暖味な点があ
った。
[Prior Art] Conventionally, this type of device has been configured to output only character codes that are candidates for the input character as a recognition result, or a signal indicating that the input character is unrecognizable. There were some warm points regarding the type of writing style that would allow for more accurate recognition.

また、書き方の見本が掲げられていても、文字認識装置
は、その種類成いは認識方式の違いにより、認識がより
正確となる入力文字の書き方や字形に違いがあり、見本
のように書いたつもりの文字を入力しても正しい結果が
得られない場合や、見本と違う書き方をした文字が正し
く認識される場合もある。
In addition, even if writing samples are posted, character recognition devices differ in the writing style and shape of input characters that can be recognized more accurately depending on the type and recognition method. In some cases, you may not get the correct result even if you enter the characters you intended to write, and in some cases, characters written in a different way than the sample may be recognized correctly.

[発明が解決しようとする問題点] 本発明は、上記従来技術に鑑みなされたものであり、入
力文字と標準パターンの相違度又は類似度に基づいた評
価を認識結果と合わせて出力することで、上述した従来
の暖味さを減少させる文字認識装置を提供することにあ
る。
[Problems to be Solved by the Invention] The present invention has been made in view of the above-mentioned prior art, and is capable of outputting an evaluation based on the degree of difference or similarity between an input character and a standard pattern together with a recognition result. Another object of the present invention is to provide a character recognition device that reduces the conventional warmth described above.

[問題点を解決するための手段] この問題を解決するために本発明は以下の様な構成から
なる。
[Means for solving the problem] In order to solve this problem, the present invention has the following configuration.

すなわち、文字記号を入力する入力手段と、該入力手段
により入力された文字記号を認識して認識候補を出力す
る認識手段と、該認識手段により認識された候補に対し
て前記入力手段により入力した文字記号の類似度或いは
相違度を報知する報知手段とを備える。
That is, an input means for inputting character symbols, a recognition means for recognizing the character symbols input by the input means and outputting recognition candidates, and a recognition means for inputting the characters input by the input means with respect to the candidates recognized by the recognition means. and notifying means for notifying the degree of similarity or difference of character symbols.

[作用] かかる本発明の構成において、入力手段により入力され
た文字記号を認識手段により認識して出力された認識候
補と、入力された文字記号の認識候補に対する類似度或
いは相違度を報知手段により報知する。
[Operation] In the configuration of the present invention, the recognition means recognizes the character symbol inputted by the input means and outputs the recognition candidate, and the degree of similarity or dissimilarity between the recognition candidate of the input character symbol is notified by the notification means. inform.

[実施例] 以下添付図面に従って本発明に係る実施例を詳細に説明
する。
[Examples] Examples according to the present invention will be described in detail below with reference to the accompanying drawings.

第1図は本実施例における文字認識装置のブロック構成
図である。
FIG. 1 is a block diagram of a character recognition device in this embodiment.

図中、1は文字の入力部であり、光学的文字認識装置の
場合はスキャナ或いはオンライン、手書き文字認識の場
合はディジタイザである。2は前処理部であり、入力さ
れたイメージデータが多値であれば2値化し、ノイズ除
去、平滑化或いは細線化等の中で必要な前処理を行う。
In the figure, 1 is a character input unit, which is a scanner or online in the case of an optical character recognition device, and a digitizer in the case of handwritten character recognition. A preprocessing unit 2 binarizes input image data if it is multivalued, and performs necessary preprocessing such as noise removal, smoothing, line thinning, etc.

3は特徴抽出部で、前処理部2により施されたイメージ
データからマツチングのために必要な特徴を抽出する。
Reference numeral 3 denotes a feature extraction section which extracts features necessary for matching from the image data processed by the preprocessing section 2.

特徴は輪郭、背景、芯線等のどれか1つ、又は複数の組
合わせを用いる。4はマツチング部で、入力文字の特徴
と後述する辞書部5の中にある標準パターンとのマツチ
ングを行い、相違度又は類似度を計算する。またマツチ
ングする標準パターンを予め絞る作業(大分類等)もこ
のマツチング部4で行う。5は辞書部で、標準パターン
を納めている。6は文字候補判定部で、マツチングの対
象となった標準パターンの文字種と、相違度、又は類似
度をもとに認識結果として出力する文字コードの候補を
選ぶ。7はモード信号入力部で、入力文字と標準パター
ンの相違度又は類似度に基づいた入力文字の評価も出力
するかしないかの信号を文字候補判定部6に送る。尚、
このモード信号入力部7は装置の外部に設けられたスイ
ッチでもって付勢する様にしてもよいし、外部装置でも
って制御する様にしても構わない。8は結果出力部であ
り、プリンタでもCRTでも良く、電子計算機へのイン
タフェースであっても構わない。9は評価判定部で、入
力文字と標準パターンの相違度、又は類似度に基づいた
入力文字の評価を行い、例えばANDの4段階にランク
1寸けするものとする。IQOは本装置全体を制御する
CP’Uであり、後述する第2図のフローチャートの処
理手順に従って動作するものである。尚、このフローチ
ャートに対応するプログラムは101のROMに格納さ
れているものである。102はCPU100のワークエ
リアとして使用するRAMである。
As the feature, one or a combination of outlines, backgrounds, core lines, etc., is used. Reference numeral 4 denotes a matching section, which performs matching between the characteristics of the input characters and standard patterns in the dictionary section 5, which will be described later, to calculate the degree of difference or similarity. The matching section 4 also performs the work of preliminarily narrowing down the standard patterns to be matched (major classification, etc.). 5 is a dictionary section which stores standard patterns. A character candidate determination unit 6 selects character code candidates to be output as recognition results based on the character type of the standard pattern to be matched and the degree of difference or similarity. Reference numeral 7 denotes a mode signal input section which sends a signal to the character candidate determination section 6 indicating whether or not to output an evaluation of the input character based on the degree of difference or similarity between the input character and the standard pattern. still,
This mode signal input section 7 may be energized by a switch provided outside the device, or may be controlled by an external device. 8 is a result output unit, which may be a printer, a CRT, or an interface to an electronic computer. Reference numeral 9 denotes an evaluation determination unit that evaluates the input character based on the degree of difference or similarity between the input character and the standard pattern, and ranks the input character in 4 stages of AND, for example. The IQO is a CPU'U that controls the entire device, and operates according to the processing procedure of the flowchart in FIG. 2, which will be described later. Note that the program corresponding to this flowchart is stored in the ROM 101. 102 is a RAM used as a work area for the CPU 100.

この例を第3図(a)、(b)に示す。An example of this is shown in FIGS. 3(a) and 3(b).

第3図(a)は、オペ1ノータが文字「A」を意識して
入力したときの認識結果とその評価を示すものであるり
、いずれも認識結果の文字は「A」となっている。
Figure 3 (a) shows the recognition results and evaluations when Operator 1 Nota inputs the letter ``A'' consciously, and in both cases, the recognition result is ``A''. .

入力文字30の場合には文字r A Jを形成する3木
の線分(ベクトル)が標準パターン「A」とほとんど同
じであるから、その評価はA°゛となっている。入力文
字31の場合には文字「A」中の水平線゛°−°°であ
る線分が所定幅内に置かれているが、この線分が傾いて
いるので評価はB′。
In the case of the input character 30, the line segments (vectors) of the three trees forming the character r A J are almost the same as the standard pattern "A", so the evaluation is A°. In the case of the input character 31, the line segment which is the horizontal line ゛°-°° in the character "A" is placed within a predetermined width, but since this line segment is tilted, the evaluation is B'.

となっている。入力文字32に対しては入力文字31で
の水平線が飛出しているものである。通常、認識処理で
はこの飛び出した線を別の線分と判断する場合かあるの
で、認識結果の文字「A」に対しての認識率はあまり良
くない状態を示す。
It becomes. The horizontal line of the input character 31 protrudes from the input character 32. Normally, in recognition processing, this protruding line may be determined to be another line segment, so the recognition rate for the character "A" as a recognition result is not very good.

さて、入力文字33に対しては、3つの線分それぞれに
“跳′°や′飛び出した線分”があり、認識結果の文字
は「A」となっているが、他の認識候補がみつからなか
った状態を示すものであり、認識結果としての信頼度が
ほとんどない状態を示すものである。
Now, for input character 33, each of the three line segments has a "jump" or "jumping line segment", and the recognition result is the character "A", but no other recognition candidates were found. This indicates a state in which there was no recognition result, and a state in which the recognition result has almost no reliability.

以上の様に入力した文字記号に対する評価を出力するが
、上述した認識判定の基準は認識方式によって異なるの
で、この認識方式に限定されるものではない。尚、第3
図(b)に他の入力文字に対する認識結果と認識評価を
出力した例を示す。
As described above, the evaluation of the input characters and symbols is output, but the criteria for recognition determination described above differ depending on the recognition method, so the evaluation is not limited to this recognition method. Furthermore, the third
Figure (b) shows an example in which recognition results and recognition evaluations for other input characters are output.

以上の様な構成からなる本実施例の文字認識装置の動作
を第2図に示すフローチャートに従って説明する。
The operation of the character recognition device of this embodiment having the above-mentioned configuration will be explained with reference to the flowchart shown in FIG.

先ず、ステップS1ではモード設定する。このモードと
は、認識結果出力される認識候補に対して入力部1より
入力した文字記号の類似度又は相違度を出力するかしな
いかを決定するものである。次にステップS2におい′
C入力部1から文字記号のイメージデータな入力し、前
処理部2で前処理を施し、特徴抽出部3にイメージデー
タな送り、認識処理に移る。ステップS3では、特徴抽
出部3に送られてきたイメージデータから特徴を抽出し
、抽出された特徴情報をマツチング部4へ送る。マツチ
ング部4では大分類を行い、特徴抽出部3から出力され
た特徴を辞書部5に納められている標準パターンのいく
つかとマツチングを行い、マツチングの対象となった標
準パターンの文字種の情報と相違度又は類似度を複数組
、文字候補判定部6へ送る。次にステップS4で先に設
定したモードが評価出力モードであるか否かを判断する
。この判断で評価出力モードでないと判断した場合には
ステップS4に移り、文字候補判定部6は複数のマツチ
ングの結果の中から、相違度又は類似度をもとに認識結
果として出力し、文字コードの候補を選び、結果出力部
8へ送る。尚、候補が選べない場合は前処理部2へ戻っ
てやり直すが、やり直しが一定回数を越えると、やり直
しを行わず、認識不能の信号を結果出力部8へ送る。
First, in step S1, a mode is set. This mode determines whether or not to output the similarity or dissimilarity of the character symbol input from the input unit 1 with respect to the recognition candidate output as the recognition result. Next, step S2'
Image data of characters and symbols is inputted from the C input section 1, preprocessed by the preprocessing section 2, and sent to the feature extraction section 3, whereupon the process proceeds to recognition processing. In step S3, features are extracted from the image data sent to the feature extraction section 3, and the extracted feature information is sent to the matching section 4. The matching unit 4 performs broad classification, matches the features output from the feature extraction unit 3 with some of the standard patterns stored in the dictionary unit 5, and compares the character type information of the standard patterns targeted for matching. A plurality of sets of degrees or similarities are sent to the character candidate determination section 6. Next, in step S4, it is determined whether the previously set mode is the evaluation output mode. If it is determined that it is not the evaluation output mode in this determination, the process moves to step S4, and the character candidate determination unit 6 outputs the recognition result based on the degree of dissimilarity or similarity from among the plurality of matching results, and outputs the character code as a recognition result. The candidates are selected and sent to the result output unit 8. If no candidate can be selected, the process returns to the preprocessing unit 2 and tries again, but if the process is repeated a certain number of times, the process does not start again and sends an unrecognizable signal to the result output unit 8.

また、ステップS4で評価出力モードであると判断した
場合(モード信号入力部7から入力文字の評価を出力す
る信号が文字候補判定部6に送られていいる場合)、ス
テップS6において文字候補判定部6は文字コード、又
は認識不能の信号を結果出力部8へ送るとともに文字コ
ードの文字種に対応する相違度又は類似度もしくは認識
不能の信号を評価判定部9に送る。評価判定部9は文字
候補判定部6からの出力をもとに評価を行い、評価結果
を結果出力部8へ送ることになる。
Further, if it is determined in step S4 that the mode is the evaluation output mode (if a signal for outputting the evaluation of the input character is sent from the mode signal input section 7 to the character candidate determination section 6), the character candidate determination section 6 sends the character code or an unrecognizable signal to the result output unit 8, and also sends the degree of difference or similarity corresponding to the character type of the character code or an unrecognizable signal to the evaluation determination unit 9. The evaluation determining section 9 performs evaluation based on the output from the character candidate determining section 6, and sends the evaluation result to the result output section 8.

以上説明した様に本実施例によれば、入力文字と認識候
補との相違度又は類似度に基づいた評価を認識結果と合
わせて出力するため、文字入力に対する筆記者の教育に
役立ち、筆記者に正確に認識できる筆記法を自覚させ、
より正確な文字認識が可能となる。
As explained above, according to the present embodiment, an evaluation based on the degree of difference or similarity between input characters and recognition candidates is output together with the recognition results, which is useful for educating scribes regarding character input. to make students aware of accurate writing methods,
More accurate character recognition becomes possible.

また、本実施例において、認識結果をA−Dの4段階で
認識評価したが、例えば百分率で出力する様にしても構
わない。
Further, in this embodiment, the recognition results are evaluated in four stages A to D, but they may be output as a percentage, for example.

[発明の効果] 以上説明した様に本発明によれば、入力文字と認識候補
との相違度又は類似度に基づいた評価を認識結果と合わ
せて出力するため、文字入力に対する筆記者の教育に役
立ち、筆記者に正確に認識できる筆記法を自覚させ、よ
り正確な文字認識が可能となる。
[Effects of the Invention] As explained above, according to the present invention, an evaluation based on the degree of difference or similarity between an input character and a recognition candidate is output together with a recognition result, which is useful for training scribes on character input. It is useful to make scribes aware of handwriting methods that can be recognized accurately, and enables more accurate character recognition.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本実施例の文字認識装置のブロック構成図、 第2図は本実施例の文字認識装置の動作を説明するため
のフローチャート、 第3図(a)、(b)は本実施例における入力文字に対
する認識結果と認識評価を示した図である。 図中、1・・・文字入力部、2・・・前処理部、3・・
・特徴抽出部、4・・・マツチング部、5・・・辞書部
、6・・・文字候補判定部、7・・・モード信号入力部
、8・・・結果出力部、9・・・評価判定部、30〜3
3・・・入力文字、100・CPU、101・ROM、
102・・・RAMである。 特許出願人   キャノン株式会社 −へ43− (b) 第3図
Fig. 1 is a block diagram of the character recognition device of this embodiment, Fig. 2 is a flowchart for explaining the operation of the character recognition device of this embodiment, and Fig. 3 (a) and (b) are this embodiment. FIG. 3 is a diagram showing recognition results and recognition evaluations for input characters in FIG. In the figure, 1...character input section, 2...preprocessing section, 3...
・Feature extraction unit, 4...Matching unit, 5...Dictionary unit, 6...Character candidate determination unit, 7...Mode signal input unit, 8...Result output unit, 9...Evaluation Judgment section, 30-3
3... Input characters, 100・CPU, 101・ROM,
102...RAM. Patent applicant: Canon Co., Ltd. -He43- (b) Figure 3

Claims (4)

【特許請求の範囲】[Claims] (1)文字記号を入力する入力手段と、該入力手段によ
り入力された文字記号を認識して認識候補を出力する認
識手段と、該認識手段により認識された候補に対して前
記入力手段により入力した文字記号の類似度或いは相違
度を報知する報知手段とを備えることを特徴とする文字
認識装置。
(1) An input means for inputting character symbols, a recognition means for recognizing the character symbols input by the input means and outputting recognition candidates, and an input means for inputting the candidates recognized by the recognition means using the input means. 1. A character recognition device comprising: notification means for notifying the degree of similarity or difference between character symbols.
(2)報知手段は類似度或いは相違度を所定の段階に分
けて報知することを特徴とする特許請求の範囲第1項記
載の文字認識装置。
(2) The character recognition device according to claim 1, wherein the notifying means notifies the degree of similarity or difference in predetermined stages.
(3)入力手段は手書き入力であることを特徴とする特
許請求の範囲第1項記載の文字入力装置。
(3) The character input device according to claim 1, wherein the input means is handwritten input.
(4)入力手段は原稿上に記録された文字記号を光学的
に読取ることを特徴とする特許請求の範囲第1項記載の
文字認識装置。
(4) The character recognition device according to claim 1, wherein the input means optically reads character symbols recorded on the original.
JP61123728A 1986-05-30 1986-05-30 Character recognizing device Pending JPS62281082A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61123728A JPS62281082A (en) 1986-05-30 1986-05-30 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61123728A JPS62281082A (en) 1986-05-30 1986-05-30 Character recognizing device

Publications (1)

Publication Number Publication Date
JPS62281082A true JPS62281082A (en) 1987-12-05

Family

ID=14867885

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61123728A Pending JPS62281082A (en) 1986-05-30 1986-05-30 Character recognizing device

Country Status (1)

Country Link
JP (1) JPS62281082A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5050221A (en) * 1989-02-13 1991-09-17 Ricoh Company, Ltd. Image generating apparatus
US5111514A (en) * 1989-10-05 1992-05-05 Ricoh Company, Ltd. Apparatus for converting handwritten characters onto finely shaped characters of common size and pitch, aligned in an inferred direction
US5195147A (en) * 1989-05-02 1993-03-16 Ricoh Company, Ltd. Image forming apparatus

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5050221A (en) * 1989-02-13 1991-09-17 Ricoh Company, Ltd. Image generating apparatus
US5195147A (en) * 1989-05-02 1993-03-16 Ricoh Company, Ltd. Image forming apparatus
US5111514A (en) * 1989-10-05 1992-05-05 Ricoh Company, Ltd. Apparatus for converting handwritten characters onto finely shaped characters of common size and pitch, aligned in an inferred direction

Similar Documents

Publication Publication Date Title
US5539841A (en) Method for comparing image sections to determine similarity therebetween
JP3155577B2 (en) Character recognition method and device
EP0439743A2 (en) Constraint driven on-line recognition of handwritten characters and symbols
JP2730665B2 (en) Character recognition apparatus and method
US5621818A (en) Document recognition apparatus
EP0432937B1 (en) Hand-written character recognition apparatus
US5659633A (en) Character recognition method utilizing compass directions and torsion points as features
JPS62281082A (en) Character recognizing device
JPH0520794B2 (en)
JPH10302025A (en) Handwritten character recognizing device and its program recording medium
JP3037727B2 (en) OCR system
JPH06162266A (en) Method for recognizing on-line handwritten character and device therefor
KR100204618B1 (en) Method and system for recognition of character or graphic
JP3045086B2 (en) Optical character reading method and apparatus
Saeed et al. Intelligent feature extract system for cursive-script recognition
JP2851865B2 (en) Character recognition device
JPH06251187A (en) Method and device for correcting character recognition error
JPH10334190A (en) Character recognition method and device and recording medium
JP2665488B2 (en) Personal dictionary registration method
JPS61220081A (en) Segmentation and recognition system for pattern
JPH0944593A (en) Character recognition controller
JP2001344567A (en) Device and method for recognizing character and recording medium with program for performing the method recorded thereon
JPS6318483A (en) Character recognizing method for optical information input device
JPH03224079A (en) Character recognizer
JPH0887570A (en) Information recognizing method