JPH01255080A - Optical character reading system - Google Patents

Optical character reading system

Info

Publication number
JPH01255080A
JPH01255080A JP63081255A JP8125588A JPH01255080A JP H01255080 A JPH01255080 A JP H01255080A JP 63081255 A JP63081255 A JP 63081255A JP 8125588 A JP8125588 A JP 8125588A JP H01255080 A JPH01255080 A JP H01255080A
Authority
JP
Japan
Prior art keywords
matching
feature
value
similarity
distance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP63081255A
Other languages
Japanese (ja)
Inventor
Kazumi Suzuki
和美 鈴木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Priority to JP63081255A priority Critical patent/JPH01255080A/en
Publication of JPH01255080A publication Critical patent/JPH01255080A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To reduce reading omission without increasing misreading by dynamically changing a feature value for the dispersion of a pattern and making larger the similarity degree. CONSTITUTION:In a matching retrial part 18 and an increase and decrease propriety table 19, the feature values are successively taken off from the features of an unknown pattern and a dictionary, and the total matching distance is obtained. Next, the feature value for each feature number stored in a table is taken off. Then, the value obtained by adding or subtracting the fine value to or from the feature value is substituted for a function, to matching distances are obtained, and the smaller matching distance is obtained. The feature value stored into the table and the matching distance are subtracted from the whole matching distance, and further, the obtained smaller matching distance is added. These processings are successively executed, the corrected matching distance is obtained, and sent to a decision part 15, and when the distance is smaller than the value set beforehand, it is decided to be a candidate character, and outputted from an answer output part 16.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は光学文字読取方式にかかシ、更に詳しくは書体
が大幅に変形した変形パターンの文字認識に好適な光学
文字読取方式に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an optical character reading system, and more particularly to an optical character reading system suitable for character recognition of deformed patterns in which the font is significantly deformed.

〔従来の技術〕[Conventional technology]

従来の光学文字読取方式においては、書体が大幅に変形
した変形パターンを文字認識する場合、変形パターンを
辞書に登録することにより対応していた。即ち、変形パ
ターンに対して、各特徴値′・のバラツキを吸収するた
め、各特徴値読み取シ範囲を広げることによって、対応
していた。このよ・うな従来技術に関するものとしては
、例えば路間・昭62−169289号公報に開示され
た発明が存在す・る。               
       ILI〔発明が解決しようとする課題〕 上記した従来技術において、各特徴値のバラツ6キは多
種多様であるため、変形パターンの各特徴値のバラツキ
を全て吸収するためには、辞書が膨大となってしまうと
いう問題点があった。また、1この様な辞書を用いると
、不読は減少するが、誤。
In conventional optical character reading systems, when characters are recognized in a deformed pattern in which the typeface is significantly deformed, the deformed pattern is registered in a dictionary. That is, in order to absorb the variation in each characteristic value '· with respect to the deformed pattern, the reading range of each characteristic value was widened. As related to such prior art, for example, there is an invention disclosed in Rima No. 169289/1989.
ILI [Problem to be Solved by the Invention] In the above-mentioned conventional technology, since the variations in each feature value are diverse, the dictionary must be huge in order to absorb all the variations in each feature value of the deformation pattern. There was a problem with this. Also, 1.Using such a dictionary will reduce the number of non-readers, but it is a mistake.

読が増加するという問題点があった。There was a problem that the number of readers increased.

この発明は、上記した従来技術の問題点に鑑みなされた
もので、膨大な辞書を必要とせず、かつ誤読を増加する
ことなく不読を減少させろことが可能な光学文字読取方
式を提供することにある。
The present invention was made in view of the problems of the prior art described above, and it is an object of the present invention to provide an optical character reading method that does not require a huge dictionary and can reduce misreading without increasing misreading. It is in.

〔課題を解決するための手段〕[Means to solve the problem]

この発明の光学文字読取方式は、入力文字パターンに対
して複数の特徴値を抽出し、あらかじめ準備した辞書に
格納されている文字毎の複数の特・徴値のそれぞれと上
記抽出した各特徴値とにもと・すいて、特徴毎に類似度
を計算し、計算された各・類似度を合計して全体類似度
を求め、候補文字を・定めろ光学文字読取方式に適用さ
れるものであり、特に類似度の小さい特徴のうち少なく
とも1つの14、特徴について、抽出された特徴値を増
減した値を。
The optical character reading method of the present invention extracts a plurality of feature values from an input character pattern, and extracts each of the plurality of features/feature values for each character stored in a dictionary prepared in advance and each of the above extracted feature values. This method is applied to the optical character reading method, which calculates the similarity for each feature, sums up each calculated similarity to obtain the overall similarity, and determines candidate characters. For at least one of the features with particularly low similarity, the extracted feature value is increased or decreased.

用いて再度類似度を計算し、この再度計算された。The similarity was calculated again using this recalculation.

類似度を用いて全体類似度を求め、候補文字を定めるこ
とを特徴としている。
It is characterized by determining the overall similarity using the similarity and determining candidate characters.

〔作用〕                   1・
一般に、1文字に対する辞書は多数の特徴によシ構成さ
れておシ、数個の特徴の類似度がたまたま極端に小さい
場合、不読となってしまう。しかし、同一カテゴリの場
合、微少に特徴値を増減させることで、類似度が大きく
なる。逆に、異カテ、1ゴリの場合は(文字が異なる場
合)類似度が大きくならない。そこで、微少に特徴値を
増減させる゛ことによって、辞書本来の識別能力を劣化
させる。
[Effect] 1.
Generally, a dictionary for one character is made up of many features, and if the degree of similarity between some features happens to be extremely small, the dictionary becomes illegible. However, in the case of the same category, the degree of similarity increases by slightly increasing or decreasing the feature value. On the other hand, in the case of different categories and 1 gori (characters are different), the degree of similarity does not increase. Therefore, by slightly increasing or decreasing the feature values, the inherent discrimination ability of the dictionary is degraded.

ことは、はとんど無い。従って、マツチングの取れない
、類似度の小さい特徴に対して特徴値を少−・量増減さ
せることは、パターンの特徴のノ(ラツキ・を吸収する
ことになるので、誤読を増加させるこ・となく不読を減
少することができる。
That is very unlikely. Therefore, increasing or decreasing the feature value by a small amount for features with low similarity that cannot be matched will absorb the irregularity of the pattern features, which may increase misreading. Unreading can be reduced without any problems.

〔実施例〕〔Example〕

以下添付の図面に示す実施例によシ、更に詳細1.1に
本発明について説明する〇 第1図において、走査部・光電変換部11は、帳。
The present invention will be described below in more detail in 1.1 according to the embodiments shown in the accompanying drawings. In FIG.

票の読取領域に光を照射しその帳票からの反射光を受光
し電気信号に変換する。帳票からの反射光は、帳票上の
文字とか図形とかパターンの濃淡情1゜報を反映したも
のであり、電気信号はそのパター。
Light is irradiated onto the reading area of the form, and the reflected light from the form is received and converted into an electrical signal. The light reflected from a form reflects the shading information of the characters, figures, and patterns on the form, and the electrical signal is the pattern.

ンの形態に応じた信号である。This is a signal that corresponds to the mode of operation.

前処理部12は、電気信号の2値化、及びその結果を内
部のパターンメモリに格納する処理を行う、。
The preprocessing unit 12 performs a process of binarizing the electrical signal and storing the result in an internal pattern memory.

更に、この前処理部12にあっては、パターンメモ11
.。
Furthermore, in this preprocessing section 12, the pattern memo 11
.. .

、6 。, 6.

すから1文字分のパターンを切出し、特徴抽出部13に
送る。
A pattern for one character is cut out from the blank and sent to the feature extraction section 13.

特徴抽出部13は、1文字分の人カバターンに対して内
輪、外輪の数、線分間の位置関係などの特徴を抽出し、
大まかなパターンの分類を行う。次)に、分類されたパ
ターンを読取るのに有効な特徴・を抽出する〇 マツチング部14は、抽出した特徴値と辞書の辞・書特
徴値を照合し、マツチング距離を求める。マ。
The feature extraction unit 13 extracts features such as the number of inner rings and outer rings, and the positional relationship between line segments for one character's human cover pattern,
Perform rough pattern classification. Next), the matching unit 14 extracts features that are effective for reading the classified patterns, and matches the extracted feature values with the dictionary feature values in the dictionary to obtain a matching distance. Ma.

ッチング距離とは、前記した類似度とは逆の性質1.。The etching distance is a property that is opposite to the above-mentioned similarity. .

のものであシ、値が少さい方がより類似している。The smaller the value, the more similar it is.

ことを意味する。辞書に登録されている文字に対。It means that. Pairs with characters registered in the dictionary.

して全てマツチングを行ない、マツチング距離があらか
じめ定めた値よシ小さいものすべてを候補とする。  
                  1゜判定部15
は、候補が複数個出力された場合、ぺ。
All the matching distances are smaller than a predetermined value, and all the matching distances are selected as candidates.
1° determination unit 15
If multiple candidates are output, P.

ア判定によシーつに候補をしばる処理を行い、さらに最
終候補が妥当であるかを判定し、答えを答え出力部16
に渡す。
The output unit 16 performs processing to narrow down the candidates based on a determination, further determines whether the final candidate is valid, and outputs the answer.
give it to

上記、マツチング部14および判定部15で候補が51
,1.4 。
In the above, the matching unit 14 and the determining unit 15 select 51 candidates.
, 1.4.

出力されない場合は、マツチングリトライ部1Bによシ
、モラー度マツチングを行う。これにより今。
If the output is not output, the matching retry section 1B performs moral degree matching. Due to this now.

まで候補として出力されなかった文字が候補とし゛て出
力され、結果として読取が向上される効果がある。
Characters that were not output as candidates until then are output as candidates, and as a result, there is an effect that reading is improved.

次に、マツチングリトライ部1日及び増減可否テ・−プ
ル19の動作について、第2図、第5図を用い・て説明
する。第2図に示すように、未知パターン・の特徴21
から特徴値x i (i=1.2,5.−)−が順次取
り出され、また辞書特徴22から特徴値y川i (i=
1.2,5.・・・)が順次取シ出され、関。
Next, the operation of the matching retry unit 1 day and the increase/decrease table 19 will be explained using FIGS. 2 and 5. As shown in Figure 2, the characteristics of unknown pattern 21
The feature values x i (i=1.2, 5.-)− are sequentially extracted from the dictionary features 22, and the feature values y i (i=
1.2,5. ) are taken out one after another, and the results are displayed.

数f(xi、yi)に代入され、マツチング距離。Substituted into the number f(xi, yi), the matching distance.

Di (i=1.2,3.・・・)が求められる0マツ
チング距離Diは順次加算され、全体マツチング距離り
が求められる。このとき、第2図に示すよ、。
The 0 matching distances Di for which Di (i=1.2, 3, . . . ) are determined are sequentially added, and the total matching distance is determined. At this time, as shown in Figure 2.

うに、マツチング距離Diの大きい順に特徴番号。feature numbers in descending order of matching distance Di.

iとマツチング距離Diをテーブル25にソートしてお
く。この実施例では、最大5個ソートしているが、この
数は任意で良い。
i and matching distance Di are sorted in table 25. In this embodiment, a maximum of five items are sorted, but this number may be arbitrary.

次に、第3図に示すように、テーブル25に記憶2.。Next, as shown in FIG. 3, the table 25 stores 2. .

された各特徴番号lについて、未知パターン特徴21の
特徴値xiを取シ出し、かつ同じくかつ特徴。
For each feature number l, extract the feature value xi of the unknown pattern feature 21, and find the same and feature value xi.

番号iについて辞書特徴22から特徴値yiを取り。For number i, take the feature value yi from the dictionary features 22.

出す。次に、特徴値xiに微少値Δdをプラス、マイナ
スした値(xi±Δd)を用いて関数f □・(xi±
△d、yi)に代入し、2つのマヅチン・グ距離d1.
d2を求める。次に、求めた2つの・マツチング距離d
1.d2の内、小さい方のマツ・チング距離を求める。
put out. Next, the function f □・(xi±
Δd, yi) and the two matching distances d1.
Find d2. Next, the two matching distances d
1. Find the smaller pine cutting distance of d2.

そして、全体マツチング距・離りからテーブル25に記
憶されているものとマーy1,1チング距離Diを減算
し、更に上記の処理によつ。
Then, what is stored in the table 25 and the mark y1,1 matching distance Di are subtracted from the overall matching distance/distance, and the above processing is further performed.

て求めた小さい方のマ・ソチング距離(、d 1又はd
The smaller ma-soching distance (, d 1 or d
.

2)を加算する。上記の処理をテーブル25に記憶され
た特徴番号1について順次実行し、修正された全体特徴
距離D′を求める。こうして求められた1゜修正マツチ
ング距離D′が判定部15に送られ、修正。
2) is added. The above process is executed sequentially for the feature number 1 stored in the table 25 to obtain the corrected overall feature distance D'. The 1° corrected matching distance D' thus obtained is sent to the determination section 15 and corrected.

マツチング距離[)′があらかじめ定められた値よシ小
さいか否かが判定され、小さい場合には候補文字とされ
ろ。
It is determined whether the matching distance [)' is smaller than a predetermined value, and if it is smaller, the character is selected as a candidate character.

ここで、増減可否テーブル19は、特徴値xiv2L△
dだけ増減可能か否かを文字の特徴値毎に記憶している
ものであシ、マツチングリトライ部18け”増減可否テ
ーブル19を参照して処理をおこなう。゛これは、文字
の種類によっては、特徴値のΔdの。
Here, the increase/decrease possibility table 19 has the feature value xiv2L△
Whether or not it is possible to increase or decrease by d is stored for each character feature value, and the matching retry unit 18 performs processing by referring to the increase/decrease table 19. This depends on the type of character. is the feature value Δd.

増減によって、かえって誤判定を招くものがある5から
である。このような文字には、例えば数字の・「1−1
とカタカナの「ノ」のような類似文字かあ・る。この増
減可否テーブル19は、増加可フラグと・減少可フラグ
からなシ、それぞれ増加可、減少可・を 1 で表わし
、否可の場合な 0 で表わす。よILIって増減可否
テーブル19は、1つの特徴番号につ。
This is because an increase or decrease may actually lead to erroneous determination5. Such characters include, for example, the numbers ``1-1''
and similar characters like ``ノ'' in katakana. This increase/decrease permission table 19 has an increase/decrease flag and a ``1'' to indicate that the increase is allowed or a decrease is allowed, respectively, and a 0 to indicate that the value is not allowed. The ILI increase/decrease possibility table 19 is for one feature number.

いて2ビツトによシ構成される。It is composed of 2 bits.

この処理により、従来技術では候補として出力されなか
った文字が候補として出力される様になシ、結果として
読取シ効率が向上される効果かあ、。
Through this processing, characters that were not output as candidates in the conventional technology are now output as candidates, and as a result, the reading efficiency is improved.

る。Ru.

〔発明の効果〕〔Effect of the invention〕

本発明によれば、パターンのバラツキに対してダイナミ
ックに特徴値を変えることが出来るので、誤読を増加さ
せろことなく不読を減少させる効果。11.7 。
According to the present invention, it is possible to dynamically change the feature value in response to pattern variations, thereby reducing the number of misreadings without increasing the number of misreadings. 11.7.

がある。There is.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例を示すブロック図、。 第2図及び第3図は第1図に示す実施例の動作を示す説
明図である。 11・・・走査部・光電変換部、12・・・前処理部・
特徴・抽出部、14・・・マツチング部、15・・・判
定部、16・・・答・え出力部、17・・・辞書部、1
8・・・マツチングトライ部−19・・・増減可否テー
ブル。 、 8
FIG. 1 is a block diagram showing one embodiment of the present invention. FIGS. 2 and 3 are explanatory diagrams showing the operation of the embodiment shown in FIG. 1. 11...Scanning section/photoelectric conversion section, 12...Preprocessing section/
Feature/extraction unit, 14... Matching unit, 15... Judgment unit, 16... Answer/e output unit, 17... Dictionary unit, 1
8...Matching try section-19...Increase/decrease possibility table. , 8

Claims (1)

【特許請求の範囲】[Claims] 1、入力文字パターンに対して複数の特徴値を抽出し、
あらかじめ準備した辞書に格納されている文字毎の特徴
値と上記抽出した特徴値とにもとずいて、特徴毎に類似
度を計算し、各類似度を合計して全体類似度を求め、候
補文字を定める光学文字読取方式において、上記類似度
の小さい特徴のうち少なくとも1つの特徴について、抽
出された特徴値を増減した値を用いて再度類似度を計算
し、この再度計算された類似度を用いて全体類似度を求
め、候補文字を定めることを特徴とする光学文字読取方
式。
1. Extract multiple feature values for the input character pattern,
Based on the feature values for each character stored in a dictionary prepared in advance and the feature values extracted above, the similarity is calculated for each feature, and the overall similarity is calculated by summing each similarity, and the candidate is selected. In the optical character reading method that determines characters, the similarity is calculated again using the extracted feature value increased or decreased for at least one of the features with low similarity, and this recalculated similarity is An optical character reading method characterized by determining candidate characters by determining overall similarity.
JP63081255A 1988-04-04 1988-04-04 Optical character reading system Pending JPH01255080A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63081255A JPH01255080A (en) 1988-04-04 1988-04-04 Optical character reading system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63081255A JPH01255080A (en) 1988-04-04 1988-04-04 Optical character reading system

Publications (1)

Publication Number Publication Date
JPH01255080A true JPH01255080A (en) 1989-10-11

Family

ID=13741275

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63081255A Pending JPH01255080A (en) 1988-04-04 1988-04-04 Optical character reading system

Country Status (1)

Country Link
JP (1) JPH01255080A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012173837A (en) * 2011-02-18 2012-09-10 Denso Wave Inc Non-contact communication medium reader

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012173837A (en) * 2011-02-18 2012-09-10 Denso Wave Inc Non-contact communication medium reader

Similar Documents

Publication Publication Date Title
Chi et al. Separation of single-and double-touching handwritten numeral strings
US4769851A (en) Apparatus for recognizing characters
JPH01255080A (en) Optical character reading system
JP4194020B2 (en) Character recognition method, program used for executing the method, and character recognition apparatus
JP2930996B2 (en) Image recognition method and image recognition device
JP3207566B2 (en) Character recognition method
JP2902097B2 (en) Information processing device and character recognition device
JP2732748B2 (en) Fuzzy pattern recognition device
JPS61114388A (en) Character input device
JPH03224079A (en) Character recognizer
JPS6089290A (en) Pattern recognition method
JPS60138689A (en) Character recognizing method
JPS58165178A (en) Character reader
JP2953162B2 (en) Character recognition device
JPH0212391A (en) Character recognizing system
JP2639314B2 (en) Character recognition method
JPS60138688A (en) Character recognizing method
JPS5822781B2 (en) Character pattern recognition processing method
JPS5914078A (en) Reader of business form
JPS59149569A (en) Optical character reader
JPS59157776A (en) Pattern recognizing system
JPS593793B2 (en) optical character reader
JPH0340186A (en) Character recognizer
JPS61153781A (en) Optical character reading device
JPS63178379A (en) Optical character reader