JPH01125683A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPH01125683A
JPH01125683A JP62169453A JP16945387A JPH01125683A JP H01125683 A JPH01125683 A JP H01125683A JP 62169453 A JP62169453 A JP 62169453A JP 16945387 A JP16945387 A JP 16945387A JP H01125683 A JPH01125683 A JP H01125683A
Authority
JP
Japan
Prior art keywords
character
data
size
image
rom
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP62169453A
Other languages
Japanese (ja)
Inventor
Mikio Aoki
三喜男 青木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Seiko Epson Corp
Original Assignee
Seiko Epson Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Seiko Epson Corp filed Critical Seiko Epson Corp
Priority to JP62169453A priority Critical patent/JPH01125683A/en
Publication of JPH01125683A publication Critical patent/JPH01125683A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To recognize a character of any size with one datum, and to reduce a ROM memory quantity and a recognizing time by normalizing a character pattern and a character image to a constant size. CONSTITUTION:Character image information sent from a character extracting means is once stored in a RAM, thereafter, compared with character data in the ROM, and the character is recognized. A character recognizing means extracts a contour 8 of a character 7, and outputs the size of a circumscribed rectangle 9 with the use of the data of the outline 8. Since the proportion of enlargement and reduction at the time of normalization is obtained by the size, normalized character image data 6 can be obtained. The data are collated with normalized data 5 stored in the ROM. In the collation, the coincidence degree of the character is calculated, and the most coincident data out of the character data in the ROM are selected. Consequently, a single datum can recognize the character with any size.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は文字認識装置に関する。[Detailed description of the invention] [Industrial application field] The present invention relates to a character recognition device.

〔従来の技術〕[Conventional technology]

文字認識の方法としては、非常に多(の種類が存在する
。その中で代表的なものにはパターンマツチング法、特
徴抽出法等がある。パターンマツチング法は、データと
しである大きさのドツトパターンを持っており、該ドツ
トパターンを抽出された文字と重ね合わせることにより
ドツトパターンとの一致度を調べ候補となる文字を選び
出し、最終的に最も一致したドツトパターンを選ぶ方法
である。また、特徴抽出法は文字中の点、腺等の特徴を
拾い出し、これらの特徴をもとに階層的に分類して候補
文字を選び出す方法である。
There are many types of character recognition methods. Among them, the typical ones include pattern matching method and feature extraction method. Pattern matching method uses data with a certain size. The dot pattern is superimposed on the extracted character, the degree of matching with the dot pattern is checked, candidate characters are selected, and the most matching dot pattern is finally selected. Further, the feature extraction method is a method of picking out features such as points and glands in characters, and selecting candidate characters by hierarchically classifying them based on these features.

〔発明が解決しようとする問題点〕[Problem that the invention seeks to solve]

しかしながら、パターンマツチング法においては、アル
ゴリズムが比較的簡単であるものの、一つのバター/で
認識できる文字の種類は非常に限られており、多くの8
1類の文字を認識するためには、膨大な量のデータを必
要とし、その結果、メモリを多く必要とする。また検索
時間も非常にかかるという欠点をもつ。また、特徴抽出
法においては、そのアルゴリズムが非常に複雑で、プロ
グラムが非常に長(なりメモリを多く必要とする。
However, in the pattern matching method, although the algorithm is relatively simple, the types of characters that can be recognized with one butter/ are very limited, and there are many
Recognizing characters of type 1 requires a huge amount of data and, as a result, a large amount of memory. Another disadvantage is that the search time is very long. In addition, in the feature extraction method, the algorithm is very complex and the program is very long (and requires a lot of memory).

そこで本発明は、このような問題点を解決するものでそ
の目的とするところは、パターンマツチング法において
少量のメモリで多(の種類の字を確実に認識することに
ある。
The present invention is intended to solve these problems, and its purpose is to reliably recognize many types of characters using a small amount of memory in a pattern matching method.

〔問題点を解決するための手段〕[Means for solving problems]

文書画像を入力する入力手段と、入力画像を処理する画
像処理手段と、処理画像から入字列を抽出する文字列抽
出手段と、文字列から文字を抽出する文字抽出手段と、
該文字を認識する文字認識手段と、認識文字表示手段と
からなる文字認識装置において、文字認識手段は、文字
パターン及び文字画像を一定の大きさに正規化したもの
を用いることを特徴とする。また正規化の手段として、
文字の輪郭抽出の結果を用いて、文字外接矩形を抽出し
、該外接矩形を一定の大きさに正規化すると共に、内部
文字画像を比例して正規化する方法であることを特徴と
する。
An input means for inputting a document image, an image processing means for processing the input image, a character string extraction means for extracting an input character string from the processed image, a character extraction means for extracting characters from the character string,
A character recognition device comprising a character recognition means for recognizing the characters and a recognized character display means is characterized in that the character recognition means uses character patterns and character images normalized to a constant size. Also, as a means of normalization,
This method is characterized in that a character circumscribing rectangle is extracted using the result of character outline extraction, the circumscribing rectangle is normalized to a constant size, and an internal character image is proportionally normalized.

〔実施例〕〔Example〕

以下本発明について実施例に基づいて詳細に説明する。 The present invention will be described in detail below based on examples.

本発明の文字認!a装置は、第1図のブロック図に示す
ように、画像入力手段A1人手された画像から雑音等を
取り除(画像処理手段B1画像中から文字列を抽出する
文字列抽出手段01文字列から文字を抽出する文字抽出
手段D1抽出された文字を認識する文字認識手段E1認
識結果を表示する認識文字表示手段Fとから13成され
ている。
Character recognition of the present invention! As shown in the block diagram of FIG. It is composed of 13 characters including character extraction means D for extracting characters, character recognition means E for recognizing extracted characters, and recognized character display means F for displaying recognition results.

第2図は、前述の文字認識手段Eのブロック図である。FIG. 2 is a block diagram of the character recognition means E mentioned above.

文字抽出手段りから送られてきた文字画像データを一度
RAM4に納め、その後、ROM3にある文字データと
比較することにより文字を認識する構成になっている。
The character image data sent from the character extraction means is once stored in the RAM 4, and then compared with the character data stored in the ROM 3 to recognize the characters.

ROM 3に格納されているデータは、一定の大きさの
ドツトパターンである。
The data stored in ROM 3 is a dot pattern of a fixed size.

以下文字認識手段Eの動作を、第4図に示すフローチャ
ートに基づいて説明する。
The operation of the character recognition means E will be explained below based on the flowchart shown in FIG.

文字抽出手段りによって抽出された文字7は文字認識手
段Eにわたされる。ここで、文字7の輪郭8を抽出する
。輪郭8のデータを用いて、文字7の外接矩形9を抽出
する。外接矩形9の大きさがわかることにより、正規化
時の適大、縮少の割合を知ることができ、正規化した文
字画像データ6を得ることができる。該文字画像データ
6と、ROi’vl 3に格納されている正規化したデ
ータ5とを照合する。 この照合で文字の一致度を計算
する。次の段階で、 該一致度が今までの照合文字デー
タの中で最高かどうかを判断し、もし最高ならば、該当
文字を書き替え次の文字データへ。最高で無いならばそ
のまま次の文字データとの照合を行う。照合がすべて行
われた時点で、ROM3に格納されている文字データで
最も一致度の大きいものが選び出される。次の段階で、
一致度がある値以上かどうかを調べ、ある値以下のもの
は該当文字で無いと判断し、ある値以上のものは該当文
字と判断し、認m文字表示手段Fにデータを送る。この
ようにして文字の認識が行われる。
The character 7 extracted by the character extraction means is passed to the character recognition means E. Here, the outline 8 of the character 7 is extracted. A circumscribed rectangle 9 of the character 7 is extracted using the data of the outline 8. By knowing the size of the circumscribed rectangle 9, it is possible to know the appropriate size and reduction ratio during normalization, and normalized character image data 6 can be obtained. The character image data 6 is compared with the normalized data 5 stored in the ROi'vl 3. This comparison calculates the degree of matching of characters. In the next step, it is determined whether the matching degree is the highest among the collated character data so far, and if it is the highest, the corresponding character is rewritten and moved on to the next character data. If it is not the highest, it is compared with the next character data. When all the comparisons have been made, the character data stored in the ROM 3 with the highest degree of matching is selected. In the next step,
It is checked whether the degree of matching is greater than a certain value, and if it is less than a certain value, it is determined that it is not the relevant character, and if it is more than a certain value, it is determined that it is the relevant character, and the data is sent to the recognized character display means F. Character recognition is performed in this way.

以上の様に、本発明によれば、一つの文字データであら
ゆる大きさの文字の認…が可能となる。
As described above, according to the present invention, characters of all sizes can be recognized using one character data.

〔発明の効果〕〔Effect of the invention〕

以上述べた様に本発明によれば、文字認識時において、
一つの文字データであらゆる大きさの文字を4顔するこ
とが可能となる。本発明の文字認識方法は、活字文字を
対象としたパターンマツチング方法である。パターンマ
ツチング法においては、一つ一つの文字に対応して−っ
−っのデータが存在し、このデータを重ね合わせること
によりその一致度を調べ、認識する方法である。したが
って、認識文字数及び認識文字体数を多くするほど、デ
ータ量も必然と多くなる。結果として、ROMのメモリ
ー量の増大、 認識時間の増大となる。しかしながら、
本発明のように文字の大きさを一担一定の大きさに変換
して照合することによリ、一つのデータであらゆる大き
さの文字の認識が可能となる。その結果ROMのメモリ
ー量の減少、認識時間の減少とりう効果を得る。
As described above, according to the present invention, during character recognition,
It is possible to make four faces of characters of any size with one character data. The character recognition method of the present invention is a pattern matching method for printed characters. In the pattern matching method, there is data corresponding to each character, and the degree of matching is checked and recognized by overlapping this data. Therefore, as the number of recognized characters and fonts increases, the amount of data inevitably increases. As a result, the amount of ROM memory increases and the recognition time increases. however,
By converting the size of characters to a constant size and comparing them as in the present invention, it becomes possible to recognize characters of any size using one piece of data. As a result, the effect of reducing the amount of memory in the ROM and the recognition time can be obtained.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図に本発明の文字認識装置のブロック図。 第2図に本発明の認識手段のブロック図。 第3図に本発明の文字画像の拡大、縮少を示した図。 第4図に本発明の認識手段のフローチャートを示す。 A・・・画像入力手段 B・・・vi像処理手段 C・・・文字列抽出手段 D・・・文字抽出手段 E・・・文字認識手段 F・・・認識文字表示手段 1・・・文字認識手段 2・・・CPU 3・・・ROM 4 ・・・RA 〜1 5・・・文字データ 6・・・正規化後文字画像 7・・・文字画像 8・・・文字輪郭 9・・・文字外接矩形 以  上 出願人 セイコーエプソン株式会社 代理人 弁理士 最 上  務 他1名第4図 FIG. 1 is a block diagram of a character recognition device of the present invention. FIG. 2 is a block diagram of the recognition means of the present invention. FIG. 3 is a diagram showing enlargement and reduction of a character image according to the present invention. FIG. 4 shows a flowchart of the recognition means of the present invention. A... Image input means B...vi image processing means C...Character string extraction means D...Character extraction means E...Character recognition means F... Recognized character display means 1...Character recognition means 2...CPU 3...ROM 4...RA ~1 5...Character data 6... Character image after normalization 7...Character image 8...Character outline 9...Character circumscribing rectangle that's all Applicant: Seiko Epson Corporation Agent: Patent attorney Mogami and 1 other person Figure 4

Claims (3)

【特許請求の範囲】[Claims] (1) 文書画像を入力する入力手段と、入力画像を処
理する画像処理手段と、処理画像から文字列を抽出する
文字列抽出手段と、文字列から文字を抽出する文字抽出
手段と、該文字を認識する文字認識手段と、認識文字表
示手段とからなる文字認識装置において、文字認識手段
は、文字パターン及び文字画像を一定の大きさに正規化
したものを用いることを特徴とする文字認識装置。
(1) An input means for inputting a document image, an image processing means for processing the input image, a character string extraction means for extracting a character string from the processed image, a character extraction means for extracting a character from the character string, and the character A character recognition device comprising a character recognition means for recognizing a character and a recognized character display means, wherein the character recognition means uses a character pattern and a character image normalized to a certain size. .
(2) 正規化の手段は文字外接矩形を抽出し、該外接
矩形を一定の大きさに正規化すると共に内部文字画像も
比例して正規化する方法であることを特徴とする特許請
求の範囲第一項記載の文字認識装置。
(2) The scope of the claim characterized in that the normalization means is a method of extracting a character circumscribing rectangle, normalizing the circumscribing rectangle to a certain size, and proportionally normalizing the internal character image. The character recognition device according to item 1.
(3) 外接矩形の抽出手段として、文字の輪郭抽出の
結果を用いることを特徴とする特許請求の範囲第二項記
載の文字認識装置。
(3) The character recognition device according to claim 2, characterized in that the result of character outline extraction is used as the circumscribed rectangle extraction means.
JP62169453A 1987-07-07 1987-07-07 Character recognizing device Pending JPH01125683A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP62169453A JPH01125683A (en) 1987-07-07 1987-07-07 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP62169453A JPH01125683A (en) 1987-07-07 1987-07-07 Character recognizing device

Publications (1)

Publication Number Publication Date
JPH01125683A true JPH01125683A (en) 1989-05-18

Family

ID=15886878

Family Applications (1)

Application Number Title Priority Date Filing Date
JP62169453A Pending JPH01125683A (en) 1987-07-07 1987-07-07 Character recognizing device

Country Status (1)

Country Link
JP (1) JPH01125683A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05159104A (en) * 1991-12-04 1993-06-25 Nippon Telegr & Teleph Corp <Ntt> Character recognition communication system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05159104A (en) * 1991-12-04 1993-06-25 Nippon Telegr & Teleph Corp <Ntt> Character recognition communication system

Similar Documents

Publication Publication Date Title
JP3155577B2 (en) Character recognition method and device
JP3105967B2 (en) Character recognition method and device
US5621818A (en) Document recognition apparatus
EP0432937B1 (en) Hand-written character recognition apparatus
JPH01125683A (en) Character recognizing device
JPH05225394A (en) Candidate-character classifying method for character recognizing system
JPH0589190A (en) Drawing information checking system
JP2762472B2 (en) Character recognition method and character recognition device
JP3305367B2 (en) Data entry device for database
JP2851865B2 (en) Character recognition device
JPH07107698B2 (en) Character recognition method
JP2612383B2 (en) Character recognition processing method
JPS63269267A (en) Character recognizing device
JPH09128484A (en) Character recognizing method
JPS62257583A (en) Character recognizing system
JP2001060250A (en) Method and device for character recognition
JPS5929246Y2 (en) Online recognition processing device for handwritten characters
JPH09330377A (en) Device and method for recognizing handwritten character
JP2972443B2 (en) Character recognition device
JP3245241B2 (en) Character recognition apparatus and method
JP2963474B2 (en) Similar character identification method
JP2977244B2 (en) Character recognition method and character recognition device
JP2732753B2 (en) Fuzzy pattern recognition device
JPH1021398A (en) Method for extracting directional characteristic vector
JPH03219384A (en) Character recognizing device