JPH01279385A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPH01279385A
JPH01279385A JP63108737A JP10873788A JPH01279385A JP H01279385 A JPH01279385 A JP H01279385A JP 63108737 A JP63108737 A JP 63108737A JP 10873788 A JP10873788 A JP 10873788A JP H01279385 A JPH01279385 A JP H01279385A
Authority
JP
Japan
Prior art keywords
white
black
character
picture
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP63108737A
Other languages
Japanese (ja)
Other versions
JP2743378B2 (en
Inventor
Mikio Aoki
三喜男 青木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Seiko Epson Corp
Original Assignee
Seiko Epson Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Seiko Epson Corp filed Critical Seiko Epson Corp
Priority to JP63108737A priority Critical patent/JP2743378B2/en
Publication of JPH01279385A publication Critical patent/JPH01279385A/en
Application granted granted Critical
Publication of JP2743378B2 publication Critical patent/JP2743378B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)

Abstract

PURPOSE:To recognize an inverted character which is written in white on black ground through an ordinary operation by making both normal character pictures written in black on white paper and inverted character pictures written in white on black paper recognizable. CONSTITUTION:A picture inverting means 7 is provided and the number of white picture elements and black ones of inputted pictures 12 and 13 are calculated. When an inputted picture is black characters written on white ground, the white picture element predominates the black picture element in number and, when white characters written on black ground, the black picture element predominates the white picture element is number, in most case. Therefore, by comparing the number of white picture elements with that of the black picture elements and discriminating whether an inputted picture is the inputted picture 12 or 13, the inputted picture is not processed further and treated as an inversion-processed picture 15 when the inputted picture is the picture 12. When the inputted picture is the picture 13, the white and black picture elements are inverted to each other and the inputted picture is treated as the inversion-processed picture 15. Therefore, both black characters written on white ground and white characters written on black ground can be recognized by means of an ordinary recognizing means.

Description

【発明の詳細な説明】 【産業上の利用分野1 本発明は、紙面上に書かれた文字を画像として人力する
ことにより、文字画像から文字領域を捜し出し、コード
番号に変換する文字認識装置に関する。
DETAILED DESCRIPTION OF THE INVENTION [Industrial Application Field 1] The present invention relates to a character recognition device that searches for a character area from a character image and converts it into a code number by manually inputting characters written on a sheet of paper as an image. .

[従来の技術1 文字認識装置は、紙面上に書かれた文字をたとえばスキ
ャナ等の入力装置によって画像として記憶装置に入力し
、取り込まれた画像データより自動的に文字位置を捜し
出し、該捜し出された文字位置のデータを辞書となる文
字データと比較して、該文字画像がどういう文字である
のかを判断し、キャラクタ−コードに買き換えて出力す
るものである。一般に人力文字画像がら文字位置を捜し
出す場合は次の手順をとる6通常人カ画像は複数行同時
に入力される。したがって、該入カ画像より各行を切り
出さなければならない。この時、行方向の周辺分布をと
り、周辺分布で黒画素の固まりを文字行と判断し行の切
り出しを行う。次に、行方向と垂直の方向の周辺分布を
とり、行方向と同様に黒画素の固まりを文字位置と判断
し、文字位置が決定される。この後、前記決定された文
字位置のデータ(例えば周辺特徴 電子通信学会論文誌
’82  Vol、J65−D  No、8P、P、1
026〜1033、メツシュ特徴研究実用化報告第34
巻第1号 P、P、47〜58)を辞書として持ってい
る文字データと比べ一番近いものを捜し出すことによっ
て文字の認識を行っており、これらの文字切り出しの方
法から認識に至るすべての方法は白地に黒文字といった
通常文字画像を対象としたものである。
[Prior art 1] A character recognition device inputs characters written on paper as an image into a storage device using an input device such as a scanner, automatically searches for the character position from the captured image data, and processes the search result. The character position data thus obtained is compared with character data in a dictionary to determine what kind of character the character image is, and the character code is converted into a character code and output. Generally, when searching for a character position from a human character image, the following procedure is used.6 Normally, a human character image is input in multiple lines at the same time. Therefore, each row must be cut out from the input image. At this time, the peripheral distribution in the row direction is taken, and a cluster of black pixels is determined to be a character line based on the peripheral distribution, and the line is cut out. Next, the peripheral distribution in the direction perpendicular to the row direction is taken, and similarly to the row direction, a cluster of black pixels is determined to be a character position, and the character position is determined. After that, the data of the determined character position (for example, peripheral characteristics Journal of the Institute of Electronics and Communication Engineers '82 Vol. J65-D No. 8P
026-1033, Metzhu characteristic research practical application report No. 34
Characters are recognized by comparing the character data held in the dictionary (Volume No. 1, P, P, 47-58) and finding the closest one, and all the processes from character extraction methods to recognition are performed. The method targets ordinary character images such as black characters on a white background.

〔発明が解決しようとする課題1 しかしながら、通常口にする文字画像は、白地に黒文字
といったものばかりでなく、黒地あるいは黒に近い暗い
地に白などの明るい文字というものもかなり多く、この
ような文字画像を入力した場合の文字の切り出し、認識
は不可能である。また、白黒反転文字を認識しようとし
た場合、文字切り出しの判断は白の固まりを文字領域と
判断した後、反転文字用に作成された文字画像データと
比較しなければならず、文字画像データとしての辞書の
量が倍になる。また、反転文字かどうかの判断は、外部
から何らかの方法で入力しなければならず、このような
方法での反転文字の認識は非常に非効率的で、また、メ
モリの無駄である。そこで1本発明はこのような課題を
解決するもので、その目的とするところは、白地に黒文
字といった通常文字は当然のこと、黒地に白文字といっ
た白黒反転文字を通常の操作により認識可能な文字認識
手段及び該認識方法を提供することにある。
[Problem to be solved by the invention 1 However, the character images that are usually used are not only black characters on a white background, but also quite a lot of bright characters such as white on a black background or a dark background close to black. When a character image is input, it is impossible to cut out and recognize the characters. In addition, when trying to recognize black and white inverted characters, character extraction must be performed by determining that a white block is a character area, and then comparing it with character image data created for inverted characters. The amount of dictionaries will be doubled. Further, to determine whether or not a character is an inverted character, it is necessary to input it from the outside by some method, and recognizing inverted characters using such a method is extremely inefficient and wastes memory. Therefore, the present invention is intended to solve such problems, and its purpose is to make it possible to recognize not only normal characters such as black characters on a white background, but also black and white inverted characters such as white characters on a black background by normal operations. An object of the present invention is to provide a recognition means and a recognition method.

[課題を解決するための手段] 本発明は、 (1)文書画像を入力する画像入力手段と、該入力画像
から文字列を抽出する文字抽出手段と、抽出文字列から
文字を抽出する文字抽出手段と、該文字を認識する文字
認識手段と、認識結果を表示する認識文字表示手段とか
らなる文字認識装置において、たとえば白い紙上に書か
れた黒い文字の様な普通の文字画像、黒い紙上に書かれ
た白い文字の様な反転文字画像のどちらでも認識可能で
あることを特徴とする。
[Means for Solving the Problems] The present invention provides: (1) an image input means for inputting a document image, a character extraction means for extracting a character string from the input image, and a character extraction means for extracting characters from the extracted character string. In a character recognition device comprising a character recognition means for recognizing the character, and a recognized character display means for displaying the recognition result, for example, an ordinary character image such as a black character written on a white paper, a character image written on a black paper, It is characterized by being able to recognize both inverted character images such as written white characters.

また、 (2)上記文字認識装置において、読み込まれた画像が
黒地に白文字の場合は読み込みデータを白黒反転させた
後認識の操作を行うことを特徴とする。
(2) In the character recognition device, if the read image is white characters on a black background, the recognition operation is performed after the read data is inverted in black and white.

(3)白地に黒文字か、黒地に白文字かの判断は白画素
と黒画素の統計を用いることを特徴とする。
(3) Determination as to whether the text is black on a white background or white on a black background is characterized by using statistics of white pixels and black pixels.

[実 施 例] 以下本発明について実施例に基づいて詳細に説明する。[Example] The present invention will be described in detail below based on examples.

本発明の文字認識装置は、第1図のブロック図に示す様
に、CPU1.画像人力装置2、認識文字表示装置3、
ROM4、RAM5により構成されている。また、本発
明の文字認識装置の動作は第2図のブロック図に示す様
に、画像入力手段6、入力された画像が白地に黒文字で
あるか、黒地に白文字であるかを判断し、黒地に白文字
である場合には画像を白黒反転する画像反転手段7゜−
画(i中から文字列を抽出する文字列抽出手段8、文字
列から文字を抽出する文字抽出手段9、抽出された文字
を認識する文字認識手段lO1該認識文字を表示する認
識結果表示手段11とからなっている。
As shown in the block diagram of FIG. 1, the character recognition device of the present invention includes a CPU 1. Image human power device 2, recognized character display device 3,
It is composed of ROM4 and RAM5. Further, the operation of the character recognition device of the present invention is as shown in the block diagram of FIG. 2. The image input means 6 determines whether the input image is black characters on a white background or white characters on a black background. Image reversing means 7°- for reversing the image in black and white in the case of white characters on a black background.
character string extraction means 8 for extracting a character string from a character string, character extraction means 9 for extracting characters from a character string, character recognition means lO1 for recognizing the extracted characters; recognition result display means 11 for displaying the recognized characters; It consists of

以下、人力文字画像が、白地に黒文字の場合、黒地に白
文字の場合によらない本発明の文字認識装置の動作を第
4図に示すフローチャートに基づいて説明する。
Hereinafter, the operation of the character recognition apparatus of the present invention will be explained based on the flowchart shown in FIG. 4, regardless of whether the human character image is black characters on a white background or white characters on a black background.

画像入力手段6においては、画像入力装置2によって、
文字画像がRAM5に読み込まれる。該RAM5に読み
込まれたデータは、白地に黒文字のデータ12なのか、
黒地に白文字のデータBなのか分かっていない、このま
ま、文字列抽出手段8−において文字の抽出を行おうと
すると、行方向の周辺分布を計算して、黒画素の領域を
拾うので、行の抽出は不可能である6本発明は、ここで
、画像反転手段7(反転処理手段16)を設けである。
In the image input means 6, the image input device 2
A character image is read into RAM5. Is the data read into the RAM 5 data 12 with black characters on a white background?
If it is not known whether data B is a white character on a black background, and the character string extraction means 8- tries to extract the character, the peripheral distribution in the row direction will be calculated and the black pixel area will be picked up. Here, the present invention provides an image reversing means 7 (reversing processing means 16).

該反転処理手段16においては、人力画像12(13)
の白画素数と黒画素数の計算を行う、もしも、入力画像
が白地に黒文字であれば殆どの場合黒画素数よりも白画
素数の方が多くなるし、黒地に白文字であれば殆どの場
合白画素数よりも黒画素数の方が多くなる。したがって
、第3図14において白画素数と黒画素数を比較し、入
力画像が12であるのか、13であるのかを判断する6
人力画像が12であれば、14−1において何もせずに
反転処理後画像15とし、入力画像が13であれば、1
4−2において、白画素と黒画素とを反転し、反転処理
後画像15とする。こうして得られた画像15は、白地
に黒文字であるため、以後の認識処理は通常の認識処理
でよく、文字認識手段10において、新たに黒地に白文
字用の文字データをROM4中に持つ必要が無く、単位
メモリあたりの認識可能な文字の量が増える。また、本
発明の白地に黒文字か、黒地に白文字かの判断は、入力
画像すべての白画素、黒画素の数を計算すれば確実であ
るが、入力画像の1/16〜1/8の領域のみの画素数
の計算で判断しても、殆ど判断を誤ることない6 以上の様に人力画像の白画素と黒画素の数を計算し、白
地に黒文字の画像なのか、黒地に白文字の画像なのかを
判断した後、黒地に白文字の画像である場合、画像の白
画素と黒画素を反転し、白地に黒文字の文字画像とする
ので、通常の認識で白黒反転文字の認識が可能となる。
In the reversal processing means 16, the human image 12 (13)
Calculate the number of white pixels and the number of black pixels in the input image.If the input image is black text on a white background, the number of white pixels will be greater than the number of black pixels in most cases, and if the input image is white text on a black background, the number of white pixels will be greater than the number of black pixels. In this case, the number of black pixels is greater than the number of white pixels. Therefore, in FIG. 3 14, the number of white pixels and the number of black pixels are compared to determine whether the input image is 12 or 13.
If the human image is 12, the inverted image is set to 15 without doing anything in 14-1, and if the input image is 13, it is 15.
In step 4-2, white pixels and black pixels are inverted to form an inverted image 15. Since the image 15 obtained in this way has black characters on a white background, the subsequent recognition processing can be performed by normal recognition processing, and the character recognition means 10 needs to newly store character data for white characters on a black background in the ROM 4. This increases the amount of characters that can be recognized per unit of memory. In addition, in the present invention, it is possible to determine whether the text is black on a white background or white on a black background by calculating the number of white pixels and black pixels in all input images, Even if you make a judgment by calculating the number of pixels in only the area, there is almost no misjudgment.6 Calculate the number of white pixels and black pixels of the human image as described above, and determine whether it is an image with black text on a white background or white text on a black background. After determining whether the image is an image with white text on a black background, the white pixels and black pixels of the image are inverted to create a character image with black text on a white background, so normal recognition cannot recognize black and white inverted characters. It becomes possible.

また、自動的に反転の判断を行うので、操作上は全く今
までの文字認識装置と変わらず、機能のみを向上させる
ことが可能である。
Furthermore, since the reversal judgment is automatically made, the operation is no different from conventional character recognition devices, and only the functionality can be improved.

また本実施例は、入力画像が、白黒の二値の場合につい
て述べたが、該方法は、二値に限ったものではなく、階
調をもったデータであっても、i情調の統計をとって、
反転文字であれば、階調を反転させる(2つの補数をと
る)ことによって同様の機能を持つ文字認識装置の提供
が可能となる。
Furthermore, although this embodiment has described the case where the input image is binary (black and white), this method is not limited to binary images, and even data with gradation can be used to calculate the statistics of i-situation. Take it,
If the character is an inverted character, it is possible to provide a character recognition device with a similar function by inverting the gradation (taking two's complement).

[発明の効果1 以上述べた様に本発明によれば、文字認識装置において
、入力画像が白地に黒文字か、黒地に白文字かの判断を
画素の統計をとることによって自動的に行い、黒地に白
文字の場合は、白黒反転を行うので、通常の認識手段で
、黒地に白文字、白地に黒文字の認識を可能とした。ま
た、このような文字認識装置は従来のものと比べて、使
用用途が拡がったものとなり、便利さの向上を実現した
ものとなる。
[Effect of the Invention 1] As described above, according to the present invention, a character recognition device automatically determines whether an input image is a black character on a white background or a white character on a black background by taking pixel statistics. In the case of white text, black and white is inverted, making it possible to recognize white text on a black background and black text on a white background using normal recognition means. Moreover, such a character recognition device has a wider range of uses than conventional ones, and is more convenient.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図に本発明の文字認識装置のブロック図。 第2図に本発明の認識手段のブロック図。 第3図に本発明の文字の反転を示した図。 第4図に本発明の文字反転の判断のフローチャートを示
す。 1・・・・・CPU 2・・・・・画像入力装置 3・・・・・認識結果表示装置 4・・・・・ROM 5・・・・・RAM 6・・・・・画像入力手段 7・・・・・画像反転手段 、8・・・・・文字列抽出手段 9・・・・・文字抽出手段 10・・・・・文字認識手段 11・・・・・認識文字表示手段 12.13・・入力画像 14・・・・・反転処理 15・・・・・反転処理後画像 16・・・・・反転処理手段 以上 出願人 セイコーエプソン株式会社 代理人 弁理士 上 柳 雅 誉(他1名)第2図
FIG. 1 is a block diagram of a character recognition device of the present invention. FIG. 2 is a block diagram of the recognition means of the present invention. FIG. 3 is a diagram showing the inversion of the characters of the present invention. FIG. 4 shows a flow chart of character reversal determination according to the present invention. 1... CPU 2... Image input device 3... Recognition result display device 4... ROM 5... RAM 6... Image input means 7 ... Image inversion means, 8 ... Character string extraction means 9 ... Character extraction means 10 ... Character recognition means 11 ... Recognized character display means 12.13 ... Input image 14 ... Reversal processing 15 ... Image after reversal processing 16 ... Reversal processing means and above Applicant Seiko Epson Co., Ltd. Agent Patent attorney Masayoshi Kamiyanagi (1 other person) )Figure 2

Claims (3)

【特許請求の範囲】[Claims] (1)文書画像を入力する画像入力手段と、該入力画像
から文字列を抽出する文字列抽出手段と、抽出文字列か
ら文字を抽出する文字抽出手段と、該文字を認識する文
字認識手段と、認識結果を表示する認識文字表示手段と
からなる文字認識装置において、たとえば白い紙上に書
かれた黒い文字の様な普通の文字画像、黒い紙上に書か
れた白い文字の様な反転文字画像のどちらでも認識可能
であることを特徴とする文字認識装置。
(1) An image input means for inputting a document image, a character string extraction means for extracting a character string from the input image, a character extraction means for extracting characters from the extracted character string, and a character recognition means for recognizing the characters. , and a recognized character display means for displaying the recognition results, for example, normal character images such as black characters written on white paper, and inverted character images such as white characters written on black paper. A character recognition device characterized by being able to recognize either.
(2)上記文字認識装置において、読み込まれた画像が
黒地に白文字の場合は読み込みデータを白黒反転させた
後認識の操作を行うことを特徴とする請求項第1項記載
の文字認識手段。
(2) The character recognition means according to claim 1, wherein in the character recognition device, when the read image is white characters on a black background, the read data is inverted in black and white and then the recognition operation is performed.
(3)白地に黒文字か、黒地に白文字かの判断は白画素
と黒画素の統計を用いることを特徴とする請求項1記載
の文字認識手段。
(3) The character recognition means according to claim 1, wherein statistics of white pixels and black pixels are used to determine whether it is a black character on a white background or a white character on a black background.
JP63108737A 1988-04-30 1988-04-30 Character recognition method Expired - Lifetime JP2743378B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63108737A JP2743378B2 (en) 1988-04-30 1988-04-30 Character recognition method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63108737A JP2743378B2 (en) 1988-04-30 1988-04-30 Character recognition method

Publications (2)

Publication Number Publication Date
JPH01279385A true JPH01279385A (en) 1989-11-09
JP2743378B2 JP2743378B2 (en) 1998-04-22

Family

ID=14492246

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63108737A Expired - Lifetime JP2743378B2 (en) 1988-04-30 1988-04-30 Character recognition method

Country Status (1)

Country Link
JP (1) JP2743378B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004097721A1 (en) * 2003-04-25 2004-11-11 Sharp Kabushiki Kaisha Image processing device, image processing method, image processing program, and computer-readable recording medium containing the program

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS522337A (en) * 1975-06-24 1977-01-10 Nec Corp Slice level deciding equipment
JPS5762466A (en) * 1980-10-03 1982-04-15 Canon Inc Original reader
JPS5960580A (en) * 1982-09-29 1984-04-06 Fujitsu Ltd Picture processing system
JPS62147584A (en) * 1985-12-23 1987-07-01 Toshiba Corp Character reader

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS522337A (en) * 1975-06-24 1977-01-10 Nec Corp Slice level deciding equipment
JPS5762466A (en) * 1980-10-03 1982-04-15 Canon Inc Original reader
JPS5960580A (en) * 1982-09-29 1984-04-06 Fujitsu Ltd Picture processing system
JPS62147584A (en) * 1985-12-23 1987-07-01 Toshiba Corp Character reader

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004097721A1 (en) * 2003-04-25 2004-11-11 Sharp Kabushiki Kaisha Image processing device, image processing method, image processing program, and computer-readable recording medium containing the program

Also Published As

Publication number Publication date
JP2743378B2 (en) 1998-04-22

Similar Documents

Publication Publication Date Title
US7949157B2 (en) Interpreting sign language gestures
JP2940936B2 (en) Tablespace identification method
CN101615251A (en) The method and apparatus that is used for identification character in the character recognition device
JP2001060247A (en) Device and method for image processing
US10055668B2 (en) Method for the optical detection of symbols
JPH01279385A (en) Character recognizing device
CN107480648B (en) Method for detecting characters in natural scene
US5361204A (en) Searching for key bit-mapped image patterns
JPH0291789A (en) Character recognizing system
JP3305367B2 (en) Data entry device for database
JP2867531B2 (en) Character size recognition device
JP2649807B2 (en) Character reader
JP2978801B2 (en) Character input method for handwritten character recognition
JPH0877355A (en) Weighed pattern matching method
KR200332373Y1 (en) Mobile communication device with business card recognition processing
JPH02166583A (en) Character recognizing device
JP2612383B2 (en) Character recognition processing method
JPH05324900A (en) Portable character retrieving device
JPS6089290A (en) Pattern recognition method
JPH03219384A (en) Character recognizing device
JP4129320B2 (en) Image processing apparatus and recording medium
JP3243389B2 (en) Document identification method
JPH08263591A (en) Device and method for character recognition
JPH01261794A (en) Display method for character recognizing system
JPH06131496A (en) Pattern normalization processing method

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080206

Year of fee payment: 10

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090206

Year of fee payment: 11

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090206

Year of fee payment: 11