JP2981902B2 - Character photo automatic recognition processing method - Google Patents

Character photo automatic recognition processing method

Info

Publication number
JP2981902B2
JP2981902B2 JP2002249A JP224990A JP2981902B2 JP 2981902 B2 JP2981902 B2 JP 2981902B2 JP 2002249 A JP2002249 A JP 2002249A JP 224990 A JP224990 A JP 224990A JP 2981902 B2 JP2981902 B2 JP 2981902B2
Authority
JP
Japan
Prior art keywords
pixel
pixels
character
threshold value
density
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP2002249A
Other languages
Japanese (ja)
Other versions
JPH03208184A (en
Inventor
哲士 熊本
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kyocera Corp
Original Assignee
Kyocera Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kyocera Corp filed Critical Kyocera Corp
Priority to JP2002249A priority Critical patent/JP2981902B2/en
Publication of JPH03208184A publication Critical patent/JPH03208184A/en
Application granted granted Critical
Publication of JP2981902B2 publication Critical patent/JP2981902B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Image Analysis (AREA)

Description

【発明の詳細な説明】 [産業上の利用分野] 本発明は文字と写真が混在する原画像を自動的に処理
して文字と写真とを識別する方式の改良に関する。
Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an improvement in a method of automatically processing an original image in which characters and photographs are mixed and distinguishing between characters and photographs.

[発明の概要] 文字と写真が混在する原画像を構成する多数の濃度の
異なる画素から成る入力データに対し、分散処理、メッ
シュ処理及び又はエッジ処理を施して文字と写真とを正
確に認識する方式に関するものである。
SUMMARY OF THE INVENTION Dispersion processing, mesh processing and / or edge processing are performed on input data composed of a large number of pixels having different densities that constitute an original image in which characters and photographs are mixed, so that characters and photographs are accurately recognized. It is about the method.

[従来の技術] 文字画像と写真画像が混在している原稿を光学的に読
み取ってディジタル処理して得られた画像情報を再生し
た場合、その再生画像は大変見ずらいものであった。そ
のため文字と写真の像域分離を自動的に認識し、再生画
像を見やすくする処理方式として、従来からエッジ処理
及びメッシュ処理の方式が提案されている。
2. Description of the Related Art When a document in which a character image and a photographic image are mixed is optically read and image information obtained by digital processing is reproduced, the reproduced image is very difficult to see. For this reason, edge processing and mesh processing have been proposed as processing methods for automatically recognizing the image area separation between characters and photographs and making the reproduced image easier to see.

[発明が解決しようとする課題] しかしながら従来の方式は画素の濃度差に着目するの
みであるため、太い文字の場合、太い文字の非エッジ部
は濃度差がないため文字として認識されず、写真として
認識されてしまう不具合があった。
[Problems to be Solved by the Invention] However, since the conventional method focuses only on the density difference between pixels, in the case of a thick character, the non-edge portion of the thick character has no density difference and is not recognized as a character. There was a problem that was recognized as.

[発明の目的] 従って本発明の目的は文字と写真の像域分離を自動的
に認識し再生された画像を見やすくする文字写真自動認
識処理方式を提供するにある。
Accordingly, an object of the present invention is to provide an automatic character / photograph recognition system for automatically recognizing image area separation between a character and a photograph and making it easy to view a reproduced image.

[課題を解決するための手段] 本発明は上記目的を達成するため、文字と写真が混在
する原画像を濃度の異なる複数の画素に変換して読み取
った入力データを夫々の画素毎にその周囲の画素を3×
3のマトリクス上に順次分割し、この分割した入力デー
タの中である注目の1画素毎にその近傍の8画素を含む
9個の画素の濃度平均値を求め、この平均値と各画素濃
度との2乗和を計算し、その結果をしきい値と順次比較
しながら文字画素か写真画素かを判定する分散処理手段
と、上記原画像の各領域を複数個のn×n(nは自然
数)のメッシュに区切り、夫々のメッシュについて写真
画素と判定された画素を計数してその計数値をしきい値
と比較し、その結果によりメッシュ毎に写真領域が文字
領域かを決定するメッシュ処理手段と、から成る文字写
真自動認識処理方式を提供する。
Means for Solving the Problems In order to achieve the above object, the present invention converts an original image in which characters and photographs are mixed into a plurality of pixels having different densities and reads the input data for each pixel. 3x pixels
3 are sequentially divided on a matrix of 3 and the average value of the density of 9 pixels including 8 pixels in the vicinity of each pixel of interest in the divided input data is obtained. Calculating the sum of the squares of the pixels and sequentially comparing the result with a threshold value to determine whether the pixel is a character pixel or a photographic pixel, and a plurality of n × n (n is a natural number) A) a mesh processing means for counting pixels determined to be photographic pixels for each mesh, comparing the counted value with a threshold value, and determining whether the photographic region is a text region for each mesh based on the result. And a character / photo automatic recognition processing method comprising:

[作用] メッシュ処理手段及び又はエッジ処理手段の他に、分
散処理手段を設けているので、特に太い文字に対する認
識精度が向上する。
[Operation] Since the distributed processing means is provided in addition to the mesh processing means and / or the edge processing means, the recognition accuracy for particularly thick characters is improved.

[実施例] 以下図面に示す実施例を参照して本発明を説明する。
第1図は本発明による文字写真自動認識処理方式の一実
施例の基本的システム概念図を示す。同図において、1
は読取部、2は中央演算部(CPU)、3はメモリ(RA
M)、4はROM、5はメッシュ処理部、6は分散処理部、
7はエッジ抽出処理部である。
EXAMPLES The present invention will be described below with reference to examples shown in the drawings.
FIG. 1 shows a basic system conceptual diagram of an embodiment of an automatic text / photo recognition processing system according to the present invention. In the figure, 1
Is a reading unit, 2 is a central processing unit (CPU), 3 is a memory (RA
M), 4 is ROM, 5 is mesh processing unit, 6 is distributed processing unit,
Reference numeral 7 denotes an edge extraction processing unit.

読取部1は公知の光学的文字読取装置(OCR)等から
成り、文字と写真が混在する原画像を濃度の異なる多数
の画素に変換してディジタルデータとして読み取る。
The reading unit 1 includes a known optical character reading device (OCR) or the like, and converts an original image in which characters and photographs are mixed into a large number of pixels having different densities and reads the digital data as digital data.

CPU2は読み取られたデータを一時的にメモリ3に蓄積
しておき、下記のようにメッシュ処理部5、分散処理部
6及びエッジ抽出処理部7を制御して文字と写真とを認
識するように処理させる。なおROM4は上記制御に必要な
プログラムを格納している。
The CPU 2 temporarily stores the read data in the memory 3 and controls the mesh processing unit 5, the distribution processing unit 6, and the edge extraction processing unit 7 so as to recognize characters and photographs as described below. Let it be processed. The ROM 4 stores a program necessary for the above control.

而して本発明の第1の方式では上記画素データに対し
分散処理とメッシュ処理を施す。この分散処理は入力デ
ータ(1画素=1ドット)である注目の1画素に対して
その近傍の8画素の状況を見て所定の演算を行い、その
結果をしきい値と比較しながらその注目画素が文字画素
か写真画素かを判定する。
Thus, in the first method of the present invention, the pixel data is subjected to a dispersion process and a mesh process. In this distributed processing, a predetermined operation is performed on one pixel of interest, which is input data (1 pixel = 1 dot), while observing the situation of eight pixels in the vicinity thereof, and the result is compared with a threshold to compare the result with the threshold. It is determined whether the pixel is a character pixel or a photo pixel.

上記分散処理の演算は分散処理部6により次のように
して行われる。
The calculation of the distributed processing is performed by the distributed processing unit 6 as follows.

3×3のマトリクス上に格納された注目画素を含む9
個の画素の濃度平均値を求め、この平均値と各画素濃度
との2乗和を計算する。今、この2乗和をSD、平均値を
m、各画素濃度をXi(i=0〜8)とすると、 このSDは9個の画素に比例する。
9 including the pixel of interest stored on a 3 × 3 matrix
An average value of the densities of the pixels is obtained, and a sum of squares of the average value and the density of each pixel is calculated. Assuming that the sum of squares is SD, the average value is m, and each pixel density is X i (i = 0 to 8), This SD is proportional to nine pixels.

文字の入力データではその文字のエッジ濃度分布に対
しSDが大きく、また太い均一濃度の文字領域のSDは小さ
い。写真の画素濃度はほぼSDの分布をもっている。
In the input data of a character, the SD is large with respect to the edge density distribution of the character, and the SD of a character region having a thick uniform density is small. The pixel density of a photograph has a distribution of almost SD.

本発明者の実験(コンピュータシミュレーション)的
検討によればSDのしきい値をSa,Sb,Sa<Sbとすると、文
字領域の画素についてはSD<Sa又はSb<SD、写真領域に
おいては、Sa≦SD≦Sbとして判別できる。
According to the experimental (computer simulation) study of the inventor, when the threshold value of SD is Sa, Sb, Sa <Sb, the pixels in the character region are SD <Sa or Sb <SD, and the pixels in the photographic region are Sa <Sb. ≦ SD ≦ Sb.

このような分散処理により細い文字、太い文字及び画
素濃度変化の穏やかな写真の判別を正確に行うことがで
きる。
By such dispersion processing, thin characters, thick characters, and photographs having a gentle change in pixel density can be accurately determined.

次にメッシュ処理部5は判別情報をより正確にするた
め、入力データの全領域を複数個のメッシュに区切り、
例えばその1つのメッシュを2×2の部分に区分し、各
メッシュについて、その4個の部分の文字画素の数を計
数し、その計数値Smがしきい値Scより大きいと文字領
域、Scより小さいと写真領域と判定し、像域分離を行
う。
Next, the mesh processing unit 5 divides the entire area of the input data into a plurality of meshes in order to make the discrimination information more accurate.
For example, one mesh is divided into 2 × 2 portions, and for each mesh, the number of character pixels in the four portions is counted. If the count value Sm is larger than the threshold value Sc, the number of character pixels is calculated from the character region and Sc. If it is smaller, it is determined to be a photographic area, and image area separation is performed.

第2図は上述した分散処理とメッシュ処理を行うアル
ゴリズムを示すフローチャートである。
FIG. 2 is a flowchart showing an algorithm for performing the above-described distributed processing and mesh processing.

さて本発明の第2の方式は上述した第1の方式の分散
処理に平行してエッジ抽出処理を行うことにより特に細
い文字の場合の認識精度を更に上げるようにしている。
In the second method of the present invention, the edge extraction processing is performed in parallel with the distribution processing of the first method described above, thereby further improving the recognition accuracy in the case of particularly thin characters.

上記エッジ抽出処理部7では前記分散処理における9
個の画素が格納されているマトリクスに対し3×3の加
重マトリクスを乗算として和をとり各画素の濃度を変換
してエッジを抽出する。このようにして抽出された画素
の濃度としきい値Tとを比較してエッジ画素(文字画
素)か非エッジ画素(写真画素)かを判定する。
In the edge extraction processing unit 7, 9
A matrix in which a number of pixels are stored is multiplied by a 3 × 3 weighting matrix to obtain a sum, the density of each pixel is converted, and an edge is extracted. The density of the pixel thus extracted is compared with the threshold value T to determine whether the pixel is an edge pixel (character pixel) or a non-edge pixel (photo pixel).

CPU2はエッジ抽出処理部7及び分散処理部6での判定
結果をとり写真画素か文字画素かを判定し、前記メッシ
ュ処理に移行する。
The CPU 2 obtains the determination results of the edge extraction processing unit 7 and the distribution processing unit 6, determines whether the pixel is a photograph pixel or a character pixel, and shifts to the mesh processing.

第3図は上記第2の方式のアルゴリズムを示すフロー
チャートである。
FIG. 3 is a flowchart showing the algorithm of the second method.

[発明の効果] 以上説明したように本発明によれば、太い文字、細い
文字及び文字と写真の像域分離の精度が従来の方式より
も向上して見やすくなり、不自然さのない画像データが
得られ、特に太い文字に対しても前記アルゴリズムで文
字として判断することが可能となるので実用上の効果顕
著である。
[Effects of the Invention] As described above, according to the present invention, the accuracy of image area separation of thick characters, thin characters, and characters and photographs is improved as compared with the conventional method, and image data without unnaturalness is obtained. Is obtained, and it is possible to determine a character as a character even with a thick character by the above algorithm, so that the practical effect is remarkable.

【図面の簡単な説明】[Brief description of the drawings]

第1図は本発明の一実施例の基本的システム構成を示す
ブロック図、第2図及び第3図は夫々本発明の第1及び
第2の方式のアルゴリズムを示すフローチャートであ
る。 1……画像読取部、2……CPU、3……メモリ、4……R
OM、5……メッシュ処理部、6……分散処理部、7……
エッジ抽出処理部。
FIG. 1 is a block diagram showing a basic system configuration of an embodiment of the present invention, and FIGS. 2 and 3 are flowcharts showing algorithms of the first and second systems of the present invention, respectively. 1 image reading unit, 2 CPU, 3 memory, 4 R
OM, 5: mesh processing unit, 6: distributed processing unit, 7:
Edge extraction processing unit.

Claims (2)

(57)【特許請求の範囲】(57) [Claims] 【請求項1】文字と写真が混在する原画像を濃度の異な
る複数の画素に変換して読み取った入力データを夫々の
画素毎にその周囲の画素を3×3のマトリクス上に順次
分割し、この分割した入力データの中である注目の1画
素毎にその近傍の8画素を含む9個の画素の濃度平均値
を求め、この平均値と各画素濃度との2乗和を計算し、
その結果をしきい値と順次比較しながら文字画素か写真
画素かを判定する分散処理手段と、上記原画像の各領域
を複数個のn×n(nは自然数)のメッシュに区切り、
夫々のメッシュについて写真画素と判定された画素を計
数してその計数値をしきい値と比較し、その結果により
メッシュ毎に写真領域か文字領域かを決定するメッシュ
処理手段と、から成ることを特徴とする文字写真自動認
識処理方式。
An original image in which characters and photographs are mixed is converted into a plurality of pixels having different densities, and read input data is divided into pixels of each pixel sequentially on a 3 × 3 matrix. For each pixel of interest in the divided input data, the average value of the density of nine pixels including eight pixels in the vicinity thereof is calculated, and the sum of squares of the average value and the density of each pixel is calculated.
A distributed processing unit for determining whether a pixel is a character pixel or a photograph pixel by sequentially comparing the result with a threshold value; and dividing each area of the original image into a plurality of n × n (n is a natural number) meshes;
Mesh processing means for counting pixels determined to be photographic pixels for each mesh, comparing the counted value with a threshold value, and determining a photographic area or a text area for each mesh based on the result. Character and text automatic recognition processing method.
【請求項2】文字と写真が混在する原画像を濃度の異な
る複数の画素に変換して読み取った入力データを夫々の
画素毎にその周囲の画素を3×3のマトリクス上に順次
分割し、この分割した入力データの中である注目の1画
素毎にその近傍の8画素を含む9個の画素の濃度平均値
を求め、この平均値と各画素濃度との2乗和を計算し、
その結果をしきい値と順次比較しながら文字画素か写真
画素かを判定する分散処理手段と、上記画素格納マトリ
クスに対し3×3の加重マトリクスを乗算して和をとり
濃度変換された各画素としきい値とを比較してエッジ画
素か非エッジ画素かを判定するエッジ抽出処理手段と、
上記分散処理手段とエッジ抽出処理手段との判定結果に
応じて写真画素か文字画素かを判定する判定手段と、上
記原画像の各領域を複数個のn×n(nは自然数)のメ
ッシュに区切り、夫々のメッシュについて写真画素と判
定された画素を計数してその計数値をしきい値と比較
し、その結果によりメッシュ毎に写真領域か文字領域か
を決定するメッシュ処理手段と、から成ることを特徴と
する文字写真自動認識処理方式。
2. An input image in which an original image in which characters and photographs are mixed is converted into a plurality of pixels having different densities, and read input data is sequentially divided for each pixel into a 3 × 3 matrix. For each pixel of interest in the divided input data, the average value of the density of nine pixels including eight pixels in the vicinity thereof is calculated, and the sum of squares of the average value and the density of each pixel is calculated.
A dispersion processing means for determining whether the pixel is a character pixel or a photographic pixel by sequentially comparing the result with a threshold value; and each pixel subjected to density conversion by multiplying the pixel storage matrix by a 3 × 3 weighting matrix to obtain a sum. Edge extraction processing means for comparing the threshold value with the threshold value to determine whether the pixel is an edge pixel or a non-edge pixel,
A determination unit for determining whether the pixel is a photograph pixel or a character pixel according to the determination result of the distribution processing unit and the edge extraction processing unit, and converting each area of the original image into a plurality of n × n (n is a natural number) meshes And a mesh processing means for counting pixels determined as photo pixels for each of the meshes, comparing the counted value with a threshold value, and determining a photo region or a text region for each mesh based on the result. An automatic text / photo recognition processing method.
JP2002249A 1990-01-09 1990-01-09 Character photo automatic recognition processing method Expired - Lifetime JP2981902B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2002249A JP2981902B2 (en) 1990-01-09 1990-01-09 Character photo automatic recognition processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2002249A JP2981902B2 (en) 1990-01-09 1990-01-09 Character photo automatic recognition processing method

Publications (2)

Publication Number Publication Date
JPH03208184A JPH03208184A (en) 1991-09-11
JP2981902B2 true JP2981902B2 (en) 1999-11-22

Family

ID=11524083

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2002249A Expired - Lifetime JP2981902B2 (en) 1990-01-09 1990-01-09 Character photo automatic recognition processing method

Country Status (1)

Country Link
JP (1) JP2981902B2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5659402A (en) * 1994-01-14 1997-08-19 Mita Industrial Co., Ltd. Image processing method and apparatus

Also Published As

Publication number Publication date
JPH03208184A (en) 1991-09-11

Similar Documents

Publication Publication Date Title
US7212668B1 (en) Digital image processing system and method for emphasizing a main subject of an image
US7457432B2 (en) Specified object detection apparatus
JP4559437B2 (en) Sky detection in digital color images
EP1276315B1 (en) A method for processing a digital image to adjust brightness
JPS62137974A (en) Picture processing system
JP2007500967A (en) Generate images with desired aspect ratio
US20090016616A1 (en) Category Classification Apparatus, Category Classification Method, and Storage Medium Storing a Program
CN104182722B (en) Method for text detection and device and text message extracting method and system
JP2005122720A (en) Apparatus for selecting image of specific scene, program, and recording medium having program recorded thereon
JP2981902B2 (en) Character photo automatic recognition processing method
US20080199084A1 (en) Category Classification Apparatus and Category Classification Method
JPH09101579A (en) Face area extraction method and copying condition determination method
JPH06113139A (en) Picture binarization device
JPH05114045A (en) Area separating method and image processor
EP0975146A1 (en) Locating the position and orientation of multiple objects with a smart platen
JPH08251404A (en) Method and device for discriminating attribute of image area
JP2812619B2 (en) Image processing device
US20080199085A1 (en) Category Classification Apparatus, Category Classification Method, and Storage Medium Storing a Program
JP3099435B2 (en) Image area determination device
JP2000067244A (en) Area integrating method for color image, and program storage medium
EP0974931A1 (en) Method and apparatus for identifying a plurality of sub-images in an input image
JP2877548B2 (en) Document image attribute discrimination method
JPH04304776A (en) Picture processor
JPH08251403A (en) Device for discriminating attribute of image area
JP2859603B2 (en) Method and apparatus for binarizing image data