JP2981902B2

JP2981902B2 - Character photo automatic recognition processing method

Info

Publication number: JP2981902B2
Application number: JP2002249A
Authority: JP
Inventors: 哲士熊本
Original assignee: Kyocera Corp
Current assignee: Kyocera Corp
Priority date: 1990-01-09
Filing date: 1990-01-09
Publication date: 1999-11-22
Anticipated expiration: 2014-11-22
Also published as: JPH03208184A

Description

【発明の詳細な説明】［産業上の利用分野］本発明は文字と写真が混在する原画像を自動的に処理
して文字と写真とを識別する方式の改良に関する。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an improvement in a method of automatically processing an original image in which characters and photographs are mixed and distinguishing between characters and photographs.

［発明の概要］文字と写真が混在する原画像を構成する多数の濃度の
異なる画素から成る入力データに対し、分散処理、メッ
シュ処理及び又はエッジ処理を施して文字と写真とを正
確に認識する方式に関するものである。SUMMARY OF THE INVENTION Dispersion processing, mesh processing and / or edge processing are performed on input data composed of a large number of pixels having different densities that constitute an original image in which characters and photographs are mixed, so that characters and photographs are accurately recognized. It is about the method.

［従来の技術］文字画像と写真画像が混在している原稿を光学的に読
み取ってディジタル処理して得られた画像情報を再生し
た場合、その再生画像は大変見ずらいものであった。そ
のため文字と写真の像域分離を自動的に認識し、再生画
像を見やすくする処理方式として、従来からエッジ処理
及びメッシュ処理の方式が提案されている。2. Description of the Related Art When a document in which a character image and a photographic image are mixed is optically read and image information obtained by digital processing is reproduced, the reproduced image is very difficult to see. For this reason, edge processing and mesh processing have been proposed as processing methods for automatically recognizing the image area separation between characters and photographs and making the reproduced image easier to see.

［発明が解決しようとする課題］しかしながら従来の方式は画素の濃度差に着目するの
みであるため、太い文字の場合、太い文字の非エッジ部
は濃度差がないため文字として認識されず、写真として
認識されてしまう不具合があった。[Problems to be Solved by the Invention] However, since the conventional method focuses only on the density difference between pixels, in the case of a thick character, the non-edge portion of the thick character has no density difference and is not recognized as a character. There was a problem that was recognized as.

［発明の目的］従って本発明の目的は文字と写真の像域分離を自動的
に認識し再生された画像を見やすくする文字写真自動認
識処理方式を提供するにある。Accordingly, an object of the present invention is to provide an automatic character / photograph recognition system for automatically recognizing image area separation between a character and a photograph and making it easy to view a reproduced image.

［課題を解決するための手段］本発明は上記目的を達成するため、文字と写真が混在
する原画像を濃度の異なる複数の画素に変換して読み取
った入力データを夫々の画素毎にその周囲の画素を３×
３のマトリクス上に順次分割し、この分割した入力デー
タの中である注目の１画素毎にその近傍の８画素を含む
９個の画素の濃度平均値を求め、この平均値と各画素濃
度との２乗和を計算し、その結果をしきい値と順次比較
しながら文字画素か写真画素かを判定する分散処理手段
と、上記原画像の各領域を複数個のｎ×ｎ（ｎは自然
数）のメッシュに区切り、夫々のメッシュについて写真
画素と判定された画素を計数してその計数値をしきい値
と比較し、その結果によりメッシュ毎に写真領域が文字
領域かを決定するメッシュ処理手段と、から成る文字写
真自動認識処理方式を提供する。Means for Solving the Problems In order to achieve the above object, the present invention converts an original image in which characters and photographs are mixed into a plurality of pixels having different densities and reads the input data for each pixel. 3x pixels
3 are sequentially divided on a matrix of 3 and the average value of the density of 9 pixels including 8 pixels in the vicinity of each pixel of interest in the divided input data is obtained. Calculating the sum of the squares of the pixels and sequentially comparing the result with a threshold value to determine whether the pixel is a character pixel or a photographic pixel, and a plurality of n × n (n is a natural number) A) a mesh processing means for counting pixels determined to be photographic pixels for each mesh, comparing the counted value with a threshold value, and determining whether the photographic region is a text region for each mesh based on the result. And a character / photo automatic recognition processing method comprising:

［作用］メッシュ処理手段及び又はエッジ処理手段の他に、分
散処理手段を設けているので、特に太い文字に対する認
識精度が向上する。[Operation] Since the distributed processing means is provided in addition to the mesh processing means and / or the edge processing means, the recognition accuracy for particularly thick characters is improved.

［実施例］以下図面に示す実施例を参照して本発明を説明する。
第１図は本発明による文字写真自動認識処理方式の一実
施例の基本的システム概念図を示す。同図において、１
は読取部、２は中央演算部（CPU）、３はメモリ（RA
M）、４はROM、５はメッシュ処理部、６は分散処理部、
７はエッジ抽出処理部である。EXAMPLES The present invention will be described below with reference to examples shown in the drawings.
FIG. 1 shows a basic system conceptual diagram of an embodiment of an automatic text / photo recognition processing system according to the present invention. In the figure, 1
Is a reading unit, 2 is a central processing unit (CPU), 3 is a memory (RA
M), 4 is ROM, 5 is mesh processing unit, 6 is distributed processing unit,
Reference numeral 7 denotes an edge extraction processing unit.

読取部１は公知の光学的文字読取装置（OCR）等から
成り、文字と写真が混在する原画像を濃度の異なる多数
の画素に変換してディジタルデータとして読み取る。The reading unit 1 includes a known optical character reading device (OCR) or the like, and converts an original image in which characters and photographs are mixed into a large number of pixels having different densities and reads the digital data as digital data.

CPU2は読み取られたデータを一時的にメモリ３に蓄積
しておき、下記のようにメッシュ処理部５、分散処理部
６及びエッジ抽出処理部７を制御して文字と写真とを認
識するように処理させる。なおROM4は上記制御に必要な
プログラムを格納している。The CPU 2 temporarily stores the read data in the memory 3 and controls the mesh processing unit 5, the distribution processing unit 6, and the edge extraction processing unit 7 so as to recognize characters and photographs as described below. Let it be processed. The ROM 4 stores a program necessary for the above control.

而して本発明の第１の方式では上記画素データに対し
分散処理とメッシュ処理を施す。この分散処理は入力デ
ータ（１画素＝１ドット）である注目の１画素に対して
その近傍の８画素の状況を見て所定の演算を行い、その
結果をしきい値と比較しながらその注目画素が文字画素
か写真画素かを判定する。Thus, in the first method of the present invention, the pixel data is subjected to a dispersion process and a mesh process. In this distributed processing, a predetermined operation is performed on one pixel of interest, which is input data (1 pixel = 1 dot), while observing the situation of eight pixels in the vicinity thereof, and the result is compared with a threshold to compare the result with the threshold. It is determined whether the pixel is a character pixel or a photo pixel.

上記分散処理の演算は分散処理部６により次のように
して行われる。The calculation of the distributed processing is performed by the distributed processing unit 6 as follows.

３×３のマトリクス上に格納された注目画素を含む９
個の画素の濃度平均値を求め、この平均値と各画素濃度
との２乗和を計算する。今、この２乗和をSD、平均値を
ｍ、各画素濃度をX_i（ｉ＝０〜８）とすると、このSDは９個の画素に比例する。9 including the pixel of interest stored on a 3 × 3 matrix
An average value of the densities of the pixels is obtained, and a sum of squares of the average value and the density of each pixel is calculated. Assuming that the sum of squares is SD, the average value is m, and each pixel density is X _i (i = 0 to 8), This SD is proportional to nine pixels.

文字の入力データではその文字のエッジ濃度分布に対
しSDが大きく、また太い均一濃度の文字領域のSDは小さ
い。写真の画素濃度はほぼSDの分布をもっている。In the input data of a character, the SD is large with respect to the edge density distribution of the character, and the SD of a character region having a thick uniform density is small. The pixel density of a photograph has a distribution of almost SD.

本発明者の実験（コンピュータシミュレーション）的
検討によればSDのしきい値をSa,Sb,Sa＜Sbとすると、文
字領域の画素についてはSD＜Sa又はSb＜SD、写真領域に
おいては、Sa≦SD≦Sbとして判別できる。According to the experimental (computer simulation) study of the inventor, when the threshold value of SD is Sa, Sb, Sa <Sb, the pixels in the character region are SD <Sa or Sb <SD, and the pixels in the photographic region are Sa <Sb. ≦ SD ≦ Sb.

このような分散処理により細い文字、太い文字及び画
素濃度変化の穏やかな写真の判別を正確に行うことがで
きる。By such dispersion processing, thin characters, thick characters, and photographs having a gentle change in pixel density can be accurately determined.

次にメッシュ処理部５は判別情報をより正確にするた
め、入力データの全領域を複数個のメッシュに区切り、
例えばその１つのメッシュを２×２の部分に区分し、各
メッシュについて、その４個の部分の文字画素の数を計
数し、その計数値Smがしきい値Scより大きいと文字領
域、Scより小さいと写真領域と判定し、像域分離を行
う。Next, the mesh processing unit 5 divides the entire area of the input data into a plurality of meshes in order to make the discrimination information more accurate.
For example, one mesh is divided into 2 × 2 portions, and for each mesh, the number of character pixels in the four portions is counted. If the count value Sm is larger than the threshold value Sc, the number of character pixels is calculated from the character region and Sc. If it is smaller, it is determined to be a photographic area, and image area separation is performed.

第２図は上述した分散処理とメッシュ処理を行うアル
ゴリズムを示すフローチャートである。FIG. 2 is a flowchart showing an algorithm for performing the above-described distributed processing and mesh processing.

さて本発明の第２の方式は上述した第１の方式の分散
処理に平行してエッジ抽出処理を行うことにより特に細
い文字の場合の認識精度を更に上げるようにしている。In the second method of the present invention, the edge extraction processing is performed in parallel with the distribution processing of the first method described above, thereby further improving the recognition accuracy in the case of particularly thin characters.

上記エッジ抽出処理部７では前記分散処理における９
個の画素が格納されているマトリクスに対し３×３の加
重マトリクスを乗算として和をとり各画素の濃度を変換
してエッジを抽出する。このようにして抽出された画素
の濃度としきい値Ｔとを比較してエッジ画素（文字画
素）か非エッジ画素（写真画素）かを判定する。In the edge extraction processing unit 7, 9
A matrix in which a number of pixels are stored is multiplied by a 3 × 3 weighting matrix to obtain a sum, the density of each pixel is converted, and an edge is extracted. The density of the pixel thus extracted is compared with the threshold value T to determine whether the pixel is an edge pixel (character pixel) or a non-edge pixel (photo pixel).

CPU2はエッジ抽出処理部７及び分散処理部６での判定
結果をとり写真画素か文字画素かを判定し、前記メッシ
ュ処理に移行する。The CPU 2 obtains the determination results of the edge extraction processing unit 7 and the distribution processing unit 6, determines whether the pixel is a photograph pixel or a character pixel, and shifts to the mesh processing.

第３図は上記第２の方式のアルゴリズムを示すフロー
チャートである。FIG. 3 is a flowchart showing the algorithm of the second method.

［発明の効果］以上説明したように本発明によれば、太い文字、細い
文字及び文字と写真の像域分離の精度が従来の方式より
も向上して見やすくなり、不自然さのない画像データが
得られ、特に太い文字に対しても前記アルゴリズムで文
字として判断することが可能となるので実用上の効果顕
著である。[Effects of the Invention] As described above, according to the present invention, the accuracy of image area separation of thick characters, thin characters, and characters and photographs is improved as compared with the conventional method, and image data without unnaturalness is obtained. Is obtained, and it is possible to determine a character as a character even with a thick character by the above algorithm, so that the practical effect is remarkable.

[Brief description of the drawings]

第１図は本発明の一実施例の基本的システム構成を示す
ブロック図、第２図及び第３図は夫々本発明の第１及び
第２の方式のアルゴリズムを示すフローチャートであ
る。１……画像読取部、２……CPU、３……メモリ、４……R
OM、５……メッシュ処理部、６……分散処理部、７……
エッジ抽出処理部。FIG. 1 is a block diagram showing a basic system configuration of an embodiment of the present invention, and FIGS. 2 and 3 are flowcharts showing algorithms of the first and second systems of the present invention, respectively. 1 image reading unit, 2 CPU, 3 memory, 4 R
OM, 5: mesh processing unit, 6: distributed processing unit, 7:
Edge extraction processing unit.

Claims

(57) [Claims]

An original image in which characters and photographs are mixed is converted into a plurality of pixels having different densities, and read input data is divided into pixels of each pixel sequentially on a 3 × 3 matrix. For each pixel of interest in the divided input data, the average value of the density of nine pixels including eight pixels in the vicinity thereof is calculated, and the sum of squares of the average value and the density of each pixel is calculated.
A distributed processing unit for determining whether a pixel is a character pixel or a photograph pixel by sequentially comparing the result with a threshold value; and dividing each area of the original image into a plurality of n × n (n is a natural number) meshes;
Mesh processing means for counting pixels determined to be photographic pixels for each mesh, comparing the counted value with a threshold value, and determining a photographic area or a text area for each mesh based on the result. Character and text automatic recognition processing method.

2. An input image in which an original image in which characters and photographs are mixed is converted into a plurality of pixels having different densities, and read input data is sequentially divided for each pixel into a 3 × 3 matrix. For each pixel of interest in the divided input data, the average value of the density of nine pixels including eight pixels in the vicinity thereof is calculated, and the sum of squares of the average value and the density of each pixel is calculated.
A dispersion processing means for determining whether the pixel is a character pixel or a photographic pixel by sequentially comparing the result with a threshold value; and each pixel subjected to density conversion by multiplying the pixel storage matrix by a 3 × 3 weighting matrix to obtain a sum. Edge extraction processing means for comparing the threshold value with the threshold value to determine whether the pixel is an edge pixel or a non-edge pixel,
A determination unit for determining whether the pixel is a photograph pixel or a character pixel according to the determination result of the distribution processing unit and the edge extraction processing unit, and converting each area of the original image into a plurality of n × n (n is a natural number) meshes And a mesh processing means for counting pixels determined as photo pixels for each of the meshes, comparing the counted value with a threshold value, and determining a photo region or a text region for each mesh based on the result. An automatic text / photo recognition processing method.