JPH02253383A

JPH02253383A - Picture processor

Info

Publication number: JPH02253383A
Application number: JP1075366A
Authority: JP
Inventors: Gakuto Sugimoto; 杉本　学人; Shinji Kondo; 真司近藤; Kouei Riku; 陸　光榮
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1989-03-27
Filing date: 1989-03-27
Publication date: 1990-10-12
Anticipated expiration: 2014-08-25
Also published as: JP2939985B2

Abstract

PURPOSE:To automatically extract and classify the areas of a character string, a graphic, a photograph, a picture, a line segment, and a noise in a picture by providing a rectangular area coordinate extracting part, a rectangular area characteristic extracting part, and a rectangular area classifying part. CONSTITUTION:A rectangular area is extracted from the picture by a rectangular area coordinate extracting part 3, the characteristic of the extracted rectangular area is extracted by a rectangular area characteristic extracting part 4, and the extracted characteristic is compared with the characteristics of the character string, graphic, photograph, line segment and noise prepared beforehand, and to which of the character string, graphic, photograph, line segment or noise the rectangular area corresponds is decided by a rectangular area classifying part 5. By using the method, every one character is segmented and recognized by character recognizing technique, respectively inherent processings are executed for the graphic, photograph, line segment and noise areas, and thus the input picture can be more flexibly processed.

Description

【発明の詳細な説明】産業上の利用分野新聞、雑誌等の不特定な書式の文書から文字列。[Detailed description of the invention] Industrial applications Character strings from documents in unspecified formats such as newspapers and magazines.

図表、写真、線分、ノイズの領域を抽出し、分類する画
像処理装置に関するものである。The present invention relates to an image processing device that extracts and classifies regions of charts, photographs, line segments, and noise.

従来の技術文字１図形の混在する画像から、文字領域７図形領域を
切り分ける画像処理装置には、入力画像を表示しオペレ
ータがマウスなどを使用して指定するものと、オペレー
タの介在なくして自動的に行うものがある。オペレータ
の介在なくして自動的に行うものには、画像全体を文字
と図形に区別することなく所定のフォーマットに基づき
強制的に文字として１文字毎の小領域に切り出し、すで
に知られているパターン認識技術を用いて認識し、その
１文字毎の小領域の認識結果が文字として認識可能であ
るか否かを判定し、その判定結果を用いて１文字毎の小
領域どうしの連続性を調べて文字領域と図形領域を分類
していた（例えば、特開昭６１−１１８８８号公報）。Conventional technical image processing devices that separate character areas and seven graphic areas from an image containing a mixture of one character and one figure include those that display the input image and allow an operator to specify it using a mouse, and those that automatically specify the area without operator intervention. There is something to do. Automatic pattern recognition without operator intervention involves forcibly cutting out the entire image into small areas for each character based on a predetermined format without distinguishing between characters and figures. technology is used to recognize each character, the recognition results of the small areas for each character are determined whether or not they can be recognized as characters, and the continuity of the small areas for each character is examined using the determination results. Character areas and graphic areas were classified (for example, Japanese Patent Laid-Open No. 11888/1988).

発明が解決しようとする課題しかしながら、上記のような従来の技術では、文字認識
処理のための文字領域と文字以外の領域の分類に主眼が
おかれており、画像内の文字以外の領域は図表、写真、
線分、ノイズの領域というように、細かく分類すること
ができないという欠点を有していた。Problems to be Solved by the Invention However, in the above-mentioned conventional techniques, the main focus is on classifying text areas and non-text areas for character recognition processing, and non-text areas in an image are classified into figures, graphs, etc. ,photograph,
This method has the disadvantage of not being able to classify finely into areas such as line segments and noise areas.

本発明はかかる点に鑑みてなされたものであり、画像内
の文字列２図表、写真、線分、ノイズの領域を簡易な方
法で、自動的に抽出し分類する画像処理装置を提供する
ことを目的としている。The present invention has been made in view of the above points, and an object of the present invention is to provide an image processing device that automatically extracts and classifies character strings, graphs, photographs, line segments, and noise regions in an image using a simple method. It is an object.

課題を解決するための手段本発明は上記目的を達成するために、画像から文字列９
図表、写真、線分、ノイズの矩形領域を抽出する矩形領
域座標抽出部と、前記矩形領域座標抽出部で抽出した矩
形領域の特徴”を抽出する矩形領域特徴抽出部と、前記
矩形領域特徴抽出部から抽出した特徴を用いて、矩形領
域を文字列９図表、写真、線分、ノイズに分類する矩形
領域分類部を備えた画像処理装置である。Means for Solving the Problems In order to achieve the above object, the present invention provides character strings 9 from an image.
a rectangular area coordinate extraction unit that extracts rectangular areas of diagrams, photographs, line segments, and noise; a rectangular area feature extraction unit that extracts “features of the rectangular area extracted by the rectangular area coordinate extraction unit”; and the rectangular area feature extraction unit. This image processing device includes a rectangular area classification unit that classifies a rectangular area into character strings, graphs, photographs, line segments, and noise using features extracted from the area.

作　　用本発明は上記の構成により、画像から矩形領域座標抽出
部で矩形領域を抽出し、抽出した矩形領域に対し矩形領
域特徴抽出部で特徴を抽出し、抽出した特徴を矩形領域
分類部で文字列９図表、写真、線分、ノイズそれぞれに
あらかじめ用意した特徴と比較することにより、矩形領
域が文字列。According to the above configuration, the present invention extracts a rectangular area from an image using the rectangular area coordinate extraction unit, extracts features from the extracted rectangular area using the rectangular area feature extraction unit, and uses the extracted features in the rectangular area classification unit. Character String 9 By comparing the features prepared in advance for each diagram, photograph, line segment, and noise, a rectangular area is transformed into a character string.

図表、写真、線分、ノイズのいずれかに該当するかを判
定する。Determine whether it corresponds to a diagram, photograph, line segment, or noise.

実施例以下、本発明の実施例について図面を参照しながら説明
する。EXAMPLES Hereinafter, examples of the present invention will be described with reference to the drawings.

第１図は、本発明による画像処理装置の一実施例の構成
図である。１は画像入力部であシ文字列。FIG. 1 is a block diagram of an embodiment of an image processing apparatus according to the present invention. 1 is a character string in the image input section.

図表、写真、線分、ノイズを含む画像を走査し、２値信
号で画像メモリ部２に格納する。３は矩形領域座標抽出
部であシ文字列１図表、写真、ａ分。Images including charts, photographs, line segments, and noise are scanned and stored in the image memory section 2 as binary signals. 3 is a rectangular area coordinate extraction section with character string 1 chart, photo, a minute.

ノイズを囲む、最小の矩形領域座標を抽出する。Extract the coordinates of the smallest rectangular area surrounding the noise.

４は矩形領域特徴抽出部であり、矩形領域座標抽出部３
で抽出した文字列２図表、写真、線分、ノイズを囲む矩
形領域の特徴を抽出する。５は矩形領域分類部であり、
矩形領域特徴抽出部４で抽出分類する。4 is a rectangular area feature extraction unit, and rectangular area coordinate extraction unit 3
Extract the characteristics of the rectangular area surrounding the extracted character string 2 charts, photographs, line segments, and noise. 5 is a rectangular area classification unit;
The rectangular area feature extraction unit 4 performs extraction and classification.

以上のように構成された画像処理装置について、第２図
に示す入力画像Ｐを例に説明する。The image processing apparatus configured as described above will be explained using an input image P shown in FIG. 2 as an example.

画像入力部１から、入力された画像Ｐは文字列。The image P input from the image input unit 1 is a character string.

図表、写真、線分、ノイズ部の黒画素を１、背景部の白
画素を０の２値データで画像メモリ部２に蓄えられる。Binary data of 1 for black pixels in graphs, photographs, line segments, and noise areas and 0 for white pixels in background areas is stored in the image memory unit 2.

矩形領域座標抽出部３では、画像メモリ部２に蓄えられ
ている入力画像Ｐを横方向に走査して黒画素間の距離が
あらかじめ定めたしきい値Ｒ１以下の場合、その黒画素
どうしは連結しているものとする。同様に画像メモリ部
２に蓄えられている入力画像Ｐを縦方向に走査して黒画
素間の距離があらかじめ定めたしきい値Ｒ２以下の場合
、その黒画素どうしは連結しているものとする。横方向
。The rectangular area coordinate extraction unit 3 scans the input image P stored in the image memory unit 2 in the horizontal direction, and if the distance between black pixels is equal to or less than a predetermined threshold value R1, the black pixels are connected to each other. It is assumed that Similarly, when the input image P stored in the image memory unit 2 is scanned in the vertical direction and the distance between black pixels is less than or equal to a predetermined threshold value R2, the black pixels are considered to be connected. . Lateral direction.

縦方向に走査して得られた黒画素間の連結情報に着目し
文字列２図表、写真、線分、ノイズ部分のいずれかを囲
む最小の矩形領域の左上点座標（ｘｍｉ、ｎ　ｙＹｍｉ
　ｎ　）　ｒ　右下点座標（ｘｍ＆　！　ｊ　Ｙｍ＆　
！　）を抽出する。第３図に文字列の矩形領域を抽出し
た状態を座標を用いて示す。第４図に第２図の入力画像
Ｐから矩形領域座標抽出部３で抽出したすべての矩形領
域を示す。Focusing on the connection information between black pixels obtained by scanning in the vertical direction, we calculate the coordinates of the upper left point of the smallest rectangular area (xmi, n yYmi
n) r Lower right point coordinates (xm&! j Ym&
! ). FIG. 3 shows a state in which a rectangular region of a character string is extracted using coordinates. FIG. 4 shows all rectangular areas extracted by the rectangular area coordinate extraction unit 3 from the input image P of FIG. 2.

矩形領域特徴抽出部４では、矩形領域座標抽出部３で抽
出した文字列９図表、写真、線分、ノイズの矩形領域座
標から、矩形領域の幅Ｗを式（１）によって求める。The rectangular area feature extraction unit 4 calculates the width W of the rectangular area from the rectangular area coordinates of the character strings 9 charts, photographs, line segments, and noise extracted by the rectangular area coordinate extraction unit 3 using equation (1).

Ｗ＝ＸｍａニーＸｍ１ｎ＋１　　　　・・・・・・・・
・・・・（１）同様に矩形領域座標から矩形領域の高さ
Ｈを式＠）によって求める。W=XmanyXm1n+1 ・・・・・・・・・
(1) Similarly, the height H of the rectangular area is determined from the rectangular area coordinates using the formula @).

Ｈ＝Ｙ　　　−Ｙ　　　＋１　　　　・・・・・・・・
・・・・Ｃ２）ｍａｗ　　　ｍｉｎ矩形領域の幅Ｗと高さＨから、矩形領域の文字列方向垂
直高さＶを式（３）により、て求める。H=Y −Y +1 ・・・・・・・・・
...C2) maw min From the width W and height H of the rectangular area, the vertical height V of the rectangular area in the character string direction is determined using equation (3).

矩形領域の幅Ｗと高さＨから、矩形領域サイズＳを式（
イ）によって求める。From the width W and height H of the rectangular area, calculate the rectangular area size S using the formula (
A).

Ｓ＝ｗＨ・・・・・・・・・・・・←）矩形領域の幅Ｗ
と高さＨから、矩形領域縦横比Ｅを式（５）によって求
める。S=wH・・・・・・・・・・・・←) Width W of rectangular area
and the height H, the rectangular area aspect ratio E is determined by equation (5).

矩形領域サイズＳと矩形領域内の黒画素数Ｂから矩形領
域の黒画素密度りを式（６）によって求める。The black pixel density of the rectangular area is calculated from the rectangular area size S and the number B of black pixels in the rectangular area using equation (6).

Ｄ＝丁　　　　　　　・・・・・・・・・・・塵）矩形
領域分類部６では、−数的な文書の文字列。D = ding (dust) In the rectangular area classification unit 6, - a character string of a numerical document.

図表、写真、線分、ノイズは矩形領域特徴抽出部４で抽
出した矩形領域の文字列方向垂直高さＶ。For charts, photographs, line segments, and noise, the vertical height V in the character string direction of the rectangular area extracted by the rectangular area feature extraction unit 4 is used.

矩形領域サイズＳ、矩形領域縦横比Ｅ、矩形領域の黒画
素密度りが特定の性質を持つことを利用して分類を行う
。具体的には、矩形領域の文字列方向垂直高さＶがあら
かじめ定めたしきい値ｖｔｈｒ以上の場合、その矩形領
域は図表、または写真と分類され、Ｖがｖｔｈｒ未溝の
場合は文字列、線分。Classification is performed using the fact that the rectangular area size S, the rectangular area aspect ratio E, and the black pixel density of the rectangular area have specific properties. Specifically, if the vertical height V of a rectangular area in the character string direction is greater than or equal to a predetermined threshold value vthr, the rectangular area is classified as a diagram or a photograph, and if V is not vthr, then the character string, line segment.

ノイズのうちいずれかであると分類される。文字列、線
分、ノイズのうちいずれかであると分類された矩形領域
は、矩形領域サイズＳがあらかじめ定めたしきい値８ｔ
ｈｒ以上の場合は文字列、線分と分類され、Ｓが８ｔｈ
ｒ未満の場合は、ノイズであると分類される。文字列、
線分と分類された矩形領域は、矩形領域縦横比Ｅがあら
かじめ定めたしきい値Ｅｔｈｒ以上の場合は、線分と分
類され、ＥがＥｔｈｒ未滴の場合は文字列と分類される
。図表または写真と分類された矩形領域は、矩形領域の
黒画素密度りがあらかじめ定めたしきい値Ｄｔｈｒ以上
の場合は、写真と分類され、ＤがＤｔｈｒ未満の場合は
図表と分類される。第５図に矩形領域の分類条件の説明
図を示す。It is classified as either noise. A rectangular area classified as a character string, line segment, or noise is determined by a predetermined threshold value of 8t for the rectangular area size S.
If it is hr or more, it is classified as a character string or line segment, and S is 8th
If it is less than r, it is classified as noise. string,
A rectangular area classified as a line segment is classified as a line segment if the rectangular area aspect ratio E is equal to or greater than a predetermined threshold value Ethr, and is classified as a character string if E is not filled with Ethr. A rectangular area classified as a diagram or a photograph is classified as a photograph if the black pixel density of the rectangular area is greater than or equal to a predetermined threshold value Dthr, and is classified as a diagram if D is less than Dthr. FIG. 5 shows an explanatory diagram of the classification conditions for rectangular areas.

以上のように構成された画像処理装置では文字列２図表
、写真、線分、ノイズの混在する画像から文字列９図表
、写真、線分、ノイズを抽出し、分類することができる
。The image processing apparatus configured as described above can extract and classify character strings 9, charts, photographs, line segments, and noise from an image containing character strings 2, charts, photographs, line segments, and noise.

尚、本実施例の画像処理装置を文字認識装置に接続する
ことにより、文字列と分類された矩形領域から文字を切
り出し、認識することができる。Note that by connecting the image processing device of this embodiment to a character recognition device, characters can be extracted and recognized from rectangular areas classified as character strings.

発明の詳細な説明したように、本発明によれば不特定な書式の文書
の入力画像から簡易な方法で自動的に文字列９図表、写
真、線分、ノイズの領域を抽出することができる。この
方法を使用して、文字列の領域はすでに知られている文
字認識技術によって１文字毎に切り出して認識し、図表
、写真、線分、ノイズの領域はそれぞれ固有の処理を行
うことによって入力画像をより柔軟に加工することがで
き、その実用的効果は大きい。As described in detail, according to the present invention, text strings, graphs, photographs, line segments, and noise regions can be automatically extracted from an input image of a document in an unspecified format using a simple method. . Using this method, character string regions are extracted and recognized character by character using already known character recognition technology, and diagrams, photographs, line segments, and noise regions are each input by performing unique processing. Images can be processed more flexibly, which has great practical effects.

[Brief explanation of drawings]

第１図は本発明における一実施例の画像処理装置の構成
図、第２図は入力画像の説明図、第３図は抽出した文字
列の矩形領域座標を示す説明図、第４図は第２図の入力
画像に対して抽出したすべての矩形領域を示す説明図、
第５図は矩形領域の分類条件を示す説明図である。１・・・・・・画像入力部、２・・・・・・画像メモリ
部、３・・・・・・矩形領域座標抽出部、４・・・・・
・矩形領域特徴抽出部、６・・・・・・矩形領域分類部
、６・・・・・・文字列領域、７・・・・・・線分領域
、８・・・・・・写真領域、９・・・・・・図表領域、
Ｐ・・・・・・入力画像。代理人の氏名　弁理士　粟　野　重　孝　ほか１名第１
図／第図FIG. 1 is a configuration diagram of an image processing apparatus according to an embodiment of the present invention, FIG. 2 is an explanatory diagram of an input image, FIG. 3 is an explanatory diagram showing rectangular area coordinates of extracted character strings, and FIG. An explanatory diagram showing all rectangular areas extracted for the input image in Figure 2,
FIG. 5 is an explanatory diagram showing classification conditions for rectangular areas. 1... Image input unit, 2... Image memory unit, 3... Rectangular area coordinate extraction unit, 4...
・Rectangular area feature extraction unit, 6... Rectangular area classification unit, 6... Character string area, 7... Line segment area, 8... Photograph area , 9...Chart area,
P...Input image. Name of agent: Patent attorney Shigetaka Awano and 1 other person 1st
Figure / Diagram

Claims

[Claims]

an image information input section for inputting image information consisting of at least two sets of elements among character strings, diagrams, photographs, line segments, and noise; and an image information memory for storing the image information input to the image information input section. a rectangular area coordinate extraction unit that extracts a rectangular area of a character string, diagram, photograph, line segment, or noise from the image information stored in the image information memory unit; and a rectangular area extracted by the rectangular area coordinate extraction unit. and a rectangular area classification unit that uses the features extracted by the rectangular area feature extraction unit to classify the rectangular area into character strings, diagrams, photographs, line segments, and noise. An image processing device characterized by: