JPH02253383A - Picture processor - Google Patents

Picture processor

Info

Publication number
JPH02253383A
JPH02253383A JP1075366A JP7536689A JPH02253383A JP H02253383 A JPH02253383 A JP H02253383A JP 1075366 A JP1075366 A JP 1075366A JP 7536689 A JP7536689 A JP 7536689A JP H02253383 A JPH02253383 A JP H02253383A
Authority
JP
Japan
Prior art keywords
rectangular area
noise
extracted
character
character string
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP1075366A
Other languages
Japanese (ja)
Other versions
JP2939985B2 (en
Inventor
Gakuto Sugimoto
杉本 学人
Shinji Kondo
真司 近藤
Kouei Riku
陸 光榮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP1075366A priority Critical patent/JP2939985B2/en
Publication of JPH02253383A publication Critical patent/JPH02253383A/en
Application granted granted Critical
Publication of JP2939985B2 publication Critical patent/JP2939985B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)
  • Image Analysis (AREA)

Abstract

PURPOSE:To automatically extract and classify the areas of a character string, a graphic, a photograph, a picture, a line segment, and a noise in a picture by providing a rectangular area coordinate extracting part, a rectangular area characteristic extracting part, and a rectangular area classifying part. CONSTITUTION:A rectangular area is extracted from the picture by a rectangular area coordinate extracting part 3, the characteristic of the extracted rectangular area is extracted by a rectangular area characteristic extracting part 4, and the extracted characteristic is compared with the characteristics of the character string, graphic, photograph, line segment and noise prepared beforehand, and to which of the character string, graphic, photograph, line segment or noise the rectangular area corresponds is decided by a rectangular area classifying part 5. By using the method, every one character is segmented and recognized by character recognizing technique, respectively inherent processings are executed for the graphic, photograph, line segment and noise areas, and thus the input picture can be more flexibly processed.

Description

【発明の詳細な説明】 産業上の利用分野 新聞、雑誌等の不特定な書式の文書から文字列。[Detailed description of the invention] Industrial applications Character strings from documents in unspecified formats such as newspapers and magazines.

図表、写真、線分、ノイズの領域を抽出し、分類する画
像処理装置に関するものである。
The present invention relates to an image processing device that extracts and classifies regions of charts, photographs, line segments, and noise.

従来の技術 文字1図形の混在する画像から、文字領域7図形領域を
切り分ける画像処理装置には、入力画像を表示しオペレ
ータがマウスなどを使用して指定するものと、オペレー
タの介在なくして自動的に行うものがある。オペレータ
の介在なくして自動的に行うものには、画像全体を文字
と図形に区別することなく所定のフォーマットに基づき
強制的に文字として1文字毎の小領域に切り出し、すで
に知られているパターン認識技術を用いて認識し、その
1文字毎の小領域の認識結果が文字として認識可能であ
るか否かを判定し、その判定結果を用いて1文字毎の小
領域どうしの連続性を調べて文字領域と図形領域を分類
していた(例えば、特開昭61−11888号公報)。
Conventional technical image processing devices that separate character areas and seven graphic areas from an image containing a mixture of one character and one figure include those that display the input image and allow an operator to specify it using a mouse, and those that automatically specify the area without operator intervention. There is something to do. Automatic pattern recognition without operator intervention involves forcibly cutting out the entire image into small areas for each character based on a predetermined format without distinguishing between characters and figures. technology is used to recognize each character, the recognition results of the small areas for each character are determined whether or not they can be recognized as characters, and the continuity of the small areas for each character is examined using the determination results. Character areas and graphic areas were classified (for example, Japanese Patent Laid-Open No. 11888/1988).

発明が解決しようとする課題 しかしながら、上記のような従来の技術では、文字認識
処理のための文字領域と文字以外の領域の分類に主眼が
おかれており、画像内の文字以外の領域は図表、写真、
線分、ノイズの領域というように、細かく分類すること
ができないという欠点を有していた。
Problems to be Solved by the Invention However, in the above-mentioned conventional techniques, the main focus is on classifying text areas and non-text areas for character recognition processing, and non-text areas in an image are classified into figures, graphs, etc. ,photograph,
This method has the disadvantage of not being able to classify finely into areas such as line segments and noise areas.

本発明はかかる点に鑑みてなされたものであり、画像内
の文字列2図表、写真、線分、ノイズの領域を簡易な方
法で、自動的に抽出し分類する画像処理装置を提供する
ことを目的としている。
The present invention has been made in view of the above points, and an object of the present invention is to provide an image processing device that automatically extracts and classifies character strings, graphs, photographs, line segments, and noise regions in an image using a simple method. It is an object.

課題を解決するための手段 本発明は上記目的を達成するために、画像から文字列9
図表、写真、線分、ノイズの矩形領域を抽出する矩形領
域座標抽出部と、前記矩形領域座標抽出部で抽出した矩
形領域の特徴”を抽出する矩形領域特徴抽出部と、前記
矩形領域特徴抽出部から抽出した特徴を用いて、矩形領
域を文字列9図表、写真、線分、ノイズに分類する矩形
領域分類部を備えた画像処理装置である。
Means for Solving the Problems In order to achieve the above object, the present invention provides character strings 9 from an image.
a rectangular area coordinate extraction unit that extracts rectangular areas of diagrams, photographs, line segments, and noise; a rectangular area feature extraction unit that extracts “features of the rectangular area extracted by the rectangular area coordinate extraction unit”; and the rectangular area feature extraction unit. This image processing device includes a rectangular area classification unit that classifies a rectangular area into character strings, graphs, photographs, line segments, and noise using features extracted from the area.

作  用 本発明は上記の構成により、画像から矩形領域座標抽出
部で矩形領域を抽出し、抽出した矩形領域に対し矩形領
域特徴抽出部で特徴を抽出し、抽出した特徴を矩形領域
分類部で文字列9図表、写真、線分、ノイズそれぞれに
あらかじめ用意した特徴と比較することにより、矩形領
域が文字列。
According to the above configuration, the present invention extracts a rectangular area from an image using the rectangular area coordinate extraction unit, extracts features from the extracted rectangular area using the rectangular area feature extraction unit, and uses the extracted features in the rectangular area classification unit. Character String 9 By comparing the features prepared in advance for each diagram, photograph, line segment, and noise, a rectangular area is transformed into a character string.

図表、写真、線分、ノイズのいずれかに該当するかを判
定する。
Determine whether it corresponds to a diagram, photograph, line segment, or noise.

実施例 以下、本発明の実施例について図面を参照しながら説明
する。
EXAMPLES Hereinafter, examples of the present invention will be described with reference to the drawings.

第1図は、本発明による画像処理装置の一実施例の構成
図である。1は画像入力部であシ文字列。
FIG. 1 is a block diagram of an embodiment of an image processing apparatus according to the present invention. 1 is a character string in the image input section.

図表、写真、線分、ノイズを含む画像を走査し、2値信
号で画像メモリ部2に格納する。3は矩形領域座標抽出
部であシ文字列1図表、写真、a分。
Images including charts, photographs, line segments, and noise are scanned and stored in the image memory section 2 as binary signals. 3 is a rectangular area coordinate extraction section with character string 1 chart, photo, a minute.

ノイズを囲む、最小の矩形領域座標を抽出する。Extract the coordinates of the smallest rectangular area surrounding the noise.

4は矩形領域特徴抽出部であり、矩形領域座標抽出部3
で抽出した文字列2図表、写真、線分、ノイズを囲む矩
形領域の特徴を抽出する。5は矩形領域分類部であり、
矩形領域特徴抽出部4で抽出分類する。
4 is a rectangular area feature extraction unit, and rectangular area coordinate extraction unit 3
Extract the characteristics of the rectangular area surrounding the extracted character string 2 charts, photographs, line segments, and noise. 5 is a rectangular area classification unit;
The rectangular area feature extraction unit 4 performs extraction and classification.

以上のように構成された画像処理装置について、第2図
に示す入力画像Pを例に説明する。
The image processing apparatus configured as described above will be explained using an input image P shown in FIG. 2 as an example.

画像入力部1から、入力された画像Pは文字列。The image P input from the image input unit 1 is a character string.

図表、写真、線分、ノイズ部の黒画素を1、背景部の白
画素を0の2値データで画像メモリ部2に蓄えられる。
Binary data of 1 for black pixels in graphs, photographs, line segments, and noise areas and 0 for white pixels in background areas is stored in the image memory unit 2.

矩形領域座標抽出部3では、画像メモリ部2に蓄えられ
ている入力画像Pを横方向に走査して黒画素間の距離が
あらかじめ定めたしきい値R1以下の場合、その黒画素
どうしは連結しているものとする。同様に画像メモリ部
2に蓄えられている入力画像Pを縦方向に走査して黒画
素間の距離があらかじめ定めたしきい値R2以下の場合
、その黒画素どうしは連結しているものとする。横方向
The rectangular area coordinate extraction unit 3 scans the input image P stored in the image memory unit 2 in the horizontal direction, and if the distance between black pixels is equal to or less than a predetermined threshold value R1, the black pixels are connected to each other. It is assumed that Similarly, when the input image P stored in the image memory unit 2 is scanned in the vertical direction and the distance between black pixels is less than or equal to a predetermined threshold value R2, the black pixels are considered to be connected. . Lateral direction.

縦方向に走査して得られた黒画素間の連結情報に着目し
文字列2図表、写真、線分、ノイズ部分のいずれかを囲
む最小の矩形領域の左上点座標(xmi、n yYmi
 n ) r 右下点座標(xm& ! j Ym& 
! )を抽出する。第3図に文字列の矩形領域を抽出し
た状態を座標を用いて示す。第4図に第2図の入力画像
Pから矩形領域座標抽出部3で抽出したすべての矩形領
域を示す。
Focusing on the connection information between black pixels obtained by scanning in the vertical direction, we calculate the coordinates of the upper left point of the smallest rectangular area (xmi, n yYmi
n) r Lower right point coordinates (xm&! j Ym&
! ). FIG. 3 shows a state in which a rectangular region of a character string is extracted using coordinates. FIG. 4 shows all rectangular areas extracted by the rectangular area coordinate extraction unit 3 from the input image P of FIG. 2.

矩形領域特徴抽出部4では、矩形領域座標抽出部3で抽
出した文字列9図表、写真、線分、ノイズの矩形領域座
標から、矩形領域の幅Wを式(1)によって求める。
The rectangular area feature extraction unit 4 calculates the width W of the rectangular area from the rectangular area coordinates of the character strings 9 charts, photographs, line segments, and noise extracted by the rectangular area coordinate extraction unit 3 using equation (1).

W=XmaニーXm1n+1    ・・・・・・・・
・・・・(1)同様に矩形領域座標から矩形領域の高さ
Hを式@)によって求める。
W=XmanyXm1n+1 ・・・・・・・・・
(1) Similarly, the height H of the rectangular area is determined from the rectangular area coordinates using the formula @).

H=Y   −Y   +1    ・・・・・・・・
・・・・C2)maw   min 矩形領域の幅Wと高さHから、矩形領域の文字列方向垂
直高さVを式(3)により、て求める。
H=Y −Y +1 ・・・・・・・・・
...C2) maw min From the width W and height H of the rectangular area, the vertical height V of the rectangular area in the character string direction is determined using equation (3).

矩形領域の幅Wと高さHから、矩形領域サイズSを式(
イ)によって求める。
From the width W and height H of the rectangular area, calculate the rectangular area size S using the formula (
A).

S=wH・・・・・・・・・・・・←)矩形領域の幅W
と高さHから、矩形領域縦横比Eを式(5)によって求
める。
S=wH・・・・・・・・・・・・←) Width W of rectangular area
and the height H, the rectangular area aspect ratio E is determined by equation (5).

矩形領域サイズSと矩形領域内の黒画素数Bから矩形領
域の黒画素密度りを式(6)によって求める。
The black pixel density of the rectangular area is calculated from the rectangular area size S and the number B of black pixels in the rectangular area using equation (6).

D=丁       ・・・・・・・・・・・塵)矩形
領域分類部6では、−数的な文書の文字列。
D = ding (dust) In the rectangular area classification unit 6, - a character string of a numerical document.

図表、写真、線分、ノイズは矩形領域特徴抽出部4で抽
出した矩形領域の文字列方向垂直高さV。
For charts, photographs, line segments, and noise, the vertical height V in the character string direction of the rectangular area extracted by the rectangular area feature extraction unit 4 is used.

矩形領域サイズS、矩形領域縦横比E、矩形領域の黒画
素密度りが特定の性質を持つことを利用して分類を行う
。具体的には、矩形領域の文字列方向垂直高さVがあら
かじめ定めたしきい値vthr以上の場合、その矩形領
域は図表、または写真と分類され、Vがvthr未溝の
場合は文字列、線分。
Classification is performed using the fact that the rectangular area size S, the rectangular area aspect ratio E, and the black pixel density of the rectangular area have specific properties. Specifically, if the vertical height V of a rectangular area in the character string direction is greater than or equal to a predetermined threshold value vthr, the rectangular area is classified as a diagram or a photograph, and if V is not vthr, then the character string, line segment.

ノイズのうちいずれかであると分類される。文字列、線
分、ノイズのうちいずれかであると分類された矩形領域
は、矩形領域サイズSがあらかじめ定めたしきい値8t
hr以上の場合は文字列、線分と分類され、Sが8th
r未満の場合は、ノイズであると分類される。文字列、
線分と分類された矩形領域は、矩形領域縦横比Eがあら
かじめ定めたしきい値Ethr以上の場合は、線分と分
類され、EがEthr未滴の場合は文字列と分類される
。図表または写真と分類された矩形領域は、矩形領域の
黒画素密度りがあらかじめ定めたしきい値Dthr以上
の場合は、写真と分類され、DがDthr未満の場合は
図表と分類される。第5図に矩形領域の分類条件の説明
図を示す。
It is classified as either noise. A rectangular area classified as a character string, line segment, or noise is determined by a predetermined threshold value of 8t for the rectangular area size S.
If it is hr or more, it is classified as a character string or line segment, and S is 8th
If it is less than r, it is classified as noise. string,
A rectangular area classified as a line segment is classified as a line segment if the rectangular area aspect ratio E is equal to or greater than a predetermined threshold value Ethr, and is classified as a character string if E is not filled with Ethr. A rectangular area classified as a diagram or a photograph is classified as a photograph if the black pixel density of the rectangular area is greater than or equal to a predetermined threshold value Dthr, and is classified as a diagram if D is less than Dthr. FIG. 5 shows an explanatory diagram of the classification conditions for rectangular areas.

以上のように構成された画像処理装置では文字列2図表
、写真、線分、ノイズの混在する画像から文字列9図表
、写真、線分、ノイズを抽出し、分類することができる
The image processing apparatus configured as described above can extract and classify character strings 9, charts, photographs, line segments, and noise from an image containing character strings 2, charts, photographs, line segments, and noise.

尚、本実施例の画像処理装置を文字認識装置に接続する
ことにより、文字列と分類された矩形領域から文字を切
り出し、認識することができる。
Note that by connecting the image processing device of this embodiment to a character recognition device, characters can be extracted and recognized from rectangular areas classified as character strings.

発明の詳細 な説明したように、本発明によれば不特定な書式の文書
の入力画像から簡易な方法で自動的に文字列9図表、写
真、線分、ノイズの領域を抽出することができる。この
方法を使用して、文字列の領域はすでに知られている文
字認識技術によって1文字毎に切り出して認識し、図表
、写真、線分、ノイズの領域はそれぞれ固有の処理を行
うことによって入力画像をより柔軟に加工することがで
き、その実用的効果は大きい。
As described in detail, according to the present invention, text strings, graphs, photographs, line segments, and noise regions can be automatically extracted from an input image of a document in an unspecified format using a simple method. . Using this method, character string regions are extracted and recognized character by character using already known character recognition technology, and diagrams, photographs, line segments, and noise regions are each input by performing unique processing. Images can be processed more flexibly, which has great practical effects.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明における一実施例の画像処理装置の構成
図、第2図は入力画像の説明図、第3図は抽出した文字
列の矩形領域座標を示す説明図、第4図は第2図の入力
画像に対して抽出したすべての矩形領域を示す説明図、
第5図は矩形領域の分類条件を示す説明図である。 1・・・・・・画像入力部、2・・・・・・画像メモリ
部、3・・・・・・矩形領域座標抽出部、4・・・・・
・矩形領域特徴抽出部、6・・・・・・矩形領域分類部
、6・・・・・・文字列領域、7・・・・・・線分領域
、8・・・・・・写真領域、9・・・・・・図表領域、
P・・・・・・入力画像。 代理人の氏名 弁理士 粟 野 重 孝 ほか1名第1
図 / 第 図
FIG. 1 is a configuration diagram of an image processing apparatus according to an embodiment of the present invention, FIG. 2 is an explanatory diagram of an input image, FIG. 3 is an explanatory diagram showing rectangular area coordinates of extracted character strings, and FIG. An explanatory diagram showing all rectangular areas extracted for the input image in Figure 2,
FIG. 5 is an explanatory diagram showing classification conditions for rectangular areas. 1... Image input unit, 2... Image memory unit, 3... Rectangular area coordinate extraction unit, 4...
・Rectangular area feature extraction unit, 6... Rectangular area classification unit, 6... Character string area, 7... Line segment area, 8... Photograph area , 9...Chart area,
P...Input image. Name of agent: Patent attorney Shigetaka Awano and 1 other person 1st
Figure / Diagram

Claims (1)

【特許請求の範囲】[Claims] 文字列、図表、写真、線分、ノイズのうち、少なくとも
2組の要素からなる画像情報を入力する画像情報入力部
と、前記画像情報入力部に入力された前記画像情報を格
納する画像情報メモリ部と、前記画像情報メモリ部に格
納された画像情報から文字列、図表、写真、線分、ノイ
ズの矩形領域を抽出する矩形領域座標抽出部と、前記矩
形領域座標抽出部で抽出した矩形領域の特徴を抽出する
矩形領域特徴抽出部と、前記矩形領域特徴抽出部で抽出
した特徴を用いて、矩形領域を文字列、図表、写真、線
分、ノイズに分類する矩形領域分類部を有することを特
徴とする画像処理装置。
an image information input section for inputting image information consisting of at least two sets of elements among character strings, diagrams, photographs, line segments, and noise; and an image information memory for storing the image information input to the image information input section. a rectangular area coordinate extraction unit that extracts a rectangular area of a character string, diagram, photograph, line segment, or noise from the image information stored in the image information memory unit; and a rectangular area extracted by the rectangular area coordinate extraction unit. and a rectangular area classification unit that uses the features extracted by the rectangular area feature extraction unit to classify the rectangular area into character strings, diagrams, photographs, line segments, and noise. An image processing device characterized by:
JP1075366A 1989-03-27 1989-03-27 Image processing device Expired - Fee Related JP2939985B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1075366A JP2939985B2 (en) 1989-03-27 1989-03-27 Image processing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1075366A JP2939985B2 (en) 1989-03-27 1989-03-27 Image processing device

Publications (2)

Publication Number Publication Date
JPH02253383A true JPH02253383A (en) 1990-10-12
JP2939985B2 JP2939985B2 (en) 1999-08-25

Family

ID=13574151

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1075366A Expired - Fee Related JP2939985B2 (en) 1989-03-27 1989-03-27 Image processing device

Country Status (1)

Country Link
JP (1) JP2939985B2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0728940A (en) * 1993-06-30 1995-01-31 Internatl Business Mach Corp <Ibm> Image segmentation for document processing and classification of image element
US5696843A (en) * 1994-06-22 1997-12-09 Sharp Kabushiki Kaisha Automatic image quality controlling apparatus for use in an electronic copier
US5757957A (en) * 1991-11-29 1998-05-26 Ricoh Company, Ltd. Apparatus and method for area separation for image, having improved separation accuracy
US6771842B1 (en) 1998-05-28 2004-08-03 Fujitsu Limited Document image skew detection method

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5757957A (en) * 1991-11-29 1998-05-26 Ricoh Company, Ltd. Apparatus and method for area separation for image, having improved separation accuracy
JPH0728940A (en) * 1993-06-30 1995-01-31 Internatl Business Mach Corp <Ibm> Image segmentation for document processing and classification of image element
US5696843A (en) * 1994-06-22 1997-12-09 Sharp Kabushiki Kaisha Automatic image quality controlling apparatus for use in an electronic copier
US6771842B1 (en) 1998-05-28 2004-08-03 Fujitsu Limited Document image skew detection method

Also Published As

Publication number Publication date
JP2939985B2 (en) 1999-08-25

Similar Documents

Publication Publication Date Title
US7697776B2 (en) Model-based dewarping method and apparatus
US6738154B1 (en) Locating the position and orientation of multiple objects with a smart platen
US6574375B1 (en) Method for detecting inverted text images on a digital scanning device
US7016536B1 (en) Method and apparatus for automatic cleaning and enhancing of scanned documents
US5854853A (en) Method and apparatus for selecting blocks of image data from image data having both horizontally- and vertically-oriented blocks
JP2940936B2 (en) Tablespace identification method
EP0843277A2 (en) Page analysis system
US6351559B1 (en) User-enclosed region extraction from scanned document images
US6393150B1 (en) Region-based image binarization system
EP0472313B1 (en) Image processing method and apparatus therefor
Clark et al. Combining statistical measures to find image text regions
US6289120B1 (en) Method and system for processing images of forms which have irregular construction and/or determining whether characters are interior to a form
Dori et al. Segmentation and recognition of dimensioning text from engineering drawings
JPH02253383A (en) Picture processor
JP3285686B2 (en) Area division method
JPH03126181A (en) Area dividing method for document image
EP0975146A1 (en) Locating the position and orientation of multiple objects with a smart platen
JPH0548510B2 (en)
EP0767941B1 (en) Automatic determination of landscape scan in binary images
JPS5949671A (en) Optical character reader
JP2877548B2 (en) Document image attribute discrimination method
JP3162414B2 (en) Ruled line recognition method and table processing method
JPH0535914A (en) Picture inclination detection method
JP2983448B2 (en) Drawing recognition method
JPH0728934A (en) Document image processor

Legal Events

Date Code Title Description
LAPS Cancellation because of no payment of annual fees