JPH02253383A - Picture processor - Google Patents
Picture processorInfo
- Publication number
- JPH02253383A JPH02253383A JP1075366A JP7536689A JPH02253383A JP H02253383 A JPH02253383 A JP H02253383A JP 1075366 A JP1075366 A JP 1075366A JP 7536689 A JP7536689 A JP 7536689A JP H02253383 A JPH02253383 A JP H02253383A
- Authority
- JP
- Japan
- Prior art keywords
- rectangular area
- noise
- extracted
- character
- character string
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012545 processing Methods 0.000 claims abstract description 13
- 239000000284 extract Substances 0.000 claims abstract description 11
- 238000000605 extraction Methods 0.000 claims description 21
- 238000010586 diagram Methods 0.000 claims description 18
- 238000000034 method Methods 0.000 abstract description 6
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
Landscapes
- Character Input (AREA)
- Image Analysis (AREA)
Abstract
Description
【発明の詳細な説明】 産業上の利用分野 新聞、雑誌等の不特定な書式の文書から文字列。[Detailed description of the invention] Industrial applications Character strings from documents in unspecified formats such as newspapers and magazines.
図表、写真、線分、ノイズの領域を抽出し、分類する画
像処理装置に関するものである。The present invention relates to an image processing device that extracts and classifies regions of charts, photographs, line segments, and noise.
従来の技術
文字1図形の混在する画像から、文字領域7図形領域を
切り分ける画像処理装置には、入力画像を表示しオペレ
ータがマウスなどを使用して指定するものと、オペレー
タの介在なくして自動的に行うものがある。オペレータ
の介在なくして自動的に行うものには、画像全体を文字
と図形に区別することなく所定のフォーマットに基づき
強制的に文字として1文字毎の小領域に切り出し、すで
に知られているパターン認識技術を用いて認識し、その
1文字毎の小領域の認識結果が文字として認識可能であ
るか否かを判定し、その判定結果を用いて1文字毎の小
領域どうしの連続性を調べて文字領域と図形領域を分類
していた(例えば、特開昭61−11888号公報)。Conventional technical image processing devices that separate character areas and seven graphic areas from an image containing a mixture of one character and one figure include those that display the input image and allow an operator to specify it using a mouse, and those that automatically specify the area without operator intervention. There is something to do. Automatic pattern recognition without operator intervention involves forcibly cutting out the entire image into small areas for each character based on a predetermined format without distinguishing between characters and figures. technology is used to recognize each character, the recognition results of the small areas for each character are determined whether or not they can be recognized as characters, and the continuity of the small areas for each character is examined using the determination results. Character areas and graphic areas were classified (for example, Japanese Patent Laid-Open No. 11888/1988).
発明が解決しようとする課題
しかしながら、上記のような従来の技術では、文字認識
処理のための文字領域と文字以外の領域の分類に主眼が
おかれており、画像内の文字以外の領域は図表、写真、
線分、ノイズの領域というように、細かく分類すること
ができないという欠点を有していた。Problems to be Solved by the Invention However, in the above-mentioned conventional techniques, the main focus is on classifying text areas and non-text areas for character recognition processing, and non-text areas in an image are classified into figures, graphs, etc. ,photograph,
This method has the disadvantage of not being able to classify finely into areas such as line segments and noise areas.
本発明はかかる点に鑑みてなされたものであり、画像内
の文字列2図表、写真、線分、ノイズの領域を簡易な方
法で、自動的に抽出し分類する画像処理装置を提供する
ことを目的としている。The present invention has been made in view of the above points, and an object of the present invention is to provide an image processing device that automatically extracts and classifies character strings, graphs, photographs, line segments, and noise regions in an image using a simple method. It is an object.
課題を解決するための手段
本発明は上記目的を達成するために、画像から文字列9
図表、写真、線分、ノイズの矩形領域を抽出する矩形領
域座標抽出部と、前記矩形領域座標抽出部で抽出した矩
形領域の特徴”を抽出する矩形領域特徴抽出部と、前記
矩形領域特徴抽出部から抽出した特徴を用いて、矩形領
域を文字列9図表、写真、線分、ノイズに分類する矩形
領域分類部を備えた画像処理装置である。Means for Solving the Problems In order to achieve the above object, the present invention provides character strings 9 from an image.
a rectangular area coordinate extraction unit that extracts rectangular areas of diagrams, photographs, line segments, and noise; a rectangular area feature extraction unit that extracts “features of the rectangular area extracted by the rectangular area coordinate extraction unit”; and the rectangular area feature extraction unit. This image processing device includes a rectangular area classification unit that classifies a rectangular area into character strings, graphs, photographs, line segments, and noise using features extracted from the area.
作 用
本発明は上記の構成により、画像から矩形領域座標抽出
部で矩形領域を抽出し、抽出した矩形領域に対し矩形領
域特徴抽出部で特徴を抽出し、抽出した特徴を矩形領域
分類部で文字列9図表、写真、線分、ノイズそれぞれに
あらかじめ用意した特徴と比較することにより、矩形領
域が文字列。According to the above configuration, the present invention extracts a rectangular area from an image using the rectangular area coordinate extraction unit, extracts features from the extracted rectangular area using the rectangular area feature extraction unit, and uses the extracted features in the rectangular area classification unit. Character String 9 By comparing the features prepared in advance for each diagram, photograph, line segment, and noise, a rectangular area is transformed into a character string.
図表、写真、線分、ノイズのいずれかに該当するかを判
定する。Determine whether it corresponds to a diagram, photograph, line segment, or noise.
実施例
以下、本発明の実施例について図面を参照しながら説明
する。EXAMPLES Hereinafter, examples of the present invention will be described with reference to the drawings.
第1図は、本発明による画像処理装置の一実施例の構成
図である。1は画像入力部であシ文字列。FIG. 1 is a block diagram of an embodiment of an image processing apparatus according to the present invention. 1 is a character string in the image input section.
図表、写真、線分、ノイズを含む画像を走査し、2値信
号で画像メモリ部2に格納する。3は矩形領域座標抽出
部であシ文字列1図表、写真、a分。Images including charts, photographs, line segments, and noise are scanned and stored in the image memory section 2 as binary signals. 3 is a rectangular area coordinate extraction section with character string 1 chart, photo, a minute.
ノイズを囲む、最小の矩形領域座標を抽出する。Extract the coordinates of the smallest rectangular area surrounding the noise.
4は矩形領域特徴抽出部であり、矩形領域座標抽出部3
で抽出した文字列2図表、写真、線分、ノイズを囲む矩
形領域の特徴を抽出する。5は矩形領域分類部であり、
矩形領域特徴抽出部4で抽出分類する。4 is a rectangular area feature extraction unit, and rectangular area coordinate extraction unit 3
Extract the characteristics of the rectangular area surrounding the extracted character string 2 charts, photographs, line segments, and noise. 5 is a rectangular area classification unit;
The rectangular area feature extraction unit 4 performs extraction and classification.
以上のように構成された画像処理装置について、第2図
に示す入力画像Pを例に説明する。The image processing apparatus configured as described above will be explained using an input image P shown in FIG. 2 as an example.
画像入力部1から、入力された画像Pは文字列。The image P input from the image input unit 1 is a character string.
図表、写真、線分、ノイズ部の黒画素を1、背景部の白
画素を0の2値データで画像メモリ部2に蓄えられる。Binary data of 1 for black pixels in graphs, photographs, line segments, and noise areas and 0 for white pixels in background areas is stored in the image memory unit 2.
矩形領域座標抽出部3では、画像メモリ部2に蓄えられ
ている入力画像Pを横方向に走査して黒画素間の距離が
あらかじめ定めたしきい値R1以下の場合、その黒画素
どうしは連結しているものとする。同様に画像メモリ部
2に蓄えられている入力画像Pを縦方向に走査して黒画
素間の距離があらかじめ定めたしきい値R2以下の場合
、その黒画素どうしは連結しているものとする。横方向
。The rectangular area coordinate extraction unit 3 scans the input image P stored in the image memory unit 2 in the horizontal direction, and if the distance between black pixels is equal to or less than a predetermined threshold value R1, the black pixels are connected to each other. It is assumed that Similarly, when the input image P stored in the image memory unit 2 is scanned in the vertical direction and the distance between black pixels is less than or equal to a predetermined threshold value R2, the black pixels are considered to be connected. . Lateral direction.
縦方向に走査して得られた黒画素間の連結情報に着目し
文字列2図表、写真、線分、ノイズ部分のいずれかを囲
む最小の矩形領域の左上点座標(xmi、n yYmi
n ) r 右下点座標(xm& ! j Ym&
! )を抽出する。第3図に文字列の矩形領域を抽出し
た状態を座標を用いて示す。第4図に第2図の入力画像
Pから矩形領域座標抽出部3で抽出したすべての矩形領
域を示す。Focusing on the connection information between black pixels obtained by scanning in the vertical direction, we calculate the coordinates of the upper left point of the smallest rectangular area (xmi, n yYmi
n) r Lower right point coordinates (xm&! j Ym&
! ). FIG. 3 shows a state in which a rectangular region of a character string is extracted using coordinates. FIG. 4 shows all rectangular areas extracted by the rectangular area coordinate extraction unit 3 from the input image P of FIG. 2.
矩形領域特徴抽出部4では、矩形領域座標抽出部3で抽
出した文字列9図表、写真、線分、ノイズの矩形領域座
標から、矩形領域の幅Wを式(1)によって求める。The rectangular area feature extraction unit 4 calculates the width W of the rectangular area from the rectangular area coordinates of the character strings 9 charts, photographs, line segments, and noise extracted by the rectangular area coordinate extraction unit 3 using equation (1).
W=XmaニーXm1n+1 ・・・・・・・・
・・・・(1)同様に矩形領域座標から矩形領域の高さ
Hを式@)によって求める。W=XmanyXm1n+1 ・・・・・・・・・
(1) Similarly, the height H of the rectangular area is determined from the rectangular area coordinates using the formula @).
H=Y −Y +1 ・・・・・・・・
・・・・C2)maw min
矩形領域の幅Wと高さHから、矩形領域の文字列方向垂
直高さVを式(3)により、て求める。H=Y −Y +1 ・・・・・・・・・
...C2) maw min From the width W and height H of the rectangular area, the vertical height V of the rectangular area in the character string direction is determined using equation (3).
矩形領域の幅Wと高さHから、矩形領域サイズSを式(
イ)によって求める。From the width W and height H of the rectangular area, calculate the rectangular area size S using the formula (
A).
S=wH・・・・・・・・・・・・←)矩形領域の幅W
と高さHから、矩形領域縦横比Eを式(5)によって求
める。S=wH・・・・・・・・・・・・←) Width W of rectangular area
and the height H, the rectangular area aspect ratio E is determined by equation (5).
矩形領域サイズSと矩形領域内の黒画素数Bから矩形領
域の黒画素密度りを式(6)によって求める。The black pixel density of the rectangular area is calculated from the rectangular area size S and the number B of black pixels in the rectangular area using equation (6).
D=丁 ・・・・・・・・・・・塵)矩形
領域分類部6では、−数的な文書の文字列。D = ding (dust) In the rectangular area classification unit 6, - a character string of a numerical document.
図表、写真、線分、ノイズは矩形領域特徴抽出部4で抽
出した矩形領域の文字列方向垂直高さV。For charts, photographs, line segments, and noise, the vertical height V in the character string direction of the rectangular area extracted by the rectangular area feature extraction unit 4 is used.
矩形領域サイズS、矩形領域縦横比E、矩形領域の黒画
素密度りが特定の性質を持つことを利用して分類を行う
。具体的には、矩形領域の文字列方向垂直高さVがあら
かじめ定めたしきい値vthr以上の場合、その矩形領
域は図表、または写真と分類され、Vがvthr未溝の
場合は文字列、線分。Classification is performed using the fact that the rectangular area size S, the rectangular area aspect ratio E, and the black pixel density of the rectangular area have specific properties. Specifically, if the vertical height V of a rectangular area in the character string direction is greater than or equal to a predetermined threshold value vthr, the rectangular area is classified as a diagram or a photograph, and if V is not vthr, then the character string, line segment.
ノイズのうちいずれかであると分類される。文字列、線
分、ノイズのうちいずれかであると分類された矩形領域
は、矩形領域サイズSがあらかじめ定めたしきい値8t
hr以上の場合は文字列、線分と分類され、Sが8th
r未満の場合は、ノイズであると分類される。文字列、
線分と分類された矩形領域は、矩形領域縦横比Eがあら
かじめ定めたしきい値Ethr以上の場合は、線分と分
類され、EがEthr未滴の場合は文字列と分類される
。図表または写真と分類された矩形領域は、矩形領域の
黒画素密度りがあらかじめ定めたしきい値Dthr以上
の場合は、写真と分類され、DがDthr未満の場合は
図表と分類される。第5図に矩形領域の分類条件の説明
図を示す。It is classified as either noise. A rectangular area classified as a character string, line segment, or noise is determined by a predetermined threshold value of 8t for the rectangular area size S.
If it is hr or more, it is classified as a character string or line segment, and S is 8th
If it is less than r, it is classified as noise. string,
A rectangular area classified as a line segment is classified as a line segment if the rectangular area aspect ratio E is equal to or greater than a predetermined threshold value Ethr, and is classified as a character string if E is not filled with Ethr. A rectangular area classified as a diagram or a photograph is classified as a photograph if the black pixel density of the rectangular area is greater than or equal to a predetermined threshold value Dthr, and is classified as a diagram if D is less than Dthr. FIG. 5 shows an explanatory diagram of the classification conditions for rectangular areas.
以上のように構成された画像処理装置では文字列2図表
、写真、線分、ノイズの混在する画像から文字列9図表
、写真、線分、ノイズを抽出し、分類することができる
。The image processing apparatus configured as described above can extract and classify character strings 9, charts, photographs, line segments, and noise from an image containing character strings 2, charts, photographs, line segments, and noise.
尚、本実施例の画像処理装置を文字認識装置に接続する
ことにより、文字列と分類された矩形領域から文字を切
り出し、認識することができる。Note that by connecting the image processing device of this embodiment to a character recognition device, characters can be extracted and recognized from rectangular areas classified as character strings.
発明の詳細
な説明したように、本発明によれば不特定な書式の文書
の入力画像から簡易な方法で自動的に文字列9図表、写
真、線分、ノイズの領域を抽出することができる。この
方法を使用して、文字列の領域はすでに知られている文
字認識技術によって1文字毎に切り出して認識し、図表
、写真、線分、ノイズの領域はそれぞれ固有の処理を行
うことによって入力画像をより柔軟に加工することがで
き、その実用的効果は大きい。As described in detail, according to the present invention, text strings, graphs, photographs, line segments, and noise regions can be automatically extracted from an input image of a document in an unspecified format using a simple method. . Using this method, character string regions are extracted and recognized character by character using already known character recognition technology, and diagrams, photographs, line segments, and noise regions are each input by performing unique processing. Images can be processed more flexibly, which has great practical effects.
第1図は本発明における一実施例の画像処理装置の構成
図、第2図は入力画像の説明図、第3図は抽出した文字
列の矩形領域座標を示す説明図、第4図は第2図の入力
画像に対して抽出したすべての矩形領域を示す説明図、
第5図は矩形領域の分類条件を示す説明図である。
1・・・・・・画像入力部、2・・・・・・画像メモリ
部、3・・・・・・矩形領域座標抽出部、4・・・・・
・矩形領域特徴抽出部、6・・・・・・矩形領域分類部
、6・・・・・・文字列領域、7・・・・・・線分領域
、8・・・・・・写真領域、9・・・・・・図表領域、
P・・・・・・入力画像。
代理人の氏名 弁理士 粟 野 重 孝 ほか1名第1
図
/
第
図FIG. 1 is a configuration diagram of an image processing apparatus according to an embodiment of the present invention, FIG. 2 is an explanatory diagram of an input image, FIG. 3 is an explanatory diagram showing rectangular area coordinates of extracted character strings, and FIG. An explanatory diagram showing all rectangular areas extracted for the input image in Figure 2,
FIG. 5 is an explanatory diagram showing classification conditions for rectangular areas. 1... Image input unit, 2... Image memory unit, 3... Rectangular area coordinate extraction unit, 4...
・Rectangular area feature extraction unit, 6... Rectangular area classification unit, 6... Character string area, 7... Line segment area, 8... Photograph area , 9...Chart area,
P...Input image. Name of agent: Patent attorney Shigetaka Awano and 1 other person 1st
Figure / Diagram
Claims (1)
2組の要素からなる画像情報を入力する画像情報入力部
と、前記画像情報入力部に入力された前記画像情報を格
納する画像情報メモリ部と、前記画像情報メモリ部に格
納された画像情報から文字列、図表、写真、線分、ノイ
ズの矩形領域を抽出する矩形領域座標抽出部と、前記矩
形領域座標抽出部で抽出した矩形領域の特徴を抽出する
矩形領域特徴抽出部と、前記矩形領域特徴抽出部で抽出
した特徴を用いて、矩形領域を文字列、図表、写真、線
分、ノイズに分類する矩形領域分類部を有することを特
徴とする画像処理装置。an image information input section for inputting image information consisting of at least two sets of elements among character strings, diagrams, photographs, line segments, and noise; and an image information memory for storing the image information input to the image information input section. a rectangular area coordinate extraction unit that extracts a rectangular area of a character string, diagram, photograph, line segment, or noise from the image information stored in the image information memory unit; and a rectangular area extracted by the rectangular area coordinate extraction unit. and a rectangular area classification unit that uses the features extracted by the rectangular area feature extraction unit to classify the rectangular area into character strings, diagrams, photographs, line segments, and noise. An image processing device characterized by:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1075366A JP2939985B2 (en) | 1989-03-27 | 1989-03-27 | Image processing device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1075366A JP2939985B2 (en) | 1989-03-27 | 1989-03-27 | Image processing device |
Publications (2)
Publication Number | Publication Date |
---|---|
JPH02253383A true JPH02253383A (en) | 1990-10-12 |
JP2939985B2 JP2939985B2 (en) | 1999-08-25 |
Family
ID=13574151
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP1075366A Expired - Fee Related JP2939985B2 (en) | 1989-03-27 | 1989-03-27 | Image processing device |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP2939985B2 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0728940A (en) * | 1993-06-30 | 1995-01-31 | Internatl Business Mach Corp <Ibm> | Image segmentation for document processing and classification of image element |
US5696843A (en) * | 1994-06-22 | 1997-12-09 | Sharp Kabushiki Kaisha | Automatic image quality controlling apparatus for use in an electronic copier |
US5757957A (en) * | 1991-11-29 | 1998-05-26 | Ricoh Company, Ltd. | Apparatus and method for area separation for image, having improved separation accuracy |
US6771842B1 (en) | 1998-05-28 | 2004-08-03 | Fujitsu Limited | Document image skew detection method |
-
1989
- 1989-03-27 JP JP1075366A patent/JP2939985B2/en not_active Expired - Fee Related
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5757957A (en) * | 1991-11-29 | 1998-05-26 | Ricoh Company, Ltd. | Apparatus and method for area separation for image, having improved separation accuracy |
JPH0728940A (en) * | 1993-06-30 | 1995-01-31 | Internatl Business Mach Corp <Ibm> | Image segmentation for document processing and classification of image element |
US5696843A (en) * | 1994-06-22 | 1997-12-09 | Sharp Kabushiki Kaisha | Automatic image quality controlling apparatus for use in an electronic copier |
US6771842B1 (en) | 1998-05-28 | 2004-08-03 | Fujitsu Limited | Document image skew detection method |
Also Published As
Publication number | Publication date |
---|---|
JP2939985B2 (en) | 1999-08-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7697776B2 (en) | Model-based dewarping method and apparatus | |
US6738154B1 (en) | Locating the position and orientation of multiple objects with a smart platen | |
US6574375B1 (en) | Method for detecting inverted text images on a digital scanning device | |
US7016536B1 (en) | Method and apparatus for automatic cleaning and enhancing of scanned documents | |
US5854853A (en) | Method and apparatus for selecting blocks of image data from image data having both horizontally- and vertically-oriented blocks | |
JP2940936B2 (en) | Tablespace identification method | |
EP0843277A2 (en) | Page analysis system | |
US6351559B1 (en) | User-enclosed region extraction from scanned document images | |
US6393150B1 (en) | Region-based image binarization system | |
EP0472313B1 (en) | Image processing method and apparatus therefor | |
Clark et al. | Combining statistical measures to find image text regions | |
US6289120B1 (en) | Method and system for processing images of forms which have irregular construction and/or determining whether characters are interior to a form | |
Dori et al. | Segmentation and recognition of dimensioning text from engineering drawings | |
JPH02253383A (en) | Picture processor | |
JP3285686B2 (en) | Area division method | |
JPH03126181A (en) | Area dividing method for document image | |
EP0975146A1 (en) | Locating the position and orientation of multiple objects with a smart platen | |
JPH0548510B2 (en) | ||
EP0767941B1 (en) | Automatic determination of landscape scan in binary images | |
JPS5949671A (en) | Optical character reader | |
JP2877548B2 (en) | Document image attribute discrimination method | |
JP3162414B2 (en) | Ruled line recognition method and table processing method | |
JPH0535914A (en) | Picture inclination detection method | |
JP2983448B2 (en) | Drawing recognition method | |
JPH0728934A (en) | Document image processor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
LAPS | Cancellation because of no payment of annual fees |