JPH01266689A

JPH01266689A - Image recognition device

Info

Publication number: JPH01266689A
Application number: JP63094568A
Authority: JP
Inventors: Shinko Ishitani; 石谷　新子
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1988-04-19
Filing date: 1988-04-19
Publication date: 1989-10-24

Abstract

PURPOSE:To obtain a device to recognize a document and a graphic on an original precisely by extracting the graphic from image information by a graphic recognizing means, and extracting and recognizing a character and a mark related to the graphic by a character string recognizing means. CONSTITUTION:The original 1 is read by an image reader 2, and the image information of the original is stored in a buffer memory 3. A circular graph recognition part 4 recognizes a circular graph in the image information developed on the memory 3. A character recognition part 5 recognizes a character pattern in the image information developed in the memory 3 based on information in a character font ROM 6 and the information obtained at the recognition part 4. The character pattern recognized at the recognition part 5 reads out a corresponding character pattern from the ROM 6, and performs a processing to write it on a VRAM 7 that is a RAM for display. Also, a various kinds of patterns stored in the VRAM 7 are displayed on a CRT 9 by a graphic control GC 8. A CPU 10 controls the whole of the device.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は画像認識装置、特に原稿から入力された画像情
報から文字を認識する能力をもつ画像認識装置に関する
ものである。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an image recognition device, and particularly to an image recognition device capable of recognizing characters from image information input from a document.

［従来の技術］従来から、原稿読み取り装置から入力した画像情報中に
あるパターンから文字を認識する文字認識装置がある。[Prior Art] Conventionally, there have been character recognition devices that recognize characters from patterns in image information input from a document reading device.

この文字認識装置により、手書きした原稿等を活字等の
清書体にするばかりでなく、認識したパターンをコード
化（多くの場合にはＪＩＳコード化）することにより、
例えば外部記憶装置に記憶するときに少量のデータに圧
縮することもできる。This character recognition device not only converts handwritten manuscripts into typeface, but also converts recognized patterns into codes (JIS codes in most cases).
For example, the data can be compressed into a small amount of data when stored in an external storage device.

［発明が解決しようとする課題］しかし、この種の装置で認識対象となる原稿あるいは画
像情報は、文字記号パターンのみの存在を前提としてい
るものである。従って、図形パターンのみの原稿、ある
いは文字記号パターンと図・形パターンとが混在してい
る原稿の場合には、図形パターンを認識できないといっ
た欠点がある。[Problems to be Solved by the Invention] However, the document or image information to be recognized by this type of device is premised on the existence of only character symbol patterns. Therefore, in the case of a document containing only graphic patterns or a document containing a mixture of character/symbol patterns and figure/shape patterns, there is a drawback that the graphic pattern cannot be recognized.

更に、図形パターンを認識できるものでも、図形の付近
に存在する文字列が図形の一部であるか文章の一部であ
るかを区別できないため、文章の認識及び図形の認識が
正確にできないという欠点がある。Furthermore, even if a device can recognize graphic patterns, it cannot distinguish whether a character string near a graphic is part of the graphic or a text, making it unable to accurately recognize text and shapes. There are drawbacks.

本発明は、前記従来例の欠点を除去し、原稿上の文書と
図形とを正確に認識する画像認識装置を提供する。The present invention eliminates the drawbacks of the conventional example and provides an image recognition device that accurately recognizes documents and figures on a manuscript.

また、図形が円グラフの場合に、引き出し線を抽出して
認識することにより、円グラフの付近にある文字列を認
識する画像認識装置を提供する。Furthermore, when the figure is a pie chart, an image recognition device is provided that recognizes character strings near the pie chart by extracting and recognizing leader lines.

更に、図形が円グラフの場合に、仕切り線と円弧で囲ま
れた扇形領域を抽出し、その延長領域に文字領域全体又
は１部が含まれることを調べることにより、円グラフの
外側付近にある文字列を認識する画像認識装置を提供す
る。Furthermore, when the figure is a pie chart, by extracting the fan-shaped area surrounded by the partition line and arc and checking whether the entire character area or a part of the character area is included in the extended area, An image recognition device that recognizes character strings is provided.

［課題を解決するための手段］この課題を解決するために、本発明の画像認識装置は、
文章と図形とが混在する画像情報を入力する入力手段と
、該入力手段より入力された前記画像情報から図形を抽
出して、該図形の特徴を認識する図形認識手段と、該図
形認識手段の認識結果に基づいて、前記図形と関連する
文字及び記号を抽出することにより、前記文字及び記号
を認識する文字列認識手段とを備える。[Means for Solving the Problem] In order to solve this problem, the image recognition device of the present invention has the following features:
an input means for inputting image information in which text and figures are mixed; a figure recognition means for extracting a figure from the image information inputted from the input means and recognizing characteristics of the figure; The apparatus further includes character string recognition means for recognizing the characters and symbols by extracting the characters and symbols associated with the graphics based on the recognition results.

前記図形認識手段は、円を抽出する円抽出手段と、該円
の中心と円周とを結ぶ直線を抽出して仕切り線とする仕
切り線認識手段とを備える。The figure recognition means includes a circle extraction means for extracting a circle, and a partition line recognition means for extracting a straight line connecting the center and circumference of the circle and using it as a partition line.

前記文字列認識手段は、前記仕切り線以外の直線のうち
、少なくとも一端が前記円の外にある直線を抽出して引
き出し線とする切り出し線認識手段を備える。The character string recognition means includes a cut-out line recognition means that extracts a straight line having at least one end outside the circle from among the straight lines other than the partition line and uses it as a lead line.

又、前記文字列認識手段は、前記仕切り線と円弧で囲ま
れた扇形の延長領域の文字及び記号を抽出する文字列抽
出手段を備える。Further, the character string recognition means includes a character string extraction means for extracting characters and symbols in a fan-shaped extension area surrounded by the partition line and the arc.

［作用］かかる構成において、図形認識手段により、入力手段よ
り入力された画像情報から図形を抽出し、文字列認識手
段は、該図形認識手段の認識結果に基づいて、前記図形
と関連する文字及び記号を抽出することにより、前記文
字及び記号を認識する。[Operation] In this configuration, the figure recognition means extracts a figure from the image information input from the input means, and the character string recognition means extracts the characters and characters associated with the figure based on the recognition result of the figure recognition means. The characters and symbols are recognized by extracting the symbols.

前記図形認識手段は、円抽出手段により円を抽出し、仕
切り線認識手段により仕切り線を認識し、入力された前
記画像情報から円グラフを抽出して認識する。The figure recognition means extracts a circle using a circle extraction means, recognizes a partition line using a partition line recognition means, and extracts and recognizes a pie chart from the inputted image information.

前記文字列認識手段は、切り出し線認識手段により、前
記仕切り線以外の直線のうち、少なくとも一端が前記内
の外にある直線を抽出して引き出し線とし、該引き出し
線と所定の関係にある文字及び記号を前記円グラフの比
率を示す文字列とする。The character string recognition means uses the extraction line recognition means to extract a straight line having at least one end outside the inside from among the straight lines other than the partition line as a leader line, and extracts a character having a predetermined relationship with the leader line. and symbols are character strings indicating the ratio of the pie chart.

又、前記文字列認識手段は、文字列抽出手段により、前
記仕切り線と円弧で囲まれた扇形の延長領域の文字及び
記号を抽出し、該文字及び記号を前記円グラフの比率を
示す文字列とする。Further, the character string recognition means extracts characters and symbols in a sector-shaped extension area surrounded by the partition line and the arc, and converts the characters and symbols into a character string indicating the ratio of the pie chart. shall be.

［実施例］以下、添付図面に従って、本発明の画像認識装置の実施
例を詳細に説明する。[Embodiments] Hereinafter, embodiments of the image recognition apparatus of the present invention will be described in detail with reference to the accompanying drawings.

第１図は、本実施例の画像認識装置のブロック構成図で
ある１図中、１は原稿であり、２は原稿を光学的に読み
取る画像読み取り装置である。FIG. 1 is a block diagram of the image recognition apparatus according to the present embodiment. In FIG. 1, 1 is a document, and 2 is an image reading device that optically reads the document.

３は読み取った画像情報を格納するバッファメモリ、４
はバッファメモリ３内に展開された画像情報中の円グラ
フを認識する円グラフ認識部、５は文字フォントＲＯＭ
ｅ内の情報及び円グラフ認識部４で、得られた情報を基
に、バッファメモリ３内に展開された画像情報中の文字
記号パターンを認識する文字認識部である。3 is a buffer memory for storing read image information; 4
5 is a pie chart recognition unit that recognizes a pie chart in the image information developed in the buffer memory 3, and 5 is a character font ROM.
This is a character recognition unit that recognizes the character symbol pattern in the image information developed in the buffer memory 3 based on the information in e and the information obtained by the pie chart recognition unit 4.

この文字認識部５で認識された文字パターンは、文字フ
ォントＲＯＭ６から対応する文字パターンを読出し、表
示用のＲＡＭであるＶＲＡＭ７に書込む処理をする。ま
た、このＶＲＡＭ７に格納された各種パターンは、グラ
フィックコントローラＧＣ８によりＣＲＴ９に表示され
る。The character pattern recognized by the character recognition unit 5 is processed by reading the corresponding character pattern from the character font ROM 6 and writing it into the VRAM 7 which is a display RAM. Further, the various patterns stored in this VRAM 7 are displayed on the CRT 9 by the graphic controller GC8.

更に、１０は本装置全体を制御するＣＰｔＪであり、装
置全体を制御するための制御プログラムを格納するＲＯ
Ｍ１０ａと、ＣＰＵｌ０のワークエリアとして使用する
ＲＡＭ１０ｂとを備えている。Furthermore, 10 is a CPtJ that controls the entire device, and an RO that stores a control program for controlling the entire device.
The RAM 10b is used as a work area for the CPU 10.

以上の構成における本装置の基本的な処理手順を、第２
図に示すフローチャートに従って説明する。先ずステッ
プＳ１では、イメージスキャナ等の画像読み取り装置２
より原稿１を読み込み、バッファメモリ２に格納する０
次にステップＳ２では、細線化処理を行う。ステップＳ
３で文字パターンと図形パターンとを判別する。この判
別の基準は、例えば線分の長さが短いものは文字パター
ンの一部、長いものは図形パターンの一部と判断する。The basic processing procedure of this device with the above configuration is explained in the second section.
The explanation will be given according to the flowchart shown in the figure. First, in step S1, an image reading device 2 such as an image scanner is used.
Read original 1 from 0 and store it in buffer memory 2.
Next, in step S2, thinning processing is performed. Step S
In step 3, character patterns and graphic patterns are discriminated. The criteria for this determination is, for example, if a line segment is short, it is determined to be part of a character pattern, and if it is long, it is determined to be part of a graphic pattern.

文字パターンの場合は、ステップＳ９に進んで、文字切
り出しステップＳＩＯで文字を認識してステップＳｌｌ
で表示する。If it is a character pattern, proceed to step S9, recognize the character in character extraction step SIO, and proceed to step Sll.
Display in .

一方、図形パターンの一部と判断された場合はステｔツ
ブＳ４に進んで、ステップＳ４では点列に分けて該点列
毎に直線近似を行って直線化した結果のベクトルを得る
。第３図は画像情報をベクトル情報にする過程を示した
ものである。次に、ステップＳ５では、ステップＳ４で
得られたベクトルをもとに円の中心点を算出し、この中
心点を基に半径を求める。On the other hand, if it is determined that it is part of a graphic pattern, the process proceeds to step S4, where the point sequence is divided into point sequences, linear approximation is performed for each point sequence, and a linearized vector is obtained. FIG. 3 shows the process of converting image information into vector information. Next, in step S5, the center point of the circle is calculated based on the vector obtained in step S4, and the radius is determined based on this center point.

円の中心点を算出するアルゴリズムについて第４図で詳
しく説明する。今、ベクトルがＶＯ。The algorithm for calculating the center point of the circle will be explained in detail with reference to FIG. Vector is now VO.

■、・・・ｖｎ個存在する場合には、ベクトルの垂直二
等分線は、氾３．λ２．氾、・・・β、本できる。■,... If there are vn pieces, the perpendicular bisector of the vector is 3. λ2. Flood...β, I can write a book.

℃１とβ、の交点、β、とβ３の交点・・・、ρ、と４
１の交点、β２とβ８、β、とβ４・・・、β２とβｎ
、ｆ１３と１２４、Ａｓとｆ２ｓ・・・、ρ、と１２．
　＊ａ＊１１ｎ−１と４７の交点を算出していくと、全
部で（ｎ−１）１個の交点が算出される０次に、該交点
の座標値（ｘ、ｙ）の分布のヒストグラムを作成すると
、第５図に示すように一部（傾斜部分）に交点が集中す
る。従って、この領域の中間値（Ｘｗａｍｘ　＊　Ｖｗ
ａ□）を円グラフの円の中心座標と決定する。Intersection of ℃1 and β, intersection of β and β3..., ρ, and 4
1 intersection, β2 and β8, β and β4..., β2 and βn
, f13 and 124, As and f2s..., ρ, and 12.
When the intersections of *a*11n-1 and 47 are calculated, a total of (n-1) 1 intersections are calculated. Next, the histogram of the distribution of the coordinate values (x, y) of the intersections. When , the intersection points are concentrated in a part (slanted part) as shown in FIG. Therefore, the intermediate value (Xwamx * Vw
Determine a□) as the center coordinates of the circle of the pie chart.

次に、このようにして求められた中心座標（ｘｓ□＋　
ｙｍ□）から、さらにベクトル■ｌ。Next, the center coordinates (xs□+
From ym□), further vector ■l.

Ｖｓ、Ｖｓ・・・ｖｎにそれぞれ中線を引くと、全部で
ｎ本の中線が出来る。この中線とベクトル間の角度をα
１．α２・・・α。とすると、・ａｎ＞０７（θ７はス
レッショルド）ならばＶｎは円の一部を形成する円ベク
トル、又α７≦θ丁ならばｖ７は直線を形成する直線ベ
クトルと認識する。If we draw a median line on each of Vs, Vs...vn, a total of n median lines will be created. The angle between this median line and the vector is α
1. α2...α. Then, if an>07 (θ7 is a threshold), Vn is recognized as a circular vector forming a part of a circle, and if α7≦θd, v7 is recognized as a straight line vector forming a straight line.

第６図では、α。〉θアなのでｖＩは円ベクトル、そし
てα７〉０丁なのでＶ、は直線エレメントとなる。この
ようにして、すべてのベクトルを円工・レメントと直線
エレメントに分類し、円エレメントと中心点を結ぶ直線
の長さをヒストグラムにとりピーク値を半径と定める。In Figure 6, α. Since 〉θa, vI is a circular vector, and since α7〉0, V is a straight line element. In this way, all vectors are classified into circular elements, element elements, and straight line elements, and the length of the straight line connecting the circular element and the center point is made into a histogram, and the peak value is determined as the radius.

第７図（ａ）は、この半径の抽出アルゴリズムを図化し
たもので、中心点（Ｘｍａｘ　、Ｖ□８）と円エレメン
トγ１とすると、γ１．γ２．γ３・・・γ。とデータ
ができる。このデータをヒストグラムにとると第７図（
ｂ）に示すようなヒストグラムが得られ、このピーク値
が円の半径の値となる。FIG. 7(a) is a diagram of this radius extraction algorithm, where the center point (Xmax, V□8) and the circular element γ1 are γ1. γ2. γ3...γ. and data can be created. When this data is plotted as a histogram, Figure 7 (
A histogram as shown in b) is obtained, and this peak value becomes the value of the radius of the circle.

このようにして中心座標と半径が求められ、円を認識す
ることができる。In this way, the center coordinates and radius are determined, and a circle can be recognized.

また、円の半径が求まると、円エレメントの中でも中心
点（ｘｌ、８．ｙ、、、）からの距離が半径の長さと異
なるエレメントは、新たに直線エレメントとみなされる
。このようにして、抽出した直線エレメントの中から、
次に仕切り線と引き出し線とを抽出していく。Furthermore, when the radius of the circle is determined, among the circular elements, elements whose distance from the center point (xl, 8.y, . . . ) is different from the length of the radius are newly regarded as linear elements. In this way, from the extracted straight line elements,
Next, partition lines and leader lines are extracted.

ステップＳ６では、まず仕切り線の抽出を行う。両端の
点座標が、円の内側にある直線エレメントを仕切り線と
認識した後、ステップＳ７では、のこりの直線エレメン
トの中から、少なくとも１端が円の外側にある直線エレ
メントを、引き出し線と認識している。第８図は、すべ
てのエレメントを、円エレメント、仕切り線、引き出し
線に分類した図である。このように円エレメント。In step S6, partition lines are first extracted. After recognizing a straight line element whose point coordinates at both ends are inside the circle as a partition line, in step S7, from among the remaining straight line elements, a straight line element whose at least one end is outside the circle is recognized as a leader line. are doing. FIG. 8 is a diagram in which all elements are classified into circle elements, partition lines, and leader lines. A circle element like this.

仕切り線、引き出し線を抽出した後は、ステップＳ８で
、引き出し線の円の外側の端の位置をもとに、比率を表
す文字等の文字領域を抽出する。After extracting the partition line and leader line, in step S8, a character area such as a character representing a ratio is extracted based on the position of the outer end of the circle of the leader line.

第９図で示すように、引き出し線の先端Ｐ１にもっと近
い文字ベクトルＵ＋を抽出する０次に、このＵｌの先端
から半径７０以内に端がある文字ベクトルＵ２を捜し、
同様の処理を行っていくと、＝Ｕｔ　、Ｕｓ　、Ｕｓ・
・・Ｕ、が求まり、引き出し線で導かれる文字等を抽出
し、これを比率（パーセント）を表す文字列であると認
識することができる。その後、ステップＳ９では文字の
切り出しを行い、ステップＳＩＯで文字の認識を行う、
ステップＳｌｌで認識結果を表示する。As shown in FIG. 9, a character vector U+ that is closer to the tip P1 of the leader line is extracted. Next, a character vector U2 whose end is within a radius of 70 from the tip of this Ul is searched,
When similar processing is performed, =Ut, Us, Us・
. . U is determined, characters etc. guided by the lead line are extracted, and this can be recognized as a character string representing a ratio (percentage). After that, in step S9, the characters are cut out, and in step SIO, the characters are recognized.
The recognition result is displayed in step Sll.

［第２実施例］本実施例では、第１０図に示すように引出し線の無い円
グラフの場合の認識手順を示す。この場合、前述した実
施例では比率を表す文字等を抽出できない。そこでステ
ップＳ８で更に次のような処理を行う。[Second Example] In this example, a recognition procedure for a pie chart without a leader line as shown in FIG. 10 will be described. In this case, in the above-described embodiments, it is not possible to extract characters representing ratios. Therefore, the following processing is further performed in step S8.

ステップ３１〜Ｓ６の処理で中心座標と半径、さらに仕
切り線が認識されると、仕切り線と円弧とで囲まれた扇
形の領域が抽出されるので、ステップＳ８では、まずこ
の扇形領域を抽出する。第１１図（ａ）に示すように、
直線、ｇｌ＋１２２、ρ、のＸ軸との角度θ３．θ２．
θ３の大きさを調べると、β、〈βｇ　＜　１２　ｓと
なる。When the center coordinates, radius, and partition line are recognized in the processing of steps 31 to S6, a fan-shaped area surrounded by the partition line and the circular arc is extracted, so in step S8, this fan-shaped area is first extracted. . As shown in FIG. 11(a),
The angle θ3 of the straight line, gl+122, ρ, with the X axis. θ2.
Examining the magnitude of θ3, we find that β, <βg < 12 s.

また、円グラフの半径をγとすると、円の方程式はｘ”
　＋ｙ”　＝γ２となる。全黒画素について、第１１図
（ｂ）の領域Ａ、Ｂ、Ｃのどの領域に含まれるかを調べ
ていくと、座標（ｘ、ｙ）がｘ２＋ｙ２＜γ２をみたし
、且つ（０，Ｏ）と（ｘ、ｙ）を結ぶ直線の角度θ□　
がθ１くθ８ｙくθ３をみたしている場合には領域Ａに
含まれるとし、またθ２くθつ、くθ３の場合には領域
Ｂ、０くθ、ｌｙ＜θ８、又θ３くθ８ｙの場合には領
域Ｃに含まれるとする。Also, if the radius of the pie chart is γ, then the equation of the circle is x”
+y" = γ2. When examining which area of the area A, B, or C in FIG. 11(b) the all-black pixel is included, we find that the coordinates (x, y) satisfy x2+y2<γ2. And the angle θ□ of the straight line connecting (0, O) and (x, y)
If it satisfies θ1 * θ8y * θ3, it is considered to be included in region A, and if θ2 * θ * θ3, it is included in region B, and if 0 * θ, ly < θ8, and θ3 * θ8y. is included in area C.

次に、Ａ、Ｂ、Ｃの各領域に含まれる黒画素の座標値の
Ｘ座標、Ｘ座標の最大値、最小値を求めると、第１１図
（Ｃ）のように文字領域が抽出される・。このように、
円グラフ内の文字領域が抽出されると、文字列がない扇
形領域では文字領域が存在しないことから第１２図で示
すように、文字列が存在する扇形の領域と空白の扇形領
域を得る。Next, by finding the X coordinate, the maximum value, and the minimum value of the X coordinate of the black pixel coordinate values included in each area of A, B, and C, the character area is extracted as shown in Figure 11 (C).・. in this way,
When the character area in the pie chart is extracted, since there is no character area in the fan-shaped area where there is no character string, a fan-shaped area where the character string exists and a blank fan-shaped area are obtained as shown in FIG.

次に、この空白領域の延長領域内に文字が存在するか否
かを調べ、円の外側にある文字等の抽出を行う、第１３
図は、該文字等の抽出方法を示した図である。斜線部分
は、円グラフの空白の扇形領域Ａを、半径局の長さだけ
円の外側へ拡張した領域である。この斜線領域中に文字
ベクトルが少なくとも１個以上存在する場合にのみ、近
傍の文字ベクトルの集合（文字領域）を抽出していく、
抽出された文字列は比率を示す文字列と認識される。Next, it is checked whether or not characters exist within the extension area of this blank area, and characters, etc. located outside the circle are extracted.
The figure shows a method for extracting the characters, etc. The shaded area is an area obtained by extending the blank fan-shaped area A of the pie chart to the outside of the circle by the length of the radius station. Only when at least one character vector exists in this diagonal area, a set of nearby character vectors (character area) is extracted.
The extracted character string is recognized as a character string indicating a ratio.

以下、前述実施例と同様にステップＳ９で文字を切り出
し、ステップＳ１０で文字の認識を行い、ステップＳｌ
ｌで認識結果を表示する。Thereafter, as in the above embodiment, characters are cut out in step S9, characters are recognized in step S10, and characters are recognized in step S1.
Display the recognition result with l.

以上説明したように、円グラフ等の図形を認識すること
によって、文章と図形とからなる原稿をスキャナから入
力し、編集することが可能になり、原稿からの入力情報
量が増大した。As explained above, by recognizing figures such as pie charts, it has become possible to input and edit manuscripts consisting of text and figures using a scanner, increasing the amount of information input from manuscripts.

しかも、円グラフの円の外にある比率（パーセント）を
表す文字等の図形中の文字や記号の認識が可能になった
ため、複雑なパターンも認識することができるようにな
った。Moreover, it has become possible to recognize characters and symbols inside figures, such as characters representing percentages outside the circle of a pie chart, making it possible to recognize complex patterns.

また、認識速度から見ても、円グラフ及び比率と表す文
字等を簡単な処理方法で認識表示することができ、この
ため、処理時間が短く済み、よって高速の文字図形認識
が可能となった。In addition, in terms of recognition speed, it is possible to recognize and display pie charts, characters representing ratios, etc. using a simple processing method, which reduces processing time and enables high-speed character/figure recognition. .

又、円グラフの引き出し線及び円の外にある比率を表す
文字等の認識が可能になったため、複雑なパターンの円
グラフも認識することができるようになった。Additionally, it has become possible to recognize leader lines of pie charts and characters representing ratios outside the circle, making it possible to recognize pie charts with complex patterns.

更に、円グラフの一部である比率を表す文字等の認識が
可能になったため、複雑なパターンが円グラフも認識す
ることがで・きるようになった。Furthermore, it has become possible to recognize characters that represent ratios that are part of a pie chart, making it possible to recognize complex patterns in pie charts.

尚、本実施例では、円グラフと比率を示す文字列の関連
に限定したが、認識した図形の特徴に応じて文字列を抽
出することにより、一般に図形と該図面に関連する文字
及び記号を認識できる０本発明の画像の認識により、文
字と図形に関連する文字列とは明確に区別されて認識さ
れるため、画像の記憶時の圧縮あるいは編集等が可能と
なる。In this example, the relationship is limited to the relationship between a pie chart and a character string indicating a ratio, but by extracting a character string according to the characteristics of the recognized shape, it is generally possible to associate a shape with characters and symbols related to the drawing. By the image recognition of the present invention, characters and character strings related to graphics can be clearly distinguished and recognized, so that it becomes possible to compress or edit images when they are stored.

［発明の効果］本発明により、原稿上の文書と図形とを正確に認識する
画像認識装置を提供できる。また、文書の文字と図形中
の文字とを識別することにより、文字と図形とからなる
原稿の編集も可能になった。[Effects of the Invention] According to the present invention, it is possible to provide an image recognition device that accurately recognizes documents and figures on a manuscript. Furthermore, by distinguishing between characters in a document and characters in graphics, it has become possible to edit manuscripts consisting of characters and graphics.

また、図形が円グラフの場合に、引き出し線を抽出して
認識することにより、円グラフの付近にある文字列を認
識する画像認識装置を提供できる。Furthermore, when the figure is a pie chart, by extracting and recognizing leader lines, it is possible to provide an image recognition device that recognizes character strings near the pie chart.

更に、図形が円グラフの場合に、仕切り線と円弧で囲ま
れた扇形領域を抽出し、その延長領域に文字領域全体又
は１部が含まれることを調べることにより、円グラフの
外側付近にある文字列を認識する画像認識装置を提供で
きる。Furthermore, when the figure is a pie chart, by extracting the fan-shaped area surrounded by the partition line and arc and checking whether the entire character area or a part of the character area is included in the extended area, An image recognition device that recognizes character strings can be provided.

[Brief explanation of the drawing]

第１図は本実施例の画像認識装置のブロック図、第２図は本実施例の画像認識装置の処理手順のフロ、−
チャート、第３図、第１０図は画像情報の変換を示す図、第４図は
円の中心の求め方を示す図、第５図は中心点の分布図を示す図、第６図は円エレメントと直線エレメントの求め方を示す
図、第７図（ａ）は円の半径を示す図、第７図（ｂ）は円の半径のヒストグラムを示す図、第８図はエレメントの分類図を示す図、第９図は比率を
表す文字等の抽出を示す図、第１１図（ａ）〜（ｃ）は
扇形領域の抽出方法を示す図、第１２図は扇形空白領域を示す図、第１３図は円外の文字列の抽出法を示す図である。図中、１・・・原稿、２・・・画像読み取り装置、３・
・・バッファメモリ、４・・・円グラフ認識部、５・・
・文字認識部、６・・・文字フォントＲＯＭ、７・・・
ＶＲＡＭ、８−Ｇ　−Ｃ，９−ＣＲＴ、１０−・・ＣＰ
Ｕ、１０ａ・・・ＲＯＭ、１０ｂ・・・ＲＡＭである。FIG. 1 is a block diagram of the image recognition device of this embodiment, and FIG. 2 is a flowchart of the processing procedure of the image recognition device of this embodiment.
Charts, Figures 3 and 10 are diagrams showing the conversion of image information, Figure 4 is a diagram showing how to find the center of a circle, Figure 5 is a diagram showing the distribution of center points, Figure 6 is a diagram showing a circle Figure 7(a) is a diagram showing the radius of a circle. Figure 7(b) is a diagram showing a histogram of the radius of a circle. Figure 8 is a diagram showing the classification of elements. Figure 9 is a diagram showing the extraction of characters representing ratios, Figures 11 (a) to (c) are diagrams showing the method for extracting a fan-shaped area, Figure 12 is a diagram showing a fan-shaped blank area, FIG. 13 is a diagram showing a method for extracting character strings outside the circle. In the figure, 1... original, 2... image reading device, 3...
... Buffer memory, 4... Pie chart recognition unit, 5...
・Character recognition unit, 6...Character font ROM, 7...
VRAM, 8-G-C, 9-CRT, 10-...CP
U, 10a...ROM, 10b...RAM.

Claims

[Claims] (1) An input means for inputting image information containing a mixture of text and figures, and a figure that extracts a figure from the image information inputted from the input means and recognizes the characteristics of the figure. Image recognition comprising: a recognition means; and a character string recognition means for recognizing the characters and symbols by extracting characters and symbols associated with the figure based on the recognition results of the figure recognition means. Device. (2) The figure recognition means includes a circle extraction means for extracting a circle, and a partition line recognition means for extracting a straight line connecting the center and the circumference of the circle and using it as a partition line, and the figure recognition means is provided with the input image. 2. The image recognition apparatus according to claim 1, wherein the image recognition apparatus extracts and recognizes a pie chart from the information. (3) The character string recognition means includes a leader line recognition means for extracting a straight line having at least one end outside the circle from among straight lines other than the partition line and using the extracted line as a leader line; 3. The image recognition apparatus according to claim 2, wherein the related characters and symbols are character strings indicating the ratio of the pie chart. (4) The character string recognition means includes a character string extraction means for extracting characters and symbols within a fan-shaped extension area surrounded by the partition line and the circular arc, and converts the extracted characters and symbols into the ratio of the pie chart. 3. The image recognition device according to claim 2, wherein the image recognition device is a character string indicating .