JPS63158669A

JPS63158669A - Drawing recognition device

Info

Publication number: JPS63158669A
Application number: JP61306942A
Authority: JP
Inventors: Masaaki Itasaka; 板坂　政昭; Rieko Kishi; 岸　利江子; Takayuki Kageyama; 影山　尊之
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1986-12-23
Filing date: 1986-12-23
Publication date: 1988-07-01

Abstract

PURPOSE:To efficiently extract a character or a graphic with a high by extracting the quantity of feature constituting the character or the graphic from the picture data of a drawing and sorting and shaping the quantity of the feature according to the format of an area to which the quantity of the feature belongs. CONSTITUTION:The drawing to be read is divided into the plural formats and the specific graphic is enters the respective formats. The quantity of feature extraction part 1 extracts the quantity of the feature constituting the character or the graphic from the picture data of the drawing. Specific format information promised in the respective formats and the position information of the area are stored in a format information memory 4. An area extraction part 2 corrects a dislocation or an inclination at the time of reading the drawing with respect to the quantity of the feature extracted from the drawing to extract the format area to which the respective quantities of the feature belong. Then, the quantity of the feature is sorted and shaped as the specific graphic corresponding to the format according to the output of the memory 4 by a character and graphic processing part 3. In such a way, the character or the graphic can be efficiently extracted with the high recognition ratio.

Description

【発明の詳細な説明】産業上の利用分野本発明は、図面等をスキャナ等で読み込み、コンピュー
タシステムで処理することにより、文字や図形を認識す
る装置に関するもので、フォーマット化された図面中の
文字や図形を、文字や図形が槙するフォーマット領域の
７オーマノトに応じた認識処理を行う図面認識装置に関
するものである。[Detailed Description of the Invention] Industrial Application Field The present invention relates to a device that recognizes characters and figures by reading drawings etc. with a scanner etc. and processing them with a computer system. The present invention relates to a drawing recognition device that performs recognition processing for characters and figures according to a format area in which the characters and figures overlap.

従来の技術従来、文字と図形が混在する図面から文字や図形を認識
する手法としては、例えば回路図面の認識に関するもの
がある。2. Description of the Related Art Conventionally, as a method for recognizing characters and figures from drawings in which characters and figures coexist, there is a method relating to recognition of circuit drawings, for example.

以下図面を参照しながら、上述した従来の図面認識装置
の一例について説明する。第５図は従来の図面認識装置
のブロック図を示すものである。An example of the above-mentioned conventional drawing recognition device will be described below with reference to the drawings. FIG. 5 shows a block diagram of a conventional drawing recognition device.

第５図において６１は画像データから特徴量を抽出する
特徴量抽出部、６２は前記特徴量から文字領域等を切り
出す特徴量分類部、５３は前記特徴量に対し、切れやか
すれ等に対する整形化を施す整形化部、５４は分類、整
形化された前記特徴量から文字や図形を認識する文字１
図形認識部である。以上のように構成された図面認識装
置について、以下その処理フローを説明する。特徴量抽
出部６１において、画像データから水平線、垂直線等の
線分領域および前記線分領域以外の領域等を、文字や図
形を構成する特徴量として抽出する。次に特徴量分類部
５２において、前記特徴量から文字領域等を抽出する。In FIG. 5, 61 is a feature extraction unit that extracts feature quantities from image data, 62 is a feature classification unit that extracts character areas, etc. from the feature quantities, and 53 is a formatting unit for the feature quantities to prevent cuts, blurring, etc. 54 is a character 1 that recognizes characters and figures from the classified and formatted feature amounts;
This is a figure recognition section. The processing flow of the drawing recognition apparatus configured as described above will be described below. A feature extraction unit 61 extracts line segment areas such as horizontal lines and vertical lines, areas other than the line segment areas, etc. from the image data as feature quantities constituting characters and figures. Next, the feature amount classification section 52 extracts character areas and the like from the feature amounts.

図面をスキャナ等で入力する際に線切れやかすれが生じ
る場合があり、このような線切れやかすれに対処する為
、整形化部５３において各特徴量間の距離や位置関係等
を参照して、前記線切れ等で分断された水平線、垂直線
等をつなぐ整形化を行う。整形化された前記特徴量と文
字領域の画像データをもとに、文字０図形認識部５４に
おいて文字や図形の認識を行う（例えば情報処理学会第
３２目金国大会予稿集４Ｎ−１ｐ、ｐ、１３ｓｅｉ）。When inputting a drawing using a scanner or the like, line breakage or blurring may occur. , performs shaping to connect the horizontal lines, vertical lines, etc. that have been separated by the line break or the like. Based on the formatted features and the image data of the character area, the character and figure recognition unit 54 recognizes characters and figures (for example, Information Processing Society of Japan 32nd Gold Country Conference Proceedings 4N-1p, p. , 13sei).

また文字と図形が混在するフォーマット化された図面中
から、図面のフォーマット情報を利用して文字や図形を
分類し、認識する手法は従来にはみもれないものである
。Furthermore, a method for classifying and recognizing characters and figures from a formatted drawing containing a mixture of characters and figures using the format information of the drawing is unprecedented.

発明が解決しようとする問題点タイミングチャート等のフォーマット化された図面にお
いては、例えば文字のみが書かれる領域、特定の文字と
記号が混在して描かれる領域等、領域毎にフォーマット
が定められている。このような図面に対し、前記のよう
な構成では、フォーマット情報が利用できないため、第
６図に示すような、線切れに対する整形化の際に領域を
越えて誤った整形化が行われたり、あるいは本来文字の
みが書かれる領域で、図形をも対象とした処理を行って
処理の効率が低下したり、前記領域で誤って図形を抽出
して認識率が低下する問題が生じる。Problems to be Solved by the Invention In formatted drawings such as timing charts, formats are determined for each area, such as areas where only characters are written, areas where specific characters and symbols are mixed, etc. There is. For such a drawing, format information cannot be used with the above configuration, so when formatting a line break as shown in FIG. 6, incorrect formatting may be performed beyond the area, or Alternatively, processing that also targets graphics may be performed in an area where only characters are normally written, resulting in a decrease in processing efficiency, or a problem may arise in which graphics are erroneously extracted in the area, resulting in a decrease in recognition rate.

本発明は上記問題点に鑑み、フォーマット化された図面
において、図面のフォーマット情報を利用し、各フォー
マット領域に適した処理を施すことにより、図面中の文
字や図形を効率良く、しがも高い認識率で抽出する図面
認識装置を提供するものである。In view of the above-mentioned problems, the present invention utilizes the format information of the drawing in a formatted drawing and performs processing suitable for each format area, thereby efficiently and efficiently editing characters and figures in the drawing. The present invention provides a drawing recognition device that performs extraction based on recognition rate.

問題点を解決するための手段前記問題点を解決するために本発明の図面認識装置は、
入力図面の画像データから水平線、垂直線等の特徴量を
抽出する特徴量抽出部と、前記各特徴量に対し、図面入
力時の傾きや位置ずれに対する補正を施した後、前記特
徴量が図面中のどの７オーマノト領域に記入されている
かを正確に抽出する領域抽出部と、各フォーマント領域
内で、当該フォーマットに応じた特徴量の分類、整形化
および文字３図形認識処理を施す図形処理部と、各フォ
ーマット領域のフォーマット情報等を格納するフォーマ
ット情報メモリを備えている。Means for Solving the Problems In order to solve the above problems, the drawing recognition device of the present invention includes:
A feature extraction unit extracts features such as horizontal lines and vertical lines from image data of an input drawing, and after correcting each feature for tilt and positional deviation at the time of drawing input, the feature is extracted from the image data of the drawing. A region extraction unit that accurately extracts which 7-ormanoto region is written in, and a graphic processing that performs feature amount classification, shaping, and character 3 shape recognition processing according to the format in each formant region. and a format information memory for storing format information, etc. of each format area.

作用本発明は前記した構成によって、フォーマット化された
図面から特徴量を抽出し、前記各特徴量が属するフォー
マット領域を正確に抽出することにより、各フォーマス
ト領域内で、当該フォーマットに適した特徴量の分類、
整形化処理を行い、文字および図形の認識を効率良く、
高い認識率で行うものである。Effect of the present invention With the above-described configuration, the feature values are extracted from a formatted drawing, and by accurately extracting the format area to which each of the feature values belongs, features suitable for the format are extracted within each format area. Classification of quantity,
Performs formatting processing to efficiently recognize characters and shapes.
This is done with a high recognition rate.

実施例以下本発明の一実施例のフォーマット化された図面に対
する図面認識装置について、図面を参照しながら説明す
る。Embodiment Hereinafter, a drawing recognition device for formatted drawings according to an embodiment of the present invention will be described with reference to the drawings.

第１図は本発明の一実施例における図面認識装置の、ブ
ロック図を示すものである。第１図において、１は図面
の画像データを入力とし、図面中の特徴量を抽出する特
徴量抽出部、２はｎｆｆ記特微量に対し、図面入力時の
ずれや傾きを補正し、６ｆ７記各特微量が図面中のどの
フォーマット領域に属するかを抽出する領域抽出部、３
は前記特徴量をフォーマットに応じて分類、整形化し、
文字１図形を認識する文字３図形処理部、４は図面中の
各７オーマノト領域の位置情報やフォーマント情報を格
納するフォーマット情報メモリである。第２図は本発明
の一実施例における、フォーマット化された図面の例を
示すものである。第２図において太い実線の枠は、入力
図面のずれや傾き検出用のマーカーであり、破線の枠は
文字や図形を記入する各フォーマット領域の境界（７オ
ーマノト枠）を示している。また図中ａ　−ｇは各領域
のフォーマットの種別の例を示している。FIG. 1 shows a block diagram of a drawing recognition device according to an embodiment of the present invention. In Fig. 1, 1 is a feature extraction unit that inputs image data of a drawing and extracts feature quantities in the drawing, and 2 is a feature quantity extraction unit that corrects deviations and inclinations at the time of drawing input for the nff feature quantity, and 6f7 an area extraction unit that extracts which format area in the drawing each feature belongs to; 3;
classifies and formats the feature amounts according to the format,
A character 3 graphic processing section recognizes character 1 graphics, and 4 is a format information memory that stores position information and formant information of each 7-omanoto area in the drawing. FIG. 2 shows an example of a formatted drawing in an embodiment of the present invention. In FIG. 2, the thick solid-line frames are markers for detecting shifts and inclinations of the input drawing, and the broken-line frames indicate the boundaries of each format area (7-ohmanoto frame) in which characters and figures are written. Further, symbols a to g in the figure indicate examples of format types of each area.

以下、第１図から第４図を用いてその動作を説明する。The operation will be explained below using FIGS. 1 to 4.

まず図面の画像データをもとに特徴量抽出部１において
、水平線、垂直線、線分以外の領域等の特徴量を抽出す
る（例えば情報処理学会箱３２目金国大会予稿集４Ｎ−
１ｐ　ｐ　１３８５　）。First, based on the image data of the drawing, the feature amount extraction unit 1 extracts feature amounts such as horizontal lines, vertical lines, areas other than line segments (for example, Information Processing Society of Japan Box 32 Gold National Conference Proceedings 4N-
1p p 1385).

次に前記各特徴量に対し領域抽出部２において第３図に
示す処理を施し、各特徴量が属する７オーマスト領域を
抽出する。Next, each of the feature amounts is subjected to the processing shown in FIG. 3 in the region extracting section 2, and seven ohmast regions to which each feature amount belongs are extracted.

まずステップ３１において、前記特徴量から第２図の太
い実線で示すマーカーの候補となる水平線および垂直線
を抽出し、ステップ３２においてマーカーが抽出できた
ことが確認さｎたならば、ステップ３３において、入力
図面の前記マーカーの位置情報と、フォーマット情報メ
モリ４に登録されているマーカーの位置情報を比較し、
図面入力時のずれや傾き、伸縮を検出する。図面入力時
のずれや傾き、伸ｍは１例えばアフィン変換の係数を求
めることにより得られる。ステップ３４において、［）
ｒＩ紀ずれや傾き、伸縮が許容値以内であることが確認
さ扛れば、ステップ３６において各特徴量に対して座標
変換を施し、位置情報を補正する。座標変換としては、
例えば各特徴量の中心座標に対し、ステップ３３で求め
たアフィン変換の逆変換を行う方法がある。なお前記座
標変換は各特徴量の中心位置以外の座標値を用いてもか
まわない。次にステップ３６において、位置補正された
前記各特徴量と、フォーマット情報メモリ４に格納され
ている各フォーマット領域の境界の位置を比較し、前記
各特徴量の属するフォーマット領域を抽出する。ステッ
プ３７は、ステップ３２あるいはステップ３４において
入力図面の情報が正しくなかった場合に、例えば処理を
中止させる等のりジェクト処理を行う。First, in step 31, horizontal lines and vertical lines that are candidates for the marker shown by the thick solid line in FIG. , compare the position information of the marker in the input drawing with the position information of the marker registered in the format information memory 4,
Detects deviations, tilts, and expansion/contraction when inputting drawings. The deviation, inclination, and expansion m at the time of inputting the drawing can be obtained by calculating the coefficients of affine transformation, for example. In step 34, [)
If it is confirmed that the rI offset, inclination, and expansion/contraction are within tolerance values, coordinate transformation is performed on each feature in step 36 to correct the position information. As a coordinate transformation,
For example, there is a method of performing an inverse transformation of the affine transformation obtained in step 33 on the central coordinates of each feature. Note that the coordinate transformation may use coordinate values other than the center position of each feature. Next, in step 36, the positions of the respective feature quantities whose positions have been corrected are compared with the positions of the boundaries of the respective format areas stored in the format information memory 4, and the format areas to which the respective feature quantities belong are extracted. In step 37, if the information on the input drawing is incorrect in step 32 or step 34, a transfer process is performed, such as stopping the process.

次に文字６図形処理部３において、第４図に示す処理を
行い、図面中の文字や図形を認識する。Next, the character 6 figure processing section 3 performs the processing shown in FIG. 4 to recognize the characters and figures in the drawing.

捷ずステ、プ４１においてフォーマット情報メモリ４を
参照することにより各領域内の特徴量を、フォーマット
毎に約束さｎた特定フォーマット情報に応じて効率良く
分類する。例えば、文字のみが書かれる領域であるなら
ば、前記領域内の特徴量は全て文字候補として分類し、
文字１図形が混在して書かれる領域に対しては、前記領
域内の特徴量を、その形状や接続関係を調べることによ
り、文字候補と、図形候補等に分類する。ステップ４２
では、各領域内の文字以外の特徴量に対し、線切れやか
すれ等の整形化を行う。ステ、プ４２において、各領域
内の特徴量に注目して、当該領域のフォーマットに応じ
た整形化を行うことにより、第６図に示すような誤った
整形化が回避できる。By referring to the format information memory 4 in the unsorting step 41, the feature amounts in each area are efficiently classified according to the specific format information specified for each format. For example, if it is an area where only characters are written, all the features in the area are classified as character candidates,
For an area where a mixture of characters and figures is written, the feature amounts in the area are classified into character candidates, figure candidates, etc. by examining their shapes and connection relationships. Step 42
Then, the features other than characters in each region are modified to remove line breakage, blurring, etc. In step 42, erroneous shaping as shown in FIG. 6 can be avoided by paying attention to the feature amounts in each area and performing shaping according to the format of the area.

すなわち本装置においては、第６図の例の場合、各々同
一フォーマット枠内にあるｈｌ　とマ１の組あるいはｈ
２と７２の組が整形化の対象となり、ｈ、　　とｖ２の
ｍが整形化の対象となることはない。In other words, in the case of the example shown in FIG.
The set of 2 and 72 is the target of formatting, and m of h, and v2 is not the target of formatting.

ステップ４３では、ステップ４１において文字とみなさ
れた特徴量の画像データを用いて文字認識を行う（例え
ば、情報処理学会箱３２目金国大会予稿集４Ｎ−４ｐ、
ｐ、１３９１　）。ステップ４４では、前記文字認識結
果と、フォーマット情報メモリ４を参照して文字列の生
成を行う。In step 43, character recognition is performed using the image data of the feature amounts that were deemed to be characters in step 41 (for example, Information Processing Society of Japan, Box 32, Gold Country Conference Proceedings 4N-4p,
p. 1391). In step 44, a character string is generated by referring to the character recognition result and the format information memory 4.

発明の効果以上のように本発明は、文字や図形が描かれているフォ
ーマット化された図面の画像データから特徴量を抽出し
、ボＩ微量微量が属する領域のフォ　　゛−マットに応
じた特徴量の分類・整形化処理を行うものであるので、
短時間の認識処理であっても文字や図形の誤抽出を防ぎ
、図面の認識率を向上させることができるとともに、前
記特徴量が属する領域を抽出する際、前記特徴量の座標
値を用いて図面入力時のずれや傾きの位置補正を行う為
、文字や図形が属するフォーマット領域を効率的に、し
かも正確に抽出することができる。Effects of the Invention As described above, the present invention extracts feature amounts from image data of formatted drawings in which characters and figures are drawn, and extracts features according to the format of the region to which a small amount of text and figures belong. Since it performs quantity classification and formatting processing,
Even in a short recognition process, it is possible to prevent erroneous extraction of characters and figures and improve the recognition rate of drawings. Since positional corrections are made for shifts and inclinations when inputting drawings, the format area to which characters and figures belong can be efficiently and accurately extracted.

[Brief explanation of the drawing]

第１図は本発明の一実施例における図面認識装置の全体
構成を示すブロック図、第２図は本発明の対象とするフ
ォ−マットの説明図、第３図は同図面認識装置の領域抽
出部のフローチャート、第４図は同装置の文字１図形処
理部の処理のフローである。１・・・・・・特微量抽出部、２・・・・・・領域抽出
部、３・・・・・・文字・図形処理部、４・・・・・・
フォーマット情報メモリ。代理人の氏名　弁理士　中　尾　敏　男　ほか１名菓　
１　図第２図 α〜？　　フォーマット４１戚第３図第　４　図Fig. 1 is a block diagram showing the overall configuration of a drawing recognition device in an embodiment of the present invention, Fig. 2 is an explanatory diagram of the format targeted by the present invention, and Fig. 3 is an area extraction diagram of the drawing recognition device. FIG. 4 is a flowchart of the processing of the character 1 graphic processing section of the same apparatus. 1...Feature quantity extraction unit, 2...Region extraction unit, 3...Character/figure processing unit, 4......
Format information memory. Name of agent: Patent attorney Toshio Nakao and one other name
1 Figure 2 α~? Format 41 Figure 3 Figure 4

Claims

[Claims]

(1) A drawing to be read is divided into multiple formats, and a specific figure is entered in each format, and a feature quantity extraction unit extracts the feature quantities constituting characters and figures from the image data of the drawing. a format information memory that stores specific format information and area position information promised in each of the formats, an area extractor that detects in which format area in the drawing the feature amount is written, and the format It is characterized by comprising means for classifying and formatting the feature amount as a specific figure corresponding to the format based on the output of the information memory, and a character and figure processing unit that recognizes the output figure of the classification and formatting means. drawing recognition device.

(2) The area extracting unit corrects the deviation and inclination of the feature quantities extracted from the drawing when reading the drawing, and extracts the format area to which each feature quantity belongs. The drawing recognition device according to item 1.