JPH0528305A

JPH0528305A - Device and method for image recognition

Info

Publication number: JPH0528305A
Application number: JP3184242A
Authority: JP
Inventors: Junichi Koizumi; 潤一小泉
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-07-24
Filing date: 1991-07-24
Publication date: 1993-02-05

Abstract

PURPOSE:To improve recognition ratio by extracting character strings as candidate character strings, evaluating the recognition result of the constituting characters of respective candidate character strings and feeding the evaluation result back to determine whether or not the character string constitution is proper. CONSTITUTION:An input means 1 reading an image 5, and a character string extraction part 2 extracting plural character strings as the candidate character strings when respective character elements are integrated in accordance with environmental conditions and when the character elements can be integrated into plural character strings, are provided. A character recognition part 3 segmenting the constituting characters of the candidate character strings, preparing a normalized image of the constituting characters corresponding to their tilt, extracting the features of the normalized image, checking the features with the features of the standard pattern of a dictionary and recognizing the constituting characters, is provided. A character string evaluation part 4 outputting the candidate character strings in which the recognition inavailable ratio of the plural candidate character strings is minimized, as the recognition result, is also provided.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字列の方向とその文
字列を構成する各文字の底辺が基本的に平行である書式
に従った各種設計図や地図の如き文字と図形が混在する
原稿（以下画像と称する）を読み取る画像認識装置に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention mixes characters and figures such as various design drawings and maps according to a format in which the direction of a character string and the bases of the characters constituting the character string are basically parallel. The present invention relates to an image recognition device that reads a document (hereinafter referred to as an image).

【０００２】ＣＡＤ／ＣＡＭ、コンピュータ・マッピン
グの普及に伴い、過去に作成した設計図面、地図などの
画像をディジタル化する手段として、光学式読み取り装
置を用いた画像の自動入力がある。With the spread of CAD / CAM and computer mapping, there is automatic image input using an optical reader as a means for digitizing images such as design drawings and maps created in the past.

【０００３】これに伴い、画像を読み取り、読み取った
画像を正しく認識することが重要視されている。Along with this, it is important to read an image and correctly recognize the read image.

【０００４】[0004]

【従来の技術】図７は画像認識装置の文字列抽出部を示
す図である。図８は画像認識装置の文字認識部を示す図
である。2. Description of the Related Art FIG. 7 is a diagram showing a character string extraction unit of an image recognition apparatus. FIG. 8 is a diagram showing a character recognition unit of the image recognition device.

【０００５】図１１は従来例を示す図である。図１２、
図１３は従来例のフローチャートである。図１４はベク
ターのグループ化を示す図である。FIG. 11 is a diagram showing a conventional example. 12,
FIG. 13 is a flowchart of a conventional example. FIG. 14 is a diagram showing grouping of vectors.

【０００６】図１５は文字列の作成例を示す図である。
図１１において、スキャナ２０１より文字と図面が混在
した画像を読み込み、ラスター・データ化しフレームメ
モリ２０９に格納する（図１２のＳ１）。FIG. 15 is a diagram showing an example of creating a character string.
In FIG. 11, an image in which characters and drawings are mixed is read by the scanner 201, converted into raster data, and stored in the frame memory 209 (S1 in FIG. 12).

【０００７】その際のスキャナ２０１の制御、並びにフ
レームメモリ２０９へのラスター・データの格納はスキ
ャナ制御部２０２により行われる。フレームメモリ２０
９のラスター・データに対して、ラスター＝ベクター変
換部（Ｒ＝Ｖ変換部) ２０３がラスター・データを細線
化、折線近似等の手段により、ベクター・データを発生
させ、結果を図形情報テーブル２１１に格納する（図１
２のＳ２）。At this time, the scanner controller 202 controls the scanner 201 and stores the raster data in the frame memory 209. Frame memory 20
For the raster data of No. 9, the raster = vector conversion unit (R = V conversion unit) 203 generates vector data by means of thinning the raster data, approximation of broken lines, etc., and outputs the result to the graphic information table 211. Stored in (Fig. 1
2 S2).

【０００８】図形情報テーブル２１１の各ベクターに対
して、文字・図形分離部２０４で相互に連続なベクター
をグループ化し（図１２のＳ３）、文字は比較的小さな
領域を持ち、他の図形・文字から孤立しているという特
徴を使って、全グループの計測が終了するまで（図１２
のＳ４）各グループの大きさを調べ（図１２のＳ５）、
グループの大きさが文字サイズに適合すれば（図１２の
Ｓ６）文字要素として抽出し（図１２のＳ７）、また、
文字要素以外のグループを図形要素として抽出する。
（図１２のＳ８）。For each vector in the graphic information table 211, the character / graphic separation unit 204 groups mutually continuous vectors (S3 in FIG. 12), and the character has a relatively small area, and the other graphic / character Until the measurement of all groups is completed using the feature of being isolated from (Fig. 12
S4) Check the size of each group (S5 in FIG. 12),
If the size of the group matches the character size (S6 in FIG. 12), it is extracted as a character element (S7 in FIG. 12).
Extract groups other than character elements as graphic elements.
(S8 of FIG. 12).

【０００９】分離された文字要素は文字情報テーブル２
１２に格納される。従って、図１４に示すようにグルー
プ１０１及びグループ１０２は文字要素、グループ１０
３は図形要素となる。The separated character elements are the character information table 2
Stored in 12. Therefore, as shown in FIG. 14, the groups 101 and 102 are character elements, group 10
3 is a graphic element.

【００１０】図７に示すように文字情報テーブル２１２
に格納された文字要素に対して（図１３のＳ１０）、文
字列抽出部２０５の周辺探索部４０１で注目する文字要
素の周辺に他の文字要素が存在するかどうか探索する
（図１３のＳ１１）。As shown in FIG. 7, a character information table 212
With respect to the character element stored in (S10 of FIG. 13), the surrounding search unit 401 of the character string extraction unit 205 searches for other character elements around the focused character element (S11 of FIG. 13). ).

【００１１】探索した結果、文字要素が存在する場合は
（図１３のＳ１２）、注目する文字要素の重心と探索さ
れた任意の他の文字要素の重心の距離を距離計算部４０
４が計算し、任意の他の文字要素の傾きと任意の他の文
字要素に注目する文字要素を統合したときの傾きとの差
を角度計算部４０５が計算し、その得られた距離および
角度より統合評価部４０２において文字列として統合で
きる可能性を評価値として表し、距離および角度が小さ
ければ小さい程高い評価値が与えられる。If a character element exists as a result of the search (S12 in FIG. 13), the distance calculation unit 40 calculates the distance between the center of gravity of the character element of interest and the center of gravity of any other searched character element.
4 calculates, the angle calculation unit 405 calculates the difference between the inclination of any other character element and the inclination when the character element of interest to any other character element is integrated, and the obtained distance and angle are calculated. The possibility of being integrated as a character string in the integrated evaluation unit 402 is represented as an evaluation value, and the smaller the distance and the angle, the higher the evaluation value.

【００１２】但し、角度と距離の評価の重みは距離の重
みの方が高いようになっている。周辺探索部４０１によ
って複数の文字要素が探索された場合には（図１３のＳ
１３）、探索された任意の文字要素を探索された他の文
字要素に変え、前述の処理を同様に行う。However, the weight of the evaluation of the angle and the distance is higher than the weight of the distance. When a plurality of character elements are searched by the peripheral search unit 401 (S in FIG. 13).
13), the searched arbitrary character element is changed to another searched character element, and the above-mentioned processing is performed in the same manner.

【００１３】以上の処理を上記注目する文字要素につい
て探索された全ての文字要素に対して行い、それぞれの
評価値を求める。判定部４０３は、文字要素の統合が１
つしか存在しない場合には、その文字要素の統合を（図
１３のＳ１４）、また、文字要素の統合が複数存在する
場合には、求められた評価値から最も高い評価値をもつ
文字要素の統合を文字列として基本的に抽出する（図１
３のＳ１５）。The above processing is performed for all the character elements searched for the noted character element to obtain respective evaluation values. The determination unit 403 determines that the integration of character elements is 1
If there is only one, the character element is integrated (S14 in FIG. 13). If there are multiple character element integrations, the character element having the highest evaluation value from the obtained evaluation values is selected. The integration is basically extracted as a character string (Fig. 1
3 S15).

【００１４】但し、最も高い評価値であってもその評価
値が予め定められたレベル値より低い場合はその注目す
る文字要素は独立している、つまり、どこにも統合しな
いものとして抽出する。However, even if the evaluation value is the highest, if the evaluation value is lower than the predetermined level value, the character element of interest is extracted as independent, that is, it is not integrated anywhere.

【００１５】即ち、従来は評価値がレベル値以上で且つ
最大のものを抽出する。この文字列抽出の処理を文字情
報テーブル２１２に格納されている全ての文字要素に対
して行う（図１３のＳ９）。That is, conventionally, the evaluation value that is equal to or higher than the level value and is maximum is extracted. This character string extraction processing is performed on all the character elements stored in the character information table 212 (S9 in FIG. 13).

【００１６】例えば、図１５のように注目する文字要素
１０４に対して、統合可能範囲に文字列１０５と文字列
１０６というような複数の文字列が存在する場合は、文
字要素１０４の重心に対して両文字列の最近傍文字の重
心との距離Ｌ１、Ｌ２を距離計算部４０４が計測する。For example, when there are a plurality of character strings such as a character string 105 and a character string 106 in the integrable range for the character element 104 of interest as shown in FIG. The distance calculation unit 404 measures the distances L1 and L2 from the center of gravity of the nearest characters of both character strings.

【００１７】また、文字列１０５の傾きと文字列１０５
に文字要素１０４を統合したときの傾きとの差と、文字
列１０６の傾きと文字列１０６に文字要素１０４を統合
したときの傾きとの差を角度計算部４０５が計測する。Further, the inclination of the character string 105 and the character string 105
The angle calculation unit 405 measures the difference between the inclination when the character element 104 is integrated with the character string 104 and the difference between the inclination of the character string 106 and the inclination when the character element 104 is integrated with the character string 106.

【００１８】そして、統合評価部４０２において距離計
算部４０４および角度計算部４０５で計算された距離、
角度によりそれぞれ評価値を求め、判定部４０３でこの
２つの評価値により文字要素１０４を文字列１０５に統
合するか、文字列１０６に統合するか、または何れにも
統合しないかを決定する。Then, in the integrated evaluation unit 402, the distance calculated by the distance calculation unit 404 and the angle calculation unit 405,
An evaluation value is obtained from each angle, and the determination unit 403 determines whether to integrate the character element 104 into the character string 105, the character string 106, or neither of them by the two evaluation values.

【００１９】文字列抽出の結果は、文字情報テーブル２
１２に格納される。図８に示すように文字情報テーブル
２１２に格納された文字列の傾き情報及び各文字の位置
情報に基づき、文字認識部２０６の画像切り出し部５０
１はフレームメモリ２０９より一文字分の画像を切り出
す（図１３のＳ１７）。The result of the character string extraction is the character information table 2
Stored in 12. As shown in FIG. 8, the image cutout unit 50 of the character recognition unit 206 is based on the inclination information of the character string and the position information of each character stored in the character information table 212.
1 cuts out an image for one character from the frame memory 209 (S17 in FIG. 13).

【００２０】切り出された文字画像を認識するために
は、文字画像の特徴と辞書５０５の標準パターンの特徴
と照合する。その際、文字の特徴と辞書５０５の標準パ
ターンの特徴との照合の方向性を合わせるために図８に
示す画像回転部５０２によって文字列の傾きに応じた回
転処理を切り出された文字画像に施して正規化画像を作
成し、特徴抽出部５０３によって特徴を抽出した後、そ
の正規化画像の特徴と辞書５０５の標準パターンの特徴
を特徴照合部５０４で照合し文字の認識結果を得る（図
１３のＳ１８）。In order to recognize the cut out character image, the characteristics of the character image are compared with the characteristics of the standard pattern in the dictionary 505. At that time, in order to match the direction of collation between the character feature and the feature of the standard pattern of the dictionary 505, a rotation process according to the inclination of the character string is applied to the clipped character image by the image rotation unit 502 shown in FIG. After the normalized image is created by the feature extracting unit 503, the feature extracting unit 503 extracts the feature, and the feature matching unit 504 matches the feature of the normalized image with the feature of the standard pattern of the dictionary 505 to obtain a character recognition result (FIG. 13). S18).

【００２１】以上の文字列を構成する文字の認識処理を
全文字列について行い（図１３のＳ１６）、得られた文
字の認識結果は文字情報テーブル２１２に格納される
（図１３のＳ１９）。The recognition processing of the characters constituting the above character string is performed for all character strings (S16 in FIG. 13), and the obtained character recognition result is stored in the character information table 212 (S19 in FIG. 13).

【００２２】一方、図形情報テーブル２１２の図形要素
に対しては、ベクター演算部２０７において、直線・円
・円弧等の図形プリミティブ認識が以下のように行われ
る。まず、ベクターの長さと連続する２本のベクター間
の角度から、円弧の候補となるベクターを検索する。On the other hand, with respect to the graphic elements of the graphic information table 212, graphic primitive recognition of straight lines, circles, arcs, etc. is performed in the vector operation unit 207 as follows. First, a vector that is a candidate for a circular arc is searched from the length of the vector and the angle between two consecutive vectors.

【００２３】次に検索されたベクターから、可能性のあ
る円弧の半径と中心の候補を求め、得られた候補のう
ち、与えられたしきい値内で最も多くのベクターからな
るものを円、円弧の抽出結果とする。Next, from the searched vectors, candidates for the radius and center of a possible arc are obtained, and among the obtained candidates, the one consisting of the largest number of vectors within a given threshold is circled, This is the extraction result of the arc.

【００２４】円、円弧以外のベクターを直線の候補と
し、その中で誤差がしきい値以下で、ある一定以上の長
さを持つベクターの並びを直線とする。図形プリミティ
ブ認識が行われた結果は再び図形情報テーブル２１１に
格納される。Vectors other than circles and arcs are set as straight line candidates, and among them, an array of vectors whose error is less than or equal to a threshold value and which has a certain length or more is set as a straight line. The result of the graphic primitive recognition is stored in the graphic information table 211 again.

【００２５】以上により画像の認識が終了し、図形情報
テーブル２１１及び文字情報テーブル２１２の認識結果
が通信制御部２０８を経由してホストコンピュータ２１
４へ出力される。The image recognition is completed as described above, and the recognition results of the graphic information table 211 and the character information table 212 are transferred to the host computer 21 via the communication control unit 208.
4 is output.

【００２６】ホストコンピュータ２１４では、認識結果
を表示してオペレータによる確認・修正作業が行われた
後、ＣＡＤ図面データベースあるいはマッピング・デー
タベースへ格納され、ＣＡＤシステムにて、例えば図面
を作成する際に情報として用いられる。In the host computer 214, after the recognition result is displayed and confirmation / correction work is performed by the operator, the recognition result is stored in the CAD drawing database or the mapping database, and the information is created when the CAD system creates a drawing, for example. Used as.

【００２７】[0027]

【発明が解決しようとする課題】しかしながら、従来で
は文字要素を統合し文字列として抽出を行う時点におい
て、複数の文字列への統合可能な場合でも、周囲状況の
みから文字列の統合を行うため誤った文字列の抽出が成
され、認識不能となるという問題を生じていた。However, in the prior art, when the character elements are integrated and extracted as a character string, the character strings are integrated only from the surrounding situation even if they can be integrated into a plurality of character strings. There was a problem that the character string was erroneously extracted and it became unrecognizable.

【００２８】即ち、文字と図形の混在する設計図等の画
像では、前記した文字列の方向とその文字列を構成する
各文字の底辺が基本的に平行であるという書式に従い、
且つ任意の傾きをもって文字列が記入される。That is, in an image such as a design drawing in which characters and figures are mixed, according to the format that the direction of the character string and the bases of the characters forming the character string are basically parallel,
In addition, the character string is written with an arbitrary inclination.

【００２９】その文字列を構成する各文字を認識する場
合は、まず、文字列を構成する文字を切り出し、辞書の
標準パターンの特徴と照合させる際、照合の方向性を合
わせるため文字列の傾きに応じて文字に回転処理を施し
た正規化画像を作成し、その正規化画像の特徴抽出を行
う必要がある。When recognizing each character forming the character string, first, when the character forming the character string is cut out and collated with the characteristics of the standard pattern of the dictionary, the inclination of the character string is aligned to match the direction of the collation. It is necessary to create a normalized image in which characters are rotated according to the above, and to extract the features of the normalized image.

【００３０】そのため、誤った文字列の抽出を行うと、
その文字列を構成する各文字に誤った回転処理を施した
正規化画像が作成され、その正規化画像の特徴と辞書の
標準パターンの特徴を照合するため、文字列内の多くの
文字の認識が正常に行われず、抽出された文字列は認識
不能となる可能性が大きい。Therefore, if an incorrect character string is extracted,
A normalization image is created by performing an incorrect rotation process on each character that makes up the character string, and the characteristics of the normalization image are compared with the characteristics of the standard pattern in the dictionary. Is not normally performed, and the extracted character string is likely to be unrecognizable.

【００３１】従って、本発明は、文字列の方向とその文
字列を構成する各文字の底辺が基本的に平行であるとい
う書式に従った画像を読み取り、文字列を抽出する際、
文字要素が複数の文字列に統合可能な場合、複数の候補
文字列を挙げ、各候補文字列内の個々の文字の認識結果
を評価して文字列構成の適否へフィードバックすること
により、誤った文字列の抽出を削減し、認識度を向上さ
せることができる画像認識装置を提供することを目的と
する。Therefore, according to the present invention, when an image is read according to a format in which the direction of the character string and the bases of the characters forming the character string are basically parallel, and the character string is extracted,
When a character element can be integrated into multiple character strings, multiple candidate character strings are listed, the recognition result of each character in each candidate character string is evaluated, and feedback is given to the propriety of the character string configuration. An object of the present invention is to provide an image recognition device capable of reducing the extraction of character strings and improving the degree of recognition.

【００３２】[0032]

【課題を解決するための手段】図１は本発明の第１の原
理図である。図中、１は入力手段であり、少なくとも所
定の書式に従って書かれた文字を有する画像５を読み取
るものである。FIG. 1 is a diagram showing the first principle of the present invention. In the figure, reference numeral 1 is an input means for reading an image 5 having at least characters written according to a predetermined format.

【００３３】２は文字列抽出部であり、個々の文字を統
合し文字列として抽出する際に、複数の文字列へ統合可
能な場合はその複数の統合可能な文字列を候補文字列と
して抽出するものである。Reference numeral 2 denotes a character string extraction unit that, when individual characters are integrated and extracted as a character string, and if they can be integrated into a plurality of character strings, the plurality of integrable character strings are extracted as candidate character strings. To do.

【００３４】３は文字認識部であり、文字列抽出部２で
抽出された各候補文字列の構成要件である文字列の傾
き、文字位置に従って、各文字を切り出し認識するもの
である。A character recognition unit 3 cuts out and recognizes each character according to the inclination and the character position of the character string which are the constituent requirements of each candidate character string extracted by the character string extraction unit 2.

【００３５】４は文字列評価部であり、文字認識部３で
認識した各候補文字列内の文字要素の認識不能文字数を
計数し、認識不能率（候補文字列を構成する全文字数と
その候補文字列の認識不能文字数の割合）が最小である
候補文字列を認識結果として出力するものである。A character string evaluation unit 4 counts the number of unrecognizable characters of the character elements in each candidate character string recognized by the character recognition unit 3, and recognizes the unrecognizable rate (the total number of characters forming the candidate character string and its candidates). The candidate character string having the smallest ratio of the number of unrecognizable characters of the character string) is output as the recognition result.

【００３６】図２は本発明の第２の原理図である。図
中、１は入力手段、２は文字列抽出部、３は文字認識部
であって、前述の図１と同様であるため、その具体的な
説明を省略する。FIG. 2 is a second principle diagram of the present invention. In the figure, 1 is an input unit, 2 is a character string extraction unit, and 3 is a character recognition unit, which are the same as those in FIG. 1 described above, so a detailed description thereof will be omitted.

【００３７】６は文字列選択部であり、文字認識部３で
認識した各候補文字列内の文字要素の認識不能文字数を
計数し、認識不能率が設定基準を満たす候補文字列を全
て認識結果として出力し、その何れかを選択させるもの
である。A character string selecting unit 6 counts the number of unrecognizable characters of the character elements in each candidate character string recognized by the character recognizing unit 3 and recognizes all candidate character strings whose unrecognizable rate satisfies the setting criterion. Is output, and any one of them is selected.

【００３８】[0038]

【作用】本発明の請求項１では、文字列抽出部２におい
て、文字列の抽出を行う時点で文字要素が複数の文字列
に統合可能な場合、その複数の統合可能な文字列を候補
文字列として全て抽出する。According to the first aspect of the present invention, when the character element can be integrated into a plurality of character strings at the time of extracting the character string in the character string extracting unit 2, the plurality of the character strings that can be integrated are candidate characters. Extract all as columns.

【００３９】その文字列抽出部２によって抽出された複
数の候補文字列の構成文字を文字列の傾き、文字位置に
従って切り出し、文字認識部３で辞書の標準パターンの
特徴と照合できるようにするために、文字列の傾きに応
じて切り出された文字に回転処理を施した正規化画像を
作成し特徴抽出を行った後、その正規化画像の特徴と辞
書の標準パターンの特徴とを照合し、認識処理を行う。In order to make the constituent characters of the plurality of candidate character strings extracted by the character string extraction unit 2 cut out according to the inclination and character position of the character string, and to allow the character recognition unit 3 to collate with the characteristics of the standard pattern of the dictionary. In addition, after creating a normalized image obtained by performing rotation processing on the characters cut out according to the inclination of the character string and performing feature extraction, the features of the normalized image and the features of the standard pattern of the dictionary are collated, Perform recognition processing.

【００４０】斜めまたは垂直方向の文字列を所定の書式
に従って書くと、その文字列を構成する個々の文字は文
字列の方向と各文字の底辺が平行になるように傾く（文
字の底辺と平行な方向を文字の方向とする）。When a character string in an oblique or vertical direction is written according to a predetermined format, the individual characters forming the character string are inclined so that the direction of the character string is parallel to the base of each character (parallel to the base of the character. And the direction of the character is).

【００４１】そのため、文字列を構成する各文字を認識
するためには、文字列の方向を文字の方向とみなし、傾
いた状態で文字を切り出す。一方、文字の特徴と辞書の
標準パターンの特徴の照合を行う際、照合の方向性を合
わせる必要があるため、傾いた状態で切り出された文字
の方向と辞書の標準パターンの方向とを一致させるよう
に切り出した文字を回転させる正規化を行わなければな
らない。Therefore, in order to recognize each character forming the character string, the direction of the character string is regarded as the direction of the character, and the character is cut out in an inclined state. On the other hand, when matching the characteristics of the character with the characteristics of the standard pattern of the dictionary, it is necessary to match the directionality of the matching. Therefore, the direction of the character cut out in an inclined state matches the direction of the standard pattern of the dictionary. Normalization must be done to rotate the cut out characters.

【００４２】そのため、誤った文字列抽出を行うと、文
字列の方向とその文字列を構成する各文字の方向が一致
しない。従って、抽出された文字列の構成文字を切り出
し、文字の方向と標準パターンの方向を一致させるため
に切り出された文字を回転する正規化を行い、照合を行
ったとしても認識不能となる可能性が大きい。Therefore, if the wrong character string is extracted, the direction of the character string does not match the direction of each character forming the character string. Therefore, the constituent characters of the extracted character string may be cut out, and the cut out characters may be rotated in order to match the direction of the character with the direction of the standard pattern. Is big.

【００４３】しかし、本発明では文字要素が複数の文字
列に統合可能な場合、その複数の統合可能な文字列を候
補文字列として全て抽出し、その抽出された候補文字列
を構成する文字の認識を行った後、文字列評価部４にお
いて、各候補文字列について認識不能文字数を計数し、
認識不能率が最小である候補文字列を認識結果として出
力するようにしたため、抽出されるべき文字列が出力さ
れる可能性が大きい。However, in the present invention, when a character element can be integrated into a plurality of character strings, all of the plurality of character strings that can be integrated are extracted as candidate character strings, and the characters constituting the extracted candidate character strings are extracted. After recognition, the character string evaluation unit 4 counts the number of unrecognizable characters for each candidate character string,
Since the candidate character string having the smallest unrecognizable rate is output as the recognition result, there is a high possibility that the character string to be extracted is output.

【００４４】これらのことにより、誤った文字列抽出に
よる文字の誤認識及び認識不能を削減することができ、
高精度の認識結果を得ることができる。本発明の請求項
２では、候補文字列を構成する各文字を認識した後、文
字列選択部６において、各候補文字列の認識不能文字数
を計数し、認識不能率が設定基準を満たす候補文字列を
全て認識結果として出力し、その何れかを選択させるよ
うにしている。Due to the above, it is possible to reduce erroneous recognition and unrecognizable character due to erroneous character string extraction,
It is possible to obtain a highly accurate recognition result. According to claim 2 of the present invention, after recognizing each character forming the candidate character string, the character string selecting unit 6 counts the number of unrecognizable characters of each candidate character string, and the unrecognizable rate satisfies the setting criterion. All columns are output as recognition results, and any one of them is selected.

【００４５】よって、請求項２の手段を用いると、請求
項１によって出力される認識結果を含む複数の候補文字
列の中から認識結果を選択できるため、請求項１よりさ
らに高精度の認識結果を得ることができる。Therefore, when the means of claim 2 is used, the recognition result can be selected from a plurality of candidate character strings including the recognition result output by claim 1, so that the recognition result with higher accuracy than that of claim 1 is obtained. Can be obtained.

【００４６】[0046]

【実施例】まず、第１の実施例について説明する。図３
は本発明の第１の実施例を示す図である。EXAMPLE First, a first example will be described. Figure 3
FIG. 3 is a diagram showing a first embodiment of the present invention.

【００４７】図４、図５、図１２にそのフローチャート
を示す。図６は本実施例の候補文字列の抽出例を示す図
である。図７は画像認識装置の文字列抽出部を示す図で
ある。The flowcharts are shown in FIGS. 4, 5 and 12. FIG. 6 is a diagram showing an example of extracting candidate character strings in this embodiment. FIG. 7 is a diagram showing a character string extraction unit of the image recognition device.

【００４８】図８は画像認識装置の文字認識部を示す図
である。画像の読み込み、ベクター・データの発生、ベ
クターのグループ化および文字要素と図形要素の分離は
前記従来例のように行うため、その詳細な説明は省略す
る（図１２参照）。FIG. 8 is a diagram showing a character recognition unit of the image recognition apparatus. Since the image reading, vector data generation, vector grouping, and character element / graphic element separation are performed as in the conventional example, detailed description thereof is omitted (see FIG. 12).

【００４９】図７に示すように文字情報テーブル２１２
に格納された文字要素に対して（図４のＳ２１）、文字
列抽出部２０５の周辺探索部４０１は注目する文字要素
の周辺に他の文字要素が存在するかどうか探索する（図
４のＳ２２）。As shown in FIG. 7, the character information table 212
For the character element stored in (S21 in FIG. 4), the peripheral search unit 401 of the character string extraction unit 205 searches for another character element around the target character element (S22 in FIG. 4). ).

【００５０】探索した結果、文字要素が存在する場合は
（図４のＳ２３）、注目する文字要素の重心と探索され
た任意の他の文字要素の重心の距離を距離計算部４０４
が計算し、任意の他の文字要素の傾きと任意の他の文字
要素に注目する文字要素を統合したときの傾きとの差を
角度計算部４０５が計算し、その得られた距離および角
度より統合評価部４０２において文字列として統合でき
る可能性を評価値として表し、距離および角度が小さけ
れば小さい程高い評価値が与えられる。If a character element exists as a result of the search (S23 in FIG. 4), the distance calculation unit 404 calculates the distance between the center of gravity of the character element of interest and the center of gravity of any other searched character element.
Is calculated by the angle calculation unit 405, and the angle calculation unit 405 calculates the difference between the inclination of any other character element and the inclination when the character element of interest to the other arbitrary character element is integrated, and from the obtained distance and angle. The possibility of being integrated as a character string in the integrated evaluation unit 402 is represented as an evaluation value, and the smaller the distance and the angle, the higher the evaluation value.

【００５１】但し、角度と距離の評価の重みは距離の重
みの方が高いようになっている。周辺探索部４０１によ
って複数の文字要素が探索された場合には、探索された
任意の文字要素を探索された他の文字要素に変え、前述
の処理を同様に行う。However, the weight of the evaluation of the angle and the distance is higher than the weight of the distance. When a plurality of character elements are searched by the peripheral search unit 401, the searched arbitrary character element is changed to another searched character element, and the above-described processing is similarly performed.

【００５２】以上の処理を上記注目する文字要素につい
て探索された全ての文字要素に対して行い、それぞれの
評価値を求める。判定部４０３でしきい値を設定し、統
合評価部４０２によって得られた評価値の中からしきい
値より大きい評価値をもつ全ての文字要素の統合を候補
文字列として抽出する。The above processing is performed for all the character elements searched for the noted character element to obtain respective evaluation values. The determination unit 403 sets a threshold value, and from the evaluation values obtained by the integrated evaluation unit 402, the integration of all character elements having an evaluation value larger than the threshold value is extracted as a candidate character string.

【００５３】このため、ある文字要素が複数の文字列へ
統合可能な場合が存在する。この文字列抽出の処理を文
字情報テーブル２１２の全ての文字要素に対して行う
（図４のＳ２０）。Therefore, there is a case where a certain character element can be integrated into a plurality of character strings. This character string extraction processing is performed for all the character elements of the character information table 212 (S20 of FIG. 4).

【００５４】図６のように文字要素が複数の文字列に統
合可能な場合（図４のＳ２４）は、その複数の文字列を
候補文字列（図６（ａ）のＴ４１、Ｔ４２、Ｔ４３と
（ｂ）のＴＴＴ、４４４、１２３）として抽出し（図４
のＳ２５）、各候補文字列の構成文字及び文字を共有す
る他の候補文字列をリンク情報として各候補文字列に付
与する。When the character elements can be integrated into a plurality of character strings as shown in FIG. 6 (S24 in FIG. 4), the plurality of character strings are regarded as candidate character strings (T41, T42, T43 in FIG. 6A). (B) TTT, 444, 123) and extracted (FIG. 4).
S25), the constituent characters of each candidate character string and other candidate character strings sharing the character are added to each candidate character string as link information.

【００５５】また、文字要素が１つの文字列のみに統合
可能な場合は独立文字列（ある文字列の構成文字が他の
文字列の構成文字とならない文字列）としてに抽出する
（図４のＳ２６）。When a character element can be integrated into only one character string, it is extracted as an independent character string (a character string in which the constituent characters of one character string do not become the constituent characters of another character string) (see FIG. 4). S26).

【００５６】文字列抽出の結果及びリンク情報は、文字
情報テーブル２１２に格納される。文字情報テーブル２
１２に格納された独立文字列または候補文字列の傾き情
報及び各文字の位置情報に基づき、文字認識部２０６の
画像切り出し部５０１はフレームメモリ２０９より一文
字分の画像を切り出す（図４のＳ２８）。The result of the character string extraction and the link information are stored in the character information table 212. Character information table 2
The image cutout unit 501 of the character recognition unit 206 cuts out an image of one character from the frame memory 209 based on the inclination information of the independent character string or the candidate character string stored in 12 and the position information of each character (S28 in FIG. 4). .

【００５７】切り出された文字画像を認識するために
は、文字画像の特徴と辞書５０５の標準パターンの特徴
と照合する。その際、文字の特徴と辞書５０５の標準パ
ターンの特徴との照合の方向性を合わせるために図８に
示す画像回転部５０２によって独立文字列または候補文
字列の傾きに応じた回転処理を切り出された各文字画像
に施して正規化画像を作成し、特徴抽出部５０３によっ
て特徴を抽出した後、その正規化画像の特徴と辞書５０
５の標準パターンの特徴を特徴照合部５０４で照合し文
字の認識結果を得る（図４のＳ２９）。In order to recognize the cut out character image, the characteristics of the character image are compared with the characteristics of the standard pattern in the dictionary 505. At that time, in order to match the direction of collation between the character feature and the feature of the standard pattern of the dictionary 505, the image rotation unit 502 shown in FIG. 8 cuts out rotation processing according to the inclination of the independent character string or the candidate character string. The normalized image is created by applying it to each character image, the features are extracted by the feature extraction unit 503, and then the features of the normalized image and the dictionary 50 are extracted.
The characteristics of the standard pattern of No. 5 are collated by the characteristic collating unit 504 to obtain a character recognition result (S29 in FIG. 4).

【００５８】この認識処理を全ての独立文字列及び候補
文字列について行い、得られた文字の認識結果は文字情
報テーブル２１２に格納する（図４のＳ２７）。文字列
評価部３０１は、複数の候補文字列（図６（ａ）と
（ｂ）、矢印は文字列の方向）を持つ文字群が存在する
場合に、その文字群の複数の候補文字列の各々につい
て、認識結果を評価する。This recognition processing is performed for all the independent character strings and candidate character strings, and the obtained character recognition result is stored in the character information table 212 (S27 in FIG. 4). When a character group having a plurality of candidate character strings (FIGS. 6 (a) and 6 (b), the arrow indicates the direction of the character string) is present, the character string evaluation unit 301 determines whether or not a plurality of candidate character strings of the character group are present. For each, the recognition result is evaluated.

【００５９】具体的には、候補文字列を構成する文字の
認識不能文字数を計数して（図４のＳ３１）、リンク情
報に基づいた全候補文字列の認識不能文字数を計数後
（図４のＳ３０）、認識不能率が最小である候補文字列
を正解として出力し、文字情報テーブル２１２の他の候
補文字列についてはこれを削除する（図５のＳ３２）。Specifically, the number of unrecognizable characters of the characters forming the candidate character string is counted (S31 of FIG. 4), and the number of unrecognizable characters of all candidate character strings based on the link information is counted (see FIG. 4). S30), the candidate character string having the smallest unrecognizable rate is output as a correct answer, and the other candidate character strings in the character information table 212 are deleted (S32 in FIG. 5).

【００６０】図６（ａ）における候補文字列の文字列方
向は水平であるため、候補文字列を構成する各文字に回
転処理を施し正規化を行う必要はない。そして、その候
補文字列を構成する各文字の特徴を抽出し、辞書５０５
の標準パターンの特徴と照合すると、各文字は正方向で
あるため正しく認識される。Since the character string direction of the candidate character string in FIG. 6A is horizontal, it is not necessary to perform rotation processing on each character forming the candidate character string for normalization. Then, the feature of each character that constitutes the candidate character string is extracted, and the dictionary 505 is extracted.
When compared with the features of the standard pattern of, each character is correctly recognized because it is in the positive direction.

【００６１】一方、同図（ｂ）における候補文字列の文
字列方向は垂直であるため、その候補文字列を構成する
各文字に９０度の回転処理を施した正規化画像を作成す
る。しかし、その正規化画像から特徴を抽出し、辞書５
０５の標準パターンの特徴と照合したとしても、候補文
字列を構成する各文字は正方向でないため認識不能また
は誤認識となる。On the other hand, since the character string direction of the candidate character string in FIG. 6B is vertical, a normalized image is created by rotating each character forming the candidate character string by 90 degrees. However, the features are extracted from the normalized image, and the dictionary 5
Even if the characters of the standard pattern of No. 05 are matched, each character forming the candidate character string is not in the normal direction, and thus cannot be recognized or is erroneously recognized.

【００６２】従って、図６（ａ）と（ｂ）に示される各
候補文字列の認識不能率を比較すると、（ａ）に示され
る候補文字列の認識不能率のほうが低いため、（ａ）に
示される候補文字列が認識結果として出力され、（ｂ）
に示される候補文字列は削除される。Therefore, comparing the unrecognizable rates of the candidate character strings shown in FIGS. 6A and 6B, the unrecognizable rate of the candidate character strings shown in FIG. 6A is lower. The candidate character string shown in is output as a recognition result, and (b)
The candidate character string shown in is deleted.

【００６３】次に第２の実施例について説明する。図９
は本発明の第２の実施例を示す図である。図４、図１
０、図１２にそのフローチャートを示す。Next, the second embodiment will be described. Figure 9
FIG. 6 is a diagram showing a second embodiment of the present invention. 4 and 1
0, the flowchart is shown in FIG.

【００６４】第２の実施例が第１の実施例と異なる点
は、文字列評価部３０１に代えて文字列選択部３０２に
した点である。尚、第１の実施例と同様の処理を行うと
ころはその説明を省略する（図１２及び図４参照）。The second embodiment differs from the first embodiment in that the character string evaluation unit 301 is replaced by a character string selection unit 302. Incidentally, the description of the same processing as that of the first embodiment will be omitted (see FIGS. 12 and 4).

【００６５】文字列選択部３０２において各候補文字列
の認識不能文字数を計数し（図４のＳ３１）、計数し終
わった後（図４のＳ３０）、文字情報テーブル２１２の
候補文字列のうち、設定基準（例えば、認識不能率が１
／３以下など）を満たさない候補文字列は削除し、設定
基準を満たしている候補文字列は通信制御部２０８を経
由してホストコンピュータ２１４に送り、ホストコンピ
ュータ２１４の制御に基づきディスプレイ（ＤＰ）２１
５に表示される（図１０のＳ３３）。In the character string selection unit 302, the number of unrecognizable characters of each candidate character string is counted (S31 in FIG. 4), and after counting (S30 in FIG. 4), among the candidate character strings in the character information table 212, Setting criteria (for example, the unrecognizable rate is 1
/ 3 or less) is deleted, and the candidate character string satisfying the setting criterion is sent to the host computer 214 via the communication control unit 208, and the display (DP) is controlled under the control of the host computer 214. 21
5 is displayed (S33 in FIG. 10).

【００６６】ディスプレイ２１５に表示された候補文字
列をホストコンピュータ２１４に接続されるマンマシン
インターフェース（ＭＭＩ）２１６でオペレータに何れ
かの選択を行わせるが、表示された候補文字列が認識さ
れるべき画像と異なるとオペレータが判断した場合、リ
ンク情報に基づき他の候補文字列を表示し選択させる
（図１０のＳ３４）。The man-machine interface (MMI) 216 connected to the host computer 214 allows the operator to select one of the candidate character strings displayed on the display 215, but the displayed candidate character string should be recognized. If the operator determines that the image is different from the image, another candidate character string is displayed and selected based on the link information (S34 in FIG. 10).

【００６７】一方、第１および第２の実施例における図
形情報テーブル２１１の図形要素に対しては、ベクター
演算部２０７において、従来同様、直線・円・円弧等の
図形プリミティブ認識が行われ、結果は再び図形情報テ
ーブル２１１に格納される。On the other hand, with respect to the graphic elements of the graphic information table 211 in the first and second embodiments, in the vector operation unit 207, graphic primitives such as straight lines, circles and arcs are recognized as in the conventional case, and the result is obtained. Are again stored in the graphic information table 211.

【００６８】以上により画像の認識が終了し、図形情報
テーブル２１１及び文字情報テーブル２１２の認識結果
が通信制御部２０８を経由してＣＡＤプログラムまた
は、マッピング・プログラムを有するワークステーショ
ン等のホストコンピュータ２１４へ出力される。The image recognition is completed as described above, and the recognition results of the graphic information table 211 and the character information table 212 are passed to the host computer 214 such as a workstation having a CAD program or a mapping program via the communication control unit 208. Is output.

【００６９】ホストコンピュータ２１４では、認識結果
を表示してオペレータによる確認・修正作業が行われた
後、ＣＡＤ図面データベースあるいはマッピング・デー
タベースへ格納され、ＣＡＤシステムにて、例えば図面
を作成する際に情報として用いられる。In the host computer 214, after the recognition result is displayed and confirmation / correction work is performed by the operator, the recognition result is stored in the CAD drawing database or the mapping database, and information is created when the CAD system creates a drawing, for example. Used as.

【００７０】[0070]

【発明の効果】以上説明したように、本発明によれば、
文字列の方向とその文字列を構成する各文字の底辺が基
本的に平行である書式に従った画像を読み取る画像認識
装置において、精度の高い文字認識機能を提供すること
が可能であり、設計図面・地図等のデータ・ベース作成
の効率化に寄与するところが大きい。As described above, according to the present invention,
It is possible to provide a highly accurate character recognition function in an image recognition device that reads an image according to a format in which the direction of a character string and the bases of the characters that make up the character string are basically parallel. It greatly contributes to the efficiency of data base creation such as drawings and maps.

[Brief description of drawings]

【図１】本発明の第１の原理図である。FIG. 1 is a first principle diagram of the present invention.

【図２】本発明の第２の原理図である。FIG. 2 is a second principle diagram of the present invention.

【図３】本発明の第１の実施例である。FIG. 3 is a first embodiment of the present invention.

【図４】本発明の第１及び第２の実施例のフローチャー
トである。FIG. 4 is a flowchart of the first and second embodiments of the present invention.

【図５】本発明の第１の実施例のフローチャートであ
る。FIG. 5 is a flowchart of the first embodiment of the present invention.

【図６】本実施例の候補文字列抽出例を示す図である。FIG. 6 is a diagram showing an example of candidate character string extraction according to the present embodiment.

【図７】画像認識装置の文字列抽出部を示す図である。FIG. 7 is a diagram showing a character string extraction unit of the image recognition device.

【図８】画像認識装置の文字認識部を示す図である。FIG. 8 is a diagram showing a character recognition unit of the image recognition device.

【図９】本発明の第２の実施例である。FIG. 9 is a second embodiment of the present invention.

【図１０】本発明の第２の実施例のフローチャートであ
る。FIG. 10 is a flow chart of a second embodiment of the present invention.

【図１１】従来例を示す図である。FIG. 11 is a diagram showing a conventional example.

【図１２】従来例のフローチャート（その１）である。FIG. 12 is a flowchart (part 1) of a conventional example.

【図１３】従来例のフローチャート（その２）である。FIG. 13 is a flowchart (part 2) of a conventional example.

【図１４】ベクターのグループ化を示す図である。FIG. 14 is a diagram showing grouping of vectors.

【図１５】文字列の作成例を示す図である。FIG. 15 is a diagram showing an example of creating a character string.

[Explanation of symbols]

１入力手段２文字列抽出部３文字認識部４文字列評価部５画像６文字列選択部 1 Input means 2 Character string extractor 3 Character recognition part 4 Character string evaluation section 5 images 6 Character string selection section

Claims

[Claims]

1. An image recognition device for recognizing a character by reading an image (5) having a character written according to at least a predetermined format and comparing the feature extracted from the character with the feature of a standard pattern in a dictionary. In the case where the characters can be integrated into a plurality of character strings when the character is integrated by the input means (1) for reading at least the character in the image (5) and extracted as a character string according to ambient conditions. A character string extraction unit (2) for extracting the plurality of character strings as a candidate character string, cutting out the constituent characters of the candidate character string, and creating a normalized image of the constituent characters according to the inclination of the candidate character string, A character recognition unit (3) for recognizing the constituent characters by matching the characteristics of the normalized image with the characteristics of the standard pattern of the dictionary, counting the number of unrecognizable characters of the plurality of candidate character strings, and calculating the plurality of candidates. String of Image recognition apparatus characterized by having a character string evaluation unit 識不 efficiency is output as the recognition result candidate character string is a minimum (4).

2. An image recognition apparatus for recognizing a character by reading an image (5) having a character written according to at least a predetermined format, comparing the feature extracted from the character with the feature of a standard pattern of a dictionary. In the case where the characters can be integrated into a plurality of character strings when the character is integrated by the input means (1) for reading at least the character in the image (5) and extracted as a character string according to ambient conditions. A character string extraction unit (2) for extracting the plurality of character strings as a candidate character string, cutting out the constituent characters of the candidate character string, and creating a normalized image of the constituent characters according to the inclination of the candidate character string, A character recognition unit (3) for recognizing the constituent characters by matching the characteristics of the normalized image with the characteristics of the standard pattern of the dictionary, counting the number of unrecognizable characters of the plurality of candidate character strings, and calculating the plurality of candidates. String of Output as all recognition result candidate character string 識不 efficiency meets the set criteria, the image recognition apparatus characterized by having character string selection unit for selecting one of said recognition result (6).

3. A recognition method for recognizing a character by reading an image having at least a character written according to a predetermined format, comparing the feature extracted from the character with the feature of a standard pattern of a dictionary, and recognizing the character. Of at least the above characters are read, and when the individual characters are integrated according to ambient conditions and extracted as a character string, when the characters can be integrated into a plurality of character strings, the plurality of character strings are extracted as candidate character strings, The constituent characters of the candidate character string are cut out, a normalized image of the constituent characters is created according to the inclination of the candidate character string, and the characteristics of the normalized image and the characteristics of the standard pattern of the dictionary are compared to perform the constituent characters. Is recognized, the number of unrecognizable characters of the plurality of candidate character strings is counted, and the candidate character string having the smallest unrecognizable rate of the plurality of candidate character strings is output as a recognition result. Method.

4. A recognition method for recognizing a character by reading an image having at least a character written according to a predetermined format, comparing the feature extracted from the character with the feature of a standard pattern in a dictionary, and recognizing the character. Of at least the above characters are read, and when the individual characters are integrated according to ambient conditions and extracted as a character string, when the characters can be integrated into a plurality of character strings, the plurality of character strings are extracted as candidate character strings, The constituent characters of the candidate character string are cut out, a normalized image of the constituent characters is created according to the inclination of the candidate character string, and the characteristics of the normalized image and the characteristics of the standard pattern of the dictionary are compared to perform the constituent characters. Is recognized, the number of unrecognizable characters of the plurality of candidate character strings is counted, all candidate character strings whose unrecognizable rate of the plurality of candidate character strings satisfy a setting criterion are output as a recognition result, and the recognition result is obtained. Recognition method for causing selected one of.