JP2993533B2

JP2993533B2 - Information processing device and character recognition device

Info

Publication number: JP2993533B2
Application number: JP3213665A
Authority: JP
Inventors: 浩史吉田; 義征山下
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-08-26
Filing date: 1991-08-26
Publication date: 1999-12-20
Anticipated expiration: 2014-12-20
Also published as: JPH0554183A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は、複数の文字ブロック
で構成された文書画像から、各文字ブロック間の順序関
係を判定することのできる情報処理装置と、この情報処
理装置を具える文字認識装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing apparatus capable of judging an order relation between character blocks from a document image composed of a plurality of character blocks, and a character recognition apparatus having the information processing apparatus. It concerns the device.

【０００２】[0002]

【従来の技術】機械が媒体上の文書画像から該文書が持
つ情報を正解に得ることが出来れば、種々の機能を持つ
情報処理装置、例えば文書画像を蓄積するファイリング
装置、文字認識装置、認識した文字を音声出力するよう
な装置、認識した文字を翻訳する装置等の構築が可能に
なる。しかし、文書画像には、例えば図３に示すよう
に、罫線或いは空白（図示例は空白）と、文字列が密に
配列され上述の空白により周囲と分離している複数の文
字ブロック（図３では３１〜３４）とで構成されたよう
な文書画像が多々あり、このような文書画像の認識処理
に当たっては、書式既知の文書画像の処理とは異なり、
文字ブロックを抽出し各文字ブロック間の順序関係を判
定する、という特別の処理が必要になる。2. Description of the Related Art If a machine can correctly obtain information of a document from a document image on a medium, an information processing apparatus having various functions, such as a filing apparatus for storing a document image, a character recognition apparatus, and a recognition apparatus. It is possible to construct a device that outputs a recognized character by voice, a device that translates a recognized character, and the like. However, in the document image, for example, as shown in FIG. 3, ruled lines or blanks (blank in the illustrated example) and a plurality of character blocks (FIG. 3 Then, there are many document images composed of 31 to 34), and in recognition processing of such a document image, unlike processing of a document image having a known format,
Special processing of extracting character blocks and determining the order relation between the character blocks is required.

【０００３】そこで、これを行い得る従来装置として、
例えば文献１：電子情報通信学会技術研究報告ＰＲＵ８
６−３３に開示された書式未知文書の自動読取装置があ
った。この文献に開示されている装置では、書式未知の
文書より、周辺分布特徴、線密度特徴、外接矩形特徴を
用いて、見出し文字ブロックや本文文字ブロック等の文
書構成要素が抽出され、さらにこれら各文字ブロックか
ら各文字が抽出され、さらに各文字に対して特徴抽出が
行われて各文字が認識される。Therefore, as a conventional device capable of performing this,
For example, Document 1: IEICE Technical Report PRU8
There has been an automatic reader for unknown format documents disclosed in JP-A-6-33. In the device disclosed in this document, document components such as a heading character block and a body character block are extracted from a document of unknown format using marginal distribution features, line density features, and circumscribed rectangle features. Each character is extracted from the character block, and feature extraction is performed on each character to recognize each character.

【０００４】また、この際、各文字ブロックの順序関係
は、例えば文献２：電子情報通信学会論文誌（１９８
６．３）ＶＯＬ．Ｊ６９−Ｄ，Ｎｏ．３，Ｐ４０４に開
示されているように、次式（１）に従い各文字ブロック
の文書中における位置評価ｈが算出され、このｈが小さ
い順とされるのが一般的であった。[0004] At this time, the order relation of each character block is described in, for example, Reference 2: Transactions of the Institute of Electronics, Information and Communication Engineers (198).
6.3) VOL. J69-D, no. 3, P404, the position evaluation h of each character block in the document is calculated according to the following equation (1), and the h is generally set in ascending order.

【０００５】ｈ＝Ｘｃ＋ε・Ｙｃ（１）H = Xc + ε · Yc (1)

【０００６】ここで、Ｘｃは各文字ブロックの左上隅の
Ｘ座標、Ｙｃは各文字ブロックの左上隅のＹ座標、εは
０．１程度の定数をそれぞれ示す。Here, Xc is the X coordinate of the upper left corner of each character block, Yc is the Y coordinate of the upper left corner of each character block, and ε is a constant of about 0.1.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、上述し
た文献に開示された自動読取装置では、図３に示した文
書画像のような場合に、文字ブロック３１〜３４の正規
な順序関係が、３１→３３→３２→３４というような場
合であっても、上記位置評価値ｈに従うため３１→３２
→３３→３４という順序関係であると判定されてしま
い、書体の異なる文字ブロックの正確な順序関係の判定
を行うことが出来ないという問題点があった。However, in the automatic reading apparatus disclosed in the above-mentioned document, in the case of the document image shown in FIG. 3, the normal order relation of the character blocks 31 to 34 is 31 → Even in the case of 33 → 32 → 34, 31 → 32 to follow the position evaluation value h.
There is a problem that it is determined that the order relationship is → 33 → 34, and it is not possible to determine the exact order relationship between character blocks having different typefaces.

【０００８】従って、この方法を文書画像を蓄積するフ
ァイリング装置に適用した場合には正しい順序で蓄積出
来ないので、蓄積された文書画像データを効率的に利用
出来なくなるという問題が生じる。また、この方法を文
字認識装置に適用した場合、正しい順序で文字認識を行
うことが出来ないため、文字認識装置として実用に耐え
られなくなる。また、文字認識装置においては正確な認
識結果を得るために、文字単位の認識処理の後に、何ら
かの後処理、例えば言語情報を用いた言語処理を行う場
合があるが、この後処理は認識結果の文字の並びに対し
て処理を行う場合がほとんどであり、前述のように正し
い順序で認識が行われない場合には、この後処理が全く
機能しなくなり、従って認識精度が著しく低下すること
になる。また、これを防止するため、或いは結果を修正
するためには、オペレータによる順序指示作業、或いは
修正作業が必要となるが、この作業は煩雑であるため操
作性が悪化し、結局実用上有効な高性能な文字認識装置
は実現できない、という問題点があった。Therefore, when this method is applied to a filing apparatus for storing document images, since the data cannot be stored in a correct order, there arises a problem that the stored document image data cannot be used efficiently. Further, when this method is applied to a character recognition device, character recognition cannot be performed in the correct order, so that the character recognition device cannot be put to practical use. Further, in the character recognition device, in order to obtain an accurate recognition result, after the recognition processing for each character, some post-processing, for example, linguistic processing using linguistic information may be performed. In most cases, processing is performed on the arrangement of characters. If recognition is not performed in the correct order as described above, the post-processing will not function at all, and the recognition accuracy will be significantly reduced. Further, in order to prevent this or to correct the result, an order instructing operation or an correcting operation by an operator is necessary. However, this operation is complicated, so that the operability is deteriorated, and ultimately a practically effective one is required. There is a problem that a high-performance character recognition device cannot be realized.

【０００９】この発明はこのような点に鑑みなされたも
のであり、従ってこの発明の目的は、文字ブロック間の
順序関係を正確に判定することが出来る情報処理装置、
及びこれを具える文字認識装置を提供することにある。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and an object of the present invention is to provide an information processing apparatus capable of accurately determining the order relation between character blocks.
And a character recognition device having the same.

【００１０】[0010]

【課題を解決するための手段】この目的を達成するため
に、本発明の情報処理装置は、文書画像より文字ブロッ
クを抽出すると共に文字ブロックの位置関係に基づいて
文字ブロック間の仮の順序関係を判定する文字ブロック
抽出部と、抽出された文字ブロックより文字パタンを切
り出す文字切り出し部と、文字の属性によって分類され
た複数種の辞書を有する辞書部と、前記文字切り出し部
によって切り出された文字パタンと前記辞書とを照合し
候補文字名と共に該候補文字の辞書名を前記文字ブロッ
ク毎に出力する文字認識部と、当該文字ブロックが如何
なる属性の文字によって構成されているかという文字ブ
ロックの属性を判定する為の文字ブロック属性特徴を前
記候補文字の辞書名に基づいて抽出するブロック特徴抽
出部と、前記文字ブロック間の仮の順序関係から文字ブ
ロックの属性に基づいた新たな順序関係を判定する判定
部とを備えたことを特徴とする。In order to achieve the above object, an information processing apparatus according to the present invention extracts a character block from a document image and, based on a positional relationship between the character blocks, a provisional order relation between the character blocks. Character extraction unit, a character extraction unit that extracts a character pattern from the extracted character block, a dictionary unit that has a plurality of types of dictionaries classified by character attributes, and a character extracted by the character extraction unit A character recognition unit that compares a pattern with the dictionary and outputs a dictionary name of the candidate character together with a candidate character name for each of the character blocks, and a character block attribute indicating what attribute the character block is composed of. A block feature extraction unit for extracting a character block attribute feature for determination based on a dictionary name of the candidate character; Characterized by comprising a determination unit for determining a new sequence relationship based on the attributes of the character blocks from the order relation of the temporary between lock.

【００１１】また本発明の文字認識装置は、文書画像を
入力する画像入力部と、請求項１記載の情報処理装置
と、前記情報処理装置内の文字認識部における文字認識
結果を文字ブロック毎に記憶する認識結果記憶部と、前
記情報処理装置内の判定部における文字ブロックの順序
関係の判定結果に基づいて前記認識結果記憶部より認識
結果を出力するブロック選択部とを備えたことを特徴と
する。According to another aspect of the present invention, there is provided a character recognition apparatus comprising: an image input unit for inputting a document image; an information processing apparatus according to claim 1; A recognition result storage unit for storing, and a block selection unit for outputting a recognition result from the recognition result storage unit based on a determination result of an order relation of a character block in a determination unit in the information processing device. I do.

【００１２】[0012]

【作用】この発明の情報処理装置によれば、文書画像か
ら抽出された文字ブロックの位置関係に基づいて文字ブ
ロック間の仮の順序関係が判定される。また、文字ブロ
ックから切り出された文字パタンと、書体等の文字の属
性によって予め分類され複数種用意されている辞書とを
照合して認識を行うことにより、候補文字名と共に候補
文字の辞書名が得られる。この候補文字の辞書名、例え
ば第１位の候補文字の辞書名の内最も多く参照された辞
書名に基づいて、各文字ブロックが如何なる属性の文字
によって構成されているかを判定する為の文字ブロック
属性特徴が抽出される。この文字ブロック属性特徴に基
づいて、文字ブロック間の仮の順序関係から文字ブロッ
ク間の属性の連続性が保存された新たな順序関係が得ら
れる。従って、各文字ブロックのレイアウトに規則性の
無い文書から文字ブロックを抽出し出力する場合でも、
各ブロックを文字の属性に基づいて正しい順序関係で出
力出来る。According to the information processing apparatus of the present invention, the provisional order relation between character blocks is determined based on the positional relation between the character blocks extracted from the document image. In addition, by comparing and recognizing a character pattern cut out from a character block with a dictionary prepared in advance and classified according to the attribute of a character such as a font, a dictionary name of the candidate character is obtained along with the candidate character name. can get. A character block for determining what attribute each character block is composed of based on the dictionary name of the candidate character, for example, the dictionary name most frequently referred to among the dictionary names of the first candidate character Attribute features are extracted. Based on the character block attribute characteristics, a new order relation in which the continuity of the attribute between the character blocks is preserved is obtained from the temporary order relation between the character blocks. Therefore, even when extracting and outputting a character block from a document having no regular layout of each character block,
Each block can be output in the correct order based on the attribute of the character.

【００１３】また、この発明の情報処理装置を文字認識
装置に適用した場合には、上述のように正しい順序で出
力される文字ブロックについて文字認識を行うことが出
来るため例えばレイアウトに規則性の無い文書の文字認
識を文書の流れに即して行うことが出来る。Further, when the information processing apparatus of the present invention is applied to a character recognition apparatus, character recognition can be performed on character blocks output in the correct order as described above. Character recognition of a document can be performed according to the flow of the document.

【００１４】[0014]

【実施例】以下、図１〜図６を用いて、この発明の情報
処理装置及び文字認識装置の実施例について説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of an information processing apparatus and a character recognition apparatus according to the present invention will be described below with reference to FIGS.

【００１５】図１は本発明の情報処理装置１２０を用い
た文書画像ファイリング装置１００の構成を示すブロッ
ク図である。この文書画像ファイリング装置１００は、
画像入力部１１０、本発明の情報処理装置１２０、ブロ
ック選択部１３０、画像記憶部１４０、及び出力端子１
５０を備える。また、この情報処理装置１２０は文字ブ
ロック抽出部１２１、文字切り出し部１２２、文字認識
部１２３、辞書部１２６、ブロック特徴抽出部１２４、
判定部１２５を具える。尚、辞書部１２６は、書体の異
なる辞書マトリクスを格納した第１の辞書１２６ａ及び
第２の辞書１２６ｂより構成されている。FIG. 1 is a block diagram showing the configuration of a document image filing apparatus 100 using an information processing apparatus 120 according to the present invention. This document image filing apparatus 100
Image input unit 110, information processing device 120 of the present invention, block selection unit 130, image storage unit 140, and output terminal 1
50 is provided. The information processing device 120 includes a character block extracting unit 121, a character segmenting unit 122, a character recognizing unit 123, a dictionary unit 126, a block feature extracting unit 124,
A determination unit 125 is provided. The dictionary unit 126 includes a first dictionary 126a and a second dictionary 126b that store dictionary matrices having different typefaces.

【００１６】図２は文字認識部１２３の構成を示すブロ
ック図であり、サブパタン抽出部２０１、特徴抽出部２
０２、照合部２０３を具えている。FIG. 2 is a block diagram showing the structure of the character recognizing unit 123. The sub pattern extracting unit 201 and the feature extracting unit 2
02, a collating unit 203 is provided.

【００１７】図３は、文書画像の一例を示す図であり、
文書画像３０は、空白部によって分離されたローマン系
書体の文字ブロック３１、３３及びサンセリフ系書体の
文字ブロック３２、３４で構成されている。FIG. 3 is a diagram showing an example of a document image.
The document image 30 is composed of character blocks 31 and 33 of a Roman typeface and character blocks 32 and 34 of a sans serif typeface separated by a blank portion.

【００１８】図４は、辞書部１２６における第１の辞書
１２６ａ、及び、第２の辞書１２６ｂの説明に供する図
であり、本実施例では、例えば第１の辞書はローマン系
書体を標準パタンとし、第２の辞書はサンセリフ系書体
を標準パタンとする辞書である。FIG. 4 is a diagram for explaining the first dictionary 126a and the second dictionary 126b in the dictionary unit 126. In this embodiment, for example, the first dictionary uses Roman typeface as a standard pattern. The second dictionary is a dictionary that uses sans-serif typefaces as standard patterns.

【００１９】図５は、ブロック特徴抽出部１２４の説明
に供する図であり、（Ａ）は文字ブロック３１の先頭行
の文字列、第１位候補文字、当該候補文字を得た辞書マ
トリクスの辞書番号を表し、以下（Ｂ），（Ｃ）、及び
（Ｄ）は各々文字ブロック３２、３３、３４の先頭行の
文字列、第１位候補文字、及び辞書マトリクスの辞書番
号を表している。FIG. 5 is a diagram for explaining the block feature extracting unit 124. FIG. 5A shows a character string in the first line of the character block 31, a first candidate character, and a dictionary of a dictionary matrix from which the candidate character is obtained. Hereafter, (B), (C), and (D) represent the character string of the first line of the character blocks 32, 33, and 34, the first candidate character, and the dictionary number of the dictionary matrix, respectively.

【００２０】（第１の実施例）以下に、第１の実施例で
ある本発明の情報処理装置を用いた文書画像ファイリン
グ装置の各構成成分の動作について図１から図５を用い
て詳細に説明する。(First Embodiment) The operation of each component of the document image filing apparatus using the information processing apparatus according to the first embodiment of the present invention will be described in detail with reference to FIGS. explain.

【００２１】画像入力部１１０は、例えば従来公知のイ
メージセンサ等で構成出来、文字・図形等（以下、文字
と称する。）が記載された例えば図３に示すような文書
画像３０からの光信号Ｓを、例えば文字線部を黒ビット
及び背景部を白ビットで表現した２値のディジタル信号
（これを入力画像データという。）に変換し、この入力
画像データを情報処理装置１２０の文字ブロック抽出部
１２１に出力する。The image input section 110 can be composed of, for example, a conventionally known image sensor or the like, and an optical signal from a document image 30 such as that shown in FIG. 3 in which characters, figures, and the like (hereinafter, referred to as characters) are described. S is converted into a binary digital signal in which the character line portion is represented by black bits and the background portion is represented by white bits (this is referred to as input image data). Output to the unit 121.

【００２２】情報処理装置１２０の文字ブロック抽出部
１２１は、画像入力部１１０より入力された入力画像デ
ータより文書画像の文字ブロックに対応するデータ（こ
れを文字ブロックデータと称する）を抽出し、文字切り
出し部１２２及びブロック選択部１３０に夫々出力す
る。例えば図３に示す文書画像３０の例で説明すれば、
文字列が規則正しく密に配列し空白等により周囲と分離
した領域３１〜３４（これらの領域各々が文字ブロック
になる）を抽出する。The character block extraction unit 121 of the information processing apparatus 120 extracts data (hereinafter, referred to as character block data) corresponding to a character block of a document image from input image data input from the image input unit 110, and The data is output to the extraction unit 122 and the block selection unit 130, respectively. For example, in the case of the document image 30 shown in FIG.
Areas 31 to 34 in which character strings are regularly arranged densely and separated from the surroundings by spaces or the like (each of these areas becomes a character block) are extracted.

【００２３】ここで、文字ブロックデータの抽出は従来
公知の種々の方法で行うことが出来るが、この実施例で
は以下に説明するような方法で行う。先ず、入力画像デ
ータを縦方向（図３中ｙで示す方向に相当する方向）に
走査し周辺分布を作成する。次に、この周辺分布におい
て、黒ビット数が０から１以上に変化する走査線より１
以上から０に変化する直前の走査線までで規定される領
域を列方向の１ブロックとして検出する。Here, the extraction of the character block data can be performed by various conventionally known methods. In this embodiment, the extraction is performed by the following method. First, the input image data is scanned in the vertical direction (a direction corresponding to the direction indicated by y in FIG. 3) to create a peripheral distribution. Next, in this peripheral distribution, the number of black bits changes from 0 to 1 or more from the scanning line.
From the above, an area defined by the scanning line immediately before changing to 0 is detected as one block in the column direction.

【００２４】次に、この列方向の、１ブロックに対応す
る入力画像データ部分を、今度は横方向（図３中ｘで示
す方向に相当する方向）に走査し横方向の周辺分布にお
いて、黒ビット数が０から１以上に変化する走査線より
１以上から０に変化する直前の走査線までで規定される
領域の幅が、閾値ＴＨより大きかったならばこの領域と
前述の列方向のブロックとの重複部分のデータを文字ブ
ロックデータとして抽出する。尚、閾値ＴＨは、文字の
大きさ、フォント等により決定されるパラメータであ
り、本実施例ではＴＨ＝１４０である。Next, the input image data portion corresponding to one block in the column direction is scanned in the horizontal direction (a direction corresponding to the direction indicated by x in FIG. 3), and a black image is formed in the peripheral distribution in the horizontal direction. If the width of the area defined by the scanning line where the number of bits changes from 0 to 1 or more to the scanning line immediately before changing from 1 or more to 0 is larger than the threshold value TH, this area and the aforementioned block in the column direction are used. The data of the overlapping part with is extracted as character block data. The threshold value TH is a parameter determined by the size of the character, the font, and the like. In this embodiment, TH = 140.

【００２５】また、抽出された前記文字ブロックの出力
順序（この順序関係を仮の順序関係と称する）は、各文
字ブロックの位置情報を用いて、式（２）により各文字
ブロックの文書画像における位置評価値Ｅ_i（但しｉは
文字ブロック番号であり例えば図３の３１〜３４であ
る）を求め、このＥ_iの昇順に出力する。The output order of the extracted character blocks (this order relationship is referred to as a tentative order relationship) is determined by the equation (2) using the position information of each character block in the document image of each character block. The position evaluation value E _i (where i is a character block number and is, for example, 31 to 34 in FIG. 3) is obtained and output in ascending order of E _i .

【００２６】Ｅ_i＝ｙ_ei＋ｆ・ｘ_ei （２）E _i = y _ei + f · x _ei (2)

【００２７】但し、式（２）において、ｘ_ei、ｙ_eiは其
々当該文字ブロックに外接する矩形の左上隅のｘ、ｙ座
標であり、ｆは予め定めた定数であり本実施例では１０
である。なお、ｘ、ｙ座標は図３に示すｘ、ｙ座標系に
よる。従って図３に示した文書画像の各文字ブロックの
出力順序は、３１→３２→３３→３４となる。In the equation (2), x _ei and y _ei are the x and y coordinates of the upper left corner of the rectangle circumscribing the character block, f is a predetermined constant, and is 10 in this embodiment.
It is. The x and y coordinates are based on the x and y coordinate system shown in FIG. Therefore, the output order of each character block of the document image shown in FIG. 3 is 31 → 32 → 33 → 34.

【００２８】文字切り出し部１２２は、文字ブロック抽
出部１２１より入力された文字ブロックデータより、該
文字ブロックを構成している文字行を切り出し、さらに
この文字行より１文字づつの文字パタンを切り出し、該
切り出された文字パタンを文字ブロック毎に文字認識部
１２３内のサブパタン抽出部２０１（図２参照）に出力
する。The character cutout unit 122 cuts out a character line constituting the character block from the character block data input from the character block extraction unit 121, and further cuts out a character pattern for each character from this character line. The extracted character pattern is output to the sub-pattern extraction unit 201 (see FIG. 2) in the character recognition unit 123 for each character block.

【００２９】ここで、前記文字行の切り出しは、文字ブ
ロック抽出部１２１から入力された文字ブロックデータ
を、文字行方向を主走査方向として、また文字列方向を
副走査方向として順次走査し、黒画素の分布を作成し、
該黒画素の分布が「０」から「１」以上に変化する位置
から、「１」以上から「０」に変化する位置の直前の位
置までを文字行データとして検出することにより行う。
また、前記文字行からの文字パタンの切り出しは、前記
切り出された文字行データを、文字列方向を主走査方
向、文字行方向を副走査方向として走査し、黒画素の分
布を作成し、該黒画素の分布が「０」から「１」以上に
変化する位置から、「１」以上から「０」に変化する直
前の位置までを文字パタンデータとして順次切り出すこ
とにより行う。Here, the character line is cut out by sequentially scanning the character block data input from the character block extracting unit 121 with the character line direction as the main scanning direction and the character string direction as the sub scanning direction. Create a distribution of pixels,
This is performed by detecting, as character line data, a portion from a position where the distribution of black pixels changes from “0” to “1” or more to a position immediately before a position where the distribution of black pixels changes from “1” or more to “0”.
The character pattern is cut out from the character line by scanning the cut-out character line data with the main scanning direction in the character string direction and the sub-scanning direction in the character line direction to create a distribution of black pixels. This is performed by sequentially cutting out from the position where the distribution of black pixels changes from “0” to “1” or more to the position immediately before changing from “1” or more to “0” as character pattern data.

【００３０】図２のサブパタン抽出部２０１において
は、文字切り出し部１２２より入力された文字パタンを
複数の方向に走査し、各走査線上で予め定めた特定の値
ｈ（本実施例ではｈ＝５）以上連続している黒画素列を
検出し、該連続した黒画素列をサブパタンの黒画素成分
として抽出することにより、文字パタンより各走査方向
別のサブパタンを抽出し、該サブパタンを順次特徴抽出
部２０２に出力する。前記走査方向は、本実施例では、
文字行方向（以下、Ｘ軸方向）に垂直な方向（垂直方
向）、及び平行な方向（水平方向）、Ｘ軸から反時計方
向４５°の方向（左斜め方向）及び時間方向４５°の方
向（左斜め方向）とし、これら各方向毎に文字パタンを
走査して各方向別に４個のサブパタンを抽出する。In the sub-pattern extracting unit 201 shown in FIG. 2, the character pattern input from the character extracting unit 122 is scanned in a plurality of directions, and a predetermined specific value h (h = 5 in this embodiment) on each scanning line. ) Detecting a continuous black pixel row as described above and extracting the continuous black pixel row as a black pixel component of a sub-pattern extracts a sub-pattern for each scanning direction from a character pattern, and sequentially extracts a feature of the sub-pattern. Output to the unit 202. The scanning direction is, in the present embodiment,
Direction (vertical direction) and parallel direction (horizontal direction) perpendicular to the character line direction (hereinafter, X-axis direction), 45 ° counterclockwise direction from the X axis (diagonal left direction), and 45 ° time direction (Left diagonal direction), and character patterns are scanned in each of these directions to extract four sub-patterns for each direction.

【００３１】例えば垂直方向のサブパタンの抽出では垂
直方向を主走査方向とし、文字パタンデータを垂直方向
に走査し、該走査線上で連続する黒画素（黒ラン）を検
出し、Ｌ≧ｈとなる長さＬの黒ランを垂直方向のサブパ
タンの黒画素部分として抽出することにより、垂直方向
サブパタンを抽出する。垂直方向のサブパタン抽出と同
様にして、残りの各方向を主走査方向としたときのサブ
パタンの抽出を行う。For example, in the extraction of a vertical sub-pattern, the vertical direction is set as the main scanning direction, character pattern data is scanned in the vertical direction, and continuous black pixels (black runs) are detected on the scanning line, and L ≧ h. The vertical sub-pattern is extracted by extracting the black run of length L as the black pixel portion of the vertical sub-pattern. In the same manner as the vertical sub-pattern extraction, the sub-pattern is extracted when the remaining directions are set to the main scanning direction.

【００３２】特徴抽出部２０２においては、前記抽出さ
れた各方向のサブパタン上に、文字パタンの文字外接枠
に対応する方形領域を設定し、該方形領域をＮ×Ｍ個
（Ｎ、Ｍは任意好適な自然数）の小領域に分割し、各小
領域に含まれる各サブパタンの文字線の長さを表す特徴
量を抽出し、該特徴量を文字外接枠の大きさで正規化
し、正規化された特徴量ｆ_iからなる特徴マトリクスＦ
を作成し、照合部２０３に出力するものである。尚、本
実施例では、前記分割数Ｎ、Ｍ＝８とし、また前記特徴
量の文字外接枠の大きさでの正規化は（ｄＸ＋ｄＹ）／
２なる値で正規化するものとする。但し、ｄＸは文字外
接枠の水平方向の長さ及びｄＹは文字外接枠の垂直方向
の長さである。また特徴量ｆ_iは、各小領域に１〜Ｎ×
Ｍまでの番号ｉ（ｉ＝１、２、…、Ｎ×Ｍ）を順次に付
して小領域を表したときに、番号ｉの小領域の特徴量を
表し、特徴マトリクスＦの要素値である。In the feature extracting unit 202, square areas corresponding to the character circumscribed frame of the character pattern are set on the extracted sub-patterns in each direction, and N × M square areas (N and M are arbitrary) (A suitable natural number), and the feature amount representing the length of the character line of each sub-pattern included in each small region is extracted. The feature amount is normalized by the size of the character circumscribed frame. feature matrix F consisting of feature amounts f _i was
Is created and output to the matching unit 203. In the present embodiment, the number of divisions N and M = 8, and the normalization of the feature amount by the size of the character circumscribed frame is (dX + dY) /
It shall be normalized by two values. Here, dX is the horizontal length of the character circumscribed frame, and dY is the vertical length of the character circumscribed frame. Also, the feature amount f _i is 1 to N ×
When a number i (i = 1, 2,..., N × M) up to M is sequentially assigned to represent a small area, the feature amount of the small area with the number i is represented. is there.

【００３３】照合部２０３は特徴マトリクスＦを、辞書
部１２６に格納されている予め用意された、文字の属性
により２以上の種類に分類された標準パタンの特徴辞書
マトリクスＧと照合し、次式（３）で表される類似度Ｒ
を求め、類似度Ｒが予め定めた値Ｐ以上である辞書マト
リクスの文字名を候補文字名とし、さらに類似度の高い
順に第１位候補文字、第２位候補文字と順位付けを行
い、１以上の候補文字名及び当該候補文字名を得る対象
となった標準パタンの辞書の種類を認識結果としてブロ
ック特徴抽出部１２４に出力する。The matching unit 203 compares the feature matrix F with a feature dictionary matrix G of standard patterns prepared in advance and stored in the dictionary unit 126 and classified into two or more types according to character attributes. Similarity R represented by (3)
, And the character names of the dictionary matrix whose similarity R is equal to or greater than a predetermined value P are set as candidate character names, and further ranked in order of the highest similarity with the first candidate character and the second candidate character. The above-described candidate character name and the type of dictionary of the standard pattern from which the candidate character name is obtained are output to the block feature extraction unit 124 as a recognition result.

【００３４】[0034]

【数１】 (Equation 1)

【００３５】但し、ｇ_iは辞書マトリクスの要素を表
す。Here, g _i represents an element of the dictionary matrix.

【００３６】辞書部１２６は前記文字認識部１２３内の
照合部２０３で、文字パタンより抽出した特徴マトリク
スと照合を行う標準パタンの特徴マトリクス、つまり辞
書マトリクスを格納しておくものであり、書体、標準体
／斜体、線幅、縦横比等の文字属性に応じて１の文字名
に対して２以上の辞書マトリクスが用意されているもの
である。The dictionary unit 126 stores a feature matrix of a standard pattern for matching with a feature matrix extracted from a character pattern in the matching unit 203 in the character recognition unit 123, that is, a dictionary matrix. Two or more dictionary matrices are prepared for one character name in accordance with character attributes such as standard / italic, line width, and aspect ratio.

【００３７】本実施例においては、図４（Ａ）に示すよ
うなローマン系書体を標準パタンとして作成した第１の
辞書１２６ａ（辞書番号１）及び図４（Ｂ）に示すよう
なサンセリフ系書体を標準パタンとして作成した第２の
辞書１２６ｂ（辞書番号２）の２の辞書より辞書部１３
３は構成されており、前記文字認識部１２３内の照合部
２０３においては、照合結果の候補文字名と各候補文字
名を得るために用いた辞書が、前記第１の辞書である
か、第２の辞書であるかを示す辞書番号を出力する。本
実施例の辞書部１２６を用いて図３の３１〜３４の各文
字ブロックの先頭行を認識した場合の認識結果の例を図
５（Ａ）〜（Ｄ）に示す。In the present embodiment, the first dictionary 126a (dictionary number 1) prepared using a Roman typeface as shown in FIG. 4A as a standard pattern and a sans-serif typeface as shown in FIG. 4B. From the second dictionary 126b (dictionary number 2) created as a standard pattern
In the collating unit 203 in the character recognizing unit 123, the dictionary used to obtain the candidate character names of the collation result and each candidate character name is the first dictionary, A dictionary number indicating whether the dictionary is the second dictionary is output. FIGS. 5A to 5D show examples of recognition results when the first line of each of the character blocks 31 to 34 in FIG. 3 is recognized using the dictionary unit 126 of this embodiment.

【００３８】ブロック特徴抽出部１２４は、文字認識部
１２３から入力された認識結果の候補文字名及び該候補
文字名を得た辞書の種類を、各文字ブロック毎に集計
し、各文字ブロックが如何なる属性の文字により構成さ
れているかという文字ブロックの属性を判定するための
文字ブロック属性特徴を抽出する。即ち、当該文字ブロ
ックを構成する全ての文字の認識結果の第１位候補文字
の辞書番号を集計する事により、最も多くの第１位候補
を得た辞書番号を、当該文字ブロックの属性特徴として
判定部１２５に出力する。The block feature extraction unit 124 tabulates, for each character block, the candidate character name of the recognition result input from the character recognition unit 123 and the type of dictionary from which the candidate character name was obtained. A character block attribute feature for determining the attribute of the character block as to whether the character block is constituted by the character of the attribute is extracted. That is, by summing up the dictionary numbers of the first candidate characters of the recognition results of all the characters constituting the character block, the dictionary number that has obtained the largest number of first candidates is used as the attribute feature of the character block. Output to the determination unit 125.

【００３９】図３の３１、３２、３３、３４の各文字ブ
ロックについては、図５（Ａ）〜（Ｄ）に例示したよう
に、３１ではローマン系の辞書１が、３２ではサンセリ
フ系の辞書２が、３３ではローマン系の辞書１が、３４
ではサンセリフ系の辞書２が各々最も出現回数が多いと
され、Ｆ₃₁＝１、Ｆ₃₂＝２、Ｆ₃₃＝１、Ｆ₃₄＝２なる文
字ブロック属性特徴値が得られる。但し、Ｆ₃₁、Ｆ₃₂、
Ｆ₃₃、Ｆ₃₄は、図３の３１、３２、３３、３４の各文字
ブロックの文字ブロック属性特徴値である。As shown in FIGS. 5 (A) to 5 (D), for each of the character blocks 31, 32, 33 and 34 in FIG. 2 is 33 and Roman dictionary 1 is 34
In this example, it is assumed that the sans-serif dictionary 2 has the largest number of appearances, and character block attribute feature values of F ₃₁ = 1, F ₃₂ = 2, F ₃₃ = 1, and F ₃₄ = 2 are obtained. However, F ₃₁ , F ₃₂ ,
F ₃₃ and F ₃₄ are character block attribute characteristic values of the character blocks 31, 32, ₃₃ and ₃₄ in FIG.

【００４０】判定部１２５ではブロック特徴抽出部１２
４から出力される各文字ブロック属性特徴値を用いて文
字ブロック抽出部１２１において得られた仮の順序関係
から、文字ブロックの属性に基づいて、各文字ブロック
の新たな順序関係を判定し、判定結果をブロック選択部
１３０に出力する。文字ブロックの新たな順序関係の判
定は以下のように行う。In the determination section 125, the block feature extraction section 12
4, a new order relation of each character block is determined based on the attribute of the character block from the tentative order relation obtained in the character block extraction unit 121 using each character block attribute feature value output from The result is output to block selection section 130. The determination of the new order relation of the character block is performed as follows.

【００４１】先ず、ブロック特徴抽出部１２４から出力
された各文字ブロック属性特徴値を、各文字ブロック間
で相互に比較し、当該文字ブロックが同一の属性特徴値
の文字ブロックであるか否か、つまり同一属性の文字に
より構成されているか否か、の判定を行う。この判定
は、２つの文字ブロックの文字ブロック属性特徴値が次
式（４）を満足するか否かにより行う。First, the character block attribute feature values output from the block feature extraction unit 124 are compared with each other between character blocks to determine whether or not the character block is a character block having the same attribute feature value. That is, it is determined whether or not the characters are composed of characters having the same attribute. This determination is made based on whether or not the character block attribute characteristic values of the two character blocks satisfy the following expression (4).

【００４２】Ｆ_i＝Ｆ_j （４）F _i = F _j (4)

【００４３】但し、Ｆ_iは文字ブロックｉの文字ブロッ
ク属性特徴値、Ｆ_jは文字ブロックｊの文字ブロック属
性特徴値である。図３の帳票の例においては、前述した
ように各文字ブロックの文字特徴値は、Ｆ₃₁＝１、Ｆ₃₂
＝２、Ｆ₃₃＝１、Ｆ₃₄＝２であったので、Ｆ₃₁＝Ｆ₃₃、
Ｆ₃₂＝Ｆ₃₄の２つの組み合わせのみ前式（４）を満足す
る組み合わせとなる。従って、文字ブロック３１と文字
ブロック３３、及び文字ブロック３２と文字ブロック３
４は、それぞれ同一属性の文字ブロックである、つまり
同一の書体の文字で構成されている文字ブロックである
と判定される。Where F _i is the character block attribute characteristic value of character block i, and F _j is the character block attribute characteristic value of character block j. In the example of the form shown in FIG. 3, as described above, the character feature value of each character block is F ₃₁ = 1, F ₃₂
= 2, F ₃₃ = 1, and F ₃₄ = 2, so that F ₃₁ = F ₃₃ ,
Only two combinations of F ₃₂ = F ₃₄ satisfy the above expression (4). Therefore, the character blocks 31 and 33 and the character blocks 32 and 3
4 are determined to be character blocks having the same attribute, that is, character blocks composed of characters of the same typeface.

【００４４】さらに、判定部１２５においては、前記文
字ブロック比較結果に基づいて、前記仮の順序関係を以
下の手順で検定することにより、順次文字ブロックを選
択し新たな順序関係を得る。Further, the determination section 125 tests the tentative order relation in the following procedure based on the result of the character block comparison, thereby sequentially selecting character blocks and obtaining a new order relation.

【００４５】先ず、仮の順序関係が第１番目である文
字ブロックを第１の文字ブロックとして選択し、かつ基
準の文字ブロックとする。First, a character block having the first provisional order relation is selected as a first character block and is set as a reference character block.

【００４６】次に、前記基準の文字ブロックと同一の
種類の文字で構成されている文字ブロックを、前記文字
ブロック比較結果に基づき、仮の順序関係の順に選択す
る。Next, character blocks composed of characters of the same type as the reference character block are selected in the order of the provisional order based on the result of the character block comparison.

【００４７】前記の選択を、仮に順序関係の最後ま
で終了したら、再び仮の順序関係上位の文字ブロックか
ら順次検定し、未だ前記及びによって未選択の文字
ブロックを探索する。When the above selection is temporarily completed to the end of the order relation, the character blocks in the temporary higher order relation are sequentially tested again, and a character block which has not been selected by the above is searched.

【００４８】前記により最初に探索された未選択の
文字ブロックを選択すると共に基準の文字ブロックと
し、前記以降の処理を未選択の文字ブロックを対象に
して再び繰り返す。The unselected character block searched first as described above is selected and set as a reference character block, and the subsequent processing is repeated again for the unselected character block.

【００４９】前記の探索により未選択の文字ブロッ
クが存在しなかった場合には、文字ブロックの選択を終
了する。If there is no unselected character block as a result of the above search, the selection of the character block is terminated.

【００５０】上述したような手順で得られた新たな順序
関係は、ブロック選択部１３０に出力される。図３に示
すような帳票の文字ブロックは、前述した手順によれ
ば、最終的に得られる文字ブロックの順序関係は３１→
３３→３２→３４となる。The new order relation obtained by the above-described procedure is output to the block selector 130. According to the above-described procedure, the character block of the form as shown in FIG.
33 → 32 → 34.

【００５１】ブロック選択部１３０においては、前記文
字ブロック抽出部１２１から入力された文字ブロックデ
ータを、図示せぬ文字ブロックメモリに格納しておき、
情報処理装置１２０内の判定部１２５から出力された順
序関係判定結果に従って、順次文字ブロックデータを画
像記憶部１４０に出力する。The block selector 130 stores the character block data input from the character block extractor 121 in a character block memory (not shown).
The character block data is sequentially output to the image storage unit 140 according to the order relation determination result output from the determination unit 125 in the information processing device 120.

【００５２】画像記憶部１４０は、ブロック選択部１３
０から順序づけられて出力される各文字ブロックデータ
を順次文書画像データとして格納する為の記憶装置であ
り、ハードディスク、光磁気ディスク、フレキシブルデ
ィスク等で容易に実現出来る。また、画像記憶部１４０
は、図示しない制御装置により制御されており、指定さ
れた文書画像データを任意に選択し出力端子１５０に出
力できる。The image storage unit 140 includes the block selection unit 13
This is a storage device for sequentially storing character block data output in order from 0 as document image data, and can be easily realized by a hard disk, a magneto-optical disk, a flexible disk, or the like. Also, the image storage unit 140
Is controlled by a control device (not shown), and can arbitrarily select designated document image data and output it to the output terminal 150.

【００５３】（第２の実施例）図６は、本発明の情報処
理装置を用いた文字認識装置の構成を示すブロック図で
あり、文字認識装置６００は、画像入力部６１０、情報
処理装置６２０、認識結果記憶部６３０、ブロック選択
部６４０、及び出力端子６５０を具え、また、この情報
処理装置６２０は、文字ブロック抽出部６２１、文字切
り出し部６２２、文字認識部６２３、辞書部６２６、ブ
ロック特徴抽出部６２４、判定部６２５を具える。なお
辞書部６２６は書体の異なる辞書マトリクスを格納した
第１の辞書６２６ａ及び第２の辞書６２６ｂより構成さ
れている。(Second Embodiment) FIG. 6 is a block diagram showing the configuration of a character recognition device using the information processing device of the present invention. The character recognition device 600 includes an image input unit 610 and an information processing device 620. , A recognition result storage unit 630, a block selection unit 640, and an output terminal 650. The information processing device 620 includes a character block extraction unit 621, a character cutout unit 622, a character recognition unit 623, a dictionary unit 626, and a block feature. An extraction unit 624 and a determination unit 625 are provided. Note that the dictionary unit 626 includes a first dictionary 626a and a second dictionary 626b that store dictionary matrices having different typefaces.

【００５４】この実施例の画像入力装置６１０、文字ブ
ロック抽出部６２１、文字切り出し部６２２、ブロック
特徴抽出部６２４、判定部６２５、辞書部６２６は其々
第１の実施例における画像入力装置１１０、文字ブロッ
ク抽出部１２１、文字切り出し部１２２、ブロック特徴
抽出部１２４、判定部１２５、辞書部１２６と同様の構
成要素であり、此等の説明は省略する。The image input device 610 of this embodiment, the character block extraction unit 621, the character cutout unit 622, the block feature extraction unit 624, the determination unit 625, and the dictionary unit 626 are respectively provided by the image input device 110 of the first embodiment. The components are the same as those of the character block extraction unit 121, the character cutout unit 122, the block feature extraction unit 124, the determination unit 125, and the dictionary unit 126, and a description thereof will be omitted.

【００５５】文字認識部６２３においては、第１の実施
例と同様の方法により、入力された文字パタンよりサブ
パタンを抽出し、各サブパタンの文字線の長さを表す特
徴を抽出し、正規化された特徴量ｆ_iからなる特徴マト
リクスＦを、予め辞書部６２６に格納され文字の属性に
より２以上の種類に分類された標準文字パタンの特徴辞
書マトリクスＧと照合し、前式（３）で表される類似度
Ｒを求め、類似度が予め定めた閾値Ｐ以上である辞書マ
トリクスの文字名を候補文字名とし、更に、類似度の高
い順に第１位候補文字、第２位候補文字と順位付けを行
い、候補文字名及び当該候補文字名を得る対象となった
標準文字パタンの辞書の種類をブロック特徴抽出部６２
４に出力すると共に候補文字名を認識結果記憶部６３０
に文字ブロック毎に出力する。In the character recognition unit 623, sub-patterns are extracted from the input character pattern in the same manner as in the first embodiment, and a characteristic representing the length of the character line of each sub-pattern is extracted and normalized. Table characteristic matrix F consisting of feature amounts f _i, in advance by being stored in the dictionary unit 626 character attributes against the feature dictionary matrix G of the standard character patterns which are classified into two or more kinds, equation (3) The similarity R is calculated, and the character names of the dictionary matrix whose similarity is equal to or greater than a predetermined threshold value P are set as candidate character names, and the first candidate character and the second candidate character are ranked in descending order of similarity. And the type of dictionary of the standard character pattern from which the candidate character name and the candidate character name are obtained
4 and the candidate character names are stored in the recognition result storage unit 630.
Is output for each character block.

【００５６】認識結果記憶部６３０は、文字認識部６２
３より出力された各文字ブロックの認識結果を各文字ブ
ロック毎に記憶する記憶部である。また、認識結果記憶
部６３０は図示しない制御信号により、後述するブロッ
ク選択部６４０により選択された文字ブロックの文字単
位の認識結果を選択された順に出力できる構成となって
いる。The recognition result storage unit 630 stores the character recognition unit 62
3 is a storage unit that stores the recognition result of each character block output from 3 for each character block. In addition, the recognition result storage unit 630 is configured to be able to output the recognition results in character units of the character blocks selected by the block selection unit 640 described later in the order of selection, by a control signal (not shown).

【００５７】ブロック選択部６４０においては、判定部
６２５から出力された順序関係判定結果に従って文字ブ
ロック毎の認識結果を認識結果記憶部６３０より順次読
み込み、出力端子６５０に出力する。The block selection section 640 sequentially reads the recognition results for each character block from the recognition result storage section 630 according to the order relation judgment result output from the judgment section 625 and outputs the result to the output terminal 650.

【００５８】以上、この発明の情報処理装置及び文字認
識装置の実施例について詳細に説明したが、この発明は
上述の実施例のみに限られるものではなく以下に説明す
るような種々の変更が可能である。Although the embodiments of the information processing apparatus and the character recognition apparatus according to the present invention have been described in detail above, the present invention is not limited to the above-described embodiments, and various modifications as described below are possible. It is.

【００５９】例えば、文字ブロックの抽出、文字行の切
り出し、文字パタンの切り出し、また文字認識に当たっ
ての特徴マトリクスの作成や類似度計算等は、実施例で
説明した方法に限られるものではなく他の好適な方法を
用い得ることは明かである。For example, extraction of a character block, extraction of a character line, extraction of a character pattern, creation of a feature matrix and similarity calculation in character recognition are not limited to the method described in the embodiment, but may be other methods. Obviously, any suitable method can be used.

【００６０】また、本実施例では文字ブロック特徴を得
るための辞書番号の抽出は、当該文字ブロックの全ての
文字パタンについて行うものとしたが、これに限られる
ものではなく、任意に選択した一部の文字、例えば各文
字ブロック先頭文字行の文字パタンのみから特徴抽出を
行い、該先頭文字行の文字パタンの特徴から当該文字ブ
ロックの文字ブロック特徴を得るようにしても良い。Further, in this embodiment, the extraction of the dictionary number for obtaining the character block feature is performed for all the character patterns of the character block. However, the present invention is not limited to this. Alternatively, the feature extraction may be performed only from the character pattern of the first character line of each character block, and the character block characteristics of the character block may be obtained from the characteristics of the character pattern of the first character line.

【００６１】また、本実施例では、情報処理装置中の辞
書部においては、ローマン系の文字を標準パタンとする
第１の辞書と、サンセリフ系の文字を標準パタンとする
第２の辞書とを有する構成としたが、これに限られるも
のではなく、その他の書体、メディアム／ボールド等の
線幅、スタンダード／コンデンス等の縦横比、等の文字
属性の違いにより、２以上の辞書を作成しても良い。In this embodiment, the dictionary section in the information processing apparatus includes a first dictionary that uses Roman characters as standard patterns and a second dictionary that uses sans-serif characters as standard patterns. However, the present invention is not limited to this, and two or more dictionaries are created based on differences in character attributes such as other typefaces, line widths such as medium / bold, and aspect ratios such as standard / condensed. May be.

【００６２】また、上述の実施例では本発明の情報処理
装置を文書画像ファイリング装置及び文字認識装置に適
用したが、この情報処理装置は、文字ブロックの順序関
係を決定しこの順序に従い文字ブロックを順次に出力す
る画像処理装置、さらに、文字領域の順序関係を抽出し
て文書のレイアウト構造を抽出する装置等にも適用出来
る。In the above-described embodiment, the information processing apparatus of the present invention is applied to the document image filing apparatus and the character recognition apparatus. However, this information processing apparatus determines the order relation of the character blocks, and converts the character blocks in accordance with this order. The present invention can be applied to an image processing apparatus that sequentially outputs images, and an apparatus that extracts a document layout structure by extracting the order relation of character areas.

【００６３】[0063]

【発明の効果】以上詳細に説明したように、この発明の
情報処理装置によれば、文書画像から仮の順序関係に従
って文字ブロックを抽出し、該文字ブロックから文字パ
タンを切り出し、文字パタンの特徴により分類されてい
る辞書と前記切り出された文字パタンを照合し、認識を
行い、１以上の候補文字の辞書名を得、該認識結果の候
補文字の辞書名より文字ブロック属性特徴を抽出し、こ
の文字ブロック属性特徴を用いて、各文字ブロックの新
たな順序関係を判定することが出来る。As described above in detail, according to the information processing apparatus of the present invention, a character block is extracted from a document image in accordance with a provisional order relation, a character pattern is cut out from the character block, and the characteristics of the character pattern are extracted. The dictionary classified by the above is compared with the cut-out character pattern, recognition is performed, a dictionary name of one or more candidate characters is obtained, and a character block attribute feature is extracted from the dictionary name of the candidate character of the recognition result, Using this character block attribute characteristic, a new order relation of each character block can be determined.

【００６４】従って、各文字ブロックのレイアウトに規
則性の無い文書から文字ブロックを抽出し出力する場合
でも、単に各文字ブロックの文書画像中の位置座標に基
づいて各文字ブロックを出力していた従来装置に比べ、
各ブロックを正しい順序関係で出力出来る。Therefore, even when extracting and outputting a character block from a document in which the layout of each character block is not regular, each character block is simply output based on the position coordinates of the character block in the document image. Compared to the device,
Each block can be output in the correct order.

【００６５】従って、例えば文書画像ファイリング装置
等のような各種の文書画像処理装置に適用した場合に
は、正しい順序関係で文字ブロックのデータの利用が可
能となるので、高性能な各種の文書画像処理装置が可能
となる。Therefore, when the present invention is applied to various document image processing apparatuses such as a document image filing apparatus, character block data can be used in a correct order relationship, and various high-performance document image processing apparatuses can be used. A processing device becomes possible.

【００６６】また、この発明の情報処理装置を文字認識
装置に適用した場合には、上述のように正しい順序で出
力される文字ブロックについて文字認識を行うことが出
来るため例えばレイアウトに規則性の無い文書の文字認
識を文書の流れに即して行うことが出来、実用上有効な
文字認識装置が実現できる。また、正しい順序で、認識
結果文字列が得られるので、文字単位の認識の後に行う
後処理が正しく行え、従って認識精度が向上し、またオ
ペレータの作業が減少し、ひいては高速に入力が出来
る。従って、レイアウトに規則性の無い文書においても
正確に高速に入力が可能となるような、高性能な文字認
識装置が実現できる。When the information processing apparatus of the present invention is applied to a character recognition apparatus, character recognition can be performed on character blocks output in the correct order as described above. Character recognition of a document can be performed in accordance with the flow of the document, and a practically effective character recognition device can be realized. In addition, since the recognition result character string is obtained in the correct order, the post-processing performed after the recognition in units of characters can be performed correctly, so that the recognition accuracy is improved, the work of the operator is reduced, and the input can be performed at high speed. Therefore, a high-performance character recognition device capable of accurately and quickly inputting a document having no regular layout can be realized.

[Brief description of the drawings]

【図１】本発明の情報処理装置を用いた文書画像ファイ
リング装置の構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a document image filing apparatus using an information processing apparatus according to the present invention.

【図２】文字認識部の構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of a character recognition unit.

【図３】文書画像の一例を示す図である。FIG. 3 is a diagram illustrating an example of a document image.

【図４】辞書部の説明に供する図である。FIG. 4 is a diagram provided for explanation of a dictionary unit.

【図５】ブロック特徴抽出部の説明に供する図である。FIG. 5 is a diagram provided for explanation of a block feature extraction unit.

【図６】本発明の情報処理装置を用いた文字認識装置の
構成を示すブロック図である。FIG. 6 is a block diagram illustrating a configuration of a character recognition device using the information processing device of the present invention.

[Explanation of symbols]

１００文書画像ファイリング装置１１０画像入力部１２０情報処理装置１３０ブロック選択部１４０画像記憶部１５０出力端子６００文字認識装置６１０画像入力部６２０情報処理装置６３０認識結果記憶部６４０ブロック選択部６５０出力端子 Reference Signs List 100 document image filing device 110 image input unit 120 information processing device 130 block selection unit 140 image storage unit 150 output terminal 600 character recognition device 610 image input unit 620 information processing device 630 recognition result storage unit 640 block selection unit 650 output terminal

Claims

(57) [Claims]

1. A character block extracting unit for extracting a character block from a document image and determining a provisional order relation between the character blocks based on a positional relationship between the character blocks, and a character for extracting a character pattern from the extracted character block. A cutout unit, a dictionary unit having a plurality of types of dictionaries classified according to character attributes, collating the character pattern cut out by the character cutout unit with the dictionary, and combining the dictionary name of the candidate character with the candidate character name. A character recognizing unit that outputs for each character block, and a character block attribute feature for determining the attribute of the character block as to what attribute the character block is composed of based on the dictionary name of the candidate character A block feature extraction unit, and a new order based on the attribute of the character block from the temporary order relationship between the character blocks. The information processing apparatus characterized by comprising a determination unit for determining the ordinal relationship.

2. An image input unit for inputting a document image, an information processing device according to claim 1, and a recognition result storage unit for storing a character recognition result in a character recognition unit in the information processing device for each character block. And a block selection unit that outputs a recognition result from the recognition result storage unit based on a determination result of a character block order relationship in a determination unit in the information processing device.