JPH05174114A

JPH05174114A - Information processor and character recognizing device using the same

Info

Publication number: JPH05174114A
Application number: JP3339154A
Authority: JP
Inventors: Hiroshi Yoshida; 浩▲史▼ 吉田; Yoshiyuki Yamashita; 義征山下
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-12-21
Filing date: 1991-12-21
Publication date: 1993-07-13
Anticipated expiration: 2015-09-04
Also published as: JP3083609B2

Abstract

PURPOSE:To accurately segment a character pattern along with context. CONSTITUTION:In a document listing plural character blocks having different context in the same page, generally, the character blocks forming the same context are made to have similar features of the character pattern such as character line width or character height and the character blocks having different context are made to have the features of the different character pattern. A pattern characteristic extraction part 16 obtains the average of all or part of the features of the character pattern within the character blocks at every character block, and an order deciding part 18 divides each character block into groups having the similar average feature of the character pattern. The order deciding part 18 obtains an evaluation value E=X+F.Y (F: a prescribed constant) from the position of each character block (X, Y) and systematizes the character blocks at every group in the ascending order of the evaluation value E. According to the sequencing, the character pattern is segmented from each character block to be recognized.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、文字媒体から抽出し
た文字ブロックの順序関係を判定する情報処理装置及び
それを用いた文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing apparatus for determining the order relation of character blocks extracted from a character medium and a character recognition apparatus using the same.

【０００２】[0002]

【従来の技術】書式未知の文書が持つ情報を文脈に沿っ
て正確に抽出することができれば、種々の機能を有する
情報処理装置、例えば書式未知の文書の文字認識を行う
文字認識装置、この認識文字を音声で或は翻訳して出力
する装置を構築でき、情報処理装置の用途拡大が容易に
なる。2. Description of the Related Art An information processing apparatus having various functions, such as a character recognition apparatus for recognizing a character of a document of unknown format, if the information of the document of unknown format can be accurately extracted in accordance with the context. It is possible to construct a device that outputs characters by voice or by translating, and it is easy to expand the application of the information processing device.

【０００３】書式未知文書の文字認識装置として、例え
ば文献１：電子情報通信学会技術報告ＰＲＵ８６−３３
に開示されているものがある。この従来装置では、書式
未知の文書画像から周辺分布特徴、線密度特徴及び外接
矩形特徴を抽出し、これら特徴を用いて文書画像から見
出しの文字ブロックや本文の文字ブロック等の文書構成
要素を抽出し、さらにこれら各文字ブロックから文字パ
タンを抽出し、さらに各文字パタンの特徴抽出を行って
文字認識を行う。As a character recognition device for an unknown format document, for example, Document 1: IEICE Technical Report PRU86-33.
Are disclosed in. This conventional apparatus extracts marginal distribution features, line density features, and circumscribed rectangle features from a document image of unknown format, and uses these features to extract document constituent elements such as a character block of a headline and a character block of a body from the document image. Then, a character pattern is extracted from each of these character blocks, and the feature of each character pattern is extracted to perform character recognition.

【０００４】そして本文の文字ブロックの文字認識を行
う場合、例えば文献２：電子情報通信学会論文誌ＶＯ
Ｌ．Ｊ６９−Ｄ，Ｎｏ．３，ｐ．４００〜４０９に開示
されているように、一般的に、次式（１）で表される位
置評価値Ｈを用いて本文の文字ブロックに関し文字認識
を行う順序を決定する。すなわち位置評価値Ｈの小さい
順に本文の文字ブロックの文字認識を行う。In the case of performing character recognition of a character block of the body, for example, Reference 2: IEICE Transactions on VO
L. J69-D, No. 3, p. As disclosed in 400 to 409, generally, the position evaluation value H represented by the following equation (1) is used to determine the order in which character recognition is performed on the character block of the text. That is, character recognition of the character blocks of the body is performed in the ascending order of the position evaluation value H.

【０００５】Ｈ＝Ｘｃ＋ε・Ｙｃ …（１）ここで、Ｘｃは文字ブロックの左上隅点のＸ座標、Ｙｃ
は文字ブロックの左上隅点のＹ座標、及びεは例えば
０．１程度の値の定数である。H = Xc + ε · Yc (1) where Xc is the X coordinate of the upper left corner point of the character block, and Yc
Is the Y coordinate of the upper left corner point of the character block, and ε is a constant having a value of about 0.1, for example.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら本文の文
字ブロックのなかには、例えば図２にも示すように太い
文字線の文字ブロック３４及び３６と、細い文字線の文
字ブロック３８及び４０とでそれぞれ異なる文脈を構成
するようにしたものもある。従って上述したように位置
評価値ｈの小さい順に文字認識を行うようにすると、図
２の例では文字ブロック３４、３８、３６及び４０の順
に文字認識を行うこととなり、その結果文脈に沿って正
確に認識結果を得ることはできなくなる。However, among the character blocks of the text, for example, as shown in FIG. 2, the character blocks 34 and 36 having thick character lines and the character blocks 38 and 40 having thin character lines have different contexts, respectively. There are some that are configured. Therefore, if the character recognition is performed in the ascending order of the position evaluation value h as described above, the character recognition is performed in the order of the character blocks 34, 38, 36 and 40 in the example of FIG. It will not be possible to obtain recognition results.

【０００７】また文字認識においては、認識精度を高め
るため、一文字単位に認識処理を行った後に何らかの後
処理、例えば言語情報を用いた言語処理（知識処理）を
行う場合がある。この後処理の処理対象は、通常、文の
構造であるので、文脈に沿って認識結果が得られないと
後処理が意味を成さなくなり、その結果、認識精度が低
下し誤認識が増える。そこでオペレータは、これを防止
するため或は誤認識を修正するために、本文の文字ブロ
ックに関し文字認識を行う順序を指定したり或は誤認識
結果を修正したりする必要が生じる。これら順序の指定
作業や誤認識の修正作業は煩雑であり作業性が非常に悪
い。In the character recognition, in order to improve the recognition accuracy, after the recognition processing is performed character by character, some post-processing, for example, language processing (knowledge processing) using language information may be performed. Since the processing target of this post-processing is usually a sentence structure, the post-processing does not make sense unless a recognition result is obtained in accordance with the context, and as a result, the recognition accuracy decreases and erroneous recognition increases. Therefore, in order to prevent this or correct the erroneous recognition, the operator needs to specify the order in which the character recognition is performed on the character block of the body or correct the erroneous recognition result. The task of specifying the order and the task of correcting the incorrect recognition are complicated and the workability is very poor.

【０００８】この発明の目的は上述した従来の問題点を
解決するため、文字ブロックを同一種類の情報を担う文
字ブロック毎により精度良く順序付けることができる情
報処理装置及びそれを用いた文字認識装置を提供するこ
とにある。In order to solve the above-mentioned conventional problems, an object of the present invention is to provide an information processing apparatus and a character recognizing apparatus using the same, in which character blocks can be accurately ordered by character blocks carrying the same type of information. To provide.

【０００９】[0009]

【課題を解決するための手段】この目的の達成を図るた
め第一発明の情報処理装置は、文字媒体の画像データか
ら文字ブロックの位置情報を抽出する文字ブロック抽出
部と、文字ブロックの順序関係を判定する順序判定部
と、文字ブロックから文字パタンの切出し情報を抽出す
る切出し部と、文字パタンの切出し情報を利用して各文
字ブロック毎に文字パタンの特徴を抽出するパタン特徴
抽出部とを備え、順序判定部は、文字ブロックの位置情
報から位置評価値を求めると共に文字ブロックを文字パ
タンの特徴が類似するもの同志にグループ分けし、文字
ブロックの順序を各グループ毎に位置評価値の小さい順
或は大きい順に決定することを特徴とする。In order to achieve this object, an information processing apparatus according to the first aspect of the present invention comprises a character block extracting section for extracting position information of a character block from image data of a character medium, and an order relation of the character blocks. An order determination unit that determines the character pattern, a cutout unit that extracts the cutout information of the character pattern from the character block, and a pattern feature extraction unit that uses the cutout information of the character pattern to extract the characteristics of the character pattern for each character block. The order determination unit obtains the position evaluation value from the position information of the character block and divides the character blocks into groups having similar character pattern characteristics, and arranges the character blocks in order of small position evaluation value for each group. It is characterized in that it is determined in order or in descending order.

【００１０】また第二発明の文字認識装置は、文字媒体
の画像データを生成する画像生成部と、画像データが含
む文字ブロックを文字パタンの特徴が類似するもの同志
にグループ分けし文字ブロックの順序を各グループ毎に
決定する第一発明の情報処理装置と、各グループ毎に、
決定された順序に従って順次に文字ブロックを選択する
ブロック選択部と、文字ブロックの選択順次に文字ブロ
ックから文字パタンを切り出す切出し部と、文字パタン
を認識する認識部とを備えて成ることを特徴とする。Further, the character recognition device of the second aspect of the present invention groups an image generation unit for generating image data of a character medium and a character block included in the image data into groups having similar character pattern characteristics, and orders the character blocks. An information processing device of the first invention for determining for each group, and for each group,
It is characterized by comprising a block selection unit for sequentially selecting character blocks according to the determined order, a cutout unit for sequentially selecting character patterns from the character blocks, and a recognition unit for recognizing the character patterns. To do.

【００１１】[0011]

【作用】第一発明の情報処理装置によれば、文字ブロッ
クの位置情報から位置評価値を求める。これと共に文字
ブロックを文字パタンの特徴が類似するもの同志にグル
ープ分けする。そして文字ブロックの順序を各グループ
毎に位置評価値の小さい順或は大きい順に決定する。According to the information processing apparatus of the first invention, the position evaluation value is obtained from the position information of the character block. Along with this, the character blocks are grouped into comrades having similar character patterns. Then, the order of the character blocks is determined for each group in ascending or descending order of the position evaluation value.

【００１２】従って文字ブロックが含む文字パタンの特
徴を文字ブロック単位で異ならせ、文字ブロックが担う
情報の種類を文字パタンの特徴と対応付けている文書、
帳票等の文字媒体において、文字ブロックの順序を同一
種類の情報毎に精度良く決定できる。Therefore, a document in which the characteristic of the character pattern included in the character block is made different for each character block and the type of information carried by the character block is associated with the characteristic of the character pattern,
In a character medium such as a form, the order of character blocks can be accurately determined for each information of the same type.

【００１３】例えば文字媒体を文書とし異なる文脈の文
字ブロックを文書の同一紙面に掲載してある場合を考え
る。この場合、文脈がつながる文字ブロック同志におい
ては一般に、これら各文字ブロックの文字パタンは例え
ば文字線の太さが等しいといった共通の特徴を備える。
また文脈がつながらない文字ブロック同志に関しては一
般に、これら各文字ブロックの文字パタンは互いに例え
ば文字線の太さが異なるといった異なる特徴を備える。
従ってこのような文書の一般的性質に着目すれば、文字
パタンの特徴が互いに類似する文字ブロック同志は文脈
がつながり、また文字パタンの特徴が類似しない文字ブ
ロック同志は文脈がつながらないと判断できる。従って
異なる文脈の文字ブロックを文書の同一紙面に掲載して
ある場合においては、文字ブロックを文字パタンの特徴
が類似するもの同志にグループ分けし文字ブロックの順
序を各グループ毎に位置評価値の小さい順或は大きい順
に決定することによって、各文字ブロックをそれぞれの
文脈に沿ってより精度良く順序付けることができる。For example, consider a case where a character medium is a document and character blocks in different contexts are posted on the same page of the document. In this case, generally, in character blocks that are connected to each other in context, the character patterns of these character blocks have common features such as the same thickness of character lines.
Further, regarding character blocks that do not connect with each other in context, the character patterns of these character blocks generally have different characteristics such as different thicknesses of character lines.
Therefore, by paying attention to the general property of such a document, it can be determined that character block comrades having similar character pattern characteristics are connected with each other in context, and character block comrades having dissimilar character pattern characteristics are not connected with context. Therefore, when character blocks with different contexts are posted on the same page of a document, the character blocks are grouped into groups of similar character patterns, and the order of the character blocks is small for each group. By determining the order of the characters or the order of increasing, the character blocks can be ordered more accurately according to the respective contexts.

【００１４】また第二発明の文字認識装置によれば、上
述の第一発明の作用で説明したように、文字ブロックを
文字パタンの特徴が類似するもの同志にグループ分けし
文字ブロックの順序を各グループ毎に決定する。そして
各グループ毎に、決定された順序に従って順次に文字ブ
ロックを選択し文字ブロックの選択順次に文字ブロック
から文字パタンを切り出す。その結果、複数の異なる文
脈を文書の同一紙面に掲載してある場合でも、各文脈毎
に文脈に沿ってより精度良く文字パタンを切り出すこと
ができる。Further, according to the character recognition device of the second invention, as described in the operation of the first invention, the character blocks are grouped into groups having similar character pattern characteristics, and the order of the character blocks is divided into groups. Determined for each group. Then, for each group, the character blocks are sequentially selected according to the determined order, and the character patterns are sequentially cut out from the character blocks. As a result, even when a plurality of different contexts are posted on the same page of the document, the character pattern can be more accurately cut out according to each context.

【００１５】[0015]

【実施例】以下、図面を参照し、これら発明の実施例に
つき説明する。尚、図面はこれら発明が理解できる程度
に概略的に示してあるにすぎず、従ってこれら発明を図
示例に限定するものではない。以下の説明では、第二発
明の文字認識装置の実施例の説明と共に第一発明の情報
処理装置の実施例を説明する。Embodiments of the present invention will be described below with reference to the drawings. It should be noted that the drawings are merely schematic illustrations to the extent that the invention can be understood, and therefore the invention is not limited to the illustrated examples. In the following description, an embodiment of the character recognition device of the second invention and an embodiment of the information processing device of the first invention will be described.

【００１６】図１は第一及び第二発明の第一実施例の全
体構成を概略的に示す機能ブロック図である。FIG. 1 is a functional block diagram schematically showing the overall construction of the first embodiment of the first and second inventions.

【００１７】同図において１０は第一発明の第一実施例
としての情報処理装置を示し、この情報処理装置１０
は、文字媒体の画像データから文字ブロックの位置情報
を抽出する文字ブロック抽出部１２と文字ブロックの順
序関係を判定する順序判定部１８と文字ブロックから文
字パタンの切出し情報を抽出する切出し部１４と文字パ
タンの切出し情報を利用して各文字ブロック毎に文字パ
タンの特徴を抽出するパタン特徴抽出部１６とを備え、
順序判定部１８は、文字ブロックの位置情報から位置評
価値を求めると共に文字ブロックを文字パタンの特徴が
類似するもの同志にグループ分けし、文字ブロックの順
序を各グループ毎に位置評価値の小さい順或は大きい順
に決定する。In the figure, reference numeral 10 denotes an information processing apparatus as a first embodiment of the first invention.
Is a character block extraction unit 12 that extracts the position information of the character blocks from the image data of the character medium, an order determination unit 18 that determines the order relationship of the character blocks, and a cutout unit 14 that extracts the cutout information of the character patterns from the character blocks. A pattern feature extraction unit 16 for extracting the feature of the character pattern for each character block using the cut-out information of the character pattern,
The order determination unit 18 obtains the position evaluation value from the position information of the character blocks, divides the character blocks into groups having similar character pattern characteristics, and arranges the character blocks in order from the smallest position evaluation value for each group. Or decide in descending order.

【００１８】また２０は第二発明の第一実施例としての
文字認識装置を示し、この文字認識装置２０は文字媒体
の画像データを生成する画像生成部２２と、画像データ
が含む文字ブロックを文字パタンの特徴が類似するもの
同志にグループ分けし文字ブロックの順序を各グループ
毎に決定する情報処理装置１０と、各グループ毎に、決
定された順序に従って順次に文字ブロックを選択するブ
ロック選択部２４と、文字ブロックの選択順次に文字ブ
ロックから文字パタンを切り出す切出し部２６と、文字
パタンを認識する認識部２８とを備えて成る。尚、３０
は文字認識装置２０の出力端子を示す。Reference numeral 20 denotes a character recognition device as a first embodiment of the second invention. The character recognition device 20 is an image generation unit 22 for generating image data of a character medium, and a character block included in the image data. Information processing apparatus 10 that determines the order of the character blocks by grouping them into groups having similar pattern characteristics, and block selecting unit 24 that sequentially selects the character blocks according to the determined order for each group. And a selection unit 26 for sequentially extracting character patterns from the character block and a recognition unit 28 for recognizing the character pattern. Incidentally, 30
Indicates an output terminal of the character recognition device 20.

【００１９】次に図２に示す文書の文字認識を例に取っ
てこの実施例の動作につき説明する。図２は文書の一例
を示す図である。同図において３２は文字媒体としての
文書を示し、文書３２は文字高さが高い文字から成りひ
とつの文脈を形成する文字ブロック３４及び３６と、文
字高さが低い文字から成り別のひとつの文脈を形成する
文字ブロック３８及び４０とを有する。ここに言う文字
は記号及び図形を含む。図中、文字ブロック３４〜４０
をそれぞれ一点鎖線で囲んで示した。Next, the operation of this embodiment will be described by taking the character recognition of the document shown in FIG. 2 as an example. FIG. 2 is a diagram showing an example of a document. In the figure, reference numeral 32 denotes a document as a character medium. The document 32 has character blocks 34 and 36 which are composed of characters having a high character height and which form one context, and another context which is composed of a character having a low character height. And character blocks 38 and 40 forming The characters mentioned here include symbols and figures. In the figure, character blocks 34-40
Are surrounded by a dashed line.

【００２０】画像生成部２２はイメージセンサを備え、
主走査方向を文字行方向（以下、水平方向と称す）Ｘと
し及び副走査方向を文字行方向と直交する方向（以下、
垂直方向と称す）Ｙとして文書３２を光学的に走査す
る。文書３２の文字行方向は従来周知の方法により、予
め検出されているものとする。そして画像生成部２２は
文書３２からの光信号Ｓを白黒２値のディジタル信号
（画像データ）に変換し、この画像データを図示しない
画像メモリに格納する。画像データの黒ビットは例えば
文字線及び白ビットは文字背景部分を表す。The image generator 22 has an image sensor,
The main scanning direction is a character line direction (hereinafter referred to as a horizontal direction) X, and the sub-scanning direction is a direction orthogonal to the character line direction (hereinafter,
The document 32 is optically scanned as Y (referred to as the vertical direction). It is assumed that the character line direction of the document 32 has been detected in advance by a conventionally known method. Then, the image generator 22 converts the optical signal S from the document 32 into a monochrome binary digital signal (image data), and stores the image data in an image memory (not shown). The black bit and the white bit of the image data represent, for example, a character line and a character background portion.

【００２１】ここでは、文書３２上に主走査方向をＸ軸
方向及び副走査方向をＹ軸方向としたＸ−Ｙ座標系を設
定し、文書３２の走査位置をこの座標系の座標（Ｘ、
Ｙ）で表すものとする。また画像メモリ上には文書３２
上のＸ−Ｙ座標系に相対応するＸ−Ｙ座標系を仮想的に
設定し、画像メモリの各格納場所の位置をメモリ上の座
標系の座標（Ｘ、Ｙ）で表す。そして文書３２上の走査
位置（Ｘ、Ｙ）の画素の画像データを、当該走査位置
（Ｘ、Ｙ）に対応する画像メモリ上の座標（Ｘ、Ｙ）の
格納場所に格納する。Here, an XY coordinate system in which the main scanning direction is the X-axis direction and the sub-scanning direction is the Y-axis direction is set on the document 32, and the scanning position of the document 32 is the coordinate (X,
Y). Also, the document 32 is stored in the image memory.
An XY coordinate system corresponding to the above XY coordinate system is virtually set, and the position of each storage location of the image memory is represented by coordinates (X, Y) of the coordinate system on the memory. Then, the image data of the pixel at the scanning position (X, Y) on the document 32 is stored in the storage location of the coordinate (X, Y) on the image memory corresponding to the scanning position (X, Y).

【００２２】文字ブロック抽出部１２は文書３２の画像
データを走査し、文書３２が含む文字ブロック３４〜４
０の画像データを抽出すると共に文字ブロック３４〜４
０の位置を検出する。文字ブロック３４〜４０はそれぞ
れ一又は複数の文字列を含む領域であり、各文字ブロッ
ク３４〜４０は空白或は罫線そのほかの分割要素により
それぞれ互いに区別できるように画定されている。例え
ば図２の例では、文字ブロック３４〜４０はそれぞれ、
複数の文字列がほぼ規則正しく密に配列して一塊と成っ
ている領域である。The character block extraction unit 12 scans the image data of the document 32, and the character blocks 34 to 4 included in the document 32 are scanned.
The image data of 0 is extracted and the character blocks 34 to 4 are also extracted.
The position of 0 is detected. The character blocks 34 to 40 are areas containing one or a plurality of character strings, and the character blocks 34 to 40 are defined by blanks, ruled lines or other dividing elements so that they can be distinguished from each other. For example, in the example of FIG. 2, the character blocks 34 to 40 are
This is an area where multiple character strings are arranged in a regular and dense manner to form a lump.

【００２３】この実施例では各文字ブロック３４〜４０
を空白で区別するようにしている場合に文字ブロックの
画像データを抽出し及び位置を検出する例につき説明す
る。尚、文字ブロック３４〜４０の位置検出及び画像デ
ータ抽出に当たっては、従来周知の種々の方法を用いる
ことができる。In this embodiment, each character block 34-40 is
An example in which the image data of the character block is extracted and the position is detected in the case where the characters are distinguished by a blank will be described. Note that various conventionally known methods can be used for detecting the positions of the character blocks 34 to 40 and extracting the image data.

【００２４】まず文字ブロック抽出部１２は、走査範囲
を文書３２全面、主走査方向を垂直方向Ｙ及び副走査方
向を水平方向Ｘとして、文書３２の画像データを走査し
走査範囲内の垂直な走査線上の黒ビット累積個数を各副
走査位置Ｘ毎に求め、求めた黒ビット累積個数を副走査
位置Ｘの小さい順に参照してゆく。ここで黒ビット累積
個数が所定個数例えば１個未満となる走査線を白線及び
黒ビット累積個数が所定個数例えば１個以上となる走査
線を黒線と表す。そして黒ビット累積個数の参照過程で
白線より黒線に変化した時の当該黒線を黒線Ａまた黒線
より白線に変化した時の当該黒線を黒線Ｂと表せば、第
ｈ番目に検出した垂直な黒線Ａ及びＢが挟む領域を第ｈ
番目の水平ブロック候補領域として検出する。この第ｈ
番目の水平ブロック候補領域の始端及び終端位置はこれ
ら第ｈ番目の垂直な黒線Ａ及びＢの位置Ｘである。図２
の例であれば、文書３２全面のうちの位置Ｘ１及びＸ２
の垂直な走査線が挟む領域が第１番目の水平ブロック候
補領域、また位置Ｘ３及びＸ４の垂直な走査線が挟む領
域が第２番目の水平ブロック候補領域である。First, the character block extraction unit 12 scans the image data of the document 32 with the scanning range as the entire surface of the document 32, the main scanning direction as the vertical direction Y and the sub-scanning direction as the horizontal direction X, and the vertical scanning within the scanning range. The cumulative number of black bits on the line is obtained for each sub-scanning position X, and the obtained cumulative number of black bits is referred to in ascending order of the sub-scanning position X. Here, a scanning line in which the cumulative number of black bits is less than a predetermined number, for example, 1 is represented by a white line, and a scanning line in which the cumulative number of black bits is more than a predetermined number, for example, 1 is denoted by a black line. If the black line when changing from a white line to a black line in the process of referring to the cumulative number of black bits is represented as a black line A and the black line when changing from a black line to a white line is referred to as a black line B, The area between the detected vertical black lines A and B is the h-th
The second horizontal block candidate area is detected. This h
The start and end positions of the th horizontal block candidate area are the positions X of these h-th vertical black lines A and B. Figure 2
For example, the positions X1 and X2 on the entire surface of the document 32
The area sandwiched by the vertical scanning lines is the first horizontal block candidate area, and the area sandwiched by the vertical scanning lines at the positions X3 and X4 is the second horizontal block candidate area.

【００２５】次に文字ブロック抽出部１２は、水平ブロ
ック候補領域をひとつずつ着目ブロックとし、走査範囲
を着目ブロック、主走査方向を水平方向Ｘ及び副走査方
向を垂直方向Ｙとして、文書３２の画像データを走査し
走査範囲内の水平な走査線上の黒ビット累積個数を各副
走査位置Ｙ毎に求め、求めた黒ビット累積個数を副走査
位置Ｙの小さい順に参照してゆく。この参照過程で、第
１番目に検出した水平な黒線Ａの位置Ｙを第１番目の垂
直ブロック候補領域の始端位置として検出する。そして
第１番目の水平な黒線Ａの検出したら第ｉ−１番目（ｉ
＝２、３、４、……）に検出される水平な黒線Ｂと第ｉ
番目に検出される水平な黒線Ａとの離間間隔を求める。
この離間間隔は隣接する文字行間の空白の幅を表す。ｉ
の小さい順に、順次に、求めた離間間隔を閾値ＴＨと比
較してゆき、閾値ＴＨを越える離間間隔を有する第ｉ−
１番目の黒線Ｂ及び第ｉ番目の黒線Ａを検出したら、こ
のときの第ｉ−１番目の黒線Ｂの位置Ｙを当該着目ブロ
ックに関連する第１番目の垂直ブロック候補領域の終端
位置として検出し、またこのときの第ｉ番目の黒線Ａの
位置Ｙを当該着目ブロックに関連する第２番目の垂直ブ
ロック候補領域の始端位置として検出する。閾値ＴＨ
は、文字ブロックを分割する分割要素としての空白を検
出するためのパラメータであり、文字の大きさ、フォン
ト及びそのほかを考慮して決定され例えばＴＨ＝１４０
である。以下同様にして、当該着目ブロックに関連す
る、第２番目の垂直ブロック候補領域の終端位置、第３
番目の垂直ブロック候補領域の始端及び終端位置、第４
番目のブロック候補領域の始端位置、……を順次に検出
してゆく。そして文字ブロック抽出部１２はひとつの着
目ブロックにつき垂直ブロック候補領域の検出を終えた
ら、次の他の着目ブロックに関連する垂直ブロック候補
領域を検出する。Next, the character block extraction unit 12 sets each horizontal block candidate area as a target block, the scanning range as the target block, the main scanning direction as the horizontal direction X and the sub-scanning direction as the vertical direction Y, and the image of the document 32. The data is scanned and the cumulative number of black bits on a horizontal scanning line within the scanning range is obtained for each sub-scanning position Y, and the obtained cumulative number of black bits is referred to in ascending order of the sub-scanning position Y. In this reference process, the position Y of the first horizontal black line A detected is detected as the starting end position of the first vertical block candidate area. When the first horizontal black line A is detected, the (i-1) th (i
= 2, 3, 4, ...) and the horizontal black line B and the i-th
The distance between the second black line A and the horizontal line A detected is calculated.
This spacing represents the width of the space between adjacent character lines. i
Of the i-th which has a separation interval exceeding the threshold TH, by sequentially comparing the calculated separation interval with the threshold value TH in ascending order of
When the first black line B and the i-th black line A are detected, the position Y of the i−1-th black line B at this time is determined as the end of the first vertical block candidate area associated with the block of interest. The position Y of the i-th black line A at this time is detected as the start end position of the second vertical block candidate area associated with the target block. Threshold TH
Is a parameter for detecting a blank as a dividing element that divides a character block, and is determined in consideration of the character size, font, and others, for example, TH = 140.
Is. In the same manner, the end position of the second vertical block candidate area related to the target block, the third position
The start and end positions of the th vertical block candidate area, the fourth
The start position of the th block candidate area, ... is sequentially detected. Then, when the character block extraction unit 12 finishes detecting the vertical block candidate area for one target block, it detects a vertical block candidate area related to the next other target block.

【００２６】図２の例では、第１番目の水平ブロック候
補領域（位置Ｘ１及びＸ２を通る垂直な走査線が挟む領
域）に関連する垂直ブロック候補領域として、位置Ｙ１
及びＹ２を通る水平な走査線が挟む領域と、位置Ｙ３及
びＹ４を通る水平な走査線が挟む領域とが検出される。
また第２番目の水平ブロック候補領域（位置Ｘ３及びＸ
４を通る垂直な走査線が挟む領域）に関連する垂直ブロ
ック候補領域として、位置Ｙ５及びＹ６を通る水平な走
査線が挟む領域と位置Ｙ７及びＹ８を通る垂直な走査線
が挟む領域とが検出される。In the example of FIG. 2, the position Y1 is set as the vertical block candidate area related to the first horizontal block candidate area (the area sandwiched by the vertical scanning lines passing through the positions X1 and X2).
And a region sandwiched by horizontal scanning lines passing through Y2 and a region sandwiched by horizontal scanning lines passing through positions Y3 and Y4 are detected.
The second horizontal block candidate area (positions X3 and X
The area sandwiched by the horizontal scanning lines passing through the positions Y5 and Y6 and the area sandwiched by the vertical scanning lines passing through the positions Y7 and Y8 are detected as vertical block candidate areas related to the area sandwiched by the vertical scanning lines passing through 4). To be done.

【００２７】文字ブロックは関連する水平及び垂直ブロ
ック候補領域が重なり合う領域であり、これら関連する
ブロック候補領域のうち、水平ブロック候補領域の始端
及び終端位置が文字ブロックの垂直方向における始端及
び終端位置を表しまた垂直ブロック候補領域の始端及び
終端位置が文字ブロックの水平方向における始端及び終
端位置を表す。図２の例において例えばＸ１≧Ｘ≧Ｘ２
かつＹ１≧Ｙ≧Ｙ２を満足する領域が、文字ブロックの
ひとつすなわち文字ブロック３４となる。A character block is an area in which related horizontal and vertical block candidate areas overlap each other. Of these related block candidate areas, the start and end positions of the horizontal block candidate areas are the start and end positions of the character block in the vertical direction. In addition, the start and end positions of the vertical block candidate area represent the start and end positions of the character block in the horizontal direction. In the example of FIG. 2, for example, X1 ≧ X ≧ X2
An area that satisfies Y1 ≧ Y ≧ Y2 is one of the character blocks, that is, the character block 34.

【００２８】文字ブロック抽出部１２は文字ブロックの
抽出順次に、文字ブロック内の画像データ（ブロックデ
ータ）を切出し部１４及びブロック選択部２４に出力し
また文字ブロックの位置を順序判定部１８に出力する。The character block extraction unit 12 sequentially outputs the image data (block data) in the character block to the cutout unit 14 and the block selection unit 24 and the position of the character block to the order determination unit 18 in order of the extraction of the character block. To do.

【００２９】切出し部１４はブロックデータを各文字ブ
ロック毎に格納する。そして文字ブロックをひとつずつ
順次に着目ブロックとし、着目ブロックのブロックデー
タを走査して着目ブロックが含む全部又は一部の文字行
を切出し、さらに文字行内のブロックデータを走査して
着目ブロックの文字行内の文字パタンを切出す。切出し
部１４は文字行及び文字パタンの切出し過程で得られる
情報やデータ或はこれらを切出した結果得られる情報や
データを、切出し情報としてパタン抽出部１６へ出力す
る。The cutout unit 14 stores the block data for each character block. Then, character blocks are set as target blocks one by one, and the block data of the target block is scanned to cut out all or some of the character lines included in the target block, and further the block data in the character line is scanned to scan the character line of the target block. Cut out the character pattern. The cutout unit 14 outputs information or data obtained in the process of cutting out character lines and character patterns or information or data obtained as a result of cutting out these to the pattern extraction unit 16 as cutout information.

【００３０】文字行及び文字パタンの切出しは従来周知
の種々の方法により行うことができるが、この実施例で
は次のようにして行う。The character line and the character pattern can be cut out by various conventionally known methods. In this embodiment, it is carried out as follows.

【００３１】切出し部１４は着目ブロック内の例えば第
１行目の文字行のみを切出す。このため切出し部１４は
走査範囲を着目ブロック、主走査方向を水平方向Ｘ及び
副走査方向を垂直方向Ｙとし、副走査位置Ｙの小さい順
に、走査範囲内の水平な走査線上の黒ビット累積個数を
求める。そして副走査位置Ｙの小さい順に黒ビット累積
個数を求めてゆく過程で、第１番目に検出した水平な黒
線Ａ及びＢの位置Ｙを第１行目の文字行の垂直方向にお
ける切出し開始及び終了位置とする。また着目ブロック
の水平方向における始端及び終端位置を、第１行目の文
字行の水平方向における切出し開始及び終了位置とす
る。第１行目の文字行は、当該文字行の垂直方向におけ
る切出し開始及び終了位置を通る水平な２つの走査線が
挟み、かつ当該文字行の水平方向における切出し開始及
び終了位置を通る垂直な２つの走査線が挟む領域であ
る。The cutout unit 14 cuts out, for example, only the first character line in the target block. Therefore, the cutout unit 14 sets the scanning range as the target block, the main scanning direction as the horizontal direction X and the sub-scanning direction as the vertical direction Y, and the sub-scanning position Y in ascending order, the cumulative number of black bits on the horizontal scanning lines in the scanning range. Ask for. Then, in the process of obtaining the cumulative number of black bits in ascending order of the sub-scanning position Y, the position Y of the first detected horizontal black line A and B is cut out in the vertical direction of the first character line and Set as the end position. Further, the start and end positions in the horizontal direction of the block of interest are set as the cutout start and end positions in the horizontal direction of the first character line. The first character line is sandwiched by two horizontal scanning lines passing through the cutout start and end positions of the character line in the vertical direction, and two vertical scan lines passing through the cutout start and end positions of the character line in the horizontal direction. The area between two scanning lines.

【００３２】次に切出し部１４は第１行目の文字行内の
全部又は一部、例えば全部の文字パタンを切出し、文字
パタンの文字切出し位置として文字外接枠の位置を検出
する。このため切出し部１４は、走査範囲を着目ブロッ
クの第１行目の文字行、主走査方向を垂直方向Ｙ及び副
走査方向を水平方向Ｘとし、走査範囲内の垂直な走査線
上の黒ビット累積個数を各副走査位置Ｘ毎に求める。そ
してこの求めた黒ビット累積個数を副走査位置Ｘの小さ
い順に参照してゆき、第ｊ番目に検出した垂直な黒線Ａ
及びＢの位置Ｘ（これら黒線Ａ及びＢの位置Ｘは文字外
接枠の左端及び右端位置を表す）を第ｊ番目の文字パタ
ンに関する水平方向の切出し開始及び終了位置とする。
次いで走査範囲を文字行内の、これら第ｊ番目の垂直な
黒線Ａ及びＢで挟む領域、主走査方向を水平方向Ｘ及び
副走査方向を垂直方向Ｙとして、各副走査位置Ｙ毎に走
査範囲内の水平な走査線上の黒ビット累積個数を求め
る。そしてこの求めた黒ビット累積個数を副走査位置Ｙ
の小さい順に参照して水平な黒線Ａ及びＢを検出し、こ
れら水平な黒線Ａの副走査位置Ｙのうち最大のＹ（この
Ｙは文字外接枠の上端位置を表す）を第ｊ番目の文字パ
タンに関する垂直方向の切出し開始位置としまたこれら
水平な黒線Ｂの副走査位置Ｙのうち最小のＹ（このＹは
文字外接枠の下端位置を表す）を第ｊ番目の文字パタン
に関する垂直方向の切出し終了位置とする。ひとつの着
目ブロックにつき第１行目の文字行の文字パタンを切出
し終えたら次の着目ブロックにつき第１行目の文字行の
文字パタンを切出す。尚、文字外接枠は、当該枠の左端
及び右端位置を通る２本の垂直な走査線と、上端及び下
端位置を通る２本の水平な走査線との交点を結んで得ら
れる矩形枠である。Next, the cutout unit 14 cuts out all or part, for example, all the character patterns in the first character line, and detects the position of the character circumscribing frame as the character cutting position of the character pattern. Therefore, the cutout unit 14 sets the scanning range to the first character line of the target block, sets the main scanning direction to the vertical direction Y and the sub-scanning direction to the horizontal direction X, and accumulates the black bits on the vertical scanning lines in the scanning range. The number is obtained for each sub-scanning position X. Then, the obtained cumulative number of black bits is referred to in the ascending order of the sub-scanning position X, and the j-th detected vertical black line A is detected.
The positions X of B and B (the positions X of the black lines A and B represent the left end and right end positions of the character circumscribing frame) are set as the horizontal cutting start and end positions for the j-th character pattern.
Next, the scanning range is a region sandwiched by these j-th vertical black lines A and B in the character line, the main scanning direction is the horizontal direction X and the sub-scanning direction is the vertical direction Y, and the scanning range is for each sub-scanning position Y. The cumulative number of black bits on the horizontal scanning line in is calculated. Then, the obtained cumulative number of black bits is set to the sub-scanning position Y.
The horizontal black lines A and B are detected by referring to the ascending order of, and the maximum Y (Y represents the upper end position of the character circumscribing frame) among the sub-scanning positions Y of these horizontal black lines A is set to the j-th position. Of the horizontal scanning line B, and the minimum Y (Y represents the lower end position of the circumscribing frame) of the horizontal black line B is the vertical cutout position for the j-th character pattern. The cutout end position in the direction. After cutting out the character pattern of the first character line for one target block, the character pattern of the first character line for the next target block is cut out. The character circumscribing frame is a rectangular frame obtained by connecting the intersections of two vertical scanning lines passing through the left end and right end positions of the frame and two horizontal scanning lines passing through the upper end and lower end positions. ..

【００３３】切出し部１４は着目ブロックの文字行及び
文字パタンの切出し順次に、切出し情報をパタン特徴抽
出部１６へ出力する。パタン特徴抽出部１６は切出し情
報を利用して各文字ブロック毎に文字パタンの特徴を抽
出する。切出し情報は例えば、文字パタン及び文字行の
切出し位置、文字行内の画像データ、或は文字外接枠内
の画像データである。The cutout unit 14 sequentially outputs the cutout information to the pattern feature extraction unit 16 in order to cut out the character line and the character pattern of the block of interest. The pattern feature extraction unit 16 uses the cut-out information to extract the feature of the character pattern for each character block. The cutout information is, for example, the cutout position of the character pattern and the character line, the image data in the character line, or the image data in the character circumscribing frame.

【００３４】この実施例では、切出し部１４は文字パタ
ンの切出し位置を切出し情報として出力し、パタン特徴
抽出部１６は文字パタンの文字高さを文字ブロックｊの
文字パタンの特徴Ｆ_jとして求める。ｊは文字ブロック
番号であり、図２に示す例ではｊ＝３４、３６、３８又
は４０である。文字高さは、文字パタンの垂直方向にお
ける切出し開始及び終了位置の離間間隔で表せる。In this embodiment, the cutout unit 14 outputs the cutout position of the character pattern as cutout information, and the pattern feature extraction unit 16 obtains the character height of the character pattern as the feature F _j of the character pattern of the character block j. j is a character block number, and j = 34, 36, 38 or 40 in the example shown in FIG. The character height can be represented by the space between the cutout start and end positions in the vertical direction of the character pattern.

【００３５】しかもパタン特徴抽出部１６は、文字ブロ
ックｊが含む全部又は一部の文字パタンに関して得た特
徴の平均値を文字パタンの特徴Ｆ_jとして求める。例え
ば図２に示す例において、文字ブロックｊの第１行目の
文字行が含む全ての文字パタンに関して得た平均特徴
を、文字パタンの特徴Ｆ_jとすれば、文字ブロック３４
の特徴Ｆ₃₄はＦ₃₄＝３９．１、文字ブロック３６の特徴
Ｆ₃₆はＦ₃₆＝３７．５、文字ブロック３８の特徴Ｆ₃₈は
Ｆ₃₈＝３１．２、文字ブロック４０の特徴Ｆ₄₀はＦ₄₀＝
３２．１となる。パタン特徴抽出部１６は各文字ブロッ
クｊ毎に求めた特徴Ｆ_jを順序判定部１８へ出力する。Moreover, the pattern feature extraction unit 16 obtains the average value of the features obtained for all or some of the character patterns included in the character block _j as the feature F _j of the character pattern. For example, in the example shown in FIG. 2, if the average feature obtained for all the character patterns included in the first character line of the character block j is the character pattern feature F _j , the character block 34
The characteristic F ₃₄ of the character block 36 is F ₃₄ = 39.1, the characteristic F ₃₆ of the character block 36 is F ₃₆ = 37.5, the characteristic F ₃₈ of the character block ₃₈ is F ₃₈ = 31.2, and the characteristic F ₄₀ of the character block ₄₀ is F ₄₀ =
It becomes 32.1. The pattern feature extraction unit 16 outputs the feature F _j obtained for each character block _j to the order determination unit 18.

【００３６】この実施例の理解を助けるため、文字ブロ
ック３４、３６、３８及び４０に関する第１行目の文字
列とこの文字列の各文字パタンの文字外接枠及び文字高
さとを図３及び図４に示す。図３（Ａ）は文字ブロック
３４に関する図、図３（Ｂ）は文字ブロック３８に関す
る図、図４（Ａ）は文字ブロック３６に関する図及び図
４（Ｂ）は文字ブロック４０に関する図である。In order to facilitate understanding of this embodiment, the character string on the first line of the character blocks 34, 36, 38 and 40 and the character circumscribing frame and character height of each character pattern of this character string are shown in FIGS. 4 shows. 3A is a diagram relating to the character block 34, FIG. 3B is a diagram relating to the character block 38, FIG. 4A is a diagram relating to the character block 36, and FIG. 4B is a diagram relating to the character block 40.

【００３７】順序判定部１８は文字ブロック抽出部１２
から入力した各文字ブロックの位置情報を利用し、次式
（２）に従い文字ブロックｊの位置評価値Ｅ_jを求め
る。The order determination unit 18 is a character block extraction unit 12
Using the position information of each character block input from, the position evaluation value E _j of the character block j is obtained according to the following equation (2).

【００３８】Ｅ_j＝Ｙ_ej＋Ｆ・Ｘ_ej ……（２）但し、Ｙ_ej及びＸ_ejは文字ブロックｊの左上隅点Ｐの画
像メモリ上のＹ及びＸ座標を表す。一例として、文字ブ
ロック３４の左上隅点Ｐを図２に示す。またＦは任意好
適に定められる定数を示し、例えばＦ＝１０である。E _j = Y _ej + F · X _ej (2) where Y _ej and X _ej represent the Y and X coordinates on the image memory of the upper left corner point P of the character block j. As an example, the upper left corner point P of the character block 34 is shown in FIG. Further, F represents a constant that is arbitrarily determined and is, for example, F = 10.

【００３９】この例では、位置評価値Ｅ_jの小さい順に
各文字ブロックｊに対し仮の順序を定め、従って図２の
例では文字ブロック３４〜４０の仮の順序は、文字ブロ
ック３４、３８、３６及び４０の順となる。尚、位置評
価値Ｅ_jの大きい順に仮の順序を定めてもよい。In this example, a temporary order is set for each character block j in the ascending order of the position evaluation value E _j . Therefore, in the example of FIG. 2, the temporary order of the character blocks 34-40 is the character blocks 34, 38 ,. The order is 36 and 40. Note that a temporary order may be set in descending order of the position evaluation value E _j .

【００４０】これと共に順序判定部１８は、パタン特徴
抽出部１６から入力した各文字ブロックｊの特徴Ｆ_jを
利用し、各文字ブロックｊを特徴Ｆ_jが類似するもの同
志（ほぼ等しいもの同志）にグループ分けし、各文字ブ
ロックｊにいずれのグループに属するかを識別するため
のグループ識別情報を付与する。例えば、次式（３）を
満足する特徴Ｆ_jを有する文字ブロックをひとつのグル
ープとすればよい。At the same time, the order determination unit 18 uses the feature F _j of each character block j input from the pattern feature extraction unit 16, and the character blocks _j are similar to each other in feature F _j (almost equal to each other). And group identification information for identifying which group each character block j belongs to. For example, the character blocks having the feature F _j satisfying the following expression (3) may be set as one group.

【００４１】｜Ｆ_j1−Ｆ_j2｜＜Ｕ ……（３）但し、ｊ１及びｊ２は文字ブロック番号を示し、ｊ１≠
ｊ２である。またＵはイメージスキャナの解像度、文字
媒体の種類、文字の大きさそのほかを考慮して任意好適
に定められる定数を示し、例えばＵ＝５である。| F _j1 −F _j2 | <U (3) where j1 and j2 are character block numbers, and j1 ≠
j2. Further, U represents a constant arbitrarily determined in consideration of the resolution of the image scanner, the type of character medium, the size of characters, and the like, and U = 5, for example.

【００４２】図２に示す文字ブロック３４〜４０におい
ては、文字パタン特徴Ｆ₃₄＝３９．１、Ｆ₃₆＝３７．
５、Ｆ₃₈＝３１．２及びＦ₄₀＝３２．１であったので、
Ｕ＝５として（３）式を満足する文字ブロックのグルー
プは２つでき、ひとつのグループは文字ブロック３４及
び３６が構成し、他のひとつのグループは文字ブロック
３８及び４０が構成することとなる。In the character blocks _{34 to} 40 shown in FIG. 2, character pattern features F ₃₄ = 39.1, F ₃₆ = 37.
5, F ₃₈ = 31.2 and F ₄₀ = 32.1.
When U = 5, two groups of character blocks satisfying the expression (3) can be formed. One group is composed of the character blocks 34 and 36, and the other group is composed of the character blocks 38 and 40. ..

【００４３】同じグループに属する文字ブロックは、共
通の種類或は属性を有する文字（例えば文字高さが互い
に等しい文字）を含む文字ブロックであり、従って共通
の情報例えば同一文脈を構成する文字ブロックである。Character blocks belonging to the same group are character blocks including characters having a common type or attribute (for example, characters having the same character height), and thus common information, for example, character blocks forming the same context. is there.

【００４４】次に順序判定部１８は位置評価値Ｅ_iとグ
ループ識別情報とを利用し、各グループ毎に位置評価値
の小さい順に文字ブロックの正式の順序を定める。例え
ば次に示す１）〜４）の処理に従って正式の順序を定め
る処理を行う。Next, the order determining unit 18 uses the position evaluation value E _i and the group identification information to determine the formal order of the character blocks for each group in ascending order of the position evaluation value. For example, a process for determining a formal order is performed according to the following processes 1) to 4).

【００４５】１）まず文書３２が含む全ての文字ブロッ
クを選択対象とする。1) First, all character blocks included in the document 32 are selected.

【００４６】２）次に選択対象のなかから位置評価値の
最も小さい文字ブロックを検出し、この文字ブロックに
対し正式の順序番号１を付与すると共に、当該文字ブロ
ックを選択対象から除外する。2) Next, the character block having the smallest position evaluation value is detected from the selection targets, the formal sequence number 1 is given to this character block, and the character block is excluded from the selection targets.

【００４７】３）次に選択対象のグループ識別情報を位
置評価値の小さい順に参照し、処理２）で検出した順序
番号１の文字ブロックと同じグループの文字ブロック
を、選択対象のなかから検出する。この検出する過程に
おいて、第ｋ番目（ｋ＝１、２、……）に検出した、順
序番号１の文字ブロックと同じグループの文字ブロック
に対し正式の順序番号ｋ＋１を付与すると共に、当該順
序番号を付与した文字ブロックを選択対象から除外す
る。処理３）の開始時点での選択対象の全てにつきグル
ープ識別情報の参照を終了したら、処理３）を終了す
る。3) Next, the group identification information of the selection target is referred to in the ascending order of the position evaluation value, and the character block of the same group as the character block of the sequence number 1 detected in the process 2) is detected from the selection targets. .. In the process of this detection, the formal sequence number k + 1 is given to the k-th (k = 1, 2, ...) Detected character block of the same group as the character block of sequence number 1, and the sequence number is added. Exclude the character block with "" from selection. When the reference of the group identification information is completed for all the selection targets at the time of starting the process 3), the process 3) is ended.

【００４８】処理２）及び３）によって、同一グループ
に属する全ての文字ブロックに対しそれぞれ正式の順序
番号が付与され、しかもより位置評価値の小さい文字ブ
ロックに対しより小さい順序番号が付与される。例えば
同一グループ内において順序番号がより小さい文字ブロ
ックをより先順位の文字ブロックとして、各文字ブロッ
クの正式の順序を定める。By the processes 2) and 3), a formal sequence number is given to all the character blocks belonging to the same group, and a smaller sequence number is given to the character block having a smaller position evaluation value. For example, the formal order of each character block is determined by regarding the character block with the smaller order number in the same group as the character block with the higher order.

【００４９】４）次に選択対象となる文字ブロックが残
存するか否かを判定する。残存すれば、残りの他のグル
ープに関し正式の順序番号１を付与すべき文字ブロック
を検出するため、処理２）を再び行う。選択対象となる
文字ブロックが残存しなければ、文書３２が含む全ての
文字ブロックに対し各グループ毎に正式の順序番号を付
与し終えたので、正式の順序を定める処理を終了する。4) Next, it is determined whether the character block to be selected remains. If it remains, the process 2) is performed again in order to detect the character block to which the formal sequence number 1 is to be added for the remaining other groups. If no character block to be selected remains, the process of determining the formal order ends because all the character blocks included in the document 32 have been given the formal order numbers for each group.

【００５０】例えば図２に示す例では、上述の処理１）
〜４）により、まずひとつのグループに属する文字ブロ
ック３４及び３６に対し正式の順序番号１及び２が付与
され、次いで残りの他のグループに属する文字ブロック
３８及び４０に対し正式の順序番号１及び２が付与され
る。For example, in the example shown in FIG. 2, the above processing 1)
4), the formal sequence numbers 1 and 2 are first given to the character blocks 34 and 36 belonging to one group, and then the formal sequence numbers 1 and 2 are given to the character blocks 38 and 40 belonging to the other remaining groups. 2 is given.

【００５１】順序判定部１８は各文字ブロックの正式の
順序番号とグループ識別情報とをブロック選択部２４へ
出力する。The order determination section 18 outputs the formal order number of each character block and the group identification information to the block selection section 24.

【００５２】ブロック選択部２４は文字ブロック抽出部
１２から入力したブロックデータを各文字ブロック毎に
図示しないブロックデータメモリに格納する。そして各
文字ブロックの正式の順序番号及びグループ識別情報に
基づいて、各グループ毎にブロックデータを正式の順序
で順次に切出し部２６へ出力する。ブロック選択部２４
はひとつのブロックデータを切出し部２６へ出力する
と、切出し部２６が当該ブロックデータにつき全ての文
字パタンの切出しを終了するまで次のブロックデータの
出力を待ち、当該ブロックデータの全文字パタンの切出
しが終了すると、次のブロックデータを切出し部２６へ
出力する。The block selection unit 24 stores the block data input from the character block extraction unit 12 for each character block in a block data memory (not shown). Then, based on the formal sequence number and group identification information of each character block, the block data is sequentially output to the cutout unit 26 for each group in the formal sequence. Block selection unit 24
Outputs one block data to the cutout unit 26, waits for the output of the next block data until the cutout unit 26 finishes cutting out all the character patterns for the block data, and cuts out all the character patterns of the block data. When finished, the next block data is output to the cutout unit 26.

【００５３】切出し部２６はブロックデータの入力順次
に、ブロックデータから従来周知の方法により文字パタ
ンを切出し、文字パタンの画像データ（文字データ）と
して文字外接枠内の画像データを認識部２８へ出力す
る。The cut-out unit 26 sequentially cuts out character data from the block data by a conventionally known method, and outputs the image data in the character circumscribed frame to the recognition unit 28 as image data (character data) of the character pattern. To do.

【００５４】認識部２８は文字データに基づいて文字パ
タンの認識を行い、その認識結果を次段の装置例えば言
語処理或は知識処理を行う装置へ出力する。認識部２８
の構成及び認識処理は従来周知の種々のものとすること
ができるが、この実施例では、認識部２８を図５に示す
構成のものとする。The recognition unit 28 recognizes a character pattern based on the character data, and outputs the recognition result to a device in the next stage, for example, a device that performs language processing or knowledge processing. Recognition unit 28
Although the structure and the recognition process of FIG. 5 can be variously known conventionally, in this embodiment, the recognition unit 28 has the structure shown in FIG.

【００５５】図５は認識部の構成の一例を示す機能ブロ
ック図であり、同図にも示すようにこの実施例の認識部
２８はサブパタン抽出部４２、特徴抽出部４４及び照合
部４６を備える。FIG. 5 is a functional block diagram showing an example of the structure of the recognition unit. As shown in FIG. 5, the recognition unit 28 of this embodiment includes a sub-pattern extraction unit 42, a feature extraction unit 44 and a collation unit 46. ..

【００５６】サブパタン抽出部４２は、文字データを図
示しない文字パタンメモリに格納し、主走査方向を異な
る複数種類の方向として文字データを走査する。そして
各走査線毎に走査線上で所定個数ｍ（例えばｍ＝５）以
上連続する黒ビットの塊を検出し、この黒ビットの塊を
当該塊を検出した主走査方向に関するサブパタンの文字
線成分として抽出する。サブパタン抽出部４２は、各主
走査方向毎に、検出したサブパタンの文字線成分をサブ
パタンメモリに格納する。ひとつの文字データから、主
走査方向の種類の個数と同個数ｎのサブパタンを抽出す
る。The sub-pattern extracting section 42 stores the character data in a character pattern memory (not shown) and scans the character data with the main scanning direction being a plurality of different directions. Then, a lump of black bits consecutive for a predetermined number m (for example, m = 5) or more on each scan line is detected, and this lump of black bits is used as a character line component of the sub-pattern in the main scanning direction in which the lump is detected. Extract. The sub-pattern extraction unit 42 stores the character line component of the detected sub-pattern in the sub-pattern memory for each main scanning direction. From one character data, the same number n of sub patterns as the number of types in the main scanning direction are extracted.

【００５７】文字パタン及びサブパタンメモリ上には文
書３２上に設定したＸ−Ｙ座標系に相対応するＸ−Ｙ座
標系を設定し、これらメモリの格納場所にそれぞれ座標
（Ｘ、Ｙ）を付与する。そして文字データを構成する黒
ビット及び白ビットを、当該ビットの文書３２上での座
標と対応する座標を有する文字パタンメモリの格納場所
に格納し、またサブパタンの文字線成分を、当該文字線
成分の文書３２上での座標と対応する座標を有するサブ
パタンメモリの格納場所に格納する。サブパタンメモリ
の文字線成分が格納されなかった格納場所にはサブパタ
ンの文字背景成分としての白ビットを格納する。On the character pattern and sub pattern memory, an XY coordinate system corresponding to the XY coordinate system set on the document 32 is set, and coordinates (X, Y) are respectively set in the storage locations of these memories. Give. Then, the black bit and the white bit forming the character data are stored in the storage location of the character pattern memory having the coordinates corresponding to the coordinates of the bit on the document 32, and the character line component of the sub-pattern is set to the character line component. It is stored in the storage location of the sub-pattern memory having the coordinates corresponding to the coordinates on the document 32. A white bit as a character background component of the sub pattern is stored in the storage location where the character line component of the sub pattern memory is not stored.

【００５８】具体的に一例を挙げれば、文字行方向をＸ
軸方向（水平方向）とし、文字文字パタンの主走査方向
をＸ軸方向、Ｙ軸方向（垂直方向）、Ｘ軸から反時計回
りに４５°回転した方向（左斜め方向）及びＸ軸から時
計回りに４５°回転した方向（右斜め方向）の４つの異
なる方向として文字データを走査する。従ってこの場
合、ひとつの文字データから水平、垂直、左斜め及び右
斜めサブパタンの４個のサブパタンを抽出することとな
る。As a specific example, the character line direction is X.
The axial direction (horizontal direction), the main scanning direction of the character pattern is the X-axis direction, the Y-axis direction (vertical direction), the direction rotated 45 ° counterclockwise from the X-axis (left diagonal direction), and the X-axis clockwise. Character data is scanned as four different directions of a direction rotated by 45 ° (obliquely to the right). Therefore, in this case, four sub patterns of horizontal, vertical, diagonal left and diagonal right are extracted from one character data.

【００５９】垂直サブパタンを抽出する場合には、主走
査方向を垂直方向として文字データを走査し、垂直な走
査線上で連続する黒ビットの塊（黒ラン）を検出する。
この黒ランを構成する黒ビットの総個数（黒ランの長さ
Ｌ）がＬ≧ｍを満足するとき当該黒ランを垂直サブパタ
ンの文字線成分として抽出する。Ｌ≧ｍを満足しない長
さＬの黒ランは垂直サブパタンの文字背景成分となる。
残りの３個のサブパタンも、垂直サブパタンの場合と同
様にして、抽出する。When the vertical sub-pattern is extracted, character data is scanned with the main scanning direction as the vertical direction, and a continuous block of black bits (black run) is detected on the vertical scanning line.
When the total number of black bits forming the black run (the length L of the black run) satisfies L ≧ m, the black run is extracted as the character line component of the vertical sub-pattern. A black run having a length L that does not satisfy L ≧ m becomes a character background component of the vertical sub-pattern.
The remaining three sub-patterns are also extracted in the same manner as the vertical sub-pattern.

【００６０】特徴抽出部４４は、文字データから抽出し
たｎ個のサブパタンそれぞれにつき特徴マトリクスＦを
抽出し、ひとつの文字データに関しｎ個の特徴マトリク
スＦを抽出する。特徴抽出部４４は各文字データ毎に得
たｎ個の特徴マトリクスＦを照合部４６に出力する。The feature extraction unit 44 extracts the feature matrix F for each of the n sub patterns extracted from the character data, and extracts the n feature matrix F for one character data. The feature extraction unit 44 outputs the n feature matrices F obtained for each character data to the matching unit 46.

【００６１】サブパタンから特徴マトリクスＦを抽出す
るに当たっては、当該サブパタンを得た文字データの文
字外接枠を文書３２上での座標位置と対応するサブパタ
ンメモリ上の座標位置に設定する。次いで文字外接枠が
囲むサブパタンメモリ上の領域をＮ×Ｍ個（Ｎ及びＭは
それぞれ任意好適に定められる自然数）の小領域に分割
する。Ｎ×Ｍ個の小領域をそれぞれ小領域ｉと表す。そ
してＮ×Ｍ個の小領域のそれぞれにつき小領域ｉ内に存
在する文字線成分の長さを表す特徴量ｅ_iを求め、これ
ら特徴量ｅ_iをそれぞれ文字外接枠の大きさで正規化す
る。正規化された特徴量ｅ_iを特徴量ｆ_iと表す。特徴
量ｆ_iは特徴マトリクスＦの要素値であり、特徴マトリ
クスＦはＮ×Ｍ個の特徴量ｆ_iから成る。In extracting the feature matrix F from the sub pattern, the character circumscribing frame of the character data for which the sub pattern is obtained is set at the coordinate position on the sub pattern memory corresponding to the coordinate position on the document 32. Then, the area on the sub-pattern memory surrounded by the character circumscribing frame is divided into N × M small areas (N and M are natural numbers arbitrarily determined). Each of the N × M small areas is represented as a small area i. Then, for each of the N × M small areas, a characteristic quantity e _i representing the length of the character line component existing in the small area _i is obtained, and these characteristic quantities e _i are each normalized by the size of the character circumscribing frame. .. The normalized feature value e _i represents the feature amount f _i. The feature quantity f _i is an element value of the feature matrix F, and the feature matrix F is composed of N × M feature quantities f _i .

【００６２】例えばＮ＝Ｍ＝８とし、特徴量ｅ_iを（ｄ
Ｘ＋ｄＹ）／２で除して得た値を特徴量ｆ_iとする。ｄ
Ｘ及びｄＹは文字外接枠の水平及び垂直方向における長
さを表す。For example, when N = M = 8, the feature quantity e _i is (d
X + dY) / 2 The value obtained by dividing the feature amount f _i. d
X and dY represent the lengths of the character circumscribing frame in the horizontal and vertical directions.

【００６３】照合部４６は、図示せずも、標準文字パタ
ンの特徴マトリクス（辞書マトリクス）Ｇを格納した辞
書メモリを備える。辞書マトリクスＧは特徴マトリクス
Ｆと同様にして標準文字パタンから抽出した特徴量であ
り、例えば、標準文字パタンの水平、垂直、左斜め及び
右斜めサブパタンからそれぞれ特徴マトリクスを抽出
し、これら４個の特徴マトリクスをそれぞれ当該標準パ
タンの辞書マトリクスＧとしている。The collating unit 46 includes a dictionary memory (not shown) in which a characteristic matrix (dictionary matrix) G of standard character patterns is stored. The dictionary matrix G is a feature quantity extracted from the standard character pattern in the same manner as the feature matrix F. For example, the feature matrix is extracted from the horizontal, vertical, left diagonal, and right diagonal sub-patterns of the standard character pattern, and these four matrixes are extracted. The feature matrix is the dictionary matrix G of the standard pattern.

【００６４】照合部４６は、特徴マトリクスＦ及び辞書
マトリクスＧのサブパタンの種類が同じもの同志例えば
垂直サブパタンの特徴マトリクスＦ及び辞書マトリクス
Ｇ同志を照合し、これらマトリクス間の類似度Ｒを次式
（４）に従って求める。そしてサブパタンの各種類毎に
求めた類似度Ｒがそれぞれ予め定めた値Ｐ以上となる標
準文字パタンに付与されている文字名を、当該特徴マト
リクスＦを得た文字データの候補文字名として検出す
る。照合部４６は一又は複数の候補文字名を、ひとつの
文字データにつき検出し認識結果として次段の装置へ出
力する。複数の候補文字名を検出した場合には、これら
候補文字名に対し類似度Ｒが高い順に第１位、第２位、
……と順位付けし、これら順位付けした候補文字名を認
識結果とする。The collation unit 46 collates the feature matrix F and the dictionary matrix G having the same type of sub-pattern, for example, the feature matrix F and the dictionary matrix G having the vertical sub-pattern, and the similarity R between these matrices is given by the following equation ( Obtain according to 4). Then, a character name given to a standard character pattern whose similarity R obtained for each type of sub-pattern is a predetermined value P or more is detected as a candidate character name of the character data for which the characteristic matrix F is obtained. .. The collation unit 46 detects one or a plurality of candidate character names for one character data and outputs the result as a recognition result to the next apparatus. When a plurality of candidate character names are detected, the first rank, the second rank,
.. are ranked, and these ranked candidate character names are used as recognition results.

【数１】[Equation 1]

【００６５】 [0065]

【００６６】但し、ｇ_iは辞書マトリクスＧの要素値を
示す。However, g _i indicates the element value of the dictionary matrix G.

【００６７】図６は第一及び第二発明の第二実施例の構
成を概略的に示す機能ブロック図である。尚、第一実施
例の構成成分に対応する構成成分については同一の符号
を付して示す。以下の第二実施例の説明では、主として
第一実施例と相違する点につき説明し、第一実施例と同
様の点についてはその詳細な説明を省略する。FIG. 6 is a functional block diagram schematically showing the configuration of the second embodiment of the first and second inventions. The constituents corresponding to those of the first embodiment are designated by the same reference numerals. In the following description of the second embodiment, differences from the first embodiment will be mainly described, and detailed description of the same points as the first embodiment will be omitted.

【００６８】同図において４８は第一発明の第二実施例
としての情報処理装置を示し、この情報処理装置４８は
文字ブロック抽出部５０、切出し部５２、パタン特徴抽
出部１６及び順序判定部１８を備える。また５６は第二
発明の第二実施例としての文字認識装置を示し、この文
字認識装置５６は画像生成部２２、情報処理装置４８、
ブロック選択部５８及び認識部２８を備える。この実施
例では、情報処理装置４８において文字パタンの特徴を
抽出するための文字パタンを切り出す切出し部５２を、
文字認識装置５４において文字パタンの認識のため文字
パタンを切り出す切出し部としても用いる。In the figure, reference numeral 48 denotes an information processing apparatus as a second embodiment of the first invention. This information processing apparatus 48 includes a character block extracting section 50, a cutting section 52, a pattern feature extracting section 16 and an order determining section 18. Equipped with. Reference numeral 56 represents a character recognition device as a second embodiment of the second invention. The character recognition device 56 includes the image generation unit 22, the information processing device 48,
The block selection unit 58 and the recognition unit 28 are provided. In this embodiment, the information processing device 48 includes a cutout unit 52 that cuts out a character pattern for extracting characteristics of the character pattern.
The character recognizing device 54 is also used as a cutout unit for cutting out a character pattern for recognition of the character pattern.

【００６９】第二実施例では、文字ブロック抽出部５０
は文書３２の画像データから文字ブロックを抽出し、抽
出した文字ブロックの位置情報を順序判定部１８へ出力
すると共に、文字ブロック内の画像データ（ブロックデ
ータ）を切出し部５２へ出力する。文字ブロック抽出部
５０はブロック選択部５８へはブロックデータを出力し
ない。In the second embodiment, the character block extraction unit 50
Extracts a character block from the image data of the document 32, outputs the position information of the extracted character block to the order determination unit 18, and outputs the image data (block data) in the character block to the cutout unit 52. The character block extraction unit 50 does not output the block data to the block selection unit 58.

【００７０】切出し部５２は、各文字ブロック毎に、ブ
ロックデータを図示しないブッロクデータメモリに格納
する。そして文書３２の全ての文字ブロックをひとつず
つ順次に着目ブロックとし、着目ブロック内のブロック
データを走査して、着目ブロック内の全ての文字行を切
り出す。次いで文字行内のブロックデータを走査して文
字パタンを切り出し、最終的に着目ブロック内の全ての
文字パタンを切り出す。そして切出し部５２は、着目ブ
ロック内の全部又は一部の文字パタンの切出し情報をパ
タン特徴抽出部５４へ出力し、これと共に着目ブロック
内の全部の文字パタンの画像データをブロック選択部５
８へ出力する。The slicing section 52 stores the block data for each character block in a block data memory (not shown). Then, all the character blocks of the document 32 are sequentially set as the target block one by one, the block data in the target block is scanned, and all the character lines in the target block are cut out. Next, the block data in the character line is scanned to cut out a character pattern, and finally all the character patterns in the target block are cut out. Then, the cutout unit 52 outputs the cutout information of all or some of the character patterns in the target block to the pattern feature extraction unit 54, and at the same time, outputs the image data of all the character patterns in the target block to the block selection unit 5
Output to 8.

【００７１】ブロック選択部５８は、各文字ブロック毎
に、文字パタンの画像データを図示しない文字パタンメ
モリに格納する。そしてブロック選択部５８は順序判定
部１８から入力した各文字ブロックの正式の順序番号及
びグループ識別情報に基づいて、各グループ毎に正式の
順序番号に従って文字ブロックを選択し、選択順次に文
字ブロック内の文字データを認識部２８へ出力する。The block selection unit 58 stores the image data of the character pattern for each character block in a character pattern memory (not shown). Then, the block selection unit 58 selects a character block according to the formal sequence number for each group based on the formal sequence number and the group identification information of each character block input from the sequence determination unit 18, and sequentially selects the character blocks. The character data of is output to the recognition unit 28.

【００７２】図７は第一及び第二発明の第三実施例の全
体構成を概略的に示す機能ブロック図である。尚、第一
実施例の構成成分に対応する構成成分については同一の
符号を付して示す。以下の第三実施例の説明では、主と
して第一実施例と相違する点につき説明し、第一実施例
と同様の点についてはその詳細な説明を省略する。FIG. 7 is a functional block diagram schematically showing the overall structure of the third embodiment of the first and second inventions. The constituents corresponding to those of the first embodiment are designated by the same reference numerals. In the following description of the third embodiment, differences from the first embodiment will be mainly described, and detailed description of the same points as the first embodiment will be omitted.

【００７３】同図において６０は第一発明の第三実施例
としての情報処理装置を示し、この情報処理装置６０は
文字ブロック抽出部１２、切出し部１４、パタン特徴抽
出部６２及び順序判定部１８を備える。また６４は第二
発明の第三実施例としての文字認識装置を示し、この文
字認識装置６４は画像生成部２２、情報処理装置６０、
ブロック選択部２４、切出し部２６及び認識部２８を備
える。In the figure, reference numeral 60 denotes an information processing apparatus as a third embodiment of the first invention, and the information processing apparatus 60 includes a character block extraction unit 12, a cutout unit 14, a pattern feature extraction unit 62 and an order determination unit 18. Equipped with. Reference numeral 64 denotes a character recognition device as a third embodiment of the second invention, and the character recognition device 64 is an image generation unit 22, an information processing device 60,
The block selection unit 24, the cutout unit 26, and the recognition unit 28 are provided.

【００７４】次に図８に示す文書の文字認識を例に取っ
てこの実施例の動作につき説明する。図８は文書の他の
例を示す図である。同図において６６は文字媒体として
の文書を示し、文書６６は文字線幅が太い文字から成り
ひとつの文脈を形成する文字ブロック６８及び７０と、
文字線幅が細い文字から成り別のひとつの文脈を形成す
る文字ブロック７２及び７４とを有する。Next, the operation of this embodiment will be described by taking the character recognition of the document shown in FIG. 8 as an example. FIG. 8 is a diagram showing another example of a document. In the figure, reference numeral 66 denotes a document as a character medium, and the document 66 includes character blocks 68 and 70 which are formed of characters having a large character line width and form one context,
The character blocks 72 and 74 are formed of characters having a narrow character line width and form another context.

【００７５】画像データ生成部２２が文書６６の画像デ
ータを生成し終わると、文字ブロック抽出部１２は文書
６６から文字ブロック６８〜７４をそれぞれ抽出し、次
いで切出し部１４は文字ブロック６８〜７４から文字パ
タンを切り出し文字パタンの切出し情報をパタン特徴抽
出部６２へ出力する。When the image data generation unit 22 finishes generating the image data of the document 66, the character block extraction unit 12 extracts the character blocks 68 to 74 from the document 66, and the cutout unit 14 then extracts the character blocks 68 to 74. A character pattern is cut out, and the cut-out information of the character pattern is output to the pattern feature extraction unit 62.

【００７６】この実施例では、文字パタンの水平方向に
おける切出し開始及び終了位置を文字外接枠の左端及び
右端位置とし、また文字パタンの垂直方向における切出
し開始及び終了位置を文字行の垂直方向における切出し
開始及び終了位置とする。そして文字外接枠の左端及び
右端位置の間の文字行内の領域の画像データを文字デー
タとし、この文字データを切出し情報として出力する。In this embodiment, the cutout start and end positions in the horizontal direction of the character pattern are the left and right end positions of the character circumscribing frame, and the cutout start and end positions in the vertical direction of the character pattern are cutout in the vertical direction of the character line. The start and end positions. Then, the image data of the area in the character line between the left end and right end positions of the character circumscribing frame is set as character data, and this character data is output as cutout information.

【００７７】パタン特徴抽出部６２は、文字データを図
示しない文字パタンメモリに格納し、文字パタンの特徴
Ｆ_j（この例ではｊ＝６８、７０、７２又は７４であ
る。）として文字パタンの線幅Ｗを抽出する。線幅Ｗの
抽出は従来周知の種々の方法で行って良いが、この実施
例では次に述べるようにして抽出する。The pattern feature extraction unit 62 stores the character data in a character pattern memory (not shown), and as a feature F _{j of the} character pattern (j = 68, 70, 72 or 74 in this example), the line of the character pattern. The width W is extracted. The line width W may be extracted by various conventionally known methods, but in this embodiment, it is extracted as described below.

【００７８】まずパタン特徴抽出部６２は文字データを
走査し、文字データの文字外接枠の上端及び下端位置を
検出する。次に文字外接枠内の文字データが含む黒ビッ
トの総個数Ｐを求め、これと共に文字外接枠内の文字デ
ータを例えば２画素×２画素の広さを有する窓を用いて
線順次に走査しこの窓内の画素が全て黒ビットとなる回
数Ｑを求める。そしてこれらＰ及びＱより従来周知の次
式（５）に従って、一つ一つの文字パタンにつき線幅Ｗ
を求める。First, the pattern feature extraction unit 62 scans the character data and detects the upper and lower end positions of the character circumscribing frame of the character data. Next, the total number P of black bits included in the character data in the character circumscribing frame is determined, and together with this, the character data in the character circumscribing frame is line-sequentially scanned using a window having a width of, for example, 2 pixels × 2 pixels. The number of times Q that all pixels in this window are black bits is calculated. From these P and Q, the line width W for each character pattern is calculated according to the following well-known equation (5).
Ask for.

【００７９】Ｗ＝１／｛１−（Ｑ／Ｐ）｝ ……（５）この実施例の理解を助けるため、図８に示す文字ブロッ
ク６８、７０、７２及び７４に関する第１行目の文字列
とこの文字列の各文字パタンの線幅Ｗとを図９及び図１
０に示す。図９（Ａ）は文字ブロック６８に関する図、
図９（Ｂ）は文字ブロック７２に関する図、図１０
（Ａ）は文字ブロック７０に関する図及び図１０（Ｂ）
は文字ブロック７４に関する図である。W = 1 / {1- (Q / P)} (5) To facilitate understanding of this embodiment, the characters on the first line of the character blocks 68, 70, 72 and 74 shown in FIG. The line and the line width W of each character pattern of this character string are shown in FIG. 9 and FIG.
It shows in 0. FIG. 9A is a diagram regarding the character block 68,
FIG. 9B is a diagram relating to the character block 72, and FIG.
FIG. 10A is a diagram relating to the character block 70 and FIG.
Is a diagram relating to a character block 74.

【００８０】ここでは文字ブロックｊが含む全ての文字
パタンに関して得た線幅Ｗの平均値を、当該文字ブロッ
クｊの文字パタン特徴Ｆ_jとする。この場合、図８に示
す例では文字ブロック６８の特徴Ｆ₆₈はＦ₆₈＝９．１、
文字ブロック７０の特徴Ｆ₇₀はＦ₇₀＝８．５、文字ブロ
ック７２の特徴Ｆ₇₂はＦ₇₂＝４．２及び文字ブロック７
４の特徴Ｆ₇₄はＦ₇₄＝３．９となる。Here, the average value of the line width W obtained for all the character patterns included in the character block j is set as the character pattern feature F _j of the character block j. In this case, in the example shown in FIG. 8, the feature F ₆₈ of the character block ₆₈ is F ₆₈ = 9.1,
The feature F ₇₀ of the character block 70 is F ₇₀ = 8.5, the feature F ₇₂ of the character block ₇₂ is F ₇₂ = 4.2, and the character block 7 is
The feature F ₇₄ of 4 is F ₇₄ = 3.9.

【００８１】順序判定部１８は、上述の（３）式の定数
Ｕを例えばＵ＝３として文字ブロックｊをグループ分け
する。図８に示す文字ブロック６８〜７４においてはＦ
₆₈＝９．１、Ｆ₇₀＝８．５、Ｆ₇₂＝４．２及びＦ₇₄＝
３．９であったので（３）式を満足する文字ブロックの
グループは２つでき、ひとつのグループは文字ブロック
６８及び７０が構成し、他のひとつのグループは文字ブ
ロック７２及び７４が構成する。次に順序判定部１８は
文字ブロックｊに対し各グループ毎に正式の順序を付与
する。図８に示す例では、ひとつのグループを構成する
文字ブロック６８及び７０に対し正式の順序番号１及び
２が付与され、残りの他のグループを構成する文字ブロ
ック７２及び７４に対し正式の順序番号１及び２が付与
される。The order determining unit 18 divides the character block j into groups by setting the constant U in the above equation (3) to U = 3, for example. In the character blocks 68 to 74 shown in FIG.
₆₈ = 9.1, F ₇₀ = 8.5, F ₇₂ = 4.2 and F ₇₄ =
Since it is 3.9, two groups of character blocks satisfying the expression (3) can be formed. One group is composed of the character blocks 68 and 70, and the other group is composed of the character blocks 72 and 74. .. Next, the order determination unit 18 gives the character block j a formal order for each group. In the example shown in FIG. 8, formal sequence numbers 1 and 2 are given to the character blocks 68 and 70 forming one group, and formal sequence numbers are given to the character blocks 72 and 74 forming the remaining other groups. 1 and 2 are given.

【００８２】第一及び第二発明は上述した実施例にのみ
限定されるものではなく、従って各構成成分の入出力信
号、動作の流れ、数値的条件、処理方法及びそのほかを
任意好適に変更することができる。The first and second inventions are not limited to the above-mentioned embodiments, and therefore, the input / output signals of the respective constituent components, the flow of operation, the numerical conditions, the processing method and others are arbitrarily changed. be able to.

【００８３】例えば文字ブロックの抽出方法、文字行及
び文字パタンの切出し方法、文字認識の際の特徴マトリ
クスの作成方法及び類似度算出方法そのほかの処理を、
任意好適な種々の方法に変更できる。For example, a method of extracting a character block, a method of cutting out a character line and a character pattern, a method of creating a feature matrix at the time of character recognition, a method of calculating a degree of similarity, and other processes,
It can be changed to any suitable various methods.

【００８４】また文字パタン特徴を文字高さ或は線幅と
するほか、文字幅、文字ピッチ、行高さ、文字外接枠の
縦横比、文字の傾き、文字パタンの黒画素の分布から得
られる特徴、文字パタンから抽出したサブパタンの線
幅、サブパタンの文字線量及び２種類以上の種類の異な
るサブパタン間の文字線量の差（例えば同一の文字パタ
ンから抽出した垂直及び水平サブパタン間の文字線量の
差）のいずれかひとつとしても良い。また１種類の文字
パタン特徴を用いて文字ブロックをグループ分けするの
みならず、異なる複数種類の文字パタン特徴を用いて文
字ブロックをグループ分けするようにしても良い。複数
種類の文字パタン特徴を用いる場合には、これら複数種
類の文字パタン特徴を用いてより高次なひとつの特徴を
導き出すようにするのが良い。In addition to the character height or line width as the character pattern feature, it can be obtained from the character width, character pitch, line height, aspect ratio of the circumscribing frame, inclination of the character, and distribution of black pixels in the character pattern. Characteristic, line width of sub-pattern extracted from character pattern, character dose of sub-pattern and difference in character dose between two or more different sub-patterns (eg difference in character dose between vertical and horizontal sub-patterns extracted from the same character pattern) ) Any one of Further, not only the character blocks may be grouped by using one type of character pattern feature, but the character blocks may be grouped by using different types of character pattern features. When using a plurality of types of character pattern features, it is preferable to derive a higher-order feature by using these types of character pattern features.

【００８５】また上述した例では第一発明の情報処理装
置を用いて文字認識装置を構成した例につき説明した
が、第一発明の適用を文字認識装置にのみ限定するもの
ではなく、このほか、文字ブロックの順序関係を決定し
決定した順序関係に従って文字ブロック内の画像データ
を順次に出力する装置や、文字ブロックの順序関係を抽
出して文字媒体のレイアウト構造を抽出する装置を構成
するのに第一発明の情報処理装置を用いるようにしても
良い。Further, in the above-mentioned example, an example in which the character recognition device is configured by using the information processing device of the first invention has been described, but the application of the first invention is not limited to the character recognition device. To configure a device that determines the order relationship of character blocks and sequentially outputs the image data in the character blocks according to the determined order relationship, or a device that extracts the order relationship of the character blocks and extracts the layout structure of the character medium. You may make it use the information processing apparatus of 1st invention.

【００８６】[0086]

【発明の効果】上述した説明からも明らかなように、第
一発明の情報処理装置によれば、文字ブロックの位置情
報から位置評価値を求める。これと共に文字ブロックを
文字パタンの特徴が類似するもの同志にグループ分けす
る。そして文字ブロックの順序を各グループ毎に位置評
価値の小さい順或は大きい順に決定する。As is apparent from the above description, according to the information processing apparatus of the first invention, the position evaluation value is obtained from the position information of the character block. Along with this, the character blocks are grouped into comrades having similar character patterns. Then, the order of the character blocks is determined for each group in ascending or descending order of the position evaluation value.

【００８７】従って文字ブロックが含む文字パタンの特
徴を文字ブロック単位で異ならせ、文字ブロックが担う
情報の種類を文字パタンの特徴と対応付けている文字媒
体において文字ブロックの順序を決定する場合、文字ブ
ロックを同一種類の情報毎に精度良く順序付けることが
できる。Therefore, when the characteristic of the character pattern included in the character block is made different for each character block and the type of information carried by the character block is associated with the characteristic of the character pattern, the order of the character block is determined in the character medium. It is possible to accurately order blocks by the same type of information.

【００８８】例えば異なる文脈の文字ブロックを、各文
脈毎に文字パタンの特徴を異ならせて同一紙面に掲載し
てある文書にあっては、文字パタン特徴が類似するグル
ープ毎に文字ブロックの順序を定めることができ、従っ
て各文字ブロックをそれぞれの文脈に沿って精度良く順
序付けることができる。For example, in a document in which character blocks having different contexts are printed on the same paper with different character pattern characteristics for each context, the order of the character blocks may be changed for each group having similar character pattern characteristics. Therefore, each character block can be accurately ordered according to each context.

【００８９】また第二発明の文字認識装置によれば、上
述の第一発明の情報処理装置を備えるので、文字ブロッ
クが含む文字パタンの特徴を文字ブロック単位で異なら
せ、文字ブロックが担う情報の種類を文字パタンの特徴
と対応付けている文字媒体の文字認識において、文字ブ
ロックを同一種類の情報毎に精度良く順序付けて選択す
ることができ、従って文字パタンを同一種類の情報毎に
精度良く順序付けて切り出せる。その結果、例えば複数
の異なる文脈を各文脈毎に文字パタンの特徴を異ならせ
て同一紙面に掲載してある文書の文字認識を行う場合、
文字パタンを、各文脈毎に文脈に沿って精度良く切り出
し認識することができる。文字パタンを文脈に沿って精
度良く認識できる結果、言語処理による認識精度の向上
を効果的に達成しオペレータが誤認識を確認或は訂正す
る作業を軽減し、またオペレータが文脈毎に文字ブロッ
クを順序付ける作業を省け、従って文字認識処理の作業
効率を高めることができる。Further, according to the character recognition device of the second invention, since the information processing device of the first invention described above is provided, the characteristics of the character pattern included in the character block are made different for each character block, and the information carried by the character block is changed. In character recognition of a character medium in which types are associated with characteristics of character patterns, character blocks can be accurately ordered and selected for each information of the same type, and therefore character patterns can be accurately ordered for each information of the same type. Can be cut out. As a result, for example, when character recognition is performed on a document that is printed on the same page with different character patterns for different contexts,
It is possible to accurately cut out and recognize a character pattern for each context according to the context. As a result of accurately recognizing character patterns according to the context, it is possible to effectively improve the recognition accuracy by language processing and reduce the work for the operator to confirm or correct erroneous recognition, and for the operator to recognize character blocks for each context. The work of ordering can be omitted, and therefore the work efficiency of the character recognition processing can be improved.

[Brief description of drawings]

【図１】第一及び第二発明の第一実施例の構成を概略的
に示す機能ブロック図である。FIG. 1 is a functional block diagram schematically showing a configuration of a first embodiment of the first and second inventions.

【図２】文書の一例を示す図である。FIG. 2 is a diagram showing an example of a document.

【図３】（Ａ）及び（Ｂ）はそれぞれ文字ブロックの第
１行目の文字列とこの文字列の各文字パタンの文字外接
枠及び文字高さとを例示した図である。3A and 3B are diagrams respectively illustrating a character string on the first line of a character block, a character circumscribing frame of each character pattern of the character string, and a character height.

【図４】（Ａ）及び（Ｂ）はそれぞれ文字ブロックの第
１行目の文字列とこの文字列の各文字パタンの文字外接
枠及び文字高さとを例示した図である。FIGS. 4A and 4B are diagrams illustrating a character string on the first line of a character block, a character circumscribing frame and a character height of each character pattern of the character string, respectively.

【図５】認識部のより具体的な構成の一例を示す図であ
る。FIG. 5 is a diagram showing an example of a more specific configuration of a recognition unit.

【図６】第一及び第二発明の第二実施例の構成を概略的
に示す機能ブロック図である。FIG. 6 is a functional block diagram schematically showing a configuration of a second embodiment of the first and second inventions.

【図７】第一及び第二発明の第三実施例の構成を概略的
に示す機能ブロック図である。FIG. 7 is a functional block diagram schematically showing a configuration of a third embodiment of the first and second inventions.

【図８】文書の他の例を示す図である。FIG. 8 is a diagram showing another example of a document.

【図９】（Ａ）及び（Ｂ）はそれぞれ文字ブロックの第
１行目の文字列とこの文字列の各文字パタンの文字線幅
とを例示した図である。9A and 9B are diagrams respectively illustrating a character string on the first line of a character block and a character line width of each character pattern of this character string.

【図１０】（Ａ）及び（Ｂ）はそれぞれ文字ブロックの
第１行目の文字列とこの文字列の各文字パタンの文字線
幅とを例示した図である。10A and 10B are diagrams respectively illustrating a character string on the first line of a character block and a character line width of each character pattern of this character string.

[Explanation of symbols]

１０、４８、６０：情報処理装置１２、５０：文字ブロック抽出部１４、２６、５２：切出し部１６、６２：パタン特徴抽出部１８：順序判定部２０、５６、６４：文字認識装置２２：画像生成部２４、５８：ブロック選択部２８：認識部 10, 48, 60: information processing device 12, 50: character block extraction unit 14, 26, 52: cutout unit 16, 62: pattern feature extraction unit 18: order determination unit 20, 56, 64: character recognition device 22: image Generation unit 24, 58: Block selection unit 28: Recognition unit

Claims

[Claims]

1. An information processing apparatus comprising: a character block extraction unit that extracts position information of a character block from image data of a character medium; and an order determination unit that determines an order relationship of the character blocks. A cutout unit that extracts cutout information of the pattern, and a pattern feature extraction unit that extracts the feature of the character pattern for each character block using the cutout information of the character pattern, the order determination unit, the order determination unit, the character block The position evaluation value is obtained from the position information, and the character blocks are divided into groups having similar character pattern characteristics, and the order of the character blocks is determined for each group in ascending or descending position evaluation value. An information processing device characterized by the above.

2. The characteristics of the character pattern are obtained from the character height, the character width, the character pitch, the line height, the aspect ratio of the character circumscribing frame, the character line width, the character inclination, and the black pixel distribution of the character pattern. Features, line width of sub patterns extracted from character patterns,
The information processing apparatus according to claim 1, wherein one or more of the character dose of the sub-pattern and the difference of the character dose of the two or more sub-patterns are provided.

3. The information processing apparatus according to claim 1, wherein the characteristic of the character pattern is an average characteristic obtained for all or some of the character patterns included in the character block.

4. An image generation unit for generating image data of a character medium, and character blocks included in the image data are grouped into groups having similar character patterns, and the order of the character blocks is determined for each group. The information processing apparatus according to claim 1, a block selection unit that sequentially selects character blocks in each group according to a determined order, and a cutout unit that sequentially selects character patterns from the character blocks. And a recognition unit for recognizing the character pattern.