JP2002312719A

JP2002312719A - Character segmentation device, character segmentation method and program for character segmentation

Info

Publication number: JP2002312719A
Application number: JP2001112309A
Authority: JP
Inventors: Naganori Ishidera; 永記石寺
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2001-04-11
Filing date: 2001-04-11
Publication date: 2002-10-25

Abstract

PROBLEM TO BE SOLVED: To provide a character segmentation device and a character segmentation method, by which characters are segmented correctly without the prerequisite of row extraction even in the case that a plurality of rows are inputted simultaneously. SOLUTION: The character segmentation device is provided with a character string image storage part 11 for storing character string images, a connection component extraction part 12 for binarizing the character string images and extracting connection components, a minimum unit preparation part 13 for judging whether or not they are in contact for all the connection components, turning the connection component itself to a minimum unit at the time of non- contact and cutting the connection component and newly preparing the minimum unit at the time of contact, a two-dimensional connection relation preparation part 14 for obtaining the connection relation of the minimum units with each other, a character segmentation candidate output part 15 for preparing a combination pattern by a plurality of the minimum units in the connection relation with each other and outputting the pattern with the high possibility of being a correct character as a character segmentation candidate, and a character segmentation establishment part 16 for performing a recognition processing, a language knowledge processing and layout analysis on the basis of the character segmentation candidate and outputting a character segmentation result.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は光学的に走査された
文字列画像から文字の切り出しを行う装置、方法に関
し、特に、文字行の抽出を前提としないで文字切り出し
を行う文字切り出し装置及び文字切り出し方法並びに文
字切り出し用プログラムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus and a method for extracting a character from an optically scanned character string image, and more particularly to a character extraction apparatus and a character for extracting a character without assuming extraction of a character line. The present invention relates to a clipping method and a character clipping program.

【０００２】[0002]

【従来の技術】郵便物や帳票上に記載された文字列を光
学的文字読み取り装置(ＯＣＲ)で読み取って文字認識す
る場合、認識処理に先立って前処理が実施され、この前
処理において、文字行を１行づつ抽出し、その後１文字
を切り出す際の候補となる幾つかの文字切り出し候補が
求められる。そして、文字切り出し候補に対して認識処
理を施し、言語的な知識処理を施すことによって文字切
り出し結果と文字列の読み取り結果が得られる。2. Description of the Related Art When a character string written on a mail or a form is read by an optical character reading device (OCR) and character recognition is performed, preprocessing is performed prior to recognition processing. Lines are extracted line by line, and then several character cutout candidates that are candidates for cutting out one character are obtained. Then, a recognition process is performed on the character cutout candidate, and a linguistic knowledge process is performed to obtain a character cutout result and a character string reading result.

【０００３】[0003]

【発明が解決しようとする課題】従来、このような文字
切り出し候補の生成では、始めに入力された文字列パタ
ーンから、一行づつ行を抽出することが必要であった。
つまり、従来の技術ではすでに行が一行づつ抽出された
状態を前提としていたので、複数行が同時に入力された
とき、正しく一行づつ切り出すことに失敗した場合、正
しく文字切り出し候補を生成することが困難であった。Conventionally, in generating such a character segmentation candidate, it has been necessary to extract lines one by one from a character string pattern input first.
In other words, in the conventional technology, it is assumed that lines are already extracted line by line, so if multiple lines are input at the same time, if it fails to cut out one line at a time, it is difficult to generate character extraction candidates correctly. Met.

【０００４】例えば、特願平7-8793号公報に開示されて
いる文字切り出し候補を作成するための技術は、入力と
して正しく一行づつ入力されることを前提としている。
また、特願平8-52844号公報や特願平11-30686号公報や
特願平11-57792号公報に開示されている技術は、すべて
文字行(文字列)を抽出する処理が初期段階で施されてい
る。これらの技術では、いずれの方式を用いても入力パ
ターンから正しく一行づつ切り出すことに失敗した場
合、正しく文字切り出し候補を生成することが困難であ
る。[0004] For example, the technique for creating a character segmentation candidate disclosed in Japanese Patent Application No. 7-8793 is based on the premise that characters are correctly input line by line.
Further, in the technologies disclosed in Japanese Patent Application Nos. 8-52844, 11-30686 and 11-57792, the process of extracting character lines (character strings) is an initial stage. It is given in. With these techniques, it is difficult to correctly generate a character cutout candidate if it fails to cut out one line at a time from the input pattern using any of the methods.

【０００５】一方で、文字列が複数でしかも蛇行してい
ても、１文字毎に文字を正確に切り出すことができる技
術が特願平7-236232号公報に開示されているが、やは
り、初期段階の処理において文字列を切り出すための処
理が施されており、文字列抽出に失敗した場合は、やは
り正しく文字切り出し候補を生成することが困難であ
る。On the other hand, Japanese Patent Application No. 7-236232 discloses a technique for accurately extracting characters one by one even if a plurality of character strings meander. A process for extracting a character string is performed in the processing at the stage, and if extraction of the character string fails, it is also difficult to correctly generate a character extraction candidate.

【０００６】例えば、図２(a)に示すような文字列が入
力された場合、これが一行から成るものか、二行からな
るものかを正しく判定することは容易ではなく、もし図
２(b)のように二行として判定された場合は、一行目が
「月月丁」二行目が「ナ日田三」となってしまい、正し
く「有明町三」と認識できるような文字切り出し候補を
生成することは不可能であるし、同様に読み取りも困難
である。For example, when a character string as shown in FIG. 2A is input, it is not easy to correctly determine whether the character string is composed of one line or two lines. ), The first line is `` Tsukitsukicho '' and the second line is `` Na Hita 3 '', and character extraction candidates that can be correctly recognized as `` Ariake Machi 3 '' It is impossible to generate and similarly difficult to read.

【０００７】一方で、正しく一行づつの入力を前提とし
なくても正しく文字列を読み取ることができる方式が特
願平7-203148号公報に開示されているが、この方式で
も、文字切り出し候補生成時に正しい候補が生成されな
かった場合には、正しく文字列を読み取ることは不可能
である。また、この方式によると、文字候補の作成は、
黒画素の連結成分のあらゆる組み合わせを考えることに
なるため、組み合わせの爆発が起こり、処理が破綻する
可能性があるとともに、接触した文字パターンが含まれ
ている場合、正しく文字切り出しを行うことが困難にな
るという問題があった。On the other hand, Japanese Patent Application No. 7-203148 discloses a method capable of correctly reading a character string without assuming a correct line-by-line input. Sometimes, when a correct candidate is not generated, it is impossible to read a character string correctly. Also, according to this method, creation of character candidates is
Since all combinations of connected components of black pixels are considered, explosion of combinations may occur and processing may be broken, and it is difficult to cut out characters correctly when a touched character pattern is included There was a problem of becoming.

【０００８】そのため、文字列の読み取り精度向上の観
点からは、複数行が同時に入力された場合でも、行抽出
を行うことを前提としないで、組み合わせが爆発するこ
となく、接触した文字パターンも正しく切断できて、正
しい文字切り出しを行うことが可能な装置が必要である
と言える。Therefore, from the viewpoint of improving the reading accuracy of a character string, even when a plurality of lines are input at the same time, without assuming that line extraction is performed, the combination does not explode and the contacted character pattern is correctly detected. It can be said that a device that can be cut and can perform correct character segmentation is required.

【０００９】本発明は、上記問題点に鑑みてなされたも
のであって、その主たる目的は、複数行が同時に入力さ
れた場合でも、行抽出を行うことを前提としないで、組
み合わせが爆発することなく、接触した文字パターンも
正しく切断できて、正しい文字切り出しを行うことが可
能な文字切り出し装置及び文字切り出し方法並びに文字
切り出し用プログラムを提供することにある。The present invention has been made in view of the above problems, and a main object of the present invention is that even when a plurality of lines are inputted at the same time, the combination explodes without assuming that line extraction is performed. It is another object of the present invention to provide a character cutout device, a character cutout method, and a character cutout program that can cut a contacted character pattern correctly and perform correct character cutout.

【００１０】[0010]

【課題を解決するための手段】上記目的を達成するた
め、本発明の文字切り出し装置は、文字列画像から文字
切り出し候補を生成する文字切り出し装置において、文
字列画像を入力して記憶する文字列画像記憶部と、前記
文字列画像を二値化して連結成分を抽出する連結成分抽
出部と、全ての連結成分に対して接触しているかどうか
の判定を行い、連結成分が接触していない判定されたと
きに該連結成分自体を最小単位とし、連結成分が接触し
ていると判定されたときに該連結成分を切断して新たに
最小単位を作成する最小単位作成部と、二次元空間上に
配置された状態の前記最小単位同士の二次元的な接続関
係を求める二次元接続関係作成部と、互いに接続関係に
ある複数の前記最小単位同士を組み合わせて組み合わせ
パターンを作成し、正しい文字である可能性の高い組み
合わせパターンを文字切り出し候補として出力する文字
切り出し候補出力部と、を少なくとも有するものであ
る。In order to achieve the above object, a character cutout device according to the present invention is a character cutout device for generating a character cutout candidate from a character string image. An image storage unit, a connected component extraction unit that binarizes the character string image to extract connected components, and determines whether or not all connected components are in contact with each other, and determines whether the connected components are not in contact with each other. A minimum unit creation unit that creates the minimum unit by cutting the connected component when it is determined that the connected component is in contact with the connected component itself as a minimum unit when A two-dimensional connection relationship creating unit that determines a two-dimensional connection relationship between the minimum units arranged in a state, and creates a combination pattern by combining a plurality of the minimum units in a connection relationship with each other, A character segmentation candidate output unit for outputting a likely combination pattern is Shii characters as character segmentation candidates, the one having at least.

【００１１】また、本発明の文字切り出し装置は、文字
列画像から文字切り出し候補を生成する文字切り出し装
置において、文字列画像を入力して記憶する文字列画像
記憶部と、前記文字列画像を二値化して連結成分を抽出
する連結成分抽出部と、全ての連結成分に対して接触し
ているかどうかの判定を行い、連結成分が接触していな
い判定されたときに該連結成分自体を最小単位とし、連
結成分が接触していると判定されたときに該連結成分を
切断して新たに最小単位を作成する最小単位作成部と、
二次元空間上に配置された状態の前記最小単位同士の二
次元的な接続関係を求める二次元接続関係作成部と、互
いに接続関係にある複数の前記最小単位同士を組み合わ
せて組み合わせパターンを作成し、正しい文字である可
能性の高い組み合わせパターンを文字切り出し候補とし
て出力する文字切り出し候補出力部と、該文字切り出し
候補を基に、認識処理と言語知識処理とレイアウト解析
を行い、文字切り出し結果を出力する文字切り出し確定
部と、を少なくとも有するものである。Further, according to the character extracting apparatus of the present invention, a character extracting apparatus for generating a character extracting candidate from a character string image is provided. A connected component extraction unit that extracts a connected component by converting the value into a value and determines whether or not all connected components are in contact with each other. When it is determined that the connected components are not in contact with each other, the connected component itself is a minimum unit. And a minimum unit creating unit that creates a new minimum unit by cutting the connected component when it is determined that the connected component is in contact,
A two-dimensional connection relationship creating unit that determines a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space, and a combination pattern created by combining a plurality of the minimum units having a connection relationship with each other. A character cutout candidate output unit that outputs a combination pattern likely to be a correct character as a character cutout candidate, and performs a recognition process, a linguistic knowledge process, and a layout analysis based on the character cutout candidate, and outputs a character cutout result And a character cutout confirming unit.

【００１２】本発明においては、前記最小単位作成部に
おいて、前記文字列画像から抽出された全連結成分の幅
及び高さの平均値と、着目する連結成分の幅及び高さと
を参照して、接触を起こしている連結成分かどうかが統
計的に判定される構成とすることができ、更に、前記着
目する連結成分形状を表す画像パターンを参照して、接
触を起こしている連結成分かどうかが統計的に判定され
る構成とすることもできる。In the present invention, the minimum unit creation unit refers to the average value of the width and height of all connected components extracted from the character string image and the width and height of the connected component of interest, It can be configured to be statistically determined whether or not the connected components that are in contact, further, with reference to the image pattern representing the connected component shape of interest, whether the connected components that are in contact It is also possible to adopt a configuration that is determined statistically.

【００１３】また、本発明においては、前記二次元接続
関係作成部において、前記文字列画像から抽出された前
記最小単位の隣接関係を求めるに際し、ボロノイ分割に
よるボロノイ辺を共有する最小単位間は隣接関係にある
と判断され、更に、該隣接関係にある最小単位同士が二
つのボロノイ辺を共有する場合には、前記最小単位同士
は隣接関係にないと判断されることが好ましい。In the present invention, in the two-dimensional connection relation creation unit, when obtaining the adjacent relation of the minimum unit extracted from the character string image, the minimum unit sharing a Voronoi side by Voronoi division is adjacent to the minimum unit. When it is determined that there is a relationship, and when the adjacent minimum units share two Voronoi sides, it is preferable that it is determined that the minimum units are not adjacent.

【００１４】また、本発明においては、前記二次元接続
関係作成部において、隣接関係にあると判断された前記
複数の最小単位の各々の幅、高さ及び中心座標値と、前
記複数の最小単位を組み合わせて作成される外接矩形の
幅、高さ及び中心座標値と、前記文字列画像から抽出さ
れた全最小単位の幅及び高さの平均値とを参照して、統
計的に評価して接続関係が求められる構成とすることが
でき、更に、前記複数の最小単位の各々の形状を表す画
像パターンを参照して、接続関係が求められる構成とす
ることもできる。Also, in the present invention, the two-dimensional connection relationship creating unit may include a width, height, and center coordinate value of each of the plurality of minimum units determined to be adjacent to each other, and the plurality of minimum units. By referring to the width, height and center coordinate values of the circumscribed rectangle created by combining and the average value of the width and height of all the minimum units extracted from the character string image, statistically evaluated The connection relationship may be determined, and further, the connection relationship may be determined with reference to the image patterns representing the shapes of the plurality of minimum units.

【００１５】また、本発明においては、前記文字切り出
し候補出力部において、前記組み合わせパターンを作成
するに際し、前記二次元接続関係作成部で求められた前
記接続関係をたどることにより複数の最小単位の組み合
わせパターンが作成されることが好ましい。Further, in the present invention, the character segmentation candidate output unit traces the connection relationship obtained by the two-dimensional connection relationship creation unit when creating the combination pattern, thereby combining a plurality of minimum units. Preferably, a pattern is created.

【００１６】また、本発明においては、前記文字切り出
し候補出力部において、前記組み合わせパターンの幅、
高さ及び中心座標値と、前記文字列画像から抽出された
全最小単位の幅及び高さの平均値とを参照して、又は、
前記組み合わせパターンの幅、高さ及び中心座標値と、
該組み合わせパターンに隣接する前記最小単位の幅、高
さ及び中心座標値と、前記組み合わせパターン及び前記
最小単位を組み合わせて作成される外接矩形幅、高さ及
び中心座標値と、前記文字列画像から抽出された全最小
単位の幅及び高さの平均値とを参照して、前記組み合わ
せパターンに隣接する前記最小単位の数と同じ回数だけ
統計的に評価して、前記組み合わせパターンを文字切り
出し候補として出力するかどうかが決定される構成とす
ることができ、更に、前記組み合わせパターンの形状を
表す画像パターンと、該組み合わせパターンに隣接する
前記最小単位の形状を表す画像パターンとを参照して、
前記組み合わせパターンを文字切り出し候補として出力
するかどうかが決定される構成とすることもできる。Also, in the present invention, the character segmentation candidate output section may include a width of the combination pattern,
With reference to the height and the center coordinate value, and the average value of the width and height of all the minimum units extracted from the character string image, or
Width, height and center coordinate value of the combination pattern,
From the width, height and center coordinate values of the minimum unit adjacent to the combination pattern, the circumscribed rectangle width, height and center coordinate value created by combining the combination pattern and the minimum unit, and from the character string image With reference to the average value of the width and height of all the extracted minimum units, statistical evaluation is performed for the same number of times as the number of the minimum units adjacent to the combination pattern, and the combination pattern is used as a character cutout candidate. Whether or not to output can be determined, further, with reference to an image pattern representing the shape of the combination pattern and an image pattern representing the shape of the minimum unit adjacent to the combination pattern,
A configuration may be adopted in which it is determined whether to output the combination pattern as a character segmentation candidate.

【００１７】本発明の文字切り出し方法は、文字列画像
から文字切り出し候補を生成する文字切り出し方法にお
いて、文字列画像を入力して記憶する文字列画像記憶ス
テップと、前記文字列画像を二値化して連結成分を抽出
する連結成分抽出ステップと、全ての連結成分に対して
接触しているかどうかの判定を行い、連結成分が接触し
ていない判定されたときに該連結成分自体を最小単位と
し、連結成分が接触していると判定されたときに該連結
成分を切断して新たに最小単位を作成する最小単位作成
ステップと、二次元空間上に配置された状態の前記最小
単位同士の二次元的な接続関係を求める二次元接続関係
作成ステップと、互いに接続関係にある複数の前記最小
単位同士を組み合わせて組み合わせパターンを作成し、
正しい文字である可能性の高い組み合わせパターンを文
字切り出し候補として出力する文字切り出し候補出力ス
テップと、を少なくとも有するものである。A character extracting method according to the present invention is a character extracting method for generating a character extracting candidate from a character string image, wherein a character string image storing step of inputting and storing a character string image, and binarizing the character string image. A connected component extraction step of extracting connected components, and determining whether or not all connected components are in contact, and when it is determined that the connected components are not in contact, the connected component itself is used as a minimum unit, A minimum unit creation step of cutting the connected component to create a new minimum unit when it is determined that the connected component is in contact with the connected component, and a two-dimensional configuration of the minimum units arranged in a two-dimensional space. Two-dimensional connection relationship creating step to determine a typical connection relationship, to create a combination pattern by combining a plurality of the minimum units in a connection relationship with each other,
A character extraction candidate output step of outputting a combination pattern having a high possibility of being a correct character as a character extraction candidate.

【００１８】また、本発明の文字切り出し方法は、文字
列画像から文字切り出し候補を生成する文字切り出し方
法において、文字列画像を入力して記憶する文字列画像
記憶ステップと、前記文字列画像を二値化して連結成分
を抽出する連結成分抽出ステップと、全ての連結成分に
対して接触しているかどうかの判定を行い、連結成分が
接触していない判定されたときに該連結成分自体を最小
単位とし、連結成分が接触していると判定されたときに
該連結成分を切断して新たに最小単位を作成する最小単
位作成ステップと、二次元空間上に配置された状態の前
記最小単位同士の二次元的な接続関係を求める二次元接
続関係作成ステップと、互いに接続関係にある複数の前
記最小単位同士を組み合わせて組み合わせパターンを作
成し、正しい文字である可能性の高い組み合わせパター
ンを文字切り出し候補として出力する文字切り出し候補
出力ステップと、該文字切り出し候補を基に、認識処理
と言語知識処理とレイアウト解析を行い、文字切り出し
結果を出力する文字切り出し確定ステップと、を少なく
とも有するものである。The character extracting method according to the present invention is a character extracting method for generating a character extracting candidate from a character string image, wherein a character string image storing step of inputting and storing a character string image is performed. A connected component extracting step of extracting connected components by converting the value to a value, and determining whether or not all the connected components are in contact. When it is determined that the connected components are not in contact, the connected component itself is the minimum unit. And a minimum unit creation step of cutting the connected component to create a new minimum unit when it is determined that the connected components are in contact with each other, and the minimum unit between the minimum units arranged in a two-dimensional space. A two-dimensional connection relationship creation step for obtaining a two-dimensional connection relationship, and combining the plurality of minimum units having a connection relationship with each other to create a combination pattern, A character cutout candidate output step of outputting a combination pattern having a high possibility as a character cutout candidate, and a character cutout that performs recognition processing, linguistic knowledge processing, and layout analysis based on the character cutout candidate and outputs a character cutout result And at least steps.

【００１９】本発明のプログラムは、文字列画像を入力
して記憶する文字列画像記憶処理と、前記文字列画像を
二値化して連結成分を抽出する連結成分抽出処理と、全
ての前記連結成分に対して接触しているかどうかの判定
を行い、連結成分が接触していない判定されたときに該
連結成分自体を最小単位とし、連結成分が接触している
と判定されたときに該連結成分を切断して新たに最小単
位を作成する最小単位作成処理と、二次元空間上に配置
された状態の前記最小単位同士の二次元的な接続関係を
求める二次元接続関係作成処理と、互いに接続関係にあ
る複数の前記最小単位同士を組み合わせて組み合わせパ
ターンを作成し、正しい文字である可能性の高い組み合
わせパターンを文字切り出し候補として出力する文字切
り出し候補出力処理と、をコンピュータに実行させるも
のである。A program according to the present invention includes a character string image storing process for inputting and storing a character string image, a connected component extracting process for binarizing the character string image and extracting a connected component, It is determined whether or not the connected component is in contact, and when it is determined that the connected component is not in contact, the connected component itself is used as a minimum unit, and when it is determined that the connected component is in contact, the connected component is determined. A minimum unit creation process for creating a new minimum unit by cutting off, a two-dimensional connection relationship creation process for finding a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space, and connecting with each other A character cutout candidate output process that creates a combination pattern by combining a plurality of the minimum units having a relationship and outputs a combination pattern that is likely to be a correct character as a character cutout candidate If is intended to execute on a computer.

【００２０】また、本発明のプログラムは、文字列画像
を入力して記憶する文字列画像記憶処理と、前記文字列
画像を二値化して連結成分を抽出する連結成分抽出処理
と、全ての前記連結成分に対して接触しているかどうか
の判定を行い、連結成分が接触していない判定されたと
きに該連結成分自体を最小単位とし、連結成分が接触し
ていると判定されたときに該連結成分を切断して新たに
最小単位を作成する最小単位作成処理と、二次元空間上
に配置された状態の前記最小単位同士の二次元的な接続
関係を求める二次元接続関係作成処理と、互いに接続関
係にある複数の前記最小単位同士を組み合わせて組み合
わせパターンを作成し、正しい文字である可能性の高い
組み合わせパターンを文字切り出し候補として出力する
文字切り出し候補出力処理と、該文字切り出し候補を基
に、認識処理と言語知識処理とレイアウト解析とを行
い、文字切り出し結果を出力する文字切り出し確定処理
と、をコンピュータに実行させるものである。Further, the program of the present invention includes a character string image storing process for inputting and storing a character string image, a connected component extracting process for binarizing the character string image and extracting a connected component, and It is determined whether or not the connected component is in contact, and when it is determined that the connected component is not in contact, the connected component itself is used as the minimum unit, and when it is determined that the connected component is in contact, the A minimum unit creation process of cutting a connected component to create a new minimum unit, and a two-dimensional connection relationship creation process of finding a two-dimensional connection relationship between the minimum units in a state arranged in a two-dimensional space, A character cutout candidate that creates a combination pattern by combining a plurality of the minimum units connected to each other and outputs a combination pattern that is likely to be a correct character as a character cutout candidate And power processing, based on the character segmentation candidate performs the recognition processing and language knowledge processing and layout analysis, in which to execute the character cut-out determination processing of outputting the character extraction result to the computer.

【００２１】[0021]

【発明の実施の形態】次に、本発明の一実施の形態につ
いて、図面を参照して詳細に説明する。図１は本発明の
実施の形態に係る文字切り出し装置の構成を示すブロッ
ク図である。図１に示す実施の形態の構成は、例えば、
光学的手段を用いて文字列画像を読み取る文字列画像読
み取り装置１８で取得した文字列画像を入力して記憶す
る文字列画像記憶部１１と、文字列画像を二値化して連
結成分を抽出する連結成分抽出部１２と、全ての連結成
分に対して接触しているかどうかの判定を行い、連結成
分が接触していない判定されたときにその連結成分自体
を最小単位とし、連結成分が接触していると判定された
ときにその連結成分を切断して新しく最小単位を作成す
る最小単位作成部１３と、二次元空間上に配置された状
態の最小単位同士の二次元的な接続関係を求める二次元
接続関係作成部１４と、互いに接続関係にある複数の最
小単位同士を組み合わせて組み合わせパターンを作成
し、正しい文字である可能性の高い組み合わせパターン
を文字切り出し候補として出力する文字切り出し候補出
力部１５と、文字切り出し候補を基に、認識処理と言語
知識処理とレイアウト解析を行い、文字切り出し結果を
出力する文字切り出し確定部１６と、がコンピュータ１
７内の記憶／演算部に組み込まれており、文字切り出し
確定部１６にはその結果を表示する表示手段１９が接続
され、更に、コンピュータ１７には上記手段１１〜１６
をプログラムとして格納する記憶媒体２０が接続されて
いる。Next, an embodiment of the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram showing a configuration of a character segmenting device according to an embodiment of the present invention. The configuration of the embodiment shown in FIG.
A character string image storage unit 11 that inputs and stores a character string image obtained by a character string image reading device 18 that reads a character string image using optical means, and binarizes the character string image to extract connected components. The connected component extraction unit 12 determines whether or not all connected components are in contact. When it is determined that the connected components are not in contact, the connected component itself is used as the minimum unit, and the connected components are contacted. The minimum unit creation unit 13 that cuts the connected component to create a new minimum unit when it is determined that the minimum unit is present, and obtains a two-dimensional connection relationship between the minimum units arranged in the two-dimensional space. A combination pattern is created by combining the two-dimensional connection relationship creation unit 14 with a plurality of minimum units having a connection relationship with each other, and a combination pattern that is likely to be a correct character is a character cutout candidate. A character cutout candidate output unit 15 to and outputting, based on the character segmentation candidate performs recognition processing and language knowledge processing and layout analysis, and character segmentation determination unit 16 for outputting a character segmentation results, but the computer 1
7, a display means 19 for displaying the result is connected to the character cutout determination section 16, and the computer 17 is provided with the above means 11-16.
Is stored as a program.

【００２２】次に、本実施の形態の動作について説明す
る。基本的に、本発明は入力された文字列パターンを、
始めに細かい部分パターンに分割し、その部分パターン
を再度統合することによって文字切り出し候補を決定す
る方式であり、その点では従来から知られている方式と
大差はない。ただし、従来から知られている方式と大き
くことなる点は、本発明が入力として、すでに正しく一
行づつ切り出されたものを前提としていない点である。
このような文字切り出し候補を与え、特願平7-203148号
公報に記載されている文字認識コストと言語的コストと
レイアウトコストを同時に用いて文字列を読み取る方式
を用いれば、複数行が同時に入力された場合でも、行抽
出を行うことを前提としないで、組み合わせが爆発する
ことなく、接触した文字パターンも正しく切断できて、
正しい文字切り出しを行うことが可能になる。Next, the operation of this embodiment will be described. Basically, the present invention converts the input character string pattern into
This is a method in which a character segmentation candidate is determined by first dividing the pattern into fine partial patterns and then integrating the partial patterns again, and in that respect, there is no great difference from a conventionally known method. However, a significant difference from the conventionally known method is that the present invention does not assume that the input has already been correctly cut out line by line.
Given such a character segmentation candidate, if a method of reading a character string using simultaneously the character recognition cost, linguistic cost, and layout cost described in Japanese Patent Application No. 7-203148, a plurality of lines are input simultaneously. Even if it is done, without assuming that line extraction is performed, the character pattern that touched can be cut correctly without explosion of the combination,
Correct character segmentation can be performed.

【００２３】文字列画像記憶部１１では、例えば、光学
的に走査された文字列画像を入力して記憶する。これ
は、通常のスキャナとメモリーを用いて容易に実現が可
能である。連結成分抽出部１２では文字列画像を入力し
て、二値化した後に文字切り出しのための連結成分を抽
出する。The character string image storage section 11 inputs and stores, for example, a character string image scanned optically. This can be easily realized using a normal scanner and memory. The connected component extraction unit 12 inputs a character string image, binarizes the image, and extracts a connected component for character extraction.

【００２４】この処理の実現にはいくつかの方法が考え
られる。例えば、二値化は大津の判別分析による二値化
を用いることで容易に実現可能である。連結成分の抽出
には例えば、通常のラベリング処理や輪郭追跡処理によ
って連結成分の抽出は容易に実現が可能であるし、他に
も、通常のラベリング処理によって連結成分を抽出した
後に輪郭追跡を施して連結成分の輪郭のチェインコード
も同時に抽出することができる。また、連結成分自体を
画像パターンとして格納することも可能であるし、連結
成分の外接矩形を格納することも可能である。図を用い
て説明すると、図２(a)のような二値化されたパターン
が入力されると図３のように８個の連結成分(CC1からCC
8まで)が抽出される。図３の例では、見やすさのため
に、各連結成分の外接矩形も表示していると同時に、外
接矩形の中心座標が高い順に連結成分の順番を決めてい
る。There are several methods for realizing this processing. For example, binarization can be easily realized by using binarization based on Otsu's discriminant analysis. For the extraction of connected components, for example, extraction of connected components can be easily realized by normal labeling processing and contour tracking processing.In addition, contour tracking is performed after extracting connected components by normal labeling processing. Thus, the chain code of the outline of the connected component can be simultaneously extracted. Further, the connected component itself can be stored as an image pattern, and a circumscribed rectangle of the connected component can be stored. To explain with reference to the drawing, when a binarized pattern as shown in FIG. 2A is input, eight connected components (CC1 to CC
Up to 8) are extracted. In the example of FIG. 3, the circumscribed rectangle of each connected component is also displayed for easy viewing, and the order of the connected components is determined in descending order of the center coordinates of the circumscribed rectangle.

【００２５】図２(a)のようなパターンが入力され、連
結成分抽出部１２で得られた連結成分のデータとして
は、図４のように、連結成分の通し番号であるIDと、外
接矩形の幅と高さ(W_i,H_i)と、外接矩形中心の座標(X_i,Y
_i)と、連結成分の画像自体x_iと、チェインコードの追跡
開始点座標(C_xi,C_yi)と、チェインコード長CL_iと、チェ
インコードC_iとが連結成分情報として出力される。ここ
で、添字のiはi番目の連結成分を意味し、IDの番号と対
応している。また、図４においては、簡単のため、多次
元のベクトル情報である画像パターンx_iと、チェインコ
ードC_iの中身についての記載は省略する。なお、画像パ
ターンのデータ長は外接矩形の幅と高さから知ることが
可能である。A pattern as shown in FIG. 2A is input, and the data of the connected component obtained by the connected component extraction unit 12 includes an ID which is a serial number of the connected component and a circumscribed rectangle as shown in FIG. The width and height (W _i , H _i ) and the coordinates (X _i , Y
_i ), the connected component image itself x _i , the chain code tracking start point coordinates (C _xi , C _yi ), the chain code length CL _i, and the chain code C _i are output as connected component information. Here, the subscript i means the i-th connected component, and corresponds to the ID number. Further, in FIG. 4, for simplicity, the image pattern x _i is a multi-dimensional vector information, the description of the contents of the chain code C _i is omitted. The data length of the image pattern can be known from the width and height of the circumscribed rectangle.

【００２６】さて、連結成分抽出部１２で得られた連結
成分は、図３のCC4の例のように複数の文字が接触を起
こしたことによって一つの連結成分として抽出されてし
まう場合がある。つまり、連結成分抽出部１２では入力
された文字列画像の文字に接触があった場合に、接触箇
所が分離できないので、最小単位作成部１３において接
触した連結成分を分離する。最小単位作成部１３の処理
は大きく分けると二つの処理から成る。一つ目は接触を
起こしている連結成分の検出であり、二つ目は接触箇所
の切断である。Now, the connected component obtained by the connected component extraction unit 12 may be extracted as one connected component due to contact between a plurality of characters as in the example of CC4 in FIG. In other words, when the character of the input character string image is touched by the connected component extraction unit 12, the touched part cannot be separated. Therefore, the connected component touched by the minimum unit creation unit 13 is separated. The processing of the minimum unit creation unit 13 is roughly divided into two processings. The first is the detection of the connected component that is making contact, and the second is the disconnection of the contact point.

【００２７】接触した連結成分を分離するためには、ま
ず始めにどの連結成分が接触を起こしているか検出しな
ければならない。この検出にはいくつかの方法が考えら
れるが、例えば、入力画像から得られた全連結成分の外
接矩形を考え、この全外接矩形の幅と高さの平均を(W
_Cmean,H_Cmean)として求め、この平均値と、着目する連
結成分の幅と高さ(W_i,H_i)を用いて接触の有無の検出を
行うことができる。ここで、添字のiはi番目の連結成分
であることを意味する。着目する連結成分が接触してい
るかどうかの判定に用いる特徴量として、接触検出特徴
(W_Cmean,H_Cmean,W _i,H_i)という４次元のベクトルを考
え、これを用いて統計的に学習を行う方法が考えられ
る。または、(W_i/W_Cmean,H_i/H_Cmean)の２次元ベクトル
を用いることも可能である。In order to separate the connected connected components,
First detect which connected components are in contact.
I have to. There are several ways to detect this.
However, for example, out of all the connected components obtained from the input image,
Consider the tangent rectangle and calculate the average of the width and height of all circumscribed rectangles as (W
_Cmean, H_Cmean).
The width and height of the component (W_i, H_i) To detect contact
It can be carried out. Where the subscript i is the ith connected component
Means that The connected components of interest are in contact
Whether the contact detection feature
(W_Cmean, H_Cmean, W _i, H_iConsider the four-dimensional vector
It is possible to use this method to perform statistical learning.
You. Or (W_i/ W_Cmean, H_i/ H_Cmean) Two-dimensional vector
Can also be used.

【００２８】例えば、学習用の文字列画像を準備し、あ
らかじめ(W_Cmean,H_Cmean)と、全連結成分の(W_i,H_i)を求
めておき、接触を起こしている連結成分の教師データを
T_i=1、そうでない連結成分の教師データをT_i=0と置き学
習を行うことができる。統計的学習として例えば回帰分
析を行う場合、(W_Cmean,H_Cmean,W_i,H_i)を説明変数とし
て、教師データTiを推定する問題として考えることがで
きる。同様に(W_i/W_C _mean,H_i/H_Cmean)を説明変数とし
て、教師データTiを推定する問題として考えることもで
きる。つまり、説明変数に接触検出特徴を与え、その特
徴に対応した教師データを推定することを行えば良い。For example, a character string image for learning is prepared, and (W _Cmean , H _Cmean ) and (W _i , H _i ) of all connected components are obtained in advance, and the teacher of the connected components that are in contact with each other is obtained. Data
T _i = 1, can be learned every teacher data of connected components otherwise the T _i = 0. For example, when performing regression analysis as statistical learning, it can be considered as a problem of estimating the teacher data Ti using (W _Cmean , H _Cmean , W _i , H _i ) as an explanatory variable. Similarly, it can be considered as a problem of estimating the teacher data Ti using (W _i / W _C _mean , H _i / H _Cmean ) as an explanatory variable. In other words, a contact detection feature may be given to the explanatory variable, and teacher data corresponding to the feature may be estimated.

【００２９】またバックプロパゲーションで学習する多
層パーセプトロン型のニューラルネットを用いる場合、
入力層に４次元ベクトル(W_Cmean,H_Cmean,W_i,H_i)を与
え、教師データとしてTiを与えて学習を行えば良い。こ
のときのニューラルネットの構造としては、入力層に４
個のニューロンを、中間層に５個のニューロンを、出力
層に１個のニューロンを持つ構造にすることも可能であ
るし、中間層には２個から１０００個まで自由にニュー
ロンの数を変えることも可能である。もちろん入力には
２次元ベクトルの(W_i/W_Cmean,H_i/H_Cmean)を与えること
も可能であり、この場合は入力層は２個のニューロンに
なる。つまり、ニューラルネットの入力層に接触検出特
徴を与えれば良いので、接触検出特徴の次元数と同じだ
けの個数のニューロンを入力層に与えれば良い。When using a multilayer perceptron type neural network learning by back propagation,
Four-dimensional vectors to the input layer _{_{(W Cmean, H Cmean, W}} i, H i) gave, may be performed learning gives Ti as teacher data. At this time, the structure of the neural network is as follows.
It is also possible to have a structure with 5 neurons in the hidden layer, 1 neuron in the output layer, and freely change the number of neurons from 2 to 1000 in the hidden layer. It is also possible. Of course, a two-dimensional vector (W _i / W _Cmean , H _i / H _Cmean ) can be given to the input. In this case, the input layer becomes two neurons. That is, since the contact detection feature may be provided to the input layer of the neural network, the same number of neurons as the number of dimensions of the contact detection feature may be provided to the input layer.

【００３０】また、判別分析を用いる場合、T_iはカテゴ
リーであると考え、(W_Cmean,H_Cmean,W_i,H_i)を用いて判
別分析を行うことも可能であるし、(W_i/W_Cmean,H_i/H
_Cmean)を用いることもできる。つまり接触検出特徴と、
その特徴に対応するカテゴリーを与えて判別分析を行え
ば良い。Further, when using the discriminant analysis, considered T _i are _{_{category, (W Cmean, H Cmean,}} W i, H i) it is also possible to perform the discrimination analysis using, (W _i / W _Cmean , H _i / H
_Cmean ) can also be used. In other words, the contact detection feature,
Discriminant analysis may be performed by giving a category corresponding to the feature.

【００３１】以上説明したように、学習済のニューラル
ネットや回帰係数や判別軸を用いて、あらかじめ定めら
れた閾値より大きな値が得られた連結成分に対して接触
を起こしていると判定することが可能である。すなわ
ち、学習済のニューラルネットを用いる場合は、(W
_Cmean,H_Cmean,W_i,H_i)の４次元のベクトルを入力して、
出力が0.5以上であった場合に接触を起こしていると判
定することができるし、重回帰分析でも同様な判定を行
うことができる。As described above, it is determined using the learned neural network, the regression coefficient, and the discriminant axis that the connected component having a value larger than the predetermined threshold value is in contact with the connected component. Is possible. That is, when using a trained neural network, (W
_Cmean , H _Cmean , W _i , H _i )
When the output is 0.5 or more, it can be determined that contact has occurred, and a similar determination can be made by multiple regression analysis.

【００３２】同様に、(W_i/W_Cmean,H_i/H_Cmean)の２次元
のベクトルを入力して、出力が0.5以上であった場合に
接触を起こしていると判定することができるし、重回帰
分析でも同様な判定を行うことができる。このとき閾値
は0.5に限定するものではなく、0から1までの値を自由
に設定可能である。また、判別分析の場合、(W_Cmean,H_C
_mean,W_i,H_i)に対する判別得点を用いることが可能であ
り、あらかじめ定められた閾値以上の判別得点が得られ
た場合に接触を起こしていると判定することができる。
もちろん、判定の際には、(W_i/W_Cmean,H_i/H_Cmean)を用
いることもできる。Similarly, a two-dimensional vector of (W _i / W _Cmean , H _i / H _Cmean ) is input, and if the output is 0.5 or more, it can be determined that contact has occurred. A similar determination can be made in multiple regression analysis. At this time, the threshold value is not limited to 0.5, and a value from 0 to 1 can be freely set. In the case of discriminant analysis, (W _Cmean , H _C
It is possible to use a discrimination score for _mean , W _i , H _i ), and it is possible to determine that contact has occurred when a discrimination score equal to or greater than a predetermined threshold is obtained.
Of course, (W _i / W _Cmean , H _i / H _Cmean ) can be used for the determination.

【００３３】他の方法として例えば、(W_Cmean,H_Cmean,W
_i,H_i)のデータが存在する４次元空間をFとして、Ti=1の
事象とTi=0の事象を考えて、確率密度関数p(F|T_i=1)とp
(F|T_i=0)を求め、この確率密度関数p(F|T_i=1)の値が一
定の閾値以上の値を与えるような(W_Cmean,H_Cmean,W_i,
H_i)である場合に、i番目の連結成分は接触していると判
定することも可能であるし、T_i=1の事象とT_i=0の事象の
事前確率P(T_i=1)とP(T_i=0)をあらかじめ求めておき、ベ
イズの定理からT_i=1の事象とT_i=0の事象の事後確率を求
め、T_i=1の事象の事後確率が大きければi番目の連結成
分は接触していると判定することも可能であるし、T_i=1
の事象の事後確率があらかじめ定められた閾値(0から1
の間に設定される値)よりも大きければi番目の連結成分
は接触していると判定することも可能である。もちろ
ん、この際には、(W_i/W_Cmean,H_i/H_Cme _an)を用いること
もできる。ここでは、４次元空間や２次元空間で説明し
たが、接触検出特徴の次元数を持つようなデータ空間を
Fとして、上記のような方式を用いることが可能であ
る。As another method, for example, (W _Cmean , H _Cmean , W
_i , H _i ), the four-dimensional space where the data exists is F, and considering the event of Ti = 1 and the event of Ti = 0, the probability density functions p (F | T _i = 1) and p
(F | T _i = 0), and the value of this probability density function p (F | T _i = 1) gives a value equal to or greater than a certain threshold (W _Cmean , H _Cmean , W _i ,
H _i ), it is possible to determine that the i-th connected component is in contact, and the prior probability P (T _i = 1) of the event of T _i = 1 and the event of T _i = 0 ) and obtained in advance the P (T _i = 0), I asked the posterior probability of an event of T _i = 1 of the event and T _i = 0 from Bayes' theorem, the larger the posterior probability of T _i = 1 event It is possible to determine that the i-th connected component is in contact, and T _i = 1
The posterior probability of the event is a predetermined threshold (0 to 1
If the value is larger than (a value set during), it is possible to determine that the i-th connected component is in contact. Of course, when this can be used _{_{(W i / W Cmean, H}} i / H Cme an). Here, a four-dimensional space or a two-dimensional space has been described.
As F, the above-described method can be used.

【００３４】このとき、確率密度関数を求めるには例え
ば混合正規分布を用いてEMアルゴリズムによって求める
ことが可能であるし、各事象の事前確率はあらかじめ学
習データに接触パターンがどのくらい含まれるかの比率
を求めることにより容易に求めることが可能である。At this time, the probability density function can be obtained by, for example, an EM algorithm using a mixture normal distribution, and the prior probability of each event is determined in advance by the ratio of how much the contact pattern is included in the learning data. Can be easily obtained by obtaining.

【００３５】さらに、接触検出特徴として、(W_Cmean,H
_Cmean,W_i,H_i)や(W_i/W_Cmean,H_i/H_Cmea _n)だけでなく、連
結成分の画像情報も付加して用いることも考えられる。
今まで説明した４次元のデータや２次元のデータを用い
る場合、連結成分は矩形情報として扱われており、その
内部の画像パターンがどのようなパターンであるかを知
ることが困難であったが、画像情報を用いることで連結
成分自体の形状パターンを知ることが可能になる。この
画像情報としては、例えば連結成分の画像を１６×１６
の大きさに正規化して１６×１６次元のベクトルとして
そのまま用いることもできる。このように作成された画
像情報を用いると、接触検出特徴は４次元＋１６×１６
次元のベクトルや２次元＋１６×１６次元のベクトルと
なる。Further, as a contact detection feature, (W _Cmean , H
_Cmean, W _{_i,} H _i) and _{_{(W i / W Cmean, H}} i / H Cmea n) as well as image information of the connected components also it is considered to use added.
In the case of using the four-dimensional data or the two-dimensional data described so far, the connected component is treated as rectangular information, and it is difficult to know what pattern the image pattern inside is. By using image information, it is possible to know the shape pattern of the connected component itself. As this image information, for example, a 16 × 16
, And can be used as it is as a 16 × 16-dimensional vector. Using the image information created in this way, the contact detection feature is 4D + 16 × 16
It is a two-dimensional vector or a two-dimensional + 16 × 16-dimensional vector.

【００３６】また、多くの学習サンプルとしての連結成
分の画像を１６×１６の大きさに正規化して主成分分析
を行い、固有値の大きな順番に固有ベクトルを１０個用
いて、固有ベクトルと入力パターンとの内積の値を用い
ることもできる。この場合、画像情報は１０個の固有ベ
クトルとの内積が用いられるので、１０次元ベクトルに
なる。ここでは、固有値の大きな順番に１０個の固有ベ
クトルを用いると説明したが、１０個に限るものではな
く、１個から２５６個まで自由に設定可能である。ま
た、ここまでの説明では画像を１６×１６の大きさに正
規化すると説明したが、この大きさに限るものでは無
く、必要に応じてN×Mとして任意の大きさに正規化する
ことが可能である。このとき、NとMはそれぞれ１から２
５６までの範囲で自由に設定可能である。Also, the principal component analysis is performed by normalizing the images of the connected components as many learning samples to a size of 16 × 16, and ten eigenvectors are used in descending order of the eigenvalues. The value of the inner product can also be used. In this case, the image information is a 10-dimensional vector because the inner product of the image information with 10 eigenvectors is used. Here, it has been described that ten eigenvectors are used in the descending order of eigenvalues, but the number is not limited to ten and can be set freely from one to 256. In the above description, the image is normalized to a size of 16 × 16. However, the image is not limited to this size, and may be normalized to an arbitrary size as N × M as necessary. It is possible. At this time, N and M are 1 to 2 respectively.
It can be set freely up to 56.

【００３７】例えば、文字が横方向に記載されている場
合、縦方向をN=1として横方向のMを連結成分の横幅と同
じ値に取った場合、このベクトルは文字列の記載方向に
黒画素の投影ヒストグラムを取って用いる場合と等価に
なる。つまり、本発明は従来から用いられている黒画素
の投影ヒストグラムを用いる方式の一般化になっている
とも言える。しかし、横方向のMを、一つ一つの連結成
分の横幅と同じに設定すると各連結成分から計算される
接触検出特徴の次元数が一つ一つ異なってしまうので、
例えばN=1、M=64等に設定することで統計的な学習が可
能になる。For example, when a character is described in the horizontal direction, when the vertical direction is set to N = 1 and the horizontal direction M is set to the same value as the horizontal width of the connected component, this vector becomes black in the writing direction of the character string. This is equivalent to the case where a projection histogram of pixels is used. In other words, it can be said that the present invention is a generalization of a conventional method using a projection histogram of black pixels. However, if M in the horizontal direction is set to be the same as the width of each connected component, the number of dimensions of the contact detection feature calculated from each connected component will be different one by one.
For example, by setting N = 1, M = 64, etc., statistical learning becomes possible.

【００３８】また、オリジナルな画像領域を小領域に区
切り、この領域を少しずつオーバーラップさせながら黒
画素の含有率を求め、これを画像情報とすることも可能
である。含有率とは、領域のなかに占める黒画素の比率
であり、例えば小領域の大きさが１０×１０の大きさ
で、その中に黒画素が３０個あったとすると、黒画素の
含有率は３０／１０×１０＝０．３となる。小領域の区
切り方を図５を用いて説明すると、連結成分の画像を９
個の小領域に分割する。図５は図３の例におけるCC1に
対応した説明図になっている。CC1の外接矩形は幅と高
さが(W₁,H₁)であるが、これを(W₁/2,H₁/2)の小さい矩形
で区切ることで小領域を作る。このとき、この小領域は
隣接する小領域と少しづつオーバーラップさせて、図５
に示すようにArea1からArea9までの９個の領域を作る。
この領域内の黒画素数をカウントして、領域の面積で割
ることにより小領域毎の黒画素占有率を求めることがで
き、これを画像情報とすることができる。このように作
成された画像情報を用いると、接触検出特徴は４次元＋
９次元のベクトルまたは２次元＋９次元のベクトルとな
る。It is also possible to divide the original image area into small areas, obtain the content ratio of black pixels while overlapping the areas little by little, and use this as image information. The content ratio is a ratio of black pixels occupying in the region. For example, if the size of the small region is 10 × 10 and there are 30 black pixels in the small region, the content ratio of the black pixels is 30/10 × 10 = 0.3. How to divide the small area will be described with reference to FIG.
Divided into small areas. FIG. 5 is an explanatory diagram corresponding to CC1 in the example of FIG. Although CC1 of the circumscribed rectangle is the width and height (W _{_1,} H _1), making small areas by separating this small rectangular of _{_{(W 1/2, H 1}} /2). At this time, this small area slightly overlaps with an adjacent small area,
As shown in the figure, 9 areas from Area1 to Area9 are created.
By counting the number of black pixels in this area and dividing the area by the area of the area, the black pixel occupancy for each small area can be obtained, which can be used as image information. Using the image information created in this way, the contact detection feature is 4D +
It is a 9-dimensional vector or a 2-dimensional + 9-dimensional vector.

【００３９】以上説明したように、i番目の連結成分に
対する画像情報をImg_iとすると接触検出特徴として、(W
_Cmean,H_Cmean,W_i,H_i,Img_i)を与えることができる。また
(W_i/W_Cmean,H_i/H_Cmean,Img_i)を与えることもできる。As described above, when the image information for the i-th connected component is Img _i , (W
_{_{_{Cmean, H Cmean, W i,}}} H i, it is possible to give the Img _i). Also
_{_{(W i / W Cmean, H}} i / H Cmean, Img i) can be given.

【００４０】このようにして、接触を起こした連結成分
が検出されると、つぎにこれらの連結成分を切断するこ
とが必要になる。この連結成分を接触箇所において切断
するためにはいくつかの方法が考えられるが、例え
ば、”石寺他：「ストローク抽出に基づく文字切り出し
の一手法」(信学ソ大、Ｄ−３４８、１９９６)”におい
て提案されている方法を用いることができるし、特願平
9-318638号公報に記載されている方法を用いることもで
きる。When the connected components that have come into contact are detected in this way, it is necessary to cut these connected components next. Several methods are conceivable for cutting this connected component at the contact point. For example, “Ishidera et al .:“ A method of character segmentation based on stroke extraction ”(Digital Society of Japan, D-348, 1996) And the method proposed in
The method described in JP-A-9-318638 can also be used.

【００４１】この処理をフローチャートで説明すると図
６のようになる。ST11では、入力画像の連結成分を入力
して、全連結成分の平均値(W_Cmean,H_Cmean)を求める。S
T12では、一つづつ連結成分について着目し、i番目の連
結成分に対する接触検出特徴を作成する。ST13におい
て、統計的に接触を起こしたかどうかの判定を行い、接
触を起こした連結成分についてはST14において切断処理
が施され、ST15において切り出し最小単位として出力さ
れる。また、ST13において接触を起こしていないと判定
された連結成分はST14の処理を経ずに直接ST15に移り、
切り出し最小単位として出力される。この処理を全ての
連結成分に対して行う。FIG. 6 is a flowchart illustrating this process. In ST11, the connected components of the input image are input, and the average value (W _Cmean , H _Cmean ) of all the connected components is _obtained . S
At T12, attention is paid to the connected components one by one, and a contact detection feature for the i-th connected component is created. In ST13, it is determined whether or not contact has occurred statistically. The connected component that has caused contact is subjected to a cutting process in ST14, and is output as a cutout minimum unit in ST15. In addition, the connected component determined to have not caused a contact in ST13 moves directly to ST15 without performing the processing of ST14,
It is output as the minimum cutout unit. This process is performed for all connected components.

【００４２】図３の例に対応した図４の連結成分情報が
入力された場合の動作の例を示すならば、ST11にて(W
_Cmean,H_Cmean)=(96.3,107.9)と求められ、ST12において
一個づつの連結成分に対して接触判定特徴が求められ
る。連結成分の通し番号1から3まで(CC1からCC3)は、ニ
ューラルネットの出力がすべて0.1以下となり、接触を
起こしていないと判定される。しかし、通し番号4の連
結成分(CC4)に対するニューラルネットの出力は0.9以上
になり、通し番号4の連結成分は接触を起こしていると
判定され、切断処理を施される。通し番号5から8の連結
成分(CC5からCC8)に対するニューラルネットの出力は再
び0.1以下となり、接触を起こしていないと判定され
る。If an example of the operation when the connected component information of FIG. 4 corresponding to the example of FIG. 3 is input is shown, at ST11 (W
_Cmean , H _Cmean ) = (96.3, 107.9), and the contact determination feature is obtained for each connected component in ST12. For the serial numbers 1 to 3 (CC1 to CC3) of the connected components, the outputs of the neural networks are all 0.1 or less, and it is determined that no contact has occurred. However, the output of the neural network for the connected component (CC4) with the serial number 4 is 0.9 or more, the connected component with the serial number 4 is determined to be in contact, and the cutting process is performed. The output of the neural network for the connected components (CC5 to CC8) with serial numbers 5 to 8 becomes 0.1 or less again, and it is determined that no contact has occurred.

【００４３】CC1、CC2、CC3とCC5、CC6、CC7、CC8の連
結成分は接触を起こしていないと判定され、連結成分情
報がそのまま最小単位情報として出力される。一方、連
結成分CC4はST14の切断処理によって図７のように４個
の最小単位PS02とPS05とPS06とPS08に分割される。この
際、新たに生成されることになる４個の最小単位に対し
て、連結成分情報と同様なデータ形式を持つ最小単位情
報を作成する必要がある。この処理は、CC4に切断を施
した後の画像を用いて、再び連結成分抽出部における処
理と同様な処理を行えば良い。このようにして最小単位
作成部にて作成された最小単位情報の例を図８に示す。
なお、図８において、最小単位情報PS01とPS03とPS04と
PS07とPS09とPS10とPS11はそれぞれ連結成分情報CC1とC
C2とCC3とCC5とCC6とCC7とCC8に対応しており、内容は
まったく同様である。よって、連結成分CC4の切断処理
によって新たに作成された最小単位情報はPS02とPS05と
PS06とPS08である。It is determined that the connected components CC1, CC2, CC3 and CC5, CC6, CC7, CC8 are not in contact, and the connected component information is output as the minimum unit information as it is. On the other hand, the connected component CC4 is divided into four minimum units PS02, PS05, PS06, and PS08 by the cutting process in ST14 as shown in FIG. At this time, it is necessary to create minimum unit information having the same data format as the connected component information for the four minimum units to be newly generated. In this process, the same process as that in the connected component extraction unit may be performed again using the image obtained by cutting the CC4. FIG. 8 shows an example of the minimum unit information created by the minimum unit creating unit in this way.
In FIG. 8, the minimum unit information PS01, PS03, and PS04
PS07, PS09, PS10, and PS11 are connected component information CC1 and C, respectively.
It corresponds to C2, CC3, CC5, CC6, CC7 and CC8, and the contents are exactly the same. Therefore, the minimum unit information newly created by the cutting process of the connected component CC4 is PS02 and PS05.
PS06 and PS08.

【００４４】つぎに、二次元接続関係作成部１４では二
次元空間上で最小単位間の接続関係を作成する。接続関
係と隣接関係は同じような関係であるが、ここでは、隣
接関係とは単純に図形的に隣り合うことを意味する。接
続関係については後に詳しく述べる。Next, the two-dimensional connection relation creating unit 14 creates a connection relation between the minimum units in the two-dimensional space. The connection relationship and the adjacency relationship are similar, but here, the adjacency relationship simply means that they are graphically adjacent to each other. The connection relationship will be described later in detail.

【００４５】まず、隣接関係の重要性と意義について説
明する。従来から知られている文字切り出し候補の作成
方式では、入力文字列はすでに一行づつ切り出されてい
ることが前提であり、文字列の記載方向の軸上で最小単
位をソートすることで隣接関係は簡単に作成可能であっ
た。すなわち、最小単位を記載位置でソートしてあれ
ば、すでに順番が決まっているので順番が隣り合う最小
単位同士が隣接していると考えることができるのは自明
である。隣接関係がわかっていれば、明らかに不要な文
字切り出し候補をわざわざ処理する必要がなく、また誤
って明らかに不要な文字切り出し候補を出力してしまう
ことも回避することが可能である。First, the significance and significance of the adjacency will be described. The conventionally known method of creating a character segmentation candidate is based on the premise that the input character string has already been segmented line by line, and by sorting the minimum unit on the axis in the writing direction of the character string, the adjacency relationship is obtained. It was easy to create. That is, if the minimum units are sorted by the description position, the order is already determined, and it is obvious that the minimum units adjacent to each other can be considered to be adjacent to each other. If the adjacency relationship is known, it is not necessary to separately process apparently unnecessary character cutout candidates, and it is also possible to avoid erroneously outputting apparently unnecessary character cutout candidates.

【００４６】図９の例で説明する。この例は、横書きさ
れたパターンであることと、一行しか記入されていない
ということと、接触箇所はすでに切断してあるという条
件があらかじめ与えられた場合の例である。文字列パタ
ーンが与えられ、PS21からPS27までの最小単位が、外接
矩形の中心位置が左から順番にソートされて得られたと
して、例えばPS22を飛ばしてPS21とPS23の組み合わせを
作成してもこれは正しい１文字を表すことは非常にまれ
であると言える。同様にPS22とPS23を飛ばしてPS21とPS
24の組み合わせを作成してもこれが正しい１文字を表す
ことは非常にまれである。図９のように、組み合わせを
作成する際に、隣接する最小単位を飛ばして組み合わせ
を作成しても、そのパターンが正しい文字を表すとは考
えられないので、明らかに不要な文字切り出し候補であ
る。このように、明らかに不要な組み合わせを作らない
ようにするために、あらかじめ隣接関係を決めておくこ
とは重要である。This will be described with reference to the example of FIG. This example is an example in which a condition is given in advance that the pattern is a horizontally written pattern, that only one line is written, and that the contact point has already been cut. Given a character string pattern and the minimum unit from PS21 to PS27 was obtained by sorting the center position of the circumscribed rectangle in order from the left, for example, skipping PS22 and creating a combination of PS21 and PS23 Can be said to be very rare to represent the correct character. Similarly skip PS22 and PS23 and PS21 and PS
It is very rare that even if you make 24 combinations, this represents the correct one character. As shown in FIG. 9, when a combination is created by skipping adjacent minimum units, the pattern is not considered to represent a correct character. . As described above, it is important to determine the adjacency in advance so as not to create an unnecessary combination.

【００４７】しかし、本発明は、入力されたパターンが
一行しか記入されていないという前提がない状況で文字
切り出し候補を作成することが目的である。つまり、入
力パターンが何行から成っているかが未知であり、本発
明においては、最小単位作成部の処理が終了した時点で
得られている最小単位は二次元空間上にばらまかれた状
態にすぎない。However, an object of the present invention is to create a character segmentation candidate in a situation where there is no assumption that the input pattern has only one line. That is, it is unknown how many lines the input pattern consists of, and in the present invention, the minimum unit obtained at the time when the processing of the minimum unit creating unit ends is merely a state scattered on the two-dimensional space. Absent.

【００４８】さて、二次元でこの隣接関係を定義しよう
とすると、単純なソートを用いれば良いというわけに行
かなくなる。二次元上に散らばった最小単位の隣接関係
を定義するにはいくつかの方法が考えられる。例えば、
ボロノイ分割を利用して、ボロノイ辺を共有する最小単
位同士を隣接関係にあると定義することができる。Now, when trying to define this adjacency in two dimensions, it is impossible to use a simple sort. There are several methods for defining the minimum unit adjacency scattered on two dimensions. For example,
Using Voronoi division, the minimum units sharing Voronoi sides can be defined as having an adjacent relationship.

【００４９】より詳細に説明すると、各最小単位の輪郭
をトレースしてチェインコードを取得したならば、輪郭
上の点の座標値はチェインコードを用いれば容易に計算
することができる。この座標値の系列を座標値チェイン
コードと呼ぶことにすると、座標値チェインコードを5
個おきにサンプリングして母点とすることができる。図
１０の(a)に示されたパターンに対して母点を生成した
結果の図が図１０の(b)であり、黒い丸印が輪郭上に生
成された母点を示している。More specifically, if a chain code is obtained by tracing the outline of each minimum unit, the coordinate values of points on the outline can be easily calculated by using the chain code. This series of coordinate values is called a coordinate value chain code.
A sampling point can be sampled every other and used as a generating point. FIG. 10B shows the result of generating the generating points for the pattern shown in FIG. 10A, and the black circles indicate the generating points on the contour.

【００５０】これらの母点を全ての最小単位に対して生
成し、この母点に対してボロノイ分割を行うことができ
る。次に、同じ最小単位に所属する母点同士から生成さ
れたボロノイ辺を削除して、最終的に最小単位同士のボ
ロノイ分割結果を得る。このボロノイ分割結果は、最小
単位のボロノイ分割の近似になっており、近似精度を上
げる場合には、座標値チェインコードで5個おきにサン
プリングして母点とするのではなく、座標値チェインコ
ードで3個おき、2個おきとサンプリングする間隔を小さ
くすることで精度を上げることが可能であるし、精度を
少し下げても処理時間を高速化したい場合にはサンプリ
ング間隔をもっと大きくすることが可能である。図１０
の(c)に最小単位をボロノイ分割した結果の例を示す。
図１０の(c)では、同じ最小単位から作られるボロノイ
辺を点線で図示し、異なる最小単位から作られるボロノ
イ辺を太線で示している。また、図２(a)の文字列画像
のパターンに対して、連結成分の抽出と接触パターンの
切断が終り、１１個の最小単位(PS01からPS11)が得られ
ている状態からボロノイ分割を行った結果を図１１に示
す。ボロノイ辺は点線で示されている。These generating points can be generated for all the minimum units, and Voronoi division can be performed on the generating points. Next, Voronoi sides generated from generating points belonging to the same minimum unit are deleted, and finally a Voronoi division result of the minimum units is obtained. This Voronoi division result is an approximation of the Voronoi division of the minimum unit.In order to improve the approximation accuracy, instead of sampling every fifth coordinate value with a coordinate value chain code and using it as a mother point, a coordinate value chain code is used. It is possible to increase the accuracy by reducing the sampling interval between every third and every two samples, and if you want to speed up the processing time even if you lower the accuracy a little, you can increase the sampling interval further It is possible. FIG.
(C) shows an example of the result of Voronoi division of the minimum unit.
In (c) of FIG. 10, Voronoi sides made from the same minimum unit are shown by dotted lines, and Voronoi sides made from different minimum units are shown by thick lines. In addition, for the pattern of the character string image in FIG. 2A, the extraction of the connected components and the cutting of the contact pattern are completed, and Voronoi division is performed from a state in which 11 minimum units (PS01 to PS11) are obtained. The results are shown in FIG. Voronoi sides are indicated by dotted lines.

【００５１】隣接関係を求める処理をフローチャートで
説明すると図１２のようになる。ST21では、全最小単位
の輪郭上に母点を生成し、ST22において生成された全母
点を用いてボロノイ分割を行いボロノイ辺を求める。ST
23ではある一つのボロノイ辺に着目し、ST24においてそ
のボロノイ辺が同一の最小単位の母点間から得られたも
のと判定された場合、そのボロノイ辺はST25において削
除され、そうでなかった場合はST26に移る。ST26では全
ボロノイ辺に着目した場合にはST27に移るがそうでない
場合はST23に戻る。ST27において、残ったボロノイ辺を
用いて、同一のボロノイ辺を共有する最小単位のペアを
隣接関係にあるとして隣接関係情報を出力する。図２
(a)のパターンが入力されたときに得られる隣接関係情
報の出力例を図１３に示す。FIG. 12 is a flowchart for explaining the processing for obtaining the adjacency. In ST21, a generating point is generated on the contour of all minimum units, and Voronoi division is performed by using all generating points generated in ST22 to perform Voronoi division. ST
In 23, focusing on one Voronoi side, if it is determined in ST24 that the Voronoi side is obtained between the generating points of the same minimum unit, the Voronoi side is deleted in ST25, otherwise Moves to ST26. In ST26, if attention is paid to all Voronoi sides, the process proceeds to ST27; otherwise, the process returns to ST23. In ST27, by using the remaining Voronoi sides, the minimum unit pair sharing the same Voronoi side is determined to have an adjacent relation, and the adjacent relation information is output. FIG.
FIG. 13 shows an output example of the adjacency information obtained when the pattern (a) is input.

【００５２】隣接関係情報はマトリクスになっており、
ある最小単位iと最小単位jが隣接する場合、i行j列の箇
所(j行i列も同様)の値が1で、隣接していない場合は値
が0である。The adjacency information is a matrix,
When a certain minimum unit i and a minimum unit j are adjacent to each other, the value at the position of the i-th row and the j-th column (same for the j-th row and the i-th column) is 1, and when they are not adjacent, the value is 0.

【００５３】また、一般にボロノイ分割は、縦軸と横軸
が等距離であるとすることが多いが、図１４(a)のよう
に楕円上の点を等距離とすることも可能である。このと
き、図１４(b)のように、母点の横座標の値を0.5倍にし
て与え、縦座標の値をそのままの値で与えて、通常のボ
ロノイ分割(縦軸と横軸が等距離であるとする処理)を行
うことにより、図１４(a)のように楕円の部分が等距離
になるようなボロノイ分割を行うことも可能である。こ
こでは、横座標の値を0.5倍、縦座標の値をそのまま(1.
0倍)と説明したが、この比率は自由に設定することが可
能であり、横軸をx倍、縦軸をy倍に自由に設定してもよ
い。In general, the Voronoi division often assumes that the vertical and horizontal axes are equidistant, but points on an ellipse can be equidistant as shown in FIG. 14 (a). At this time, as shown in FIG. 14 (b), the value of the abscissa of the generating point is given by multiplying by 0.5, and the value of the ordinate is given as it is, and ordinary Voronoi division (the vertical and horizontal axes are equal) 14A, it is also possible to perform Voronoi division such that the elliptical portions are equidistant as shown in FIG. 14A. Here, the value of the abscissa is multiplied by 0.5 and the value of the ordinate is unchanged (1.
However, this ratio can be freely set, and the horizontal axis may be freely set to x times and the vertical axis to y times.

【００５４】次に、最小単位の隣接関係に基づき接続可
否判定を行う。この接続可否判定は、互いに隣接関係に
ある任意の最小単位のペアに着目して、そのペアが接続
しても良いかどうか判定するものである。Next, a determination is made as to whether connection is possible based on the minimum unit adjacent relationship. This connection availability determination focuses on an arbitrary minimum unit pair that is adjacent to each other, and determines whether the pair may be connected.

【００５５】ここで、接続関係についてその意義を説明
する。例えば、図１５(a)のようなパターンが与えられ
たとき、最小単位PS101からPS103は、ボロノイ辺を共有
しあっており、互いに隣接しあっている。ここで、最小
単位PS101とPS103の共有するボロノイ辺は二つ(B1とB2)
あることが重要である。つまり、PS101とPS103のように
二つのボロノイ辺を共有するパターンを用いて組み合わ
せを作成し文字切り出し候補を作成すると、図１５(b)
のように、文字切り出し候補はその内部に閉じたボロノ
イ領域を持つことになる。このようなパターンが、正し
い文字として得られることは非常にまれなため、PS101
とPS103は隣接しているが、この二つで組み合わせを作
成しても意味がない。つまり、PS101とPS103のみから成
る文字切り出し候補は作成しないようにする必要があ
る。このようなことを避けるために接続関係を作成す
る。すなわち、PS101とPS103のように二つのボロノイ辺
を共有するパターン同士は接続関係に無いと決めれば、
PS101とPS103のみから成る文字切り出し候補を作成しな
いように制御できる。Here, the significance of the connection relationship will be described. For example, when a pattern as shown in FIG. 15A is given, the minimum units PS101 to PS103 share Voronoi sides and are adjacent to each other. Here, the Voronoi sides shared by the minimum units PS101 and PS103 are two (B1 and B2)
It is important that there is. That is, when a combination is created using a pattern that shares two Voronoi sides like PS101 and PS103 to create a character segmentation candidate, FIG. 15 (b)
, The character segmentation candidate has a Voronoi region closed inside. It is very rare that such a pattern is obtained as a correct character, so the PS101
And PS103 are adjacent, but there is no point in creating a combination of the two. That is, it is necessary not to create a character cutout candidate consisting of only PS101 and PS103. Create a connection to avoid this. That is, if it is determined that patterns sharing two Voronoi sides like PS101 and PS103 are not connected,
It is possible to control not to create a character cutout candidate consisting of only PS101 and PS103.

【００５６】ただし、PS101とPS102は隣接関係でもある
し接続関係にもあるとして、同様にPS102とPS103も隣接
関係でもあるし接続関係にもあるとすれば、PS101とPS1
02とPS103の組み合わせは作成することが可能であると
することができる。つまり、PS101とPS103は直接の接続
はできないが、組み合わせ作成の際にPS102を共有する
ことで、間接的に接続することができる。However, assuming that PS101 and PS102 are both adjacent and connected, and if PS102 and PS103 are also adjacent and connected, PS101 and PS1 are similarly connected.
A combination of 02 and PS103 can be created. In other words, PS101 and PS103 cannot be directly connected, but can be indirectly connected by sharing PS102 when creating a combination.

【００５７】以上説明したように、互いに隣接関係にあ
る任意の最小単位のペアが二つのボロノイ辺を共有する
場合は、接続関係に無いと判断することができる。つま
り、接続可否判定において、接続してはならないと判定
することが可能である。As described above, when arbitrary pairs of adjacent minimum units share two Voronoi sides, it can be determined that there is no connection. In other words, it is possible to determine that connection should not be made in the connection availability determination.

【００５８】接続可否判定の精度を向上させるために、
さらに他のいくつかの方法を併用することが考えられ
る。その方法の例として、例えば、図１６のように二つ
の最小単位が与えられたとき、最小単位iの外接矩形情
報として最小単位の外接矩形の中心点座標(X_i,Y_i)と外
接矩形の幅と高さ(W_i,H_i)を抽出し、同様に最小単位jか
らも中心点座標(X_j,Y_j)と外接矩形の幅と高さ(W_j,H_j)を
抽出し、これらを組み合わせた際にできる新たな外接矩
形P_ijの中心点座標(X_ij,Y_ij)と外接矩形の幅と高さ
(W_ij,H_ij)を作成し、さらに全ての最小単位の平均幅と
平均高さ(W_PSmean,H_PSm _ean)を求め、これらの値を用い
て接続判定特徴(X_i,Y_i,W_i,H_i,X_j,Y_j,W_j,H_j,X_ij,Y _ij,
W_ij,H_ij,W_PSmean,H_PSmean)を用いて統計的に接続可否判
定を行うことができる。これは、１４次元のベクトルと
して表される。または、(X_i/W_PSmean,Y_i/H_PS _mean,W_i/W
_PSmean,H_i/H_PSmean,X_j/W_PSmean,Y_j/H_PSmean,W_j/
W_PSmean,H_j/H_PSmean,X _ij/W_PSmean,Y_ij/H_PSmean,W_ij/W
_PSmean,H_ij/H_PSmean)という１２次元のベクトルを用い
ることも可能である。In order to improve the accuracy of connection determination,
It is possible to use several other methods together
You. As an example of the method, for example, as shown in FIG.
Given the minimum unit of, the circumscribed rectangle information of the minimum unit i
The coordinates of the center point (X_i, Y_i) And outside
The width and height of the tangent rectangle (W_i, H_i) And extract the minimum unit j
Also the center point coordinates (X_j, Y_j) And the width and height (W_j, H_j)
New circumscribed rectangles created by extracting and combining these
P_ijCenter point coordinates (X_ij, Y_ij) And the bounding rectangle width and height
(W_ij, H_ij), And the average width of all the smallest units and
Average height (W_PSmean, H_PSm _ean), And using these values
Connection judgment feature (X_i, Y_i, W_i, H_i, X_j, Y_j, W_j, H_j, X_ij, Y _ij,
W_ij, H_ij, W_PSmean, H_PSmean) Statistically determines whether connection is possible
Settings can be made. This is a 14-dimensional vector
It is expressed as Or (X_i/ W_PSmean, Y_i/ H_PS _mean, W_i/ W
_PSmean, H_i/ H_PSmean, X_j/ W_PSmean, Y_j/ H_PSmean, W_j/
W_PSmean, H_j/ H_PSmean, X _ij/ W_PSmean, Y_ij/ H_PSmean, W_ij/ W
_PSmean, H_ij/ H_PSmean) Using a 12-dimensional vector
It is also possible.

【００５９】例えば、学習用の文字列画像を準備し、あ
らかじめ隣接関係にある全ての最小単位のペアから接続
判定特徴を求めておき、最小単位iと最小単位jが互いに
一つの文字の中に含まれるものである場合に教師データ
をT_ij=1、そうでない場合(つまり最小単位iとjは違う文
字に含まれている場合)に教師データをT_ij=0と置き学習
を行うことができる。For example, a character string image for learning is prepared, and connection determination features are obtained in advance from all pairs of adjacent minimum units so that the minimum unit i and the minimum unit j are included in one character. If it is included, the training data is set to T _ij = 1, otherwise (i.e., if the minimum unit i and j are included in different characters), the training data is set to T _ij = 0 and learning is performed. it can.

【００６０】統計的学習として例えば回帰分析を行う場
合、接続判定特徴を説明変数として、教師データT_ijを
推定する問題として考えることができる。またバックプ
ロパゲーションで学習する多層パーセプトロン型のニュ
ーラルネットを用いる場合、入力層に接続判定特徴を与
え、教師データとしてT_ijを与えて学習を行えば良い。
このときのニューラルネットの構造としては、入力層に
あるニューロンの個数は接触判定特徴の次元数と同じだ
けの個数を与え、中間層に５個のニューロンを、出力層
に１個のニューロンを持つ構造にすることも可能である
し、中間層には２個から１０００個まで自由にニューロ
ンの数を変えることも可能である。また、判別分析を用
いる場合、T_ijはカテゴリーであると考え、接続判定特
徴を用いて判別分析を行うことも可能である。また、主
成分分析やマハラノビス距離等を用いることも考えられ
る。When performing, for example, regression analysis as statistical learning, it can be considered as a problem of estimating teacher data _Tij using the connection determination feature as an explanatory variable. When a multilayer perceptron-type neural network that learns by back propagation is used, the learning may be performed by providing a connection determination feature to the input layer and providing _Tij as teacher data.
The structure of the neural network at this time is such that the number of neurons in the input layer is the same as the number of dimensions of the contact determination feature, with five neurons in the intermediate layer and one neuron in the output layer. The number of neurons can be freely changed from 2 to 1000 in the intermediate layer. When discriminant analysis is used, _Tij is considered to be a category, and discriminant analysis can be performed using the connection judgment feature. It is also conceivable to use principal component analysis, Mahalanobis distance, or the like.

【００６１】以上説明したように、学習済のニューラル
ネットや回帰係数や判別軸を用いて、あらかじめ定めら
れた閾値より大きな値が得られた最小単位のペアに対し
て接続可能であると判定することが可能である。すなわ
ち、学習済のニューラルネットを用いる場合は、接続判
定特徴のベクトルを入力して、出力が0.5以上であった
場合に接続可能なペアであると判定することができる
し、重回帰分析でも同様な判定を行うことができる。ま
た、判別分析の場合、接続判定特徴に対する判別得点を
用いることが可能であり、あらかじめ定められた閾値以
上の判別得点が得られた場合に接触を起こしていると判
定することができる。As described above, it is determined that a connection can be made to a minimum unit pair having a value larger than a predetermined threshold value using a learned neural network, a regression coefficient, and a discriminant axis. It is possible. In other words, when a learned neural network is used, a connection determination feature vector is input, and if the output is 0.5 or more, it can be determined that the pair is connectable, and the same applies to multiple regression analysis. Can be determined. In the case of discriminant analysis, it is possible to use a discrimination score for the connection determination feature, and it is possible to determine that contact has occurred when a discrimination score equal to or greater than a predetermined threshold is obtained.

【００６２】この処理を行う意義を説明する。例えば、
図１１のPS05とPS07のように隣接関係にある最小単位の
ペアが、互いに遠い場所に斜め方向に位置している場合
には、このペアがお互いに同一の文字を構成する要素で
あることはまれである。一方で、図１１のPS03とPS05
や、PS05とPS06は互いに近い場所に横方向や縦方向で位
置しており、このようなペアの場合お互いに同一の文字
を構成する可能性が高いと言える。The significance of performing this processing will be described. For example,
When pairs of adjacent minimum units, such as PS05 and PS07 in FIG. 11, are positioned diagonally far from each other, it is not possible that these pairs constitute the same character as each other. It is rare. On the other hand, PS03 and PS05 in FIG.
In addition, PS05 and PS06 are located in the horizontal and vertical directions close to each other, and it can be said that in such a pair, there is a high possibility that the same character is constituted by each other.

【００６３】本発明のように、隣接関係をボロノイ分割
で定義した場合、隣接するペアの数が多くなってしまう
ことがあるので、上記のような方式によって、より精密
な接続関係を求める必要性が高い。すなわち、余分な文
字切り出し候補を少しでも作成しないようにするため
に、本処理が必要であると言える。When the adjacency relation is defined by Voronoi division as in the present invention, the number of adjacent pairs may increase. Therefore, it is necessary to obtain a more precise connection relation by the above method. Is high. That is, it can be said that this processing is necessary in order not to create any extra character cutout candidates.

【００６４】さらに、学習のための接続判定特徴とし
て、１４次元のベクトルや１２次元のベクトルだけでな
く、最小単位の画像情報も付加して用いることも考えら
れる。今まで説明した１４次元のデータや１２次元のデ
ータを用いる場合、最小単位は矩形情報として扱われて
おり、その内部の画像パターンがどのようなパターンで
あるかを知ることが困難であったが、画像情報を用いる
ことで最小単位自体の形状パターンを知ることが可能に
なる。この画像情報としては、例えば最小単位iの画像
パターンを１６×１６の大きさに正規化して１６×１６
次元のベクトルをImg_iとして求め、同様に最小単位jの
パターンからも１６×１６次元のベクトルをImg_jとして
求め、Img_iとImg_jを画像情報として用いることもでき
る。この場合、接続判定特徴は１４次元＋(１６×１６)
×２次元のベクトルや１２次元＋(１６×１６)×２次元
のベクトルとなる。Further, as a connection determination feature for learning, not only a 14-dimensional vector and a 12-dimensional vector but also image information of a minimum unit may be added and used. When 14-dimensional data or 12-dimensional data described above is used, the minimum unit is treated as rectangular information, and it is difficult to know what kind of image pattern is inside the rectangular information. By using image information, it becomes possible to know the shape pattern of the minimum unit itself. As this image information, for example, the image pattern of the minimum unit i is normalized to a size of 16 × 16 to obtain a 16 × 16
It is also possible to obtain a dimensional vector as Img _i , similarly obtain a 16 × 16 dimensional vector as Img _j from the pattern of the minimum unit j, and use Img _i and Img _j as image information. In this case, the connection determination feature is 14 dimensions + (16 × 16)
× 2 dimensional vector or 12 dimensional + (16 × 16) × 2 dimensional vector.

【００６５】また、多くの学習サンプルとしての最小単
位の画像を１６×１６の大きさに正規化して主成分分析
を行い、固有値の大きな順番に固有ベクトルを１０個用
いて、固有ベクトルと入力パターンとの内積を用いるこ
ともできる。この場合、画像情報は１０次元ベクトルに
なり、接続判定特徴は２０次元だけ次元数の増えたベク
トルとなる。ここでは、固有値の大きな順番に１０個の
固有ベクトルを用いると説明したが、１０個に限るもの
ではなく、１個から２５６個まで自由に設定可能であ
る。Further, the principal unit analysis is performed by normalizing the image of the minimum unit as many learning samples to a size of 16 × 16, and ten eigenvectors are used in descending order of the eigenvalue, and the eigenvector and the input pattern are compared. Dot product can also be used. In this case, the image information is a 10-dimensional vector, and the connection determination feature is a vector whose dimension is increased by 20 dimensions. Here, it has been described that ten eigenvectors are used in the descending order of eigenvalues, but the number is not limited to ten and can be set freely from one to 256.

【００６６】また、ここまでの説明では１６×１６の大
きさで説明したが、この大きさに限るものでは無く、必
要に応じてN×Mとして任意の大きさに正規化することが
可能である。このとき、NとMはそれぞれ１から２５６ま
での範囲で自由に設定可能である。また、着目する最小
単位のオリジナルな画像領域を小領域に区切り、この領
域を少しずつオーバーラップさせながら黒画素の含有率
を求め、これを画像情報とすることも可能である。含有
率とは、領域のなかに占める黒画素の比率であり、例え
ば小領域の大きさが１０×１０の大きさで、その中に黒
画素が３０個あったとすると、黒画素の含有率は３０／
１０×１０＝０．３となる。小領域の区切り方を図５を
用いて説明すると、連結成分の画像を９個の小領域に分
割する。この小領域は少しずつ隣接する小領域とオーバ
ーラップしているが、この領域内の黒画素数をカウント
して、領域の面積で割ることにより小領域毎の黒画素占
有率を求めることができ、これを画像情報とすることが
できる。In the above description, the size is 16 × 16. However, the size is not limited to this, and it is possible to normalize the size to N × M as needed. is there. At this time, N and M can be freely set in the range of 1 to 256, respectively. It is also possible to divide the original image area of the minimum unit of interest into small areas, obtain the content ratio of black pixels while overlapping the areas little by little, and use this as image information. The content ratio is a ratio of black pixels occupying in the region. For example, if the size of the small region is 10 × 10 and there are 30 black pixels in the small region, the content ratio of the black pixels is 30 /
10 × 10 = 0.3. Explaining how to divide a small area with reference to FIG. 5, an image of a connected component is divided into nine small areas. This small area slightly overlaps with the adjacent small area, but the number of black pixels in this area is counted and divided by the area of the area to obtain the black pixel occupancy for each small area. , Which can be used as image information.

【００６７】以上説明したように、最小単位iの画像情
報をImg_i、最小単位jの画像情報をImg_jとすると接続判
定特徴として、(X_i,Y_i,W_i,H_i,X_j,Y_j,W_j,H_j,X_ij,Y_ij,
W_ij,H_ij,W _PSmean,H_PSmean,Img_i,Img_j)を用いることがで
きる。または、(X_i/W_PSmean,Y_i/H_P _Smean,W_i/W_PSmean,H_i
/H_PSmean,X_j/W_PSmean,Y_j/H_PSmean,W_j/W_PSmean,H_j/H
_PSmean,X_ij/W_PSmean,Y_ij/H_PSmean,W_ij/W_PSmean,H_ij/H
_PSmean,Img_i,Img_j)を用いることも可能である。As described above, the image information of the minimum unit i is
Img_i, Img the image information of the minimum unit j_jAnd connection
As a fixed feature, (X_i, Y_i, W_i, H_i, X_j, Y_j, W_j, H_j, X_ij, Y_ij,
W_ij, H_ij, W _PSmean, H_PSmean, Img_i, Img_j) Can be used
Wear. Or (X_i/ W_PSmean, Y_i/ H_P _Smean, W_i/ W_PSmean, H_i
/ H_PSmean, X_j/ W_PSmean, Y_j/ H_PSmean, W_j/ W_PSmean, H_j/ H
_PSmean, X_ij/ W_PSmean, Y_ij/ H_PSmean, W_ij/ W_PSmean, H_ij/ H
_PSmean, Img_i, Img_j) Can also be used.

【００６８】このように最小単位の画像パターンも利用
することの意義について説明する。例えば図１７のPS08
とPS09との関係に着目すると、PS08では縦ストロークの
下の部分に返しがある。漢字においては、この返しは文
字の終了を意味する場合が多く、このようなパターンが
書かれている場合、それよりも下の最小単位と組み合わ
せても正しい文字を表すことはあまり多くない。このよ
うなパターンを検出して、接続関係の決定に用いること
で、少しでも余分な文字切り出し候補が作成されること
を抑える効果があり、組み合わせの爆発を防ぐことがで
きる。The significance of using the minimum unit image pattern will be described. For example, PS08 in FIG.
Paying attention to the relationship between PS09 and PS09, there is a return in the part below the vertical stroke in PS08. In kanji, this return often means the end of a character, and when such a pattern is written, it is not very often that a correct character is represented even when combined with the smallest unit below it. Detecting such a pattern and using it for determining the connection relationship has the effect of suppressing the creation of any extra character cutout candidates, thereby preventing the explosion of combinations.

【００６９】接続関係を求める処理をフローチャートで
説明すると図１８のようになる。ST31ではi番目の最小
単位に着目し、ST32において隣接関係情報を参照しなが
らi番目の最小単位に隣接するj番目の最小単位との間で
計算される接続判定特徴を求める。この際に、i番目の
最小単位とj番目の最小単位とで二つのボロノイ辺を共
有しているかどうかの情報も付加して作成する。ST33に
おいて、i番目の最小単位とj番目の最小単位とで二つの
ボロノイ辺を共有している場合直ちに両者は接続関係に
無いと判定し、そうでなかった場合は接続判定特徴を用
いて統計的に接続しているかどうかを判定する。ST34で
は、最小単位iに隣接した全ての最小単位jに着目した場
合ST35へ移り、そうでない場合はST32へ戻る。ST35で
は、全ての最小単位について着目した場合ST36へ移り、
そうでない場合はST31へ戻る。ST36では、ST33にて判定
された接続関係を接続関係情報として出力する。FIG. 18 is a flowchart for explaining the processing for obtaining the connection relationship. At ST31, focusing on the i-th minimum unit, a connection determination feature calculated between the i-th minimum unit and the j-th minimum unit adjacent to the i-th minimum unit is obtained with reference to the adjacent relation information at ST32. At this time, it is created by adding information as to whether the two Voronoi sides are shared by the i-th minimum unit and the j-th minimum unit. In ST33, if the i-th minimum unit and the j-th minimum unit share two Voronoi sides, it is immediately determined that they do not have a connection relationship, and if not, statistical analysis is performed using the connection determination feature. It is determined whether or not the connection is established. In ST34, when attention is paid to all the minimum units j adjacent to the minimum unit i, the process proceeds to ST35, and otherwise, the process returns to ST32. In ST35, when attention is paid to all minimum units, the process proceeds to ST36,
Otherwise, return to ST31. In ST36, the connection relation determined in ST33 is output as connection relation information.

【００７０】ここで、図２(a)のパターンが入力された
ときに作成される接続関係情報の例を図１９に示す。接
続関係情報は隣接関係情報と同様な形式を持つ。例えば
PS01とPS03はボロノイ分割によって隣接していると判定
されたが、両者は矩形中心間の距離があいているため
に、接続関係に無いと判定されている。またPS02とPS03
もボロノイ分割によって隣接していると判定されたが、
両者は矩形中心間距離があいており、かつ斜め方向に隣
接しているために、接続関係に無いと判定されている。
PS05とPS07も同様である。FIG. 19 shows an example of connection relation information created when the pattern shown in FIG. 2A is input. The connection relation information has the same format as the adjacent relation information. For example
Although PS01 and PS03 are determined to be adjacent by Voronoi division, they are determined to be out of connection because the distance between the centers of the rectangles is large. Also PS02 and PS03
Are also determined to be adjacent by Voronoi division,
Since both have a rectangular center-to-center distance and are obliquely adjacent to each other, it is determined that there is no connection relationship.
The same applies to PS05 and PS07.

【００７１】以上説明したように接続可否が決定され、
接続関係が与えられた後に、文字切り出し候補出力部１
５において、いくつかの最小単位から組み合わせパター
ンを作成し、その組み合わせを文字切り出し候補として
も良いかどうかの判定、すなわち出力可否判定を行う。As described above, whether connection is possible or not is determined.
After the connection relation is given, the character segmentation candidate output unit 1
In step 5, a combination pattern is created from some of the minimum units, and it is determined whether or not the combination may be used as a character cutout candidate, that is, whether output is possible.

【００７２】組合せパターンの作成においては、いくつ
の最小単位を組み合わせるかについてあらかじめ決めて
おく必要がある。In creating a combination pattern, it is necessary to determine in advance how many minimum units are to be combined.

【００７３】ここで、ある組み合わせパターンが一つの
最小単位から成っている場合、組み合わせ数を１と定義
する。また、ある組み合わせパターンが二つの最小単位
から成っている場合の組み合わせ数を２と定義し、同様
にある組み合わせパターンがN個の最小単位から成って
いる場合の組み合わせ数はNであるとする。Here, when a certain combination pattern is composed of one minimum unit, the number of combinations is defined as one. Further, the number of combinations when a certain combination pattern is composed of two minimum units is defined as 2, and the number of combinations when a certain combination pattern is composed of N minimum units is N.

【００７４】組み合わせパターンを作成する際には、例
えば組み合わせ数が１から１０までの組み合わせパター
ンを作成すればよい。ここでは、組み合わせ数の下限は
１、上限は１０と説明したが、組み合わせ数の上限は自
由に設定可能であり、２から１００までの値を設定する
ことが可能である。When creating a combination pattern, for example, a combination pattern having 1 to 10 combinations may be created. Here, the lower limit of the number of combinations is 1 and the upper limit is 10, but the upper limit of the number of combinations can be set freely, and a value from 2 to 100 can be set.

【００７５】組み合わせ数１のパターンを作成する場
合、一つの最小単位を組み合わせパターンであるとすれ
ば良い。組み合わせ数２のパターンを作成する場合、例
えばある最小単位iに着目して、その最小単位iに接続可
能であると判定された最小単位jと組み合わせることで
作成可能である。このときi≠jである。また、組み合わ
せ数３のパターンを作成する場合、例えばある最小単位
iに着目して、その最小単位iに接続可能であると判定さ
れた最小単位jと、最小単位iか最小単位jのいづれかに
接続可能であると判定された最小単位kを組み合わせる
ことで作成可能である。このときi≠j≠kである。同様
にして、組み合わせ数Nの組み合わせパターンを作成す
ることが可能である。When a pattern having the number of combinations of 1 is created, one minimum unit may be a combination pattern. When creating a pattern having two combinations, for example, a pattern can be created by focusing on a certain minimum unit i and combining it with a minimum unit j determined to be connectable to the minimum unit i. At this time, i ≠ j. When creating a pattern having three combinations, for example, a certain minimum unit
Focusing on i, created by combining the minimum unit j determined to be connectable to the minimum unit i and the minimum unit k determined to be connectable to either the minimum unit i or the minimum unit j It is possible. At this time, i ≠ j ≠ k. Similarly, it is possible to create a combination pattern of the number N of combinations.

【００７６】このようにして作成された組み合わせパタ
ーンを文字切り出し候補として出力してもよいかどうか
の出力可否判定を文字切り出し候補出力部１５にて行
う。The character cutout candidate output section 15 determines whether or not the combination pattern created in this way may be output as a character cutout candidate.

【００７７】この判定には、いくつかの方法が考えられ
るが、例えば、いくつかの最小単位の組み合わせを作っ
たときに、内部に閉じたボロノイ領域を持つパターンは
文字切り出し候補として出力してはならないと判定する
ことができる。For this determination, several methods are conceivable. For example, when a combination of some minimum units is created, a pattern having a Voronoi region closed inside should be output as a character extraction candidate. Can not be determined.

【００７８】より詳しく説明すると、図２０の(a)や(b)
のパターンは、それぞれ三つの最小単位があるが、(a)
の例ではPSa1とPSa3だけの組み合わせを作成すると、内
部に閉じたボロノイ領域ができてしまう。同様に、(b)
の例ではPSb1とPSb2だけの組み合わせでも、内部に閉じ
たボロノイ領域ができてしまう。一般に、文字列中にお
ける文字は、最小単位をボロノイ分割したときに一つの
閉領域を作るという性質を持ち、これに反するような文
字パターンは非常にまれであるという性質を利用して、
内部に閉じたボロノイ領域を持つパターンは文字切り出
し候補として出力しないという制御を行うことが可能で
ある。More specifically, (a) and (b) of FIG.
Each pattern has three minimum units, but (a)
In the example, if only a combination of PSa1 and PSa3 is created, a closed Voronoi region will be created inside. Similarly, (b)
In the above example, even with a combination of PSb1 and PSb2, a Voronoi region closed inside is created. In general, characters in a character string have the property that one closed area is created when the minimum unit is Voronoi-divided, and the property that character patterns contrary to this are very rare is used.
It is possible to perform control so that a pattern having a Voronoi region closed inside is not output as a character extraction candidate.

【００７９】さらに、異なる判定方式によって組み合わ
せパターンの出力可否判定を行うことで、さらに判定精
度を向上させることが可能である。例えば、ある組み合
わせパターンに着目し、その組み合わせパターンの外接
矩形の高さと幅を(H_C,W_C)として、統計的に文字切り出
し候補として出力してよいかどうかの出力可否判定する
ことができる。ここで、添字のCはC番目の組み合わせパ
ターンであることを意味する。Further, the determination accuracy can be further improved by determining the output propriety of the combination pattern using different determination methods. For example, paying attention to a certain combination pattern, the height and width of the circumscribed rectangle of the combination pattern are set to (H _C , W _C ), and it is possible to determine whether or not to output statistically as a character cutout candidate. . Here, the subscript C means the C-th combination pattern.

【００８０】例えば、統計的な判定の方法として、学習
用にあらかじめ正しい文字候補になっている組み合わせ
パターンを多数用意して、その外接矩形の幅と高さを(W
_L,H_L)としたときに、WとHによって張られる二次元空間
上に(W_L,H_L)をプロットし、例えば５０×５０のメッシ
ュに区切って、あるメッシュの中に正しい文字候補が含
まれている頻度を求め、その頻度を用いることが可能で
ある。ここで、添字のLはL番目の学習用データを意味す
る。このとき、WとHで張られる空間は本来ゼロから無限
まで広がっているがあらかじめ学習データ(W_L,H_L)の存
在する範囲を求め、その範囲を５０×５０のメッシュに
区切れば良い。あらかじめ設定したメッシュの範囲を逸
脱した箇所は、頻度をゼロとすることも可能である。For example, as a statistical determination method, a large number of combination patterns that are correct character candidates are prepared in advance for learning, and the width and height of the circumscribed rectangle are set to (W
_L, when the H _L), W and (W on a two-dimensional space spanned by H _L, plots the H _L), for example, separated in a 50 × 50 mesh, correct character candidate in Mesh Is found, and the frequency can be used. Here, the suffix L means the L-th learning data. At this time, the space spanned by W and H originally extends from zero to infinity, but the range in which the learning data (W _L , H _L ) exists is determined in advance, and the range may be divided into 50 × 50 meshes. . It is also possible to set the frequency of a portion that deviates from a preset mesh range to zero.

【００８１】ある組み合わせパターンを考えた場合、こ
の頻度の値がある一定の閾値以上であった場合に文字切
り出し候補として出力してよいと判定することができ
る。この判定により、明らかに異常なパターン(幅や高
さや面積が大きすぎたり、縦横の比率が異常につぶれて
いたり縦長だったりするパターン)を排除することが可
能になり、余分な文字切り出し候補を出力してしまうこ
とを抑えることができる。When a certain combination pattern is considered, it can be determined that a character cutout candidate can be output when the frequency value is equal to or greater than a certain threshold value. By this determination, it is possible to eliminate apparently abnormal patterns (patterns whose width, height, area is too large, the aspect ratio is crushed abnormally or vertically long), and extra character cutout candidates are Output can be suppressed.

【００８２】また、文字切り出し候補として出力してよ
いかの判定には組み合わせパターンの外接矩形の幅と高
さを(W_C,H_C)として用いると説明したが、ある一枚の文
字列画像から得られる全ての最小単位の幅と高さの平均
値を(W_PSmean,H_PSmean)として求め、正解である組み合
わせパターンCの外接矩形の幅と高さを平均値で正規化
した量として(W_C/H_PSmean,H_C/W_PSmean)を求め、これを
用いることも可能である。Also, it has been described that the width and height of the circumscribed rectangle of the combination pattern are used as (W _C , H _C ) to determine whether or not it is possible to output as a character cutout candidate. The average value of the width and height of all the minimum units obtained from is obtained as (W _PSmean , H _PSmean ), and the width and height of the circumscribed rectangle of the combination pattern C that is the correct answer are normalized by the average value as ( W _C / H _PSmean , H _C / W _PSmean ) can be obtained and used.

【００８３】通常、異なる文字列画像から得られる文字
の幅と高さは、そのサイズが大きく異なるが、各画像の
最小単位の幅と高さで正規化された量を用いることで、
サイズの変動分を抑えることが可能になる。Normally, the width and height of characters obtained from different character string images differ greatly in size, but by using quantities normalized by the minimum unit width and height of each image,
It is possible to reduce the size variation.

【００８４】この判定に加えて、さらなる判定方式を加
えることも考えられる。例えば、着目する組み合わせパ
ターンCが最小単位i,j,kから成り立っているとしたとき
に、最小単位i,j,kのいずれかと接続関係にある他の最
小単位lを考える、このときl≠i≠j≠kである。組み合
わせパターンの外接矩形の中心座標(X_C,Y_C)と外接矩形
の幅と高さ(W_C,H_C)を求め、同様に組み合わせパターンC
と接続する最小単位lからも中心点座標(X_Cl,Y_Cl)と外接
矩形の幅と高さ(W_Cl,H_Cl)を求め、これらを組み合わせ
た際にできる新たな外接矩形Pの中心点座標(X_P,Y_P)と外
接矩形の幅と高さ(W_P,H_P)を作成し、さらに全ての最小
単位の平均幅と平均高さ(W_PSmean,H_PSmean)を求め、こ
れらの値を用いて統計的に文字切り出し候補として出力
してよいかの出力可否判定を行うことができる。In addition to this determination, a further determination method may be added. For example, assuming that the combination pattern C of interest is made up of the minimum units i, j, k, consider another minimum unit l that is connected to one of the minimum units i, j, k. i ≠ j ≠ k. The center coordinates (X _C , Y _C ) of the circumscribed rectangle of the combination pattern and the width and height (W _C , H _C ) of the circumscribed rectangle are obtained.
The center point coordinates (X _Cl , Y _Cl ) and the width and height (W _Cl , H _Cl ) of the circumscribed rectangle are also obtained from the minimum unit l connected to the center, and the center of the new circumscribed rectangle P created when these are combined Create the point coordinates (X _P , Y _P ) and the width and height (W _P , H _P ) of the circumscribed rectangle, and further calculate the average width and average height (W _PSmean , H _PSmean ) of all minimum units, By using these values, it is possible to statistically determine whether or not to output as character extraction candidates.

【００８５】すなわち、出力可否判定特徴として(X_C,
Y_C,W_C,H_C,X_Cl,Y_Cl,W_Cl,H_Cl,X_P,Y_P,W_P,H_P,W_PSmean,H
_PSmean)を用いて統計的に出力可否判定を行うことがで
きる。また、(X_C/W_PSmean,Y_C/H_PSmean,W_C/W_PSmean,H_C/H
_PSmean,X_Cl/W_PSmean,Y_Cl/H_PSmean,W _Cl/W_PSmean,H_Cl/H
_PSmean,X_P/W_PSmean,Y_P/H_PSmean,W_P/W_PSmean,H_P/
H_PSmean)を用いることも可能である。That is, (X_C,
Y_C, W_C, H_C, X_Cl, Y_Cl, W_Cl, H_Cl, X_P, Y_P, W_P, H_P, W_PSmean, H
_PSmean) Can be used to statistically determine whether output is possible.
Wear. Also, (X_C/ W_PSmean, Y_C/ H_PSmean, W_C/ W_PSmean, H_C/ H
_PSmean, X_Cl/ W_PSmean, Y_Cl/ H_PSmean, W _Cl/ W_PSmean, H_Cl/ H
_PSmean, X_P/ W_PSmean, Y_P/ H_PSmean, W_P/ W_PSmean, H_P/
H_PSmean) Can also be used.

【００８６】例えば、学習用の文字列画像を準備し、あ
らかじめ隣接関係に基づいて組み合わせ数1からNまでの
組み合わせパターンCP_Cを作成する。ここで、Cは通し番
号でありCP_Cはi番目の組み合わせパターンを意味する。
次に、組み合わせパターンCP _Cと隣接する他の最小単位
のセットPS_Clを求める。ここで添字のCはCP_Cに隣接して
いるという意味をあらわし、lはCP_Cに隣接しているl番
目の最小単位であることを意味する。ここで、全てのCP
_CとPS_Clのペアから出力可否判定特徴を求めておき、組
み合わせパターンCP_Cが正しい文字を表しているときに
教師データをT_Cl=1、そうでない場合(つまり組み合わせ
パターンCP_Cが正しい文字でない場合)に教師データをT
_Cl=0と置き学習を行うことができる。For example, a character string image for learning is prepared.
From 1 to N combinations based on the adjacency
Combination pattern CP_CCreate Where C is a serial number
And CP_CMeans the i-th combination pattern.
Next, the combination pattern CP _CAnd other adjacent minimum units
Set PS_ClAsk for. Where the subscript C is CP_CAdjacent to
L means CP_CNumber l adjacent to
It is the smallest unit of the eye. Where all CPs
_CAnd PS_ClThe output feasibility judgment feature is obtained from the pair of
Combination pattern CP_CRepresents the correct character
Teacher data T_Cl= 1, otherwise (ie combination
Pattern CP_C(If is not a correct character)
_Cl0 can be set for learning.

【００８７】統計的学習として例えば回帰分析を行う場
合、出力可否判定特徴を説明変数として、教師データT
_Clを推定する問題として考えることができる。またバッ
クプロパゲーションで学習する多層パーセプトロン型の
ニューラルネットを用いる場合、入力層に出力可否判定
特徴を与え、教師データとしてT_Clを与えて学習を行え
ば良い。このときのニューラルネットの構造としては、
入力層にあるニューロンの個数は接触判定特徴の次元数
と同じだけの個数を与え、中間層に５個のニューロン
を、出力層に１個のニューロンを持つ構造にすることも
可能であるし、中間層には２個から１０００個まで自由
にニューロンの数を変えることも可能である。また、判
別分析を用いる場合、T_Clはカテゴリーであると考え、
出力可否判定特徴を用いて判別分析を行うことも可能で
ある。また、主成分分析やマハラノビス距離等を用いる
ことも考えられる。When performing, for example, regression analysis as statistical learning, the output data determination feature is used as an explanatory variable and the teacher data T
It can be considered as a problem of estimating _Cl . When a multilayer perceptron-type neural network that learns by back propagation is used, an input layer is provided with an output availability determination feature, and learning is performed by providing T _Cl as teacher data. The structure of the neural network at this time is as follows:
The number of neurons in the input layer is given the same number as the number of dimensions of the contact determination feature, and it is possible to have a structure having five neurons in the intermediate layer and one neuron in the output layer, The number of neurons in the intermediate layer can be freely changed from two to 1,000. Also, when using discriminant analysis, T _Cl is considered a category,
It is also possible to perform discriminant analysis using the output availability determination feature. It is also conceivable to use principal component analysis, Mahalanobis distance, or the like.

【００８８】以上説明したように、学習済のニューラル
ネットや回帰係数や判別軸を用いて、あらかじめ定めら
れた閾値より大きな値が得られた組み合わせパターンCP
_Cを文字切り出し候補として出力することが可能であ
る。すなわち、学習済のニューラルネットを用いる場合
は、出力可否判定特徴のベクトルを入力して、出力が0.
5以上であった場合に正しい文字を表している組み合わ
せパターンである可能性が高いとして、文字切り出し候
補として出力することができるし、重回帰分析でも同様
な判定を行うことができる。また、判別分析の場合、出
力可否判定特徴に対する判別得点を用いることが可能で
あり、あらかじめ定められた閾値以上の判別得点が得ら
れた場合に正しい文字を表している組み合わせパターン
である可能性が高いとして、文字切り出し候補として出
力することができる。As described above, using the learned neural network, the regression coefficient and the discriminant axis, the combination pattern CP having a value larger than the predetermined threshold value is obtained.
It is possible to output _C as a character extraction candidate. In other words, when using a trained neural network, input a vector of the output availability determination feature and output 0.
If the number is 5 or more, it is highly likely that the combination pattern represents a correct character, the character pattern can be output as a character segmentation candidate, and a similar determination can be made by multiple regression analysis. Also, in the case of discriminant analysis, it is possible to use a discriminant score for an output propriety judgment feature, and if a discriminant score equal to or greater than a predetermined threshold is obtained, it is possible that the combination pattern represents a correct character. If it is high, it can be output as a character segmentation candidate.

【００８９】また、一般に一つの組み合わせパターンCP
_Cに対して、複数の隣接する最小単位PS_Clが得られる。
このとき、CP_Cの出力可否判定は、隣接する最小単位PS
_Clの個数と同じ回数だけ行われる。このとき、l回の判
定のなかで1回でも閾値以下の判定が行われた場合、CP_C
は正しい文字を表す可能性が低いとして棄却することも
可能であるし、l回の判定のうち正しい文字を表すと判
定された比率、すなわちm回正しい文字を表すと判定さ
れた場合に、m/lの値があらかじめ定められた閾値より
も大きかった場合には、正しい文字を表す可能性が高い
として文字切り出し候補として出力することができる。Also, generally, one combination pattern CP
_{For C} , a plurality of adjacent minimum units PS _Cl are obtained.
In this case, the output determination of CP _C is the minimum unit adjacent PS
_This is performed as many times as the number of _Cl . At this time, if at least one of the l determinations is determined to be equal to or less than the threshold, the CP _C
Can be rejected as having a low possibility of representing a correct character, or the ratio of l times that the correct character is determined to represent the correct character, i.e., if it is determined that m represents the correct character, m When the value of / l is larger than a predetermined threshold, it is possible to output a character extraction candidate as a possibility that it represents a correct character.

【００９０】ここで、図２１を用いて説明する。図２１
の(a)から(e)は、組み合わせパターンCP₁からCP₅の一例
を示している。図２１の(a)の例で説明するならば、CP₁
を評価する際には、まずCP₁とPS11の関係からCP₁の出力
可否判定を行い、次にCP₁とPS12の関係からCP₁の出力可
否判定を行う。Here, a description will be given with reference to FIG. FIG.
From (a) (e) illustrates an example of CP ₅ from combination pattern CP _1. If described in example (a) of FIG. 21, CP ₁
When evaluating, first performs output determination of CP ₁ from the relation CP ₁ and PS11, to output determination of CP ₁ then the relationship CP ₁ and PS12.

【００９１】図２１(a)の例で、CP₁とPS₁₁の関係からCP
₁の出力可否判定を行う場合の出力可否判定特徴を作成
する際の様子を図２２に示す。In the example of FIG. 21A, the relationship between CP ₁ and PS ₁₁
FIG. 22 shows a state in which the output permission / inhibition determination feature is created in the case of performing the output permission / inhibition determination of ₁ .

【００９２】ここで、このような出力可否判定を行う意
義について説明する。例えば、図２１(a)のパターンは
それぞれ正しい文字を表す文字切り出し候補である。
(a)のCP ₁が、それぞれサイズの観点で妥当な文字切り出
し候補であったとして、そのまわりの最小単位PS₁₁とPS
₁₂とに対してCP₁が空間的にどのような配置関係を持つ
か評価すると、PS₁₁とPS₁₂両者に対して縦方向に配置さ
れていることがわかる。文字列は、縦方向や横方向に記
載されることが多いので、CP₁はまわりの最小単位との
配置も妥当であると言える。(c)の場合も同様である。Here, the meaning of making such output availability determination is as follows.
I will explain justice. For example, the pattern in FIG.
Each is a character cutout candidate representing a correct character.
(a) CP ₁However, character extraction that is appropriate in terms of size
As a candidate, and the minimum unit PS around it₁₁And PS
₁₂Against CP₁What spatial relationship has
If you evaluate, PS₁₁And PS₁₂Vertically positioned for both
You can see that it is. Character strings can be written vertically or horizontally.
Because it is often listed, CP₁Is the smallest unit around
It can be said that the arrangement is appropriate. The same applies to the case (c).

【００９３】一方、(e)のパターンは組み合わせパター
ンを構成する内部の最小単位が斜めに組み合わさってお
り、しかも同一文字ではないと仮定している二つの最小
単位PS₅₁とPS₅₂がやはり斜めに組み合わせパターンの外
接矩形内に入り込んでおり、このようなパターンは文字
切り出し候補として明らかに妥当ではない。On the other hand, in the pattern (e), the two minimum units PS ₅₁ and PS ₅₂ which are assumed to be diagonally combined with the internal minimum units forming the combination pattern and are not the same character are also diagonal. , The pattern is clearly not valid as a character segmentation candidate.

【００９４】つまり、組み合わせパターン自体のサイズ
(幅と高さ)だけで文字としての妥当性を評価しようとす
ると、(e)のようなパターンがひとたび妥当なサイズで
あると判定されてしまうと、まわりとの配置関係がいび
つなパターンであっても、それを排除できないという問
題がある。That is, the size of the combination pattern itself
If you try to evaluate the validity of a character using only (width and height), once the pattern as shown in (e) is judged to be a valid size, the pattern of the arrangement with the surroundings is irregular. Even so, there is a problem that it cannot be excluded.

【００９５】図２１の(b)と(d)は、(a)と(c)の二つのパ
ターンと(e)のパターンとの中間にあるような例である
が、両者を比較すると(b)の例よりも(d)のパターンのほ
うが文字切り出し候補として適さないと判定されて排除
される可能性が高いと期待できる。このような微妙な判
定を精度良く行うために、統計的な判定を利用すること
も意義の大きいことである。FIGS. 21 (b) and (d) show an example in which the two patterns (a) and (c) are in the middle of the pattern (e). It can be expected that the pattern of (d) is more likely to be determined to be unsuitable as a character segmentation candidate and eliminated than the example of ()). It is also significant to use statistical judgment in order to make such a delicate judgment with high accuracy.

【００９６】さらに、学習のための出力可否判定特徴と
して、１４次元のベクトルや１２次元のベクトルだけで
なく、組み合わせパターンCP_Cとそれに隣接する最小単
位PS_C _lの画像情報も付加して用いることも考えられる。
今まで説明した１４次元のデータや１２次元のデータを
用いる場合、組み合わせパターンと最小単位は矩形情報
として扱われており、その内部の画像パターンがどのよ
うなパターンであるかを知ることが困難であったが、画
像情報を用いることで組み合わせパターンと最小単位自
体の形状パターンを知ることが可能になる。この画像情
報としては、例えば組み合わせパターンCPCの画像パタ
ーンを１６×１６の大きさに正規化して１６×１６次元
のベクトルImg_Cとして求め、同様に最小単位PS_Clのパタ
ーンからも１６×１６次元のベクトルをImg_Clとして求
め、Img_CとImg_Clを画像情報として用いることもでき
る。この場合、特徴量は(１６×１６)×２次元だけ次元
数の増えたベクトルになる。[0096] Further, as the output determination features for learning, be used also added image information of the minimum unit PS _C _l adjacent to the well vector of 14-dimensional vectors and 12-D, a combination pattern CP _C it Is also conceivable.
When 14-dimensional data or 12-dimensional data described above is used, the combination pattern and the minimum unit are treated as rectangular information, and it is difficult to know what kind of image pattern is inside. However, using the image information makes it possible to know the combination pattern and the shape pattern of the minimum unit itself. As this image information, for example, an image pattern of the combination pattern CPC is normalized to a size of 16 × 16 to obtain a 16 × 16-dimensional vector Img _C , and a 16 × 16-dimensional vector is similarly obtained from the pattern of the minimum unit PS _Cl . The vector can be obtained as Img _Cl , and Img _C and Img _Cl can be used as image information. In this case, the feature amount is a vector whose dimension is increased by (16 × 16) × 2 dimensions.

【００９７】また、多くの学習サンプルとしての組み合
わせパターンと最小単位の画像をそれぞれ１６×１６の
大きさに正規化して主成分分析を行い、それぞれの画像
について固有値の大きな順番に固有ベクトルを１０個用
いて、固有ベクトルと入力パターンとの内積を用いるこ
ともできる。この場合、それぞれの画像情報は１０次元
ベクトルになり、特徴量は２０次元だけ次元数の増えた
ベクトルになる。ここでは、固有値の大きな順番に１０
個の固有ベクトルを用いると説明したが、１０個に限る
ものではなく、１個から２５６個まで自由に設定可能で
ある。Also, the combination pattern as a large number of learning samples and the image of the minimum unit are each normalized to a size of 16 × 16, and the principal component analysis is performed. For each image, ten eigenvectors are used in the order of large eigenvalue. Thus, the inner product of the eigenvector and the input pattern can be used. In this case, each piece of image information is a 10-dimensional vector, and the feature amount is a vector whose dimension is increased by 20 dimensions. In this case, 10
Although it has been described that the number of eigenvectors is used, the number is not limited to ten, but can be set freely from one to 256.

【００９８】また、ここまでの説明では１６×１６の大
きさで説明したが、この大きさに限るものでは無く、必
要に応じてN×Mとして任意の大きさに正規化することが
可能である。このとき、NとMはそれぞれ１から２５６ま
での範囲で自由に設定可能である。また、着目する組み
合わせパターンと最小単位のオリジナルな画像領域を小
領域に区切り、この領域を少しずつオーバーラップさせ
ながら黒画素の含有率を求め、これを画像情報とするこ
とも可能である。含有率とは、領域のなかに占める黒画
素の比率であり、例えば小領域の大きさが１０×１０の
大きさで、その中に黒画素が３０個あったとすると、黒
画素の含有率は３０／１０×１０＝０．３となる。小領
域の区切り方を図５を用いて説明すると、連結成分の画
像を９個の小領域に分割する。この小領域は少しずつ隣
接する小領域とオーバーラップしているが、この領域内
の黒画素数をカウントして、領域の面積で割ることによ
り小領域毎の黒画素占有率を求めることができ、これを
画像情報とすることができる。In the above description, the size is 16 × 16. However, the size is not limited to this, and it is possible to normalize the size to N × M as needed. is there. At this time, N and M can be freely set in the range of 1 to 256, respectively. It is also possible to divide the combination image of interest and the original image area of the minimum unit into small areas, obtain the content ratio of black pixels while overlapping the areas little by little, and use this as image information. The content ratio is a ratio of black pixels occupying in the region. For example, if the size of the small region is 10 × 10 and there are 30 black pixels in the small region, the content ratio of the black pixels is 30/10 × 10 = 0.3. Explaining how to divide a small area with reference to FIG. 5, an image of a connected component is divided into nine small areas. This small area slightly overlaps with the adjacent small area, but the number of black pixels in this area is counted and divided by the area of the area to obtain the black pixel occupancy for each small area. , Which can be used as image information.

【００９９】以上説明したように、出力可否判定特徴と
して画像情報も加えて(X_C,Y_C,W_C,H_C,X_Cl,Y_Cl,W_Cl,H_Cl,X
_P,Y_P,W_P,H_P,W_PSmean,H_PSmean,Img_C,Img_Cl)を用いて統計
的に出力可否判定を行うことができる。また、(X_C/W
_PSmean,Y_C/H_PSmean,W_C/W_PSmean,H_C/H_PSmean,X_Cl/W
_PSmean,Y_Cl/H_PSmean,W_Cl/W_PSmean,H_Cl/H_PSmean,X_P/W
_PSmean,Y_P/H_PSmean,W_P/W_PSmean,H_P/H_PSmean,Img_C,Im
g_Cl)を用いることも可能である。As described above, image information is also added as an output availability determination feature (X _C , Y _C , W _C , H _C , X _Cl , Y _Cl , W _Cl , H _Cl , X
_P , Y _P , W _P , H _P , W _Psmean , H _Psmean , Img _C , Img _Cl ) can be used to statistically determine whether output is possible. Also, (X _C / W
_PSmean , Y _C / H _PSmean , W _C / W _PSmean , H _C / H _PSmean , X _Cl / W
_PSmean , Y _Cl / H _PSmean , W _Cl / W _PSmean , H _Cl / H _PSmean , X _P / W
_{_{_{PSmean, Y P / H PSmean,}}} W P / W PSmean, H P / H PSmean, Img C, Im
g _Cl ) can also be used.

【０１００】文字切り出し候補出力部の動作をフローチ
ャートで説明すると図２３のようになる。ST41では、組
み合わせ数が1からNまでの組み合わせパターンを作成
し、ST42では生成されたC番目の組み合わせパターンCP_C
に着目し、ST43において接続関係情報を参照しながらCP
_Cに接続するl番目の最小単位PS_Clに着目する。ST44にお
いてCP_CとPS_Clとの間で得られる出力可否判定特徴を求
める。このとき、出力可否判定特徴に加えてCP_Cが作る
ボロノイ領域内に、異なる最小単位が作る閉じたボロノ
イ領域があるかどうかの情報も付加される。ST45では、
CP_Cに閉じたボロノイ領域が含まれる場合には出力して
はならないと判定され次のステップST46に移るが、そう
でなかった場合に出力可否判定特徴を用いて統計的に出
力可否判定が行われる。ST46では全てのlに着目した場
合ST47に移り、そうでない場合はST43に戻る。ST47では
全てのCに着目した場合ST48に移り、そうでない場合はS
T42に戻る。ST48では、ST45で出力可と判定された組み
合わせパターンを文字切り出し候補として出力する。The operation of the character segmentation candidate output unit will be described with reference to a flowchart shown in FIG. In ST41, a combination pattern in which the number of combinations is 1 to N is created, and in ST42, the generated C-th combination pattern CP _C
Pay attention to the CP and refer to the connection-related information in ST43.
_Focus on the l-th minimum unit PS _Cl connected to _C. In ST44, an output availability determination feature obtained between CP _C and PS _Cl is obtained. At this time, the Voronoi region created by the CP _C in addition to the output determination features, is also added information whether there is a Voronoi region closed making different minimum unit. In ST45,
Outputs but proceeds to next step ST46 is determined to not be statistically output determination rows by using output determination characterized if not so, when containing the Voronoi region closed CP _C Will be In ST46, when attention is paid to all l, the process proceeds to ST47, and otherwise, the process returns to ST43. In ST47, when all the Cs are focused on, the process proceeds to ST48, otherwise, S is performed.
Return to T42. In ST48, the combination pattern determined to be output in ST45 is output as a character cutout candidate.

【０１０１】図２(a)のようなパターンが入力されたと
きに、組み合わせ数の上限を４と設定した場合の出力さ
れる文字切り出し候補の例を図２４に示す。縦に区切ら
れた左の列から組み合わせ数１、一番右の列が組み合わ
せ数４の文字切り出し候補を示している。図２(a)の例
では、一行のみ記載された例を示したが、このようなパ
ターンに行の切り出し処理を施しても、一行として切り
出される場合もあれば、二行になって切り出されること
もあり、ひとたび行切り出しを誤ると、正しい文字切り
出し候補を作成することも困難となるが、また、図２
(a)は通常であれば一行のみ記載されていると説明した
が、もしこの文字列画像が何らかの暗号のようなもの
で、一行目が「月月丁」であり二行目が「ナ日田三」と
意図して書かれたものであっても、本方式によればそち
らに対応する文字切り出し候補が生成されている。つま
り、本方式によれば、一行か二行かわからなくても、ど
ちらのケースにも対応できるように文字切り出し候補を
生成することが可能になる。FIG. 24 shows an example of a character cutout candidate output when the upper limit of the number of combinations is set to 4 when a pattern as shown in FIG. 2A is input. From the left column vertically separated, the number of combinations is 1, and the rightmost column shows character cutout candidates with the number of combinations of 4. In the example of FIG. 2A, an example in which only one line is described is shown. However, even if such a pattern is subjected to a line cutout process, the line may be cut out as one line, or may be cut out as two lines. In some cases, once line segmentation is erroneous, it becomes difficult to create a correct character segmentation candidate.
(a) explained that normally only one line is described, but if this character string image is something like a cipher, the first line is `` Tsukitsukicho '' and the second line is `` Na Hita Even if it is written with the intention of "3", according to this method, a corresponding character segmentation candidate is generated. In other words, according to the present method, it is possible to generate a character segmentation candidate so as to be able to cope with either case without knowing whether it is one line or two lines.

【０１０２】本発明のように、行抽出処理を前提とせず
に文字切り出し候補を作成することによって、行抽出が
困難な文字列パターンであっても正しく文字切り出し候
補を得ることが可能になる。このとき、文字切り出し候
補は、最小単位を組み合わせた画像パターンとその外接
矩形の幅と高さと、外接矩形中心の座標のデータとして
得られる。As in the present invention, by creating character extraction candidates without assuming line extraction processing, character extraction candidates can be correctly obtained even for character string patterns for which line extraction is difficult. At this time, the character cutout candidate is obtained as data of the image pattern obtained by combining the minimum units, the width and height of the circumscribed rectangle, and the coordinates of the center of the circumscribed rectangle.

【０１０３】図２４に対応するデータは、文字切り出し
候補情報として図２５に示すような形式で保持される。
すなわち、ある文字切り出し候補はその通し番号である
IDと、いくつの最小単位を組み合わせたかの組み合わせ
数と、外接矩形の幅と高さ(Wi,Hi)と外接矩形中心の座
標(Xi,Yi)と、組み合わせに用いられた最小単位のIDと
が文字切り出し候補情報として出力される。ここで、添
字のiはi番目の文字切り出し候補を意味し、IDの番号と
対応している。また、文字切り出し候補の画像パターン
は、組み合わせに用いられた全ての最小単位のIDを用い
て最小単位情報を参照すれば画像パターンを取り出せる
ので、容易に作成することが可能である。The data corresponding to FIG. 24 is held as character cutout candidate information in a format as shown in FIG.
That is, a character cutout candidate is its serial number
ID, the number of combinations of how many minimum units are combined, the width and height (Wi, Hi) of the circumscribed rectangle, the coordinates (Xi, Yi) of the center of the circumscribed rectangle, and the ID of the minimum unit used in the combination It is output as character cutout candidate information. Here, the subscript i indicates the i-th character cutout candidate, and corresponds to the ID number. Further, the image pattern of the character cutout candidate can be easily created because the image pattern can be extracted by referring to the minimum unit information using the IDs of all the minimum units used in the combination.

【０１０４】以上説明したように、本発明は、文字切り
出しにおける最小単位間の接続関係を二次元的に定義
し、接続関係をたどることで組み合わせパターンを作成
するので、行抽出処理を前提とせずに文字切り出し候補
を作成することが可能であり、行抽出が困難な文字列パ
ターンであっても正しく文字切り出し候補を得ることが
できる。また、組み合わせパターンを文字切り出し候補
として出力する際に、組み合わせパターンのまわりの最
小単位間との配置関係を評価し、また組み合わせパター
ンの画像的なパターンも評価するので、余分な文字切り
出し候補が出力されてしまうことを抑えることが可能で
あり、組み合わせの爆発を防ぐことができる。As described above, according to the present invention, the connection relationship between the minimum units in character segmentation is defined two-dimensionally, and a combination pattern is created by following the connection relationship. It is possible to create a character segmentation candidate, and to correctly obtain a character segmentation candidate even for a character string pattern for which line extraction is difficult. In addition, when outputting a combination pattern as a character segmentation candidate, the arrangement relationship between the minimum units around the combination pattern is evaluated, and the image-like pattern of the combination pattern is also evaluated. It is possible to suppress that it is done, and it is possible to prevent the explosion of the combination.

【０１０５】文字切り出し確定部１６は、文字切り出し
候補出力部１５で得られた文字切り出し候補を基に、認
識処理と言語知識処理とレイアウト解析を行い文字切り
出し結果を出力する。文字切り出し確定部は、特願平7-
203148号公報に記載されている方式を用いることにより
容易に実現することが可能である。The character cutout determining section 16 performs recognition processing, linguistic knowledge processing, and layout analysis based on the character cutout candidates obtained by the character cutout candidate output section 15 and outputs a character cutout result. The character cutout confirmation section is described in Japanese Patent Application
This can be easily realized by using the method described in JP-A-203148.

【０１０６】特願平7-203148号公報に記載されている方
式を用いた文字切り出し確定部１６は、文字認識結果が
二次元的にばらまかれた状態から処理をスタートし、始
めに言語知識処理を施して複数の文字列候補を作成し、
これらの文字列候補に対してレイアウトコストを計算
し、文字認識コストと言語的コストとレイアウトコスト
を同時に用いて文字列を読み取る方式であり、文字列の
読み取り結果と同時に文字切り出し結果を確定すること
が可能である。The character cutout determining unit 16 using the method described in Japanese Patent Application No. 7-203148 starts processing from a state where character recognition results are scattered two-dimensionally. To create multiple character string candidates,
This method calculates the layout cost for these character string candidates and reads the character string using the character recognition cost, linguistic cost, and layout cost at the same time. Is possible.

【０１０７】図２４のような文字切り出し候補が得られ
たとき、文字切り出し確定部１６で言語知識として住所
の知識を用いて図２(a)の入力を読み取ると、「有明町
三」と読み取られ、最終的に文字切り出し結果は図２６
のようになる。このとき、文字切り出し結果情報として
図２７に示すとおり、文字切り出し候補のIDを持つデー
タ形式で出力される。処理結果の表示等、文字切り出し
処理の後に文字切り出し結果のデータを用いる場合は、
最小単位情報と文字切り出し候補情報を参照することで
容易に取り扱うことが可能である。When the character cutout candidate as shown in FIG. 24 is obtained, the character cutout determining section 16 reads the input of FIG. 2A using the knowledge of the address as the linguistic knowledge. Finally, the character cutout result is shown in FIG.
become that way. At this time, as shown in FIG. 27, character extraction result information is output in a data format having a character extraction candidate ID. When using data of the character extraction result after character extraction processing, such as displaying the processing result,
It can be easily handled by referring to the minimum unit information and the character segmentation candidate information.

【０１０８】なお、上記実施の形態では、文字切り出し
候補生成装置及び文字切り出し候補生成方法について記
載したが、本発明には上記文字切り出し候補生成処理を
コンピュータで実現するためのプログラム、更にはこの
プログラムを記録した記録媒体も含まれ、該プログラム
をコンピュータにロードし実現することにより上記実施
の形態記載の効果が得られる。In the above embodiment, the character cutout candidate generating apparatus and the character cutout candidate generating method have been described. However, the present invention provides a program for realizing the above character cutout candidate generating processing by a computer, and furthermore, this program Is also included, and the effects described in the above embodiments can be obtained by loading and realizing the program on a computer.

【０１０９】[0109]

【発明の効果】以上説明したように、本発明は、文字切
り出しにおける最小単位間の接続関係を二次元的に定義
し、接続関係をたどることで組み合わせパターンを作成
するので、行抽出処理を前提とせずに文字切り出し候補
を作成することが可能であり、行抽出が困難で一行なの
か二行なのかわからない文字列パターンであっても両者
の場合に対応した文字切り出し候補を得ることができ
る。また、組み合わせパターンを文字切り出し候補とし
て出力する際に、組み合わせパターンのまわりの最小単
位間との配置関係を評価し、また組み合わせパターンの
画像的なパターンも評価するので、余分な文字切り出し
候補が出力されてしまうことを抑えることが可能であ
る。そして、文字認識コストと言語的コストとレイアウ
トコストを同時に用いて文字列を読み取るので複数行が
同時に入力された場合でも、行抽出を行うことを前提と
しないで、組み合わせが爆発することなく、接触した文
字パターンも正しく切断できて、正しい文字切り出しを
行うことが可能になる。As described above, according to the present invention, the connection relationship between the minimum units in character segmentation is defined two-dimensionally, and a combination pattern is created by following the connection relationship. It is possible to create a character segmentation candidate without removing the character segment, and it is possible to obtain a character segmentation candidate corresponding to both cases even for a character string pattern in which it is difficult to extract a line and it is not clear whether it is one line or two lines. In addition, when outputting a combination pattern as a character segmentation candidate, the arrangement relationship between the minimum units around the combination pattern is evaluated, and the image-like pattern of the combination pattern is also evaluated. It is possible to suppress being done. Then, since the character string is read using the character recognition cost, the linguistic cost, and the layout cost at the same time, even if multiple lines are input at the same time, without assuming that line extraction is performed, the combination does not explode, The extracted character pattern can also be correctly cut, and correct character cutout can be performed.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の一実施の形態に係る文字切り出し候補
作成装置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a character segmentation candidate creation device according to an embodiment of the present invention.

【図２】文字列画像の例を示す図である。FIG. 2 is a diagram illustrating an example of a character string image.

【図３】図２(a)の例に対して連結成分を抽出した例を
示す図である。FIG. 3 is a diagram showing an example in which connected components are extracted from the example of FIG. 2 (a).

【図４】図２(a)の例に対応した連結成分情報の例を示
す図である。FIG. 4 is a diagram illustrating an example of connected component information corresponding to the example of FIG.

【図５】画像情報を作成する際の動作を説明する図であ
る。FIG. 5 is a diagram illustrating an operation when creating image information.

【図６】最小単位作成部における手順を示すフローチャ
ート図である。FIG. 6 is a flowchart illustrating a procedure in a minimum unit creating unit.

【図７】図２(a)の例に対して最小単位を作成した例を
示す図である。FIG. 7 is a diagram illustrating an example in which a minimum unit is created for the example of FIG.

【図８】図２(a)の例に対応した最小単位情報の例を示
す図である。FIG. 8 is a diagram illustrating an example of minimum unit information corresponding to the example of FIG.

【図９】隣接関係を定義する意義を説明するための図で
ある。FIG. 9 is a diagram for explaining the significance of defining an adjacency relationship.

【図１０】ボロノイ分割処理を説明するための図であ
る。FIG. 10 is a diagram for explaining Voronoi division processing.

【図１１】図２(a)の例に対してボロノイ分割を行い、
ボロノイ辺を抽出した例を示す図である。FIG. 11 performs Voronoi division on the example of FIG.
It is a figure showing the example which extracted Voronoi side.

【図１２】隣接関係を作成する手順を示すフローチャー
ト図である。FIG. 12 is a flowchart illustrating a procedure for creating an adjacent relationship.

【図１３】図２(a)の例に対応した隣接関係情報の例を
示す図である。FIG. 13 is a diagram illustrating an example of adjacent relation information corresponding to the example of FIG.

【図１４】縦軸と横軸で距離尺度を変えてボロノイ分割
するときの動作例を説明する図である。FIG. 14 is a diagram illustrating an operation example when Voronoi division is performed by changing the distance scale between the vertical axis and the horizontal axis.

【図１５】接続関係を作成する意義を説明するための図
である。FIG. 15 is a diagram for explaining the significance of creating a connection relationship.

【図１６】接続判定特徴を作成する際の動作を説明する
図である。FIG. 16 is a diagram illustrating an operation when creating a connection determination feature.

【図１７】接続判定特徴に画像情報を付加する意義を説
明する図である。FIG. 17 is a diagram illustrating the significance of adding image information to a connection determination feature.

【図１８】接続関係を作成する手順を示すフローチャー
ト図である。FIG. 18 is a flowchart illustrating a procedure for creating a connection relationship.

【図１９】図２(a)の例に対応した接続関係情報の例を
示す図である。FIG. 19 is a diagram illustrating an example of connection relation information corresponding to the example of FIG.

【図２０】組み合わせパターンの出力可否判定の一動作
を説明する図である。FIG. 20 is a diagram illustrating an operation of determining whether to output a combination pattern.

【図２１】組み合わせパターンとその周辺の最小単位と
の関係を説明する図である。FIG. 21 is a diagram illustrating a relationship between a combination pattern and a minimum unit around the combination pattern.

【図２２】出力可否判定特徴を作成する際の動作を説明
する図である。FIG. 22 is a diagram illustrating an operation when creating an output possibility determination feature.

【図２３】文字切り出し候補作成部における手順を示す
フローチャート図である。FIG. 23 is a flowchart illustrating a procedure in a character segmentation candidate creation unit.

【図２４】図２(a)の例に対応した文字切り出し候補の
例を示す図である。FIG. 24 is a diagram illustrating an example of a character cutout candidate corresponding to the example of FIG.

【図２５】図２４の例に対応した文字切り出し候補情報
の例を示す図である。FIG. 25 is a diagram illustrating an example of character cutout candidate information corresponding to the example of FIG. 24;

【図２６】図２(a)の例に対応した文字切り出し結果の
例を示す図である。FIG. 26 is a diagram illustrating an example of a character cutout result corresponding to the example of FIG.

【図２７】図２６の例に対応した文字切り出し結果情報
の例を示す図である。FIG. 27 is a diagram illustrating an example of character cutout result information corresponding to the example of FIG. 26;

[Explanation of symbols]

１１文字列画像記憶部１２連結成分抽出部１３最小単位作成部１４二次元接続関係作成部１５文字切り出し候補出力部１６文字切り出し確定部１７コンピュータ１８文字列画像読み取り装置１９表示手段２０記録媒体 Reference Signs List 11 character string image storage unit 12 connected component extraction unit 13 minimum unit creation unit 14 two-dimensional connection relationship creation unit 15 character cutout candidate output unit 16 character cutout determination unit 17 computer 18 character string image reading device 19 display means 20 recording medium

Claims

[Claims]

1. A character segmentation apparatus for generating a character segmentation candidate from a character string image, comprising: a character string image storage unit for inputting and storing the character string image; and binarizing the character string image to extract a connected component The connected component extraction unit performs a determination as to whether or not all the connected components are in contact with each other. When it is determined that the connected components are not in contact with each other, the connected component itself is used as a minimum unit, and the connected components are contacted. A minimum unit creating unit that cuts the connected component to create a new minimum unit when it is determined that the minimum unit is formed, and a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space. A two-dimensional connection relationship creating unit for obtaining a combination pattern, and creating a combination pattern by combining a plurality of the minimum units having a connection relationship with each other, and extracting a combination pattern having a high possibility of being a correct character. A character cutout candidate output unit for outputting as a candidate.

2. A character extracting apparatus for generating a character extracting candidate from a character string image, comprising: a character string image storage unit for inputting and storing the character string image; and binarizing the character string image to extract a connected component. The connected component extraction unit performs a determination as to whether or not all the connected components are in contact, and when it is determined that the connected components are not in contact, the connected component itself is used as a minimum unit, and the connected components are A minimum unit creating unit that cuts the connected component to create a new minimum unit when it is determined that the minimum unit is formed, and a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space. A two-dimensional connection relationship creating unit for obtaining a combination pattern, and creating a combination pattern by combining a plurality of the minimum units having a connection relationship with each other, and extracting a combination pattern having a high possibility of being a correct character. A character extraction candidate output unit that outputs as a candidate, and a character extraction determination unit that performs recognition processing, linguistic knowledge processing, and layout analysis based on the character extraction candidate, and outputs a character extraction result, is provided. Character extraction device.

3. The minimum unit creation unit refers to the average value of the width and height of all connected components extracted from the character string image and the width and height of the connected component of interest to generate contact. 3. The character segmenting device according to claim 1, wherein whether the component is a connected component is statistically determined.

4. The method according to claim 1, wherein the minimum unit creation unit further statistically determines whether or not the connected component is in contact with reference to an image pattern representing the shape of the focused connected component. The character segmentation device according to claim 3, wherein

5. The two-dimensional connection relation creation unit determines that adjacent minimum units that share Voronoi sides by Voronoi division are adjacent when determining the adjacent relation of the minimum unit extracted from the character string image. Further, when the minimum units having the adjacent relation share two Voronoi sides, it is determined that the minimum units are not in the adjacent relation. A character cutout device according to one of the preceding claims.

6. The two-dimensional connection relationship creating unit creates a combination of the width, height, and center coordinate value of each of the plurality of minimum units determined to be in an adjacent relationship, and the plurality of minimum units. With reference to the width, height, and center coordinate values of the circumscribed rectangle and the average value of the width and height of all the minimum units extracted from the character string image, the connection relationship is determined by statistical evaluation. 6. The character segmenting device according to claim 5, wherein:

7. The character according to claim 6, wherein the two-dimensional connection relation creation unit further obtains a connection relation with reference to an image pattern representing a shape of each of the plurality of minimum units. Cutting device.

8. The character cutout candidate output section creates a plurality of minimum unit combination patterns by tracing the connection relationship obtained by the two-dimensional connection relationship creation section when creating the combination pattern. The character cutout device according to any one of claims 1 to 7, wherein:

9. The character cutout candidate output unit refers to the width, height, and center coordinate value of the combination pattern and the average value of the width and height of all minimum units extracted from the character string image. 9. The character cutout device according to claim 8, wherein whether to output the combination pattern as a character cutout candidate is determined by statistical evaluation.

10. The character segmentation candidate output unit,
Width, height and center coordinate value of the combination pattern,
The width, height and center coordinate values of the minimum unit adjacent to the combination pattern, the width, height and center coordinate values of a circumscribed rectangle created by combining the combination pattern and the minimum unit, and the character string image With reference to the average value of the width and height of all the minimum units extracted from the above, statistical evaluation is performed for the same number of times as the number of the minimum units adjacent to the combination pattern, and the combination pattern is a character cutout candidate. 9. The character segmenting device according to claim 8, wherein whether to output the character is determined.

11. The character segmentation candidate output unit,
Furthermore, with reference to the image pattern representing the shape of the combination pattern and the image pattern representing the shape of the minimum unit adjacent to the combination pattern, it is determined whether to output the combination pattern as a character cutout candidate. The character cutout device according to claim 9 or 10, wherein:

12. A character extracting method for generating a character extracting candidate from a character string image, comprising: a character string image storing step of inputting and storing the character string image; and binarizing the character string image to extract a connected component. Performing a connected component extraction step, and determining whether or not all the connected components are in contact.When it is determined that the connected components are not in contact, the connected component itself is used as a minimum unit, and the connected components are A minimum unit creation step of cutting the connected component to create a new minimum unit when it is determined that the minimum unit has been established; and a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space. A two-dimensional connection relationship creating step of obtaining a combination pattern, and creating a combination pattern by combining a plurality of the minimum units having a connection relationship with each other, and a combination having a high possibility of being a correct character. A character segmentation candidate output step of outputting a matching pattern as a character segmentation candidate.

13. A character clipping method for generating a character clipping candidate from a character string image, comprising: a character string image storing step of inputting and storing the character string image; and binarizing the character string image to extract a connected component. Performing a connected component extraction step, and determining whether or not all the connected components are in contact.When it is determined that the connected components are not in contact, the connected component itself is used as a minimum unit, and the connected components are A minimum unit creating step of creating a new minimum unit by cutting the connected component when it is determined that the minimum unit has been connected; and a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space. A two-dimensional connection relationship creating step of obtaining a combination pattern, and creating a combination pattern by combining a plurality of the minimum units having a connection relationship with each other, and a combination having a high possibility of being a correct character. A character cutout candidate output step of outputting a matching pattern as a character cutout candidate, and a character cutout determining step of performing recognition processing, linguistic knowledge processing, and layout analysis based on the character cutout candidate, and outputting a character cutout result. A character extraction method characterized by having at least a character.

14. In the minimum unit creation step, contact is generated by referring to the average value of the width and height of all connected components extracted from the character string image and the width and height of the connected component of interest. 14. The character segmentation method according to claim 12, wherein it is statistically determined whether the component is a connected component.

15. The method according to claim 15, wherein in the minimum unit creating step, it is further determined statistically whether or not the connected component is in contact with reference to an image pattern representing the shape of the focused connected component. The character segmentation method according to claim 14.

16. In the two-dimensional connection relation creation step, when determining the adjacent relation of the minimum unit extracted from the character string image, it is determined that the minimum units sharing the Voronoi sides by Voronoi division are adjacent. And then
The character according to any one of claims 12 to 15, wherein when the minimum units having the adjacent relation share two Voronoi sides, it is determined that the minimum units are not in the adjacent relation. Cutting method.

17. The method for determining the adjacency relation of the minimum unit includes the steps of: (a) creating a mother point on the contour of all of the minimum units; and (b) Voronoi using all the created mother points. Dividing and obtaining a Voronoi edge; and (c) focusing on a certain Voronoi edge in the Voronoi edge and, if the Voronoi edge is obtained between the generating points of the same minimum unit, (D) determining whether all Voronoi sides have been focused on, and returning to (c) if there are Voronoi sides that have not been focused on; and (e) determining the same Voronoi sides. 17. The method according to claim 16, further comprising: determining that the minimum units to be shared have an adjacent relationship.

18. The two-dimensional connection relationship creating step, wherein the width, height, and center coordinate value of each of the plurality of minimum units determined to be in an adjacent relationship is combined with the plurality of minimum units. Statistically evaluating the width, height, and center coordinate value of the circumscribed rectangle and the average value of the width and height of all the minimum units extracted from the character string image to obtain a connection relationship. 18. The character segmentation method according to claim 16, wherein:

19. The character segmentation according to claim 18, wherein in the two-dimensional connection relation creating step, a connection relation is further obtained by referring to an image pattern representing the shape of each of the plurality of minimum units. Method.

20. In the character cutout candidate output step, when the combination pattern is created, by following the connection relationship obtained in the two-dimensional connection relationship creation step, a combination pattern of a plurality of minimum units is created. The character cutout method according to any one of claims 12 to 19, characterized in that:

21. In the character cutout candidate outputting step, the width, height, and center coordinate value of the combination pattern and the average value of the width and height of all the minimum units extracted from the character string image are referred to. 21. The character cutout method according to claim 20, wherein the character pattern is statistically evaluated to determine whether to output the combination pattern as a character cutout candidate.

22. In the character segmentation candidate output step, the width, height and center coordinate value of the combination pattern, the width, height and center coordinate value of the minimum unit adjacent to the combination pattern, and the combination pattern The width, height, and center coordinate value of a circumscribed rectangle created by combining the minimum unit and the average value of the width and height of all minimum units extracted from the character string image, 21. The character segmentation method according to claim 20, wherein whether to output the combination pattern as a character segmentation candidate is determined by statistically evaluating the same number of times as the number of the minimum units adjacent to the pattern.

23. The character segmentation candidate output step, further comprising: referring to an image pattern representing the shape of the combination pattern and an image pattern representing the shape of the minimum unit adjacent to the combination pattern. 23. The character extracting method according to claim 21, wherein it is determined whether or not to output as a character extracting candidate.

24. A determination as to whether or not to output as a character segmentation candidate includes the steps of: (a) creating a combination pattern having 1 to N combinations; and (b) the C-th combination pattern from the combination patterns. a step of focusing on the combination pattern CPc, (c) is connected to the CP _C l th the minimum unit PS
A step of focusing on _Cl, and performing an output acceptability decision obtained between the PS _Cl (d) and the CP _C, it is determined whether or not focusing on all l (e), are focused (F) determining whether or not all Cs have been focused, and returning to (b) if there are Cs not focused; 24. The character extracting method according to claim 20, further comprising: g) outputting the combination pattern determined to be output-possible in (d) as a character extracting candidate.

25. A character string image storing process for inputting and storing a character string image, a connected component extracting process for extracting a connected component by binarizing the character string image, and contacting all the connected components. Determine whether or not, when the connected component is determined not to be in contact with the connected component itself as the minimum unit, cut off the connected component when it is determined that the connected component is in contact A minimum unit creation process for creating a new minimum unit, a two-dimensional connection relationship creation process for obtaining a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space, Character combination candidate output processing for creating a combination pattern by combining the minimum units with each other and outputting a combination pattern having a high possibility of being a correct character as a character extraction candidate. Character segmentation program to be executed.

26. A character string image storing process for inputting and storing a character string image, a connected component extracting process for extracting a connected component by binarizing the character string image, and contacting all the connected components. Determine whether or not, when the connected component is determined not to be in contact with the connected component itself as the minimum unit, cut off the connected component when it is determined that the connected component is in contact A minimum unit creation process for creating a new minimum unit, a two-dimensional connection relationship creation process for obtaining a two-dimensional connection relationship between the minimum units arranged in a two-dimensional space, A character segmentation candidate output process for creating a combination pattern by combining the minimum units with each other and outputting a combination pattern likely to be a correct character as a character segmentation candidate; Complement based on, performs the recognition processing and language knowledge processing and layout analysis, character segmentation program for executing a character segmentation decision process of outputting the character extraction result to the computer.

27. The minimum unit creation processing includes the steps of: averaging the width and height of all connected components extracted from the character string image;
27. The character segmentation program according to claim 25, further comprising a process of referring to the width and height of the connected component of interest to statistically determine whether the connected component is in contact.

28. The minimum unit creation processing further includes the step of referring to an image pattern representing the shape of the target connected component,
28. The character segmenting program according to claim 27, further comprising a process of statistically determining whether or not the connected component is in contact.

29. In the two-dimensional connection relationship creation processing, when obtaining the adjacent relationship of the minimum unit extracted from the character string image, it is determined that the minimum units sharing the Voronoi side by Voronoi division have an adjacent relationship. 29. The method according to claim 25, further comprising a step of determining that the minimum units are not in an adjacent relationship when the adjacent minimum units share two Voronoi sides. Character extraction program.

30. A process for judging the adjacency relationship of the minimum unit, comprising: (a) a process of creating a mother point on all the contours of the minimum unit; and (b) a process of using all the created mother points. (C) paying attention to a certain Voronoi edge in the Voronoi edge, and if the Voronoi edge is obtained from the same minimum unit between the generating points, the Voronoi edge (D) determining whether all Voronoi sides have been focused on, and returning to (c) when there is a Voronoi side that has not been focused on; (e) the same Voronoi side 30. The computer-readable storage medium according to claim 29, further comprising: a process of determining that the minimum units sharing the same are adjacent to each other.

31. The two-dimensional connection relationship creation process, wherein the width of each of the plurality of minimum units determined to be adjacent to each other,
Height and center coordinate value, and width, height and center coordinate value of a circumscribed rectangle created by combining the plurality of minimum units,
31. The character segmentation program according to claim 29, further comprising a process of statistically evaluating a connection relationship by referring to an average value of widths and heights of all minimum units extracted from the character string image.

32. The two-dimensional connection relationship creation processing further includes:
32. The character cutout program according to claim 31, further comprising a process of obtaining a connection relationship by referring to an image pattern representing each shape of the plurality of minimum units.

33. A process for creating a plurality of minimum unit combination patterns by following the connection relationship obtained in the two-dimensional connection relationship creation step in the character cutout candidate output process when creating the combination pattern. 33. The character segmentation program according to claim 25, comprising:

34. The character cutout candidate output process refers to the width, height, and center coordinate value of the combination pattern and the average value of the width and height of all minimum units extracted from the character string image. 34. The character cutout program according to claim 33, further comprising a process of statistically evaluating to determine whether to output the combination pattern as a character cutout candidate.

35. In the character cutout candidate output processing, the width, height and center coordinate value of the combination pattern, the width, height and center coordinate value of the minimum unit adjacent to the combination pattern, and the combination pattern The width, height, and center coordinate value of a circumscribed rectangle created by combining the minimum unit and the average value of the width and height of all minimum units extracted from the character string image, The character cutout program according to claim 33, further comprising a process of statistically evaluating the same number of times as the number of the minimum units adjacent to the pattern to determine whether to output the combination pattern as a character cutout candidate.

36. In the character cutout candidate output processing, the combination pattern is further referred to by referring to an image pattern representing the shape of the combination pattern and an image pattern representing the shape of the minimum unit adjacent to the combination pattern. 36. The character extraction program according to claim 34, further comprising a process of determining whether or not to output a character as a character extraction candidate.

37. A process for determining whether or not to output as a character segmentation candidate, comprising: (a) a process of creating a combination pattern having a combination number of 1 to N; a process of focusing on the combination pattern CPc of, (c) is connected to the CP _C l th the minimum unit PS
A process for focusing on the _Cl, the process for outputting determination obtained between the PS _Cl (d) and the CP _C, it is determined whether or not focusing on all l (e), are focused When there is no l, the process returns to (c). (F) It is determined whether all Cs have been focused on, and when there is a C that has not been focused on, the process returns to (b). 37. The character cutout program according to any one of claims 34 to 36, further comprising: g) outputting the combination pattern determined to be outputtable in (d) as a character cutout candidate.