JP2010039615A

JP2010039615A - Character recognition method and character recognition apparatus

Info

Publication number: JP2010039615A
Application number: JP2008199477A
Authority: JP
Inventors: Minenobu Seki; 峰伸関; Katsumi Marukawa; 勝美丸川; Hiroshi Shinjo; 広新庄; Takeshi Eisaki; 健永崎
Original assignee: Hitachi Omron Terminal Solutions Corp
Current assignee: Hitachi Omron Terminal Solutions Corp
Priority date: 2008-08-01
Filing date: 2008-08-01
Publication date: 2010-02-18
Anticipated expiration: 2028-08-01
Also published as: JP5039659B2

Abstract

<P>PROBLEM TO BE SOLVED: To reduce processing time and obtain an accurate character recognition result in recognizing characters printed in half-, full-, and double-width characters even if characters are blurred due to contact and joint components are divided. <P>SOLUTION: A half-width likelihood indicating the degree of half-width likeliness of characters in a character line, a double-width likelihood indicating the degree of double-width likeliness, a character blur likelihood indicating the degree of character blur, and a character contact likelihood indicating the degree of character contact are calculated. Pre-segmentation is performed using the character contact likelihood and double-width likelihood, and forced separation is performed using the character blur likelihood and half-width likelihood. Character recognition is performed on character patterns resulting from the pre-segmentation and forced separation. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、入力された画像から文字を認識し、テキスト情報を取得する文字認識装置に関する。 The present invention relates to a character recognition device that recognizes characters from an input image and acquires text information.

文字認識処理では、始めに、文字が並んでいる領域である文字行を入力画像から抽出する文字行抽出処理を実行し、文字行の中から一つ一つの文字を切り出す文字切出処理を実行し、切り出された文字である文字パタンの特徴を抽出する。次に、予め複数の文字種の特徴を辞書として用意しておき、用意された辞書の特徴と認識対象である文字パタンの特徴とをマッチングする文字識別処理を実行し、マッチングの度合いである類似度（識別尤度）が最も高い文字種を認識結果として出力する。 In the character recognition process, first, the character line extraction process that extracts the character line that is the area where the characters are arranged from the input image is executed, and the character extraction process that extracts each character from the character line is executed. Then, the feature of the character pattern that is the extracted character is extracted. Next, characteristics of a plurality of character types are prepared in advance as a dictionary, a character identification process for matching the characteristics of the prepared dictionary and the characteristics of the character pattern to be recognized is executed, and the degree of similarity that is the degree of matching The character type having the highest (identification likelihood) is output as the recognition result.

文字切出処理では、黒画素の連なりである連結成分を生成し、連結成分又は連結成分を上下方向にまとめたエレメントを構成要素の単位とし、それらの構成要素を組み合わせることによって文字パタンを生成するプレセグメンテーション処理を行う。以後、構成要素の単位をエレメントとするが、構成要素の単位は連結成分となることもある。文字がつぶれる、又は文字が接近して印刷されることによって、文字同士が接触する場合には、複数の文字が合わさって一つのエレメントとなってしまうことがある。そのため、文字切出処理は、1つのエレメントを切断し、複数のエレメントを生成する強制切断処理を含む。 In the character extraction process, a connected component that is a series of black pixels is generated, and a character pattern is generated by combining the connected components or an element obtained by combining the connected components in the vertical direction as a unit. Perform pre-segmentation processing. Hereinafter, the unit of the constituent element is referred to as an element, but the unit of the constituent element may be a connected component. When the characters are crushed or the characters are printed close to each other and the characters come into contact with each other, a plurality of characters may be combined into one element. Therefore, the character cutting process includes a forced cutting process that cuts one element and generates a plurality of elements.

一般に、印刷される文字には半角、全角、倍角の文字幅がある。従来は、特許文献１に記載されているように、文字認識する場合には、どの文字幅の文字が記載されているかわからないため、印刷された文字がどの文字幅であるかを判定し、判定された文字幅で文字を切り出し、切り出された文字について文字識別処理を実行していた。 In general, characters to be printed have character widths of half-width, full-width, and double-width. Conventionally, as described in Patent Document 1, when character recognition is performed, since it is not known which character width is described, it is determined which character width the printed character is, The character is cut out with the cut character width, and the character identification process is executed on the cut out character.

また、手書き文字のように文字幅が特定できない場合は、文字切り出し処理において、一つ一つの文字の区切りが必ずしも一意に決められない。そのため複数の文字切り出し候補を生成し、それらのネットワークである切り出しネットワークを構成し、文字識別結果である識別尤度が高くなるように文字識別結果の並びを選択する最適パス探索を実行し、認識結果を得る。 Further, when the character width cannot be specified as in the case of handwritten characters, the character segmentation is not necessarily uniquely determined in the character segmentation process. Therefore, a plurality of character segmentation candidates are generated, a segmentation network that is a network of them is generated, and an optimal path search is performed to select a sequence of character identification results so that the identification likelihood that is the character identification result is high, and recognition is performed. Get results.

また、予め単語のリストである単語辞書を用意しておき、文字パタンの識別尤度が高い複数の文字種を組み合わせて、単語辞書内の単語と照合し、最適パス探索において照合した文字種の組み合わせを優先して認識結果を得ることもできる（単語照合）。 In addition, a word dictionary that is a list of words is prepared in advance, a plurality of character types having a high likelihood of character pattern identification are combined, collated with words in the word dictionary, and combinations of character types collated in the optimum path search are determined. The recognition result can also be obtained with priority (word matching).

さらに、予め単語リスト内にある単語Ａと単語Ｂが連続して出現する確率である単語バイグラムを用意しておき、最適パス探索において連続する単語の確率を考慮することもできる。単語照合や単語バイグラムを用いることによって、文字識別結果の第一位の候補が不正解であったとしても、不正解とされた候補を修正した認識結果を得ることができる。これを知識処理と呼ぶ。これらの処理は、国際ワークショップthe 10th International Workshop on Frontiers in Handwriting Recognitionにおいて開示されている。
特開平０６−１５００５５号公報 Katsumi Marukawa、外２名,"Online Recognition of Unconstrained Handwritten Japanese Text Using Statistical Information",２００６年 Furthermore, it is possible to prepare a word bigram, which is a probability that words A and B appear in the word list in advance, and to consider the probability of consecutive words in the optimum path search. By using word matching or word bigram, even if the first candidate of the character identification result is incorrect, a recognition result obtained by correcting the incorrect candidate can be obtained. This is called knowledge processing. These processes are disclosed in the international workshop on Frontiers in Handwriting Recognition.
Japanese Patent Laid-Open No. 06-150055 Katsumi Marukawa, 2 others, "Online Recognition of Unconstrained Handwritten Japanese Text Using Statistical Information", 2006

しかし、印刷された文字を認識する際、漢字やひらがなは連結成分が左右に分かれる場合があるため、特許文献１に記載された方法を用いたとしても、半角、全角、倍角の判別結果が必ず正しいとは限らない。さらに、つぶれて接触した文字を多く含む場合、及び文字がかすれて連結成分が細かく分かれている場合は、より半角、全角、倍角の判別が難しくなる。もし、半角、全角、倍角の判定を誤った場合、誤った文字認識結果を出力する可能性が高い。これに対し、上記手書き文字認識の方法のように複数の文字切り出しの候補を生成し、最適な文字パタンの組み合わせを選択することにより、精度の高い認識結果が得られる。 However, when recognizing printed characters, kanji, hiragana, and connected components may be divided into left and right, so even if the method described in Patent Document 1 is used, the discrimination result of half-width, full-width, and double-width is always It is not always correct. Furthermore, when many characters that are crushed and touched are included, and when characters are blurred and the connected components are finely divided, it is more difficult to distinguish between half-width, full-width, and double-width. If the determination of half-width, full-width, or double-width is wrong, there is a high possibility that an erroneous character recognition result will be output. On the other hand, by generating a plurality of character segmentation candidates as in the above-described handwritten character recognition method and selecting an optimum combination of character patterns, a highly accurate recognition result can be obtained.

しかし、前述した手書き文字認識の方法は、半角、全角、倍角を想定したすべての文字切り出しの候補を生成することになり、生成される文字パタンの数が多くなるため、処理時間が多くなり、誤って切り出した文字パタンを誤認識するという問題が発生する。 However, the above-described handwritten character recognition method generates all character extraction candidates assuming half-width, full-width, and double-width, and increases the number of character patterns to be generated. There arises a problem of erroneously recognizing erroneously cut out character patterns.

本発明は、半角、全角、倍角で印刷された文字を認識する際に、つぶれて接触した文字を多く含む場合、及び文字がかすれて連結成分が細かく分かれている場合にも、処理時間を抑制し、かつ、高精度な文字認識結果を得ることを目的とする。 When recognizing characters printed in half-width, full-width, or double-width, the present invention suppresses processing time even when many characters are crushed and touched, and when characters are blurred and connected components are finely divided. In addition, an object is to obtain a highly accurate character recognition result.

本発明の代表的な一例を示せば以下の通りである。 A typical example of the present invention is as follows.

すなわち、本発明の一実施形態によると、画像中から文字を認識する文字認識方法であって、文字を含む画像から文字を構成する黒画素の連なりである連結成分を生成する連結成分生成処理と、文字が並んでいる領域となる文字行を抽出する文字行抽出処理と、文字行内にある文字の半角らしさの度合いを示す半角尤度を算出する半角判別処理、文字行内にある倍角らしさの度合いを示す倍角尤度を算出する倍角判別処理、文字行内にある文字のかすれ度合いを示す文字かすれ尤度を算出する文字かすれ判別処理、文字行内にある文字の接触度合いを示す文字接触尤度を算出する文字接触判別処理、文字行内へのノイズの混入している度合いを示すノイズ尤度を算出するノイズ判別処理と、文字接触尤度及び倍角文字尤度を用いて、1つ又は複数の連結成分を組み合わせた情報である文字パタンを生成するプレセグメンテーション処理と、文字かすれ尤度及び半角文字尤度を用いて、連結成分を切断する文字パタンを選択し、連結成分の切断する位置を決定する強制切断処理と、前記プレセグメンテーション処理と強制切断処理により生成された文字パタンの特徴を抽出し、予め用意された識別辞書にある各文字種の特徴との類似度を算出する文字識別処理と、を備えることを特徴とする。 That is, according to one embodiment of the present invention, there is provided a character recognition method for recognizing characters from an image, and a connected component generation process for generating a connected component that is a series of black pixels constituting a character from an image including characters. , Character line extraction processing to extract the character line that is the area where the characters are lined up, half-width discrimination processing to calculate the half-width likelihood indicating the degree of half-width character of the character in the character line, degree of double-angle likelihood in the character line Double angle discrimination processing to calculate the double angle likelihood indicating the character, character blur detection processing to calculate the character blur likelihood indicating the blurring degree of the character in the character line, character contact likelihood indicating the contact degree of the character in the character line Character contact determination processing, noise determination processing for calculating a noise likelihood indicating the degree of noise mixed in the character line, and character contact likelihood and double-width character likelihood. Select the character pattern that cuts the connected component using the pre-segmentation process that generates the character pattern that is a combination of several connected components and the character blur likelihood and half-width character likelihood, and the position where the connected component is cut And a character identification process for extracting the characteristics of the character pattern generated by the pre-segmentation process and the forced cutting process and calculating the similarity with the characteristics of each character type in the prepared identification dictionary And.

本発明の一実施形態によると、半角、全角、倍角で印刷された文字を認識する処理時間を抑制し、かつ、高精度な文字認識結果を得ることが可能となる。 According to an embodiment of the present invention, it is possible to suppress processing time for recognizing characters printed in half-width, full-width, and double-width, and obtain a highly accurate character recognition result.

以下、本発明を適用した文字認識方法及び文字認識装置について、図面を用いて詳細に説明する。 Hereinafter, a character recognition method and a character recognition device to which the present invention is applied will be described in detail with reference to the drawings.

図２は、本発明の実施の形態に係る文字認識装置の構成を示すブロック図である。 FIG. 2 is a block diagram showing the configuration of the character recognition apparatus according to the embodiment of the present invention.

本発明の実施形態の文字認識装置２０８は、通信装置２０１、画像取得装置２０２、表示装置２０３、外部記憶装置２０４、メモリ２０５、ＣＰＵ(Central Processing Unit)２０６、及び、キーボードやマウスなどの入力装置２０７を備え、これらの各構成はＰＣＩバスなどの通信線で接続されている。 A character recognition device 208 according to an embodiment of the present invention includes a communication device 201, an image acquisition device 202, a display device 203, an external storage device 204, a memory 205, a CPU (Central Processing Unit) 206, and an input device such as a keyboard and a mouse. These components are connected by a communication line such as a PCI bus.

本発明の実施形態の文字認識方法を実行するプログラムは、外部記憶装置２０４及び／又はメモリ２０５等の記憶装置に格納され、ＣＰＵ２０６によって実行される。ＣＰＵ２０６に入力される文字を含む画像は、スキャナ、ＯＣＲ又はデジタルカメラなどの画像取得装置２０２又は通信装置２０１から入力されてもよい。また、文字を含む画像は、外部記憶装置２０４に記憶されてもよい。 A program for executing the character recognition method according to the embodiment of the present invention is stored in a storage device such as the external storage device 204 and / or the memory 205 and executed by the CPU 206. An image including characters input to the CPU 206 may be input from the image acquisition device 202 or the communication device 201 such as a scanner, an OCR, or a digital camera. In addition, an image including characters may be stored in the external storage device 204.

文字認識の結果は、表示装置２０３に出力される。また、文字認識の結果は、通信装置２０１を経由して外部に出力されてもよく、文字認識装置２０８で実行される他のプログラムに用いられてもよい。 The result of character recognition is output to the display device 203. Further, the result of character recognition may be output to the outside via the communication device 201 or may be used for another program executed by the character recognition device 208.

図１は、本発明の実施の形態の文字認識処理の全体を示すフローチャートである。 FIG. 1 is a flowchart showing the entire character recognition process according to the embodiment of the present invention.

はじめに、文字行抽出処理１０２において、入力画像１０１を入力とし、文字が並んでいる領域となる文字行画像が抽出される。 First, in the character line extraction process 102, the input image 101 is input, and a character line image that is an area where characters are arranged is extracted.

図３は、本発明の実施の形態の文字行抽出処理１０２の結果の一例を示す説明図である。 FIG. 3 is an explanatory diagram illustrating an example of a result of the character line extraction processing 102 according to the embodiment of this invention.

以後の説明では、文字行画像上の位置は、文字行画像の左上端を原点とし、左右方向の右をｘ軸の正の方向、上下方向の下をｙ軸の正の方向とする。 In the following description, the position on the character line image is the origin at the upper left corner of the character line image, the right in the left-right direction is the positive direction of the x-axis, and the lower direction in the vertical direction is the positive direction of the y-axis.

次に、連結成分／エレメント生成処理１０３では、入力された文字行画像から、黒画素の連なりである連結成分を生成し、上下方向に重なる連結成分をまとめたもの、例えば外接矩形であるエレメントを生成する。 Next, in the connected component / element generation process 103, a connected component that is a series of black pixels is generated from the input character line image, and the connected components that overlap in the vertical direction are collected, for example, an element that is a circumscribed rectangle. Generate.

その後、倍角判別処理１０４、文字接触判別処理１０５、半角判別処理１０６、文字かすれ判別処理１０７及びノイズ判別処理１０８を実行する。これらの処理は、並行して独立に実行されることが望ましいが、順に実行されてもよい。 Thereafter, a double-angle discrimination process 104, a character contact discrimination process 105, a half-angle discrimination process 106, a character blurring discrimination process 107, and a noise discrimination process 108 are executed. These processes are desirably executed independently in parallel, but may be executed in order.

その後、プレセグメンテーション処理１０９を実行する。プレセグメンテーション処理１０９では、半角尤度、倍角尤度、文字接触尤度、文字かすれ尤度、ノイズ尤度、エレメント、連結成分及び文字行画像が入力され、基準となるエレメントを決め、基準のエレメントを左端にするように文字パタンが生成される。 Thereafter, a pre-segmentation process 109 is executed. In the pre-segmentation processing 109, half-angle likelihood, double-angle likelihood, character contact likelihood, character blur likelihood, noise likelihood, element, connected component, and character line image are input, a reference element is determined, and a reference element A character pattern is generated so that is at the left end.

その後、強制切断処理１１０を実行し、一つの文字パタンを数の文字パタンに分離する。強制切断処理１１０でも、半角尤度、倍角尤度、文字接触尤度、文字かすれ尤度、ノイズ尤度、エレメント、連結成分及び文字行画像が用いられる。 Then, the forced cutting process 110 is executed to separate one character pattern into a number of character patterns. Also in the forced cutting process 110, half-width likelihood, double-angle likelihood, character contact likelihood, character blur likelihood, noise likelihood, element, connected component, and character line image are used.

その後、文字識別１１１が実行される。文字識別１１１では、プレセグメンテーション処理１０９及び強制切断処理１１０によって生成された文字パタンの特徴が抽出され、予め用意された識別辞書にある各文字種の特徴との類似度が算出される。 Thereafter, character identification 111 is executed. In the character identification 111, the characteristics of the character patterns generated by the pre-segmentation process 109 and the forced cutting process 110 are extracted, and the similarity with the characteristics of each character type in the identification dictionary prepared in advance is calculated.

その後、知識処理１１２が実行される。知識処理１１２では、文字切り出しネットワークと文字識別結果が入力され、文字行画像の文字認識結果が出力する。 Thereafter, knowledge processing 112 is executed. In the knowledge processing 112, the character segmentation network and the character identification result are input, and the character recognition result of the character line image is output.

その後、認識結果利用文字幅検定１１３を行って、文字認識結果１１４を得る。 Thereafter, a recognition result use character width test 113 is performed to obtain a character recognition result 114.

次に、各処理の詳細について説明する。まず、連結成分／エレメント生成処理１０３の詳細について説明する。 Next, details of each process will be described. First, details of the connected component / element generation process 103 will be described.

図４は、本発明の実施の形態の連結成分の例を示す図である。 FIG. 4 is a diagram illustrating an example of a connected component according to the embodiment of this invention.

文字行画像の連結画素を囲む外接矩形が、一つの連結成分として生成される。 A circumscribed rectangle surrounding the connected pixels of the character line image is generated as one connected component.

図４に示す例では、「秋」を示す画像には黒枠の外接矩形で囲われた４つの連結成分が含まれる。 In the example illustrated in FIG. 4, an image indicating “autumn” includes four connected components surrounded by a circumscribed rectangle with a black frame.

図５は、本発明の実施の形態のエレメントの例を示す図である。 FIG. 5 is a diagram showing an example of an element according to the embodiment of the present invention.

連結成分を垂直方向にまとめた外接矩形が、一つのエレメントとして生成される。 A circumscribed rectangle in which connected components are collected in the vertical direction is generated as one element.

図５に示す例では、「秋」を示す画像には黒枠で囲われた２つのエレメントが含まれる。 In the example shown in FIG. 5, the image indicating “autumn” includes two elements surrounded by a black frame.

次に、倍角判別処理１０４、文字接触判別処理１０５、半角判別処理１０６、文字かすれ判別処理１０７及びノイズ判別処理１０８の詳細について説明する。これらの処理の際、文字行の高さCLhと幅CLwを用いる。 Next, details of the double-angle discrimination process 104, the character contact discrimination process 105, the half-angle discrimination process 106, the character blurring discrimination process 107, and the noise discrimination process 108 will be described. In these processes, the height CLh and the width CLw of the character line are used.

CLhとCLwの最も簡単な算出式は、
（式１）CLh = 黒画素のy座標最大値 - y座標最小値 + 1
（式２）CLw = 黒画素のx座標最大値 - x座標最小値 + 1
である。 The simplest formula for calculating CLh and CLw is
(Expression 1) CLh = maximum y-coordinate value of black pixel-minimum y-coordinate value + 1
(Expression 2) CLw = black pixel x-coordinate maximum value-x-coordinate minimum value + 1
It is.

この文字行の高さと幅の算出方法は文字行の傾きを考慮していないが、文字行の傾きを考慮してCLhとCLwを算出してもよい。 Although the method for calculating the height and width of the character line does not consider the inclination of the character line, CLh and CLw may be calculated in consideration of the inclination of the character line.

そして、エレメントEの幅をEwとし、文字行画像内にn個のエレメント｛E1，…，Ei，…，En｝が存在する場合、それぞれのエレメントの幅を｛Ew1，…，Ewi，…，Ewn｝とする。さらに、連結成分Sの幅をSw及び高さをShとし、文字行画像内にm個の連結成分｛S1，…，Si，…，Sm｝が存在する場合、それぞれの幅を｛Sw1，…，Swi，…，Swm｝及び高さを｛Sh1，…，Shi，…，Shm｝とする。 If the width of the element E is Ew and there are n elements {E1, ..., Ei, ..., En} in the character line image, the width of each element is set to {Ew1, ..., Ewi, ..., Ewn}. Furthermore, when the width of the connected component S is Sw and the height is Sh, and m connected components {S1,..., Si,..., Sm} exist in the character line image, each width is set to {Sw1,. , Swi, ..., Swm} and the height is {Sh1, ..., Shi, ..., Shm}.

倍角判別処理１０４では、文字行画像、連結成分及びエレメントを入力し、文字行画像内の文字が倍角の文字である度合いを示す倍角尤度Ldを算出する。文字行画像、連結成分及びエレメントを利用し、Ldを算出する方法は、様々考えられる。 In the double angle discrimination process 104, a character line image, a connected component, and an element are input, and a double angle likelihood Ld indicating the degree to which the character in the character line image is a double angle character is calculated. There are various methods for calculating Ld using a character line image, a connected component, and an element.

例えば、以下に示す方法を採用することができる。
設定値Vdを予め定めておき、この設定値Vdを用いて、
（条件式１）Ewi > CLh×Vd
を満たすエレメントのEwiの総和Elwtを算出する。次にElwtを用いて、倍角尤度Ldは、
（式３）Ld = Elwt／ΣEwi
と算出できる。 For example, the following method can be employed.
A set value Vd is determined in advance, and using this set value Vd,
(Condition 1) Ewi> CLh × Vd
Calculate the total Ewt of Ewi of the elements that satisfy. Next, using Elwt, the double-angle likelihood Ld is
(Formula 3) Ld = Elwt / ΣEwi
And can be calculated.

このとき、Vdを１より大きい値に設定することによって、Ldは文字行中に幅の広いエレメントがどのくらいの割合で存在するかを示す値となる。そのため、Ldが大きいほど文字行内の文字が倍角である可能性が高いことになる。 At this time, by setting Vd to a value larger than 1, Ld becomes a value indicating how much wide elements exist in the character line. Therefore, the larger Ld is, the higher the possibility that the characters in the character line are double angle.

半角判別処理１０６では、文字行画像、連結成分及びエレメントを入力し、文字行画像内の文字が半角の文字である度合いを示す半角尤度Lhを算出する。文字行画像、連結成分及びエレメントを利用し、Lhを算出する方法は、様々考えられる。 In the half-width discrimination process 106, a character line image, a connected component, and an element are input, and a half-width likelihood Lh indicating the degree to which the characters in the character line image are half-width characters is calculated. There are various methods for calculating Lh using a character line image, a connected component, and an element.

例えば、以下に示す方法を採用することができる。
CLhとEwi及びエレメント数nを用いて、半角尤度Lhは、
（式４）Lh = CLh×n／ΣEwi
と算出できる。 For example, the following method can be employed.
Using CLh and Ewi and the number of elements n, the half-width likelihood Lh is
(Formula 4) Lh = CLh × n / ΣEwi
And can be calculated.

Lhは、文字行の高さに対する文字幅の割合の平均を算出し、算出された平均の逆数である。そのため、Lhが大きいほど半角である可能性が高いことになる。 Lh is the reciprocal of the calculated average of the ratio of the character width to the height of the character line. Therefore, the greater the Lh, the higher the probability of being a half-width.

文字接触判別処理１０５では、文字行画像、連結成分及びエレメントを入力し、文字行画像内の文字が接触している度合いを示す文字接触尤度Lcを算出する。文字行画像、連結成分及びエレメントを利用し、Lcを算出する方法は、様々考えられる。 In the character contact determination process 105, a character line image, a connected component, and an element are input, and a character contact likelihood Lc indicating the degree of contact of characters in the character line image is calculated. There are various methods for calculating Lc using a character line image, a connected component, and an element.

例えば、以下に示す方法を採用することができる。
設定値Vcを予め定めておき、この設定値Vcを用いて、
（条件式２）Ew／CLh > Vc
を満たすエレメントの数Ncを算出する。次にNcを用いて、文字接触尤度Lcは、
（式５）Lc = Nc×CLh／ΣEwi
と算出できる。 For example, the following method can be employed.
A set value Vc is determined in advance, and using this set value Vc,
(Condition 2) Ew / CLh> Vc
The number Nc of elements satisfying the condition is calculated. Next, using Nc, the character contact likelihood Lc is
(Formula 5) Lc = Nc x CLh / ΣEwi
And can be calculated.

例えば、Vcを２．５と設定することによって、倍角文字よりも幅の広い連結成分が文字行内にどのくらいの割合で存在するかを示す値となる。そのため、Lcが大きいほど文字行内の文字の接触している度合いが高いことになる。 For example, by setting Vc to 2.5, it becomes a value indicating how much a connected component having a width wider than a double-width character is present in the character line. Therefore, the larger the Lc, the higher the degree of contact of characters in the character line.

文字かすれ判別処理１０７では、文字行画像、連結成分及びエレメントを入力とし、文字行画像内の文字がかすれている度合いを示す文字かすれ尤度Lbを算出する。文字行画像、連結成分及びエレメントを利用し、Lbを算出する方法は、様々考えられる。 In the character fading determination process 107, the character line image, the connected component, and the element are input, and the character fading likelihood Lb indicating the degree of fading of the character in the character line image is calculated. There are various methods for calculating Lb using a character line image, a connected component, and an element.

例えば、以下に示す方法を採用することができる。
設定値Vb1、Vb2を予め定めておき、設定値Vb1、Vb2を用いて、
（条件式３）Vb1 < Swi <Vb2 かつ Vb1 < Shi < Vb2
を満たす連結成分の数Nbを算出する。次にNbを用いて、文字かすれ尤度Lbは、
（式６）Lb = Nb×CLh／ΣEwi
と算出できる。 For example, the following method can be employed.
Set values Vb1 and Vb2 are determined in advance, and using set values Vb1 and Vb2,
(Condition 3) Vb1 <Swi <Vb2 and Vb1 <Shi <Vb2
The number Nb of connected components that satisfy the above is calculated. Next, using Nb, the character fading likelihood Lb is
(Formula 6) Lb = Nb × CLh / ΣEwi
And can be calculated.

例えば、Vb1の値を１／３、Vb2の値を２／３と調整設定することによって、文字行の高さに対して、連結成分の高さと幅とが１／３から２／３である連結成分が文字行内にどのくらいの割合で存在するかを示す値となる。文字がかすれた場合、本来の文字の連結成分が途切れ途切れになるため、途切れ途切れになった連結成分の存在する割合は、文字行内の文字がかすれている度合いとなる。そのため、Lbが大きいほど文字行内の文字がかすれている度合いが高いことになる。 For example, by adjusting and setting the value of Vb1 to 1/3 and the value of Vb2 to 2/3, the height and width of the connected component is 1/3 to 2/3 with respect to the height of the character line. This is a value indicating how much of the connected component is present in the character line. When a character is faint, the connected component of the original character is interrupted. Therefore, the proportion of the discontinuous connected component is a degree that the character in the character line is faint. For this reason, the greater the Lb, the higher the degree of fading of characters in the character line.

ノイズ判別処理１０８では、文字行画像、連結成分及びエレメントを入力し、文字行内にノイズが存在する度合いを示すノイズ尤度Lnを算出する。文字行画像、連結成分及びエレメントを利用し、Lnを算出する方法は、様々考えられる。例えば、以下に示す方法を採用することができる。
設定値Vnを予め定めておき、設定値Vnを用い、
（条件式４）Shi < Vn かつ Swi < Vn
を満たす連結成分の数Nnを算出する。次にNnを用いて、ノイズ尤度Lnは、
（式７）Ln = Nn×CLh／ΣEwi
と算出できる。 In the noise discrimination processing 108, a character line image, a connected component, and an element are input, and a noise likelihood Ln indicating a degree of noise in the character line is calculated. There are various methods for calculating Ln using character line images, connected components, and elements. For example, the following method can be employed.
Predetermining the set value Vn, using the set value Vn,
(Condition 4) Shi <Vn and Swi <Vn
The number Nn of connected components satisfying the above is calculated. Next, using Nn, the noise likelihood Ln is
(Expression 7) Ln = Nn × CLh / ΣEwi
And can be calculated.

VLnは、文字行の高さに対して、連結成分の高さと幅とがVnより小さい連結成分が文字行内にどのくらいの割合で存在するかを示す値となる。例えば一文字サイズの領域内における小さい連結成分の存在割合では、ノイズがあるか否かの判別は難しい。これに対し、文字行の中における小さい連結成分が存在する割合を用いることで、ノイズが存在しているか否かを判別しやすくなる。そのため、Lnが大きいほど文字行内にノイズが存在している度合いが高いことになる。 VLn is a value indicating the proportion of connected components in the character line in which the height and width of the connected component are smaller than Vn with respect to the height of the character line. For example, it is difficult to determine whether or not there is noise in the presence ratio of a small connected component in an area of one character size. On the other hand, it is easy to determine whether or not noise is present by using the proportion of small connected components in the character line. Therefore, the greater the Ln, the higher the degree of noise present in the character line.

次に、プレセグメンテーション処理１０９の詳細について説明する。プレセグメンテーション処理１０９では、基準のエレメントの左端から一定距離Px内にあるエレメントが含まれる文字パタンが複数生成される。 Next, details of the pre-segmentation processing 109 will be described. In the pre-segmentation process 109, a plurality of character patterns including elements within a certain distance Px from the left end of the reference element are generated.

図６は、本発明の実施の形態のプレセグメンテーション処理１０９の一例を示す説明図である。図６に示す例では、「語い」を構成するエレメント四つが存在した場合に、エレメント１を基準のエレメントにして生成された文字パタンを示している。プレセグメンテーション処理１０９では、エレメント２を基準のエレメントにした文字パタン、エレメント３を基準のエレメントにした文字パタン、エレメント４を基準のエレメントにした文字パタンも生成される。図６に示す例では、文字パタン２が一文字分のエレメントのみを含み、文字パタン１は一部が欠け、文字パタン３と文字パタン４は余計なエレメントを含んでいる。 FIG. 6 is an explanatory diagram illustrating an example of the pre-segmentation process 109 according to the embodiment of this invention. In the example shown in FIG. 6, when there are four elements constituting “vocabulary”, a character pattern generated using element 1 as a reference element is shown. In the pre-segmentation process 109, a character pattern using element 2 as a reference element, a character pattern using element 3 as a reference element, and a character pattern using element 4 as a reference element are also generated. In the example shown in FIG. 6, the character pattern 2 includes only an element for one character, the character pattern 1 is partially missing, and the character pattern 3 and the character pattern 4 include extra elements.

図７は、本発明の実施の形態の全角文字のプレセグメンテーション処理１０９の一例を示す説明図、図８は、本発明の実施の形態の倍角文字のプレセグメンテーション処理１０９の一例を示す説明図、図９は、本発明の実施の形態の半角文字のプレセグメンテーション処理１０９の一例を示す説明図である。 FIG. 7 is an explanatory diagram illustrating an example of the full-width character pre-segmentation processing 109 according to the embodiment of the present invention. FIG. 8 is an explanatory diagram illustrating an example of the double-byte character pre-segmentation processing 109 according to the embodiment of the present invention. FIG. 9 is an explanatory diagram illustrating an example of the pre-segmentation process 109 for half-width characters according to the embodiment of this invention.

図７、図８、図９に示すように、全角文字と倍角文字と半角文字とでは一文字分のエレメント間の距離が異なる。必ず倍角文字を生成するようにPxの値を設定した場合には、プレセグメンテーション処理１０９は、図６の文字パタン３や文字パタン４のような余分な文字パタンを生成してしまう。逆に全角文字を生成するようにPxの値を設定すると、倍角文字を含む文字行において、図６の文字パタン１のような文字パタンが生成される。 As shown in FIGS. 7, 8, and 9, the distance between the elements for one character differs between full-width characters, double-width characters, and half-width characters. When the value of Px is set so as to always generate a double-width character, the pre-segmentation processing 109 generates an extra character pattern such as the character pattern 3 and the character pattern 4 in FIG. Conversely, when the value of Px is set so as to generate a full-width character, a character pattern such as the character pattern 1 in FIG. 6 is generated in a character line including a double-width character.

このように、Pxが小さすぎると一つの文字を構成するエレメントを含む文字パタンが生成されず、文字認識精度が下がる。また、Pxが大きすぎると余分な文字パタンが増加するため、処理時間が多くなる。さらに、文字のかすれ及び文字接触によって、余分な文字パタンを高い尤度で認識してしまう可能性もあるため、文字認識精度が下がることがある。 Thus, if Px is too small, a character pattern including elements constituting one character is not generated, and the character recognition accuracy is lowered. Also, if Px is too large, extra character patterns increase, and processing time increases. Furthermore, since there is a possibility that an extra character pattern may be recognized with high likelihood due to character blurring and character contact, character recognition accuracy may be lowered.

本発明の実施の形態では、半角尤度Lh、倍角尤度Ld、文字かすれ尤度Lb、文字接触尤度Lc、ノイズ尤度Lnを用いて、Pxの値を制御することによって、半角、全角、倍角の文字を含む文字行であっても、正しい文字パタンを生成し、余分な文字パタンの増加を抑制する。このとき、Pxの値は、各尤度においてPxがどのように変化するかを示すテーブルを用いて決められる。 In the embodiment of the present invention, half-width, full-width, Lh, double-angle likelihood Ld, character blur likelihood Lb, character contact likelihood Lc, and noise likelihood Ln are used to control the value of Px. Even if a character line includes double-width characters, a correct character pattern is generated, and an increase in extra character patterns is suppressed. At this time, the value of Px is determined using a table indicating how Px changes at each likelihood.

具体的には、文字接触が多い文字行では、接触している複数の文字を一つのエレメントと認識し、生成されるエレメントの幅が大きくなり倍角尤度は高くなる。このとき、半角尤度Lhと文字接触尤度Lcとを用いて、（Lh−Lc）の値が一定閾値よりも低い場合にのみPxの値を大きくすることによって、余分な文字パタンの生成が抑制される。また、文字かすれ尤度Lbが一定閾値よりも低い場合にのみPxの値を大きくすることによって、余分な文字パタンの生成が抑制される。また、倍角尤度Ldが所定の閾値よりも高い場合にのみPxの値を大きくすることによって、倍角文字を含む文字行において正しい文字パタンを生成し、かつ全角文字を含む文字行において、余分な文字パタンの生成が抑制される。ここで、ノイズ尤度Lnが一定閾値よりも高い場合は、設定値Vpより幅が小さく、かつ設定値Vpより高さが小さいエレメントを除いて文字パタンを生成すると、さらに余分な文字パタンの生成を抑制することができる。 Specifically, in a character line with many character touches, a plurality of touching characters are recognized as one element, the width of the generated element is increased, and the double likelihood is increased. At this time, by using the half-width likelihood Lh and the character contact likelihood Lc, by increasing the value of Px only when the value of (Lh−Lc) is lower than a certain threshold, generation of an extra character pattern It is suppressed. Also, the generation of an extra character pattern is suppressed by increasing the value of Px only when the character blur likelihood Lb is lower than a certain threshold. Also, by increasing the value of Px only when the double-angle likelihood Ld is higher than a predetermined threshold, a correct character pattern is generated in a character line including double-byte characters, and an extra character is included in a character line including double-byte characters. Generation of character patterns is suppressed. Here, if the noise likelihood Ln is higher than a certain threshold value, generating a character pattern excluding elements whose width is smaller than the set value Vp and smaller than the set value Vp generates extra character patterns. Can be suppressed.

次に、強制切断処理１１０の詳細について説明する。 Next, details of the forced disconnection processing 110 will be described.

図１０は、本発明の実施の形態の全角強制切断処理の一例を示す説明図である。図１０に示す例では、「開」と「発」の文字が接触している文字パタンが入力され、それらが分離され、二つの文字パタンが生成されている。 FIG. 10 is an explanatory diagram illustrating an example of the full-width forced cutting process according to the embodiment of this invention. In the example shown in FIG. 10, a character pattern in which the characters “open” and “departure” are in contact is input, they are separated, and two character patterns are generated.

図１２は、本発明の実施の形態の強制切断処理１１０を示すフローチャートである。半角尤度１２２２、倍角尤度１２２３、文字接触尤度１２２４、文字かすれ尤度１２２５、ノイズ尤度１２２６及び文字パタン１２０１が入力される。文字パタン１２０１は、パタンバッファ１２０３に格納され、一つずつ倍角接触判定１２０５、倍角旁接触判定１２０８、全角接触判定１２１１、全角旁接触判定１２１４、半角接触判定１２１７及び半角旁接触判定１２２０に入力される。また、文字行情報生成１２２７では、文字パタン１２０１が入力され、文字行の高さや幅などの情報が生成され、生成された文字行の情報は倍角接触判定１２０５、倍角旁接触判定１２０８、全角接触判定１２１１、全角旁接触判定１２１４、半角接触判定１２１７及び半角旁接触判定１２２０に入力される。 FIG. 12 is a flowchart showing the forced disconnection processing 110 according to the embodiment of this invention. Half-width likelihood 1222, double-angle likelihood 1223, character contact likelihood 1224, character blur likelihood 1225, noise likelihood 1226, and character pattern 1201 are input. The character pattern 1201 is stored in the pattern buffer 1203 and input to the double-angle contact determination 1205, the double-angle eye contact determination 1208, the full-angle contact determination 1211, the full-angle contact determination 1214, the half-angle contact determination 1217, and the half-angle contact determination 1220 one by one. The In the character line information generation 1227, a character pattern 1201 is input, and information such as the height and width of the character line is generated. The generated character line information includes double-angle contact determination 1205, double-angle heel contact determination 1208, and full-width contact. The information is input to the determination 1211, the full-angle heel contact determination 1214, the half-angle heel contact determination 1217, and the half-angle heel contact determination 1220.

全角接触判定１２１１では、文字パタン内の左に存在する全角サイズの文字が、右の文字と接触しているか否かを判定する。全角接触判定１２１１の方法は、様々考えられる。 In full-width contact determination 1211, it is determined whether or not a full-size character existing on the left in the character pattern is in contact with the right character. Various methods of the full-width contact determination 1211 are conceivable.

例えば、以下に示す方法を採用できる。
設定値Vncを予め定めておき、設定値Vncを用い、
（式８）THn＝文字行高さ×Vnc
によってThnを求め、入力文字パタンの幅が求められたTHnより大きい場合、全角文字が接触していると判定することができる。例えば、（式８）のVncを１と設定することによって、文字行高さよりも大きいパタンは全角文字が接触している可能性があると判定することができる。 For example, the following method can be adopted.
Predetermining the set value Vnc, using the set value Vnc,
(Formula 8) THn = text line height x Vnc
If Thn is obtained by the above and the width of the input character pattern is larger than the obtained THn, it can be determined that the full-width character is in contact. For example, by setting Vnc of (Equation 8) to 1, it can be determined that there is a possibility that full-width characters are in contact with a pattern larger than the character line height.

図１１は、本発明の実施の形態の全角旁強制切断の一例を示す説明図である。全角旁接触判定１２１４では、図１１に示す例のように、文字パタン内の左に旁のみが存在する全角サイズの文字が、右の文字と接触しているか否かを判定する。全角旁接触判定１２１４の方法は様々考えられる。 FIG. 11 is an explanatory diagram illustrating an example of full-width rivet forced cutting according to the embodiment of this invention. In the full-width heel contact determination 1214, as in the example shown in FIG. 11, it is determined whether or not a full-size character having only a heel on the left in the character pattern is in contact with the right character. Various methods of the full-angle wrinkle contact determination 1214 are conceivable.

例えば、（式８）のTHnを利用し、入力文字パタンの幅がTHnより大きい場合、全角文字の旁が接触していると判定することができる。例えば、（式８）のVncを１と設定することによって、文字行高さよりも大きいパタンは全角文字の旁が接触している可能性があると判定することができる。 For example, when THn in (Equation 8) is used and the width of the input character pattern is greater than THn, it can be determined that a double-byte character heel is touching. For example, by setting Vnc of (Equation 8) to 1, it can be determined that a pattern larger than the character line height may be in contact with a double-byte character heel.

半角接触判定１２１７では、文字パタン内の左に存在する半角文字が、右の文字と接触しているか否かを判定する。半角接触判定１２１７の方法は様々考えられる。 In the half-width contact determination 1217, it is determined whether or not the half-width character existing on the left in the character pattern is in contact with the right character. Various methods of the half-angle contact determination 1217 can be considered.

例えば、（式８）のTHnを利用し、入力文字パタンの幅がTHnより大きい場合、全角文字が接触していると判定することができる。例えば、（式８）のVncを１／２と設定とすることによって、文字行高さの半分よりも大きいパタンは半角文字が接触している可能性があると判定することができる。 For example, using THn in (Equation 8) and the width of the input character pattern is greater than THn, it can be determined that full-width characters are in contact. For example, by setting Vnc in (Equation 8) to 1/2, it can be determined that there is a possibility that a half-width character is in contact with a pattern that is larger than half the character line height.

半角旁接触判定１２２０では、文字パタン内の左に旁のみが存在する半角文字が、右の文字と接触しているか否かを判定する。半角旁接触判定１２２０の方法は様々考えられる。 In the half-width wrinkle contact determination 1220, it is determined whether or not a half-width character having only a wrinkle on the left in the character pattern is in contact with the right character. Various methods of the half-angle wrinkle contact determination 1220 are conceivable.

例えば（式８）のTHnを利用し、入力文字パタンの幅がTHnより大きい場合、倍角文字の旁が接触していると判定することができる。例えば、（式８）のVncを１／２と設定することによって、文字行高さの半分よりも大きいパタンは半角文字が接触している可能性があると判定することができる。 For example, when THn in (Equation 8) is used and the width of the input character pattern is greater than THn, it can be determined that a double-width character heel is touching. For example, by setting Vnc in (Equation 8) to 1/2, it can be determined that there is a possibility that a half-width character is in contact with a pattern that is larger than half the character line height.

倍角接触判定１２０５では，文字パタン内の左に存在する倍角文字が、右の文字と接触しているか否かを判定する。倍角接触判定１２０５の方法は様々考えられる。 In double angle contact determination 1205, it is determined whether or not the double angle character existing on the left in the character pattern is in contact with the right character. Various methods of the double-angle contact determination 1205 can be considered.

例えば、（式８）のTHnを利用し、入力文字パタンの幅がTHnより大きい場合、倍角文字が接触していると判定することができる。例えば、（式８）のVncを２と設定することによって、文字行高さの２倍よりも大きいパタンは倍角文字が接触している可能性があると判定することができる。 For example, using THn in (Equation 8), if the width of the input character pattern is greater than THn, it can be determined that double-width characters are in contact. For example, by setting Vnc in (Equation 8) to 2, it can be determined that there is a possibility that a double-width character is in contact with a pattern larger than twice the character line height.

倍角旁接触判定１２０８では、文字パタン内の左にの旁のみが存在する倍角文字が、右の文字と接触しているか否かを判定する。倍角旁接触判定１２０８の方法は様々考えられる。 In double-angle heel contact determination 1208, it is determined whether or not a double-angle character that has only a left heel in the character pattern is in contact with the right character. Various methods of double-angle wrinkle contact determination 1208 are conceivable.

例えば、（式８）のTHnを利用し、入力文字パタンの幅がTHnより大きい場合、倍角文字の旁が接触していると判定することができる。例えば、（式８）のVncを２と設定とすることによって、文字行高さの２倍よりも大きいパタンは倍角文字の旁が接触している可能性があると判定することができる。 For example, when THn in (Equation 8) is used and the width of the input character pattern is larger than THn, it can be determined that a double-width character heel is touching. For example, by setting Vnc in (Equation 8) to 2, it can be determined that a pattern larger than twice the character line height may be in contact with a double-width character wrinkle.

図１３は、本発明の実施の形態の強制切断処理の概要を示す説明図である。 FIG. 13 is an explanatory diagram showing an outline of the forced disconnection process according to the embodiment of this invention.

全角位置強制切断１２１２では、全角接触判定１２１１で全角文字が接触している可能性があると判定された文字パタンを入力し、入力された文字パタンの切断位置が決定され、入力された文字パタンが決定された切断位置で切断され、新たに二つの文字パタンが生成される。切断位置の決定する方法は様々考えられる。 In full-width position forced cutting 1212, a character pattern determined to have a possibility that full-width characters are in contact in full-width contact determination 1211 is input, a cutting position of the input character pattern is determined, and the input character pattern is determined. Are cut at the determined cutting position, and two new character patterns are generated. There are various methods for determining the cutting position.

例えば、以下に示す方法を採用することができる。はじめに、倍角接触判定１２０５、倍角旁接触判定１２０８、全角接触判定１２１１、全角旁接触判定１２１４、半角接触判定１２１７及び半角旁接触判定１２２０のいずれかで接触の可能性があると判定された文字パタン１３０１が入力され、y方向に黒画素を投影した黒画素ヒストグラム１３０２が生成される。次に、切断位置探索中心を決定するZxを算出し、原点からｘ軸の正方向にZxの距離を切断位置探索中心として、Zx−hからZx＋hの区間で黒画素ヒストグラム１３０２の度数を探索し、最も度数の低い位置を切断位置とする。 For example, the following method can be employed. First, a character pattern that is determined to be possibly touched by any one of double-angle contact determination 1205, double-angle eye contact determination 1208, full-angle contact determination 1211, full-angle eye contact determination 1214, half-angle contact determination 1217, and half-angle eye contact determination 1220. 1301 is input, and a black pixel histogram 1302 in which black pixels are projected in the y direction is generated. Next, Zx for determining the cutting position search center is calculated, and the frequency of the black pixel histogram 1302 is searched in the section from Zx−h to Zx + h with the distance of Zx in the positive direction of the x axis from the origin as the cutting position search center. The position with the lowest frequency is set as the cutting position.

具体的には、設定値Vpcを予め定めておき、設定値Vpcを用い、
（式９）Zx＝文字行高さ×Vpc（Vpcは設定値）
によってZxを求める。例えば、Vpcを１と設定することによって、文字パタンの左端から文字行高さと同じx位置付近で文字パタンを切断することができる。すなわち、入力された文字パタン１３０１の左端から全角文字幅の位置で切断され、全角文字幅の文字パタンが二つ生成される。ただし、切断すべき位置を含む文字パタンが必ず入力されるとは限らない。そのため、Zx−hからZx＋h区間内のすべての度数が一定値よりも高い場合は切断を行わない。 Specifically, the set value Vpc is determined in advance, and the set value Vpc is used.
(Formula 9) Zx = text line height x Vpc (Vpc is the set value)
To find Zx. For example, by setting Vpc to 1, the character pattern can be cut from the left end of the character pattern near the same x position as the character line height. That is, the input character pattern 1301 is cut from the left end at a full-width character width position to generate two full-width character width character patterns. However, the character pattern including the position to be cut is not always input. Therefore, cutting is not performed when all the frequencies in the zone from Zx−h to Zx + h are higher than a certain value.

全角旁位置強制切断１２１５では、全角旁接触判定１２１４で全角文字の旁が接触している可能性があると判定された文字パタンを入力し、入力された文字パタンの切断位置が決定され、入力された文字パタンが決定された切断位置で切断され、新たに二つの文字パタンが生成される。切断位置を決定する方法は様々考えられる。 In full-width heel position forced cutting 1215, a character pattern that is determined to have a possibility that a full-width character heel is in contact in full-width heel contact determination 1214 is input, and the cutting position of the input character pattern is determined and input. The determined character pattern is cut at the determined cutting position, and two new character patterns are generated. Various methods for determining the cutting position are conceivable.

例えば、全角位置強制切断１２１２と同じ方法を採用し、例えば、Vpcを１と調整することによって、全角文字の旁に相当する幅を持つ文字パタンを生成することができる。 For example, by adopting the same method as the full-width position forced cutting 1212 and adjusting Vpc to 1, for example, it is possible to generate a character pattern having a width corresponding to a full-width character font.

半角位置強制切断１２１８では、半角接触判定１２１７で半角文字が接触している可能性があると判定された文字パタンを入力し、入力された文字パタンの切断位置が決定され、入力された文字パタンが決定された切断位置で切断され、新たに二つの文字パタンが生成される。切断位置を決定する方法は様々考えられる。 In the half-width position forced cutting 1218, the character pattern determined as having a possibility that the half-width character is in contact in the half-width contact determination 1217 is input, the cutting position of the input character pattern is determined, and the input character pattern is determined. Are cut at the determined cutting position, and two new character patterns are generated. Various methods for determining the cutting position are conceivable.

例えば、全角位置強制切断１２１２と同じ方法を採用し、例えば、Vpcを０．５と設定することによって、半角文字に相当する幅を持つ文字パタンを生成することができる。 For example, a character pattern having a width corresponding to a half-width character can be generated by adopting the same method as the full-width position forced cutting 1212 and setting Vpc to 0.5, for example.

半角旁位置強制切断１２２１では、半角旁接触判定１２２０で半角文字の旁が接触している可能性があると判定された文字パタンを入力し、入力された文字パタンの切断位置が決定され、入力された文字パタンが決定された切断位置で切断され、新たに二つの文字パタンが生成される。切断位置を決定する方法は様々考えられる。 In the half-width wrinkle position forced cutting 1221, the character pattern that is determined to be possibly touched by the half-width wrinkle contact determination 1220 is input, the cutting position of the input character pattern is determined, and the input The determined character pattern is cut at the determined cutting position, and two new character patterns are generated. Various methods for determining the cutting position are conceivable.

例えば、全角位置強制切断１２１２と同じ方法を採用し、例えば、Vpcを０．５と設定することによって、半角文字の旁に相当する幅を持つ文字パタンを生成することができる。 For example, by adopting the same method as full-width position forced cutting 1212 and setting Vpc to 0.5, for example, it is possible to generate a character pattern having a width corresponding to a half-width character font.

倍角位置強制切断１２０６では、倍角接触判定１２０５で倍角文字が接触している可能性があると判定された文字パタンを入力し、入力された文字パタンの切断位置が決定され、入力された文字パタンが決定された切断位置で切断され、新たに二つの文字パタンが生成される。切断位置を決定する方法は様々考えられる。 In the double-angle position forced cutting 1206, the character pattern determined to be possibly touched by the double-angle contact determination 1205 is input, the cutting position of the input character pattern is determined, and the input character pattern is determined. Are cut at the determined cutting position, and two new character patterns are generated. Various methods for determining the cutting position are conceivable.

例えば、全角位置強制切断１２１２と同じ方法を採用し、例えば、Vpcを２と設定することによって、倍角文字に相当する幅を持つ文字パタンを生成することができる。 For example, by adopting the same method as full-width position forced cutting 1212 and setting Vpc to 2, for example, a character pattern having a width corresponding to a double-width character can be generated.

倍角旁位置強制切断１２０９では、倍角旁接触判定１２０８で倍角文字の旁が接触している可能性があると判定された文字パタンを入力し、入力された文字パタンの切断位置が決定され、入力された文字パタンが決定された切断位置で切断され、新たに二つの文字パタンが生成される。切断位置を決定する方法は様々考えられる。 In the double-angle heel position forced cutting 1209, the character pattern determined to have a possibility that the double-angle character heel is in contact in the double-angle heel contact determination 1208 is input, and the cutting position of the input character pattern is determined and input. The determined character pattern is cut at the determined cutting position, and two new character patterns are generated. Various methods for determining the cutting position are conceivable.

例えば、全角位置強制切断１２１２と同じ方法を採用し、例えば、Vpcを２と設定することによって、倍角文字の旁に相当する幅を持つ文字パタンを生成することができる。 For example, by adopting the same method as the full-width position forced cutting 1212 and setting Vpc to 2, for example, a character pattern having a width corresponding to a double-width character font can be generated.

全角位置強制切断１２１２、全角旁位置強制切断１２１５、半角位置強制切断１２１８、半角旁位置強制切断１２２１、倍角位置強制切断１２０６及び倍角旁位置強制切断１２０９で生成された文字パタンは、パタンバッファ１２０３に格納され、再度、倍角接触判定１２０５、倍角旁接触判定１２０８、全角接触判定１２１１、全角旁接触判定１２１４、半角接触判定１２１７及び半角旁接触判定１２２０に入力される。 Character patterns generated by full-width position forced cut 1212, full-width saddle position forced cut 1215, half-width position forced cut 1218, half-angle saddle position forced cut 1221, double-angle position forced cut 1206, and double-angle saddle position forced cut 1209 are stored in pattern buffer 1203. It is stored and input again to the double-angle contact determination 1205, the double-angle eyelid contact determination 1208, the full-angle eye contact determination 1211, the full-angle eye contact determination 1214, the half-angle contact determination 1217, and the half-angle eye contact determination 1220.

これは、全角位置強制切断１２１２、全角旁位置強制切断１２１５、半角位置強制切断１２１８、半角旁位置強制切断１２２１、倍角位置強制切断１２０６及び倍角旁位置強制切断１２０９は、一つの文字パタンを入力とし、二つの文字パタンを生成するため、三つ以上の文字が連なって接触している文字パタンが入力されると、生成された二つの文字パタンのどちらかには、接触した複数の文字パタンが含まれるからである。 This is because full-width position forced cutting 1212, full-width collar position forced cutting 1215, half-angle position forced cutting 1218, half-angle collar position forced cutting 1221, double-angle position forced cutting 1206, and double-angle collar position forced cutting 1209 are input as one character pattern. In order to generate two character patterns, when a character pattern in which three or more characters are in contact with each other is input, one of the two generated character patterns has a plurality of touched character patterns. Because it is included.

倍角接触判定１２０５、倍角旁接触判定１２０８、全角接触判定１２１１、全角旁接触判定１２１４、半角接触判定１２１７及び半角旁接触判定１２２０はパタンバッファ１２０３内のすべて文字パタンが処理されるまで行われる。 The double-angle contact determination 1205, the double-angle heel contact determination 1208, the full-angle contact determination 1211, the full-angle heel contact determination 1214, the half-angle contact determination 1217, and the half-angle heel contact determination 1220 are performed until all the character patterns in the pattern buffer 1203 are processed.

従来技術における強制切断処理では、入力された文字パタンが倍角文字、全角文字及び半角文字か否かは不定であるため、常にすべての入力文字パタンに対し、倍角接触判定１２０５、倍角旁接触判定１２０８、全角接触判定１２１１、全角旁接触判定１２１４、半角接触判定１２１７、半角旁接触判定１２２０、全角位置強制切断１２１２、全角旁位置強制切断１２１５、半角位置強制切断１２１８、半角旁位置強制切断１２２１、倍角位置強制切断１２０６及び倍角旁位置強制切断１２０９が実行される。このために、倍角文字、全角文字及び半角文字を想定した文字パタンの切断処理がされて、余分な文字パタンが数多く生成されることによって、処理時間が多くかかっていた。 In the forced cutting process in the prior art, whether or not the input character pattern is a double-width character, a full-width character, or a half-width character is undefined. Therefore, the double-angle contact determination 1205 and the double-angle heel contact determination 1208 are always performed for all input character patterns. , Full angle contact determination 1211, full angle heel contact determination 1214, half angle contact determination 1217, half angle heel contact determination 1220, full angle position forced cutting 1212, full angle heel position forced cutting 1215, half angle position forced cutting 1218, half angle heel position forced cutting 1221, double angle A position forced cut 1206 and a double angle saddle position forced cut 1209 are executed. For this reason, the processing of cutting character patterns assuming double-width characters, full-width characters, and half-width characters is performed, and many extra character patterns are generated, which takes a lot of processing time.

また、余分な文字パタンの増加によって、認識精度が下がるといった問題もあった。例えば、倍角文字を含む文字パタンが入力された場合、全角位置強制切断１２１２、全角旁位置強制切断１２１５、半角位置強制切断１２１８及び半角旁位置強制切断１２２１において、文字パタンを全角文字幅と半角文字幅で切断することによって、余分な文字パタンが多く生成されてしまう。余分な文字パタンが増加によって、文字識別１１１、知識処理１１２、認識結果利用文字幅検定１１３の処理時間が多くなる。さらに、余分な文字パタンを誤って認識してしまうという問題も発生する。また、半角文字を含む文字が入力された場合、倍角接触判定１２０５や倍角旁接触判定１２０８の処理は余分な処理である。 In addition, there is a problem that recognition accuracy decreases due to an increase in extra character patterns. For example, when a character pattern including double-width characters is input, the full-width position forced cut 1212, full-width heel position forced cut 1215, half-width position forced cut 1218, and half-width heel position forced cut 1221 are changed to full-width character width and half-width character. By cutting by width, many extra character patterns are generated. The processing time of the character identification 111, the knowledge processing 112, and the recognition result utilization character width test 113 increases due to the increase of the extra character patterns. Furthermore, there is a problem that an extra character pattern is erroneously recognized. In addition, when a character including a half-width character is input, the processing of double-angle contact determination 1205 and double-angle contact determination 1208 is an extra process.

そこで、本発明の実施の形態では倍角接触判定１２０５、倍角旁接触判定１２０８、全角接触判定１２１１、全角旁接触判定１２１４、半角接触判定１２１７及び半角旁接触判定１２２０に、半角尤度１２２２、倍角尤度１２２３、文字接触尤度１２２４、文字かすれ尤度１２２５及びノイズ尤度１２２６を入力し、接触判定を制御する。接触判定の制御によって、強制切断処理も制御できる。このように、強制切断の処理を半角尤度１２２２、倍角尤度１２２３、文字接触尤度１２２４、文字かすれ尤度１２２５及びノイズ尤度１２２６により制御することによって、余分な文字パタンの生成が抑制されて、処理時間を抑制し、かつ、高精度の文字認識を得ることが可能となる。 Therefore, in the embodiment of the present invention, the double-angle contact determination 1205, the double-angle heel contact determination 1208, the full-angle contact determination 1211, the full-angle heel contact determination 1214, the half-angle contact determination 1217, and the half-angle heel contact determination 1220 include the half-angle likelihood 1222, the double-angle likelihood. The degree 1223, the character contact likelihood 1224, the character blur likelihood 1225, and the noise likelihood 1226 are input, and the contact determination is controlled. The forced cutting process can also be controlled by the contact determination control. In this way, by controlling the forced cutting process using the half-angle likelihood 1222, the double-angle likelihood 1223, the character contact likelihood 1224, the character blur likelihood 1225, and the noise likelihood 1226, the generation of extra character patterns is suppressed. Thus, it is possible to suppress the processing time and obtain highly accurate character recognition.

具体的には、倍角尤度１２２３が設定値Vdcよりも高い場合、半角接触判定１２１７と半角旁接触判定１２２０を実行しないようにすることによって、倍角文字を含む文字パタンが入力されても、半角文字幅で文字パタンを切断することを抑制することができる。 Specifically, when the double angle likelihood 1223 is higher than the set value Vdc, the half-width contact determination 1217 and the half-angle heel contact determination 1220 are not executed, so that even if a character pattern including double-width characters is input, Cutting the character pattern with the character width can be suppressed.

すわなち、余分な文字パタンの生成を抑制することができる。また、半角尤度１２２２が設定値Vhc1よりも高い場合、倍角接触判定１２０５、倍角旁接触判定１２０８を実行しないようにすることによって、処理時間を短縮できる。また、半角尤度１２２２が設定値Vhc2よりも低く、かつ文字かすれ尤度１２２５が設定値Vbcよりも高い場合、半角旁接触判定１２２０を実行しないようにすることによって、全角文字を含む文字パタンを半角旁幅で切断することが抑制される。また、文字接触尤度１２２４が設定値Vccよりも低い場合は、倍角接触判定１２０５、倍角旁接触判定１２０８、半角接触判定１２１７、半角旁接触判定１２２０を実行することによって、文字パタンを切断することを抑制することができ、処理時間を短縮できる。また、ノイズ尤度１２２６が設定値Vncよりも高い場合は、半角接触判定１２１７、半角旁接触判定１２２０を実行することによって、文字パタンを切断することを抑制することができ、処理時間を短縮できる。 In other words, generation of extra character patterns can be suppressed. Further, when the half-angle likelihood 1222 is higher than the set value Vhc1, the processing time can be shortened by not performing the double-angle contact determination 1205 and the double-angle heel contact determination 1208. Further, when the half-width likelihood 1222 is lower than the set value Vhc2 and the character blur likelihood 1225 is higher than the set value Vbc, by preventing the half-width wrinkle contact determination 1220 from being executed, a character pattern including full-width characters can be obtained. Cutting with a half-angle width is suppressed. When the character contact likelihood 1224 is lower than the set value Vcc, the character pattern is cut by executing the double-angle contact determination 1205, the double-angle heel contact determination 1208, the half-angle contact determination 1217, and the half-angle heel contact determination 1220. Can be suppressed, and the processing time can be shortened. When the noise likelihood 1226 is higher than the set value Vnc, the character pattern can be prevented from being cut and the processing time can be shortened by executing the half-angle contact determination 1217 and the half-angle contact determination 1220. .

図１２に戻り、強制切断処理１１０のフローチャートの説明を続ける。 Returning to FIG. 12, the description of the flowchart of the forcible disconnection processing 110 will be continued.

パタンバッファ１２０３内に格納されたすべての文字パタンはマージパタン１２２８に入力される。 All the character patterns stored in the pattern buffer 1203 are input to the merge pattern 1228.

切断された文字パタンには、文字の偏のみ又は文字の旁のみを含む文字パタンが存在する。マージパタン１２２８では、切断された偏のみを含む文字パタンと、対応する旁のみを含む文字パタンとを合わせた文字パタンが生成される。また、切断された旁のみを含む文字パタンと、対応する偏の文字パタンを合わせた文字パタンが生成される。マージパタン１２２８には、切断偏パタンマージ処理と切断旁パタンマージ処理があり、どちらもプレセグメンテーション処理１０９と同様の処理がなされる。 The cut character pattern includes a character pattern including only a character deviation or only a character heel. In the merge pattern 1228, a character pattern is generated by combining a character pattern including only the cut bias and a character pattern including only the corresponding heel. In addition, a character pattern is generated by combining a character pattern including only the cut heel and a corresponding biased character pattern. The merge pattern 1228 includes a cut-off pattern merge process and a cut-off pattern merge process, both of which are the same as the pre-segmentation process 109.

切断偏パタンマージ処理では、切断された文字パタンの左端から左右方向の右に設定値Vxの距離内にある文字パタンとあわせた文字パタンを生成する。これにより、偏のみが含まれている文字パタンと対応する旁の文字パタンを合わせた文字パタンが生成される。 In the cut partial pattern merge process, a character pattern is generated that is combined with a character pattern within a distance of a set value Vx from the left end of the cut character pattern to the right in the horizontal direction. As a result, a character pattern is generated by combining the character pattern including only the bias and the corresponding character pattern of the tile.

切断旁パタンマージ処理では、切断された文字パタンの右端から左右方向の左に設定値Vxの距離内にある文字パタンとあわせた文字パタンを生成する。これにより、切断された文字パタンに旁のみが含まれている場合、この文字パタンとこれに対応する偏の文字パタンを合わせた文字パタンが生成される。 In the cut-off pattern merge process, a character pattern is generated that is combined with a character pattern within a distance of the set value Vx from the right end of the cut character pattern to the left in the left-right direction. As a result, when the cut character pattern includes only 旁, a character pattern in which this character pattern is combined with the corresponding biased character pattern is generated.

従来技術では、倍角の文字パタンを必ず生成するように前記設定値Vxの値を設定した場合、全角文字を含む文字パタン、半角文字を含む文字パタンが入力されると、余分な文字パタンを生成してしまうという問題があった。また、半角、全角の文字パタンを生成するようにVxの値を設定した場合、倍角文字の文字パタンが生成されない場合があるという問題があった。 In the prior art, when the value of the setting value Vx is set so that a double-width character pattern is always generated, an extra character pattern is generated when a character pattern including full-width characters or a character pattern including half-width characters is input. There was a problem of doing. In addition, when the value of Vx is set so that half-width and full-width character patterns are generated, there is a problem that character patterns of double-width characters may not be generated.

本発明の実施の形態では、マージパタン１２２８に、半角尤度１２２２、倍角尤度１２２３、文字接触尤度１２２４、文字かすれ尤度１２２５及びノイズ尤度１２２６が入力され、これらの値によって、余分な文字パタンの生成を抑制し、かつ、半角文字、全角文字及び倍角文字の文字パタンを正しく生成する。例えば、Vxの値を半角文字と全角文字の文字パタンが生成される値に設定しておき、倍角尤度１２２３の値が設定値Vdmより高い場合のみ、Vxの値を、倍角文字の文字パタンを生成する値に変更して処理をする。これによって、倍角文字を含む文字パタンを正しく生成し、かつ、半角文字、全角文字の文字パタンが入力された場合にも余分な文字パタンの生成を抑制することができる。 In the embodiment of the present invention, half-width likelihood 1222, double-angle likelihood 1223, character contact likelihood 1224, character fading likelihood 1225, and noise likelihood 1226 are input to merge pattern 1228. Generation of patterns is suppressed, and character patterns of half-width characters, full-width characters, and double-width characters are correctly generated. For example, the value of Vx is set to a value that generates a character pattern of half-width characters and full-width characters, and the value of Vx is changed to the character pattern of a double-width character only when the value of double-angle likelihood 1223 is higher than the set value Vdm. Change to a value that generates. As a result, it is possible to correctly generate a character pattern including double-width characters and to suppress generation of an extra character pattern even when a character pattern of half-width characters or full-width characters is input.

このように、プレセグメンテーション処理１０９、強制切断処理１１０で生成された文字パタンは、図１４に示すように生成された文字パタンの接続関係を持つ文字切り出しネットワークとしてデータ保持される。 As described above, the character patterns generated by the pre-segmentation processing 109 and the forced cutting processing 110 are retained as a character cut-out network having a connection relationship of the generated character patterns as shown in FIG.

文字切り出しネットワークを生成する上述の一連の処理は、日本語以外の言語の文字、例えば、中国語、韓国語（ハングル文字）にも適応することができる。 The above-described series of processing for generating a character segmentation network can be applied to characters in languages other than Japanese, for example, Chinese and Korean (Hangul characters).

図１６は、本発明の実施の形態の文字切り出しネットワーク（簡体字）の一例を示す説明図である。なお、繁体字でも同様の処理が可能なことは言うまでもない。 FIG. 16 is an explanatory diagram illustrating an example of a character segmentation network (simplified characters) according to the embodiment of this invention. Needless to say, the same processing can be performed for traditional characters.

図１７は、本発明の実施の形態の文字切り出しネットワーク（ハングル文字）の一例を示す説明図である。 FIG. 17 is an explanatory diagram illustrating an example of a character segmentation network (Hangul character) according to the embodiment of this invention.

このとき、各尤度を算出する際に用いる閾値を調節することで切り出しネットワークを変更できる。これによって、精度と処理速度とを調整することが可能となる。例えば、半角、全角、倍角の文字を含む文字行において、半角尤度が大きい場合に、閾値を調整し半角での強制切断を行わないようにすると、切り出しネットワークは少なくなり処理時間は速くなる。 At this time, the cut-out network can be changed by adjusting the threshold value used when calculating each likelihood. This makes it possible to adjust the accuracy and processing speed. For example, in a character line including half-width, full-width, and double-width characters, if the half-width likelihood is large, adjusting the threshold value so as not to perform half-width forced cutting results in fewer cutout networks and faster processing time.

文字識別１１１では、プレセグメンテーション処理１０９、強制切断処理１１０で生成された文字パタンの特徴が抽出され、予め用意された識別辞書にある各文字種の特徴との類似度が算出される。そして、各文字種の特徴との類似度を文字識別結果とする。 In the character identification 111, the characteristics of the character patterns generated by the pre-segmentation process 109 and the forced cutting process 110 are extracted, and the similarity with the characteristics of each character type in the identification dictionary prepared in advance is calculated. The similarity with the characteristics of each character type is used as a character identification result.

知識処理１１２では、文字切り出しネットワーク１４０１と文字識別結果が入力され、文字行画像の文字認識結果１（図１５の１５０１）が出力される。文字認識結果１（図１５の１５０１）の出力方法は様々考えられる。 In the knowledge processing 112, the character segmentation network 1401 and the character identification result are input, and the character recognition result 1 (1501 in FIG. 15) of the character line image is output. There are various methods for outputting the character recognition result 1 (1501 in FIG. 15).

例えば、以下に示す方法を採用することができる。文字切り出しネットワーク上で、左から右への一方向の経路で、各文字パタンの識別結果で最も類似度の高い値を算出し、算出された値の平均値が最も高くなるような経路を選択する（最適パス探索）。そして、得られた経路上の文字パタンの識別結果で最も類似度の高い値を持つ字種が文字認識結果１として出力される。 For example, the following method can be employed. On the character segmentation network, in the one-way route from left to right, calculate the value with the highest similarity in the identification result of each character pattern, and select the route with the highest average value of the calculated values. (Optimum path search). Then, the character type having the highest similarity in the obtained character pattern identification result on the path is output as the character recognition result 1.

また、各文字パタンの識別結果で類似度の高い文字種の組み合わせと予め用意された単語辞書にある単語とが照合され、照合された単語に含まれる文字パタンの字種が優先して認識結果として出力されるようにパス探索を選択する方法もある。この方法は、単語照合と呼ぶことにする。 In addition, a combination of character types with high similarity in the identification result of each character pattern is collated with a word in a word dictionary prepared in advance, and the character type of the character pattern included in the collated word is given priority as a recognition result. There is also a method of selecting a path search so that it is output. This method is called word matching.

さらに、予め単語リストを用意しておき、単語リストに登録されている単語と単語リストに登録されている別の単語とが連続して文字行内に出現する確率を単語バイグラムとして保持しておく。最適パス探索時に、単語バイグラムを参照して、文字行内に、連続して出現する確率が高い単語の並びを優先して文字認識結果１として出力する方法もある。これらは非特許文献１に詳細が記載されている。このように単語照合や単語バイグラムを用いることで、文字パタンの識別結果で最も類似度が高い文字種が不正解であったとしても、その他の正しい文字種を文字認識結果として出力することが可能となる。すなわち、より正しい文字認識結果が得ることが可能となる。 Furthermore, a word list is prepared in advance, and the probability that a word registered in the word list and another word registered in the word list appear in the character line in succession is held as a word bigram. There is also a method of outputting the character recognition result 1 with priority given to a sequence of words that have a high probability of appearing continuously in a character line, with reference to a word bigram when searching for an optimal path. These are described in detail in Non-Patent Document 1. By using word collation and word bigram in this way, even if the character type with the highest similarity in the character pattern identification result is incorrect, it is possible to output other correct character types as character recognition results. . That is, a more correct character recognition result can be obtained.

しかし、このようにして得られた文字認識結果１（図１５の１５０１）は正しい確率が高いが、それでも認識誤りが含まれる場合がある。 However, the character recognition result 1 (1501 in FIG. 15) obtained in this way has a high probability of being correct, but may still include a recognition error.

これに対し、本発明の実施の形態では、認識結果利用文字幅検定１１３を行う。まず、文字認識結果１（図１５の１５０１）に含まれる文字パタンの文字種と、及び文字パタンの幅及び高さから、文字行内の文字パタンが半角又は全角又は倍角であるかを表す文字幅判定結果が算出される。 On the other hand, in the embodiment of the present invention, the recognition result utilization character width test 113 is performed. First, from the character type of the character pattern included in the character recognition result 1 (1501 in FIG. 15), and the width and height of the character pattern, the character width determination indicating whether the character pattern in the character line is half-width, full-width, or double-width. The result is calculated.

次に、文字幅判定結果に応じて、前記文字認識結果１内の文字パタンの字種と、及び文字パタンの幅と高さとを用いて、前記文字認識結果１（図１５の１５０１）内に不適切な認識結果が含まれているか否かが判定さる。 Next, according to the character width determination result, the character recognition result 1 (1501 in FIG. 15) is used by using the character type of the character pattern in the character recognition result 1 and the width and height of the character pattern. It is determined whether or not an inappropriate recognition result is included.

不適切な認識結果が含まれている場合には、文字幅判定結果と文字パタンの字種と及び文字パタンの幅と高さとを用いて、プレセグメンテーション処理と強制切断処理により生成された文字パタンから適切な文字パタンと識別結果が選択される。そして、選択された文字パタンと識別結果のみを用いて、知識処理が行われ、修正された文字認識結果が出力される。 If an inappropriate recognition result is included, the character pattern generated by the pre-segmentation process and the forced cutting process using the character width determination result, the character pattern character type, and the character pattern width and height are used. From this, an appropriate character pattern and identification result are selected. Then, knowledge processing is performed using only the selected character pattern and identification result, and a corrected character recognition result is output.

図１５は、本発明の実施の形態の認識結果利用文字幅検定を示す説明図である。 FIG. 15 is an explanatory diagram showing a recognition result utilization character width test according to the embodiment of this invention.

まず、文字認識結果１（１５０１）が入力され、認識結果利用文字幅推定処理１５０２にて、文字認識結果１に含まれる文字パタンの文字種と文字パタンの幅と高さから、文字行内の文字パタンが半角又は全角又は倍角であるかを示す文字幅判定結果が算出される。 First, the character recognition result 1 (1501) is input, and the character pattern in the character line is determined from the character type of the character pattern included in the character recognition result 1 and the width and height of the character pattern in the recognition result use character width estimation processing 1502. A character width determination result indicating whether is a half-width, full-width, or double-width is calculated.

そして、文字パタン・識別結果選定処理１５０４にて、文字パタン・文字幅判定結果に応じて、前記文字認識結果１内の文字パタンの字種と及び文字パタンの幅と高さとを用いて、前記文字認識結果１内に不適切な認識結果が含まれているか否かが判定される。 In the character pattern / identification result selection processing 1504, the character pattern of the character pattern in the character recognition result 1 and the width and height of the character pattern are used according to the character pattern / character width determination result. It is determined whether or not an inappropriate recognition result is included in the character recognition result 1.

不適切な認識結果が含まれている場合には、文字幅判定結果、文字パタンの字種と及び文字パタンの幅と高さとを用いて、前記プレセグメンテーション処理と前記強制切断処理とで生成された文字パタンから、適切な文字パタンと識別結果とが選択される。 If an inappropriate recognition result is included, it is generated by the pre-segmentation process and the forced cutting process using the character width determination result, the character type of the character pattern, and the width and height of the character pattern. From the selected character pattern, an appropriate character pattern and identification result are selected.

そして、知識処理１５０５にて、選択された文字パタンと識別結果とのみを用いて、知識処理１１２と同様の処理を行い、文字認識結果２（１５０５）が出力される。 Then, in knowledge processing 1505, using only the selected character pattern and identification result, processing similar to knowledge processing 112 is performed, and character recognition result 2 (1505) is output.

認識結果利用文字幅推定処理１５０２や文字パタン・識別結果選定処理１５０４の方法にも様々な方法が考えられる。以下でその例を述べる。 Various methods are conceivable for the recognition result use character width estimation processing 1502 and the character pattern / identification result selection processing 1504. Examples are described below.

例えば、認識結果利用文字幅推定処理１５０２では、以下の処理がなされる。 For example, in the recognition result utilization character width estimation processing 1502, the following processing is performed.

文字認識結果１（１５０１）の中から特定の文字種の文字パタンを抽出し、抽出した文字パタンの幅と高さとを用いて
（式１０）Rwh = 幅／高さ
を算出する。次に、設定値Vrwhを予め定めておき、設定値Vrwhを用いて、
（条件式５）Rwh > Vrwh
を満たす文字パタンの数Nrwhを算出する。さらに、Nrwhと文字識別結果の文字パタンの数を用いて、
（式１１）Pwh = Nrwh ／（文字識別結果の文字パタンの数）
を算出する。設定値THrwhを予め定めておき、Pwhと設定値THrwhを用いて文字幅の推定を行う。 A character pattern of a specific character type is extracted from the character recognition result 1 (1501), and the width and height of the extracted character pattern are used to calculate (Equation 10) Rwh = width / height. Next, the set value Vrwh is determined in advance, and using the set value Vrwh,
(Condition 5) Rwh> Vrwh
The number of character patterns Nrwh that satisfies the above is calculated. Furthermore, using Nrwh and the number of character patterns of the character identification result,
(Formula 11) Pwh = Nrwh / (number of character patterns of character identification result)
Is calculated. A set value THrwh is determined in advance, and the character width is estimated using Pwh and the set value THrwh.

具体的には、文字認識結果１（１５０１）の中から、例えば、文字種が漢字である文字パタン（漢字結果文字パタン）を抽出し、（式１０）を用いてRwhを算出し、設定値Vrwhを１．１と定めて、（条件式５）を満たす文字パタン（幅広漢字結果文字パタン）の数Nrwhを算出する。さらに（式１１）を用いて、Pwhを算出する。このとき、THrwhは０．６と定め、PwhとTHrwhとの関係が、
（条件式６）Pwh < THrwh
を満たす場合は、文字行内の文字は半角又は倍角の文字幅であると判定される。 Specifically, for example, a character pattern (kanji result character pattern) whose character type is Kanji is extracted from the character recognition result 1 (1501), Rwh is calculated using (Equation 10), and the set value Vrwh Is 1.1 and the number Nrwh of character patterns (wide kanji result character patterns) satisfying (conditional expression 5) is calculated. Furthermore, Pwh is calculated using (Equation 11). At this time, THrwh is set to 0.6, and the relationship between Pwh and THrwh is
(Condition 6) Pwh <THrwh
If the condition is satisfied, it is determined that the characters in the character line have a half-width or double-width character width.

文字パタン・識別結果選定処理１５０４では、文字認識結果１（１５０１）とプレセグメンテーション処理１０９で生成された文字パタンと強制切断処理１１０で生成された文字パタンとが入力され、文字行の高さHが推定され、設定値Vvを予めておき、設定値Vvをもちいて、文字認識結果１（１５０１）の中に、
（条件式７）（文字パタンの幅）＞ H × Vv
を満たす文字パタンが存在するか否かを判定する。なお、文字行の高さは文字パタンの高さの最大値などで推定する。 In the character pattern / identification result selection process 1504, the character recognition result 1 (1501), the character pattern generated in the pre-segmentation process 109, and the character pattern generated in the forced cutting process 110 are input, and the character line height H Is estimated in advance, and the set value Vv is set in advance. Using the set value Vv, the character recognition result 1 (1501) includes:
(Condition 7) (Width of character pattern)> H × Vv
It is determined whether or not there is a character pattern that satisfies. The height of the character line is estimated by the maximum value of the character pattern height.

該当する文字パタンがあった場合、プレセグメンテーション処理１０９で生成された文字パタンと強制切断処理１１０で生成された文字パタンとの中から（条件式７）の背反を満たす文字パタン（非幅広文字パタン）のみが選択される。そして、知識処理１５０５では、非幅広文字パタンのみを入力として、知識処理が行われ文字認識結果２が決定される。 When there is a corresponding character pattern, a character pattern (non-wide character pattern) that satisfies the contradiction of (conditional expression 7) from the character pattern generated by the pre-segmentation processing 109 and the character pattern generated by the forced cutting processing 110. Only) is selected. In the knowledge processing 1505, only the non-wide character pattern is input, and knowledge processing is performed to determine the character recognition result 2.

このように、単純な構成要素の大きさの分布だけでなく、文字識別と知識処理によって得られる文字認識結果の文字種に応じた大きさを考慮した文字幅推定を行って文字パタンを選択し、それらを用いて再度知識処理を行うことで、より高精度な文字認識結果が得られる。 Thus, not only the distribution of the size of simple components, but also the character width estimation considering the size according to the character type of the character recognition result obtained by character identification and knowledge processing, the character pattern is selected, By performing knowledge processing again using them, a more accurate character recognition result can be obtained.

前記知識処理は、他言語の文字、例えば、中国語（簡体字、繁体字）、韓国語（ハングル文字）においても可能である。 The knowledge processing is also possible for characters in other languages, such as Chinese (simplified and traditional) and Korean (Hangul).

本発明の実施の形態の文字認識処理の全体を示すフローチャートである。It is a flowchart which shows the whole character recognition process of embodiment of this invention. 本発明の実施の形態の文字認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the character recognition apparatus of embodiment of this invention. 本発明の実施の形態の文字行抽出処理の結果の一例を示す説明図である。It is explanatory drawing which shows an example of the result of the character line extraction process of embodiment of this invention. 本発明の実施の形態の連結成分の例を示す図である。It is a figure which shows the example of the connection component of embodiment of this invention. 本発明の実施の形態のエレメントの例を示す図である。It is a figure which shows the example of the element of embodiment of this invention. 本発明の実施の形態のプレセグメンテーション処理の一例を示す説明図である。It is explanatory drawing which shows an example of the pre-segmentation process of embodiment of this invention. 本発明の実施の形態の全角文字のプレセグメンテーション処理の一例を示す説明図である。It is explanatory drawing which shows an example of the pre-segmentation process of the full-width character of embodiment of this invention. 本発明の実施の形態の倍角文字のプレセグメンテーション処理の一例を示す説明図である。It is explanatory drawing which shows an example of the pre-segmentation process of the double-width character of embodiment of this invention. 本発明の実施の形態の半角文字のプレセグメンテーション処理の一例を示す説明図である。It is explanatory drawing which shows an example of the pre-segmentation process of the half-width character of embodiment of this invention. 本発明の実施の形態の全角強制切断処理の一例を示す説明図である。It is explanatory drawing which shows an example of the full-width forced cutting process of embodiment of this invention. 本発明の実施の形態の全角旁強制切断処理の一例を示す説明図である。It is explanatory drawing which shows an example of the full width wrinkle forced cutting process of embodiment of this invention. 本発明の実施の形態の強制切断処理を示すフローチャートである。It is a flowchart which shows the forced cutting process of embodiment of this invention. 本発明の実施の形態の強制切断処理の概要を示す説明図である。It is explanatory drawing which shows the outline | summary of the forced cutting process of embodiment of this invention. 本発明の実施の形態の文字切り出しネットワークの一例を示す説明図である。It is explanatory drawing which shows an example of the character segmentation network of embodiment of this invention. 本発明の実施の形態の認識結果利用文字幅検定を示す説明図である。It is explanatory drawing which shows the recognition result utilization character width test of embodiment of this invention. 本発明の実施の形態の文字切り出しネットワーク（簡体字）の一例を示す説明図である。It is explanatory drawing which shows an example of the character segmentation network (simplified character) of embodiment of this invention. 本発明の実施の形態の文字切り出しネットワーク（ハングル文字）の一例を示す説明図である。It is explanatory drawing which shows an example of the character segmentation network (Hangul character) of embodiment of this invention.

Explanation of symbols

１０１入力画像
１０２文字行抽出処理
１０３連結成分/エレメント生成処理
１０４倍角判別処理
１０５文字接触判別処理
１０６半角判別処理
１０７文字かすれ判別処理
１０８ノイズ判別処理
１０９プレセグメンテーション処理
１１０強制切断処理
１１１文字識別処理
１１２知識処理
１１３認識結果利用文字幅検定処理
１１４文字認識結果 DESCRIPTION OF SYMBOLS 101 Input image 102 Character line extraction process 103 Connected component / element generation process 104 Double angle discrimination process 105 Character contact discrimination process 106 Half angle discrimination process 107 Character blurring discrimination process 108 Noise discrimination process 109 Pre-segmentation process 110 Forced cutting process 111 Character identification process 112 Knowledge processing 113 Recognition result use character width verification process 114 Character recognition result

Claims

A character recognition method for recognizing characters from an image,
A connected component generation process for generating a connected component that is a series of black pixels constituting a character from an image including the character;
A character line extraction process for extracting a character line that is an area where characters are arranged;
Half-width discrimination processing for calculating a half-width likelihood indicating the degree of half-width character of a character in a character line;
A double angle discrimination process for calculating a double angle likelihood indicating the degree of doubleness of the characters in the character line;
Character blur determination processing for calculating a character blur likelihood indicating the blurring degree of characters in a character line,
A character contact determination process for calculating a character contact likelihood indicating a contact degree of a character in a character line;
A noise discrimination process for calculating a noise likelihood indicating the degree of noise mixed in the character line;
Pre-segmentation for generating a character pattern that is information combining one or a plurality of connected components using the character contact likelihood calculated by the character contact determination processing and the double-angle character likelihood calculated by the double-angle determination processing Processing,
Using the character blur likelihood calculated by the character blur determination process and the half-width character likelihood calculated by the half-width discrimination process, a character pattern for cutting a connected component is selected, and a position at which the connected component is cut is determined. Forced disconnection processing,
A character identification process for extracting the characteristics of the character pattern generated by the pre-segmentation process and the forced cutting process and calculating the similarity to the characteristics of each character type existing in the identification dictionary prepared in advance. Character recognition method.

The character recognition method generates an element in which connected components overlapping in the vertical direction are combined,
The character recognition method according to claim 1, wherein the half-width discrimination processing calculates a half-width likelihood using the number of elements, the width of the elements, and the height of the character line.

The character recognition method generates an element in which connected components overlapping in the vertical direction are combined,
2. The character recognition according to claim 1, wherein the double-angle discrimination processing calculates a double-angle likelihood using a width of an element satisfying a predetermined condition of a width of the element and a height of the character line. Method.

The character recognition method generates an element in which connected components overlapping in the vertical direction are combined,
In the character blur determination process, the character blur likelihood is calculated using the number of connected components in which the width and height of the connected component satisfy a predetermined condition, the height of the character line, and the width of the element. The character recognition method according to claim 1.

The character recognition method generates an element in which connected components overlapping in the vertical direction are combined,
In the character contact determination process, the character contact likelihood is calculated using the number of elements satisfying a predetermined width of the element, the height of the character line, and the width of the element. The character recognition method according to claim 1.

The character recognition method generates an element in which connected components overlapping in the vertical direction are combined,
The noise determination process calculates a noise likelihood using the number of connected components in which the width and height of the connected components satisfy a predetermined condition, the height of the character line, and the width of the element. The character recognition method according to claim 1, wherein:

The pre-segmentation process is performed by calculating the calculated half-angle likelihood, the calculated double-angle likelihood, the calculated character blur likelihood, the calculated character contact likelihood, and the calculated noise likelihood. The character pattern having one or a plurality of widths is generated by adjusting a width of a character pattern to be generated using one or a plurality of likelihoods among the two likelihoods. Character recognition method.

The forced cutting process is performed by calculating the calculated half angle likelihood, the calculated double angle likelihood, the calculated character fading likelihood, the calculated character contact likelihood, and the calculated noise likelihood. The character recognition method according to claim 1, wherein the width of the element to be cut and the position for forced cutting are adjusted using one or a plurality of likelihoods among the two likelihoods.

A character recognition device comprising a processor, a memory connected to the processor, and an interface connected to the processor, for recognizing characters from an input image,
A connected component generation unit that generates a connected component that is a series of black pixels that form a character from the input image;
A character line extraction unit that extracts a character line that is an area in which characters are arranged;
A half-width discriminator that calculates a half-width likelihood indicating the degree of half-width character of a character in a character line;
A double angle discriminating unit that calculates a double angle likelihood indicating the degree of doubleness likelihood of the characters in the character line;
A character blur determining unit for calculating a likelihood of blur of a character indicating a blurring degree of a character in a character line;
A character contact determination unit for calculating a character contact likelihood indicating a contact degree of a character in a character line;
A noise discriminating unit for calculating a noise likelihood indicating the degree of noise mixing in the character line;
Pre-segmentation for generating a character pattern that is information combining one or a plurality of connected components using the character contact likelihood calculated by the character contact determination unit and the double-angle character likelihood calculated by the double-angle determination unit Part,
Using the character blur likelihood calculated by the character blur determination process and the half-width character likelihood calculated by the half-width discrimination process, a character pattern for cutting a connected component is selected, and a position at which the connected component is cut is determined. Forced cutting part,
A character identification unit that extracts features of the character pattern generated by the pre-segmentation unit and the forced cutting unit and calculates a similarity with the feature of each character type existing in an identification dictionary prepared in advance. Character recognition device.

The character identification unit
A word collation unit that collates with a combination of character types with high similarity and a word list in the word dictionary prepared in advance;
A knowledge processing unit that determines a character recognition result 1 in a character line by using the similarity for each character type as a result of the character identification processing and the word matching result;
Character width determination result indicating whether the character pattern in the character line is half-width, full-width, or double-width from the character type of the character pattern, the width and height of the character pattern, and the number of character patterns included in the determined character recognition result 1 And an inappropriate recognition result is included in the character recognition result 1 using the character type of the character pattern in the character recognition result 1 and the width and height of the character pattern according to the character width determination result. If an inappropriate recognition result is included, the pre-segmentation process is performed using the character width determination result, the character type of the character pattern, and the width and height of the character pattern. A recognition result-use character width verification unit that selects an appropriate character pattern and identification result from the character pattern generated by the forced cutting process and performs knowledge processing using only the selected character pattern and identification result; With features Character recognition apparatus according to claim 9 that.

The pre-segmentation unit is configured to generate the calculated half-angle likelihood, the calculated double-angle likelihood, the calculated character blur likelihood, the calculated character contact likelihood, and the calculated noise likelihood generation. The character pattern having one or a plurality of widths is generated by adjusting a width of a character pattern to be generated using one or a plurality of likelihoods among the five likelihoods. The character recognition device described.

The forced cutting unit is configured to generate the calculated half-angle likelihood, the calculated double-angle likelihood, the calculated character blur likelihood, the calculated character contact likelihood, and the calculated noise likelihood generation. The character recognition device according to claim 9, wherein the width of the element to be cut and the position for forced cutting are adjusted using one or a plurality of likelihoods among the five likelihoods.

A character recognition device comprising a processor, a memory connected to the processor, and an interface connected to the processor, for recognizing characters from an input image,
A connected component generation unit that generates a connected component that is a series of black pixels that form a character from the input image;
A character line extraction process for extracting a character line that is an area where characters are arranged;
A pre-segmentation unit that generates a character pattern that is information obtained by combining one or a plurality of connected components;
Select a character pattern that cuts the connected component, and a forced cutting unit that determines the cutting position of the connected component;
A character identification unit that extracts the characteristics of the character pattern generated by the pre-segmentation unit and the forced cutting unit, and calculates a similarity to the characteristics of each character type existing in an identification dictionary prepared in advance;
A word collation processing unit that collates with a combination of character types having high similarity in the character identification process and a word list in the word dictionary prepared in advance;
A knowledge processing unit for determining a character recognition result 1 in a character line using a similarity for each character type and a word matching result as a result of the character identification process;
From the character type of the character pattern included in the character recognition result 1, the width and height of the character pattern, and the number of character patterns, a character width determination result representing whether the character pattern in the character line is half-width, full-width, or double-width is calculated. Whether or not an inappropriate recognition result is included in the character recognition result 1 using the character type of the character pattern in the character recognition result 1 and the width and height of the character pattern according to the character width determination result. If an inappropriate recognition result is included, the character segment determination result, the character type of the character pattern, and the width and height of the character pattern are used. A recognition result-use character width verification process for selecting an appropriate character pattern and an identification result from the generated character pattern and performing knowledge processing using only the selected character pattern and the identification result. Character recognition Apparatus.