JP2011146028A

JP2011146028A - Character recognition method and character recognition device

Info

Publication number: JP2011146028A
Application number: JP2010234922A
Authority: JP
Inventors: Bunken Jo; 文権徐; Yuan He; 源何; Shun Son; 俊孫; Hao Yu; 浩于; Satoshi Naoi; 聡直井; Yusaku Fujii; 勇作藤井; Hiroshi Tanaka; 宏田中; Yoshinobu Hotta; 悦伸堀田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2010-01-18
Filing date: 2010-10-19
Publication date: 2011-07-28
Also published as: CN102129560A; CN102129560B

Abstract

<P>PROBLEM TO BE SOLVED: To provide a character recognition method and a character recognition device, which exactly delimit a concatenation character. <P>SOLUTION: The character recognition device identifies one or more candidate division paths at each candidate division point in a character image which is a recognition target, divides the character image which is the recognition target based on each of identified candidate division paths, and generates one or more image segments. Then, the character recognition device recognizes each of one or more image segments and obtains one or more recognition results. Then, the character recognition device selects one or more best division points and best division path of each best division point of the character image from the candidate division points and the candidate division paths at candidate division points, based on the obtained recognition results. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、文字認識の分野、特に文字認識方法及び文字認識装置に関するものである。 The present invention relates to the field of character recognition, and more particularly to a character recognition method and a character recognition device.

従来、文字の切り分けが不正確であった場合、その後の文字認識に誤りを招く原因の一つとなるので、文字認識において、文字の切り分けの研究は重要である。 Conventionally, when character segmentation is inaccurate, it becomes one of the causes of errors in subsequent character recognition. Therefore, research on character segmentation is important in character recognition.

例えば、複数の文字を含む入力画像を１文字毎の画像に分割する文字認識装置が開示されている。具体的には、文字認識装置は、認識ロジックにより入力画像における連結部分を認識し、分類ロジックにより認識された連結部分に対して分類し、分類結果に応じてその連結部分が１文字の画像であるか否かを判定する。文字認識装置は、１文字の画像ではない場合、切り分けロジックにより連結部分をより小さいセグメントに再度に切り分け、あるいは合併ロジックにより隣接したセグメントを合併する。文字認識装置は、分割結果がある所定の条件を満たすまで分類、切り分け及び合併の処理を繰り返して実行する。 For example, a character recognition device that divides an input image including a plurality of characters into images for each character is disclosed. Specifically, the character recognition device recognizes a connected part in the input image by the recognition logic, classifies the connected part recognized by the classification logic, and the connected part is an image of one character according to the classification result. It is determined whether or not there is. When the character recognition device is not an image of one character, the connected portion is divided again into smaller segments by the separation logic, or the adjacent segments are merged by the merge logic. The character recognition device repeatedly executes the classification, segmentation, and merge processing until the division result satisfies a predetermined condition.

また、幾つかの文字画像の分割方法が開示されている。これらは、空白とピッチ解析法（White Space and Pitch Analysis）、投影解析法（Projection Analysis）、連結文字処理法（Connected Component Processing）、認識による文字分割方法、混合分割方法及び全体認識方法等を含む。 Also, several character image segmentation methods are disclosed. These include White Space and Pitch Analysis, Projection Analysis, Connected Component Processing, Character Division by Recognition, Mixed Division, and Whole Recognition. .

米国特許第５７８７１９４号明細書US Pat. No. 5,787,194

Richard G. Casey、Eric Lecolinet，「A Survey of Methods and Strategies in Character Segmentation」，IEEE Transactions on Pattern Analysis and Machine Intelligence，１９９６年７月，第１８巻，第７号，ｐ．６９０−７０６Richard G. Casey, Eric Lecolinet, “A Survey of Methods and Strategies in Character Segmentation”, IEEE Transactions on Pattern Analysis and Machine Intelligence, July 1996, Vol. 18, No. 7, p. 690-706

しかしながら、上述した従来の技術では、正確に連結文字を切り分けることができないという課題があった。 However, the above-described conventional technique has a problem that it is impossible to accurately separate connected characters.

例えば、手書き文字、機器で印刷されたイタリック体又は飾り書き、草書体等の連結文字の切り分けにおいて、正確な分割位置と分割パスを特定する処理は困難である。 For example, it is difficult to specify an accurate division position and division path in the separation of connected characters such as handwritten characters, italic type printed on a device, ornamental type, or cursive type.

具体的には、文字画像を切り分ける過程において、文字認識装置は、複数の文字を1文字毎に切り分けるために、各文字間の切り分け位置（以下、分割点と称する）及び切り分け位置毎における切り分けパス（以下、分割パスと称する）を特定する。例えば、文字認識装置は、分割点毎に一つの分割パスを特定することにより、文字画像中の各文字を分ける。図１３Ａは、文字認識装置が分割点毎に一つの分割パスを特定した一例を示す図である。しかしながら、多くの場合、特に連結文字（例えば手書き文字、機器で印刷されたイタリック体又は装飾文字、草書体等）を切り分ける過程では、各分割点のそれぞれに利用可能な分割パスが複数存在する可能性がある。図１３Ｂは、文字認識装置が文字画像中の複数の可能な分割点のそれぞれにおいて、一つまたは複数の可能な分割パスを特定した一例を示す図である。これらの可能な分割点（以下、候補分割点と称する）及びその可能な分割パス（以下、候補分割パスと称する）が正しいかどうかは、文字認識の正確性に直接的な影響を与える。既知の方法において、文字認識装置は、先ず、分割点毎に一つの分割パスを特定する。その後、文字認識装置は、このように切り分けられた画像が、所定の基準を満たしているかどうかを判定する。所定の基準を満たしていない場合には、文字認識装置は、当該所定の基準を満たすまで、再び分割または合併を行う。すなわち、文字認識装置は、各分割点のそれぞれとその対応する一つの分割パスを再び特定する。しかしながら、このような方法で特定された分割点と分割パスは、所定の基準を満たしているだけであり、最適な分割点と最適な分割パスではない場合が多い。 Specifically, in the process of segmenting a character image, the character recognition device uses a segmentation position between characters (hereinafter referred to as a segmentation point) and a segmentation path for each segmentation position in order to segment a plurality of characters one by one. (Hereinafter referred to as a split path). For example, the character recognition device divides each character in the character image by specifying one division path for each division point. FIG. 13A is a diagram illustrating an example in which the character recognition device specifies one division path for each division point. However, in many cases, particularly in the process of separating connected characters (for example, handwritten characters, italic or decorative characters printed on devices, cursive letters, etc.), there may be a plurality of division paths that can be used at each division point. There is sex. FIG. 13B is a diagram illustrating an example in which the character recognition device identifies one or a plurality of possible division paths at each of a plurality of possible division points in the character image. Whether these possible division points (hereinafter referred to as candidate division points) and their possible division paths (hereinafter referred to as candidate division paths) are correct directly affects the accuracy of character recognition. In a known method, the character recognition device first specifies one division path for each division point. Thereafter, the character recognition device determines whether or not the image cut in this way satisfies a predetermined criterion. If the predetermined standard is not satisfied, the character recognition device performs division or merge again until the predetermined standard is satisfied. That is, the character recognition device again specifies each of the division points and one corresponding division path. However, the division points and the division paths specified by such a method only satisfy a predetermined standard, and are not optimal division points and optimum division paths in many cases.

開示の技術は、上記に鑑みてなされたものであって、正確に連結文字を切り分けることができる文字認識方法及び文字認識装置を提供することを目的とする。 The disclosed technology has been made in view of the above, and an object thereof is to provide a character recognition method and a character recognition device capable of accurately separating connected characters.

開示技術のいくらかの側面についての基本的な理解のため、開示技術についての概要を以下に記載する。この概要は、開示技術について全てを記載するものではない。これは、開示技術のキーポイントまたは重要な部分を確定することを意図せず、また、開示技術の範囲を限定することを意図するものでもない。この目的は、簡潔な形態で開示技術の概念を与えることにある。この概要は、後述のより詳しい記載の前書きとなる。 For a basic understanding of some aspects of the disclosed technology, an overview of the disclosed technology is provided below. This summary is not an exhaustive list of disclosed technology. This is not intended to establish key points or important parts of the disclosed technology, nor is it intended to limit the scope of the disclosed technology. The purpose is to give the concept of the disclosed technology in a concise form. This summary is a preface to the more detailed description below.

開示技術の一つの方面によれば、文字認識方法が提供される。この方法は、認識対象である文字画像中の候補分割点毎における一つまたは複数の候補分割パスを特定すること（ここで、少なくとも一つの候補分割点には複数の候補分割パスが有る）、各候補分割パスに基づいて認識対象である文字画像を分割して、一つまたは複数の画像セグメントを生成すること、一つまたは複数の画像セグメントのそれぞれを認識して、一つまたは複数の認識結果を取得すること、認識結果に基づいて、候補分割点及び候補分割点における候補分割パスから文字画像の一つまたは複数の最適分割点と最適分割点毎の最適分割パスを選択することを備える。 According to one aspect of the disclosed technology, a character recognition method is provided. This method specifies one or a plurality of candidate division paths for each candidate division point in the character image to be recognized (wherein at least one candidate division point has a plurality of candidate division paths), Based on each candidate division pass, the character image to be recognized is divided to generate one or more image segments, and one or more image segments are recognized to recognize one or more Obtaining a result, and selecting one or a plurality of optimum division points of the character image and an optimum division path for each optimum division point from the candidate division points and the candidate division paths at the candidate division points based on the recognition result. .

開示技術の一つの方面によれば、文字認識装置が提供される。その装置は、認識対象である文字画像中の候補分割点毎における一つまたは複数の候補分割パスを特定する（ここで、少なくとも一つの候補分割点には複数の候補分割パスが有る）候補パス特定部と、各候補分割パスに基づいて認識対象である文字画像を分割して、一つまたは複数の画像セグメントを生成する画像セグメント生成部と、一つまたは複数の画像セグメントのそれぞれを認識して、一つまたは複数の認識結果を取得する認識部と、認識結果に基づいて、候補分割点及び候補分割点における候補分割パスから文字画像の一つまたは複数の最適分割点と最適分割点毎の最適分割パスを選択するパス探索部とを備える。 According to one aspect of the disclosed technology, a character recognition device is provided. The apparatus identifies one or a plurality of candidate division paths for each candidate division point in the character image to be recognized (here, at least one candidate division point has a plurality of candidate division paths). Recognizing each of one or more image segments, a specific unit, an image segment generation unit that divides a character image to be recognized based on each candidate division path, and generates one or more image segments A recognition unit that acquires one or a plurality of recognition results, and one or more optimum division points and optimum division points of the character image from the candidate division points and the candidate division paths at the candidate division points based on the recognition results. And a path search unit for selecting the optimal divided path.

また、開示技術の実施例は、文字認識方法を実現するためのコンピュータプログラムを更に提供する。 The embodiment of the disclosed technique further provides a computer program for realizing the character recognition method.

なお、開示技術の実施例は、少なくともコンピュータによって読取可能な媒体形式で、文字認識方法を実現するためのコンピュータプログラムコードが記録されたコンピュータプログラム製品をさらに提供する。 Note that the embodiments of the disclosed technology further provide a computer program product in which computer program codes for realizing a character recognition method are recorded in at least a computer-readable medium format.

本願の開示する文字認識方法及び文字認識装置の一つの態様によれば、正確に連結文字を切り分けることができるという効果を奏する。 According to one aspect of the character recognition method and the character recognition device disclosed in the present application, it is possible to accurately separate connected characters.

図１は、実施例１に係る文字認識方法を模式的に示すフローチャートである。FIG. 1 is a flowchart schematically illustrating a character recognition method according to the first embodiment. 図２は、実施例１に係る文字認識方法の別例を模式的に示すフローチャートである。FIG. 2 is a flowchart schematically illustrating another example of the character recognition method according to the first embodiment. 図３は、実施例２に係る文字認識方法を模式的に示すフローチャートである。FIG. 3 is a flowchart schematically illustrating the character recognition method according to the second embodiment. 図４Ａは、候補分割点を第一標識によって標記した一例を示す図である。FIG. 4A is a diagram illustrating an example in which candidate division points are marked with a first marker. 図４Ｂは、候補分割パスを第二標識によって標記した一例を示す図である。FIG. 4B is a diagram illustrating an example in which candidate division paths are marked with a second indicator. 図５は、動的計画法を適用して最適分割パスの探索を行う場合の一例を模式的に示すフローチャートである。FIG. 5 is a flowchart schematically showing an example in which the optimum division path is searched by applying dynamic programming. 図６は、実施例３に係る文字認識方法を模式的に示すフローチャートである。FIG. 6 is a flowchart schematically illustrating the character recognition method according to the third embodiment. 図７Ａは、再度の分割の一例を示す図である。FIG. 7A is a diagram illustrating an example of another division. 図７Ｂは、合併の一例を示す図である。FIG. 7B is a diagram illustrating an example of merger. 図８は、実施例４に係る文字認識方法を模式的に示すフローチャートである。FIG. 8 is a flowchart schematically illustrating the character recognition method according to the fourth embodiment. 図９は、実施例５に係る文字認識装置を模式的に示すブロック図である。FIG. 9 is a block diagram schematically illustrating a character recognition apparatus according to the fifth embodiment. 図１０は、実施例６に係る文字認識装置を模式的に示すブロック図である。FIG. 10 is a block diagram schematically illustrating a character recognition apparatus according to the sixth embodiment. 図１１は、実施例７に係る文字認識装置を模式的に示すブロック図である。FIG. 11 is a block diagram schematically illustrating a character recognition device according to the seventh embodiment. 図１２は、開示技術による実施例の実施に適用可能なコンピュータの例を模式的に示すブロック図である。FIG. 12 is a block diagram schematically illustrating an example of a computer that can be applied to an embodiment of the disclosed technology. 図１３Ａは、文字認識装置が分割点毎に一つの分割パスを特定した一例を示す図である。FIG. 13A is a diagram illustrating an example in which the character recognition device specifies one division path for each division point. 図１３Ｂは、文字認識装置が文字画像中の複数の可能な分割点のそれぞれにおいて、一つまたは複数の可能な分割パスを特定した一例を示す図である。FIG. 13B is a diagram illustrating an example in which the character recognition device identifies one or a plurality of possible division paths at each of a plurality of possible division points in the character image.

以下に開示技術の実施例について図面を参照しながら説明する。開示技術の一つの図面あるいは一種の実施形態に記述された要素及び特徴は、その他の一つ又は複数の図面あるいは実施形態に示された要素及び特徴と組み合わせることができる。図面と説明には、開示技術を明確にするため、開示技術とは関係がなく当業者にとって既知の構成要素や処理についての表示や記載を省略している。 Hereinafter, embodiments of the disclosed technology will be described with reference to the drawings. Elements and features described in one drawing or type of embodiment of the disclosed technology may be combined with elements and features shown in one or more other drawings or embodiments. In the drawings and the description, in order to clarify the disclosed technique, display and description of components and processes that are not related to the disclosed technique and are known to those skilled in the art are omitted.

以下の開示技術の実施例について、図面を組合せた説明を参照すると、開示技術の目的、特徴と効果がより理解し易くなる。図面における構成要素は開示技術の原理を示すに過ぎない。また、図面中において、同一又は類似する技術的特徴や部品には、同一または類似する符号を付している。 Referring to the description of the embodiments of the disclosed technology in combination with the drawings, the objects, features, and effects of the disclosed technology can be understood more easily. The components in the drawings merely illustrate the principles of the disclosed technology. In the drawings, the same or similar technical features and parts are denoted by the same or similar reference numerals.

図１は、実施例１に係る文字認識方法を模式的に示すフローチャートである。図１に示すように、文字認識装置は、ステップＳ２０３〜２０９の処理を実行することで、候補分割点毎の一つまたは複数の候補分割パスから最適分割パスを探索する。 FIG. 1 is a flowchart schematically illustrating a character recognition method according to the first embodiment. As illustrated in FIG. 1, the character recognition device searches for an optimal division path from one or a plurality of candidate division paths for each candidate division point by executing the processes of steps S203 to S209.

ステップＳ２０３において、文字認識装置は、認識対象である文字画像中の候補分割点毎における一つまたは複数の候補分割パスを特定する。ここで、認識対象である文字画像中の複数の候補分割点のうち、少なくとも一つの候補分割点には複数の候補分割パスが存在すると仮定する。 In step S203, the character recognition device identifies one or a plurality of candidate division paths for each candidate division point in the character image to be recognized. Here, it is assumed that among a plurality of candidate division points in the character image to be recognized, a plurality of candidate division paths exist at at least one candidate division point.

ここで、認識対象である文字画像は、任意の適切なフォーマットであってもよい。ここでは全てを列挙しないが、認識対象である文字画像は、例えば、二値画像、JPG画像、ビットマップ画像等であってもよい。また、文字画像には、例えば、数字、符号、各種発音の字母または文字等、任意の種類の文字を含むことができる。ここでは、文字画像についてはいかなる限定もしない。 Here, the character image to be recognized may be in any appropriate format. Although not all are enumerated here, the character image to be recognized may be, for example, a binary image, a JPG image, a bitmap image, or the like. In addition, the character image can include any type of characters such as numerals, symbols, and letters or characters of various pronunciations. Here, no limitation is imposed on the character image.

また、文字認識装置は、任意の適切な方法を利用して候補分割点と候補分割パスを特定してもよい。例えば、文字認識装置は、空白とピッチ解析法（White Space and Pitch Analysis）、投影解析法（Projection Analysis）、常用の連結文字処理法（Connected Component Processing）及びアセンダー・ディセンダー解析法（Ascender-Descender Analysis）等を利用することができる。なお、ここでは、これらの既存の方法については、詳細な説明を省略する。 In addition, the character recognition device may specify candidate division points and candidate division paths using any appropriate method. For example, character recognition devices include white space and pitch analysis, projection analysis, connected component processing, and ascender-descender analysis. ) Etc. can be used. Here, detailed description of these existing methods is omitted.

文字認識装置は、最適分割パスを特定するために、候補分割点毎における全ての可能な分割パスを候補分割パスとして検出する。 The character recognition device detects all possible division paths for each candidate division point as candidate division paths in order to identify the optimum division path.

ステップＳ２０５において、文字認識装置は、各候補分割点における候補分割パスが特定されると、それらの候補分割パスに基づいて認識対象である文字画像を分割し、一つまたは複数の画像セグメントを生成する。 In step S205, when the candidate division path is specified at each candidate division point, the character recognition device divides the character image that is the recognition target based on the candidate division paths, and generates one or a plurality of image segments. To do.

ステップＳ２０７において、文字認識装置は、生成した一つまたは複数の画像セグメントのそれぞれを認識し、一つまたは複数の認識結果を取得する。 In step S207, the character recognition device recognizes each of the generated one or more image segments and acquires one or more recognition results.

ステップＳ２０７においては、文字認識装置は、任意の適切な方法を利用して画像セグメントを認識してもよい。図２は、実施例１に係る文字認識方法の別例を模式的に示すフローチャートである。図２に示したステップＳ７０３は、図１に示したステップＳ２０３と同様である。また、ステップＳ７０５は、ステップＳ２０５と同様である。また、ステップＳ７０９は、ステップＳ２０９と同様である。したがって、ここではこれらの説明を省略する。ステップＳ７０７において、文字認識装置は、画像セグメントを認識する。文字認識装置は、このステップでサブステップＳ７０７−１とサブステップＳ７０７−２を実行する。サブステップＳ７０７−１においては、文字認識装置は、生成された一つまたは複数の画像セグメントを分類する。文字認識装置は、画像中の文字の種類に応じて、任意の適切なパターン認識方法を利用して生成された複数の画像セグメントを分類する。例えば、文字認識装置は、ニューラルネットワーク、ベイズ決定、サポートベクトルマシン、構造的パターン認識方法等、及びこれらの方法の組み合わせを利用する。なお、これら方法についての詳細をここでは詳しく述べない。サブステップＳ７０７−２において、文字認識装置は、各画像セグメントの種類に基づいて、画像セグメント毎の当該種類の既知文字画像との間のマッチング距離を当該画像セグメントの認識結果として算出する。例えば、文字認識装置は、分類が行われた後、ある画像セグメントが字母“a”に属するものに分類されると、当該画像セグメントと字母“a”の既知画像との間のマッチング距離を算出する。図２のステップＳ７０７は、画像セグメントを認識する一つの例だけを示しているが、開示技術はこの例に限定されない。また、文字認識装置は、任意の適切な認識技術、例えば、ニューラルネットワーク、ベイズ決定、サポートベクトルマシン、構造的パターン認識方法等、及びこれらの方法の組み合わせを利用しても良い。なお、これらについては、詳細な説明を省略する。 In step S207, the character recognition device may recognize the image segment using any appropriate method. FIG. 2 is a flowchart schematically illustrating another example of the character recognition method according to the first embodiment. Step S703 shown in FIG. 2 is the same as step S203 shown in FIG. Step S705 is the same as step S205. Step S709 is the same as step S209. Therefore, these descriptions are omitted here. In step S707, the character recognition device recognizes an image segment. The character recognition device executes sub-step S707-1 and sub-step S707-2 in this step. In substep S707-1, the character recognition device classifies the generated one or more image segments. The character recognition device classifies a plurality of image segments generated using any appropriate pattern recognition method according to the type of character in the image. For example, the character recognition device uses a neural network, Bayesian decision, support vector machine, structural pattern recognition method, etc., and a combination of these methods. Details of these methods will not be described in detail here. In sub-step S707-2, the character recognition device calculates a matching distance between each type of image segment and the type of known character image as a recognition result of the image segment. For example, if a certain image segment is classified as belonging to the letter “a” after classification, the character recognition device calculates a matching distance between the image segment and the known image of the letter “a”. To do. Step S707 in FIG. 2 shows only one example of recognizing an image segment, but the disclosed technique is not limited to this example. In addition, the character recognition apparatus may use any appropriate recognition technology such as neural network, Bayesian determination, support vector machine, structural pattern recognition method, and a combination of these methods. Detailed description of these will be omitted.

ある文字種類に対して、対応する複数の既知画像が存在している例を用いて説明する。文字認識装置は、ある画像セグメントを当該種類に分類した場合、当該画像セグメントと複数の既知画像のそれぞれとの間のマッチング距離を算出する。その後、文字認識装置は、この中から最小のマッチング距離を当該画像セグメントに対応する認識結果として選択し、更にこのマッチング距離に対して平均値またはその合計値等を算出し、算出した値を当該文字画像に対応する認識結果とする。 A description will be given using an example in which a plurality of corresponding known images exist for a certain character type. When a certain image segment is classified into the type, the character recognition device calculates a matching distance between the image segment and each of the plurality of known images. After that, the character recognition device selects a minimum matching distance as a recognition result corresponding to the image segment, calculates an average value or a total value thereof for the matching distance, and calculates the calculated value The recognition result corresponding to the character image is used.

最後に、ステップＳ２０９においては、文字認識装置は、認識結果に基づいて、候補分割点及び候補分割点における候補分割パスから文字画像の一つまたは複数の最適分割点と最適分割点毎の最適分割パスを選択する。 Finally, in step S209, based on the recognition result, the character recognition device determines one or a plurality of optimum division points of the character image from the candidate division points and the candidate division paths at the candidate division points and the optimum division point for each optimum division point. Select a path.

例えば、認識対象である文字画像中におけるある文字の候補分割点において複数の候補分割パスが存在する可能性があるので、ステップＳ２０５において、文字認識装置は、当該文字に対応する複数の画像セグメントを生成する。これに伴い、ステップＳ２０７において、文字認識装置は、これらの画像セグメントに対応する複数の認識結果を取得する。例えば、図２の例では、文字認識装置は、ある文字に対応する複数の画像セグメントについて、複数のマッチング距離を算出する。ここで、算出されたマッチング距離が小さいほど、認識結果が良いことを意味する。したがって、文字認識装置は、最小のマッチング距離に対応する画像セグメントを選択する。すなわち、文字認識装置は、当該画像セグメントを分割して得ることができる候補分割点と候補分割パスから最適分割点及び最適分割点における最適分割パスとを選択することができる。 For example, since there may be a plurality of candidate division paths at a candidate division point of a character in the character image to be recognized, in step S205, the character recognition device selects a plurality of image segments corresponding to the character. Generate. Accordingly, in step S207, the character recognition device acquires a plurality of recognition results corresponding to these image segments. For example, in the example of FIG. 2, the character recognition device calculates a plurality of matching distances for a plurality of image segments corresponding to a certain character. Here, the smaller the calculated matching distance, the better the recognition result. Therefore, the character recognition device selects an image segment corresponding to the minimum matching distance. In other words, the character recognition device can select an optimal division point and an optimal division path at the optimal division point from candidate division points that can be obtained by dividing the image segment and candidate division paths.

上述した説明の方法において、文字認識装置は、一つの判定基準を予め定めなくてもよい。すなわち、文字認識装置は、一つの分割点において一つまたは複数の候補分割パスを特定することができる。したがって、文字認識装置は、一つまたは複数の候補分割パスから最適分割パスを選択することができるので、文字認識の正確性を大幅に向上することができる。 In the above-described method, the character recognition device does not have to determine one determination criterion in advance. That is, the character recognition device can specify one or a plurality of candidate division paths at one division point. Therefore, since the character recognition apparatus can select the optimum division path from one or a plurality of candidate division paths, the accuracy of character recognition can be greatly improved.

複数の候補分割点と一つまたは複数の候補分割パスから文字画像の最適分割点と最適分割パスを選択する方法はこの例に限定されない。実施例２において、文字認識装置は、候補分割点毎における各候補分割パスのそれぞれを標記すると共に、動的計画アルゴリズムを利用して、複数の候補分割点と一つまたは複数の候補分割パスから最適分割点及び最適分割点における最適分割パスを選択する。 The method for selecting the optimum division point and the optimum division path of the character image from the plurality of candidate division points and one or more candidate division paths is not limited to this example. In the second embodiment, the character recognition device marks each candidate division path for each candidate division point, and uses a dynamic programming algorithm from a plurality of candidate division points and one or a plurality of candidate division paths. The optimum division point and the optimum division path at the optimum division point are selected.

図３は、実施例２に係る文字認識方法を模式的に示すフローチャートである。図３に示すように、この方法において文字認識装置は、ステップＳ３０３、３０４、３０５、３０７及び３０９の処理を実行する。 FIG. 3 is a flowchart schematically illustrating the character recognition method according to the second embodiment. As shown in FIG. 3, in this method, the character recognition apparatus executes the processes of steps S303, 304, 305, 307, and 309.

ステップＳ３０３は、図１に示した実施例１におけるステップＳ２０３と同様であるので、ここでは説明を省略する。 Since step S303 is the same as step S203 in the first embodiment shown in FIG. 1, the description thereof is omitted here.

ステップＳ３０４において、文字認識装置は、候補分割点毎における各候補分割パスに対して二次元標識で標記する。このように、文字認識装置は、これらの候補分割パスの二次元標識により複数の画像セグメントを標記する。二次元標識は、各候補分割点を標記するための第一標識及び各候補分割パスを標記するための第二標識を含む。具体的には、文字認識装置が、認識対象である文字画像においてＩ（Ｉ＞１）個の候補分割点が含まれ、且つ第ｉ（１≦ｉ≦Ｉ）番目の候補分割点においてＪ（Ｊ≧１）個の候補分割パスを特定した場合を例に説明する。文字認識装置は、第ｉ番目の候補分割点の第ｊ（１≦ｊ≦Ｊ）番目の候補分割パスを（ｉ，ｊ）または(ｊ，ｉ)と標記する。このように、第ｉ番目の候補分割点の第ｊ番目の候補分割パス及び第ｍ(１≦ｍ≦Ｉ)番目の候補分割点における第ｎ番目の候補分割パスにより切り分けて得られた画像セグメントは｛（ｉ,ｊ）、（ｍ,ｎ）｝または｛（ｊ,ｉ）、（ｎ,ｍ）｝と標記する。なお、第ｍ番目の分割点上にＮ個の候補分割パスが存在し、Ｎ≧１、１≦ｎ≦Ｎであると仮定する。 In step S304, the character recognition device marks each candidate division path with a two-dimensional label for each candidate division point. As described above, the character recognition apparatus marks a plurality of image segments by the two-dimensional labeling of these candidate division paths. The two-dimensional mark includes a first mark for marking each candidate dividing point and a second mark for marking each candidate dividing path. Specifically, the character recognition device includes I (I> 1) candidate division points in the character image to be recognized, and J (1 ≦ i ≦ I) -th candidate division point J ( A case where J ≧ 1) candidate division paths are specified will be described as an example. The character recognition device marks the j-th (1 ≦ j ≦ J) -th candidate division path of the i-th candidate division point as (i, j) or (j, i). Thus, the image segment obtained by carving out by the j-th candidate division pass of the i-th candidate division point and the n-th candidate division pass at the m (1 ≦ m ≦ I) -th candidate division point. Is represented as {(i, j), (m, n)} or {(j, i), (n, m)}. It is assumed that N candidate division paths exist on the m-th division point, and N ≧ 1, 1 ≦ n ≦ N.

図４は、二次元的な標記を行う例を示している。図４Ａに示した画像は、認識対象である文字画像として手書き単語“ｎｅｔ”を含んでおり、その中に特定された候補分割点は“位置ｉ”、“位置ｉ＋１”と“位置ｉ＋２”を含む。ここで、ｉ≧０である。位置ｉにおける候補分割パスは（ｉ,１）と標記することができ、位置ｉ＋１における候補分割パスは（ｉ＋１,１）と標記することができる。そして、位置ｉ＋２における各候補分割パスのそれぞれは、（ｉ＋２,１）、（ｉ＋２,２）及び（ｉ＋２,３）と標記することができる。このように、図４において、位置ｉにおける候補分割パス（ｉ,１）と位置ｉ＋１における候補分割パス（ｉ＋１,１）を利用して切り分けて得られた画像セグメントは｛（ｉ,１）、（ｉ＋１,１）｝と標記することができる。 FIG. 4 shows an example of performing two-dimensional marking. The image shown in FIG. 4A includes a handwritten word “net” as a character image to be recognized, and candidate division points specified therein include “position i”, “position i + 1”, and “position i + 2”. Including. Here, i ≧ 0. The candidate split path at position i can be labeled (i, 1), and the candidate split path at position i + 1 can be labeled (i + 1,1). Then, each of the candidate division paths at the position i + 2 can be labeled as (i + 2, 1), (i + 2, 2), and (i + 2, 3). Thus, in FIG. 4, the image segment obtained by carving out using the candidate division path (i, 1) at the position i and the candidate division path (i + 1, 1) at the position i + 1 is {(i, 1), (I + 1,1)}.

ステップＳ３０５は、図１に示した、実施例１におけるステップＳ２０５と同様であるので、ここでは説明を省略する。また、ステップＳ３０７は、図１に示した、実施例１におけるステップＳ２０７と同様であるので、ここでは説明を省略する。 Since step S305 is the same as step S205 in the first embodiment shown in FIG. 1, the description thereof is omitted here. Further, step S307 is the same as step S207 in the first embodiment shown in FIG.

ステップＳ３０９においては、文字認識装置は、ステップＳ３０７において取得した認識結果に基づいて、複数の候補分割点及び候補分割点における候補分割パスから最適分割点及び最適分割パスを選択する。例えば、文字認識装置は、二次元標識に基づいて、動的計画法（dynamic programming method）を利用して、候補分割点及び候補分割点における候補分割パスから最適分割点及び最適分割点における最適分割パスを探索する。 In step S309, the character recognition device selects an optimal division point and an optimal division path from a plurality of candidate division points and candidate division paths at the candidate division points based on the recognition result acquired in step S307. For example, the character recognition device uses the dynamic programming method based on the two-dimensional label, and the optimal division point and the optimal division point at the optimal division point from the candidate division point and the candidate division path at the candidate division point. Search for a path.

図５は、動的計画法を利用して最適分割パスの探索を行う場合の一例を模式的に示すフローチャートである。 FIG. 5 is a flowchart schematically showing an example in the case of searching for an optimal division path using dynamic programming.

図５に示すように、複数の画像セグメントに対して二次元的な標記が行われた後、動的計画法を利用して、複数の候補分割点及び各候補分割点における候補分割パスから一つまたは複数の最適分割点及び最適分割点毎における最適分割パスを選択する。この方法において、文字認識装置は、ステップＳ１２０９−１、１２０９−２及び１２０９−３の処理を実行する。 As shown in FIG. 5, after two-dimensional marking is performed on a plurality of image segments, a dynamic programming method is used to identify a plurality of candidate division points and candidate division paths at each candidate division point. One or a plurality of optimum dividing points and an optimum dividing path for each optimum dividing point are selected. In this method, the character recognition device executes the processes of steps S1209-1, 1209-2, and 1209-3.

まず、ステップＳ１２０９−１において、文字認識装置は、複数の画像セグメントに対して分類する。具体的に、文字認識装置は、画像セグメントに対応する候補分割点と候補分割パスの二次元標識に基づいて、複数の画像セグメントをその左側（又は右側）の分割点毎に分類する。ここで、左側（又は右側）の候補分割点が同じ画像セグメントを候補分割パス毎に分類する。 First, in step S1209-1, the character recognition device classifies a plurality of image segments. Specifically, the character recognition device classifies the plurality of image segments for each of the left (or right) division points based on the candidate division points corresponding to the image segments and the two-dimensional markers of the candidate division paths. Here, image segments having the same candidate division point on the left side (or right side) are classified for each candidate division path.

次に、ステップＳ１２０９−２において、文字認識装置は、各画像セグメント間の接続関係を定義する。具体的には、文字認識装置は、以下の手順にしたがって、各画像セグメント間の接続関係を定義する、すなわち、文字認識装置は、一方の画像セグメントの右側の候補分割点と候補分割パスが、他方の画像セグメントの左側の候補分割点と候補分割パスと同じである場合に限り、その二つの画像セグメントが接続していると定義する。文字認識装置は、二次元標識を利用して、二つの画像セグメントの対応する候補分割点と候補分割パスは同じであるか否かを判定する。 Next, in step S1209-2, the character recognition device defines a connection relationship between the image segments. Specifically, the character recognition device defines the connection relationship between each image segment according to the following procedure, that is, the character recognition device has a candidate division point and a candidate division path on the right side of one image segment, Only when the candidate division point on the left side of the other image segment is the same as the candidate division path, the two image segments are defined as connected. The character recognition device determines whether or not the candidate division points corresponding to the two image segments and the candidate division paths are the same using the two-dimensional label.

その次に、ステップＳ１２０９−３において、文字認識装置は、パスの探索を行う。具体的には、文字認識装置は、定義された画像セグメント間の接続関係に基づいて、動的計画法を利用して最適な認識結果に対応する候補分割パスを画像の最適分割パスとして検出し、これらに対応する候補分割点を最適分割点とする。一例として、ある文字の種類に対して、対応する複数の既知画像が存在していてもよい。文字認識装置は、画像セグメント毎の複数の既知画像のそれぞれとの間のマッチング距離を算出して、複数のマッチング距離の平均値またはその合計を当該画像セグメントの認識結果とする。例えば、文字認識装置は、動的計画法を利用してマッチング距離が最も短い分割パスを検出する。ここで、文字認識装置は、マッチング距離が最も短い分割パスを構成する画像セグメントを認識結果が最も良いものと判定する。 Next, in step S1209-3, the character recognition device searches for a path. Specifically, the character recognition device detects the candidate division path corresponding to the optimum recognition result as the optimum division path of the image using dynamic programming based on the connection relation between the defined image segments. The candidate division points corresponding to these are set as the optimum division points. As an example, a plurality of corresponding known images may exist for a certain character type. The character recognition device calculates a matching distance between each of the plurality of known images for each image segment, and uses an average value or a total of the plurality of matching distances as a recognition result of the image segment. For example, the character recognition device detects a divided path with the shortest matching distance using dynamic programming. Here, the character recognition device determines that the image segment constituting the divided path with the shortest matching distance has the best recognition result.

上述の方法においては、文字認識装置は、動的計画法を利用して文字画像の最適分割パスの探索を行うので、マッチング距離の算出を効率化することができる。 In the above-described method, the character recognition device searches for the optimum division path of the character image using dynamic programming, so that the matching distance can be calculated efficiently.

実施例３において、文字認識装置は、認識対象である文字画像の各候補分割点と各候補分割パスを特定した後、得られた画像セグメントに対して再び合併及び／又は分割を行う。図６は、実施例３に係る文字認識方法を模式的に示すフローチャートである。図６に示すように、この方法において、文字認識装置は、ステップＳ６０３、６０５、６０６、６０７及び６０９の処理を実行する。 In Example 3, the character recognition apparatus identifies each candidate division point and each candidate division path of the character image to be recognized, and then merges and / or divides the obtained image segment again. FIG. 6 is a flowchart schematically illustrating the character recognition method according to the third embodiment. As shown in FIG. 6, in this method, the character recognition apparatus executes the processes of steps S603, 605, 606, 607, and 609.

ステップＳ６０３は、実施例１におけるステップＳ２０３と同様である。また、ステップＳ６０５は、実施例１におけるステップＳ２０５と同様である。したがって、ここでは、ステップＳ６０３及びステップＳ６０５についての説明を省略する。 Step S603 is the same as step S203 in the first embodiment. Step S605 is the same as step S205 in the first embodiment. Therefore, description of step S603 and step S605 is omitted here.

ステップＳ６０６においては、文字認識装置は、選択的に、ステップＳ６０５において得られた複数の画像セグメントに対して再度の分割及び／又は合併を行う。具体的には、文字認識装置は、隣接した画像セグメントを合併してもよく、及び／又は、複数の画像セグメントの中の一つまたは複数個に対して再度の分割をしてもよい。これによって、文字認識装置は、新たな画像セグメントを生成する。 In step S606, the character recognition apparatus selectively performs re-division and / or merge on the plurality of image segments obtained in step S605. Specifically, the character recognition device may merge adjacent image segments and / or may divide one or more of the plurality of image segments again. Thereby, the character recognition device generates a new image segment.

図７は再度の分割及び／または合併を行う例を示している。ここで、図７Ａは、再度の分割の例を示しており、図７Ｂは、合併を行う例を示している。図７Ａには、ステップＳ６０５を経た後、二つの画像セグメント、すなわち、図７Ａにおける「画像セグメント１」と「画像セグメント２」とが得られた場合を示している。文字認識装置は、この二つの画像セグメントに対して再度の分割を実行することができる。この例では、文字認識装置は、画像セグメント２に対して再度の分割を実行して画像セグメント３と画像セグメント４とを取得する。図７Ｂでは、文字認識装置は、ステップＳ６０５を経た後、四つの画像セグメント、すなわち図７Ｂにおける「画像セグメント１′-４′」を取得する。文字認識装置は、これらの画像セグメント中の隣接したセグメントに対して合併することができる。例えば、文字認識装置は、画像セグメント１′と画像セグメント２′を合併して画像セグメント５′を取得する。別例として、ステップＳ６０６においては、文字認識装置は、隣接した画像セグメントに対して合併を行うと同時に、一つまたは複数の画像セグメントに対して再度の分割を実行してもよい。 FIG. 7 shows an example of performing another division and / or merge. Here, FIG. 7A shows an example of re-division, and FIG. 7B shows an example of merging. FIG. 7A shows a case where two image segments, that is, “image segment 1” and “image segment 2” in FIG. 7A are obtained after step S605. The character recognition device can perform the division again on the two image segments. In this example, the character recognition device performs division again on the image segment 2 to obtain the image segment 3 and the image segment 4. In FIG. 7B, the character recognition apparatus obtains four image segments, that is, “image segment 1′-4 ′” in FIG. 7B after step S605. The character recognizer can merge with adjacent segments in these image segments. For example, the character recognition device acquires the image segment 5 ′ by merging the image segment 1 ′ and the image segment 2 ′. As another example, in step S606, the character recognition device may perform re-division on one or a plurality of image segments at the same time as merging the adjacent image segments.

ステップＳ６０７においては、文字認識装置は、ステップＳ６０５とステップＳ６０６において得られた全ての画像セグメントに対して認識を行う。例えば、図７Ａの例においては、文字認識装置は、得られた４つの画像セグメント１−４を認識して対応する認識結果を取得する。また、図７Ｂの例においては、文字認識装置は、得られた５つの画像セグメント（画像セグメント１′-５′）を認識して対応する認識結果を取得する。なお、この認識結果を取得するステップＳ６０７は、実施例１におけるステップＳ２０７と同様であるので、ここで説明を省略する。 In step S607, the character recognition device recognizes all the image segments obtained in steps S605 and S606. For example, in the example of FIG. 7A, the character recognition device recognizes the obtained four image segments 1-4 and acquires corresponding recognition results. In the example of FIG. 7B, the character recognition device recognizes the obtained five image segments (image segments 1′-5 ′) and acquires corresponding recognition results. Note that step S607 for obtaining the recognition result is the same as step S207 in the first embodiment, and a description thereof will be omitted here.

ステップＳ６０９は、実施例１におけるステップＳ２０９と同様であるので、ここでは、ステップＳ６０９についての説明を省略する。 Since step S609 is the same as step S209 in the first embodiment, description of step S609 is omitted here.

上述の方法においては、文字認識装置は、画像セグメントに対して再度の分割及び／又は合併を行うことにより、画像セグメントの誤分割を防止し、正確に最適分割パスを探索することができる。 In the above-described method, the character recognition device can divide the image segment again and / or merge to prevent erroneous division of the image segment and accurately search for the optimum division path.

図８は、実施例４に係る文字認識方法を模式的に示すフローチャートである。図８に示すように、当該方法において、文字認識装置は、ステップＳ８０１の連結部分を認識する処理を実行する。 FIG. 8 is a flowchart schematically illustrating the character recognition method according to the fourth embodiment. As shown in FIG. 8, in the method, the character recognition device executes a process of recognizing a connected part in step S801.

ステップＳ８０１において、文字認識装置は、認識対象である文字画像中の一つまたは複数の連結部分に対して認識を行う。例えば、入力された画像が、図１３に示した、手書き文字「chorghui@xxx.net」を含む画像であると仮定する。文字認識装置は、手書き文字「chorghui@xxx.net」を「chorghui@」と「xxx」と「.」と「net」とを含む四つの連結部分に分ける。文字認識装置は、任意の適切な方法、例えば空白とピッチ解析法、常用の連結部分解析法又はアセンダー・ディセンダー解析法等を利用して、連結部分を認識しても良い。なお、ここでは連結部分の認識についての詳細な説明を省略する。 In step S801, the character recognition device recognizes one or more connected portions in the character image to be recognized. For example, it is assumed that the input image is an image including the handwritten character “chorghui@xxx.net” shown in FIG. The character recognition device divides the handwritten character “chorghui@xxx.net” into four connected parts including “chorghui @”, “xxx”, “.”, And “net”. The character recognition apparatus may recognize the connected portion by using any appropriate method, for example, a space and pitch analysis method, a commonly used connected portion analysis method, or an ascender / descender analysis method. In addition, detailed description about recognition of a connection part is abbreviate | omitted here.

文字認識装置は、これらの連結部分に対してそれぞれの処理を実行する。ステップＳ８０３においては、文字認識装置は、連結部分毎の各候補分割点を特定して標記する。ステップＳ８０４において、文字認識装置は、候補分割点毎の一つまたは複数の候補分割パスを特定して標記する。ステップＳ８０３は、図３に示した、実施例２におけるステップＳ３０３と同様であり、ステップＳ８０４は、ステップＳ３０４と同様である。言い換えれば、ステップＳ８０３と８０４においては、文字認識装置は、連結部分中の候補分割点毎の一つ又は複数の候補分割パスを特定すると共に、二次元標識により各候補分割パスのそれぞれに対して標記を行う。なお、標記方法は実施例２と同様であり、ここでは説明を省略する。 The character recognition device executes each process for these connected portions. In step S803, the character recognition device specifies and marks each candidate division point for each connected portion. In step S804, the character recognition device identifies and marks one or more candidate division paths for each candidate division point. Step S803 is similar to step S303 in the second embodiment illustrated in FIG. 3, and step S804 is similar to step S304. In other words, in steps S803 and S804, the character recognition device specifies one or a plurality of candidate division paths for each candidate division point in the connected portion, and for each candidate division path by two-dimensional labeling. Do the title. The marking method is the same as that in the second embodiment, and the description thereof is omitted here.

ステップＳ８０５において、文字認識装置は、ステップＳ８０３及びステップＳ８０４により特定された候補分割点と候補分割パスを利用して、連結部分のそれぞれに対して分割を行うことで、複数の画像セグメントを生成する。 In step S805, the character recognition device generates a plurality of image segments by performing division on each of the connected portions using the candidate division points and candidate division paths identified in steps S803 and S804. .

ステップＳ８０６において、文字認識装置は、ステップＳ８０５で取得した複数の画像セグメントに対して再度の分割及び／又は合併を行う。具体的には、文字認識装置は、隣接した画像セグメントを合併することができ、及び／又は、複数の画像セグメントの中の一つ又は複数個を再度の分割を実行することもできる。このようにして、文字認識装置は、新たな画像セグメントを生成する。 In step S806, the character recognition apparatus performs another division and / or merge on the plurality of image segments acquired in step S805. Specifically, the character recognition apparatus can merge adjacent image segments and / or perform another division of one or more of the plurality of image segments. In this way, the character recognition device generates a new image segment.

実施例４において、文字認識装置は、更にそれらの新たな画像セグメントに対して標記を行うことができる。例えば、図７Ａに示した例において、文字認識装置は、画像セグメント２を再度分割する分割点を位置ｉ＋２と標記する。一方、文字認識装置は、その対応する候補分割パスを（ｉ＋２,ｊ）と標記する。ここで、１≦ｊ≦Ｊであり、Ｊは当該分割点における候補分割パスの数である。また、図７Ｂに示した例において、文字認識装置は、画像セグメント１′と画像セグメント２′を合併した画像セグメント５′を｛（ｉ,０）、（ｉ＋２,０）｝と標記する。画像セグメント５′の左境界が第ｉ番目の分割点の第０番目の候補分割パスであり、右境界が第ｉ＋２番目の分割点の第０番目の候補分割パスであることを示している。このようにして、文字認識装置は、合併した画像セグメントのそれぞれの標記を完成する。 In the fourth embodiment, the character recognition apparatus can further mark the new image segments. For example, in the example shown in FIG. 7A, the character recognition device marks a division point where the image segment 2 is divided again as a position i + 2. On the other hand, the character recognition apparatus marks the corresponding candidate division path as (i + 2, j). Here, 1 ≦ j ≦ J, where J is the number of candidate division paths at the division point. In the example shown in FIG. 7B, the character recognition apparatus marks {(i, 0), (i + 2,0)} as the image segment 5 ′ obtained by merging the image segment 1 ′ and the image segment 2 ′. The left boundary of the image segment 5 'indicates the 0th candidate division path of the i-th division point, and the right boundary indicates the 0th candidate division path of the i + 2th division point. In this way, the character recognition device completes the marking of each merged image segment.

ステップＳ８０７において、文字認識装置は、ステップＳ８０５と８０６で取得した全ての画像セグメントを認識して、認識結果を取得する。認識結果を取得する方法は実施例１におけるステップＳ２０７と同様であり、ここでは説明を省略する。 In step S807, the character recognition device recognizes all the image segments acquired in steps S805 and 806, and acquires a recognition result. The method for acquiring the recognition result is the same as that in step S207 in the first embodiment, and the description thereof is omitted here.

ステップＳ８０９においては、文字認識装置は、ステップＳ８０７で取得した認識結果に応じて、二次元標識に基づいて、動的計画法を利用して複数の候補分割点及び候補分割パスから最適分割点及び最適分割パスを探索する。 In step S809, the character recognition device uses the dynamic programming method based on the two-dimensional label according to the recognition result acquired in step S807, and determines the optimum division point and the candidate division point from the plurality of candidate division points and candidate division paths. Search for the optimal split path.

ステップＳ８１１においては、文字認識装置は、全ての連結部分に対する処理が完了したかどうかを判定する。文字認識装置は、全ての連結部分に対する処理が完了していない場合には、ステップＳ８０３に戻り以降の処理を実行する。文字認識装置は、全ての連結部分に対する処理が完了している場合には、文字認識装置は、処理を終了する。 In step S811, the character recognition device determines whether or not processing for all connected portions has been completed. If the processes for all the connected parts are not completed, the character recognition apparatus returns to step S803 and executes the subsequent processes. If the character recognition device has completed the processing for all the connected portions, the character recognition device ends the processing.

この実施例においては、文字認識装置は、入力画像中の各連結部分のそれぞれに対して処理を行うので、入力画像に対する処理を優位に効率化することができる。 In this embodiment, the character recognition apparatus performs processing on each of the connected portions in the input image, so that the processing on the input image can be made more efficient.

図９は、実施例５に係る文字認識装置９００を模式的に示すブロック図である。図９に示すように、文字認識装置９００は、候補パス特定部９０２、画像セグメント生成部９０４、認識部９０６及びパス探索部９０８を備える。 FIG. 9 is a block diagram schematically illustrating a character recognition apparatus 900 according to the fifth embodiment. As illustrated in FIG. 9, the character recognition device 900 includes a candidate path specifying unit 902, an image segment generation unit 904, a recognition unit 906, and a path search unit 908.

候補パス特定部９０２は、認識対象である文字画像中の候補分割点毎の一つまたは複数の候補分割パスを特定し、候補分割点と候補分割パスを画像セグメント生成部９０４へ出力する。ここで、画像の複数の候補分割点のうち、少なくとも一つの候補分割点には複数の候補分割パスが存在するものと仮定する。 The candidate path specifying unit 902 specifies one or a plurality of candidate division paths for each candidate division point in the character image to be recognized, and outputs the candidate division points and the candidate division paths to the image segment generation unit 904. Here, it is assumed that at least one candidate division point among a plurality of candidate division points of the image has a plurality of candidate division paths.

実施例１〜４と同様に、認識対象である文字画像は、任意の適切なフォーマットであっても良い。例えば、認識対象である文字画像は、二値画像、ＪＰＧ画像、ビットマップ画像等であるが、ここでは全てを列挙しない。文字画像には、任意の種類の文字、例えば数字、符号、各種発音の字母又は文字等を含むことができ、ここではいかなる限定もしない。また、候補パス特定部９０２は、任意の適切な方法を利用して、候補分割点と候補分割パスを特定する。例えば、候補パス特定部９０２は、空白とピッチ解析法（White Space and Pitch Analysis）、投影解析法（Projection Analysis）、常用の連結文字処理法（Connected Component Processing）及びアセンダー・ディセンダー解析法（Ascender-Descender Analysis）等を利用する。なお、ここでは、これらの方法についての詳細な説明を省略する。 As in the first to fourth embodiments, the character image to be recognized may have any appropriate format. For example, the character images to be recognized are binary images, JPG images, bitmap images, etc., but not all are listed here. The character image can include any type of character, for example, numbers, symbols, various pronunciation words or characters, and is not limited in any way here. In addition, the candidate path specifying unit 902 specifies a candidate division point and a candidate division path using any appropriate method. For example, the candidate path specifying unit 902 includes white space and pitch analysis, projection analysis, connected component processing, and ascender-descender analysis. Descender Analysis) is used. Note that detailed description of these methods is omitted here.

画像セグメント生成部９０４は、候補パス特定部９０２により特定された候補分割パスを利用して、認識対象である文字画像を分割し、一つまたは複数の画像セグメントを生成し、複数の画像セグメントを認識部９０６へ出力する。 The image segment generation unit 904 divides the character image to be recognized using the candidate division path specified by the candidate path specification unit 902, generates one or a plurality of image segments, and generates a plurality of image segments. The data is output to the recognition unit 906.

認識部９０６は、画像セグメント生成部９０４から受信した一つまたは複数の画像セグメントのそれぞれを認識して、一つまたは複数の認識結果を取得する。そして認識部９０６は、認識結果をパス探索部９０８へ出力する。 The recognition unit 906 recognizes each of one or more image segments received from the image segment generation unit 904 and acquires one or more recognition results. Then, the recognition unit 906 outputs the recognition result to the path search unit 908.

認識部９０６は、実施例１で説明した方法を利用して画像セグメントの認識を行う。なお、ここでは説明を省略する。 The recognition unit 906 recognizes an image segment using the method described in the first embodiment. In addition, description is abbreviate | omitted here.

パス探索部９０８は、認識部９０６から受信した認識結果に基づいて、一つまたは複数の候補分割点及び候補分割点における候補分割パスから文字画像の一つまたは複数の最適分割点と最適分割点毎の最適分割パスを選択する。 Based on the recognition result received from the recognition unit 906, the path search unit 908 determines one or more candidate division points and one or more optimum division points and optimum division points of the character image from the candidate division paths at the candidate division points. Select the optimal split path for each.

パス探索部９０８は、判定基準を予め定める必要がなく、文字画像の一つの分割点において複数の候補分割パスを特定することができ、特定した複数の候補分割パスの中から最適分割パスを選択する。したがって、文字認識装置９００は、文字認識の正確性を大幅に向上することができる。 The path search unit 908 can specify a plurality of candidate division paths at one division point of the character image without needing to set a determination criterion in advance, and selects an optimum division path from the plurality of identified candidate division paths. To do. Therefore, the character recognition device 900 can greatly improve the accuracy of character recognition.

図１０は、実施例６に係る文字認識装置１０００を模式的に示すブロック図である。図１０に示す文字認識装置１０００は、候補パス特定部１００２、画像セグメント生成部１００４、認識部１００６とパス探索部１００８とを有する。ここで、候補パス特定部１００２は、図９に示した実施例における候補パス特定部９０２と同様の機能を有する。また、画像セグメント生成部１００４は、図９に示した実施例における画像セグメント生成部９０４と同様の機能を有する。したがって、ここでは、候補パス特定部１００２と画像セグメント生成部１００４について、説明を省略する。 FIG. 10 is a block diagram schematically illustrating a character recognition apparatus 1000 according to the sixth embodiment. A character recognition device 1000 illustrated in FIG. 10 includes a candidate path specifying unit 1002, an image segment generation unit 1004, a recognition unit 1006, and a path search unit 1008. Here, the candidate path specifying unit 1002 has the same function as the candidate path specifying unit 902 in the embodiment shown in FIG. Further, the image segment generation unit 1004 has the same function as the image segment generation unit 904 in the embodiment shown in FIG. Therefore, description of the candidate path specifying unit 1002 and the image segment generating unit 1004 is omitted here.

図９に示した実施例と相違するのは、認識部１００６が分類部１００６ａとマッチング部１００６ｂを有することにある。分類部１００６ａは、画像セグメント生成部１００４により生成された一つまたは複数の画像セグメントを分類する。分類部１００６ａは、実施例１で説明した分類方法を利用する。なお、ここでは分類方法についての説明を省略する。マッチング部１００６ｂは、分類部１００６ａから出力された分類結果に基づいて、すなわち、各画像セグメントの種類に基づいて、画像セグメント毎の当該種類の既知文字画像との間のマッチング距離を当該画像セグメントの認識結果として算出する。例えば、分類部１００６ａによって分類された場合、マッチング部１００６ｂは、ある画像セグメントが字母“a”に属するものに分類されると、当該画像セグメントと字母“a”の既知画像との間のマッチング距離を算出する。例えば、ある文字種類に対して、認識部１００６には対応する複数の既知画像が保存されていてもよい。分類部１００６ａによってある画像セグメントを当該種類に分類されると、マッチング部１００６ｂは、当該画像セグメントと複数の既知画像のそれぞれとの間のマッチング距離を算出する。マッチング部１００６ｂは、算出した中で最小のマッチング距離を当該画像セグメントに対応する認識結果として選択する。さらに、マッチング部１００６ｂは、そのマッチング距離に対して平均値又はその合計値を算出し、算出した値を当該画像セグメントに対応する認識結果とする。 The difference from the embodiment shown in FIG. 9 is that the recognition unit 1006 has a classification unit 1006a and a matching unit 1006b. The classification unit 1006a classifies one or a plurality of image segments generated by the image segment generation unit 1004. The classification unit 1006a uses the classification method described in the first embodiment. Note that description of the classification method is omitted here. Based on the classification result output from the classification unit 1006a, that is, based on the type of each image segment, the matching unit 1006b calculates the matching distance between the type of known character image for each image segment. Calculated as a recognition result. For example, when the image is classified by the classification unit 1006a, the matching unit 1006b, when an image segment is classified as belonging to the letter “a”, the matching distance between the image segment and the known image of the letter “a”. Is calculated. For example, a plurality of corresponding known images may be stored in the recognition unit 1006 for a certain character type. When a certain image segment is classified into the type by the classification unit 1006a, the matching unit 1006b calculates a matching distance between the image segment and each of the plurality of known images. The matching unit 1006b selects the calculated minimum matching distance as a recognition result corresponding to the image segment. Furthermore, the matching unit 1006b calculates an average value or a total value thereof for the matching distance, and uses the calculated value as a recognition result corresponding to the image segment.

パス探索部１００８は、認識部１００６から出力された認識結果、すなわち算出されたマッチング距離に応じて、複数の候補分割点及び補分割点における候補分割パスから文字画像の最適分割点と最適分割パスを選択する。具体的には、文字画像中のある文字については、その対応する候補分割点において複数の候補分割パスが存在する可能性があるので、画像セグメント生成部１００４において当該文字に対応する複数の画像セグメントが生成される場合がある。これに応じて、認識部１００６は、これらの画像セグメントに対応する複数の認識結果を得る場合がある。例えば、ある文字に対応する複数の画像セグメントについて、マッチング部１００６ｂは、複数のマッチング距離を算出する。なお、算出されたマッチング距離が小さいほど、認識結果が良いことを示す。したがって、パス探索部１００８は、最小のマッチング距離に対応する画像セグメントを選択することができる。すなわち、パス探索部１００８は、当該画像セグメントを分割して得られる分割パスを対応する分割点における最適分割パスとして選択する。 The path search unit 1008 determines the optimum division point and optimum division path of the character image from the candidate division paths at the plurality of candidate division points and the complementary division points according to the recognition result output from the recognition unit 1006, that is, the calculated matching distance. Select. Specifically, for a certain character in a character image, there may be a plurality of candidate division paths at the corresponding candidate division point, so that a plurality of image segments corresponding to the character in the image segment generation unit 1004. May be generated. In response to this, the recognition unit 1006 may obtain a plurality of recognition results corresponding to these image segments. For example, for a plurality of image segments corresponding to a certain character, the matching unit 1006b calculates a plurality of matching distances. Note that the smaller the calculated matching distance, the better the recognition result. Therefore, the path search unit 1008 can select an image segment corresponding to the minimum matching distance. That is, the path search unit 1008 selects a division path obtained by dividing the image segment as the optimum division path at the corresponding division point.

さらに、一例として、候補パス特定部９０２／１００２は、候補分割点毎の一つまたは複数の候補分割パスのそれぞれを二次元標識で標記してもよい。ここで、二次元標識は、各候補分割点を標記するための第一標識及び各候補分割パスを標記するための第二標識を含む。候補パス特定部９０２／１００２は、実施例２で説明した標記方法を利用することができる。なお、ここでは標記方法についての説明を省略する。パス探索部９０８／１００８は、候補パス特定部９０２／１００２により標記された二次元標識に基づいて、動的計画法を利用して、複数の候補分割点及び候補分割点における候補分割パスから最適分割点及び最適分割パスを選択してもよい。ここで、パス探索部９０８／１００８は、実施例２で説明した動的計画法を利用してパス探索を行ってもよい。なお、ここでは、動的計画法を利用したパス探索についての説明を省略する。パス探索部９０８／１００８は、動的計画法を利用して文字の分割パスを探索することにより、処理効率を優位に高めることができる。 Furthermore, as an example, the candidate path specifying unit 902/1002 may mark each of one or a plurality of candidate division paths for each candidate division point with a two-dimensional label. Here, the two-dimensional marker includes a first marker for marking each candidate segmentation point and a second marker for marking each candidate segmentation path. The candidate path specifying unit 902/1002 can use the marking method described in the second embodiment. Note that description of the marking method is omitted here. Based on the two-dimensional label marked by the candidate path specifying unit 902/1002, the path search unit 908/1008 uses dynamic programming to optimize the candidate division points and candidate division paths at the candidate division points. A division point and an optimal division path may be selected. Here, the path search unit 908/1008 may perform a path search using the dynamic programming described in the second embodiment. Note that description of path search using dynamic programming is omitted here. The path search unit 908/1008 can improve processing efficiency by searching for a character split path using dynamic programming.

別の一例においては、画像セグメント生成部９０４／１００４は、画像に対して分割を行った後、隣接した画像セグメントを合併し、及び／又は複数の画像セグメントの中の一つまたは複数個に対して再度の分割を実行する。そして、画像セグメント生成部９０４／１００４は、得られた全ての画像セグメントを認識部９０６／１００６へ出力してもよい。画像セグメント生成部９０４／１００４は、実施例３及び４で説明した方法を利用して、生成された画像セグメントに対して再度の分割及び／又は合併を行うことができ、ここでは説明を省略する。画像セグメント生成部９０４／１００４が画像セグメントに対して再度の分割及び／又は合併を行うことにより、文字認識装置は、画像セグメントの誤分割を防止することができ、最適分割パスを正確に探索することができる。 In another example, the image segment generation unit 904/1004 divides an image, merges adjacent image segments, and / or applies to one or more of the plurality of image segments. Then perform another division. Then, the image segment generation unit 904/1004 may output all the obtained image segments to the recognition unit 906/1006. The image segment generation unit 904/1004 can perform division and / or merge again on the generated image segment using the method described in the third and fourth embodiments, and the description is omitted here. . When the image segment generation unit 904/1004 performs the division and / or merge again on the image segment, the character recognition device can prevent the image segment from being erroneously divided, and accurately searches for the optimum division path. be able to.

図１１は、実施例７に係る文字認識装置１１００を模式的に示すブロック図である。実施例６における文字認識装置１０００と同様に、図１１に示した文字認識装置１１００は、候補パス特定部１１０２、画像セグメント生成部１１０４、認識部１１０６とパス探索部１１０８とを有する。相違するのは、文字認識装置１１００は、さらに、連結部分特定部１１０１を有することである。 FIG. 11 is a block diagram schematically illustrating a character recognition device 1100 according to the seventh embodiment. Similar to the character recognition device 1000 in the sixth embodiment, the character recognition device 1100 illustrated in FIG. 11 includes a candidate path identification unit 1102, an image segment generation unit 1104, a recognition unit 1106, and a path search unit 1108. The difference is that the character recognition device 1100 further includes a connected portion specifying unit 1101.

連結部分特定部１１０１は、認識対象である文字画像中の一つまたは複数の連結部分を認識し、得られた連結部分を候補パス特定部１１０２へ出力する。連結部分特定部１１０１は、実施例４と同様の方法を利用して動作するので、ここでは説明を省略する。 The connected part specifying unit 1101 recognizes one or more connected parts in the character image to be recognized, and outputs the obtained connected part to the candidate path specifying unit 1102. Since the connection part specifying unit 1101 operates using the same method as in the fourth embodiment, the description thereof is omitted here.

候補パス特定部１１０２は、連結部分特定部１１０１によって特定された連結部分をそれぞれ処理する。候補パス特定部１１０２は、連結部分毎における候補分割点毎の一つ又は複数の候補分割パスを特定する。さらに、候補パス特定部１１０２は、二次元標識で各候補分割パスを標記してもよい。なお、標記の方法は実施例２で説明した方法と同様であるので、ここでは説明を省略する。 The candidate path specifying unit 1102 processes each of the connected parts specified by the connected part specifying unit 1101. The candidate path specifying unit 1102 specifies one or a plurality of candidate division paths for each candidate division point for each connected portion. Further, the candidate path specifying unit 1102 may mark each candidate division path with a two-dimensional marker. In addition, since the method of description is the same as the method demonstrated in Example 2, description is abbreviate | omitted here.

画像セグメント生成部１１０４は、候補パス特定部１１０２により特定された候補分割点と候補分割パスを利用して各連結部分に対して分割を行って、複数の画像セグメントを生成する。一例として、画像セグメント生成部１１０４は、これらの画像セグメントに対して再度の分割及び／又は合併を行うことができる。具体的には、画像セグメント生成部１１０４は、隣接した画像セグメントに対して合併を行うことができ、及び／又は、複数の画像セグメントの中の一つ又は複数個に対して再度の分割を実行することもできる。したがって、画像セグメント生成部１１０４は、新たな画像セグメントを生成する。画像セグメント生成部１１０４は、これらの新たな画像セグメントを標記することもできる。標記の方法は、図８で説明した実施例４と同様であるので、ここでは説明を省略する。 The image segment generation unit 1104 divides each connected portion using the candidate division points and candidate division paths identified by the candidate path identification unit 1102, and generates a plurality of image segments. As an example, the image segment generation unit 1104 can perform division and / or merge again on these image segments. Specifically, the image segment generation unit 1104 can merge adjacent image segments and / or execute re-division on one or more of the plurality of image segments. You can also Therefore, the image segment generation unit 1104 generates a new image segment. The image segment generation unit 1104 can also mark these new image segments. Since the title method is the same as that of the fourth embodiment described with reference to FIG. 8, the description thereof is omitted here.

認識部１１０６は、画像セグメント生成部１１０４によって生成された全ての画像セグメントを認識して、相応する認識結果を取得する。画像セグメントの認識は実施例１〜４で説明した方法と同様の方法で処理してもよいので、ここでも説明を省略する。パス探索部１１０８は、認識部１１０６から受信した認識結果に応じて、複数の候補分割点及びその候補分割パスから文字画像の最適分割点と最適分割パスを選択する。具体的な探索方法は、実施例１〜４で説明した方法と同様の方法で処理してもよい。例えば、パス探索部１１０８は、認識部１１０６によって取得された認識結果に応じて、候補パス特定部１１０２と画像セグメント生成部１１０４により標記された二次元標識に基づいて、動的計画法を利用する。この結果、パス探索部１１０８は、複数の候補分割点及び候補分割パスから最適分割点及び最適分割パスを選択する。 The recognition unit 1106 recognizes all the image segments generated by the image segment generation unit 1104, and acquires corresponding recognition results. The recognition of the image segment may be processed by the same method as that described in the first to fourth embodiments, and thus the description thereof is omitted here. The path search unit 1108 selects the optimum division point and the optimum division path of the character image from the plurality of candidate division points and the candidate division paths according to the recognition result received from the recognition unit 1106. A specific search method may be processed in the same manner as the method described in the first to fourth embodiments. For example, the path search unit 1108 uses dynamic programming based on the two-dimensional signs marked by the candidate path identification unit 1102 and the image segment generation unit 1104 according to the recognition result acquired by the recognition unit 1106. . As a result, the path search unit 1108 selects an optimal division point and an optimal division path from a plurality of candidate division points and candidate division paths.

この実施例７において、文字認識装置１１００は、入力画像における各連結部分をそれぞれ処理することにより、入力画像に対する処理を優位に効率化することができる。 In the seventh embodiment, the character recognition device 1100 can process the input image more efficiently by processing each connected portion in the input image.

本明細書中の実施例に記載した各例は、一例であり、開示技術はこれらに限定されない。 Each example described in the examples in this specification is an example, and the disclosed technology is not limited thereto.

上述の文字認識装置における各構成部は、ソフトウェア、ファームウェア、ハードウェアあるいはそれらを組み合わせて配置されてもよい。配置に使用可能な具体的な手法又は方式は当業者に熟知されているものであり、ここでは説明を省略する。ソフトウェア又はファームウェアにより実現される場合には、記憶媒体又はネットワークから専用のハードウェア構成を持つコンピュータ（例えば図１２に示した汎用コンピュータ１３００）に対してこのソフトウェアを構成するプログラムをインストールする。そして、各種プログラムがインストールされると、汎用コンピュータ１３００が各種機能等を実行する。 Each component in the character recognition device described above may be arranged as software, firmware, hardware, or a combination thereof. Specific techniques or schemes that can be used for the arrangement are well known to those skilled in the art, and will not be described here. When implemented by software or firmware, a program constituting the software is installed from a storage medium or a network to a computer having a dedicated hardware configuration (for example, the general-purpose computer 1300 shown in FIG. 12). When the various programs are installed, the general-purpose computer 1300 executes various functions.

図１２においては、中央処理装置（CPU）１３０１は、読取専用メモリ（ROM）１３０２に記憶されたプログラム又は記憶部１３０８からランダムアクセスメモリ（RAM）１３０３にアップロードされたプログラムにより、各種の処理を実行する。ＲＡＭ１３０３にも、必要に応じて、ＣＰＵ１３０１が各種の処理を実行するため等に必要なデータを記憶している。ＣＰＵ１３０１とＲＯＭ１３０２とＲＡＭ１３０３とは、バス１３０４を介して接続されている。同様に、入力／出力インターフェース１３０５は、バス１３０４に接続されている。 In FIG. 12, the central processing unit (CPU) 1301 executes various processes by a program stored in a read-only memory (ROM) 1302 or a program uploaded from the storage unit 1308 to a random access memory (RAM) 1303. To do. The RAM 1303 also stores data necessary for the CPU 1301 to execute various processes as necessary. The CPU 1301, ROM 1302, and RAM 1303 are connected via a bus 1304. Similarly, the input / output interface 1305 is connected to the bus 1304.

入力部１３０６と、出力部１３０７と、記憶部１３０８と、通信部１３０９とは、入力／出力インターフェース１３０５に接続されている。ここで、入力部１３０６は、キーボード、マウス等を含む。例えば、出力部１３０７は、ブラウン管（CRT）や液晶ディスプレイ（LCD）等のディスプレイ、スピーカ等を含む。記憶部１３０８は、ハードディスク等を含む。例えば、通信部１３０９は、ＬＡＮカード等のネットワークインターフェースカード、モデム等を含む。また、通信部１３０９は、ネットワーク、例えばインターネットを経由して通信処理を実行する。入力／出力インターフェース１３０５にはドライブ１３１０を接続してもよい。例えば、磁気ディスク、光ディスク、光磁気ディスク、半導体メモリ等のような着脱可能な媒体１３１１が、ドライブ１３１０に取り付けられており、この中から読み出されるコンピュータプログラムが記憶部１３０８にインストールされる。 The input unit 1306, the output unit 1307, the storage unit 1308, and the communication unit 1309 are connected to the input / output interface 1305. Here, the input unit 1306 includes a keyboard, a mouse, and the like. For example, the output unit 1307 includes a display such as a cathode ray tube (CRT) or a liquid crystal display (LCD), a speaker, and the like. The storage unit 1308 includes a hard disk and the like. For example, the communication unit 1309 includes a network interface card such as a LAN card, a modem, and the like. The communication unit 1309 executes communication processing via a network, for example, the Internet. A drive 1310 may be connected to the input / output interface 1305. For example, a removable medium 1311 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is attached to the drive 1310, and a computer program read from the medium 1311 is installed in the storage unit 1308.

ソフトウェアによって上述した一連の処理を実行する場合、汎用コンピュータ１３００は、ネットワーク又は記憶媒体からソフトウェアを構成するプログラムをインストールされる。ここで、ネットワークは、例えばインターネットであり、記憶媒体は、例えば着脱可能な媒体１３１１である。 When the above-described series of processing is executed by software, the general-purpose computer 1300 is installed with a program constituting the software from a network or a storage medium. Here, the network is, for example, the Internet, and the storage medium is, for example, a removable medium 1311.

このような記憶媒体は、着脱可能な媒体１３１１には限定されない。例えば、記憶媒体は、ＲＯＭ１３０２、記憶部１３０８に含まれるハードディスクなどでも良い。この中にプログラムが記憶されており、かつこれらを含むデバイスと一緒にユーザに配布される。なお、着脱可能な媒体１３１１は、図１２に示したような、その中にプログラムを記憶するものであって、デバイスから離れて配布されることで、ユーザにプログラムを提供する。例えば、着脱可能な媒体１３１１は、磁気ディスク（フロッピー（登録商標）ディスクを含む）、光ディスク（コンパクトディスクリードオンリーメモリ（CD−ROM）やディジタルヴァーサタイルディスク（DVD）を含む）、光磁気ディスク（ミニディスク（登録商標）を含む）及び半導体メモリを含む。 Such a storage medium is not limited to the removable medium 1311. For example, the storage medium may be a ROM 1302, a hard disk included in the storage unit 1308, or the like. A program is stored therein and distributed to users together with a device including the program. The removable medium 1311 stores a program therein as shown in FIG. 12, and provides the program to the user by being distributed away from the device. For example, the removable medium 1311 includes a magnetic disk (including a floppy (registered trademark) disk), an optical disk (including a compact disk read-only memory (CD-ROM) and a digital versatile disk (DVD)), and a magneto-optical disk (including Including a mini disk (registered trademark) and a semiconductor memory.

開示技術はさらに、機器で読取可能な命令コードが記憶されたプログラム製品を提供する。命令コードが機器で読取られて実行される場合、開示技術による実施例の方法を実行することができる。 The disclosed technology further provides a program product in which an instruction code readable by a device is stored. When the instruction code is read and executed by the device, the method of the embodiment according to the disclosed technology can be executed.

これに対して、機器で読取り可能な命令コードが記憶されたプログラム製品を搭載する記憶媒体も開示技術に含まれる。記憶媒体は、フロッピー（登録商標）ディスク、光ディスク、光磁気ディスク、メモリカード、メモリスティック等を含むが、これらに限定されない。 On the other hand, the disclosed technology also includes a storage medium on which a program product in which an instruction code readable by a device is stored. The storage medium includes, but is not limited to, a floppy (registered trademark) disk, an optical disk, a magneto-optical disk, a memory card, a memory stick, and the like.

以上の開示技術の具体的な実施例の記載においては、実施形態の一例について記載しており、実施形態の一例として示した特徴は、同一あるいは同様の形態、一つ又は複数の他の実施形態で使用されてもよい。また、他の実施形態における特徴と組み合わせてもよく、あるいは、他の実施形態における特徴と置き換えてもよい。 In the above description of specific examples of the disclosed technology, an example of the embodiment is described, and the characteristics shown as an example of the embodiment are the same or similar, and one or more other embodiments. May be used. Further, it may be combined with features in other embodiments, or may be replaced with features in other embodiments.

専門用語“含む／有する”が本文で使用される場合には、特徴、要素、ステップ又は構成部の存在を意味するが、一つ又は複数の他の特徴、要素、ステップ又は構成部の存在又は追加を排除しないことを補足する。 Where the term “including / having” is used in the text, it means the presence of a feature, element, step or component, but the presence or absence of one or more other features, elements, steps or components. It is supplemented not to exclude addition.

なお、開示技術の方法は、明細書において記載した説明の順序で実行することに限られず、その他の順序にしたがって、同時にあるいは独立に実行されてもよい。したがって、本明細書で記載した方法の実行順序は、開示技術の技術的範囲を限定するものではない。 The method of the disclosed technology is not limited to being executed in the order of explanations described in the specification, and may be executed simultaneously or independently according to other orders. Therefore, the execution order of the method described in this specification does not limit the technical scope of the disclosed technology.

このように開示技術の具体的実施例を記載することにより開示技術を説明したが、上述の実施例は一例であり、限定的なものではない。当業者は、開示技術の技術的思想の範囲内において、各種の修正、改良あるいは類似品を設計することができる。これらの修正、改良あるいは類似品は開示技術の保護範囲内に含まれるものであると考えられるべきである。 As described above, the disclosed technology has been described by describing a specific embodiment of the disclosed technology. However, the above-described embodiment is merely an example and is not limiting. Those skilled in the art can design various modifications, improvements, or similar products within the scope of the technical idea of the disclosed technology. These modifications, improvements, or similar products should be considered as being within the scope of protection of the disclosed technology.

以上の各実施例を含む実施形態に関し、さらに以下の付記を開示する。 The following supplementary notes are further disclosed with respect to the embodiments including the above examples.

（付記１）文字認識装置が、
認識対象である文字画像中の候補分割点毎における一つまたは複数の候補分割パスを特定するステップと、
前記各候補分割パスに基づいて前記認識対象である文字画像を分割し、一つまたは複数の画像セグメントを生成するステップと、
前記一つまたは複数の画像セグメントのそれぞれを認識して、一つまたは複数の認識結果を取得するステップと、
前記認識結果に基づいて、前記候補分割点及び前記候補分割点における候補分割パスから前記文字画像の一つまたは複数の最適分割点と最適分割点毎における最適分割パスを選択するステップと
を実行することを特徴とする文字認識方法。 (Supplementary note 1) The character recognition device
Identifying one or more candidate division paths for each candidate division point in the character image to be recognized;
Dividing the character image to be recognized based on each candidate division path to generate one or a plurality of image segments;
Recognizing each of the one or more image segments to obtain one or more recognition results;
Selecting one or a plurality of optimum division points of the character image and an optimum division path for each optimum division point from the candidate division points and the candidate division paths at the candidate division points based on the recognition result. A character recognition method characterized by the above.

（付記２）文字認識装置が、
前記候補分割点毎における候補分割パスを特定するステップを実行した後、前記各候補分割点を標記するための第一標識及び前記各候補分割パスを標記するための第二標識を含む二次元標識を使用して、前記候補分割点毎における一つまたは複数の候補分割パスのそれぞれを標記するステップを更に実行することを含む、付記１に記載の文字認識方法。 (Appendix 2) The character recognition device
After executing the step of identifying candidate division paths for each candidate division point, a two-dimensional indicator including a first indicator for marking each candidate division point and a second indicator for marking each candidate division path The character recognition method according to claim 1, further comprising the step of marking each of one or a plurality of candidate division paths for each candidate division point using.

（付記３）前記認識結果に基づいて、前記候補分割点及び前記候補分割点における候補分割パスから前記文字画像の一つまたは複数の最適分割点と最適分割点毎の最適分割パスを選択するステップは、
前記候補分割パス毎の二次元標識に基づいて、動的計画法により候補分割点及び候補分割点における候補分割パスから最適分割点及び最適分割点毎における最適分割パスを探索するステップを含む、付記２に記載の文字認識方法。 (Supplementary Note 3) Step of selecting one or a plurality of optimum division points of the character image and an optimum division path for each optimum division point from the candidate division points and candidate division paths at the candidate division points based on the recognition result Is
Including a step of searching for an optimum division point and an optimum division path for each optimum division point from the candidate division points and the candidate division paths at the candidate division points by dynamic programming based on the two-dimensional label for each candidate division path. 2. The character recognition method according to 2.

（付記４）前記一つまたは複数の画像セグメントを認識して、一つまたは複数の認識結果を取得するステップは、
前記一つまたは複数の画像セグメントを分類するステップと、
前記各画像セグメントの種類に基づいて画像セグメント毎の当該種類の既知文字画像との間のマッチング距離を当該画像セグメントの認識結果として算出するステップと
を含む、付記１に記載の文字認識方法。 (Supplementary Note 4) Recognizing the one or more image segments and obtaining one or more recognition results includes:
Classifying the one or more image segments;
The character recognition method according to claim 1, further comprising: calculating a matching distance between each type of image segment and the type of known character image as a recognition result of the image segment based on the type of the image segment.

（付記５）文字認識装置が、
前記一つまたは複数の画像セグメントを認識して、一つまたは複数の認識結果を取得するステップを実行する前に、隣接した画像セグメントを合併し、及び／又は前記複数の画像セグメントの中の一つまたは複数個に対して再度の分割をするステップを更に実行することを含む、付記１に記載の文字認識方法。 (Supplementary note 5) The character recognition device
Before performing the step of recognizing the one or more image segments and obtaining one or more recognition results, merge adjacent image segments and / or one of the plurality of image segments. The character recognition method according to appendix 1, further comprising executing a step of dividing again one or more.

（付記６）文字認識装置が、
認識対象である文字画像中の候補分割点毎における一つまたは複数の候補分割パスを特定するステップを実行する前に、前記認識対象である文字画像中の一つまたは複数の連結部分を認識するステップを更に実行することを含む、付記１に記載の文字認識方法。 (Appendix 6) The character recognition device
Before executing the step of identifying one or more candidate division paths for each candidate division point in the character image to be recognized, one or more connected portions in the character image to be recognized are recognized. The character recognition method according to claim 1, further comprising executing the step.

（付記７）認識対象である文字画像中の候補分割点毎の一つまたは複数の候補分割パスを特定する候補パス特定部と、
前記各候補分割パスにより前記認識対象である文字画像を分割し、一つまたは複数の画像セグメントを生成する画像セグメント生成部と、
前記一つまたは複数の画像セグメントのそれぞれを認識して、一つまたは複数の認識結果を取得する認識部と、
前記認識結果に基づいて、前記候補分割点及び前記候補分割点における候補分割パスから前記文字画像の一つまたは複数の最適分割点と最適分割点毎における最適分割パスを選択するパス探索部と
を有することを特徴とする文字認識装置。 (Supplementary Note 7) A candidate path specifying unit that specifies one or a plurality of candidate division paths for each candidate division point in the character image to be recognized;
An image segment generation unit that divides the character image to be recognized by each candidate division pass and generates one or a plurality of image segments;
A recognition unit that recognizes each of the one or more image segments and obtains one or more recognition results;
A path search unit that selects one or a plurality of optimum division points of the character image and an optimum division path for each optimum division point from the candidate division points and candidate division paths at the candidate division points based on the recognition result; A character recognition device comprising:

（付記８）前記候補パス特定部は更に、
前記各候補分割点を標記するための第一標識及び前記各候補分割パスを標記するための第二標識を含む二次元標識を使用して、前記候補分割点毎における一つまたは複数の候補分割パスのそれぞれを標記するように配置される、付記７に記載の文字認識装置。 (Supplementary Note 8) The candidate path specifying unit further includes:
One or more candidate divisions for each of the candidate division points using a two-dimensional marker that includes a first indicator for marking each candidate division point and a second indicator for marking each candidate division path The character recognition device according to appendix 7, which is arranged so as to mark each of the paths.

（付記９）前記パス探索部は更に、
前記二次元標識に基づいて、動的計画法により候補分割点及び候補分割パスから最適分割点及び最適分割パスを探索するように配置される、付記８に記載の文字認識装置。 (Supplementary Note 9) The path search unit further includes:
The character recognition device according to appendix 8, wherein the character recognition device is arranged so as to search for an optimum division point and an optimum division path from the candidate division points and the candidate division path based on the two-dimensional mark by dynamic programming.

（付記１０）前記認識部は、
前記一つまたは複数の画像セグメントを分類する分類部と、
前記各画像セグメントの種類に基づいて画像セグメント毎の当該種類の既知文字画像との間のマッチング距離を当該画像のセグメントの認識結果として算出するマッチング部と
を備える、付記７に記載の文字認識装置。 (Supplementary Note 10) The recognition unit
A classifier for classifying the one or more image segments;
The character recognition device according to appendix 7, further comprising: a matching unit that calculates a matching distance between the type of each image segment and the known character image of the type for each image segment as a recognition result of the segment of the image. .

（付記１１）前記画像セグメント生成部は更に、
隣接した画像セグメントを合併し、及び／又は前記複数の画像セグメントの中の一つまたは複数個に対して再度の分割を実行し、得られた画像セグメントを前記文字認識装置へ出力するように配置される、付記７に記載の文字認識装置。 (Supplementary Note 11) The image segment generation unit further includes:
Arranging adjacent image segments and / or performing re-division on one or more of the plurality of image segments and outputting the resulting image segments to the character recognition device The character recognition device according to appendix 7.

（付記１２）認識対象である文字画像中の一つ又は複数の連結部分を認識する連結部分特定部を更に備える、付記７に記載の文字認識装置。 (Additional remark 12) The character recognition apparatus of Additional remark 7 further provided with the connection part specific | specification part which recognizes the 1 or several connection part in the character image which is recognition object.

９００文字認識装置
９０２候補パス特定部
９０４画像セグメント生成部
９０６認識部
９０８パス探索部
１０００文字認識装置
１００２候補パス特定部
１００４画像セグメント生成部
１００６認識部
１００６ａ分類部
１００６ｂマッチング部
１００８パス探索部
１１００文字認識装置
１１０１連結部分特定部
１１０２候補パス特定部
１１０４画像セグメント生成部
１１０６認識部
１１０８パス探索部
１３００汎用コンピュータ
１３０１ＣＰＵ
１３０２ＲＯＭ
１３０３ＲＡＭ
１３０４バス
１３０５入力／出力インターフェース
１３０６入力部
１３０７出力部
１３０８記憶部
１３０９通信部
１３１０ドライブ
１３１１着脱可能な媒体 900 Character recognition device 902 Candidate path identification unit 904 Image segment generation unit 906 Recognition unit 908 Path search unit 1000 Character recognition device 1002 Candidate path identification unit 1004 Image segment generation unit 1006 Recognition unit 1006a Classification unit 1006b Matching unit 1008 Path search unit 1100 Character Recognizing device 1101 Linked part specifying unit 1102 Candidate path specifying unit 1104 Image segment generating unit 1106 Recognizing unit 1108 Path searching unit 1300 General-purpose computer 1301 CPU
1302 ROM
1303 RAM
1304 Bus 1305 Input / output interface 1306 Input unit 1307 Output unit 1308 Storage unit 1309 Communication unit 1310 Drive 1311 Removable medium

Claims

The character recognition device
Identifying one or more candidate division paths for each candidate division point in the character image to be recognized;
Dividing the character image to be recognized based on each candidate division path to generate one or a plurality of image segments;
Recognizing each of the one or more image segments to obtain one or more recognition results;
Selecting one or a plurality of optimum division points of the character image and an optimum division path for each optimum division point from the candidate division points and the candidate division paths at the candidate division points based on the recognition result. A character recognition method characterized by the above.

The character recognition device
After executing the step of identifying candidate division paths for each candidate division point, a two-dimensional indicator including a first indicator for marking each candidate division point and a second indicator for marking each candidate division path The character recognition method according to claim 1, further comprising the step of marking each of one or a plurality of candidate division paths for each candidate division point using.

Based on the recognition result, selecting one or a plurality of optimum division points of the character image and an optimum division path for each optimum division point from the candidate division points and candidate division paths at the candidate division points,
Searching for an optimum division point and an optimum division path for each optimum division point from the candidate division points and the candidate division points at the candidate division points by dynamic programming based on the two-dimensional label for each candidate division path. Item 3. A character recognition method according to Item 2.

Recognizing the one or more image segments and obtaining one or more recognition results,
Classifying the one or more image segments;
The character recognition method according to claim 1, further comprising: calculating a matching distance between each type of image segment and a known character image of that type for each image segment as a recognition result of the image segment.

The character recognition device
Before performing the step of recognizing the one or more image segments and obtaining one or more recognition results, merge adjacent image segments and / or one of the plurality of image segments. The character recognition method according to claim 1, further comprising executing a step of performing division again on one or more.

A candidate path specifying unit for specifying one or a plurality of candidate division paths for each candidate division point in the character image to be recognized;
An image segment generation unit that divides the character image to be recognized by each candidate division pass and generates one or a plurality of image segments;
A recognition unit that recognizes each of the one or more image segments and obtains one or more recognition results;
A path search unit that selects one or a plurality of optimum division points of the character image and an optimum division path for each optimum division point from the candidate division points and candidate division paths at the candidate division points based on the recognition result; A character recognition device comprising:

The candidate path specifying unit further includes:
One or more candidate divisions for each of the candidate division points using a two-dimensional marker that includes a first indicator for marking each candidate division point and a second indicator for marking each candidate division path The character recognition device according to claim 6, which is arranged so as to mark each of the paths.

The path search unit further includes:
The character recognition device according to claim 7, wherein the character recognition device is arranged so as to search for an optimum division point and an optimum division path from the candidate division points and the candidate division paths by dynamic programming based on the two-dimensional mark.

The recognition unit
A classifier for classifying the one or more image segments;
The character recognition of Claim 6 provided with the matching part which calculates the matching distance between the known character image of the said kind for every image segment based on the kind of each said image segment as a recognition result of the segment of the said image apparatus.

The image segment generation unit further includes
Arranging adjacent image segments and / or performing re-division on one or more of the plurality of image segments and outputting the resulting image segments to the character recognition device The character recognition device according to claim 6.