JP3187899B2

JP3187899B2 - Character recognition device

Info

Publication number: JP3187899B2
Application number: JP34351791A
Authority: JP
Inventors: 浩司山本; 進丸野; 泰治〆木; 茂生阪上; 敏行香田; 寿男丹羽; 良宏小島
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1991-12-25
Filing date: 1991-12-25
Publication date: 2001-07-16
Anticipated expiration: 2016-07-16
Also published as: JPH05174187A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明はパターン認識の分野にお
いて、文書からの文字切り出し処理と認識、言語処理を
相補的に用いることによって認識精度の向上を図った文
字認識装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition apparatus for improving the recognition accuracy in a field of pattern recognition by using character extraction processing from a document, recognition, and language processing in a complementary manner.

【０００２】[0002]

【従来の技術】従来の文字認識装置としては、例えば
「手書き文字列認識方法の一検討」、佐藤弥生、津雲
淳、１９９０年電子情報通信学会春季全国大会予稿、
Ｄ−４９１、７−２４３に示された装置が知られてい
る。2. Description of the Related Art As a conventional character recognition apparatus, for example, "Study of Handwritten Character String Recognition Method", Yayoi Sato and Jun Tsugumo, Proceedings of the 1990 IEICE Spring Conference,
D-490, 7-243 are known.

【０００３】図１１はこの従来の文字認識装置の主要構
成図を示すものである。FIG. 11 shows a main configuration diagram of this conventional character recognition device.

【０００４】まず、撮像され、入力された画像文字列２
０１は矩形化処理部２０２に２次元データとして渡され
る。ここで文字列構成成分に含まれる画素は１、含まれ
ない画素は０、文字列方向は便宜上横軸方向であり、文
字列の高さも既知であるとする。First, an image character string 2 captured and input
01 is passed to the rectangularization processing unit 202 as two-dimensional data. Here, it is assumed that the number of pixels included in the character string component is 1, the number of pixels not included is 0, the character string direction is the horizontal axis direction for convenience, and the height of the character string is also known.

【０００５】図１２は従来の矩形化処理部２０２の具体
的な構成図である。矩形化処理部２０２に入力された文
字列２０１は文字列データ記憶部２１１に保持される。
周辺分布算出部２１２は文字列データ記憶部２１１に保
持された画素値を縦方向に累積した値を算出し、その結
果をガウシアンフィルタ処理部２１３に出力する。ガウ
シアンフィルタ処理部２１３は周辺分布算出部２１２に
よって求められた文字列データの画素値の縦方向の累積
ヒストグラムに対してガウス型関数を畳み込むことによ
って平滑化を行うことで累積ヒストグラムの高周波成分
を除き、ノイズの影響を抑える。平滑化を受けた累積ヒ
ストグラムは左右端検出部２１４に入力される。左右端
検出部２１４は図１３に示すように累積ヒストグラムの
値が一定値（いき値）以下の部分を検出して、その境界
座標位置を順に基本矩形の左右端座標として列方向周辺
分布算出部２１５と基本矩形記憶部２１７に出力する。FIG. 12 is a diagram showing a specific configuration of the conventional rectangular processing section 202. As shown in FIG. The character string 201 input to the rectification processing unit 202 is stored in the character string data storage unit 211.
The margin distribution calculation unit 212 calculates a value obtained by accumulating the pixel values held in the character string data storage unit 211 in the vertical direction, and outputs the result to the Gaussian filter processing unit 213. The Gaussian filter processing unit 213 removes high frequency components of the cumulative histogram by performing smoothing by convolving a Gaussian function with the vertical cumulative histogram of the pixel values of the character string data obtained by the peripheral distribution calculating unit 212. , Reduce the effects of noise. The smoothed cumulative histogram is input to the left and right end detection unit 214. As shown in FIG. 13, the left and right edge detection unit 214 detects a portion where the value of the cumulative histogram is equal to or less than a certain value (threshold value), and sequentially sets the boundary coordinate positions as the left and right edge coordinates of the basic rectangle in the column direction peripheral distribution calculation unit. 215 and the basic rectangle storage unit 217.

【０００６】このとき、いき値はすべての基本矩形の横
幅が文字列の高さの一定倍以下になるように設定される
ものとする。At this time, it is assumed that the threshold is set so that the width of all the basic rectangles is equal to or less than a certain multiple of the height of the character string.

【０００７】列方向周辺分布算出部２１５は左右端検出
部２１４からの入力を受けて基本矩形の左右端座標に挟
まれる部分の画素値を文字列データ記憶部２１１から読
みだして横方向に斜影し、その結果を上下端検出部２１
６に出力する。さらに上下端検出部２１６は図１４に示
すように列方向周辺分布算出部２１５からの斜影結果の
文字構成成分が成す部分の最大値、最小値を求め、その
結果を対応する基本矩形の上下端として基本矩形記憶部
２１７に出力する。The column direction marginal distribution calculating section 215 receives the input from the left and right end detecting section 214, reads out the pixel values of the portion sandwiched between the left and right end coordinates of the basic rectangle from the character string data storage section 211, and performs horizontal oblique shading. And outputs the result to the upper and lower ends
6 is output. Further, as shown in FIG. 14, the upper / lower edge detecting section 216 obtains the maximum value and the minimum value of the portion formed by the character component of the oblique result from the column-direction marginal distribution calculating section 215, and compares the result with the upper and lower edge of the corresponding basic rectangle. Is output to the basic rectangle storage unit 217.

【０００８】以上の処理によって基本矩形記憶部２１７
には基本矩形がその左右端と上下端で表わされ、検出順
に記憶されることになる。By the above processing, the basic rectangle storage unit 217
The basic rectangle is represented by its left and right ends and its upper and lower ends, and is stored in the order of detection.

【０００９】図１５は、図１０の矩形統合認識部２０３
の具体的な構成図であり、図１６はその一部である重み
付きラティス作成部２４１の具体的な構成図である。以
下に矩形統合認識部２０３、ラティス作成部２４１の動
作を図に従って説明する。FIG. 15 is a block diagram showing the rectangular integrated recognition unit 203 shown in FIG.
FIG. 16 is a specific configuration diagram of the weighted lattice creating unit 241 which is a part of the configuration. Hereinafter, the operations of the rectangle integration recognition unit 203 and the lattice creation unit 241 will be described with reference to the drawings.

【００１０】基本矩形組合せ部２５１は上記基本矩形記
憶部２１７に保持されている基本矩形の座標値を読みだ
し、隣合う基本矩形を組合せた矩形の外郭幅が文字列の
高さの一定倍以下になるようなすべての組合せを検出
し、その結果を重み付きラティス記憶部２５２に出力す
る。The basic rectangle combination unit 251 reads out the coordinate values of the basic rectangles stored in the basic rectangle storage unit 217 and determines that the outline width of the rectangle obtained by combining the adjacent basic rectangles is not more than a fixed multiple of the character string height. Are detected, and the result is output to the weighted lattice storage unit 252.

【００１１】重み付きラティス記憶部２５２は、図１７
に示すように基本矩形組合せ部２５１によって検出され
たすべての基本矩形の組合せについて、組合せ矩形の上
下、左右端座標を記憶保持し、さらに各組合せ矩形に対
応する重み値及び、候補文字を記憶保持するためのメモ
リを確保する。The weighted lattice storage unit 252 stores a
As shown in the figure, the upper, lower, left and right coordinates of the combination rectangle are stored and retained for all the combinations of the basic rectangles detected by the basic rectangle combination unit 251, and the weight values and candidate characters corresponding to each combination rectangle are also stored and retained. To secure memory for

【００１２】重み値算出部２４２は、図１１の文字評価
値算出部２０４と矩形評価値算出部２０５からの入力を
受け、重み付きラティス記憶部２５２に記憶されている
各組合せ矩形に対応する重み値及び、候補文字を求め、
その結果を再び重み付きラティス記憶部２５２の対応す
るメモリに出力する。A weight value calculation unit 242 receives inputs from the character evaluation value calculation unit 204 and the rectangle evaluation value calculation unit 205 in FIG. 11 and receives weights corresponding to each combination rectangle stored in the weighted lattice storage unit 252. Find the value and candidate characters,
The result is output to the corresponding memory of the weighted lattice storage unit 252 again.

【００１３】ここでまず、文字評価値算出部２０４と矩
形評価値算出部２０５の動作について説明する。First, the operation of the character evaluation value calculation unit 204 and the rectangle evaluation value calculation unit 205 will be described.

【００１４】図１８は文字評価値算出部２０４の具体的
な構成図である。特徴量抽出部２７１は重み付きラティ
ス記憶部２５２に記憶されている組合せ矩形の上下、左
右端座標を参照し、前記座標内に含まれる画素を文字列
データ記憶部２１１から読みだして特徴量を得、類似度
算出部２７３に出力する。特徴量抽出部２７１によって
抽出される特徴量としては、例えばメッシュ特徴ベクト
ルなどが考えられる。メッシュ特徴ベクトルは以下のよ
うにして得られる。FIG. 18 is a specific configuration diagram of the character evaluation value calculation unit 204. The feature amount extraction unit 271 refers to the upper, lower, left and right coordinates of the combination rectangle stored in the weighted lattice storage unit 252, reads out the pixels included in the coordinates from the character string data storage unit 211, and calculates the feature amount. And outputs it to the similarity calculation unit 273. As the feature amount extracted by the feature amount extraction unit 271, for example, a mesh feature vector or the like can be considered. The mesh feature vector is obtained as follows.

【００１５】入力される画素が含まれる組合せ矩形の一
辺をｎ等分し、前記矩形全体をｎ×ｎ個のマスに分割
し、各マス内の文字構成成分に含まれる画素数がそのマ
ス全体の画素数に占める割合の値を順番に並べてできる
ｎ×ｎ次元ベクトルをメッシュ特徴ベクトルとする。One side of a combination rectangle including input pixels is divided into n equal parts, the entire rectangle is divided into n × n squares, and the number of pixels included in a character component in each square is determined by the entire square. The n × n-dimensional vector formed by arranging the values of the ratio of the pixel number to the pixel number in order is defined as the mesh feature vector.

【００１６】文字辞書２７２には、各文字の代表メッシ
ュ特徴ベクトルが記憶保持されている。各文字の代表メ
ッシュ特徴ベクトルは複数の標準文字のメッシュ特徴ベ
クトルの平均ベクトルとする。The character dictionary 272 stores and stores a representative mesh feature vector of each character. The representative mesh feature vector of each character is an average vector of the mesh feature vectors of a plurality of standard characters.

【００１７】類似度算出部２７３は各文字の代表メッシ
ュ特徴ベクトルと特徴量抽出部２７１から入力されたメ
ッシュ特徴ベクトルの２乗誤差の逆数値を求め、どの文
字にどのくらい近いかを示す類似度値として出力選出部
２７４に出力する。The similarity calculation unit 273 calculates the reciprocal value of the square error between the representative mesh feature vector of each character and the mesh feature vector input from the feature amount extraction unit 271, and calculates a similarity value indicating which character and how close it is. Is output to the output selection unit 274.

【００１８】出力選出部２７４はすべての文字に対する
類似度値が入力された後に、その中の最大値ｘと対応す
る候補文字を重み値算出部２４２に出力する。After the similarity values for all the characters are input, the output selection unit 274 outputs the candidate characters corresponding to the maximum value x to the weight value calculation unit 242.

【００１９】図１９は矩形評価値算出部２０５の具体的
な構成図である。方形度算出部２８１は重み付きラティ
ス記憶部２５２に記憶されている組合せ矩形の上下、左
右端座標を参照し、前記矩形の縦横比（方形度ｒ）を
（数１）に従って算出し、適合度算出部２８３に出力す
る。FIG. 19 is a specific configuration diagram of the rectangle evaluation value calculation unit 205. The squareness calculating unit 281 calculates the aspect ratio (squareness r) of the rectangle according to (Equation 1) by referring to the upper, lower, left and right end coordinates of the combination rectangle stored in the weighted lattice storage unit 252, and Output to calculation section 283.

【００２０】[0020]

【数１】ｒ＝矩形の縦長さ／矩形の横長さ空白度算出部２８２は重み付きラティス記憶部２５２に
記憶されている組合せ矩形の上下、左右端座標を参照
し、文字列データ記憶部２１１から前記矩形内の画素を
読みだし、そのうちの文字構成成分に含まれない画素数
が前記矩形全体の画素数に占める割合（空白度ｓ）を算
出して適合度算出部２８３に出力する。[Mathematical formula-see original document] r = vertical length of rectangle / horizontal length of rectangle The blankness calculation unit 282 refers to the upper, lower, left and right coordinates of the combination rectangle stored in the weighted lattice storage unit 252, and determines the character string data storage unit 211. , The pixels within the rectangle are read out, and the ratio of the number of pixels not included in the character component to the total number of pixels of the rectangle (blank degree s) is calculated and output to the matching degree calculation unit 283.

【００２１】適合度算出部２８３は（数２）に示す値ｙ
を適合度値として求め、重み値算出部２４２に出力す
る。The goodness-of-fit calculating section 283 calculates the value y shown in (Equation 2)
Is obtained as a fitness value, and is output to the weight value calculation unit 242.

【００２２】[0022]

【数２】ｙ＝ρ1(ｒ)・ρ2(ｓ) 但し、ｒは方形度算出部２８１からの入力、ｓは空白度算出部２８２からの入力、 ρ1、ρ2は適当な関数であり、処理対象によって変化す
る事はない。Y = ρ1 (r) · ρ2 (s) where r is an input from the squareness calculator 281, s is an input from the blankness calculator 282, ρ1 and ρ2 are appropriate functions, and It does not change depending on the target.

【００２３】以上説明したように文字評価値算出部２０
４が出力として出力選出部２６４からｘとそれに対応す
る候補文字、矩形評価値算出部２０５が出力として適合
度算出部２７３からｙを出すとき、重み値算出部２４２
は（数３）に示す値ｗを対応する組合せ矩形の重み付き
ラティス記憶部２５２の重み値として算出する。As described above, the character evaluation value calculating section 20
4, when the output selection unit 264 outputs x and the corresponding candidate character from the output selection unit 264 and the rectangle evaluation value calculation unit 205 outputs y from the suitability calculation unit 273 as an output, the weight value calculation unit 242
Calculates the value w shown in (Equation 3) as the weight value of the weighted lattice storage unit 252 of the corresponding combination rectangle.

【００２４】[0024]

【数３】ｗ＝１・・・(ｘ≧α2) ρ( (C0＋C1・ｘ＋C2・ｙ＋C3・ｘ・ｙ)/C4 ) ・・・(ｘ＜α2) 但し、C0＝β1・(α2−1)(α2−α1)−α1・β2、 C1＝α1・β1−α2・β1＋β2、C2＝α1・α2−α2²＋α2、 C3＝α2−α1−１、C4＝(α2−α1)・(β2−β1)、 ρ(ｚ)＝ｚ・・・（ｚ＞０）、ρ(ｚ)＝０・・・（ｚ≦０）。## EQU00003 ## w = 1 (x.gtoreq..alpha.2) .rho. ((C0 + C1.x + C2.y + C3.x.y) / C4) (x <.alpha.2) where C0 = .beta.1. (. Alpha.2-1) (α2-α1) -α1 · β2 , C1 = α1 · β1-α2 · β1 + β2, C2 = α1 · α2-α2 2 + α2, C3 = α2-α1-1, C4 = (α2-α1) · (β2-β1 ), Ρ (z) = z (z> 0), ρ (z) = 0 (z ≦ 0).

【００２５】なお、処理パラメータα1、α2、β1、β2
は一定値であり、処理対象によって変化する事はない。The processing parameters α1, α2, β1, β2
Is a constant value and does not change depending on the processing target.

【００２６】（数３）は以下の４つ条件を満たし、値が
大きい程文字らしいと考えられる。１．文字評価値算出部２５１から得られる評価値ｘがあ
る値α1より小さく、矩形評価値算出部２５２から得ら
れる評価値ｙがある値β1より小さい場合は、文字らし
くないと判断し、ｗ＝０とする。２．ｘがある値α2より大きいときは、文字らしいと判
断しｗ＝１とする。３．ｙがある値β2より大きいときは、文字である可能
性が大きい。４．重み値ｗはｘとｙに関して、単調に増加する。(Equation 3) satisfies the following four conditions, and it is considered that the larger the value, the more likely the character is. 1. If the evaluation value x obtained from the character evaluation value calculation unit 251 is smaller than a certain value α1, and the evaluation value y obtained from the rectangle evaluation value calculation unit 252 is smaller than a certain value β1, it is determined that the character is not a character, and w = 0. And 2. If x is larger than a certain value α2, it is determined that the character is character-like, and w = 1. 3. When y is larger than a certain value β2, it is highly possible that the character is a character. 4. The weight value w monotonically increases with respect to x and y.

【００２７】次に算出した重み値ｗと出力選出部２６４
から入力された候補文字を重み付きラティス記憶部２５
２の対応するメモリに出力する。Next, the calculated weight value w and the output selection unit 264
Lattice storage unit 25 stores candidate characters input from
2 to the corresponding memory.

【００２８】以上のようにしてすべての組合せ矩形に対
する重み値とそのときの候補文字が重み付きラティス記
憶部２５２に記憶保持された後、切り出し位置探索部２
４３が以下のようにして切り出し位置及び、認識結果を
決定する。After the weight values for all the combination rectangles and the candidate characters at that time are stored and held in the weighted lattice storage unit 252 as described above, the cutout position search unit 2
43 determines the cutout position and the recognition result as follows.

【００２９】重み付きラティス記憶部２５２の始端から
終端に至る経路のうちで経路上の一矩形当りの重み値が
最大になる経路を選択し、そのときの組合せ矩形を一文
字とし、その候補文字を認識結果２０６として出力す
る。From the paths from the start to the end of the weighted lattice storage unit 252, the path with the largest weight per rectangle on the path is selected, the combined rectangle at that time is set to one character, and the candidate character is Output as a recognition result 206.

【００３０】[0030]

【発明が解決しようとする課題】しかしながら上記のよ
うな構成では、単一文字のパターンマッチングだけから
得られる類似度値から候補文字列を求めているために、
常に言語的に正しいとは言えない。さらに矩形化処理に
おいて１つの基本矩形内に一文字以上の文字が含まれた
場合にそれにともなって文字切り出し結果や認識結果が
大きく間違ってしまう。さらに一定の矩形評価規準によ
って評価しているために不定ピッチの文字列を含む印刷
文書や手書き文書等のように文書毎、領域毎に最適な矩
形評価規準が異なる場合に矩形評価値の誤りが生じやす
くなる。However, in the above configuration, a candidate character string is obtained from a similarity value obtained only from pattern matching of a single character.
It is not always linguistically correct. Further, when one or more characters are included in one basic rectangle in the rectification process, the character cutout result and the recognition result are greatly incorrect. In addition, since the evaluation is performed according to a certain rectangular evaluation criterion, when the optimum rectangular evaluation criterion differs for each document and each region, such as a printed document or a handwritten document including a character string of an indefinite pitch, an error in the rectangular evaluation value may occur. It is easy to occur.

【００３１】以上の理由から処理系全体としての文字評
価規準の不適性によって認識精度が悪くなるという課題
を有していた。For the above reasons, there is a problem that the recognition accuracy is deteriorated due to the inappropriateness of the character evaluation criterion of the entire processing system.

【００３２】本発明はかかる従来の文字認識装置の課題
に鑑み、処理系全体の処理精度の向上を図り、文字の誤
切り出しを減少させ、認識精度の良い文字認識装置を提
供することを目的とする。The present invention has been made in view of the above problems of the conventional character recognition apparatus, and aims to improve the processing accuracy of the entire processing system, reduce erroneous segmentation of characters, and provide a character recognition apparatus with good recognition accuracy. I do.

【００３３】[0033]

【課題を解決するための手段】第１の本発明は、入力さ
れた文字列構成成分を文字列方向の軸に射影してできる
ヒストグラムの値の小さい部分で分割することで文字列
構成成分を複数の矩形として分割する矩形化処理手段
と、それら矩形内部の文字構成成分から特徴抽出を行
い、どの文字にどれくらい近いかを表す類似度を各文字
について算出することによって複数の候補文字を選出す
る候補文字選出手段と、前記候補文字選出手段の結果を
単語辞書、文法辞書と照合することによって文章として
の確からしさの評価値を算出する言語処理手段と、前記
言語処理手段の結果から文字として確定した矩形を検出
する確定文字検出手段と、前記確定文字検出手段の結果
から文字として確定した矩形の特徴を抽出する文字矩形
情報抽出手段と、前記文字矩形情報抽出手段によって得
られた矩形情報に基づいて、まだ文字として確定してい
ない矩形の統合と分割を行ない、その結果を前記矩形化
処理手段にフィードバックする矩形分割統合手段と、前
記確定文字検出手段の結果を認識結果として出力する認
識結果出力手段を備えたことを特徴とする文字認識装置
であり、前記言語処理手段は、候補文字を組み合わせて
単語辞書との整合性をみることによって複数の候補単語
を決定する候補単語決定手段と、前記候補単語決定手段
の結果から文節を推定し、前記推定した文節と文法辞書
との整合性をみることによって複数の候補文節を決定す
る候補文節決定手段と、前記候補文節決定手段によって
候補に挙げられた候補文節について前記候補文節を構成
する文字の類似度平均、前記候補文節を構成する単語の
文字数と所定値との差の自乗、前記候補文節の文字数、
前記候補文節を構成する単語の頻度とで表される評価関
数に基づいて評価値を算出した後、連続する前記候補文
節を組み合わせることによってできる一定長さの文節群
について、一文節当たりの前記評価値が最も良い文節群
を選出し、前記文節群の先頭文節を確定するとともに確
定された文節が持つ評価値を文節としての言語的正しさ
を表す評価値とする文節評価確定手段とを備え、前記矩
形化処理手段は、前記矩形分割統合手段からのフィード
バックを受け、まだ文字として確定していない部分にお
いて前記矩形分割統合手段によって新たに生成された矩
形を候補文字選出手段に出力し、前記言語処理手段、前
記確定文字検出手段、前記文字矩形情報抽出手段の処理
を全ての矩形が文字として確定するかあるいは一定回数
まで繰り返すことを特徴とする文字認識装置である。According to a first aspect of the present invention, a character string component is divided by a portion having a small histogram value formed by projecting an input character string component onto an axis in the character string direction. A plurality of candidate characters are selected by performing a feature extraction from character components inside the rectangles and calculating a similarity representing which character is close to which character for each character. Candidate character selecting means, language processing means for calculating the evaluation value of certainty as a sentence by comparing the result of the candidate character selecting means with a word dictionary and a grammar dictionary, and determining the character as a character from the result of the language processing means A determined character detecting means for detecting the determined rectangle; a character rectangle information extracting means for extracting a feature of the determined rectangle as a character from the result of the determined character detecting means; Based on the rectangle information obtained by the character rectangle information extraction unit, a rectangle not yet determined as a character is integrated and divided, and the result is fed back to the rectangularization processing unit. A character recognition device comprising a recognition result output unit that outputs a result of the detection unit as a recognition result, wherein the language processing unit includes a plurality of candidate characters that are combined with each other to check consistency with a word dictionary. Candidate word determining means for determining candidate words, and a candidate phrase determining means for estimating a phrase from the result of the candidate word determining means and determining a plurality of candidate phrases by checking the consistency between the estimated phrase and the grammar dictionary Means, for the candidate clauses listed as candidates by the candidate clause determining means, the similarity average of the characters constituting the candidate clauses, Square of the difference between the number of characters and a predetermined value of words formed, the number of characters in the candidate phrase,
After calculating an evaluation value based on an evaluation function represented by the frequency of words constituting the candidate clause, a phrase group having a fixed length formed by combining the continuous candidate clauses, the evaluation per phrase A phrase evaluation unit for selecting a phrase group having the best value, determining the head phrase of the phrase group, and using an evaluation value of the determined phrase as an evaluation value representing linguistic correctness as a phrase, The rectification processing means receives the feedback from the rectangle division integration means, outputs a rectangle newly generated by the rectangle division integration means to a candidate character selection means in a part which has not been determined as a character yet, and The processing of the processing unit, the confirmed character detection unit, and the character rectangle information extraction unit is performed until all rectangles are confirmed as characters or repeated a certain number of times. A character recognition apparatus characterized.

【００３４】第２の本発明は、前記矩形分割統合手段
は、入力文字列からの文字切り出し処理によって生じる
文字の外接長方形候補である矩形の縦横比、前記矩形の
面積と前記矩形面積に占める文字構成成分以外の部分の
面積の割合とで表される評価関数に基づいて分割すべき
矩形（1文字分以上の文字構成成分を含む矩形）を選出
する分割矩形候補選出手段と、前記分割矩形候補選出手
段によって選出された矩形を分割する矩形分割手段と、
前記矩形分割手段によって分割されて生じた矩形も含め
てすべての矩形を組み合わせて最適な矩形の統合の組み
合わせを決定する矩形統合手段とを備えたことを特徴と
する請求項１記載の文字認識装置である。According to a second aspect of the present invention, the rectangle dividing and integrating means includes an aspect ratio of a rectangle which is a circumscribed rectangle candidate of a character generated by a character extraction process from an input character string, an area of the rectangle, and a character occupying the rectangle area. A divided rectangle candidate selecting means for selecting a rectangle to be divided (a rectangle including one or more character components) based on an evaluation function represented by a ratio of an area of a portion other than a component, and the divided rectangle candidate Rectangle dividing means for dividing the rectangle selected by the selection means,
2. The character recognition device according to claim 1, further comprising: a rectangle integration unit that determines an optimal combination of rectangle integration by combining all rectangles including a rectangle generated by the rectangle division unit. It is.

【００３５】第３の本発明は、入力された文字列構成成
分を文字列方向の軸に射影してできるヒストグラムの値
の小さい部分で分割することで文字列構成成分を複数の
矩形として分割する矩形化処理手段と、それら矩形内部
の文字構成成分から特徴抽出を行い、どの文字にどれく
らい近いかを表す類似度を各文字について算出すること
によって複数の候補文字を選出する候補文字選出手段
と、前記候補文字選出手段の結果を単語辞書、文法辞書
と照合することによって文章としての確からしさの評価
値を算出する言語処理手段と、前記言語処理手段の結果
から文字として確定した矩形を検出する確定文字検出手
段と、前記確定文字検出手段の結果から文字として確定
した矩形の特徴を抽出する文字矩形情報抽出手段と、前
記確定文字検出手段の結果と前記文字矩形情報抽出手段
によって得られた矩形情報に基づいて入力文字列の筆記
者または印刷文字のフォントを決定し、その結果を前記
候補文字選出手段に出力する筆記者推定手段と、前記文
字矩形情報抽出手段によって得られた矩形情報に基づい
て、まだ文字として確定していない矩形の統合と分割を
行ない、その結果を前記矩形化処理手段にフィードバッ
クする矩形分割統合手段と、前記確定文字検出手段の結
果を認識結果として出力する認識結果出力手段とを備え
たことを特徴とする文字認識装置であり、前記筆記者推
定手段は、各筆記者が書く文字、あるいは印刷文字のフ
ォントの癖（各文字の特徴量の分布、文字の外接長方形
の縦横比、文字の大きさ）を記憶保持している筆記者辞
書と、前記筆記者辞書と言語的に正しいと確定された文
字との整合性をみることによって前記筆記者辞書のどの
筆記者またはどの印刷文字のフォントにどれだけ近いか
を表す評価値を算出する筆記者評価値算出手段と、前記
筆記者評価値算出手段の結果によって入力文字列の筆者
または印刷文字のフォントを特定する筆記者決定手段を
備え、前記矩形化処理手段は、前記矩形分割統合手段か
らのフィードバックを受け、まだ文字として確定してい
ない部分において前記矩形分割統合手段によって新たに
生成された矩形を候補文字選出手段に出力し、前記言語
処理手段、前記確定文字検出手段、前記文字矩形情報抽
出手段の処理を全ての矩形が文字として確定するかある
いは一定回数まで繰り返すことを特徴とする文字認識装
置である。According to a third aspect of the present invention, a character string component is divided into a plurality of rectangles by dividing an input character string component into small portions of a histogram value formed by projecting the character string component on an axis in the character string direction. Rectangularization processing means, and candidate character selection means for extracting a plurality of candidate characters by performing feature extraction from character constituent components inside the rectangles and calculating a similarity representing how close to each character for each character, A language processing means for calculating an evaluation value of certainty as a sentence by comparing the result of the candidate character selection means with a word dictionary and a grammar dictionary; and a determination for detecting a rectangle determined as a character from the result of the language processing means Character detection means, character rectangle information extraction means for extracting a rectangular feature determined as a character from the result of the fixed character detection means, and the fixed character detection means A writer estimating means for determining a writer of an input character string or a font of a printed character based on the result and the rectangle information obtained by the character rectangle information extracting means, and outputting the result to the candidate character selecting means; Based on the rectangle information obtained by the character rectangle information extraction unit, a rectangle not yet determined as a character is integrated and divided, and the result is fed back to the rectangularization processing unit. A recognition result output means for outputting a result of the detection means as a recognition result, wherein the writer estimation means comprises a character written by each writer or a font habit of a printed character. (The distribution of the characteristic amount of each character, the aspect ratio of the circumscribed rectangle of the character, and the size of the character). A writer evaluation value calculating means for calculating an evaluation value indicating how close to a writer or a font of a printed character of the writer dictionary by checking consistency with the determined character and the writer A writer determining unit that specifies the writer of the input character string or the font of the print character according to the result of the evaluation value calculating unit, wherein the rectification processing unit receives feedback from the rectangular division integration unit and determines the character as a character. The rectangle newly generated by the rectangle dividing and integrating unit is output to the candidate character selecting unit in the portion that has not been processed, and the processing of the language processing unit, the fixed character detecting unit, and the character rectangle information extracting unit is performed in a manner that all the rectangles are characters. The character recognition device is characterized in that the character recognition device is determined or repeated up to a certain number of times.

【００３６】[0036]

【作用】第１の本発明は、前記した構成により、候補文
字選出手段が出力する複数の候補文字から成る文字列の
中で言語的に正しいと思われる部分の文字を「文節内の
一文字当りの類似度が高いほうが文節として正しい可能
性が高い。」、「文節内の単語の文字数は多いほど文節
として正しい可能性が高い。」、「文節内の文字数はあ
る値に近いほど文節として正しい可能性が高い。」、
「文節を構成する単語の入力文字列中での出現頻度は高
いほうが文節として正しい可能性が高い。」という仮定
に基づき、文節単位で言語的な確からしさの評価値を算
出した結果によって、確定文字検出手段が確定文字とし
て検出し、認識結果出力手段が認識結果として出力する
一方、文字矩形情報検出手段が確定文字の矩形情報を検
出し、矩形評価関数の最適化を行う。矩形分割統合手段
は最適化された矩形評価関数に基づいてまだ確定されて
いない部分の基本矩形の分割・統合を行うことによって
新たな基本矩形を求め、再び候補文字選出手段以下の処
理を行う。According to the first aspect of the present invention, a character string which is considered to be linguistically correct in a character string composed of a plurality of candidate characters output by the candidate character selecting means is set to "per character in a phrase". The higher the degree of similarity, the higher the probability of correctness as a phrase. "," The greater the number of characters in a word in a phrase, the higher the possibility of correctness as a phrase. "," The closer the number of characters in a phrase to a certain value, the more correct the phrase. Probability is high.",
Based on the assumption that the higher the frequency of occurrence of words that form a phrase in the input character string is, the more likely it is to be correct as a phrase, the result of calculating the linguistic certainty evaluation value for each phrase is determined. The character detection means detects the determined character, and the recognition result output means outputs the result as a recognition result, while the character rectangle information detection means detects the rectangle information of the determined character and optimizes the rectangle evaluation function. The rectangle division integrating means obtains a new basic rectangle by dividing and integrating the basic rectangles of the undecided parts based on the optimized rectangle evaluation function, and performs the processing after the candidate character selection means again.

【００３７】このようにして最終的にすべての矩形が言
語的に文字として確定されるまで以上の処理を繰り返
す。The above processing is repeated until all the rectangles are finally determined linguistically as characters.

【００３８】以上のように文字評価結果をさらに言語処
理で評価した後に認識結果を決定するために認識精度が
高まる。またその結果を用いて矩形の文字らしさを評価
する評価関数自身を推定するために文書毎に最適な処理
が行える。As described above, since the character evaluation result is further evaluated by language processing and the recognition result is determined, the recognition accuracy is improved. In addition, since the result is used to estimate the evaluation function itself for evaluating the character likeness of a rectangular character, optimal processing can be performed for each document.

【００３９】第２の本発明は、前記した構成により、文
字の外接長方形候補である矩形に1文字以上の文字構成
成分が含まれた場合には、「横書き（縦書き）では矩形
は横長（縦長）になり、矩形の面積が他の矩形よりも大
きくなる可能性が高く、さらに矩形面積に占める文字構
成成分以外の部分の面積の割合が一定になる可能性が高
い。」という仮定に基づき、上記仮定を満たす矩形（1
文字分以上の文字構成成分を含む矩形）を分割すること
によって文字の誤切り出しを減少させることができる。According to the second aspect of the present invention, when a rectangle that is a circumscribing rectangle candidate for a character includes one or more character components, the rectangle is written horizontally (vertically). Vertical), the area of the rectangle is likely to be larger than other rectangles, and the ratio of the area of the portion other than the character component to the area of the rectangle is likely to be constant. " , A rectangle (1
By dividing a rectangle including character components equal to or more than characters, erroneous cutout of characters can be reduced.

【００４０】第３の本発明は前記した構成により、第１
の本発明と同様にして文字評価結果をさらに言語処理で
評価した後に認識結果を決定するために認識精度が高ま
る。さらにその結果を用いて筆記者あるいは印刷文字の
フォントを特定することによって候補文字選出手段にお
ける候補文字決定の際の文字辞書に前記特定された筆記
者、あるいは印刷文字のフォント用の文字辞書を使用す
ることによって候補文字選出手段の精度を向上させ、よ
り最適な処理が行える。According to the third aspect of the present invention, with the above-described structure, the first
Since the character evaluation result is further evaluated by language processing in the same manner as in the present invention, the recognition result is determined, so that the recognition accuracy is increased. Further, by using the result to specify the writer or the font of the printed character, the specified writer or the character dictionary for the font of the printed character is used as the character dictionary when determining the candidate character in the candidate character selecting means. By doing so, the accuracy of the candidate character selecting means is improved, and more optimal processing can be performed.

【００４１】すなわち、言語的に正しいと思われる部分
の文字を確定文字検出手段が確定文字として検出し、認
識結果出力手段が認識結果として出力する一方、文字矩
形情報検出手段が確定文字の矩形情報を検出し、矩形評
価関数の最適化を行なう。矩形分割統合手段は最適化さ
れた矩形評価関数に基づいてまだ確定されていない部分
の基本矩形の分割・統合を行うことによって新たな基本
矩形を求め、再び候補文字選出手段以下の処理を行な
う。また筆記者推定手段が確定文字から得た情報を用い
て候補文字選出手段で用いる辞書を最適化する。That is, the fixed character detecting means detects a character in a part considered to be linguistically correct as a fixed character, and the recognition result output means outputs the result as a recognition result. And optimize the rectangle evaluation function. The rectangle division integrating means obtains a new basic rectangle by dividing and integrating a basic rectangle of a part which has not yet been determined based on the optimized rectangle evaluation function, and performs the processing after the candidate character selection means again. The writer estimation unit optimizes the dictionary used by the candidate character selection unit by using information obtained from the determined character.

【００４２】[0042]

【実施例】以下、本発明の実施例について図面を参照し
て説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００４３】図１は、第３の本発明の一実施例における
文字認識装置の構成図を示すものである。第１、第２の
本発明は、この第３の本発明の構成要素を成すものであ
り、第３の本発明の実施例の中で同時に説明する。FIG. 1 shows a configuration diagram of a character recognition device according to an embodiment of the third invention. The first and second aspects of the present invention constitute constituent elements of the third aspect of the present invention, and will be described simultaneously in Examples of the third aspect of the present invention.

【００４４】図１において、入力文字列２０１は入力さ
れる画像文字、矩形化処理部２０２はその入力文字を矩
形化するための処理部、候補文字選出部１は候補となる
文字を選出する部、言語処理部２はその候補文字を言語
的に処理する部、確定文字検出部３は確定文字を検出す
る部、文字矩形情報抽出部４は文字矩形情報を検出する
部、矩形分割統合部５は矩形を分割統合する部、認識結
果出力部６は認識結果を出力する部、筆記者推定部７は
筆記者を推定する部である。In FIG. 1, an input character string 201 is an input image character, a rectification processing unit 202 is a processing unit for rectifying the input character, and a candidate character selection unit 1 is a unit for selecting candidate characters. The language processing unit 2 linguistically processes the candidate character, the fixed character detection unit 3 detects a fixed character, the character rectangle information extraction unit 4 detects character rectangle information, and the rectangle division integration unit 5 Is a unit that divides and integrates a rectangle, the recognition result output unit 6 is a unit that outputs a recognition result, and the writer estimation unit 7 is a unit that estimates a writer.

【００４５】以上のように構成された本実施例の文字認
識装置について、以下にその動作を説明する。The operation of the character recognition device of the present embodiment configured as described above will be described below.

【００４６】矩形化処理部２０２は従来の文字認識装置
における場合と同じ構成要素からなり、従来の文字認識
装置と同様にして基本矩形記憶部２１７に基本矩形を記
憶保持する。The rectification processing unit 202 has the same components as in the conventional character recognition device, and stores the basic rectangle in the basic rectangle storage unit 217 in the same manner as in the conventional character recognition device.

【００４７】図２は上記候補文字選出部１の具体的な構
成図であり、特徴抽出部２７１は特徴を抽出する部、文
字辞書２７２は文字情報を記憶した辞書、類似度算出部
２７３は文字辞書２７３を参照して類似度を算出する
部、文字選択部２１は類似度を参照して文字を選択する
部、候補文字記憶部２２は候補文字を記憶する部であ
る。FIG. 2 is a specific block diagram of the candidate character selection unit 1. The feature extraction unit 271 is a unit for extracting features, the character dictionary 272 is a dictionary storing character information, and the similarity calculation unit 273 is a character extraction unit. A unit that calculates the similarity with reference to the dictionary 273, the character selection unit 21 selects a character with reference to the similarity, and the candidate character storage unit 22 stores a candidate character.

【００４８】この候補文字選出部１の動作は次の通りで
ある。特徴抽出部２７１は基本矩形記憶部２１７から基
本矩形の座標位置を参照し、前記座標位置内の画素を矩
形化処理部２０２内の文字列データ記憶部２１１から読
みだし、従来の文字認識装置の場合と同様にして特徴量
を得て、類似度算出部２７３に出力する。次に類似度算
出部２７３は文字辞書２７２から各文字の代表特徴ベク
トルを読みだし、従来の文字認識装置の場合と同様にし
て各文字との類似度を算出して文字選択部２１に出力す
る。文字選択部２１は類似度の高い文字から順にｍ個の
文字とその類似度を候補文字記憶部２２に出力し、候補
文字記憶部２２は一矩形当りｍ個の候補文字と類似度を
記憶保持する。 The operation of the candidate character selection section 1 is as follows.
is there. The feature extraction unit 271 is based on the basic rectangle storage unit 217.
Referring to the coordinate position of this rectangle, the pixel within the coordinate position is rectangular.
Read from the character string data storage unit 211 in the shaping processing unit 202.
In the same way as the conventional character recognition device,
And outputs it to the similarity calculation unit 273. Next, similarity calculation
The output unit 273 obtains a representative feature vector of each character from the character dictionary 272.
And read it in the same way as with a conventional character recognition device.
To calculate the similarity with each character and output it to the character selection unit 21
You. The character selection unit 21 selects m characters in descending order of similarity.
The character and its similarity are output to the candidate character storage unit 22, and the candidate
The character storage unit 22 stores m candidate characters per rectangle and the similarity.
Remember.

【００４９】図３は第１の本発明である言語処理部２の
具体的な構成図である。図３において、候補単語決定部
３１は候補単語を決定する部、候補文節決定部３２は、
候補文節を決定する部、文節評価確定部３３は文節を評
価する部であり、さらに候補文字組合せ部３０１は候補
文字を組み合わせる部、単語整合部３０２は単語を整合
する部、単語辞書３０３は単語の辞書、候補単語記憶部
３０４は候補単語を記憶する部、候補文節出力部３０５
は候補文節出力部、文節整合部３０６は文節を整合する
部、文法辞書３０７は文法を記憶した辞書、候補文節記
憶部３０８は候補文節を記憶する部、文節評価値算出部
３０９は文節評価値を算出する部、候補文節組合せ部３
１０は候補文節を組み合わせる部、文節決定部３１１は
文節を決定する部である。FIG. 3 is a specific configuration diagram of the language processing unit 2 according to the first embodiment of the present invention. In FIG. 3, a candidate word determination unit 31 determines a candidate word, and a candidate phrase determination unit 32
A candidate phrase determining unit, a phrase evaluation determining unit 33 is a unit for evaluating phrases, a candidate character combining unit 301 is a unit for combining candidate characters, a word matching unit 302 is a unit for matching words, and a word dictionary 303 is a word dictionary. And a candidate word storage unit 304 is a unit for storing candidate words and a candidate phrase output unit 305
Is a candidate phrase output unit, a phrase matching unit 306 is a unit that matches phrases, a grammar dictionary 307 is a dictionary that stores grammar, a candidate phrase storage unit 308 is a unit that stores candidate phrases, and a phrase evaluation value calculation unit 309 is a phrase evaluation value. Calculating unit, candidate phrase combining unit 3
Reference numeral 10 denotes a unit for combining candidate phrases, and a phrase determining unit 311 denotes a unit for determining a phrase.

【００５０】候補文字組合せ部３０１は候補文字記憶部
２２から連続する矩形の候補文字を一矩形当り一文字ず
つ選び出し、選びだした文字を一個ないし複数個、組合
せた結果を単語整合部３０２に出力する。なお前記組合
せは網羅的に行なわれ、すべての組合せについて単語整
合部３０２に出力されるものとする。単語辞書３０３に
は各単語が文字毎の整合がとれるように木構造で表現さ
れており、各単語にはその品詞と分別記号が付随して記
憶保持されているものとする。単語整合部３０２は入力
された文字の組合せと単語辞書３０３に記憶保持されて
いる単語と一文字毎に整合、不整合の判断を行い、すべ
ての文字が整合した"文字の組合せ"についてその文字列
内の位置と分別記号、品詞を候補単語記憶部３０４に出
力する。候補単語記憶部３０４は単語整合部３０２から
の入力を順に記憶保持する。The candidate character combination unit 301 selects continuous rectangular candidate characters from the candidate character storage unit 22 one by one for each rectangle, and outputs the result of combining one or more selected characters to the word matching unit 302. . The above combinations are comprehensively performed, and all combinations are output to the word matching unit 302. In the word dictionary 303, each word is expressed in a tree structure so that each word can be matched, and it is assumed that each word has its part of speech and discrimination symbol attached and stored. The word matching unit 302 determines whether the combination of the input character and the word stored in the word dictionary 303 match or not for each character, and determines the character string of the "character combination" in which all the characters match. And outputs the position, classification symbol, and part of speech to the candidate word storage unit 304. The candidate word storage unit 304 sequentially stores and holds the inputs from the word matching unit 302.

【００５１】候補文節出力部３０５は候補単語記憶３０
４に記憶保持されている候補単語の情報を読みだし、隣
合う位置にある候補単語から候補文節を決定し文節整合
部３０６に出力する。文法辞書３０７には単語の活用
形、助詞との接続などの情報が記憶保持されているもの
とする。文節整合部３０６は入力された候補文節と文法
辞書３０７に記憶保持されている情報との整合、不整合
を判断し、整合した文節の位置及び候補文字記憶部２２
から前記文節を構成している文字の個数ｋ、前記文節内
の単語を構成している文字の類似度平均ｄと個数ｌ、前
記単語の文字列中における頻度ｑを候補文節記憶部３０
８に出力する。候補文節記憶部３０８は入力された候補
文節の上記の情報を記憶保持するメモリと以下で述べる
ように文節評価値算出部３０９からフィードバックされ
てくる評価値を記憶保持するメモリとからなり、入力情
報をそれらのメモリ内に記憶保持する。The candidate phrase output unit 305 stores the candidate word memory 30
4 is read out, the candidate phrase is determined from the candidate words at adjacent positions, and output to the phrase matching unit 306. It is assumed that the grammar dictionary 307 stores information such as the inflected forms of words and connections with particles. The phrase matching unit 306 determines whether the input candidate phrase matches information stored in the grammar dictionary 307, and determines the position of the matched phrase and the candidate character storage unit 22.
From the number k of characters constituting the phrase, the average similarity d and the number 1 of characters constituting the words in the phrase, and the frequency q of the word in the character string, the candidate phrase storage unit 30
8 is output. The candidate clause storage unit 308 includes a memory for storing and holding the above information of the input candidate clause and a memory for storing and holding the evaluation value fed back from the clause evaluation value calculation unit 309 as described below. Is stored in those memories.

【００５２】文節評価値算出部３０９は候補文節記憶部
３０８に記憶保持された上記に述べた文節の情報を読み
だし、（数４）のＨを文節の言語的正しさを表わす量と
して算出し、対応する文節の文節評価値として候補文節
記憶部３０８に出力する。The phrase evaluation value calculation unit 309 reads the above-described phrase information stored and held in the candidate phrase storage unit 308, and calculates H in (Equation 4) as an amount representing the linguistic correctness of the phrase. Is output to the candidate phrase storage unit 308 as the phrase evaluation value of the corresponding phrase.

【００５３】[0053]

【数４】(Equation 4) Ｈ＝ａ1・ｄ＋ａ2・ｋ＋ａ3・(Ｌ−ｌ)H = a1 · d + a2 · k + a3 · (L−1) ^2Two ＋ａ4・ｑ+ A4 · q 但し、ｄは文節を構成する文字の類似度平均、Where d is the average similarity of the characters constituting the phrase, ｋは文節を構成する単語の文字数、Ｌは一定値、k is the number of characters of a word constituting a phrase, L is a constant value, ｌは文節の文字数、ｑは文節を構成する単語の頻度、l is the number of characters in the phrase, q is the frequency of the words that make up the phrase, ａ1、ａ2、ａ3、ａ4は正規化係数であり、ａ１，ａ２，a1, a2, a3, a4 are normalization coefficients, and a1, a2,
ａ４は正の値、ａ３は負の値をとる。a4 has a positive value and a3 has a negative value.

【００５４】（数４）は「構成文字の類似度が大きい程
文節として正しい。単語の文字数が多い程文節として正
しい。文節の文字数はＬに近い程良い。単語の頻度が高
い程文節として確からしい。」ということを表わしてい
る。(Equation 4) indicates that "the greater the similarity of the constituent characters, the more correct the phrase. The greater the number of words, the more correct the phrase. The closer the number of characters in the phrase to L, the better. The higher the frequency of the word, the more reliable the phrase. It seems. "

【００５５】候補文節記憶部３０８に記憶保持されてい
るすべての候補文節について評価値が算出された後、候
補文節組合せ部３１０は候補文節記憶部３０８に記憶保
持されている候補文節の位置情報から一定の長さになる
ように連続する複数の文節で構成される"文節組合せ"を
抽出し、その結果を文節決定部３１１に出力する。従っ
て同じ位置で始まり、同じ位置で終わる"文節組合せ"が
複数、文節決定部３１１に入力されることになる。な
お"文節組合せ"は初期状態では入力文字列２０１の先頭
から順にすべて抽出するものとし、さらに以下のように
して文節決定部３１１によって文節が確定される毎に抽
出する先頭を確定された文節の次の位置にする。After the evaluation values have been calculated for all the candidate phrases stored in the candidate phrase storage unit 308, the candidate phrase combination unit 310 uses the position information of the candidate phrases stored in the candidate phrase storage unit 308 to calculate the evaluation values. A “phrase combination” composed of a plurality of continuous phrases so as to have a fixed length is extracted, and the result is output to the phrase determination unit 311. Therefore, a plurality of “phrase combinations” that start at the same position and end at the same position are input to the phrase determination unit 311. In the initial state, the phrase combination is assumed to be extracted in order from the beginning of the input character string 201. Further, each time a phrase is determined by the phrase determining unit 311 as described below, the beginning of the extracted phrase is determined. Move to the next position.

【００５６】文節決定部３１１は入力される"文節組合
せ"のうち構成する文節一つ当りの文節評価値が最大に
なる"文節組合せ"を選び、その先頭の文節を確定された
文節として位置情報および候補文字、文節評価値を確定
文字検出部３へ出力する。The phrase determining unit 311 selects a “phrase combination” that maximizes the phrase evaluation value of each of the constituent phrases from the input “phrase combination”, and sets the first phrase as the determined phrase as position information. The candidate character and the phrase evaluation value are output to the confirmed character detection unit 3.

【００５７】以上の処理によって入力文字列２０１は文
節単位に区切られ、各々の文節には文節の言語的正しさ
を表わす量として文節評価値がつけられる。確定文字検
出部３は入力された文節のうち、一定値以上の文節評価
値を持つ文節内の候補文字を認識結果の確定文字として
文字矩形情報抽出部４と認識結果出力部６に出力する。
なお文字矩形情報抽出部４と認識結果出力部６には文節
内の文字の位置情報も出力するものとする。By the above processing, the input character string 201 is divided into clause units, and each clause is given a clause evaluation value as a quantity representing the linguistic correctness of the clause. The confirmed character detection unit 3 determines candidate characters in a phrase having a phrase evaluation value equal to or greater than a certain value among the input phrases as a confirmed character of the recognition result.
The information is output to the character rectangle information extraction unit 4 and the recognition result output unit 6.
It is assumed that the character rectangle information extraction unit 4 and the recognition result output unit 6 also output position information of characters in a phrase.

【００５８】認識結果出力部６は確定文字検出部３から
入力される確定文字を前記確定文字に付随して入力され
る位置での認識結果として出力する。The recognition result output unit 6 outputs the determined character input from the determined character detection unit 3 as a recognition result at the position where it is input along with the determined character.

【００５９】文字矩形情報抽出部４は確定された文字か
ら矩形情報を抽出する。図４は文字矩形情報抽出部４の
具体的な構成図であり、方形度算出部２８１は方形度を
算出する部、空白度算出部２８２は空白度を算出する
部、矩形面積算出部４１は矩形面積を算出する部、方形
度評価関数決定部４２は方形度評価関数を決定する部、
空白度評価関数決定部４３は空白度評価関数を決定する
部、矩形面積評価関数決定部４４は矩形面積評価関数を
決定する部、評価関数記憶部４５は評価関数を記憶する
部である。The character rectangle information extracting section 4 extracts rectangle information from the determined character. FIG. 4 is a specific configuration diagram of the character rectangle information extraction unit 4. The squareness calculation unit 281 calculates the squareness, the blankness calculation unit 282 calculates the blankness, and the rectangular area calculation unit 41 calculates the squareness. A section for calculating a rectangular area, a squareness evaluation function determining section 42 for determining a squareness evaluation function,
The blankness evaluation function determining unit 43 is a unit that determines a blankness evaluation function, the rectangular area evaluation function determining unit 44 is a unit that determines a rectangular area evaluation function, and the evaluation function storage unit 45 is a unit that stores an evaluation function.

【００６０】方形度算出部２８１は入力された確定文字
の位置情報を参照し、基本矩形記憶部２１７から前記確
定文字の矩形の方形度を算出し、方形度評価関数決定部
４２に出力する。同様に空白度算出部２８２は入力され
た確定文字の位置情報を参照し、矩形化処理部２０２内
の文字列データ記憶部２１１から前記座標位置内の画素
のうち文字構成成分に含まれない画素数が全体の画素数
に占める割合（空白度）を算出し、空白度評価関数決定
部４３に出力する。矩形面積算出部４１は入力された確
定文字の位置情報を参照し、矩形化処理部２０２内の文
字列データ記憶部２１１から前記座標位置内の総画素数
を求め矩形面積評価関数決定部４４に出力する。The squareness calculator 281 calculates the squareness of the rectangle of the determined character from the basic rectangle storage unit 217 with reference to the input positional information of the determined character, and outputs it to the squareness evaluation function determining unit 42. Similarly, the blankness calculating unit 282 refers to the position information of the input fixed character, and obtains, from the character string data storage unit 211 in the rectification processing unit 202, a pixel not included in the character component among the pixels in the coordinate position. The ratio of the number of pixels to the total number of pixels (blank degree) is calculated and output to the blank degree evaluation function determining unit 43. The rectangular area calculation unit 41 refers to the input positional information of the determined character, obtains the total number of pixels in the coordinate position from the character string data storage unit 211 in the rectangularization processing unit 202, and sends it to the rectangular area evaluation function determination unit 44. Output.

【００６１】方形度評価関数決定部４２は例えば図５に
示すように入力された方形度の値のヒストグラムをと
り、評価関数の形を決定する。さらにヒストグラム値を
正規化することで評価値を決めることによって方形度評
価関数ρ1を決定する。同様にして空白度評価関数決定
部４３は入力された空白度の値のヒストグラムから空白
度評価関数ρ2を決定する。矩形面積評価関数決定部４
４は入力された矩形面積の値のヒストグラムから矩形面
積評価関数ρ3を決定する。 For example, FIG.
And the histogram of the input squareness values as shown
To determine the form of the evaluation function. And the histogram values
Squareness evaluation by determining the evaluation value by normalization
Determine the valence function ρ1. Determine blankness evaluation function in the same way
The unit 43 calculates blanks from the histogram of the input blankness values.
The degree evaluation function ρ2 is determined. Rectangular area evaluation function determination unit 4
4 is a rectangular surface from the histogram of the input rectangular area values.
The product evaluation function ρ3 is determined.

【００６２】方形度評価関数決定部４２、空白度評価関
数決定部４３と矩形面積評価関数４４によって決定され
た評価関数は評価関数記憶部４５に記憶保持される。The evaluation function determined by the squareness evaluation function determination unit 42, the blankness evaluation function determination unit 43, and the rectangular area evaluation function 44 is stored in the evaluation function storage unit 45.

【００６３】以上のようにして決定された評価関数は入
力文字列２０１から言語的に確定された文字から推定し
たものであり、評価関数としての精度は非常に高いと考
えられる。 The evaluation function determined as described above is
Estimated from linguistically determined characters from force string 201
It is considered that the accuracy as an evaluation function is very high.
available.

【００６４】図６は第２の本発明の矩形分割統合部５の
具体的な構成図である。FIG. 6 is a specific configuration diagram of the rectangular division integration section 5 of the second invention.

【００６５】分割矩形候補選出部６１は基本矩形記憶部
２１７に記憶保持されている矩形の中でまだ文字として
確定していない矩形を読みだし、前記矩形の方形度ｒ、
空白度ｓ、面積Ｍを求める。次に評価関数記憶部４３に
記憶保持されている評価関数から（数５）を満たす矩形
を複数の文字を含むと考えられる矩形として選び出す。The divided rectangle candidate selecting section 61 reads a rectangle that has not yet been determined as a character among the rectangles stored and held in the basic rectangle storage section 217 and reads the squareness r,
The blankness s and the area M are obtained. Next, a rectangle satisfying (Equation 5) is selected from the evaluation functions stored in the evaluation function storage unit 43 as a rectangle considered to include a plurality of characters.

【００６６】[0066]

【数５】 ρ1(ｒ)＜λr かつ ρ2(ｓ)＞λs かつ ρ3(Ｍ)＞λM 但し、λr、λs、λMは適当な値。Ρ1 (r) <λr and ρ2 (s)> λs and ρ3 (M)> λM where λr, λs and λM are appropriate values.

【００６７】（数５）は「ある程度以上横長で面積も大
きく、かつ空白度が大きい矩形は複数文字を含む可能性
が高い。」ということを表わしている。(Equation 5) indicates that "a rectangle that is longer than a certain length, has a large area, and has a large blank degree is likely to include a plurality of characters."

【００６８】分割矩形候補選出部６１は選出した矩形の
位置情報を矩形分割部６２に出力する。The divided rectangle candidate selecting section 61 outputs positional information of the selected rectangle to the rectangular dividing section 62.

【００６９】図７は矩形分割部６２の具体的な構成図で
ある。FIG. 7 is a specific configuration diagram of the rectangular division section 62.

【００７０】周辺分布算出部２１２は分割矩形候補選出
部６１から入力された位置に対応する文字列データ記憶
部２１１に保持された値を縦方向に累積した値を算出
し、その結果をガウスシアンフィルタ処理部２１３に出
力する。ガウスシアンフィルタ処理部２１３は周辺分布
算出部２１２によって求められた文字データの縦方向の
累積ヒストグラムに対してガウス型関数を畳み込むこと
によって平滑化を行うことで累積ヒストグラムの高周波
成分を除き、ノイズの影響を抑える。平滑化を受けた累
積ヒストグラムは分割点検出部７１に入力される。分割
点検出部７１は連続して累積ヒストグラムの値が一定値
（いき値）以下の部分を検出し、さらにその部分におい
て累積ヒストグラムの値が最小になる座標位置を現在注
目している矩形の分割点として分割矩形列方向周辺分布
算出部７２と分割矩形記憶部７４に出力する。The marginal distribution calculating section 212 calculates a value obtained by accumulating the values held in the character string data storage section 211 corresponding to the position input from the divided rectangle candidate selecting section 61 in the vertical direction, and calculates the result by Gaussian cyan. Output to the filter processing unit 213. The Gaussian filter processing unit 213 performs smoothing by convolving a Gaussian function with the vertical cumulative histogram of the character data obtained by the marginal distribution calculating unit 212, thereby removing high-frequency components of the cumulative histogram and removing noise. Reduce the effect. The smoothed cumulative histogram is input to the division point detection unit 71. The division point detection unit 71 continuously detects a portion where the value of the cumulative histogram is equal to or less than a certain value (threshold), and further divides the coordinate position where the value of the cumulative histogram becomes minimum in that portion into a rectangle which is currently focused on. The data is output to the divided rectangle column direction peripheral distribution calculation unit 72 and the divided rectangle storage unit 74 as points.

【００７１】このとき、いき値は分割されてできるすべ
ての矩形が（数５）を満たさなくなるように設定される
ものとする。At this time, it is assumed that the threshold is set so that all the rectangles formed by the division do not satisfy (Equation 5).

【００７２】分割矩形列方向周辺分布算出部７２は分割
点検出部７１からの入力を受けて基本矩形の左端座標と
入力された分割点及び、入力された分割点と基本矩形の
右端座標とで挟まれる部分の文字データを各々別々に横
方向に斜影し、その結果を分割矩形上下端検出部７３に
出力する。なお分割点が複数ある場合にはさらに分割点
間の文字データに対しても同様の処理を行なうものとす
る。Receiving the input from the division point detecting section 71, the division rectangle column direction peripheral distribution calculation section 72 calculates the left end coordinates of the basic rectangle and the input division points, and the input division points and the right end coordinates of the basic rectangle. The character data of the portion to be sandwiched is shaded separately in the horizontal direction, and the result is output to the divided rectangle upper / lower end detection unit 73. When there are a plurality of division points, the same processing is performed on the character data between the division points.

【００７３】分割矩形上下端検出部７３は図１３に示し
たのと同様に分割矩形列方向周辺分布算出部７２からの
斜影結果の文字構成成分が成す部分の最大値、最小値を
求め、その結果を対応する分割矩形の上下端として分割
矩形記憶部７４に出力する。As shown in FIG. 13, the divided rectangle upper / lower edge detecting section 73 obtains the maximum value and the minimum value of the portion formed by the character component of the oblique result from the divided rectangular column direction peripheral distribution calculating section 72, and The result is output to the divided rectangle storage unit 74 as the upper and lower ends of the corresponding divided rectangle.

【００７４】以上の処理によって分割矩形記憶部７４に
は分割されてできた矩形がその左右端と上下端で表わさ
れ、検出順に記憶されることになる。The divided rectangle storing section 74 displays the divided rectangles by their left and right ends and upper and lower ends by the above processing, and stores them in the order of detection.

【００７５】図８は矩形統合部６３の具体的な構成図で
ある。矩形統合部６３の動作を以下に図にしたがって説
明する。FIG. 8 is a specific configuration diagram of the rectangular integration unit 63. The operation of the rectangular integration unit 63 will be described below with reference to the drawings.

【００７６】図９は矩形統合部６３の統合重み付きラテ
ィス作成部８１の具体的な構成図であり、９１は矩形組
合せ部、９２は統合重み付きラティス記憶部である。矩
形組合せ部９１は基本矩形記憶部２１７及び、分割矩形
記憶部７４に保持されている文字として確定していない
部分の矩形の座標値を読みだし、隣合う矩形を組合せた
矩形の外郭幅が文字列の高さの一定倍以下になるような
すべての組合せを検出し、その結果を統合重み付きラテ
ィス記憶部９２に出力する。FIG. 9 is a specific configuration diagram of the integrated weighted lattice creating section 81 of the rectangular integrating section 63, where 91 is a rectangular combination section, and 92 is an integrated weighted lattice storage section. The rectangle combination unit 91 reads coordinate values of rectangles that are not determined as characters held in the basic rectangle storage unit 217 and the divided rectangle storage unit 74, and determines the outline width of a rectangle obtained by combining adjacent rectangles with a character. All combinations that are equal to or less than a fixed multiple of the column height are detected, and the results are output to the integrated weighted lattice storage unit 92.

【００７７】統合重み付きラティス記憶部９２は、図１
６に示したのと同様に矩形組合せ部８１によって検出さ
れたすべての矩形の組合せについて、組合せ矩形の上
下、左右端座標を記憶保持し、さらに各組合せ矩形に対
応する重み値を記憶保持するためのメモリを確保する。The integrated weighted lattice storage unit 92 stores the data in FIG.
As shown in FIG. 6, in order to store and hold the coordinates of the upper, lower, left and right ends of the combination rectangle and to store and hold the weight values corresponding to each combination rectangle, for all the combinations of rectangles detected by the rectangle combination unit 81 as shown in FIG. Allocate memory for

【００７８】統合重み値算出部８２は評価関数記憶部４
５に記憶保持されている評価関数と統合重み付きラティ
ス記憶部９２に記憶保持されている組合せ矩形の情報か
ら前記組合せ矩形の矩形評価値を算出し、前記組合せ矩
形に対応する重み値として統合重み付きラティス記憶部
９２に出力する。The integrated weight value calculation unit 82 is provided in the evaluation function storage unit 4
5 is calculated from the evaluation function stored in the storage unit 5 and the information on the combination rectangle stored in the integrated weighted lattice storage unit 92, and the integrated weight is calculated as a weight value corresponding to the combination rectangle. The result is output to the lattice storage unit 92.

【００７９】切り出し位置探索部２４３は重み付きラテ
ィス記憶部の始端から終端に至る経路のうちで経路上の
一矩形当りの重み値が最大になる経路を選択し、そのと
きの組合せ矩形を矩形統合結果として基本矩形記憶部２
１７に出力する。The cut-out position search unit 243 selects a route having the maximum weight per rectangle on the route from among the routes from the start end to the end of the weighted lattice storage unit, and integrates the combined rectangle at that time into a rectangle. As a result, the basic rectangle storage unit 2
17 is output.

【００８０】第１の本発明の文字認識装置では、すべて
の矩形が文字として確定され、認識結果出力部６によっ
て出力されるまで、再びこれまでに述べた候補文字選出
部１以降の処理を繰り返す。なお、繰り返しの処理回数
が一定以上になった場合はその時点での文節決定部３１
１の出力を認識結果として認識結果出力部６が出力す
る、あるいはまだ文字として確定されていない部分の認
識を棄却するものとしても良い。In the character recognition device according to the first aspect of the present invention, the above-described processes after the candidate character selection unit 1 are repeated again until all rectangles are determined as characters and output by the recognition result output unit 6. . If the number of times of the repetition processing becomes equal to or more than a certain value, the phrase determining unit 31 at that time is used.
1 may be output from the recognition result output unit 6 as a recognition result, or the recognition of a part that has not been determined as a character may be rejected.

【００８１】このように第１の本発明の実施例によれば
最初に矩形化処理部２０２によって適当な基本矩形に区
切られた入力文字列に対して候補文字選出部１が１つの
基本矩形に付き複数の候補文字を挙げ、それらの候補文
字がつくる文字列を言語処理部２が判断する。次に言語
的に正しいと思われる部分の文字を確定文字検出部３が
確定文字として検出し、認識結果出力部６が認識結果と
して出力する一方、文字矩形情報抽出部４が確定文字の
矩形情報を検出し、矩形評価関数の最適化を行なう。矩
形分割統合部５は最適化された矩形評価関数に基づいて
まだ確定されていない部分の基本矩形の分割・統合を行
うことによって新たな基本矩形を求め、再び候補文字選
出部１以下の処理を行なう。このように言語的に確定さ
れた文字を認識結果とし、さらにそれらの文字を処理中
の評価関数の決定に用いることで文書毎に精度の高い評
価関数を自動的に決定することができる。As described above, according to the first embodiment of the present invention, the candidate character selection unit 1 first converts the input character string divided into appropriate basic rectangles by the rectangularization processing unit 202 into one basic rectangle. A plurality of candidate characters are added, and the language processing unit 2 determines a character string formed by the candidate characters. Then detected as linguistically correct the confirmed character detecting unit 3 is determined character a character portion seems, while the recognition result output unit 6 is output as the recognition result, the rectangle information of the character rectangle information extracting unit 4 is fixed character And optimize the rectangle evaluation function. Based on the optimized rectangle evaluation function, the rectangle division integration unit 5 obtains a new basic rectangle by dividing and integrating a basic rectangle of a part that has not been determined yet, and again performs the processing of the candidate character selection unit 1 and subsequent steps. Do. Characters determined linguistically in this way are used as recognition results, and those characters are used to determine the evaluation function being processed, so that a highly accurate evaluation function can be automatically determined for each document.

【００８２】第３の本発明の文字認識装置では、以下に
述べるようにして第１の本発明の言語処理手段によって
言語的に確定された文字部分から、筆記者推定部が筆記
者あるいは印刷文字のフォントを決定し、その結果から
候補文字選出部１内の文字辞書２７２の内容を変更して
からこれまで述べた候補文字選出部１以降の処理を繰り
返す。したがって候補文字選出部１での繰り返し処理の
際に文字辞書２７２が文書に対応して最適化されるため
により精度の高い処理が行える。In the character recognition device according to the third aspect of the present invention, the writer estimating unit derives the writer or print character from the character part linguistically determined by the language processing means of the first aspect of the present invention as described below. Is determined, the content of the character dictionary 272 in the candidate character selection unit 1 is changed from the result, and the processing from the candidate character selection unit 1 described above is repeated. Therefore, at the time of the repetition processing in the candidate character selection unit 1, the character dictionary 272 is optimized according to the document, so that more accurate processing can be performed.

【００８３】筆記者推定部７は図１に示すように矩形化
処理部２０２、確定文字検出部３、文字矩形情報抽出部
４からの入力と候補文字選出部１への出力を持つ。As shown in FIG. 1, the writer estimating unit 7 has inputs from the rectangularization processing unit 202, the fixed character detecting unit 3, the character rectangle information extracting unit 4, and an output to the candidate character selecting unit 1.

【００８４】図１０は本発明の第３の本発明である筆記
者推定部７の具体的な構成図である。FIG. 10 is a specific configuration diagram of the writer estimating unit 7 according to the third embodiment of the present invention.

【００８５】筆記者辞書１０１は図１０に示すように、
あらかじめ文字認識装置の使用が予想される筆記者別、
あるいは文字の特徴量の分布が似ている筆記者群別の文
字辞書１１１ ₁ 〜１１１ _t （ｔは筆記者または筆記者群の
数）と方形度辞書１１２ ₁ 〜１１２ _t から構成されてい
る。但し、方形度辞書１１２ ₁ 〜１１２ _t は各文字が持つ
方形度の値が記憶保持されているものとする。 The scribe dictionary 101 is, as shown in FIG.
By writer who is expected to use a character recognition device in advance,
Or sentences by writer group with similar distribution of character features
Character dictionaries 111 _{1 to} 111 _t (t is a writer or a group of scribes
Number) and consists squareness dictionary 112 ₁ to 112 _t
You. However, each character has the squareness dictionary 112 _{1 to} 112 _t
It is assumed that the value of the squareness is stored and held.

【００８６】筆記者評価値算出部１０２は図１０に示す
ように特徴抽出部２７１、類似度平均算出部１２１から
なり、以下のようにして評価値を算出する。The writer evaluation value calculation unit 102 includes a feature extraction unit 271 and a similarity average calculation unit 121 as shown in FIG. 10, and calculates an evaluation value as follows.

【００８７】まず、特徴抽出部２７１は確定文字検出部
３から入力される確定文字の位置情報に基づいて画素値
を矩形化処理部２０２内の文字列データ記憶部２１１か
ら読みだし、候補文字選出部１と同様にして確定文字の
特徴量ベクトルを得、類似度平均算出部１２１に出力す
る。First, the feature extracting unit 271 reads a pixel value from the character string data storage unit 211 in the rectification processing unit 202 based on the positional information of the fixed character input from the fixed character detecting unit 3 and selects candidate characters. In the same manner as in the unit 1, the feature amount vector of the determined character is obtained and output to the similarity average calculating unit 121.

【００８８】類似度平均算出部１２１は特徴抽出部２７
１から確定文字の特徴量ベクトルを受け、文字矩形情報
抽出部４からは確定文字の方形度を受ける。類似度平均
算出部１２１は入力されるすべての確定文字の特徴量ベ
クトルについて候補文字選出部１内の類似度算出部２７
３と同様にして文字辞書１１１₁〜１１１_t内の対応する
確定文字の代表ベクトルとの類似度を求め、前記文字辞
書毎の類似度の和μを算出する一方、入力されるすべて
の確定文字の方形度と方形度辞書１１２₁〜１１２_t内の
対応する確定文字の方形度との自乗誤差を求め、前記文
字辞書毎の自乗誤差の和νを算出し、（数６）で表わさ
れるＥiを確定文字が文字辞書１１１_iにどのぐらい近い
かを表わす筆記者評価値として筆記者決定部１０３に出
力する。The similarity average calculating section 121 includes the feature extracting section 27
1 and the character vector of the determined character from the character rectangle information extraction unit 4. The similarity average calculating unit 121 performs the similarity calculating unit 27 in the candidate character selecting unit 1 with respect to the feature amount vectors of all the input fixed characters.
3, the similarity between the corresponding determined character in the character dictionaries 111 _{1 to} 111 _t and the representative vector is calculated, and the sum μ of the similarity for each character dictionary is calculated. And the squareness of the squareness of the corresponding fixed character in the squareness dictionaries 112 _{1 to} 112 _t are calculated, the sum ν of the squared errors for each of the character dictionaries is calculated, and Ei represented by (Equation 6) is obtained. Is output to the writer determining unit 103 as a writer evaluation value indicating how close the determined character is to the character dictionary 111 _i .

【００８９】[0089]

【数６】Ｅi＝（μ＋b1・ν）／Ｎ但し、b1は正規化定数、Ｎは入力された確定文字の数。Ei = (μ + b1 · ν) / N where b1 is a normalization constant, and N is the number of input confirmed characters.

【００９０】筆記者決定部１０３は筆記者評価値算出部
１０２からの入力の中で最大の筆記者評価値Ｅmaxを持
つ筆記者辞書１０１内の文字辞書１１１maxを選出し、
候補文字選出部１内の文字辞書２７２の内容を文字辞書
１１１maxの内容に変換する。The writer determining section 103 selects a character dictionary 111max in the writer dictionary 101 having the largest writer evaluation value Emax from the input from the writer evaluation value calculating section 102.
The contents of the character dictionary 272 in the candidate character selection unit 1 are converted into the contents of the character dictionary 111max.

【００９１】なおこの変換は前記最大の筆記者評価値Ｅ
maxがある値以上になるまで行なわないとしても良い。
また筆記者辞書１０１として文字列から得られる他の情
報（文字間ピッチ、文字の外接長方形の底辺の高さ位
置、文字の外接長方形の重心の高さ位置等）の辞書を持
っていても同様の効果を得ることができるのは明かであ
る。Note that this conversion is based on the maximum writer evaluation value E.
It may not be performed until max exceeds a certain value.
Also, even if the writer dictionary 101 has a dictionary of other information obtained from the character string (inter-character pitch, height position of the base of the circumscribed rectangle of the character, height position of the center of gravity of the circumscribed rectangle of the character, etc.) It is clear that the effect can be obtained.

【００９２】本発明の各種手段は、コンピュータを用い
てソフトウェア的に実現するのが通常であるが、各機能
を有するハード回路を用いて実現してもかまわない。The various means of the present invention are generally realized by software using a computer, but may be realized by using a hardware circuit having each function.

【００９３】[0093]

【発明の効果】以上説明したように、本発明によれば、
文書毎に処理のための文字らしさを表わす評価関数を自
動的に最適化でき、処理精度、認識率の向上が図れ、そ
の実用的効果は大きい。As described above, according to the present invention,
The evaluation function representing the character-likeness for processing can be automatically optimized for each document, the processing accuracy and the recognition rate can be improved, and the practical effect is large.

[Brief description of the drawings]

【図１】本発明における一実施例の文字認識装置の構成
図である。FIG. 1 is a configuration diagram of a character recognition device according to an embodiment of the present invention.

【図２】本発明における候補文字選出部１の構成図であ
る。FIG. 2 is a configuration diagram of a candidate character selection unit 1 according to the present invention.

【図３】本発明における言語処理部２の構成図である。FIG. 3 is a configuration diagram of a language processing unit 2 according to the present invention.

【図４】本発明における文字矩形情報抽出部４の構成図
である。FIG. 4 is a configuration diagram of a character rectangle information extraction unit 4 according to the present invention.

【図５】本発明における方形度評価関数の決定法の説明
図である。FIG. 5 is an explanatory diagram of a method of determining a squareness evaluation function according to the present invention.

【図６】本発明における矩形分割統合部５の構成図であ
る。FIG. 6 is a configuration diagram of a rectangular division integration unit 5 according to the present invention.

【図７】本発明における矩形分割部６２の構成図であ
る。FIG. 7 is a configuration diagram of a rectangular division unit 62 according to the present invention.

【図８】本発明における矩形統合部６３の構成図であ
る。FIG. 8 is a configuration diagram of a rectangular integration unit 63 in the present invention.

【図９】本発明における統合重み付きラティス作成部８
１の構成図である。FIG. 9 shows an integrated weighted lattice creation unit 8 according to the present invention.
1 is a configuration diagram.

【図１０】本発明における筆記者推定部７の構成図であ
る。FIG. 10 is a configuration diagram of a writer estimation unit 7 in the present invention.

【図１１】従来例における文字認識装置の構成図であ
る。FIG. 11 is a configuration diagram of a character recognition device in a conventional example.

【図１２】従来例における文字認識装置の矩形化処理部
２０１の構成図である。FIG. 12 is a configuration diagram of a rectangularization processing unit 201 of a character recognition device in a conventional example.

【図１３】従来例における左右端検出部２１４の動作の
説明図である。FIG. 13 is an explanatory diagram of an operation of a left and right end detection unit 214 in a conventional example.

【図１４】従来例における上下端検出部２１６の動作の
説明図である。FIG. 14 is an explanatory diagram of an operation of an upper and lower end detection unit 216 in a conventional example.

【図１５】従来例における矩形統合認識部２０３の構成
図である。FIG. 15 is a configuration diagram of a rectangle integration recognition unit 203 in a conventional example.

【図１６】従来例における重み付きラティス作成部２４
１の構成図である。FIG. 16 shows a weighted lattice creating unit 24 in a conventional example.
1 is a configuration diagram.

【図１７】従来例における重み付きラティス記憶部２５
２の説明図である。FIG. 17 shows a weighted lattice storage unit 25 in a conventional example.
FIG.

【図１８】従来例における文字評価値算出部２０４の構
成図である。FIG. 18 is a configuration diagram of a character evaluation value calculation unit 204 in a conventional example.

【図１９】従来例における矩形評価値算出部２０５の構
成図である。FIG. 19 is a configuration diagram of a rectangular evaluation value calculation unit 205 in a conventional example.

[Explanation of symbols]

１候補文字選出部（手段）２言語処理部（手段）３確定文字検出部（手段）４文字矩形情報抽出部（手段）５矩形分割統合部（手段）６認識結果出力部（手段）７筆記者推定部（手段）２１文字選択部（手段）２２候補文字記憶部（手段）３１候補単語決定部（手段）３２候補文節決定部（手段）３３文節評価確定部（手段）３０１候補文字組合せ部（手段）３０２単語整合部（手段）３０３単語辞書３０４候補単語記憶部（手段）３０５候補文節出力部（手段）３０６文節整合部（手段）３０７文法辞書３０８候補文節記憶部（手段）３０９候補文節組合せ部（手段）３１０文節評価値算出部（手段）３１１文節決定部（手段）４１矩形面積算出部（手段）４２方形度関数決定部（手段）４３空白度評価関数決定部（手段）４４矩形面積評価関数決定部（手段）４５評価関数記憶部（手段）６１分割矩形候補選出部（手段）６２矩形分割部（手段）６３矩形統合部（手段）７１分割点検出部（手段）７２分割矩形列方向周辺分布算出部（手段）７３分割矩形上下端検出部（手段）７４矩形記憶部（手段）８１統合重み付きラティス作成部（手段）８２統合重み算出部（手段）９１矩形組合せ部（手段）９２統合重み付きラティス記憶部（手段）１０１筆記者辞書１０２筆記者評価値算出部（手段）１０３筆記者決定部（手段）１１１筆記者辞書１０１内の文字辞書１１２筆記者辞書１０１内の方形度辞書１２１類似度平均算出部（手段）２０１入力文字列２０２矩形化処理部（手段）２０３矩形統合認識部（手段）２０４文字評価値算出部（手段）２０５矩形評価値算出部（手段）２０６認識結果２１１文字列データ記憶部（手段）２１２周辺分布算出部（手段）２１３ガウシアンフィルタ処理部（手段）２１４左右端検出部（手段）２１５列方向周辺分布算出部（手段）２１６上下端検出部（手段）２１７基本矩形記憶部（手段）２４１重み付きラティス作成部（手段）２４２重み値算出部（手段）２４３切り出し位置探索部（手段）２５１基本矩形組合せ部（手段）２５２重み付きラティス記憶部（手段）２７１特徴量抽出部（手段）２７２文字辞書２７３類似度算出部（手段）２７４出力選出部（手段）２８１方形度算出部（手段）２８２空白度算出部（手段）２８３適合度算出部（手段） 1 candidate character selection unit (means) 2 language processing unit (means) 3 fixed character detection unit (means) 4 character rectangle information extraction unit (means) 5 rectangle division integration unit (means) 6 recognition result output unit (means) 7 writing Estimator (means) 21 Character selection unit (means) 22 Candidate character storage unit (means) 31 Candidate word decision unit (means) 32 Candidate phrase decision unit (means) 33 Clause evaluation decision unit (means) 301 Candidate character combination unit (Means) 302 Word matching unit (Means) 303 Word dictionary 304 Candidate word storage unit (Means) 305 Candidate phrase output unit (Means) 306 Clause matching unit (Means) 307 Grammar dictionary 308 Candidate phrase storage unit (Means) 309 Candidate clause Combination unit (means) 310 Clause evaluation value calculation unit (means) 311 Clause determination unit (means) 41 Rectangular area calculation unit (means) 42 Squareness function decision unit (means) 43 Blankness evaluation Function determination unit (means) 44 Rectangular area evaluation function determination unit (means) 45 Evaluation function storage unit (means) 61 Divided rectangle candidate selection unit (means) 62 Rectangular division unit (means) 63 Rectangular integration unit (means) 71 Division inspection Outgoing unit (means) 72 Divided rectangular column direction peripheral distribution calculating unit (means) 73 Divided rectangular upper / lower end detecting unit (means) 74 Rectangular storage unit (means) 81 Lattice creation unit with integrated weights (means) 82 Integrated weight calculating unit ( Means 91 Rectangle combination unit (Means) 92 Lattice storage unit with integrated weight (Means) 101 Writer dictionary 102 Writer evaluation value calculation unit (Means) 103 Writer determination unit (Means) 111 Character dictionary in Writer dictionary 101 112 Squareness dictionary in the scribe dictionary 101 121 Similarity average calculation unit (means) 201 Input character string 202 Rectangularization processing unit (Means) 203 Rectangular integration recognition unit (Hand) Stage) 204 character evaluation value calculation unit (means) 205 rectangular evaluation value calculation unit (means) 206 recognition result 211 character string data storage unit (means) 212 marginal distribution calculation unit (means) 213 Gaussian filter processing unit (means) 214 left and right Edge detection unit (means) 215 Column-direction marginal distribution calculation unit (means) 216 Upper and lower end detection units (means) 217 Basic rectangle storage unit (means) 241 Weighted lattice creation unit (means) 242 Weight value calculation unit (means) 243 Extraction position search unit (means) 251 Basic rectangle combination unit (means) 252 Weighted lattice storage unit (means) 271 Feature extraction unit (means) 272 Character dictionary 273 Similarity calculation unit (means) 274 Output selection unit (means) 281 Squareness calculator (means) 282 Blankness calculator (means) 283 Suitability calculator (means)

───────────────────────────────────────────────────── フロントページの続き (72)発明者阪上茂生大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者香田敏行大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者丹羽寿男大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者小島良宏大阪府門真市大字門真1006番地松下電器産業株式会社内 (56)参考文献特開昭63−271588（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/00 - 9/82 ──────────────────────────────────────────────────の Continued on the front page (72) Inventor Shigeo Sakagami 1006 Kadoma Kadoma, Osaka Prefecture Inside Matsushita Electric Industrial Co., Ltd. (72) Inventor Toshio Niwa 1006 Kazuma Kadoma, Osaka Prefecture Inside Matsushita Electric Industrial Co., Ltd. JP-A-63-271588 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G06K 9/00-9/82

Claims

(57) [Claims]

A rectifying unit configured to divide the character string component into a plurality of rectangles by dividing the input character string component into small portions of a histogram value formed by projecting the input character string component on an axis in the character string direction; A candidate character selecting means for extracting a plurality of candidate characters by performing feature extraction from character constituent components inside the rectangles and calculating a similarity indicating how close to which character each character is, and the candidate character selecting means Language processing means for calculating an evaluation value of certainty as a sentence by comparing the result of the word dictionary with a grammar dictionary, and a fixed character detection means for detecting a rectangle determined as a character from the result of the language processing means, Character rectangle information extraction means for extracting a rectangular feature determined as a character from the result of the determined character detection means, and character rectangle information extraction means. Based on the obtained rectangle information, a rectangle not yet determined as a character is integrated and divided, and the result is fed back to the rectangularization processing means. A character recognition device comprising: a recognition result output unit that outputs a plurality of candidate words by combining candidate characters and checking consistency with a word dictionary. Word determination means, candidate phrase determination means for estimating a phrase from the result of the candidate word determination means, and determining a plurality of candidate phrases by checking the consistency between the estimated phrase and the grammar dictionary; and Means similarity average of the characters constituting the candidate clause for the candidate clauses listed as candidates by the means, the number of characters of the words constituting the candidate clause and a predetermined value After calculating an evaluation value based on the square of the difference, the number of characters of the candidate clause, and the frequency of the words constituting the candidate clause, a constant length that can be obtained by combining the consecutive candidate clauses For a phrase group, a phrase group with the best evaluation value per phrase is selected, the head phrase of the phrase group is determined, and the evaluation value of the determined phrase represents linguistic correctness as a phrase. The rectangularization processing means receives feedback from the rectangle division integration means, and generates a rectangle newly generated by the rectangle division integration means in a portion which has not been determined as a character yet. Output to the candidate character selection means, and determine whether the processing of the language processing means, the confirmed character detection means, and the character rectangle information extraction means is confirmed as all rectangles as characters. Alternatively, a character recognition device characterized by repeating a certain number of times.

2. The method according to claim 1, wherein the rectangular division integrating means includes an aspect ratio of a rectangle which is a circumscribed rectangle candidate of a character generated by a character cutout process from an input character string, an area of the rectangle, and a portion other than a character component occupying the rectangle area. A divided rectangle candidate selecting means for selecting a rectangle to be divided (a rectangle including one or more character components) based on the evaluation function represented by the area ratio of Rectangle dividing means for dividing the divided rectangle, and rectangle integrating means for determining an optimal combination of rectangular integration by combining all the rectangles including the rectangle generated by the rectangle dividing means. The character recognition device according to claim 1, wherein

3. A rectangle processing means for dividing a character string component into a plurality of rectangles by dividing an input character string component into small portions of a histogram value formed by projecting the character string component on an axis in the character string direction. A candidate character selecting means for extracting a plurality of candidate characters by performing feature extraction from character constituent components inside the rectangles and calculating a similarity indicating how close to which character each character is, and the candidate character selecting means Language processing means for calculating an evaluation value of certainty as a sentence by comparing the result of the word dictionary with a grammar dictionary, and a fixed character detection means for detecting a rectangle determined as a character from the result of the language processing means, Character rectangle information extracting means for extracting a feature of a rectangle determined as a character from the result of the determined character detecting means, and a result of the determined character detecting means and the character A writer estimating means for determining a writer of an input character string or a font of a printed character based on the rectangular information obtained by the shape information extracting means, and outputting the result to the candidate character selecting means; Based on the rectangle information obtained by the means, a rectangle not yet determined as a character is integrated and divided, and the result is fed back to the rectangularization processing means by a rectangular division integration means, and the result of the determined character detection means And a recognition result output means for outputting a character as a recognition result, wherein the writer estimation means comprises: a character written by each writer; A writer dictionary that stores and retains the distribution of the feature amount, the aspect ratio of the circumscribed rectangle of the character, and the size of the character, and a sentence that is determined to be linguistically correct with the writer dictionary. A writer evaluation value calculating means for calculating an evaluation value indicating how close to which writer or which font of a printed character is in the writer dictionary by seeing consistency with the writer dictionary; and Writer determination means for specifying the writer of the input character string or the font of the print character according to the result, wherein the rectangularization processing means receives feedback from the rectangular division integration means, and in a part which has not been determined as a character yet, A rectangle newly generated by the rectangle division integration unit is output to a candidate character selection unit, and the processing of the language processing unit, the fixed character detection unit, and the character rectangle information extraction unit is performed when all rectangles are determined as characters or A character recognition device characterized by repeating a certain number of times.