JPH03144788A

JPH03144788A - Pattern normalizing device

Info

Publication number: JPH03144788A
Application number: JP1283829A
Authority: JP
Inventors: Takeshi Kamimura; 健上村
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1989-10-30
Filing date: 1989-10-30
Publication date: 1991-06-20
Anticipated expiration: 2014-05-24
Also published as: JP2893760B2

Abstract

PURPOSE:To allow the pattern normalizing device to correspond to various deformation in an input pattern by repeating geometric conversion by means of an optimized calculation method. CONSTITUTION:A control part 14 executes repeat processing based upon the optimized calculation method so that a distance value 105 between respective feature vectors obtained from an input pattern and a reference pattern is minimized and geometric conversion is repeated so that the shape of the input pattern is more approximated with the reference pattern. The input pattern is deformed by all the reference patterns and several patterns are successively selected in order from the one having the minimum distance value as upper candidates. On the other hand, a geometric conversion part 10 executes deformation so that the sum of respective picture element value in the predeformed pattern and the deformed pattern is stored. Thus, the pattern normalizing device can be allowed to correspond to various deformation of the input pattern.

Description

【発明の詳細な説明】（産業上の利用分野）本発明はパタン認識においての形状を正規化する装置に
関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to an apparatus for normalizing shapes in pattern recognition.

（従来の技術）パタン認識の中でも重要な分野の１つである文字認識は
、一般に前処理、特徴抽出、識別、後処理の４つのプロ
セスに大別される。前処理では入力される文字画像に対
して、雑音の除去や形状の正規化を行い、特徴抽出にお
いてはマスク処理、輪郭トレース、射影等の画像処理手
法を用いて、交点・端点といった特徴点やストロークの
配置、濃度分布等に依存した特微量を求める０次に特微
量を成分として構成される特徴ベクトルと、各カテゴリ
に対応した特徴ベクトルとのマツチングによってカテゴ
リを識別し、最後に後処理として識別結果の確認を行う
。(Prior Art) Character recognition, which is one of the important fields in pattern recognition, is generally divided into four processes: preprocessing, feature extraction, identification, and postprocessing. Preprocessing removes noise and normalizes the shape of the input character image, and feature extraction uses image processing methods such as mask processing, contour tracing, and projection to extract feature points such as intersections and endpoints. Categories are identified by matching feature vectors composed of zero-order feature quantities as components to find feature quantities that depend on stroke arrangement, density distribution, etc., and feature vectors corresponding to each category.Finally, as post-processing, Check the identification results.

文字認識の難しさの要因の１つとして、同じカテゴリに
属する文字パタンの形状の多様性がある。One of the factors contributing to the difficulty in character recognition is the diversity of shapes of character patterns belonging to the same category.

これは特に手書き文字やマルチフォントの印刷文字にお
いて顕著に現れる問題であり、これを解決するために既
にいくつかの方式が提案されている。This problem is particularly noticeable in handwritten characters and multi-font printed characters, and several methods have already been proposed to solve this problem.

これらの方式のうち大部分のものは、形状に対しである
正規化基準を設け、その上で最適化がなされるように前
処理プロセスにおいて変形を行うものである０例えばＧ
、ネイギーとＮ、ツォンにより、“手書き数字の正規化
手法（ＮＯ「指ａｌｉｚａｔｉｏｎＴｅｃｈｎｉｑｕｅ
　ｆｏｒ　Ｈａｎｄｐｒｉｎｔｅｄ　Ｎｕｍｅｒａｌｓ
）　”　と題して、＾５ｓｏｃｉａｔｉｏｎ　　ｆｏｒ
　Ｃｏｍｐｕｔｉｎｇ　ＭａｃｈｉｎｅｒｙのＣｏＩＩ
ｍｕｎｉｃａｔｉｏｎ、　Ｖｏｌ、１３．Ｎｏ、８（１
９７０）の４７５ページから４８１ページに掲載された
論文においては、手書き数字に対して外接矩形を求め、
これが一定の形状となるような変形を行い形状を正規化
している。また津雲、田中により「階層的な位置ずれ補
正処理に基づく手書き漢字認識」と題して、電子情報通
信学会の研究会（資料番号ＰＲＵ８７−１０４．１９８
８年２月）に発表された論文においては、手書き漢字に
対して、ストロークの間隔が均等化されるようにストロ
ークの位置の補正を行う方式を提案している。Most of these methods set a certain normalization standard for the shape, and then transform it in the preprocessing process so that optimization can be performed.
, Nagy and N. Tseng, “Normalization Technique for Handwritten Digits”
for Handprinted Numerals
) ” entitled ^5 association for
CoII of Computing Machinery
communication, Vol. 13. No, 8 (1
In the paper published on pages 475 to 481 of 970), a circumscribed rectangle is found for handwritten digits,
The shape is normalized by deforming it so that it has a constant shape. In addition, Tsugumo and Tanaka published an article entitled "Handwritten Kanji Recognition Based on Hierarchical Positional Displacement Correction Processing" at a research meeting of the Institute of Electronics, Information and Communication Engineers (material number PRU87-104.198).
In a paper published in February 2007, a method was proposed for correcting the stroke positions of handwritten Chinese characters so that the stroke intervals would be equalized.

（発明が解決しようとする課Ｕ）以上に方式においては、ある正規化基準に対する最適な
変形が一定の処理により求められるから、処理量の面か
らみても効率的である。しかし正規化基準の選択は非常
に難しい問題であり、これは特徴抽出や識別のアルゴリ
ズムだけでなく認識の対象にも依存している。設定され
る正規化基準に対し、一定処理によって最適な変形を求
めることは一般に困難であり、その場合には反復収束計
算が必要であると考えられる。(Problem U to be Solved by the Invention) The method described above is efficient in terms of processing amount because the optimal transformation for a certain normalization standard is determined by a certain process. However, the selection of normalization criteria is a very difficult problem, which depends not only on the feature extraction and identification algorithms but also on the recognition target. It is generally difficult to obtain an optimal deformation through constant processing for a set normalization standard, and in that case, it is considered that iterative convergence calculations are necessary.

本発明の目的は、最適化計算手法を用いて幾何学変換を
反復することにより、更に汎用性の高いパタン正規化装
置を提供することにある。An object of the present invention is to provide a more versatile pattern normalization device by repeating geometric transformation using an optimization calculation method.

（課題を解決するための手段）本発明の装置は、パタン認識処理において、入力パタンと参照パタンから
各々特徴抽出を行って得られる特徴ベクトル間の距離が
最小となるように、入力パタンに対する幾何学変換を実
行し、該入力パタンの形状を正規化する装置であって、入力される２値パタンを、濃淡パタンに変換する多値化
処理部と、与えられた変形パラメータに従って、濃淡パタンに対す
る機何学を実行し、変形パタンを求める幾何学変換部と
、前記変形パタンに対し特徴抽出を行い、変形パタンの特
徴ベクトルを求める特徴抽出部と、個々の参照パタンに
対し、あらかじめ特徴抽出を行って得られる参照パタン
の特徴ベクトルを貯えておく辞書と、前記変形パタンの特徴ベクトルと、前記参照パタンの特
徴ベクトルとの間の距離を計算する距離計算部と、前記距離の値を最小とする前記変形パラメータの値を求
め、Ｉ＆適化されるまでの反復処理の制御を行う制御部
と、前記各参照パタン毎に最小化された前記距離の値を貯え
てソーティングを行い、値が最小のものから順に上位候
補として出力するソータとを含んで構成されることを特
徴とする特（作用）本発明のパタン正規化装置は、入力パタンと参照パタン
から各々得られる特徴ベクトル間の距離が最小となるよ
うに、最適化計算手法による反復処理を実行するもので
あり、入力パタンの形状が参照パタンにより近くなるよ
うに幾何学変換が繰り返される。入力パタンは全ての参
照パタンに対して変形され、その結果得られる距離値が
最小のものから順に幾つかを選択し、上位候補とする。(Means for Solving the Problems) In pattern recognition processing, the device of the present invention performs geometrical analysis on an input pattern so that the distance between feature vectors obtained by extracting features from an input pattern and a reference pattern is minimized. A device for normalizing the shape of an input pattern by performing a logical transformation, the device comprising: a multi-value processing unit that converts an input binary pattern into a shading pattern; A geometric transformation unit that performs mechanical analysis to obtain a deformation pattern; a feature extraction unit that performs feature extraction on the deformation pattern and obtains a feature vector of the deformation pattern; and a feature extraction unit that performs feature extraction on each reference pattern in advance. a dictionary for storing feature vectors of the reference pattern obtained by the above-described process; a distance calculation unit for calculating the distance between the feature vector of the deformed pattern and the feature vector of the reference pattern; a control unit that calculates the value of the deformation parameter and controls the iterative process until it is optimized; and a control unit that stores and sorts the distance values minimized for each reference pattern, and Features (Functions) The pattern normalization device of the present invention is characterized in that it is configured to include a sorter that sequentially outputs top candidates as top candidates. In this method, iterative processing is performed using an optimization calculation method so that the input pattern is minimized, and geometric transformation is repeated so that the shape of the input pattern becomes closer to the reference pattern. The input pattern is transformed with respect to all reference patterns, and some of the resulting patterns with the smallest distance values are selected as top candidates.

また幾何学変換部においては、変形前のパタンと変形後
のパタンにおいて、各々の画素値の総和が保存されるよ
うに変形が行なわれる。Further, in the geometrical transformation section, transformation is performed so that the sum of each pixel value is preserved in the pattern before transformation and the pattern after transformation.

（実施例）本発明の実施例について図面を参照して説明する。(Example) Embodiments of the present invention will be described with reference to the drawings.

第１図は本発明のパタン正規化装置の一実施例を示すブ
ロック図である。以下第１図を参照して、本発明の詳細
な説明を行う。FIG. 1 is a block diagram showing an embodiment of the pattern normalization device of the present invention. The present invention will be described in detail below with reference to FIG.

多値化処理部１６は、２値画像である入力パタン１０９
に対し、ガウシアン等のコンボリューションフィルタ処
理を行い、濃淡パタン１００を得る。５Ｘ５のガウシア
ンフィルタの係数の一例を以下に示す。The multivalue processing unit 16 receives an input pattern 109 which is a binary image.
A convolution filter process such as Gaussian is performed on the image to obtain a shading pattern 100. An example of the coefficients of a 5×5 Gaussian filter is shown below.

幾何学変換部１０は、濃淡パタン１００と変形パラメー
タ１０１の２つを入力とし、変形パラメータ１０１によ
り与えられる幾何学変換を濃淡パタン１００に対して施
した結果を、変形パタン１０２として出力する。The geometrical transformation unit 10 receives two inputs, a shading pattern 100 and a deformation parameter 101, performs a geometric transformation given by the deformation parameter 101 on the shading pattern 100, and outputs the result as a deformation pattern 102.

次に幾何学変換の方法について説明を行う。Next, the method of geometric transformation will be explained.

まず簡単のための１次元のパタンの場合を考える。変形
前のパタン１００のアドレスをｘ（０≦Ｘ≦Ｍ）、変形
後のパタン１０２のアドレスをｆ　（ｘ）と表す、ここ
で開数ｆ　（ｘ）は以下の条件を満たすものとする。First, consider the case of a one-dimensional pattern for simplicity. The address of the pattern 100 before transformation is expressed as x (0≦X≦M), and the address of the pattern 102 after transformation is expressed as f (x), where the open number f (x) satisfies the following conditions.

ｆ（０）＝Ｏ（１）ｆ　（Ｍ）　＝Ｍ　　　　　　　　　　　　　　（２）
ｆ　　（ｘ）　≧Ｏ（０≦Ｘ≦Ｍ）　　　　　　　　（
３）この３つの条件により、パタンは変形前と変形後で
同じ大きさを保つ、特に式（３）により、開数ｆは単調
増加となるため、パタンの各成分の順序を保ちながら変
形を行うことができる。f (0) = O (1) f (M) = M (2)
f (x) ≧O (0≦X≦M) (
3) Due to these three conditions, the pattern maintains the same size before and after deformation. In particular, according to equation (3), the fractional number f increases monotonically, so the pattern can be deformed while maintaining the order of each component of the pattern. It can be carried out.

例えばｆ　（ｘ）　　として、ｆ（Ｘ）”ａｓ　Ｘ’　”ａｔ　Ｘ’　”ａｔ　Ｘ＋ａ
ｏ　　　　　（４）と表される３次関数を用いる場合、
Ｘのほかにパラメータとしてａ　ｓ　ｒ　＆　２＊　ａ
　Ｉ＋　ａ　ｏの４ｆｌｌが必要である。しかし式（１
）、（２）を代入することにより次の式（５）が得られ
るから、実際に与えるパラメータは２個でよい、このパ
ラメータが、変形パラメータ１０１に対応している。For example, f (x), f(X)"as X'"atX'"at X+a
When using a cubic function expressed as o (4),
In addition to X, as parameters a s r & 2* a
4fll of I+ao is required. However, the formula (1
) and (2), the following equation (5) is obtained, so only two parameters are actually required. This parameter corresponds to the deformation parameter 101.

ｆ（Ｘ）’ａ、　Ｘ’　　”３２　Ｘ”　　＋（ｉ−ａ
ｓ　　Ｈ２−ａｘ　　Ｈ）ｘ（５）ｆ　（ｘ）として非線形の関数を用いれば、パタンのあ
る部分のみを膨張しまたは収縮するといった変形が可能
である。f(X)'a, X'"32X" + (i-a
s H2-ax H) x (5) If a nonlinear function is used as f (x), deformation such as expanding or contracting only a certain part of the pattern is possible.

次に２次元パタンの変形前のアドレスを（ｘ。Next, the address before transformation of the two-dimensional pattern is (x.

ｙ）、変形後のアドレスを（Ｘ、Ｙ）で表す、上記の条
件（１）〜（３）を満たす開数ｆとｇにより、以下の式
（６）、（７）に従って２次元パタンの幾何学変換を実
行すると、各方向ごとに独立な変形が行なわれる。y), the address after transformation is expressed as (X, Y), and the two-dimensional pattern is created according to the following equations (6) and (7) using the open numbers f and g that satisfy the above conditions (1) to (3). When a geometric transformation is performed, an independent transformation is performed in each direction.

Ｘ＝ｆ　（ｘ）　　　　　　　　　　　　　　（６）Ｙ
＝ｇ　（ｙ）　　　　　　　　　　　　　　（７）一方
、次の式（８）、（９）のように、他方向のアドレス値
の項を加えると、更に長方形から平行四辺形への変形や
、回転といったことが可能となる。X=f (x) (6)Y
=g (y) (7) On the other hand, if we add terms for address values in other directions, as in the following equations (8) and (9), we can further transform a rectangle into a parallelogram and rotate it. It becomes possible.

Ｘ＝Ｆ（ｘ、ｙ）＝ｆ（ｘ）＋ｂ−ｙ　　　（８）Ｙ＝
Ｇ（ｘ、ｙ）＝ｇ　（ｙ）＋ｃ−ｘ　　　（９）この場
合、変形パラメータ１０１の個数はＸ方向とｙ方向に関
して各３個の計６個となる。X=F(x,y)=f(x)+by (8)Y=
G(x,y)=g(y)+c−x (9) In this case, the number of deformation parameters 101 is six in total, three each in the X direction and the y direction.

以上の変形においては開数ｆ　（０）　、　ｆ　（Ｍ）
の値が一定であるから、第２図（ａ）に示すように、０
≦Ｘ≦Ｍ、０≦ｙ≦Ｎで表される矩形領域のうち両端の
成分は移動することができない、しかし第２図（ｂ）の
ように、パタンの外接正方形の外側に背景部分を加えて
正規化処理領域とすれば、パタンの両端の成分も移動す
ることができる。In the above transformation, the numerical numbers f (0), f (M)
Since the value of is constant, as shown in Figure 2(a), 0
The components at both ends of the rectangular area represented by ≦X≦M and 0≦y≦N cannot be moved, but as shown in Figure 2 (b), the background part is added outside the circumscribed square of the pattern. If this is used as the normalization processing area, components at both ends of the pattern can also be moved.

また更に領域によって異なる間数を用いれば局所的な変
形が実現される。Furthermore, local deformation can be realized by using different numbers of intervals depending on the region.

次に画素値の計算方法について説明を行う、変形前のパ
タンのアドレス（ｘ、ｙ）に格納されている値をｓｒｃ
　（ｘ、ｙ）、変形後のパタンのアドレス（ｘ、ｙ）に
格納される値をｄａｔ（ｘ。Next, we will explain how to calculate pixel values.The value stored at the address (x, y) of the pattern before transformation is
(x, y), the value stored at the address (x, y) of the pattern after transformation is dat(x.

ｙ）で表すとき、ツギノシキ（１０）に従って画素値を
求める。y), the pixel value is determined according to Tsuginoshiki (10).

Ｙ−０１ｘ、ｙｌすなわち関数ＦとＧによって（Ｘ、Ｙ）に対応づけられ
る（ｘ、ｙ）が複数存在する場合、言い換えると関数Ｆ
またはＧが多対一対応である場合、対応するｓｒｃ　（
ｘ、ｙ）の総和を求めてｄａｔ（Ｘ、Ｙ）に格納する。Y-01x, yl In other words, if there are multiple (x, y) that are associated with (X, Y) by the functions F and G, in other words, the function F
Or if G has many-to-one correspondence, the corresponding src (
x, y) and stores it in dat(X, Y).

この変形の意味を次に第３図を用いて説明を行う。Next, the meaning of this modification will be explained using FIG. 3.

第３図は１次元のパタンの変形例であり、（ａ）が変形
前、（ｂ）が参照パタンを表している。FIG. 3 shows a modified example of a one-dimensional pattern, where (a) shows the pattern before modification, and (b) shows the reference pattern.

（ａ）に対して両面から収縮変形を加えると、２つの山
の部分を近ずけることが可能であり、この変形を過剰に
行うと２つの山の位置が一致することが予想される。し
かし式（１０）に従って変形を行うと、位置が一致して
得られた山の高さは（Ｃ）のように２倍になるから、（
ｂ）のパタンと区別することが出来る。この変形では、
パタン画素値の総和が保存される。If contraction deformation is applied to (a) from both sides, it is possible to bring the two peaks closer together, and if this deformation is performed excessively, it is expected that the positions of the two peaks will coincide. However, if the transformation is performed according to equation (10), the height of the peak obtained when the positions match will be doubled as shown in (C), so (
It can be distinguished from pattern b). In this variant,
The sum of pattern pixel values is saved.

次に再び第１図を用いて本発明の詳細な説明を行う。Next, the present invention will be explained in detail using FIG. 1 again.

特徴抽出部１１は、変形パタン１０２に対して特徴抽出
処理を行い、得られる特徴ベクトルを１０３として出力
する。特徴抽出処理の例としては、一定のりサンプリン
グ点上で以下のような空間微分フィルタとのコンボリュ
ーション処理を行い、得られた値を特徴ベクトル１０３
の要素として出力する。The feature extraction unit 11 performs feature extraction processing on the deformed pattern 102 and outputs the obtained feature vector as 103. As an example of feature extraction processing, convolution processing with a spatial differential filter as shown below is performed on a constant sampling point, and the obtained value is converted into a feature vector 103.
Output as an element.

辞書１２は、特徴抽出部１１において述べた処理を、あ
らかじめ全参照パタンに対して実行して得られる特徴ベ
クトルを格納し、制御部１４からの読み出し要求信号１
０７に同期してこれを読み出し１０４とする。The dictionary 12 stores feature vectors obtained by performing the processing described in the feature extraction unit 11 on all reference patterns in advance, and receives the read request signal 1 from the control unit 14.
This is read out 104 in synchronization with 07.

距離計算部１３は、変形パタンから得られる特徴ベクト
ル１０３と、参照パタンから得られる特徴ベクトル１０
４を入力し、両者の距離値を求め１０５として出力する
。距離としては、例えばＬ　１ノルム、（、＝Σｌａｌ　　ｂ、１　　　　　　　　（１１）ま
たはし２ノルム、ＣＩ”Σ（ａ＋　　ｂ、）”　　　　　　　　（１２）
等を用いることができる。The distance calculation unit 13 calculates a feature vector 103 obtained from the deformed pattern and a feature vector 10 obtained from the reference pattern.
4 is input, the distance value between the two is calculated and output as 105. As a distance, for example, L 1 norm, (,=Σlal b, 1 (11) or 2 norm, CI"Σ(a+b,)" (12)
etc. can be used.

制御部１４は、変形パラメータ１０１と距離値１０５を
入力し、距離値１０５が最小となるような変形パラメー
タ１０１の値をｉ″ｉＭ化計算手計算手法た反復処理に
よって求める。求まったならば、その収束値と対応する
参照パタンのカテゴリ番号を信号１０６として出力し、
更に辞書１２に対して読み出し要求信号１０７を出力し
、次の参照パタンの特徴ベクトルの読み出しを行う。The control unit 14 inputs the deformation parameter 101 and the distance value 105, and calculates the value of the deformation parameter 101 that minimizes the distance value 105 through an iterative process using the i″iM calculation manual calculation method.Once determined, Outputting the convergence value and the corresponding reference pattern category number as a signal 106;
Furthermore, a read request signal 107 is output to the dictionary 12, and the feature vector of the next reference pattern is read.

次にａ適化計算生方の１つであ′る最急勾配法に関して
説明を行う、幾何学変換部１０へ入力する変形パラメー
タ１０１をベクトルＫ　＝（ｋ　＋ｋｚ　、　ｋｎ　、
・・・、ｋＬ）で表し、Ｋを与えたときに特徴抽出部１
１において得られる距離値１０５をＤ（Ｋ）とすると、
以下■〜■のようになる。Next, we will explain the steepest gradient method, which is one of the a optimization calculation methods.The deformation parameters 101 input to the geometric transformation unit 10 are expressed as vectors K = (k + kz , kn ,
..., kL), and when K is given, the feature extraction unit 1
If the distance value 105 obtained in 1 is D(K),
It will be as shown below.

■にの初期値を与える。■Give an initial value to.

■ｉ＝１．２．３．・・・しに対して、ｇ＋　＝Ｄ　（
Ｋｔ　”　）　　Ｄ　（Ｋｔ　−＞　　　　（１３）を
求める。但しベクトルＫｌ＋とに、−はＫの第１成分の
みを、各々正の方向と負の方向にΔ（〉Ｏ）だけ変化さ
せたベクトルであり、次の式（１１）のように表すこと
ができる。■i=1.2.3. ..., for g+ = D (
Find Kt '') D (Kt -> (13). However, in the vector Kl+, - is a vector in which only the first component of K is changed by Δ(〉O) in the positive direction and the negative direction, respectively. , and can be expressed as in the following equation (11).

Ｋｔ　　”　　＝（ｋｒ　　”　　　ｌ　　ｋ＋　　＋
　・　ｋｒ（ｊ＋ｉ）。Kt ” = (kr ” l k+ +
・kr(j+i).

ｋ＋”＝に＋＋Δ）とし、Ｄ’　＝Ｄ　（Ｋ−α・Ｇ／ＩＩＧＩ＋＞　　　　　（
１５）を最小にするような正のスカラ値αを求める。k+"= + + Δ), and D' = D (K-α・G/IIGI+> (
Find a positive scalar value α that minimizes 15).

■αが十分に小さくなり、Ｋの値が収束すれば終了、そ
うでなければ、Ｋ−に−α・Ｇ／ｌｌＧ１１　　　　　　　　（１６）
として■へ。■If α becomes small enough and the value of K converges, the process ends; otherwise, K- becomes −α・G/llG11 (16)
As ■.

ベクトルＧが最急勾配方向を表し、スカラーαが更新量
を表す、制御部１４は、変形パラメータ１０１と距離値
１０５を用いて、上述の■〜■を収束するまで繰り返す
。The control unit 14, in which the vector G represents the steepest gradient direction and the scalar α represents the update amount, uses the deformation parameter 101 and the distance value 105 to repeat the above-mentioned steps 1 to 2 until convergence.

この方法では、■においてベクトルにの全ての成分が更
新されるため、水平方向に関する変形と垂直方向に関す
る変形が同時に行なわれる。しかし■では変分法により
、他方向の変形を行わない状態でパラメータを変動させ
て勾配を求めている。In this method, all components of the vector are updated in step (3), so that horizontal and vertical transformations are performed simultaneously. However, in ■, the gradient is obtained by varying the parameters using the variational method without deforming in other directions.

一般に各方向に対する最適な変形の組合せが、両方向に
対する！＆適な変形になるとはいえない、そこで制御部
１４は、■〜■の反復処理をＸ方向とｙ方向の２つのフ
ェーズに分け、各フェーズで１方向の変形のみに関する
パラメータを更新し、両フェーズを１回ずつ実行して１
回の反復とする。In general, the optimal combination of deformations for each direction is for both directions! Therefore, the control unit 14 divides the iterative processing of ■ to ■ into two phases in the X direction and the y direction, updates parameters related to only one direction of deformation in each phase, and Execute the phases one time at a time
Repeated times.

ソータ１５は、制御部１４より出力される信号１０６を
貯えるメモリと、比較演算器等を含んで構成される。信
号１０６のデータのうち、距離の収束値をメモリ上でソ
ートし、その上位候補データを対応するカテゴリ番号と
共に信号１０８に出力する。The sorter 15 is configured to include a memory that stores the signal 106 output from the control section 14, a comparator, and the like. Among the data of the signal 106, the distance convergence values are sorted on the memory, and the upper candidate data is outputted as the signal 108 together with the corresponding category number.

ここでは２つの特徴ベクトル間の距離を最小にする装置
として説明を行ったが、例えば距離ではなく類似度を最
大にするような最適化基準を用いる場合も考えられる。Although the description has been made here as a device that minimizes the distance between two feature vectors, it is also possible to use an optimization criterion that maximizes the degree of similarity rather than the distance, for example.

この場合、類似度をＤ（Ｋ）で表すとすると、制御部１
４での最適化反復処理は以下のようになる。In this case, if the degree of similarity is expressed as D(K), then the control unit 1
The optimization iterative process in step 4 is as follows.

■にの初期値を与える。■Give an initial value to.

■ｉ＝１．２．３．・・・、Ｌに対して、ｇ＋　　＝Ｄ
　（Ｋｌ　　”　　）−Ｄ　　（Ｋｌ　　−）　　　　
　（１３）を求める。■i=1.2.3. ..., for L, g+ = D
(Kl ”)-D (Kl-)
Find (13).

■ベクトルＧ＝　（ｇ＋　、ｇ２１ｇｓ　、・・・ｇＬ
）とし、Ｄ’　＝Ｄ　（Ｋ−α・Ｇ／ＩＩＧＩ＋＞　　　　　（
１５）を最大とするような正のスカラ値αを求める。■Vector G= (g+ , g21gs ,...gL
), and D' = D (K-α・G/IIGI+> (
Find a positive scalar value α that maximizes 15).

■αが十分に小さくなり、Ｋの値が収束すれば終了、そ
うでなければ、Ｋ−に−α・Ｇ／ＩＩＧＩＩ　　　　　　　　（ｔｅ＞
として■へ。■If α becomes small enough and the value of K converges, the process ends; otherwise, K- is converted to −α・G/IIGII (te>
As ■.

更にソータ１５において収束値を最大値から順にソート
することにより、類似度を最大とするような変形が実現
される。Furthermore, by sorting the convergence values in order from the maximum value in the sorter 15, a transformation that maximizes the similarity is realized.

（発明の効果）以上に説明したように、本発明のパタン正規化装置を用
いることにより、従来に比べ、入力パタンの更に多様な
変形に対応することができる。また最適化の基準を、特
徴ベクトル間の距離の最小化だけでなく、認識対象に応
じて設定することにより、汎用性の高い正規化処理を実
現することができる。(Effects of the Invention) As described above, by using the pattern normalization device of the present invention, it is possible to deal with more diverse deformations of input patterns than in the past. Moreover, by setting the optimization criteria not only based on minimizing the distance between feature vectors but also according to the recognition target, highly versatile normalization processing can be realized.

また幾何学変換部においては、画素値の総和が保存され
るように変形が行なわれる。これにより例えば入力パタ
ンとして漢数字の「二」、参照パタンとして漢数字の「
−」を用いる場合、「二」の両ストロークの位置を上下
方向の収縮変形により一致させても、得られるストロー
クの高さが倍になるから、特徴ベクトル間の距離が大き
くなり、「−」と識別することができる６本発明にはこ
のような優れた効果がある。In addition, the geometric transformation unit performs transformation so that the sum of pixel values is preserved. As a result, for example, the input pattern is the kanji numeral ``2'', and the reference pattern is the kanji numeral ``2''.
When using "-", even if the positions of both strokes of "2" are matched by contraction deformation in the vertical direction, the height of the obtained stroke will be doubled, so the distance between the feature vectors will become large, and "-" The present invention has such excellent effects.

[Brief explanation of the drawing]

第１図は本発明のパタン正規化装置の一実施例を示すブ
ロック図であり、第２図は正規化領域内の幾何学変換の
例を示す図であり、第３図は幾何学変換における画素値
の計算例を示す図である。１０・・・幾何学変換部、１１・・・特徴抽出部、１２
・・・辞書、１３・・・距離計算部、１４・・・制御部
、１５・・・ソータ、１６・・・多値化処理部。第１図第図〜移動／（ｂ）第（ａ）変形前のパタン（Ｃ）変形後のパタンし図（ｂ）譬照パタンFIG. 1 is a block diagram showing an embodiment of the pattern normalization device of the present invention, FIG. 2 is a diagram showing an example of geometric transformation within the normalization area, and FIG. It is a figure which shows the example of calculation of a pixel value. 10...Geometric conversion unit, 11...Feature extraction unit, 12
... Dictionary, 13... Distance calculation section, 14... Control section, 15... Sorter, 16... Multivalue processing section. Figure 1 - Movement/ (b) (a) Pattern before transformation (C) Pattern after transformation (b) Comparison pattern

Claims

[Claims]

(1) In pattern recognition processing, geometric transformation is performed on the input pattern so that the distance between the feature vectors obtained by extracting features from the input pattern and the reference pattern is minimized, and the shape of the input pattern is A normalization device that includes a multi-value processing unit that converts an input binary pattern into a shading pattern, and a geometry unit that executes mechanics on the shading pattern to obtain a deformed pattern according to given deformation parameters. a feature extraction unit that performs feature extraction on the deformed pattern to obtain a feature vector of the deformed pattern; a distance calculation unit that calculates a distance between the feature vector of the deformation pattern and the feature vector of the reference pattern; a distance calculation unit that calculates the value of the deformation parameter that minimizes the value of the distance, and a sorter that stores the distance values minimized for each of the reference patterns, performs sorting, and outputs the distance values as top candidates in order from the one with the smallest value. A pattern normalization device comprising:

(2) In the pattern normalization device according to claim 1, the geometric transformation unit may
If a function representing the post-transformation address is determined by the transformation parameter, and there are a plurality of pre-transformation addresses corresponding to the post-transformation address, the sum of the values stored in each of the pre-transformation addresses is calculated, and the post-transformation address is A pattern normalization device characterized in that a pattern normalization device stores data in a pattern normalization device.