JPH041870A

JPH041870A - Method and device for recognizing pattern

Info

Publication number: JPH041870A
Application number: JP2101837A
Authority: JP
Inventors: Keiji Yamada; 山田　敬嗣
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1990-04-19
Filing date: 1990-04-19
Publication date: 1992-01-07
Anticipated expiration: 2013-07-23
Also published as: JP2778194B2

Abstract

PURPOSE:To improve reliability for the result of recognition by comparing assurance with a threshold value to define an output result as the result of recognition when the assurance is larger than the threshold value and to reject the result of recognition vice versa. CONSTITUTION:An output signal corresponding to each category is obtained from a pattern to be inputted or the pattern feature value with use of a multi- layer type neural circuit network. Then the maximum category is defined as the result of recognition. Furthermore the assurance R is calculated from an entire vector of the output value. When the assurance R is larger than the threshold value, the category having the maximum output value is selected as the result of recognition. Meanwhile a rejection signal is outputted when the R is smaller than the threshold value. Thus the uncertain result of recognition can be rejected and the reliability of the result of recognition is improved.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、多層構造の神経回路モデルを用いたパターン
認識装置において、神経回路網の出力信号から計算した
確信度に従って、認識結果のみならずリジェクト信号を
出力するパターン認識方法および装置に関するものであ
る。[Detailed Description of the Invention] [Industrial Application Field] The present invention provides a pattern recognition device using a multi-layered neural network model, in which not only the recognition result but also the recognition result is calculated according to the certainty factor calculated from the output signal of the neural network. The present invention relates to a pattern recognition method and device that outputs a reject signal.

[Conventional technology]

従来、多層神経回路網を用いたパターン認識では、パタ
ーンや特徴量を入力し、文献（“Ａ　Ｎｅｕ−ｒａｌ　
Ｎｅｔｗｏｒｋ　Ｄｉｇｉｔ　Ｒｅｃｏｇｎｉｔｉｏｎ
”　ｂｙ　Ｄ、Ｊ、Ｂｕｒｒ　：ＪＥＥＲＩｎｔｅｒｎ
ａｔｉｏｎａｌ　Ｃｏｎｆｅｒｅｎｃｅ　ｏｎ　Ｓｙｓ
ｔｅｍ、　Ｍａｎａｎｄ　Ｃｙｂｅｒｎｅｔｉｃｓ　：
　１９８６．　ｐｐ、１６２１−１６２５）に示される
ように、各要素がカテゴリへの類似度を表現するような
ベクトル量として出力信号を得る。多くの場合、出力信
号の要素は、０から１の数値または−１から１の数値と
して表現されている。ベクトル量の要素の中で最大の値
を取る要素が表すカテゴリを認識結果とするものである
。Conventionally, in pattern recognition using multilayer neural networks, patterns and feature quantities are input, and the literature (“A Neu-ral
Network Digit Recognition
” by D, J, Burr: JEERIntern
ational Conference on Sys.
tem, Manand Cybernetics:
1986. The output signal is obtained as a vector quantity in which each element expresses the degree of similarity to a category, as shown in (pp, pp. 1621-1625). Often, the elements of the output signal are expressed as numbers between 0 and 1 or between -1 and 1. The recognition result is the category represented by the element that takes the maximum value among the elements of the vector quantity.

パターンｖｌＷＡ装置では、認識対象でないパターンが
入力されたり、２つのカテゴリのうちのどちらともわか
らないようなパターンが入力されることがある。このよ
うな場合には、無理に認識結果を求めるよりは、認識不
能としてリジェクトする方が認識結果の信鯨性を高める
ことができる。In the pattern vlWA device, a pattern that is not a recognition target may be input, or a pattern that cannot be recognized as one of two categories may be input. In such a case, it is better to reject the recognition result as unrecognizable than to forcefully obtain the recognition result, thereby increasing the credibility of the recognition result.

そのため従来のパターン認識装置では、パターンの整合
度を入力パターンと各カテゴリ毎の標準パターンとの類
似度で評価し、最も類似度が高いカテゴリを認識結果と
するなどの方法がある。このような方法では、第１位候
補の類似度が成る一定値以上に達しないとリジェクトし
たり、１位候補と２位候補との類似度の差が成る一定値
以上ないとリジェクトするなどの方法が取られている。Therefore, in conventional pattern recognition devices, there is a method in which the degree of matching of patterns is evaluated by the degree of similarity between an input pattern and a standard pattern for each category, and the category with the highest degree of similarity is determined as a recognition result. In such a method, a candidate is rejected if the similarity of the first candidate does not reach a certain value or more, or it is rejected if the difference in similarity between the first candidate and the second candidate does not exceed a certain value, etc. A method is being taken.

[Problem to be solved by the invention]

神経回路網を用いたパターン認識方法においては、出力
信号は上限下限を持つ値を要素として持つベクトル値で
あるが、この値は従来のパターン認識方法に見られる類
似度とは異なり、上記の従来のリジェクト方法を用いた
場合には、認識誤りとリジェクトの双方を減らすように
しきい値を設定することは困難であり、本来正しく認識
されているパターンの多くがリジェクトされてしまう。In pattern recognition methods using neural networks, the output signal is a vector value whose elements are values with upper and lower limits, but this value is different from the similarity seen in conventional pattern recognition methods; When using the above rejection method, it is difficult to set a threshold to reduce both recognition errors and rejections, and many patterns that are originally correctly recognized end up being rejected.

上記のような従来のパターン認識方法におけるリジェク
ト方法では、１位候補の値のみを用いているか、または
２位候補までの値を用いている。In the rejection method in the conventional pattern recognition method as described above, only the value of the first candidate or the values up to the second candidate are used.

これでは、非常に似た複数のパターンを分類する場合に
はリジェクトが多く発生してしまう。またこのような方
法では、２つ以上の候補の類似度などの値を統一的に評
価してリジェクトを行うことはできない。This results in many rejections when classifying multiple very similar patterns. Further, with such a method, it is not possible to uniformly evaluate the values such as similarity of two or more candidates and reject them.

本発明の目的は、神経回路網の出力信号であるベクトル
値から認識結果の確信度を求め、その値としきい値とを
比較判定して認識結果を出力するか、あるいはリジェク
ト信号を出力することができるパターン認識方法および
装置を提供することにある。An object of the present invention is to obtain the confidence level of a recognition result from a vector value that is an output signal of a neural network, compare that value with a threshold value, and output a recognition result or output a reject signal. The object of the present invention is to provide a pattern recognition method and device that can perform the following steps.

[Means to solve the problem]

第１の発明は、多層構造の神経回路網を用いたパターン
認識方法において、入力パターンまたは入力パターンから抽出された特徴量
を入力して、多層構造の神経回路網によってカテゴリを
表す出力ベクトルに変換し、その出力ベクトルから、１
つの要素だけが値域の上限値の近くになり、その他の要
素は値域の下限値の近くであるときに確信度が最大とな
るように確信度を計算し、その確信度をしきい値と比較
して、しきい値よりも大きい場合には出力結果を認識結
果とし、確信度がしきい値よりも小さい場合にはリジェ
クトすることを特徴とする特第２の発明は、多層構造の神経回路網を用いたパターン
認識装置において、入力データからカテゴリを表す出力ベクトル信号へと変
換する神経回路網を実現する手段と、出力ベクトル信号
のうちで最大値を持つカテゴリを検出する手段と、出力ベクトル信号が、１つの要素だけが値域の上限値の
近くになり、その他の要素は値域の下限値の近くである
ときに確信度が最大となるように確信度を計算する手段
と、確信度がしきい値より大きい場合には最大値検出部の出
力をもって認識結果とし、確信度がしきい値以下の場合
にはリジェクトとして判定する手段とを備え、認識結果またはリジェクト信号を出力することを特徴と
する。The first invention is a pattern recognition method using a multilayer neural network, in which an input pattern or a feature extracted from the input pattern is input and converted into an output vector representing a category by the multilayer neural network. And from that output vector, 1
Calculate the confidence such that the confidence is maximum when only one element is near the upper bound of the range and the other elements are near the lower bound of the range, and compare that confidence with a threshold The second invention is characterized in that the output result is recognized as a recognition result when the confidence level is larger than the threshold value, and is rejected when the confidence level is smaller than the threshold value. In a pattern recognition device using a network, means for realizing a neural network that converts input data into an output vector signal representing a category, a means for detecting a category having a maximum value among the output vector signals, and an output vector. means for calculating the confidence such that the confidence is maximized when only one element of the signal is near the upper limit of the range and the other elements are near the lower limit of the range; It is characterized by comprising means for determining that the output of the maximum value detection unit is the recognition result when it is larger than the threshold value, and as a rejection when the confidence level is less than the threshold value, and outputting the recognition result or a rejection signal. shall be.

第３の発明は、多層構造の神経回路網を用いたパターン
認識装置において、入力データからカテゴリを表す出力ベクトル信号へと変
換する神経回路網を実現する手段と、出力信号を降順に
ソートする手段と、ソートされた結果の最大値のカテゴリを記憶する手段と
、ソートされた出力ベクトル信号のうちの上位の一定個数
の要素において、第１位の要素だけが値域の上限値の近
くになり、その他の要素は値域の下限値の近くであると
きに確信度が最大となるように確信度を計算する手段と
、確信度がしきい値より大きい場合には最大値カテゴリ記
憶部の内容をもって認識結果とし、確信度がしきい値以
下の場合にはリジェクトとして判定する手段とを備え、認識結果またはリジェクト信号を出力することを特徴と
する。A third invention is a pattern recognition device using a multilayered neural network, which includes means for realizing a neural network that converts input data into an output vector signal representing a category, and a means for sorting the output signals in descending order. and means for storing the category of the maximum value of the sorted results; and among the fixed number of high-order elements of the sorted output vector signal, only the first-ranked element is near the upper limit of the range; Other elements include means for calculating confidence so that the confidence is maximized when it is near the lower limit of the value range, and recognition based on the contents of the maximum value category storage unit when the confidence is greater than a threshold value. The method is characterized in that it includes a means for determining that the recognition result is rejected when the confidence level is less than a threshold value, and outputs a recognition result or a rejection signal.

[Effect]

第１の発明であるパターン認識方法では、多層型の神経
回路網を用いて、入力されるパターンまたはパターンの
特徴量から各カテゴリに対応する出力信号を求め、その
最大値をもつカテゴリを認識結果として定める。さらに
、出力値のベクトル全体から以下の式を用いて確信度を
計算する。ここで、認識対象のカテゴリ数をＮとして、
神経回路網からの出力ベクトルをＰｉ　　（１≦ｉ≦Ｎ
、　　０≦Ｐｉ≦１）、α、βを係数とすると、確信度
Ｒは、・　・　・（１）として求める。この値は、１つの値だけが上限値ｌに近
く、他の全ての値が下限値０に近いときに最大値を取り
、最大値を持つ要素Ｐｉが１から離れたり、第２位以下
の要素が最大値に接近した場合に、Ｒは小さくなる。そ
の値Ｒがしきい値よりも大きい場合には、最大の出力値
を持つカテゴリを認識結果として選択し、Ｒがしきい値
よりも小さい場合にはリジェクト信号を出力することに
より、不確かな認識結果をリジェクトすることができる
。The pattern recognition method, which is the first invention, uses a multilayer neural network to obtain an output signal corresponding to each category from the input pattern or feature amount of the pattern, and selects the category with the maximum value as the recognition result. Established as Furthermore, the confidence level is calculated from the entire vector of output values using the following formula. Here, the number of categories to be recognized is N,
The output vector from the neural network is Pi (1≦i≦N
, 0≦Pi≦1), α, and β are coefficients, the confidence level R is obtained as follows. This value takes the maximum value when only one value is close to the upper limit l and all other values are close to the lower limit 0, and if the element Pi with the maximum value is away from 1 or R becomes smaller when the element approaches its maximum value. If the value R is larger than the threshold value, the category with the maximum output value is selected as the recognition result, and if R is smaller than the threshold value, a reject signal is output, so that uncertain recognition can be avoided. Results can be rejected.

第２の発明であるパターン認識装置では、神経回路網の
出力ベクトルの全ての要素値から式（１）に従った演算
を実行する手段と、得られた確信度の値をしきい値と比
較して認識結果を出力するかリジェクトを行うかを判定
する手段とを備えることにより、不確かな認識結果をリ
ジェクトする。The pattern recognition device, which is the second invention, includes means for performing an operation according to formula (1) from all element values of the output vector of the neural network, and comparing the obtained confidence value with a threshold value. and means for determining whether to output the recognition result or reject the recognition result, thereby rejecting an uncertain recognition result.

第３の発明であるパターン認識装置では、神経回路網の
出力ベクトル値を降順に並べて、その上位の一定個数の
値から式（１）に従って確信度を計算する手段と、得ら
れた確信度の値をしきい値と比較して認識結果を出力す
るかリジェクトを行うかを判定する手段を備えることに
より、認識結果に影響を与え得る出力値の上位のカテゴ
リの値から計算した確信度に従って不確かな認識結果を
リジェクトする。The pattern recognition device, which is the third invention, includes means for arranging the output vector values of the neural network in descending order and calculating the confidence level from a certain number of higher-order values according to equation (1), and By providing a means for determining whether to output or reject the recognition result by comparing the value with a threshold value, uncertainty is calculated from the higher category values of the output value that may affect the recognition result. Reject recognition results.

〔Example〕

以下、本発明の実施例である文字認識方法と装置を図面
を参照して説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS A character recognition method and apparatus according to an embodiment of the present invention will be described below with reference to the drawings.

第１図は、本発明のパターン認識方法を説明する流れ図
を示す。認識対象となるパターンが入力され（ステップ
Ｓ　１０１）、次にそのパターンより特徴抽出がなされ
（ステップＳ　１０２）、ベクトル値に変換される。抽
出される特徴はどのようなものでもよく、文字認識の場
合ではガウスフィルタによる濃淡特徴や方向曲率を抽出
した特徴などがある。FIG. 1 shows a flowchart illustrating the pattern recognition method of the present invention. A pattern to be recognized is input (step S101), and then features are extracted from the pattern (step S102) and converted into vector values. The extracted features may be of any type, and in the case of character recognition, they may include shading features using a Gaussian filter, features extracted from directional curvature, and the like.

また、特徴抽出をしない場合には、入力パターン自体を
特徴ベクトル値として扱うことも可能である。−例とし
て、第２図を参照してガウスフィルタによる文字パター
ンの特徴抽出方法を説明する。Furthermore, when feature extraction is not performed, it is also possible to treat the input pattern itself as a feature vector value. - As an example, a method of character pattern feature extraction using a Gaussian filter will be explained with reference to FIG.

フィルタリング部２０３において、白黒の二値画像とし
て入力された文字パターン２０１に、５×５マトリクス
で与えられたガウスフィルタ２０２を畳み込む。入カニ
値画像を３２画素×４２西素の二値画像Ｂ　（ｉ、ｊ）
、５Ｘ５フイルタをＦ（ｋ、ｌ）、結果の多値画像をＧ
（ｉ、ｊ）とすると、Ｇ（ｉ、ｊ）・（２）なる式に従って計算が実行する。その結果のＧ（ｉ、ｊ
）は、リサンプリング部２０４で９×１１の９９個のデ
ータに間引かれ、濃淡特徴になる。第２図の濃淡時＠２
０６は、このデータを表示したもので、１つ１つの要素
の値を丸の大きさで示している。In the filtering unit 203, a Gaussian filter 202 given in a 5×5 matrix is convolved with the character pattern 201 input as a black and white binary image. The input value image is converted into a binary image B (i, j) of 32 pixels x 42 pixels.
, 5×5 filter as F(k,l), and the resulting multivalued image as G
(i, j), calculation is performed according to the formula G(i, j) (2). The resulting G(i, j
) is thinned out to 99 pieces of data (9×11) by the resampling unit 204, and becomes a grayscale feature. Fig. 2 shading @2
06 is a display of this data, and the value of each element is indicated by the size of the circle.

次に特徴ベクトル値は、多層神経回路網を用いて出力値
に変換する（ステップＳ　１０３）。計算方法を第３図
を用いて説明する。第３図では、第０層から第２層まで
の３層構造の神経回路網の計算方法を説明するが、本発
明では、３層構造に限るものではなく、２層のものや４
層以上のものにも容易に拡張が可能である。Next, the feature vector value is converted into an output value using a multilayer neural network (step S103). The calculation method will be explained using FIG. In FIG. 3, a calculation method for a neural network with a three-layer structure from the 0th layer to the 2nd layer will be explained.
It can be easily extended to more than just layers.

第３図では、第１図のステップ３１０２によって抽出さ
れた特徴ベクトル値を多層神経回路網に入力し、第０層
のベクトル値とする（ステップ５３０１）。次に、以下
の式（３）に従って第０層ベクトル値から第１層のマト
リクス積を計算する（ステップＳ　３０２）。第０層の
ベクトル値をｏ＝’（１≦１≦Ｎ　ｏ　）とし、第１層
重みマトリクス値をＷ＝＝’（１≦ｉ≦Ｎ０．１≦ｊ≦
Ｎ、）　、計算結果のベクトル値をｏ＝’（１≦ｊ≦Ｎ
、）とすると、０．１　　＝　Σ　　（ｏ４’Ｘｗ　点
Ｊｌ）　　十　０１厘　　　　　　　　　　　（３）と
なる。ただし、ここでθ、１はバイアス値である。次に
ｏ、’（１≦ｊ≦Ｎ、）の各々に対して以下の式（４）
に従ってＳ関数を計算する（ステップ５３０３）。In FIG. 3, the feature vector value extracted in step 3102 of FIG. 1 is input to the multilayer neural network and is set as the vector value of the 0th layer (step 5301). Next, the first layer matrix product is calculated from the zero layer vector values according to the following equation (3) (step S302). The vector value of the 0th layer is o=' (1≦1≦N o ), and the first layer weight matrix value is W==' (1≦i≦N0.1≦j≦
N, ), the vector value of the calculation result is o='(1≦j≦N
, ), then 0.1 = Σ (o4'Xw point Jl) 10 01 rin (3). However, here, θ, 1 is a bias value. Next, use the following equation (4) for each of o and '(1≦j≦N,)
The S function is calculated according to (step 5303).

ｏ’Ｊ’　＝　１／２　（１＋ｔａｎｈ　（ｏｊ’）　
）　　　　　（４）次に、第２層重みマトリクスをｗ、
、”（１≦ｊ≦Ｎ、、ｔ≦に≦ＮＺ）とし、バイアス値
をθ−として、ｏｋｚ＝Σ　（Ｏｊ、’　Ｘ　Ｗｊｋ”）＋θ−（５）
なる計算を行う（ステップＳ　３０４）。さらに、ｏ　
、　２（１≦に≦Ｎ　ｚ　）は各々の式（６）に従って
Ｓ関数が計算される（ステップＳ　３０５）。o'J' = 1/2 (1+tanh (oj')
) (4) Next, the second layer weight matrix is w,
," (1≦j≦N,, t≦≦NZ), and the bias value is θ-, okz=Σ (Oj,' X Wjk") + θ-(5)
A calculation is performed (step S304). Furthermore, o
, 2 (1≦and≦N z ), the S function is calculated according to each equation (6) (step S305).

ｏ’−＝　１／　２　（１＋ｔａｎｈ　（ｏｉｌ”）　
］　　　　　（６）最後に、第２層のデータは出力信号
として出力される（ステップＳ　３０６）。o'-= 1/2 (1+tanh (oil")
(6) Finally, the second layer data is output as an output signal (step S306).

ここで用いられる第１層および第２層の重みマトリクス
は、文献（“Ａｎ　Ｉｎｔｒｏｄｕｃｔｉｏｎ　ｔｏ　
Ｃｏｓｐｕ−ｔｉｎｇ　ｗｉｔｈ　Ｎｅｕｒａｌ　Ｎｅ
ｔｓ″ｂｙ　Ｒ，Ｐ、Ｌｉｐｐｍａｎｎ：ＩＥＥＥ。The weight matrices of the first and second layers used here are described in the literature (“An Introduction to
Cospu-ting with Neural Ne
ts″by R,P,Lippmann: IEEE.

ＡＳＳＰ：　Ａｐｒｉｌ　１９８７．　ｐｐ、４−２２
）に示されるような方法で予め学習により決定しである
ものとする。ASSP: April 1987. pp, 4-22
) is determined in advance through learning.

次に、出力データｏ’ｈ”（１≦に≦Ｎ、）の中から、
最大値を求める。その番号を最大値カテゴリとする（ス
テップＳ　１０４）。Next, from the output data o'h'' (1≦to≦N,),
Find the maximum value. The number is set as the maximum value category (step S104).

また、式（１）に従って出力データから確信度Ｒが計算
される（ステップＳ　１０５）。ここで、出力データは
Ｎ次元のベクトルで、ベクトルの要素Ｐｉ（１≦ｉ≦Ｎ
）は、それぞれＯから１までの値をとるとする。このパ
ターン認識方法を、数字認識のために用いる場合には、
１０個の文字を認識対象とするためＮの値は１０と設定
し、英数字を認識対象とする場合には、大文字２６文字
、小文字２６文字。Furthermore, the certainty factor R is calculated from the output data according to equation (1) (step S105). Here, the output data is an N-dimensional vector, and the vector element Pi (1≦i≦N
) take values from O to 1, respectively. When using this pattern recognition method for number recognition,
Since 10 characters are to be recognized, the value of N is set to 10, and when alphanumeric characters are to be recognized, 26 uppercase letters and 26 lowercase letters.

数字１０文字を認識するため、Ｎの値を６２に設定する
。また、式（１）の係数α、βをそれぞれｌとした。In order to recognize 10 numeric characters, the value of N is set to 62. Further, the coefficients α and β in equation (1) were each set to l.

これは、本質的な問題ではなく他の値を取ることも可能
である。また、式（１）では、出力データの各要素がＯ
から１までの値を仮定したが、−１から１の場合や、さ
らに−船釣に出力データの各要素がａからｂまでの値を
とる場合にも容易に拡張でき以下の式（７）で表現でき
る。This is not an essential problem, and other values are also possible. Also, in equation (1), each element of the output data is O
Although we assumed a value from -1 to 1, it can be easily extended to the case where each element of the output data takes a value from a to b in -boat fishing, as shown in equation (7) below. It can be expressed as

Ｒ＝−ａ　Σ　（Ｐ　１−ａ）　　（ｂ−Ｐ　ｉ）−β
　（Σ　（（Ｐ　１−ａ）／　（ｂ−ａ）　　）・　・
　・（７）次に、計算された確信度Ｒとしきい値Ｔとを比較して（
ステップＳ　１０６）、Ｒがしきい値以上ならばステッ
プ５１０７へ進み、ステップ５１０４で求められた最大
値のカテゴリ番号を出力し、Ｒがしきい値未満ならばス
テップ５１０８へすすみ、リジェクト信号である０を出
力する。しきい値Ｔの値は便宜上０．１とするが、これ
は本質的なものではなくどのような値でもよい。以上の
ステップで１パターンの認識処理を終了する。R=-a Σ (P 1-a) (b-P i)-β
(Σ ((P 1-a)/ (ba-a) )・・
・(7) Next, compare the calculated confidence R and the threshold T and calculate (
Step S106): If R is greater than or equal to the threshold, proceed to step 5107, and output the maximum category number obtained in step 5104; if R is less than the threshold, proceed to step 5108, which is a reject signal. Outputs 0. Although the value of the threshold T is assumed to be 0.1 for convenience, this is not essential and may be any value. The above steps complete the recognition process for one pattern.

次に第４図、第５図、第６図を参照して、第２の発明の
パターン認識装置の一実施例について説明する。Next, an embodiment of the pattern recognition device of the second invention will be described with reference to FIGS. 4, 5, and 6.

第４図は、文字認識装置の全体のブロック図を示す。こ
の文字認識装置は、特徴抽出部４０２と、認識部４０３
と、コード生成部４０４とから構成されている。FIG. 4 shows an overall block diagram of the character recognition device. This character recognition device includes a feature extraction section 402 and a recognition section 403.
and a code generation unit 404.

認識対象となる文字パターンは、端子４０１より入力さ
れ、特徴抽出部４０２において特徴ベクトルに変換され
る。この特徴抽出部は、本発明において本質的な問題で
はなく、従来の文字認識装置において用いられる方向特
徴、輪郭の曲率方向特徴などを用いてもよい。特徴抽出
部４０２で抽出されベクトルに変換された特徴データは
、認識部４０３に送られ、認識処理が行われ出力信号に
変換される。この出力信号は、１からＮのうちの認識結
果のカテゴリ番号、またはリジェクトを表す番号Ｏとす
る。Ｎは、認識対象のカテゴリ数である。この出力信号
はコード生成部４０４に送られて、認識結果がＪＩＳコ
ードなどの文字コードに変換され端子４０５より出力さ
れる。A character pattern to be recognized is input from a terminal 401 and converted into a feature vector by a feature extraction unit 402. This feature extraction unit is not an essential problem in the present invention, and the direction feature, contour curvature direction feature, etc. used in conventional character recognition devices may be used. The feature data extracted by the feature extraction unit 402 and converted into a vector is sent to the recognition unit 403, where it undergoes recognition processing and is converted into an output signal. This output signal is the category number of the recognition result from 1 to N, or the number O representing reject. N is the number of categories to be recognized. This output signal is sent to a code generation section 404, and the recognition result is converted into a character code such as a JIS code and outputted from a terminal 405.

認識部４０３の一例を第５図を用いて説明する。An example of the recognition unit 403 will be explained using FIG. 5.

この認識部は、多層神経回路網計算部５０１と、出力デ
ータ記憶部５０２と、最大カテゴリ検出部５０３と、確
信度計算部５０４と、認識結果・リジェクト判定部５０
５とから構成されている。This recognition unit includes a multilayer neural network calculation unit 501, an output data storage unit 502, a maximum category detection unit 503, a certainty calculation unit 504, and a recognition result/rejection determination unit 50.
It consists of 5.

特徴抽出部４０２で求められた特徴データは、多層神経
回路網計算部５０１に転送され、認識処理が行われて出
力ベクトルに変換される。ここでの出力ベクトルは、そ
の要素のそれぞれが認識対象のカテゴリのスコアを表し
ており、出力ベクトルの次元数Ｎは、認識対象とするカ
テゴリ数に等しい。The feature data obtained by the feature extraction unit 402 is transferred to the multilayer neural network calculation unit 501, where it is subjected to recognition processing and converted into an output vector. Each element of the output vector here represents the score of the category to be recognized, and the number of dimensions N of the output vector is equal to the number of categories to be recognized.

多層神経回路網計算部５０１の一例を第６図を用いて説
明する。この例では、３層型のモデルを用いているが、
これは本質的な問題ではなく、２層のもの、４層以上の
モデルを用いた場合、またはそれ以外の結合形式の神経
回路モデルを用いた場合でも容易に拡張が可能である。An example of the multilayer neural network calculation unit 501 will be explained using FIG. 6. In this example, a three-layer model is used, but
This is not an essential problem, and it can be easily extended even when using a two-layer model, a model with four or more layers, or a neural circuit model with other combination types.

この多層神経回路網計算部は、マトリクス積演算部６０
１　と、Ｓ関数部６０２と、中間層記憶部６０３と、マ
トリクス積演算部６０４と、Ｓ−関数部６０５と、第１
層重みマトリクス記憶部６０６と、第２層重みマトリク
ス記憶部６０７とから構成されている。This multilayer neural network calculation unit includes a matrix product calculation unit 60
1, the S function section 602, the intermediate layer storage section 603, the matrix product operation section 604, the S-function section 605, and the first
It is composed of a layer weight matrix storage section 606 and a second layer weight matrix storage section 607.

特徴抽出部４０２から入力されるベクトル値をｏｉ’（
１≦ｉ≦Ｎ、）とし、第１層重みマトリクス記憶部６０
６に保持されるマトリクス値をＷｉｊ（１≦ｉ≦Ｎ６，
１≦ｊ≦Ｎ１）、中間層記憶部６０３に保持されるベク
トル値を　’＝’（１≦ｊ≦Ｎ、）とすると、マトリク
ス積演算部６０１では、式（３）に従って計算を行う。The vector value input from the feature extraction unit 402 is expressed as oi'(
1≦i≦N,), and the first layer weight matrix storage unit 60
Wij(1≦i≦N6,
1≦j≦N1), and the vector value held in the intermediate layer storage unit 603 is '=' (1≦j≦N,), then the matrix product calculation unit 601 performs calculation according to equation (3).

ただし、ここでθＪ１　はバイアス値であり、第１層重
みマトリクス記憶部６０６に記憶される。結果のｏＪ＋
（１≦ｊ≦Ｎ、）は各々、Ｓ関数部６０２において式（
４）に従ってｏ″を求め、中間層記憶部６０３に記憶す
る。However, here, θJ1 is a bias value and is stored in the first layer weight matrix storage unit 606. Result oJ+
(1≦j≦N,) are each expressed by the formula (
4), and store it in the intermediate storage unit 603.

第２層重みマトリクス記憶部６０７に記憶される重みマ
トリクスをｗＪｍ”（１≦ｊ≦Ｎ、１≦に≦Ｎ２）とし
、この記憶部６０７に記憶されるバイアス値をθ−とす
るとマトリクス積演算部６０４では、式（５）を実現す
る計算を行う。結果の。−（ｌ≦に≦Ｎ、）は各々、Ｓ
関数部６０５において式（６）に従った計算によりｏ′
１を求め、出力信号として出力され、出力データ記憶部
５０２に記憶される。If the weight matrix stored in the second layer weight matrix storage section 607 is wJm'' (1≦j≦N, 1≦≦N2), and the bias value stored in this storage section 607 is θ-, matrix product operation is performed. The unit 604 performs calculations to realize equation (5).
In the function section 605, o' is calculated according to equation (6).
1 is calculated, outputted as an output signal, and stored in the output data storage section 502.

ここで用いられる重みマトリクスは予め学習により決定
されているものとする。また、Ｓ関数として式（４）１
式（６）を用いた場合には、関数の出力値は０から１の
間の小数値となる。It is assumed that the weight matrix used here has been determined in advance through learning. Also, as the S function, equation (4) 1
When formula (6) is used, the output value of the function is a decimal value between 0 and 1.

出力データ記憶部５０２から読み出されたベクトパター
ンは最大カテゴリ検出部５０３に送られて最大値をもつ
要素が求められ、その要素の表すカテゴリ番号を認識結
果・リジェクト判定部５０５に送る。The vector pattern read from the output data storage unit 502 is sent to the maximum category detection unit 503 to find the element with the maximum value, and the category number represented by that element is sent to the recognition result/rejection determination unit 505.

また、出力データ記憶部５０２から読み出されたベクト
ルデータは確信度計算部５０４にも送られる。The vector data read from the output data storage unit 502 is also sent to the certainty calculation unit 504.

確信度計算部５０４では、式（１）に従った演算により
確信度Ｒを求める。The reliability calculation unit 504 calculates the reliability R by calculation according to equation (1).

確信度計算部５０４で得られた値Ｒは認識結果リジェク
ト判定部５０５に送られ、しきい値Ｔと比較され、しき
い値Ｔよりも大きい場合には認識結果のカテゴリ番号が
コード生成部４０４へ転送され、確信度Ｒがしきい値Ｔ
よりも小さい場合には、リジェクト信号がコード生成部
４０４へ転送される。The value R obtained by the certainty calculation unit 504 is sent to the recognition result rejection determination unit 505 and compared with a threshold T. If it is larger than the threshold T, the category number of the recognition result is changed to the code generation unit 404. and the confidence R is the threshold value T
If it is smaller than , a reject signal is transferred to code generation section 404 .

ここでは、カテゴリ番号として１からＮ（Ｎはカテゴリ
数）までの数値、リジェクト信号として数値０とする。Here, the category number is a numerical value from 1 to N (N is the number of categories), and the reject signal is a numerical value 0.

次に、第３の発明の一実施例である文字認識装置を説明
する。Next, a character recognition device which is an embodiment of the third invention will be described.

文字認識装置全体の構成は第３図に従い、認識部の構成
が第２の発明と異なる。本実施例における認識部７００
を第７図に示す。この認識部７００は、多層神経回路網
計算部７０１と、出力データ記憶部７０２と、データソ
ート部７０３と、最大カテゴリ検出部７０４と、確信度
計算部７０５と、認識結果・リジェクト判定部７０６と
から構成されている。The overall structure of the character recognition device is according to FIG. 3, and the structure of the recognition section is different from that of the second invention. Recognition unit 700 in this embodiment
is shown in Figure 7. The recognition unit 700 includes a multilayer neural network calculation unit 701, an output data storage unit 702, a data sorting unit 703, a maximum category detection unit 704, a certainty calculation unit 705, and a recognition result/rejection determination unit 706. It consists of

入力パターンは、特徴抽出部４０２で、特徴ベクトルに
変換されて多層神経回路網計算部７０１に入力され認識
される。多層神経回路網計算部７０１での構成は、前述
の第６図での多層神経回路網計算部５０１の構成と同じ
である。そこからのＮ次元の出力データは、出力データ
記憶部７０２に保存される。ここで、Ｎは認識対象のカ
テゴリ数を表す。The input pattern is converted into a feature vector by the feature extraction unit 402 and input to the multilayer neural network calculation unit 701 for recognition. The configuration of the multilayer neural network calculation section 701 is the same as the configuration of the multilayer neural network calculation section 501 shown in FIG. 6 described above. The N-dimensional output data therefrom is stored in the output data storage unit 702. Here, N represents the number of categories to be recognized.

出力データ記憶部７０２から読み出されたデータは、デ
ータソート部７０３において大きい値から順に並べ直さ
れる。最大値を持つカテゴリ番号は最大カテゴリレジス
タ７０４に記憶される。The data read from the output data storage section 702 is rearranged in the data sorting section 703 in descending order of value. The category number with the maximum value is stored in maximum category register 704.

データソート部７０３から出力された大きい値のものか
ら順にＭ個の値は、確信度計算部７０５に入力され、・　・　・（８）に従った計算が実行される。ここで、ＭはＮに比べて小
さな数とし、実施例では、手書き数字認識ではＭを５と
し、英数字を認識対象とする場合にはＭを１０とした。The M values output from the data sorting unit 703 are input to the certainty calculation unit 705 in descending order of value, and the calculation according to (8) is executed. Here, M is a smaller number than N; in the embodiment, M is set to 5 for handwritten numeric recognition, and M is set to 10 when alphanumeric characters are to be recognized.

このＭの値は、本質的なものでなく他の値でもよい、ま
た、Ｐｉはデータソート部７０３の出力値で、ｉが小さ
いほどＰｉは大きな値をとる。α、βはそれぞれ１とし
た。The value of M is not essential and may be any other value. Pi is the output value of the data sorting section 703, and the smaller i is, the larger Pi takes. α and β were each set to 1.

確信度計算部７０５で計算された確信度の値は、認識結
果・リジェクト判定部７０６に送られて、予め設定され
たしきい値Ｔよりも大きい場合には、認識結果として最
大カテゴリレジスタ内のカテゴリ番号を、さもない場合
にはりジエクト信号をコード生成部４０４に転送し、こ
こで認識結果の文字コードが生成され出力される。ここ
で、しきい値Ｔを０．１としたが、これ以外の数値でも
実現は可能である。The confidence value calculated by the confidence calculation unit 705 is sent to the recognition result/rejection determination unit 706, and if it is larger than a preset threshold T, the confidence value calculated in the maximum category register is determined as the recognition result. The category number or, if not, the input signal is transferred to the code generation section 404, where the character code of the recognition result is generated and output. Here, the threshold value T is set to 0.1, but other values can also be used.

〔Effect of the invention〕

以上の説明から明らかなように、本発明のパターン認識
方法および装置では、神経回路網での認識部から出力さ
れた各カテゴリのスコアが、その値の取り得る範囲を（
ａ、ｂ）とした場合に、１つの要素だけがａの値を取り
、他の要素全てがｂの値を取った場合に明確にａの値を
取ったカテゴリが認識結果として正しいものと出力され
る。全カテゴリのうちでの最大値が、範囲の上限ａから
離れている場合や、最大値を取るカテゴリに、近い値を
とるカテゴリが複数個存在する場合に、確信度は減少し
、誤認識を起こしている確率の高い認識結果として、こ
の認識結果をリジェクトする。As is clear from the above description, in the pattern recognition method and apparatus of the present invention, the score of each category output from the recognition unit in the neural network has a range of possible values (
a, b), if only one element takes the value a and all other elements take the value b, then the category that clearly takes the value a is output as the correct recognition result. be done. When the maximum value among all categories is far from the upper limit a of the range, or when there are multiple categories whose values are close to the category that takes the maximum value, the confidence level decreases and false recognition can be avoided. This recognition result is rejected as a recognition result with a high probability of occurrence.

リジェクトされた場合には、他の認識方法を起動したり
、人手によって認識結果を修正するなどして認識結果の
信軌度を高めることができる。If the recognition result is rejected, the reliability of the recognition result can be increased by starting another recognition method or manually correcting the recognition result.

[Brief explanation of the drawing]

第１図は、本願第１の発明のパターン認識方法の説明図
、第２図は、特徴抽出部として、ガウスフィルタを用いた
場合の例を示す図、第３図は、３層構造の神経回路網の計算方法の説明図、第４図は、文字認識装置のブロック図、第５図は、認識
部の一例を示すブロック図、第６図は、多層神経回路網
計算部の一例を示すブロック図、第７図は、認識部の他の例を示すブロック図である。４０２　・・・・・特徴抽出部４０３　・・・・・認識部４０４　　・・・・・コード生成部５０１、７０１・・・多層神経回路網計算部５０２、７
０２・・・出力データ記憶部５０３　・・・・・最大カ
テゴリ検出部５０４、７０５・・・確信度計算部５０５、７０６・・・認識結果・リジェクト判定部７０
３　　・・・・・データソート部７０４　　・・・・・最大カテゴリレジスタ代理人　弁
理士　　岩　佐　　義　幸禅２図馬１図糸３図ｖｉ）６図第５図糸７図FIG. 1 is an explanatory diagram of the pattern recognition method of the first invention of the present application, FIG. 2 is a diagram showing an example when a Gaussian filter is used as the feature extraction section, and FIG. 3 is a diagram showing a three-layer neural structure. 4 is a block diagram of a character recognition device; FIG. 5 is a block diagram showing an example of a recognition unit; and FIG. 6 is an example of a multilayer neural network calculation unit. Block Diagram FIG. 7 is a block diagram showing another example of the recognition unit. 402 ... Feature extraction section 403 ... Recognition section 404 ... Code generation section 501, 701 ... Multilayer neural network calculation section 502, 7
02...Output data storage unit 503...Maximum category detection unit 504, 705...Confidence calculation unit 505, 706...Recognition result/rejection determination unit 70
3 ... Data sorting section 704 ... Maximum category register agent Patent attorney Yoshi Iwasa Kozen 2 Figures Horse 1 Figure Thread 3 Figure vi) Figure 6 Figure 5 Thread 7

Claims

[Claims]

(1) In a pattern recognition method using a multilayered neural network, input patterns or features extracted from the input patterns are input, are converted into output vectors representing categories by the multilayered neural network, and are converted into output vectors representing categories. From the output vector, 1
Calculate the confidence such that the confidence is maximum when only one element is near the upper bound of the range and the other elements are near the lower bound of the range, and compare that confidence with a threshold A pattern recognition method characterized in that the output result is recognized as a recognition result when the confidence level is larger than a threshold value, and is rejected when the confidence level is smaller than the threshold value.

(2) In a pattern recognition device using a multilayered neural network, means for realizing a neural network that converts input data into an output vector signal representing a category, and a method for determining a category having the maximum value among the output vector signals. means for detecting the output vector signal; and a means for determining that the output of the maximum value detection unit is the recognition result when the confidence level is greater than the threshold value, and as a rejection when the confidence level is less than the threshold value, and the recognition result or A pattern recognition device characterized by outputting a reject signal.

(3) In a pattern recognition device using a multilayered neural network, means for realizing a neural network that converts input data into an output vector signal representing a category, a means for sorting the output signals in descending order, and sorting. means for storing the category of the maximum value of the sorted result, and among a fixed number of high-order elements of the sorted output vector signal, only the first-ranked element is near the upper limit of the range, and the other elements means for calculating the confidence so that the confidence is maximized when the confidence is near the lower limit of the value range, and when the confidence is greater than the threshold value, the content of the maximum value category storage unit is used as the recognition result; What is claimed is: 1. A pattern recognition device comprising means for determining a pattern as a reject when the degree of certainty is less than a threshold, and outputting a recognition result or a rejection signal.