JPH08254991A - Pattern recognition device - Google Patents

Pattern recognition device

Info

Publication number
JPH08254991A
JPH08254991A JP7056396A JP5639695A JPH08254991A JP H08254991 A JPH08254991 A JP H08254991A JP 7056396 A JP7056396 A JP 7056396A JP 5639695 A JP5639695 A JP 5639695A JP H08254991 A JPH08254991 A JP H08254991A
Authority
JP
Japan
Prior art keywords
recognition
pattern
unit
threshold value
patterns
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP7056396A
Other languages
Japanese (ja)
Inventor
Hiroshi Isshiki
浩 一色
Kenichi Taki
賢一 滝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Zosen Corp
Original Assignee
Hitachi Zosen Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Zosen Corp filed Critical Hitachi Zosen Corp
Priority to JP7056396A priority Critical patent/JPH08254991A/en
Publication of JPH08254991A publication Critical patent/JPH08254991A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE: To provide a pattern recognition device capable of obtaining threshold values rationally and improving a recognition rate. CONSTITUTION: This device is provided with a threshold value setting part 6 setting threshold values by calculating threshold values for every pattern from the statistical quantity of the mean value and the standard deviation of the adaptabilities of patterns at the time of recognizing patterns previously registered in a storage part 5 in first order correctly and a recognition discriminating means 8 outputting a signal that an errorneous recognition is present when the adaptabilities of the patterns shown in the first order as the recognition result by a recognition means 7 are less than the threshold values is provided in a pattern matching part 9. Since the signal that the errorneous recognition is present is outputted when the adaptabilities of patterns shown in the first order as the result of the recognition, patterns in which probabilities of errorneous recognitions are high arc actively eliminated and, thus, the reliability of a pattern recognition is largely improved and, moreover, a recognition rate is improved because other recognition means and a human can perform the backup of the pattern recognition device.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、情報処理技術における
音声認識、画像認識などのパターン認識装置、特にその
誤認識を防止する技術に関するものである。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a pattern recognition apparatus such as voice recognition and image recognition in information processing technology, and more particularly to a technology for preventing erroneous recognition thereof.

【0002】[0002]

【従来の技術】従来、音声認識装置に限らず、パターン
認識装置における誤認識を防止する有効な方法として
は、認識装置固有の認識精度を上げる第1の方法や、認
識対象となるパターンを限定し、かつしきい値処理を行
う第2の方法がある。
2. Description of the Related Art Conventionally, as an effective method for preventing erroneous recognition not only by a voice recognition device but also by a pattern recognition device, a first method for increasing recognition accuracy peculiar to the recognition device and a pattern to be recognized are limited. And there is a second method of thresholding.

【0003】[0003]

【発明が解決しようとする課題】しかし、上記第1の方
法では、膨大な計算量が必要となることから実現が難し
く、よって確実に誤認識を防止することは難しく、また
第2の方法では、しきい値の取りかたに関する合理的な
規準がなく、認識率を向上させることはできないという
問題があった。
However, the first method is difficult to realize because it requires a huge amount of calculation, and thus it is difficult to reliably prevent erroneous recognition, and the second method is difficult. However, there is a problem that the recognition rate cannot be improved because there is no rational standard regarding how to set the threshold.

【0004】本発明は上記問題を解決するものであり、
しきい値を合理的に求め、認識率を改善できるパターン
認識装置を提供することを目的とするものである。
The present invention solves the above problems,
An object of the present invention is to provide a pattern recognition device that can reasonably obtain a threshold value and improve the recognition rate.

【0005】[0005]

【課題を解決するための手段】上記問題を解決するため
本発明のパターン認識装置は、パターン認識を行う信号
から特徴パラメータを抽出する特徴パラメータ分析部
と、この特徴パラメータ分析部による分析の結果得られ
たパターンと、予め登録されたパターンとの適合度を求
めてパターンの認識を行うパターンマッチング部を備
え、予め前記登録パターンを正しく第1順位に認識する
際のパターンの適合度の平均値と標準偏差の統計量から
各パターン毎のしきい値を演算して設定するしきい値設
定部を設け、前記パターンマッチング部に、認識の結
果、第1順位として示されたパターンの適合度が前記し
きい値未満のとき、誤認識ありの信号を出力する手段を
付加したことを特徴とするものである。
In order to solve the above problems, a pattern recognition apparatus of the present invention includes a characteristic parameter analyzing section for extracting characteristic parameters from a signal for pattern recognition, and a result of analysis by the characteristic parameter analyzing section. A pattern matching unit for recognizing the pattern by obtaining the matching degree between the registered pattern and the pre-registered pattern, and an average value of the matching degrees of the patterns when the registered pattern is correctly recognized in the first order in advance. A threshold value setting unit for calculating and setting a threshold value for each pattern from the statistical amount of standard deviation is provided, and the pattern matching unit recognizes the conformity of the pattern shown as the first rank as a result of the recognition. It is characterized in that a means for outputting a signal with erroneous recognition is added when it is less than the threshold value.

【0006】[0006]

【作用】上記構成により、予め登録パターンを正しく第
1順位に認識する際の平均値と標準偏差の統計量から各
パターン毎のしきい値を求め、認識の結果第1順位とし
て示されたパターンの適合度が前記しきい値未満のと
き、誤認識ありの信号を出力する。
With the above structure, the threshold value for each pattern is obtained from the statistical value of the average value and the standard deviation when the registered pattern is correctly recognized in the first rank, and the pattern shown as the first rank as a result of recognition. When the degree of conformity of is less than the threshold value, a signal with erroneous recognition is output.

【0007】[0007]

【実施例】以下、本発明の一実施例を図面に基づいて説
明する。図1は、本発明のパターン認識装置の一例であ
る音声認識装置の構成図である。
An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of a voice recognition device which is an example of the pattern recognition device of the present invention.

【0008】図示するように、マイクロホンなどの音声
入力部1から入力された音声信号は、前処理部2へ入力
される。この前処理部2は、ノイズフィルタと音声信号
が一定しきい値以上であることにより単語区間を検出す
るレベル検出回路などからなり、雑音除去の他に単語区
間の検出やプリエンファシスなどの認識に先立つ準備の
ための処理が行われ、音響分析部3へ出力される。音響
分析部3は、帯域フィルタ群、A/D変換器、マイクロ
コンピュータからなり、スペクトル分析が行われ、特徴
パラメータが抽出され、マイクロコンピュータからなる
認識部4へ送られる。
As shown in the figure, the audio signal input from the audio input unit 1 such as a microphone is input to the preprocessing unit 2. The pre-processing unit 2 is composed of a noise filter and a level detection circuit that detects a word section when the voice signal is equal to or higher than a certain threshold, and is used for word section detection and recognition of pre-emphasis in addition to noise removal. The preceding preparation process is performed and output to the acoustic analysis unit 3. The acoustic analysis unit 3 includes a band-pass filter group, an A / D converter, and a microcomputer, performs spectrum analysis, extracts characteristic parameters, and sends the characteristic parameters to the recognition unit 4 including a microcomputer.

【0009】認識部4は、予め各同一の単語についてサ
ンプルされた複数のパターン(たとえば、男性、女性、
幼児の発声や発声の長さが異なるパターン)が登録され
た記憶部5と、予め前記登録パターンを第1順位に認識
する際の適合度の平均値と標準偏差の統計量から各パタ
ーン毎のしきい値を演算して設定するしきい値設定部6
(詳細は後述する)と、2つのパターンの適合度を求め
る認識手段7および後述する認識判定手段8からなり、
単語を認識し、その認識結果を出力するとともに、誤認
識の可能性がある場合はその旨を出力するパターンマッ
チング部9とから構成されている。
The recognition unit 4 includes a plurality of patterns (eg, male, female,
The storage unit 5 in which babies' utterances and patterns having different utterance lengths) are registered, and from the statistical value of the average value of the fitness and the standard deviation when recognizing the registered pattern in the first order in advance, Threshold setting section 6 for calculating and setting thresholds
(Details will be described later), a recognition unit 7 for obtaining the matching degree of two patterns, and a recognition determination unit 8 described later,
The pattern matching unit 9 recognizes a word, outputs the recognition result, and outputs a message to that effect when there is a possibility of erroneous recognition.

【0010】しきい値設定部6における、しきい値設定
方法について図2のフローチャートにしたがって説明す
る。まず、外部よりしきい値設定命令信号を入力すると
(ステップ−1)、No1(n=1)の単語から順に(ス
テップ−2)、同一単語毎に、記憶部5に記憶された各
登録パターン(サンプル)についてその適合度を測定す
る命令を認識手段7へ出力する(ステップ−3)。認識
手段7は、そのパターンが正しく第1順位に認識される
もののパターン適合度をしきい値設定部6へ出力する。
A threshold setting method in the threshold setting unit 6 will be described with reference to the flowchart of FIG. First, when a threshold value setting command signal is input from the outside (step-1), the No1 (n = 1) word is sequentially (step-2), and the registered patterns stored in the storage unit 5 are stored for each same word. An instruction for measuring the conformity of the (sample) is output to the recognition means 7 (step-3). The recognizing means 7 outputs the pattern matching degree to the threshold value setting unit 6 although the pattern is correctly recognized in the first rank.

【0011】しきい値設定部6は表1に示すように、入
力したパターン適合度を記憶し(ステップ−4)、その
統計をとる。
As shown in Table 1, the threshold value setting unit 6 stores the input pattern conformity (step-4) and takes the statistics.

【0012】[0012]

【表1】 [Table 1]

【0013】統計値としては、たとえば、正規分布とみ
なされる場合には、認識手段が正しい第1順位をとるパ
ターンXの適合度Yの統計から求まる平均μx と標準偏
差σ x を演算し(ステップ−5)、さらにαx をパター
ンXに依存する適当な定数として、(μx −αx
σx )をパターンXのしきい値とする演算を行う(ステ
ップ−6)。
The statistical values are, for example, considered to be a normal distribution.
If it is done, the recognition means will take the correct first order.
Average μ obtained from statistics of conformity Y of turn XxAnd standard deviation
Difference σ xIs calculated (step-5), and αxThe putter
As an appropriate constant depending on X, (μx−αx
σx) Is used as the threshold value of pattern X (step
Up-6).

【0014】すなわち、正しく認識されたものの適合度
Yの統計を取ったとき、正規分布に従っているものと仮
定すると、図3に示す(μx −αx ・σx )<Y<(μ
x +αx ・σx )で定義される範囲に95%のデータが
入る。パターンが予め未知な場合にも、正しく認識され
るものは、正規分布に従うはずであり、また誤認識の場
合の適合度はたとえ第1順位と認識されてもより小さい
はずであるから、経験的に決定するαx を導入して、
(μx −αx ・σx )<Yとなる場合は正しい認識と考
えられるとする。すなわち、(μx −αx ・σx )をし
きい値とする。
That is, when the statistics of the goodness of fit Y of the correctly recognized ones are taken and it is assumed that they follow a normal distribution, (μ x −α x · σ x ) <Y <(μ
95% of the data is in the range defined by ( x + α x · σ x ). Even if the pattern is unknown in advance, what is recognized correctly should follow the normal distribution, and the goodness of fit in the case of misrecognition should be smaller even if it is recognized as the first rank. Introducing α x to determine
If (μ x −α x · σ x ) <Y, it is considered that the recognition is correct. That is, the threshold value is (μ x −α x · σ x ).

【0015】全ての単語(N個)についてステップ−3
〜6が実行される(ステップ−7,8)。上記しきい値
設定部6のしきい値設定により、各単語のしきい値(μ
x −αx・σx )が設定される。
Step-3 for all words (N)
~ 6 are executed (steps -7, 8). By setting the threshold value of the threshold value setting unit 6, the threshold value of each word (μ
x −α x · σ x ) is set.

【0016】パターンマッチング部9の認識手段7は、
上記しきい値が求められている状態において、音響分析
部3よりあるパターンが提示されると、この音響分析の
結果得られた単語のパターンと記憶部5に登録された登
録パターンとの適合度を求め、第1順位として示された
パターンXを認識結果として出力し、またその適合度Y
を認識判定手段8へ出力する。
The recognition means 7 of the pattern matching section 9 is
When a certain pattern is presented by the acoustic analysis unit 3 in the state where the threshold value is obtained, the matching degree between the word pattern obtained as a result of the acoustic analysis and the registered pattern registered in the storage unit 5 , The pattern X shown as the first rank is output as a recognition result, and the matching degree Y
Is output to the recognition determination means 8.

【0017】認識判定手段8は、入力した適合度Yが、
しきい値設定部6に記憶された、しきい値(μx −αx
・σx )以上のとき、その認識結果良好を出力し、しき
い値(μx −αx ・σx )未満のとき、誤認識ありを出
力する。
The recognition determining means 8 determines that the input goodness of fit Y is
The threshold value (μ x −α x stored in the threshold value setting unit 6
When σ x ) or more, the recognition result is good, and when it is less than the threshold value (μ x −α x · σ x ), false recognition is output.

【0018】このように、しきい値を統計から求め、誤
認識の可能性があるときに、その可能性を積極的に出力
することにより、信頼性を大きく向上することができ、
さらに他の認識手段や人間が音声認識装置をバックアツ
プすることができ、認識率を改善することができる。
Thus, the reliability can be greatly improved by obtaining the threshold value from the statistics and positively outputting the possibility when there is a possibility of erroneous recognition.
Furthermore, another recognition means or a person can back up the voice recognition device, and the recognition rate can be improved.

【0019】パターンマッチング部9の認識手段7にお
いて、誤認識が発生する2つのタイプとして、 a.登録されたパターン間の誤認識、 b.未登録のパターンを登録パターンと誤認識 があるが、本発明はいずれにも適用可能である。
In the recognition means 7 of the pattern matching section 9, there are two types of erroneous recognition: a. False recognition between registered patterns, b. The unregistered pattern may be erroneously recognized as a registered pattern, but the present invention is applicable to both.

【0020】なお、本実施例では、音声認識装置につい
て記載したが、画像認識装置にも同様に適用することが
可能である。
In the present embodiment, the voice recognition device is described, but the invention can be similarly applied to the image recognition device.

【0021】[0021]

【発明の効果】以上述べたように本発明によれば、予め
登録パターンを正しく第1順位に認識する際の平均値と
標準偏差の統計量から各パターン毎のしきい値を求め、
認識の結果第1順位として示されたパターンの適合度が
前記しきい値未満のとき、誤認識ありの信号を出力する
ことにより、誤認識の可能性の高いものを積極的に排除
でき、信頼性を大きく向上することができ、さらに誤認
識の可能性を積極的に出力することにより、他の認識手
段や人間がパターン認識装置をバックアツプすることが
でき、認識率を改善することができる。
As described above, according to the present invention, the threshold value for each pattern is obtained from the statistical value of the average value and the standard deviation when the registered pattern is correctly recognized in the first order,
When the conformity of the pattern shown as the first rank as a result of recognition is less than the threshold value, a signal with false recognition is output, so that a high probability of false recognition can be positively excluded, and reliability can be improved. The pattern recognition device can be backed up by other recognition means or human by positively outputting the possibility of erroneous recognition, and the recognition rate can be improved. .

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例における音声認識装置の構成
図である。
FIG. 1 is a configuration diagram of a voice recognition device according to an embodiment of the present invention.

【図2】同音声認識装置のしきい値設定方法のフローチ
ャートである。
FIG. 2 is a flowchart of a threshold value setting method for the voice recognition device.

【図3】同音声認識装置の同一単語の適合度の分布図で
ある。
FIG. 3 is a distribution diagram of fitness of the same word in the same speech recognition apparatus.

【符号の説明】[Explanation of symbols]

1 音声入力部 2 前処理部 3 音響分析部(特徴パラメータ分析部) 4 認識部 5 記憶部 6 しきい値設定部 7 認識手段 8 認識判定手段 9 パターンマッチング部 1 voice input unit 2 pre-processing unit 3 acoustic analysis unit (feature parameter analysis unit) 4 recognition unit 5 storage unit 6 threshold value setting unit 7 recognition unit 8 recognition determination unit 9 pattern matching unit

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 パターン認識を行う信号から特徴パラメ
ータを抽出する特徴パラメータ分析部と、この特徴パラ
メータ分析部による分析の結果得られたパターンと、予
め登録されたパターンとの適合度を求めてパターンの認
識を行うパターンマッチング部を備え、 予め前記登録パターンを正しく第1順位に認識する際の
パターンの適合度の平均値と標準偏差の統計量から各パ
ターン毎のしきい値を演算して設定するしきい値設定部
を設け、前記パターンマッチング部に、認識の結果、第
1順位として示されたパターンの適合度が前記しきい値
未満のとき、誤認識ありの信号を出力する手段を付加し
たことを特徴とするパターン認識装置。
1. A feature parameter analysis unit for extracting a feature parameter from a signal for pattern recognition, a pattern obtained as a result of the analysis by the feature parameter analysis unit, and a degree of matching between a pattern registered in advance to obtain a pattern. A pattern matching unit for recognizing the pattern is provided, and a threshold value for each pattern is calculated and set from the average value of the goodness of fit of the pattern and the statistical amount of the standard deviation when the registered pattern is correctly recognized in the first order. A threshold value setting unit is provided, and the pattern matching unit is provided with a unit for outputting a signal indicating erroneous recognition when the conformity of the pattern shown as the first rank is less than the threshold value as a result of recognition. A pattern recognition device characterized in that
JP7056396A 1995-03-16 1995-03-16 Pattern recognition device Pending JPH08254991A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP7056396A JPH08254991A (en) 1995-03-16 1995-03-16 Pattern recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP7056396A JPH08254991A (en) 1995-03-16 1995-03-16 Pattern recognition device

Publications (1)

Publication Number Publication Date
JPH08254991A true JPH08254991A (en) 1996-10-01

Family

ID=13026058

Family Applications (1)

Application Number Title Priority Date Filing Date
JP7056396A Pending JPH08254991A (en) 1995-03-16 1995-03-16 Pattern recognition device

Country Status (1)

Country Link
JP (1) JPH08254991A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005326505A (en) * 2004-05-12 2005-11-24 Ntt Docomo Inc Recognition system and recognition method
JP2014145932A (en) * 2013-01-29 2014-08-14 Sogo Keibi Hosho Co Ltd Speaker recognition device, speaker recognition method, and speaker recognition program

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005326505A (en) * 2004-05-12 2005-11-24 Ntt Docomo Inc Recognition system and recognition method
JP4512417B2 (en) * 2004-05-12 2010-07-28 株式会社エヌ・ティ・ティ・ドコモ Recognition system and recognition method
JP2014145932A (en) * 2013-01-29 2014-08-14 Sogo Keibi Hosho Co Ltd Speaker recognition device, speaker recognition method, and speaker recognition program

Similar Documents

Publication Publication Date Title
EP2486562B1 (en) Method for the detection of speech segments
US20070129941A1 (en) Preprocessing system and method for reducing FRR in speaking recognition
US6134527A (en) Method of testing a vocabulary word being enrolled in a speech recognition system
JPH02238495A (en) Time series signal recognizing device
CN105989836B (en) Voice acquisition method and device and terminal equipment
EP1005019B1 (en) Segment-based similarity measurement method for speech recognition
CN112992191B (en) Voice endpoint detection method and device, electronic equipment and readable storage medium
US5159637A (en) Speech word recognizing apparatus using information indicative of the relative significance of speech features
JP2996019B2 (en) Voice recognition device
JPH08254991A (en) Pattern recognition device
CN106599765B (en) Method and system for judging living body based on video-audio frequency of object continuous pronunciation
JP2975772B2 (en) Voice recognition device
CN114582321A (en) Speech recognition accuracy improving method, system, electronic device and storage medium
CN112992174A (en) Voice analysis method and voice recording device thereof
JP2754960B2 (en) Voice recognition device
JPH01159697A (en) Voice recognition apparatus
JP3114757B2 (en) Voice recognition device
JP2001350494A (en) Device and method for collating
JPH0119597B2 (en)
JP3008593B2 (en) Voice recognition device
JP3484559B2 (en) Voice recognition device and voice recognition method
CN116206610A (en) Entrance guard processing method and device based on voiceprint recognition
JP3439602B2 (en) Voice recognition device
JPH05257493A (en) Voice recognizing device
Qiu et al. Text-Independent Speaker Verification Based on 3D Inception-Resnet

Legal Events

Date Code Title Description
FPAY Renewal fee payment (prs date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080620

Year of fee payment: 5

FPAY Renewal fee payment (prs date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090620

Year of fee payment: 6

FPAY Renewal fee payment (prs date is renewal date of database)

Year of fee payment: 6

Free format text: PAYMENT UNTIL: 20090620

FPAY Renewal fee payment (prs date is renewal date of database)

Year of fee payment: 7

Free format text: PAYMENT UNTIL: 20100620

FPAY Renewal fee payment (prs date is renewal date of database)

Year of fee payment: 7

Free format text: PAYMENT UNTIL: 20100620

FPAY Renewal fee payment (prs date is renewal date of database)

Year of fee payment: 8

Free format text: PAYMENT UNTIL: 20110620

FPAY Renewal fee payment (prs date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120620

Year of fee payment: 9

LAPS Cancellation because of no payment of annual fees