JPS6338720B2 - - Google Patents
Info
- Publication number
- JPS6338720B2 JPS6338720B2 JP57187542A JP18754282A JPS6338720B2 JP S6338720 B2 JPS6338720 B2 JP S6338720B2 JP 57187542 A JP57187542 A JP 57187542A JP 18754282 A JP18754282 A JP 18754282A JP S6338720 B2 JPS6338720 B2 JP S6338720B2
- Authority
- JP
- Japan
- Prior art keywords
- phoneme
- zero
- word
- dictionary
- matrix
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 239000011159 matrix material Substances 0.000 claims description 6
- 238000000034 method Methods 0.000 claims description 4
- 238000007476 Maximum Likelihood Methods 0.000 description 1
Description
(産業上の利用分野)
本発明は、入力音声の認識された音素系列と音
素表記された単語辞書の各辞書項目の辞書音素系
列との尢度を、コンフユージヨンマトリクスを用
いて計算して単語を認識する単語音声認識方法に
関するものである。
(従来例の構成とその問題点)
従来の単語音声認識方法を第1図とともに説明
する。第1図に示すように、入力音声に対して先
ず分析を行ない、この入力音声の特徴を抽出して
入力音声を構成する音素を認識する。この認識さ
れた音素系列を、単語辞書中の各辞書項目の辞書
音素系列と照合し、2つの音素系列間の尤度を、
音素間のコンフユージヨンマトリクス
(Confusion Matrix、以下C.M.と略す)を用い
て、各音素毎の認識確率を求めることにより算出
し、音素系列間の尤度が最大となる辞書項目をも
つて認識単語とするものである。
第1表は、前記単語音声認識方法に用いる単語
辞書の一例を示しており、各単語は第2表に示す
音素表記法に従つて表記されている。第2図は前
記C.M.の一部を示す。第2図において、縦は単
(Industrial Application Field) The present invention calculates the degree of similarity between the recognized phoneme sequence of input speech and the dictionary phoneme sequence of each dictionary entry in a word dictionary in which phonemes are expressed, using a confusion matrix. The present invention relates to a word speech recognition method for recognizing words. (Structure of conventional example and its problems) A conventional word speech recognition method will be explained with reference to FIG. As shown in FIG. 1, input speech is first analyzed, features of the input speech are extracted, and phonemes making up the input speech are recognized. This recognized phoneme sequence is compared with the dictionary phoneme sequence of each dictionary entry in the word dictionary, and the likelihood between the two phoneme sequences is calculated as follows:
It is calculated by finding the recognition probability for each phoneme using a Confusion Matrix (hereinafter abbreviated as CM) between phonemes, and the recognition word is determined by finding the dictionary entry that has the maximum likelihood between phoneme sequences. That is. Table 1 shows an example of a word dictionary used in the word speech recognition method, and each word is written according to the phoneme notation shown in Table 2. FIG. 2 shows a part of the CM. In Figure 2, vertical is a single
【表】【table】
【表】【table】
Claims (1)
れた単語辞書の各辞書項目の辞書音素系列との尤
度を、単語辞書中の各音素がどのような音素に認
識されるかの確率を示すコンフユージヨンマトリ
クスを用いて計算して単語を認識するに際し、コ
ンフユージヨンマトリクスを予め正解のわかつて
いる単語音声データの分析結果あるいは音素認識
結果を用いて作成し、かつ、そのコンフユージヨ
ンマトリクスの要素のうち出現確率の低い要素に
ついては、その要素が音声の性質上生じ得ないも
のの場合には確率零または零とみなし得る値を与
え、その要素が音声の性質上生じ得るものの場合
には零に近いが零でない有限の値を与えてなるコ
ンフユージヨンマトリクスを用いることを特徴と
する単語音声認識方法。1. The likelihood between the recognized phoneme sequence of the input speech and the dictionary phoneme sequence of each dictionary item in the word dictionary with phoneme notation indicates the probability of what kind of phoneme each phoneme in the word dictionary will be recognized as. When recognizing words by calculation using a conflation matrix, the conflation matrix is created using the analysis results of word audio data or the phoneme recognition results for which the correct answer is known in advance, and the confusion matrix is For elements with a low probability of occurrence, if the element cannot occur due to the nature of the voice, a value that can be considered as zero or zero is given to the probability, and if the element can occur due to the nature of the voice, a value that can be considered zero is given. A word speech recognition method characterized by using a conflation matrix that is given a finite value that is close to zero but not zero.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP57187542A JPS5978399A (en) | 1982-10-27 | 1982-10-27 | Recognition of word voice |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP57187542A JPS5978399A (en) | 1982-10-27 | 1982-10-27 | Recognition of word voice |
Publications (2)
Publication Number | Publication Date |
---|---|
JPS5978399A JPS5978399A (en) | 1984-05-07 |
JPS6338720B2 true JPS6338720B2 (en) | 1988-08-01 |
Family
ID=16207904
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP57187542A Granted JPS5978399A (en) | 1982-10-27 | 1982-10-27 | Recognition of word voice |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPS5978399A (en) |
-
1982
- 1982-10-27 JP JP57187542A patent/JPS5978399A/en active Granted
Also Published As
Publication number | Publication date |
---|---|
JPS5978399A (en) | 1984-05-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JPS6338720B2 (en) | ||
JPH0126080B2 (en) | ||
JPS6411959B2 (en) | ||
JPS6310439B2 (en) | ||
JPH0158519B2 (en) | ||
JPS6281699A (en) | Forming and updating method for dictoinary for voice word processor | |
JPH0158520B2 (en) | ||
JPS6310438B2 (en) | ||
JP3033132B2 (en) | Language processor | |
JP2656239B2 (en) | Speech recognition learning method | |
JPS6320359B2 (en) | ||
JPS59185400A (en) | Monosyllable sound recognition system | |
JPS61149997A (en) | Voice recognition equipment | |
JPS62111292A (en) | Voice recognition equipment | |
JPS62245295A (en) | Specified speaker's voice recognition equipment | |
JPS6335998B2 (en) | ||
JPS62217297A (en) | Word voice recognition equipment | |
JPS60159798A (en) | Voice recognition equipment | |
JPS60115993A (en) | Monosyllabic voice recognition equipment | |
JPS60159899A (en) | Voice recognition equipment with learning function | |
JPS60149099A (en) | Voice recognition | |
JPS60241097A (en) | Voice recognition applying equipment | |
JPH04180097A (en) | Word voice recognition device | |
JPS5939759B2 (en) | voice recognition device | |
JPS62218997A (en) | Word voice recognition equipment |