JPS6338720B2 - - Google Patents

Info

Publication number
JPS6338720B2
JPS6338720B2 JP57187542A JP18754282A JPS6338720B2 JP S6338720 B2 JPS6338720 B2 JP S6338720B2 JP 57187542 A JP57187542 A JP 57187542A JP 18754282 A JP18754282 A JP 18754282A JP S6338720 B2 JPS6338720 B2 JP S6338720B2
Authority
JP
Japan
Prior art keywords
phoneme
zero
word
dictionary
matrix
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP57187542A
Other languages
Japanese (ja)
Other versions
JPS5978399A (en
Inventor
Takao Irumano
Kunio Akiba
Hisanori Kanezashi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
DENSHI KEISANKI KIPPON GIJUTSU KENKYU KUMIAI
Original Assignee
DENSHI KEISANKI KIPPON GIJUTSU KENKYU KUMIAI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by DENSHI KEISANKI KIPPON GIJUTSU KENKYU KUMIAI filed Critical DENSHI KEISANKI KIPPON GIJUTSU KENKYU KUMIAI
Priority to JP57187542A priority Critical patent/JPS5978399A/en
Publication of JPS5978399A publication Critical patent/JPS5978399A/en
Publication of JPS6338720B2 publication Critical patent/JPS6338720B2/ja
Granted legal-status Critical Current

Links

Description

【発明の詳細な説明】[Detailed description of the invention]

(産業上の利用分野) 本発明は、入力音声の認識された音素系列と音
素表記された単語辞書の各辞書項目の辞書音素系
列との尢度を、コンフユージヨンマトリクスを用
いて計算して単語を認識する単語音声認識方法に
関するものである。 (従来例の構成とその問題点) 従来の単語音声認識方法を第1図とともに説明
する。第1図に示すように、入力音声に対して先
ず分析を行ない、この入力音声の特徴を抽出して
入力音声を構成する音素を認識する。この認識さ
れた音素系列を、単語辞書中の各辞書項目の辞書
音素系列と照合し、2つの音素系列間の尤度を、
音素間のコンフユージヨンマトリクス
(Confusion Matrix、以下C.M.と略す)を用い
て、各音素毎の認識確率を求めることにより算出
し、音素系列間の尤度が最大となる辞書項目をも
つて認識単語とするものである。 第1表は、前記単語音声認識方法に用いる単語
辞書の一例を示しており、各単語は第2表に示す
音素表記法に従つて表記されている。第2図は前
記C.M.の一部を示す。第2図において、縦は単
(Industrial Application Field) The present invention calculates the degree of similarity between the recognized phoneme sequence of input speech and the dictionary phoneme sequence of each dictionary entry in a word dictionary in which phonemes are expressed, using a confusion matrix. The present invention relates to a word speech recognition method for recognizing words. (Structure of conventional example and its problems) A conventional word speech recognition method will be explained with reference to FIG. As shown in FIG. 1, input speech is first analyzed, features of the input speech are extracted, and phonemes making up the input speech are recognized. This recognized phoneme sequence is compared with the dictionary phoneme sequence of each dictionary entry in the word dictionary, and the likelihood between the two phoneme sequences is calculated as follows:
It is calculated by finding the recognition probability for each phoneme using a Confusion Matrix (hereinafter abbreviated as CM) between phonemes, and the recognition word is determined by finding the dictionary entry that has the maximum likelihood between phoneme sequences. That is. Table 1 shows an example of a word dictionary used in the word speech recognition method, and each word is written according to the phoneme notation shown in Table 2. FIG. 2 shows a part of the CM. In Figure 2, vertical is a single

【表】【table】

【表】【table】

Claims (1)

【特許請求の範囲】[Claims] 1 入力音声の認識された音素系列と音素表記さ
れた単語辞書の各辞書項目の辞書音素系列との尤
度を、単語辞書中の各音素がどのような音素に認
識されるかの確率を示すコンフユージヨンマトリ
クスを用いて計算して単語を認識するに際し、コ
ンフユージヨンマトリクスを予め正解のわかつて
いる単語音声データの分析結果あるいは音素認識
結果を用いて作成し、かつ、そのコンフユージヨ
ンマトリクスの要素のうち出現確率の低い要素に
ついては、その要素が音声の性質上生じ得ないも
のの場合には確率零または零とみなし得る値を与
え、その要素が音声の性質上生じ得るものの場合
には零に近いが零でない有限の値を与えてなるコ
ンフユージヨンマトリクスを用いることを特徴と
する単語音声認識方法。
1. The likelihood between the recognized phoneme sequence of the input speech and the dictionary phoneme sequence of each dictionary item in the word dictionary with phoneme notation indicates the probability of what kind of phoneme each phoneme in the word dictionary will be recognized as. When recognizing words by calculation using a conflation matrix, the conflation matrix is created using the analysis results of word audio data or the phoneme recognition results for which the correct answer is known in advance, and the confusion matrix is For elements with a low probability of occurrence, if the element cannot occur due to the nature of the voice, a value that can be considered as zero or zero is given to the probability, and if the element can occur due to the nature of the voice, a value that can be considered zero is given. A word speech recognition method characterized by using a conflation matrix that is given a finite value that is close to zero but not zero.
JP57187542A 1982-10-27 1982-10-27 Recognition of word voice Granted JPS5978399A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57187542A JPS5978399A (en) 1982-10-27 1982-10-27 Recognition of word voice

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57187542A JPS5978399A (en) 1982-10-27 1982-10-27 Recognition of word voice

Publications (2)

Publication Number Publication Date
JPS5978399A JPS5978399A (en) 1984-05-07
JPS6338720B2 true JPS6338720B2 (en) 1988-08-01

Family

ID=16207904

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57187542A Granted JPS5978399A (en) 1982-10-27 1982-10-27 Recognition of word voice

Country Status (1)

Country Link
JP (1) JPS5978399A (en)

Also Published As

Publication number Publication date
JPS5978399A (en) 1984-05-07

Similar Documents

Publication Publication Date Title
JPS6338720B2 (en)
JPH0126080B2 (en)
JPS6411959B2 (en)
JPS6310439B2 (en)
JPH0158519B2 (en)
JPS6281699A (en) Forming and updating method for dictoinary for voice word processor
JPH0158520B2 (en)
JPS6310438B2 (en)
JP3033132B2 (en) Language processor
JP2656239B2 (en) Speech recognition learning method
JPS6320359B2 (en)
JPS59185400A (en) Monosyllable sound recognition system
JPS61149997A (en) Voice recognition equipment
JPS62111292A (en) Voice recognition equipment
JPS62245295A (en) Specified speaker's voice recognition equipment
JPS6335998B2 (en)
JPS62217297A (en) Word voice recognition equipment
JPS60159798A (en) Voice recognition equipment
JPS60115993A (en) Monosyllabic voice recognition equipment
JPS60159899A (en) Voice recognition equipment with learning function
JPS60149099A (en) Voice recognition
JPS60241097A (en) Voice recognition applying equipment
JPH04180097A (en) Word voice recognition device
JPS5939759B2 (en) voice recognition device
JPS62218997A (en) Word voice recognition equipment