JPS6335998B2

JPS6335998B2 -

Info

Publication number: JPS6335998B2
Application number: JP57178481A
Authority: JP
Inventors: Hisanori Kanezashi; Kunio Akiba; Takao Irumano
Original assignee: DENSHI KEISANKI KIPPON GIJUTSU KENKYU KUMIAI
Current assignee: DENSHI KEISANKI KIPPON GIJUTSU KENKYU KUMIAI
Priority date: 1982-10-13
Filing date: 1982-10-13
Publication date: 1988-07-18
Also published as: JPS5968795A

Description

[Detailed description of the invention]

（産業上の利用分野）本発明は、入力音声に対し、まず音素認識を行
ない、この認識音素系列を音素表記された単語辞
書中の各単語と照合して単語を認識する単語音声
認識方法に関するものである。（従来例の構成とその問題点）従来のこの種の単語音声認識方法について第１
図とともに説明する。第１図に示すように、入力単語音声を分析し、
この入力単語音声の特徴を抽出して入力単語音声
を構成する音素を認識し、この認識された音素系
列を音素表記された単語辞書中の各単語とコンフ
ユージヨンマトリツクス（以下、C.M.と略記す
る）を用いて照合し、尤度を計算し、尤度の大き
い単語を認識単語とするものである。ここで、第
１表は音素表記された単語辞書の一例を示してお
り、また、第２表は単語辞書の音素表記法の一例
を示している。第１表に示したように、単語辞書における無声
化母音に関する音素表記は、日本語の性質によ
り、無声子音に挾まれた場合の母音は、無声化す
るとして、無声子音に挾まれた母音のみを無声化
母音として表記し、その他の場合の母音は有声母
音として表記していた。従来例では、音素認識段
階に (Industrial Application Field) The present invention relates to a word speech recognition method that first performs phoneme recognition on input speech and then recognizes the word by comparing the recognized phoneme sequence with each word in a word dictionary in which phonemes are expressed. It is something. (Structure of conventional example and its problems) First, regarding this type of conventional word speech recognition method
This will be explained with figures. As shown in Figure 1, the input word speech is analyzed,
The features of this input word sound are extracted to recognize the phonemes that make up the input word sound, and this recognized phoneme sequence is combined with each word in the word dictionary in which the phoneme is expressed and a confusion matrix (hereinafter abbreviated as CM). ), the likelihood is calculated, and the word with the highest likelihood is selected as the recognized word. Here, Table 1 shows an example of a word dictionary in phoneme notation, and Table 2 shows an example of the phoneme notation method of the word dictionary. As shown in Table 1, due to the nature of the Japanese language, the phoneme representation of unvoiced vowels in word dictionaries is such that vowels that are sandwiched between voiceless consonants are devoiced, and only vowels that are sandwiched between voiceless consonants are devoiced. were written as unvoiced vowels, and vowels in other cases were written as voiced vowels. In the conventional example, at the phoneme recognition stage,

【表】【table】

Claims

[Claims]

1 Perform phoneme recognition on the input speech to obtain a recognized phoneme sequence, compare this recognized phoneme sequence with each word in the word dictionary in which the phoneme is expressed, calculate the likelihood, and recognize the word. When correcting the devoicing of vowels in a phoneme series, the notation of the devoiced vowel to be inserted is distinguished between cases where there are consecutive unvoiced consonants and cases where a voiceless consonant and a voiced consonant follow, and the vowels of each word in the word dictionary are A word speech recognition method characterized in that a phoneme signal to be written is divided into three categories: a normally uttered voiced vowel, a devoiced vowel, and a vowel that is easily devoiced.