KR960030078A

KR960030078A - Speech Recognition in Hidden Markov Modeling (HMM) Speech Recognition System

Info

Publication number: KR960030078A
Application number: KR1019950001401A
Authority: KR
Inventors: 구명완
Original assignee: 조백제; 한국전기통신공사
Priority date: 1995-01-26
Filing date: 1995-01-26
Publication date: 1996-08-17
Also published as: KR0136426B1

Abstract

본 발명은 음성 인식 과정에 필수적인 비터비(viterbi) 알고리즘을 구현할 때 반복 계산을 줄이는 히든 마르코프 모델링방식(HMM)의 음성인식 시스템에서의 음성 인식 방법에 관한 것으로, 서브워드 일차 계산 및 이차 계산을 나누어 수행함으로서 비터비(Viterbi) 계산량을 줄이는 음성 인식 방법을 제공하기 위하여, 초기화 후에 마지막 프레임인지를 판단하여마지막 프레임이면 인식 결과를 출력하고, 마지막 프레임이 아니면 서브워드 단위로 비터비(Viterbi) 일차 계산을 수행하는 제1단계(401 내지 404); 및 단어 단위로 비터비 이차 계산을 수행하여 비터비 값을 구한 후에 언어 처리 과정을 수행하고 상기 제1단계(401 내지 404)의 마지막 프레임 판단 과정을 반복 수행하는 제2단계(405,406)를 포함하여 비터비(Viterbi) 계산량을 획기적으로 줄일 수 있어 실시간으로 음성을 인식할 수 있는 효과가 있다.The present invention relates to a speech recognition method in a Hidden Markov Modeling (HMM) speech recognition system that reduces iterations when implementing the Viterbi algorithm, which is essential for speech recognition. In order to provide a speech recognition method which reduces the Viterbi calculation amount by performing, it is determined whether it is the last frame after initialization, and if the last frame is output, the recognition result is output, and if it is not the last frame, the Viterbi first calculation Performing a first step (401 to 404); And a second step (405, 406) of performing a linguistic processing after performing the Viterbi second calculation in word units, performing a language processing process, and repeating the last frame determination process of the first steps (401 to 404). Viterbi calculations can be significantly reduced, allowing voice recognition in real time.

Description

Speech Recognition in Hidden Markov Modeling (HMM) Speech Recognition System

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.

제3도는 본 발명이 적용되는 HMM 음성 인식 시스템의 구성도, 제4도는 본 발명에 따른 음성 인식 방법의 흐름도, 제5도는 본 발명에 따른 비터비(Viterbi) 일차 계산 방법의 상세 흐름도.3 is a configuration diagram of an HMM speech recognition system to which the present invention is applied, FIG. 4 is a flowchart of a speech recognition method according to the present invention, and FIG. 5 is a detailed flowchart of a Viterbi first calculation method according to the present invention.

Claims

Specific extraction means 301 for receiving a voice and extracting a feature; Word modeling means (303) for modeling words using the subword model (304) according to the information in the pronunciation dictionary (305); Word recognition means (302) for receiving a voice feature of the feature extraction means (301) and word model information of the word modeling means (303) to perform a Viterbi calculation to recognize a word; In the method applied to the speech recognition system having a sentence recognition means 306 for receiving the output of the word recognition means 302 to recognize a sentence according to the information of the language model 309, whether the last frame after initialization A first step (401 to 404) of determining and outputting a recognition result if the last frame, and performing the Viterbi first calculation in units of subwords if not the last frame; And after performing the first steps (401 to 404), perform a Viterbi secondary calculation on a word-by-word basis to obtain a Viterbi value, and then perform a language processing process and determine the last frame determination process of the first steps (401 to 404). And a second step (405, 406) of repeating the speech recognition method in the HMM speech recognition system.

The method of claim 1, wherein the Viterbi first order calculations of the first steps 401 to 404 are configured such that only the voice feature output value Ot of each frame t and the corresponding subword sub are affected. Speech recognition method in speech recognition system of modeling method (HMM).

The Viterbi first order calculation of claim 1, wherein the first step (401 to 404) is performed.

(Observation probability that can come out when subword sub, frame t, state change ji, voice feature O _t in frame t changes from state j to state i: , Probability of a transition from state j to state i: Speech recognition method in a Hidden Markov Modeling (HMM) speech recognition system, characterized in that

2. The Hidden Markov Modeling Method (HMM) according to claim 1, wherein the Viterbi secondary calculation of the second steps 405 and 406 is obtained by adding the Viterbi first value to the result of the Viterbi first calculation. Speech Recognition Method in Speech Recognition System.

5. The Viterbi secondary calculation of claim 1, wherein the second step 405, 406 is performed.

(Viterbi value in subword sub, frame t, state i: , State change ji, result of Viterbi first order calculation: First_ Speech recognition method in a Hidden Markov Modeling (HMM) speech recognition system, characterized in that

The Viterbi first calculation of the first step (401 to 404) comprises: a third step (501) of obtaining a first candidate word among candidate words corresponding to the current frame; After performing the third step 501, the first calculation is performed on all possible subwords of the current candidate word based on the output value of the current frame, stored in the current subword, and the Viterbi first calculation completion flag is set. Fourth step (502 to 506); and after performing the fourth step (502 to 506), the fifth step (508, 509) to repeat the fourth step (502 to 506) to the last candidate word, characterized in that it comprises Speech Recognition in Hidden Markov Modeling (HMM) Speech Recognition System.

7. The method of claim 6, wherein the fourth steps (502 to 506) include: a sixth step (502, 503) of checking a subword Viterbi first calculation performing flag after obtaining the first subword from the current candidate word; After performing the sixth step (502, 503), if the first calculation has not been performed, the Viterbi first calculation is performed on the current subword and stored in the current subword, and then the Viterbi first calculation completion flag of the current subword is stored. An eighth step (505 to 507) of determining whether or not it is the last subword after setting; And a ninth step 504 of repeatedly performing the flag check process of the sixth steps 502 and 503 after obtaining the next sub word after performing the eighth step 505 to 507. A speech recognition method in a speech recognition system of the Hidden Markov Modeling Method (HMM).

※ Note: The disclosure is based on the initial application.