KR950034054A

KR950034054A - Candidate Word Extractor and Extraction Method for Large Vocabulary Speech Recognition

Info

Publication number: KR950034054A
Application number: KR1019940010682A
Authority: KR
Inventors: 안영목; 김회린; 황규웅
Original assignee: 양승택; 재단법인 한국전자통신연구소; 조백제; 한국전기통신공사
Priority date: 1994-05-16
Filing date: 1994-05-16
Publication date: 1995-12-26

Abstract

본 발명은 대어휘 음성인식을 위한 후보단어 추출장치 및 추출 방법에 관한 것으로, 후보단어 추출기의 효과는 대어휘 인식시에 인식단어일 확률이 희박한 단어를 미리 제외 시킴으로써 소요되는 계산 시간을 대폭 줄일수 있다. 따라서 음성 인식 시스템의 속도를 증진시킬 것이며 최종 인식단어 추출부의 담색 영역을 대폭 줄일수 있다.The present invention relates to a candidate word extraction apparatus and extraction method for large vocabulary speech recognition, the effect of the candidate word extractor can significantly reduce the computation time required by excluding words that are less likely to be recognized words in large vocabulary recognition in advance. have. Therefore, the speed of the speech recognition system will be improved, and the pale color area of the final recognition word extract can be greatly reduced.

Description

Candidate Word Extraction Apparatus and Extraction Method for Large Vocabulary Speech Recognition.

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.

제1도는 음성인식 시스템의 전체 구성도, 제2도는 후보단어 추출부가 첨가된 음성인식 시스템의 전체 구성도,제3도는 후보단어 추출부의 전체 흐름도.1 is an overall configuration diagram of a speech recognition system, FIG. 2 is an overall configuration diagram of a speech recognition system to which a candidate word extracting unit is added, and FIG.

Claims

Band filtering means (1) for converting and supplying a voice to an electric signal from a microphone and separating and passing only a signal of an audible frequency band of a human; A / D conversion means (2) for sampling the signal output through the band filtering means (1) according to the sample rate; A feature extractor which is connected to the A / D conversion means 2 and extracts a feature vector of a signal corresponding to a voice section by analyzing a sampled signal; and a reference of which a feature vector is the closest among codebooks. A codeword extraction unit for determining a vector (codeword), a candidate word extraction report for extracting only a small number of words having a high probability of occurrence by using a candidate word limit using a codeword distribution of all candidate words generated in a training step, Control means (3) for compensating for a final recognition word extraction unit (recognition unit) for selecting the final recognition word through a small number of words selected by the candidate word extraction unit with the reference pattern; I / O decoding means (7) for outputting a final recognized word of said control means (3); Address decoding means (4) for transmitting a control signal of said control means (3); A data room (6) which receives the control signal of the control means (3) via the address decoding means (4) and stores information on a codebook, codeword distribution of each word, and candidate word limit; And a program ROM 5 having a program for feature extraction, a candidate word extraction program, and a final recognition program by receiving the control signal of the control means 3 through the address decoding means 4. Candidate word extraction device for large vocabulary speech recognition.

A candidate word extraction method applied to a candidate word extraction apparatus for large vocabulary speech recognition, comprising: a feature extractor inputting a feature vector into a quantizer to find a reference vector on a codebook closest to the nearest one and then time-domain partitioning the feature vector; Stage 1; A second step of performing a codeword comparison by recording a codeword generation degree for each predetermined region after performing the first step; A third step of calculating a probability value of generating a corresponding word using a codeword distribution map of all candidate words to be recognized after performing the second step and sending it to the candidate word extracting unit; And a fourth step of, after performing the third step, exporting a list in which the candidate word is found by the candidate word limiting rate through a final recognition extracting unit.

3. The method of claim 2, wherein the third step comprises: a first step of selecting and outputting one experimental nearest vector from the reference feature vectors of the codebook; A second step of separating the symbol strings in a time domain division so that all words have the same number of domains by a predetermined time; in the occurrence codeword recording for each time domain, each word is recorded in the corresponding codeword and the entire candidate words are learned. And a third process of repeatedly performing data until all data is input.

Note: The disclosure is based on the original application.