KR940015968A - Speech duration modeling method of speech recognizer - Google Patents
Speech duration modeling method of speech recognizer Download PDFInfo
- Publication number
- KR940015968A KR940015968A KR1019920023405A KR920023405A KR940015968A KR 940015968 A KR940015968 A KR 940015968A KR 1019920023405 A KR1019920023405 A KR 1019920023405A KR 920023405 A KR920023405 A KR 920023405A KR 940015968 A KR940015968 A KR 940015968A
- Authority
- KR
- South Korea
- Prior art keywords
- speech
- probability
- model
- max
- duration
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 7
- 238000010586 diagram Methods 0.000 description 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
본 발명은 화자독립 음성인식기에 관한 것으로, 특히 음성인식 오류를 개선하기 위해 화자간 변화에 무관한 음성지속시간을 모델링하여 그 모델링된 음성지속시간 정보에 의해 음성인식시 인식율을 향상시킬 수 있게한 한 화자독립 음성인식기의 음성지속시간 모델링 방법에 관한 것이다.The present invention relates to a speaker-independent speech recognizer, and in particular, in order to improve speech recognition error, modeling a speech duration independent of change between speakers, and improving the recognition rate in speech recognition by the modeled speech duration information. The present invention relates to a speech duration modeling method of a speaker-independent speech recognizer.
본 발명은, 연속한 n개의 상태에서 음성지속시간을 확률적으로 모델링하는데, 먼저, 두 상태열을 결합할 것인가를 판단하여 음성지속시간을 모델링 할 수 있는 상태들을 찾는 과정을 수행하고, 이와같은 과정으로 선택한 상태들에서 확률분포로부터 입력음성의 상태열이 주어졌을때 지속시간 확률을 구하는 과정을 수행하도록 되어 있다.According to the present invention, probabilistic modeling of voice durations in n consecutive states is performed. First, a process of finding states capable of modeling voice durations by determining whether to combine two state sequences is performed. In the states selected as the process, the process of calculating the duration probability when the state of the input voice is given from the probability distribution is performed.
Description
본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.
제1도는 일반적인 에이취 엠엠(HMM)을 이용한 음성인식 시스템의 구조를 보인 확률분포의 상태 블록도, 제2도는 화자에 따라 발음된 음성의 상태에서 지속되는 시간과 대응되는 상태를 보인 설명도, 제3도는 본 발명에 의한 음성인식기의 음성지속시간 모델링 방법을 보인 제어 흐름도.1 is a state block diagram of a probability distribution showing a structure of a speech recognition system using a general HMM, and FIG. 2 is an explanatory diagram showing a state corresponding to a time duration in a state of a pronounced voice according to a speaker. 3 is a control flowchart showing a voice duration modeling method of a voice recognizer according to the present invention.
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019920023405A KR950010020B1 (en) | 1992-12-05 | 1992-12-05 | Voice continu ating time modeling method of voice recognizer |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019920023405A KR950010020B1 (en) | 1992-12-05 | 1992-12-05 | Voice continu ating time modeling method of voice recognizer |
Publications (2)
Publication Number | Publication Date |
---|---|
KR940015968A true KR940015968A (en) | 1994-07-22 |
KR950010020B1 KR950010020B1 (en) | 1995-09-04 |
Family
ID=19344812
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1019920023405A KR950010020B1 (en) | 1992-12-05 | 1992-12-05 | Voice continu ating time modeling method of voice recognizer |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR950010020B1 (en) |
-
1992
- 1992-12-05 KR KR1019920023405A patent/KR950010020B1/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
KR950010020B1 (en) | 1995-09-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5268990A (en) | Method for recognizing speech using linguistically-motivated hidden Markov models | |
Bahl et al. | A maximum likelihood approach to continuous speech recognition | |
JP3049259B2 (en) | Voice recognition method | |
US5241619A (en) | Word dependent N-best search method | |
JP2964507B2 (en) | HMM device | |
CN107680597A (en) | Audio recognition method, device, equipment and computer-readable recording medium | |
CN104978963A (en) | Speech recognition apparatus, method and electronic equipment | |
JP2012037619A (en) | Speaker-adaptation device, speaker-adaptation method and program for speaker-adaptation | |
CN103337241B (en) | Voice recognition method and device | |
EP0903730B1 (en) | Search and rescoring method for a speech recognition system | |
US20040019483A1 (en) | Method of speech recognition using time-dependent interpolation and hidden dynamic value classes | |
JP2002358097A (en) | Voice recognition device | |
JPS5852696A (en) | Voice recognition unit | |
CN111554270A (en) | Training sample screening method and electronic equipment | |
JPH0296800A (en) | Continuous voice recognizing device | |
JP2013182261A (en) | Adaptation device, voice recognition device and program | |
KR940015968A (en) | Speech duration modeling method of speech recognizer | |
JP2002215184A (en) | Speech recognition device and program for the same | |
CN117456999B (en) | Audio identification method, audio identification device, vehicle, computer device, and medium | |
JP3316352B2 (en) | Voice recognition method | |
JP3532248B2 (en) | Speech recognition device using learning speech pattern model | |
JP3144341B2 (en) | Voice recognition device | |
JP3368989B2 (en) | Voice recognition method | |
JPH08314490A (en) | Word spotting type method and device for recognizing voice | |
JP2003022091A (en) | Method, device, and program for voice recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
G160 | Decision to publish patent application | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
LAPS | Lapse due to unpaid annual fee |