CN1157712C - 语音识别方法和装置 - Google Patents
语音识别方法和装置 Download PDFInfo
- Publication number
- CN1157712C CN1157712C CNB018007368A CN01800736A CN1157712C CN 1157712 C CN1157712 C CN 1157712C CN B018007368 A CNB018007368 A CN B018007368A CN 01800736 A CN01800736 A CN 01800736A CN 1157712 C CN1157712 C CN 1157712C
- Authority
- CN
- China
- Prior art keywords
- speech
- score value
- voice
- node
- acoustics
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims description 39
- 238000000605 extraction Methods 0.000 claims description 25
- 238000004364 calculation method Methods 0.000 claims description 11
- 230000001143 conditioned effect Effects 0.000 claims description 8
- 230000005055 memory storage Effects 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 4
- 230000000977 initiatory effect Effects 0.000 description 13
- 230000008569 process Effects 0.000 description 11
- 230000002123 temporal effect Effects 0.000 description 11
- 238000010586 diagram Methods 0.000 description 8
- 230000015654 memory Effects 0.000 description 7
- 230000005039 memory span Effects 0.000 description 7
- 239000000463 material Substances 0.000 description 6
- 230000006866 deterioration Effects 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 206010028916 Neologism Diseases 0.000 description 1
- 208000037656 Respiratory Sounds Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012821 model calculation Methods 0.000 description 1
- 206010037833 rales Diseases 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2000051466 | 2000-02-28 | ||
JP51466/00 | 2000-02-28 | ||
JP51466/2000 | 2000-02-28 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1365488A CN1365488A (zh) | 2002-08-21 |
CN1157712C true CN1157712C (zh) | 2004-07-14 |
Family
ID=18573116
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB018007368A Expired - Fee Related CN1157712C (zh) | 2000-02-28 | 2001-02-16 | 语音识别方法和装置 |
Country Status (5)
Families Citing this family (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR0012694A (pt) | 1999-07-22 | 2002-04-09 | Procter & Gamble | Conjugado de protease, composição de limpeza e composição de tratamento pessoal |
US20030115169A1 (en) * | 2001-12-17 | 2003-06-19 | Hongzhuan Ye | System and method for management of transcribed documents |
US20030220788A1 (en) * | 2001-12-17 | 2003-11-27 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
US6990445B2 (en) * | 2001-12-17 | 2006-01-24 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
US7324940B1 (en) | 2003-02-28 | 2008-01-29 | Lumen Vox, Llc | Speech recognition concept confidence measurement |
JP4301102B2 (ja) * | 2004-07-22 | 2009-07-22 | ソニー株式会社 | 音声処理装置および音声処理方法、プログラム、並びに記録媒体 |
JP2007041988A (ja) * | 2005-08-05 | 2007-02-15 | Sony Corp | 情報処理装置および方法、並びにプログラム |
US20070124147A1 (en) * | 2005-11-30 | 2007-05-31 | International Business Machines Corporation | Methods and apparatus for use in speech recognition systems for identifying unknown words and for adding previously unknown words to vocabularies and grammars of speech recognition systems |
US9245526B2 (en) * | 2006-04-25 | 2016-01-26 | General Motors Llc | Dynamic clustering of nametags in an automated speech recognition system |
JP4188989B2 (ja) * | 2006-09-15 | 2008-12-03 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
KR100897554B1 (ko) * | 2007-02-21 | 2009-05-15 | 삼성전자주식회사 | 분산 음성인식시스템 및 방법과 분산 음성인식을 위한 단말기 |
US9129599B2 (en) * | 2007-10-18 | 2015-09-08 | Nuance Communications, Inc. | Automated tuning of speech recognition parameters |
US9582805B2 (en) | 2007-10-24 | 2017-02-28 | Invention Science Fund I, Llc | Returning a personalized advertisement |
US20090113297A1 (en) * | 2007-10-24 | 2009-04-30 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Requesting a second content based on a user's reaction to a first content |
US9513699B2 (en) | 2007-10-24 | 2016-12-06 | Invention Science Fund I, LL | Method of selecting a second content based on a user's reaction to a first content |
US8229921B2 (en) * | 2008-02-25 | 2012-07-24 | Mitsubishi Electric Research Laboratories, Inc. | Method for indexing for retrieving documents using particles |
US8255224B2 (en) | 2008-03-07 | 2012-08-28 | Google Inc. | Voice recognition grammar selection based on context |
DE102008049129A1 (de) | 2008-09-26 | 2010-04-08 | Gea Niro Gmbh | Kupplungsverschluss sowie Befestigungsmodul und Andockeinrichtung, jeweils enthaltend diesen Kupplungsverschluss |
US8301446B2 (en) * | 2009-03-30 | 2012-10-30 | Adacel Systems, Inc. | System and method for training an acoustic model with reduced feature space variation |
KR20110006004A (ko) * | 2009-07-13 | 2011-01-20 | 삼성전자주식회사 | 결합인식단위 최적화 장치 및 그 방법 |
JP2011203434A (ja) * | 2010-03-25 | 2011-10-13 | Fujitsu Ltd | 音声認識装置及び音声認識方法 |
TWI420510B (zh) * | 2010-05-28 | 2013-12-21 | Ind Tech Res Inst | 可調整記憶體使用空間之語音辨識系統與方法 |
US9601107B2 (en) * | 2011-08-19 | 2017-03-21 | Asahi Kasei Kabushiki Kaisha | Speech recognition system, recognition dictionary registration system, and acoustic model identifier series generation apparatus |
US8914288B2 (en) | 2011-09-01 | 2014-12-16 | At&T Intellectual Property I, L.P. | System and method for advanced turn-taking for interactive spoken dialog systems |
US9741342B2 (en) * | 2014-11-26 | 2017-08-22 | Panasonic Intellectual Property Corporation Of America | Method and apparatus for recognizing speech by lip reading |
WO2016134331A1 (en) * | 2015-02-19 | 2016-08-25 | Tertl Studos Llc | Systems and methods for variably paced real-time translation between the written and spoken forms of a word |
CN106033669B (zh) * | 2015-03-18 | 2019-06-07 | 展讯通信(上海)有限公司 | 语音识别方法及装置 |
KR102423302B1 (ko) * | 2015-10-06 | 2022-07-19 | 삼성전자주식회사 | 음성 인식에서의 음향 점수 계산 장치 및 방법과, 음향 모델 학습 장치 및 방법 |
US20170229124A1 (en) * | 2016-02-05 | 2017-08-10 | Google Inc. | Re-recognizing speech with external data sources |
JP7103763B2 (ja) * | 2017-07-20 | 2022-07-20 | 株式会社日立製作所 | 情報処理システムおよび情報処理方法 |
US10665228B2 (en) | 2018-05-23 | 2020-05-26 | Bank of America Corporaiton | Quantum technology for use with extracting intents from linguistics |
Family Cites Families (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5786899A (en) * | 1980-11-18 | 1982-05-31 | Mitsubishi Electric Corp | Voice recognition apparatus |
JPS5852696A (ja) * | 1981-09-25 | 1983-03-28 | 大日本印刷株式会社 | 音声認識装置 |
JPS58111989A (ja) * | 1981-12-25 | 1983-07-04 | シャープ株式会社 | 音声認識装置 |
JPS59204896A (ja) * | 1983-05-09 | 1984-11-20 | カシオ計算機株式会社 | 音声認識における候補選定方法 |
US5218668A (en) * | 1984-09-28 | 1993-06-08 | Itt Corporation | Keyword recognition system and method using template concantenation model |
US4882757A (en) * | 1986-04-25 | 1989-11-21 | Texas Instruments Incorporated | Speech recognition system |
US4837831A (en) * | 1986-10-15 | 1989-06-06 | Dragon Systems, Inc. | Method for creating and using multiple-word sound models in speech recognition |
US5349645A (en) * | 1991-12-31 | 1994-09-20 | Matsushita Electric Industrial Co., Ltd. | Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches |
US5386492A (en) * | 1992-06-29 | 1995-01-31 | Kurzweil Applied Intelligence, Inc. | Speech recognition system utilizing vocabulary model preselection |
WO1994014270A1 (en) * | 1992-12-17 | 1994-06-23 | Bell Atlantic Network Services, Inc. | Mechanized directory assistance |
DE4306508A1 (de) * | 1993-03-03 | 1994-09-08 | Philips Patentverwaltung | Verfahren und Anordnung zum Ermitteln von Wörtern in einem Sprachsignal |
DE4412930A1 (de) * | 1994-04-15 | 1995-10-19 | Philips Patentverwaltung | Verfahren zum Ermitteln einer Folge von Wörtern |
US5729656A (en) * | 1994-11-30 | 1998-03-17 | International Business Machines Corporation | Reduction of search space in speech recognition using phone boundaries and phone ranking |
US5710864A (en) * | 1994-12-29 | 1998-01-20 | Lucent Technologies Inc. | Systems, methods and articles of manufacture for improving recognition confidence in hypothesized keywords |
US5710866A (en) * | 1995-05-26 | 1998-01-20 | Microsoft Corporation | System and method for speech recognition using dynamically adjusted confidence measure |
US5677991A (en) * | 1995-06-30 | 1997-10-14 | Kurzweil Applied Intelligence, Inc. | Speech recognition system using arbitration between continuous speech and isolated word modules |
US5960447A (en) * | 1995-11-13 | 1999-09-28 | Holt; Douglas | Word tagging and editing system for speech recognition |
US5937383A (en) * | 1996-02-02 | 1999-08-10 | International Business Machines Corporation | Apparatus and methods for speech recognition including individual or speaker class dependent decoding history caches for fast word acceptance or rejection |
US5991720A (en) * | 1996-05-06 | 1999-11-23 | Matsushita Electric Industrial Co., Ltd. | Speech recognition system employing multiple grammar networks |
US5963903A (en) * | 1996-06-28 | 1999-10-05 | Microsoft Corporation | Method and system for dynamically adjusted training for speech recognition |
US5764851A (en) * | 1996-07-24 | 1998-06-09 | Industrial Technology Research Institute | Fast speech recognition method for mandarin words |
US6757652B1 (en) * | 1998-03-03 | 2004-06-29 | Koninklijke Philips Electronics N.V. | Multiple stage speech recognizer |
US6146147A (en) * | 1998-03-13 | 2000-11-14 | Cognitive Concepts, Inc. | Interactive sound awareness skills improvement system and method |
US6233559B1 (en) * | 1998-04-01 | 2001-05-15 | Motorola, Inc. | Speech control of multiple applications using applets |
ITTO980383A1 (it) * | 1998-05-07 | 1999-11-07 | Cselt Centro Studi Lab Telecom | Procedimento e dispositivo di riconoscimento vocale con doppio passo di riconoscimento neurale e markoviano. |
US6374220B1 (en) * | 1998-08-05 | 2002-04-16 | Texas Instruments Incorporated | N-best search for continuous speech recognition using viterbi pruning for non-output differentiation states |
US6178401B1 (en) * | 1998-08-28 | 2001-01-23 | International Business Machines Corporation | Method for reducing search complexity in a speech recognition system |
US6138095A (en) * | 1998-09-03 | 2000-10-24 | Lucent Technologies Inc. | Speech recognition |
US6502072B2 (en) * | 1998-11-20 | 2002-12-31 | Microsoft Corporation | Two-tier noise rejection in speech recognition |
JP3252815B2 (ja) * | 1998-12-04 | 2002-02-04 | 日本電気株式会社 | 連続音声認識装置及び方法 |
US6275802B1 (en) * | 1999-01-07 | 2001-08-14 | Lernout & Hauspie Speech Products N.V. | Search algorithm for large vocabulary speech recognition |
US6542866B1 (en) * | 1999-09-22 | 2003-04-01 | Microsoft Corporation | Speech recognition method and apparatus utilizing multiple feature streams |
US6539353B1 (en) * | 1999-10-12 | 2003-03-25 | Microsoft Corporation | Confidence measures using sub-word-dependent weighting of sub-word confidence scores for robust speech recognition |
-
2001
- 2001-02-16 WO PCT/JP2001/001127 patent/WO2001065541A1/ja active Application Filing
- 2001-02-16 CN CNB018007368A patent/CN1157712C/zh not_active Expired - Fee Related
- 2001-02-16 EP EP01904512A patent/EP1215662A4/en not_active Withdrawn
- 2001-02-16 US US10/019,125 patent/US7881935B2/en not_active Expired - Fee Related
- 2001-02-16 JP JP2001564146A patent/JP4802434B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
EP1215662A1 (en) | 2002-06-19 |
EP1215662A4 (en) | 2005-09-21 |
US20020173958A1 (en) | 2002-11-21 |
JP4802434B2 (ja) | 2011-10-26 |
CN1365488A (zh) | 2002-08-21 |
WO2001065541A1 (fr) | 2001-09-07 |
US7881935B2 (en) | 2011-02-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1157712C (zh) | 语音识别方法和装置 | |
CN1169116C (zh) | 语音识别装置和识别方法 | |
CN1199148C (zh) | 语音识别装置、语音识别方法 | |
CN1296886C (zh) | 语音识别系统和方法 | |
US8768700B1 (en) | Voice search engine interface for scoring search hypotheses | |
CN1143263C (zh) | 识别有调语言的系统和方法 | |
US7219055B2 (en) | Speech recognition apparatus and method adapting best transformation function to transform one of the input speech and acoustic model | |
US8280733B2 (en) | Automatic speech recognition learning using categorization and selective incorporation of user-initiated corrections | |
US7725319B2 (en) | Phoneme lattice construction and its application to speech recognition and keyword spotting | |
CN1160699C (zh) | 语音识别系统 | |
US6961701B2 (en) | Voice recognition apparatus and method, and recording medium | |
US20160140957A1 (en) | Speech Recognition Semantic Classification Training | |
JP2002507010A (ja) | 同時に起こるマルチモード口述のための装置及び方法 | |
CN1573924A (zh) | 语音识别设备、语音识别方法、会话控制设备以及会话控制方法 | |
JP4515054B2 (ja) | 音声認識の方法および音声信号を復号化する方法 | |
CN1920948A (zh) | 语音识别系统及语音处理系统 | |
US20100153366A1 (en) | Assigning an indexing weight to a search term | |
CN1781102A (zh) | 低速存储器判定树 | |
CN1692405A (zh) | 语音处理设备、语言处理方法、存储介质及程序 | |
JP2003515778A (ja) | 別々の言語モデルによる音声認識方法及び装置 | |
JP2001092496A (ja) | 連続音声認識装置および記録媒体 | |
CN1534597A (zh) | 利用具有转换状态空间模型的变化推理的语音识别方法 | |
CN110164416B (zh) | 一种语音识别方法及其装置、设备和存储介质 | |
KR101122591B1 (ko) | 핵심어 인식에 의한 음성 인식 장치 및 방법 | |
JP2002082691A (ja) | 発声内に含まれる会社名の自動認識方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20040714 Termination date: 20140216 |