KR20030076661A - 음성 인식을 위한 방법, 모듈, 디바이스 및 서버 - Google Patents
음성 인식을 위한 방법, 모듈, 디바이스 및 서버 Download PDFInfo
- Publication number
- KR20030076661A KR20030076661A KR10-2003-7010428A KR20037010428A KR20030076661A KR 20030076661 A KR20030076661 A KR 20030076661A KR 20037010428 A KR20037010428 A KR 20037010428A KR 20030076661 A KR20030076661 A KR 20030076661A
- Authority
- KR
- South Korea
- Prior art keywords
- unrecognized
- terminal
- language model
- representation
- data
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000012937 correction Methods 0.000 claims abstract description 49
- 230000014509 gene expression Effects 0.000 claims description 64
- 230000005540 biological transmission Effects 0.000 claims description 14
- 238000010586 diagram Methods 0.000 description 13
- 230000000875 corresponding effect Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 238000004891 communication Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000007619 statistical method Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (7)
- 적어도 하나의 단말(114)에서 구현되며 언어 모델(311)을 사용하는 음성 인식 방법으로서,- 상기 단말 중 하나의 단말에서 적어도 하나의 미인식된 표현을 검출(502)하는 단계와,- 상기 미인식된 표현(309)을 나타내는 데이터를 상기 단말에 리코딩(503)하는 단계와,- 제 1 송신 채널(121)을 통해, 상기 단말에 의해 상기 리코드된 데이터를 리모트 서버(116)로 송신(603)하는 단계와,- 상기 리코드된 데이터를 상기 리모트 서버의 레벨에서 분석(803)하며, 그리고 상기 미인식된 표현의 적어도 하나의 부분을 고려하여 상기 언어 모델을 정정하기 위한 정보를 생성(805)하는 단계와,- 상기 미인식된 표현 중 적어도 특정 표현을 차후 인식 가능하게 하기 위하여, 상기 정정 정보를 상기 서버로부터 제 2 송신 채널(115, 119, 120)을 통해 적어도 하나의 단말(114, 117, 118)로 송신(806)하는 단계를 포함하는 것을 특징으로 하는, 음성 인식 방법.
- 제 1 항에 있어서, 상기 미인식된 표현(309)을 나타내는 상기 데이터는 음향 신호(acoustic signal)를 묘사하는 파라미터를 나타내는 압축된 음성 리코딩을 포함하는 것을 특징으로 하는, 음성 인식 방법.
- 제 1 항 또는 제 2 항에 있어서, 상기 단말에 의한 상기 송신 단계 동안, 상기 단말은- 어느 표현이 인식되지 못하였을 때 상기 음성 인식 방법의 사용에 관한 문맥 정보와,- 미인식된 표현을 말한 화자에 관한 정보를 포함하는 그룹 중 일부를 형성하는 정보의 적어도 하나의 아이템을 상기 서버로 더 송신하는 것을 특징으로 하는, 음성 인식 방법.
- 제 1 항 내지 제 3 항 중 어느 한 항에 있어서, 상기 리코드된 데이터 및/또는 상기 정정 정보의 암호화 및/또는 스크램블링을 구현하는 것을 특징으로 하는, 음성 인식 방법.
- 언어 모델을 사용하는 음성 인식 모듈(102)로서,- 미인식된 표현을 검출하는 분석기와,- 적어도 하나의 미인식된 표현을 나타내는 데이터를 리코드하는 리코더와,- 상기 리코드된 데이터를 리모트 서버로 송신하는 송신기와,- 상기 모듈에 의해 상기 미인식된 표현 중 적어도 특정 표현을 차후 인식 가능하게 하도록, 상기 모듈에 송신된 상기 언어 모델의 정정을 가능하게 하는 정정 정보를 수신하는 수신기로서, 상기 정정 정보는, 상기 데이터를 상기 리모트 서버의 레벨에서 분석한 후 그리고 상기 미인식된 표현의 적어도 하나의 부분을 고려하여 상기 언어 모델을 정정하기 위한 정보를 생성한 후, 상기 리모트 서버에 의해 송신되는, 수신기를 포함하는 것을 특징으로 하는, 음성 인식 모듈.
- 언어 모델을 사용하는 음성 인식 디바이스(102)로서,- 미인식된 표현을 검출하는 분석기와,- 적어도 하나의 미인식된 표현을 나타내는 데이터를 리코드하는 리코더와,- 상기 리코드된 데이터를 리모트 서버로 송신하는 송신기와,- 상기 디바이스에 의한 상기 미인식된 표현 중 적어도 특정 표현을 차후 인식 가능하게 하도록, 상기 디바이스에 송신된 상기 언어 모델의 정정을 가능하게 하는 정정 정보를 수신하는 수신기로서, 상기 정정 정보는, 상기 데이터를 상기 리모트 서버의 레벨에서 분석한 후 그리고 상기 미인식된 표현 중 적어도 하나의 부분을 고려하여 상기 언어 모델을 정정하기 위한 정보를 생성한 후, 상기 리모트 서버에 의해 송신되는, 수신기를 포함하는 것을 특징으로 하는, 음성 인식 디바이스.
- 언어 모델을 사용하여 음성 인식이 적어도 하나의 리모트 단말의 세트에서 구현되는 음성 인식 서버(116)로서,- 상기 단말 세트의 부분을 형성하고, 음성 인식 동작 동안 상기 미인식된 표현을 검출한 적어도 하나의 단말에 의해 상기 미인식된 적어도 하나의 표현을 나타내는 데이터를 수신하는 수신기와,- 상기 서버의 레벨에서 수신된 상기 데이터의 분석에 기초하여 획득된 정정 정보를 적어도 하나의 리모트 단말의 상기 단말 세트로 송신하는 송신기로서, 상기 정정 정보는 상기 단말 세트의 각 단말에 의해 상기 미인식된 표현의 적어도 하나의 부분을 차후 인식 가능하게 하도록 상기 언어 모델의 정정을 가능하게 하는, 송신기를 포함하는 것을 특징으로 하는, 음성 인식 서버.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0101910A FR2820872B1 (fr) | 2001-02-13 | 2001-02-13 | Procede, module, dispositif et serveur de reconnaissance vocale |
FR01/01910 | 2001-02-13 | ||
PCT/FR2002/000518 WO2002065454A1 (fr) | 2001-02-13 | 2002-02-12 | Procede, module, dispositif et serveur de reconnaissance vocale |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20030076661A true KR20030076661A (ko) | 2003-09-26 |
KR100908358B1 KR100908358B1 (ko) | 2009-07-20 |
Family
ID=8859932
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020037010428A KR100908358B1 (ko) | 2001-02-13 | 2002-02-12 | 음성 인식을 위한 방법, 모듈, 디바이스 및 서버 |
Country Status (10)
Country | Link |
---|---|
US (1) | US7983911B2 (ko) |
EP (1) | EP1362343B1 (ko) |
JP (1) | JP4751569B2 (ko) |
KR (1) | KR100908358B1 (ko) |
CN (1) | CN1228762C (ko) |
DE (1) | DE60222093T2 (ko) |
ES (1) | ES2291440T3 (ko) |
FR (1) | FR2820872B1 (ko) |
MX (1) | MXPA03007178A (ko) |
WO (1) | WO2002065454A1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20210075815A (ko) * | 2019-12-13 | 2021-06-23 | 주식회사 소리자바 | 음성 인식 힌트 적용 장치 및 방법 |
Families Citing this family (64)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030182113A1 (en) * | 1999-11-22 | 2003-09-25 | Xuedong Huang | Distributed speech recognition for mobile communication devices |
JP4267385B2 (ja) | 2003-06-30 | 2009-05-27 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 統計的言語モデル生成装置、音声認識装置、統計的言語モデル生成方法、音声認識方法、およびプログラム |
US8954325B1 (en) * | 2004-03-22 | 2015-02-10 | Rockstar Consortium Us Lp | Speech recognition in automated information services systems |
US7542904B2 (en) * | 2005-08-19 | 2009-06-02 | Cisco Technology, Inc. | System and method for maintaining a speech-recognition grammar |
EP1760566A1 (en) * | 2005-08-29 | 2007-03-07 | Top Digital Co., Ltd. | Voiceprint-lock system for electronic data |
US20070136069A1 (en) * | 2005-12-13 | 2007-06-14 | General Motors Corporation | Method and system for customizing speech recognition in a mobile vehicle communication system |
US8510109B2 (en) | 2007-08-22 | 2013-08-13 | Canyon Ip Holdings Llc | Continuous speech transcription performance indication |
US8117268B2 (en) | 2006-04-05 | 2012-02-14 | Jablokov Victor R | Hosted voice recognition system for wireless devices |
US8214213B1 (en) * | 2006-04-27 | 2012-07-03 | At&T Intellectual Property Ii, L.P. | Speech recognition based on pronunciation modeling |
WO2007147077A2 (en) | 2006-06-14 | 2007-12-21 | Personics Holdings Inc. | Earguard monitoring system |
TWI321313B (en) * | 2007-03-03 | 2010-03-01 | Ind Tech Res Inst | Apparatus and method to reduce recognization errors through context relations among dialogue turns |
US9973450B2 (en) | 2007-09-17 | 2018-05-15 | Amazon Technologies, Inc. | Methods and systems for dynamically updating web service profile information by parsing transcribed message strings |
US8352264B2 (en) | 2008-03-19 | 2013-01-08 | Canyon IP Holdings, LLC | Corrective feedback loop for automated speech recognition |
US11683643B2 (en) | 2007-05-04 | 2023-06-20 | Staton Techiya Llc | Method and device for in ear canal echo suppression |
US11856375B2 (en) | 2007-05-04 | 2023-12-26 | Staton Techiya Llc | Method and device for in-ear echo suppression |
US8335829B1 (en) | 2007-08-22 | 2012-12-18 | Canyon IP Holdings, LLC | Facilitating presentation by mobile device of additional content for a word or phrase upon utterance thereof |
US9053489B2 (en) | 2007-08-22 | 2015-06-09 | Canyon Ip Holdings Llc | Facilitating presentation of ads relating to words of a message |
US9129599B2 (en) * | 2007-10-18 | 2015-09-08 | Nuance Communications, Inc. | Automated tuning of speech recognition parameters |
US8326631B1 (en) * | 2008-04-02 | 2012-12-04 | Verint Americas, Inc. | Systems and methods for speech indexing |
JP5327838B2 (ja) * | 2008-04-23 | 2013-10-30 | Necインフロンティア株式会社 | 音声入力分散処理方法及び音声入力分散処理システム |
US8600067B2 (en) | 2008-09-19 | 2013-12-03 | Personics Holdings Inc. | Acoustic sealing analysis system |
US8374872B2 (en) * | 2008-11-04 | 2013-02-12 | Verizon Patent And Licensing Inc. | Dynamic update of grammar for interactive voice response |
US20120215528A1 (en) | 2009-10-28 | 2012-08-23 | Nec Corporation | Speech recognition system, speech recognition request device, speech recognition method, speech recognition program, and recording medium |
US9842591B2 (en) * | 2010-05-19 | 2017-12-12 | Sanofi-Aventis Deutschland Gmbh | Methods and systems for modifying operational data of an interaction process or of a process for determining an instruction |
US20110307250A1 (en) * | 2010-06-10 | 2011-12-15 | Gm Global Technology Operations, Inc. | Modular Speech Recognition Architecture |
US9484018B2 (en) * | 2010-11-23 | 2016-11-01 | At&T Intellectual Property I, L.P. | System and method for building and evaluating automatic speech recognition via an application programmer interface |
US9472185B1 (en) | 2011-01-05 | 2016-10-18 | Interactions Llc | Automated recognition system for natural language understanding |
US9245525B2 (en) | 2011-01-05 | 2016-01-26 | Interactions Llc | Automated speech recognition proxy system for natural language understanding |
JP5837341B2 (ja) * | 2011-06-24 | 2015-12-24 | 株式会社ブリヂストン | 路面状態判定方法とその装置 |
GB2493413B (en) | 2011-07-25 | 2013-12-25 | Ibm | Maintaining and supplying speech models |
JP2013127536A (ja) * | 2011-12-19 | 2013-06-27 | Sharp Corp | 音声出力装置、当該音声出力装置を備える通信端末、当該音声出力装置を備える補聴器、音声出力装置を制御するためのプログラム、音声出力装置の使用者に応じた音声を提供するための方法、および、音声出力装置の変換データを更新するためのシステム |
AU2018202888B2 (en) * | 2013-01-17 | 2020-07-02 | Samsung Electronics Co., Ltd. | Image processing apparatus, control method thereof, and image processing system |
JP6025785B2 (ja) * | 2013-07-08 | 2016-11-16 | インタラクションズ リミテッド ライアビリティ カンパニー | 自然言語理解のための自動音声認識プロキシシステム |
DE102013216427B4 (de) * | 2013-08-20 | 2023-02-02 | Bayerische Motoren Werke Aktiengesellschaft | Vorrichtung und Verfahren zur fortbewegungsmittelbasierten Sprachverarbeitung |
EP3040985B1 (en) * | 2013-08-26 | 2023-08-23 | Samsung Electronics Co., Ltd. | Electronic device and method for voice recognition |
EP2851896A1 (en) | 2013-09-19 | 2015-03-25 | Maluuba Inc. | Speech recognition using phoneme matching |
DE102013219649A1 (de) * | 2013-09-27 | 2015-04-02 | Continental Automotive Gmbh | Verfahren und System zum Erstellen oder Ergänzen eines benutzerspezifischen Sprachmodells in einem mit einem Endgerät verbindbaren lokalen Datenspeicher |
US10043534B2 (en) | 2013-12-23 | 2018-08-07 | Staton Techiya, Llc | Method and device for spectral expansion for an audio signal |
DE102014200570A1 (de) * | 2014-01-15 | 2015-07-16 | Bayerische Motoren Werke Aktiengesellschaft | Verfahren und System zur Erzeugung eines Steuerungsbefehls |
US9601108B2 (en) | 2014-01-17 | 2017-03-21 | Microsoft Technology Licensing, Llc | Incorporating an exogenous large-vocabulary model into rule-based speech recognition |
CN103956168A (zh) * | 2014-03-29 | 2014-07-30 | 深圳创维数字技术股份有限公司 | 一种语音识别方法、装置及终端 |
US10749989B2 (en) | 2014-04-01 | 2020-08-18 | Microsoft Technology Licensing Llc | Hybrid client/server architecture for parallel processing |
KR102225404B1 (ko) * | 2014-05-23 | 2021-03-09 | 삼성전자주식회사 | 디바이스 정보를 이용하는 음성인식 방법 및 장치 |
JP2016009193A (ja) * | 2014-06-23 | 2016-01-18 | ハーマン インターナショナル インダストリーズ インコーポレイテッド | ユーザ適合音声認識 |
US10163453B2 (en) | 2014-10-24 | 2018-12-25 | Staton Techiya, Llc | Robust voice activity detector system for use with an earphone |
CN107077843A (zh) * | 2014-10-30 | 2017-08-18 | 三菱电机株式会社 | 对话控制装置和对话控制方法 |
US9711141B2 (en) * | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
KR102325724B1 (ko) * | 2015-02-28 | 2021-11-15 | 삼성전자주식회사 | 다수의 기기에서 텍스트 데이터 동기화 |
US20160274864A1 (en) * | 2015-03-20 | 2016-09-22 | Google Inc. | Systems and methods for enabling user voice interaction with a host computing device |
CN104758075B (zh) * | 2015-04-20 | 2016-05-25 | 郑洪� | 基于语音识别控制的家用口腔护理工具 |
US10325590B2 (en) * | 2015-06-26 | 2019-06-18 | Intel Corporation | Language model modification for local speech recognition systems using remote sources |
US10616693B2 (en) | 2016-01-22 | 2020-04-07 | Staton Techiya Llc | System and method for efficiency among devices |
US9858918B2 (en) * | 2016-03-15 | 2018-01-02 | GM Global Technology Operations LLC | Root cause analysis and recovery systems and methods |
US9761227B1 (en) | 2016-05-26 | 2017-09-12 | Nuance Communications, Inc. | Method and system for hybrid decoding for enhanced end-user privacy and low latency |
US10971157B2 (en) * | 2017-01-11 | 2021-04-06 | Nuance Communications, Inc. | Methods and apparatus for hybrid speech recognition processing |
US10229682B2 (en) | 2017-02-01 | 2019-03-12 | International Business Machines Corporation | Cognitive intervention for voice recognition failure |
US10636423B2 (en) | 2018-02-21 | 2020-04-28 | Motorola Solutions, Inc. | System and method for managing speech recognition |
CN108683937B (zh) * | 2018-03-09 | 2020-01-21 | 百度在线网络技术(北京)有限公司 | 智能电视的语音交互反馈方法、系统及计算机可读介质 |
US10951994B2 (en) | 2018-04-04 | 2021-03-16 | Staton Techiya, Llc | Method to acquire preferred dynamic range function for speech enhancement |
KR102544250B1 (ko) | 2018-07-03 | 2023-06-16 | 삼성전자주식회사 | 소리를 출력하는 디바이스 및 그 방법 |
US11087739B1 (en) * | 2018-11-13 | 2021-08-10 | Amazon Technologies, Inc. | On-device learning in a hybrid speech processing system |
CN110473530B (zh) * | 2019-08-21 | 2021-12-07 | 北京百度网讯科技有限公司 | 指令分类方法、装置、电子设备及计算机可读存储介质 |
CN113052191A (zh) * | 2019-12-26 | 2021-06-29 | 航天信息股份有限公司 | 一种神经语言网络模型的训练方法、装置、设备及介质 |
US11552966B2 (en) | 2020-09-25 | 2023-01-10 | International Business Machines Corporation | Generating and mutually maturing a knowledge corpus |
Family Cites Families (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5384892A (en) * | 1992-12-31 | 1995-01-24 | Apple Computer, Inc. | Dynamic language model for speech recognition |
ZA948426B (en) * | 1993-12-22 | 1995-06-30 | Qualcomm Inc | Distributed voice recognition system |
JPH07222248A (ja) | 1994-02-08 | 1995-08-18 | Hitachi Ltd | 携帯型情報端末における音声情報の利用方式 |
US5852801A (en) * | 1995-10-04 | 1998-12-22 | Apple Computer, Inc. | Method and apparatus for automatically invoking a new word module for unrecognized user input |
US6058363A (en) * | 1997-01-02 | 2000-05-02 | Texas Instruments Incorporated | Method and system for speaker-independent recognition of user-defined phrases |
US6173259B1 (en) * | 1997-03-27 | 2001-01-09 | Speech Machines Plc | Speech to text conversion |
US6078886A (en) * | 1997-04-14 | 2000-06-20 | At&T Corporation | System and method for providing remote automatic speech recognition services via a packet network |
US5953700A (en) * | 1997-06-11 | 1999-09-14 | International Business Machines Corporation | Portable acoustic interface for remote access to automatic speech/speaker recognition server |
WO1999018556A2 (en) * | 1997-10-08 | 1999-04-15 | Koninklijke Philips Electronics N.V. | Vocabulary and/or language model training |
US5937385A (en) * | 1997-10-20 | 1999-08-10 | International Business Machines Corporation | Method and apparatus for creating speech recognition grammars constrained by counter examples |
US6195641B1 (en) * | 1998-03-27 | 2001-02-27 | International Business Machines Corp. | Network universal spoken language vocabulary |
US6157910A (en) * | 1998-08-31 | 2000-12-05 | International Business Machines Corporation | Deferred correction file transfer for updating a speech file by creating a file log of corrections |
US6185535B1 (en) * | 1998-10-16 | 2001-02-06 | Telefonaktiebolaget Lm Ericsson (Publ) | Voice control of a user interface to service applications |
US6275803B1 (en) * | 1999-02-12 | 2001-08-14 | International Business Machines Corp. | Updating a language model based on a function-word to total-word ratio |
US6195636B1 (en) * | 1999-02-19 | 2001-02-27 | Texas Instruments Incorporated | Speech recognition over packet networks |
WO2000058946A1 (en) * | 1999-03-26 | 2000-10-05 | Koninklijke Philips Electronics N.V. | Client-server speech recognition |
US6408272B1 (en) * | 1999-04-12 | 2002-06-18 | General Magic, Inc. | Distributed voice user interface |
US6463413B1 (en) * | 1999-04-20 | 2002-10-08 | Matsushita Electrical Industrial Co., Ltd. | Speech recognition training for small hardware devices |
US6360201B1 (en) * | 1999-06-08 | 2002-03-19 | International Business Machines Corp. | Method and apparatus for activating and deactivating auxiliary topic libraries in a speech dictation system |
JP2001013985A (ja) | 1999-07-01 | 2001-01-19 | Meidensha Corp | 音声認識システムの辞書管理方式 |
US6484136B1 (en) * | 1999-10-21 | 2002-11-19 | International Business Machines Corporation | Language model adaptation via network of similar users |
US20030182113A1 (en) * | 1999-11-22 | 2003-09-25 | Xuedong Huang | Distributed speech recognition for mobile communication devices |
JP3728177B2 (ja) * | 2000-05-24 | 2005-12-21 | キヤノン株式会社 | 音声処理システム、装置、方法及び記憶媒体 |
JP2003036088A (ja) * | 2001-07-23 | 2003-02-07 | Canon Inc | 音声変換の辞書管理装置 |
US7016849B2 (en) * | 2002-03-25 | 2006-03-21 | Sri International | Method and apparatus for providing speech-driven routing between spoken language applications |
-
2001
- 2001-02-13 FR FR0101910A patent/FR2820872B1/fr not_active Expired - Fee Related
-
2002
- 2002-02-12 DE DE60222093T patent/DE60222093T2/de not_active Expired - Lifetime
- 2002-02-12 ES ES02703691T patent/ES2291440T3/es not_active Expired - Lifetime
- 2002-02-12 JP JP2002565299A patent/JP4751569B2/ja not_active Expired - Fee Related
- 2002-02-12 WO PCT/FR2002/000518 patent/WO2002065454A1/fr active IP Right Grant
- 2002-02-12 CN CNB028049195A patent/CN1228762C/zh not_active Expired - Fee Related
- 2002-02-12 MX MXPA03007178A patent/MXPA03007178A/es active IP Right Grant
- 2002-02-12 KR KR1020037010428A patent/KR100908358B1/ko active IP Right Grant
- 2002-02-12 EP EP02703691A patent/EP1362343B1/fr not_active Expired - Lifetime
- 2002-02-12 US US10/467,586 patent/US7983911B2/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20210075815A (ko) * | 2019-12-13 | 2021-06-23 | 주식회사 소리자바 | 음성 인식 힌트 적용 장치 및 방법 |
Also Published As
Publication number | Publication date |
---|---|
FR2820872A1 (fr) | 2002-08-16 |
CN1491412A (zh) | 2004-04-21 |
ES2291440T3 (es) | 2008-03-01 |
EP1362343A1 (fr) | 2003-11-19 |
JP4751569B2 (ja) | 2011-08-17 |
MXPA03007178A (es) | 2003-12-04 |
DE60222093D1 (de) | 2007-10-11 |
EP1362343B1 (fr) | 2007-08-29 |
CN1228762C (zh) | 2005-11-23 |
KR100908358B1 (ko) | 2009-07-20 |
DE60222093T2 (de) | 2008-06-05 |
WO2002065454A1 (fr) | 2002-08-22 |
FR2820872B1 (fr) | 2003-05-16 |
US20050102142A1 (en) | 2005-05-12 |
US7983911B2 (en) | 2011-07-19 |
JP2004530149A (ja) | 2004-09-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100908358B1 (ko) | 음성 인식을 위한 방법, 모듈, 디바이스 및 서버 | |
US11437041B1 (en) | Speech interface device with caching component | |
CN110473531B (zh) | 语音识别方法、装置、电子设备、系统及存储介质 | |
KR101183344B1 (ko) | 사용자 정정들을 이용한 자동 음성 인식 학습 | |
CN1667700B (zh) | 把字的语音或声学描述、发音添加到语音识别词典的方法 | |
US7412387B2 (en) | Automatic improvement of spoken language | |
US7848926B2 (en) | System, method, and program for correcting misrecognized spoken words by selecting appropriate correction word from one or more competitive words | |
EP0965978B1 (en) | Non-interactive enrollment in speech recognition | |
KR101247578B1 (ko) | 자동 음성 인식 음향 모델들의 적응 | |
CN110047481B (zh) | 用于语音识别的方法和装置 | |
WO2000049599A1 (fr) | Traducteur de sons vocaux, procede de traduction de sons vocaux et support d'enregistrement sur lequel est enregistre un programme de commande de traduction de sons vocaux | |
JP5149107B2 (ja) | 音響処理装置およびプログラム | |
US7076422B2 (en) | Modelling and processing filled pauses and noises in speech recognition | |
JP5271299B2 (ja) | 音声認識装置、音声認識システム、及び音声認識プログラム | |
WO2023109129A1 (zh) | 语音数据的处理方法及装置 | |
JP4689032B2 (ja) | シンタックス上の置換規則を実行する音声認識装置 | |
US20030105632A1 (en) | Syntactic and semantic analysis of voice commands | |
US7206738B2 (en) | Hybrid baseform generation | |
Odell et al. | Architecture, user interface, and enabling technology in Windows Vista's speech systems | |
JP2001013992A (ja) | 音声理解装置 | |
Nguyen et al. | Progress in transcription of Vietnamese broadcast news | |
Ju et al. | Spontaneous Mandarin speech understanding using Utterance Classification: A case study | |
GB2465384A (en) | A speech recognition based method and system for retrieving data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20130620 Year of fee payment: 5 |
|
FPAY | Annual fee payment |
Payment date: 20140630 Year of fee payment: 6 |
|
FPAY | Annual fee payment |
Payment date: 20150619 Year of fee payment: 7 |
|
FPAY | Annual fee payment |
Payment date: 20160616 Year of fee payment: 8 |
|
FPAY | Annual fee payment |
Payment date: 20170616 Year of fee payment: 9 |
|
FPAY | Annual fee payment |
Payment date: 20190711 Year of fee payment: 11 |