KR100800367B1 - 음성 인식 시스템의 작동 방법, 컴퓨터 시스템 및 프로그램을 갖춘 컴퓨터 판독 가능 저장 매체 - Google Patents
음성 인식 시스템의 작동 방법, 컴퓨터 시스템 및 프로그램을 갖춘 컴퓨터 판독 가능 저장 매체 Download PDFInfo
- Publication number
- KR100800367B1 KR100800367B1 KR1020057009735A KR20057009735A KR100800367B1 KR 100800367 B1 KR100800367 B1 KR 100800367B1 KR 1020057009735 A KR1020057009735 A KR 1020057009735A KR 20057009735 A KR20057009735 A KR 20057009735A KR 100800367 B1 KR100800367 B1 KR 100800367B1
- Authority
- KR
- South Korea
- Prior art keywords
- speech recognition
- recognizer
- combination
- selection
- sensor
- Prior art date
Links
- 230000006978 adaptation Effects 0.000 title description 7
- 238000000034 method Methods 0.000 claims abstract description 43
- 239000013598 vector Substances 0.000 claims description 18
- 230000006870 function Effects 0.000 claims description 9
- 238000012545 processing Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 8
- 238000003066 decision tree Methods 0.000 claims description 3
- 238000011156 evaluation Methods 0.000 claims description 3
- 238000000528 statistical test Methods 0.000 claims description 3
- 239000012634 fragment Substances 0.000 claims description 2
- 238000013507 mapping Methods 0.000 claims description 2
- 230000007613 environmental effect Effects 0.000 abstract description 4
- 230000007246 mechanism Effects 0.000 abstract description 4
- 230000008859 change Effects 0.000 abstract description 3
- 238000012549 training Methods 0.000 description 8
- 230000003044 adaptive effect Effects 0.000 description 5
- 238000002372 labelling Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000009466 transformation Effects 0.000 description 4
- 238000013459 approach Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000009423 ventilation Methods 0.000 description 2
- 230000002411 adverse Effects 0.000 description 1
- 239000000956 alloy Substances 0.000 description 1
- 229910045601 alloy Inorganic materials 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 238000012417 linear regression Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229920001690 polydopamine Polymers 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000013138 pruning Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
Description
미국 특허 출원 제 2002/0065584 A1호에는, 내장형 시스템을 위해 서로 다른 유형의 환경적 잡음을 적응시키도록 구성되는 음성 인식 시스템이 개시되어 있다. 불리하게도, 이러한 종래 기술의 시스템은 상대적으로 낮은 인식율을 가지며, 낮은 정도의 계산 리소스를 갖는 시스템과 상대적으로 높은 정도의 계산 리소스를 갖는 시스템간에 적응시키기 위한 크기 조정이 불가능하다.
이고, 단어 가설 조합의 경우에, 신뢰 측정값은, 각각의 인식기가 음성 신호의 임의의 주어진 간격동안에 상이한 결과를 생성하면 발생할 수 있는 타이(tie)를 해결하는데 사용될 수 있다. 이 경우에, 최적의 스코어 인식기로부터 획득되는 전사(transcription)를 고려중인 음성 신호의 일부에 할당하는 것이 제시되어 있다.
Claims (9)
- 음성 인식 시스템을 작동시키는 방법으로서,프로그램 제어식 인식기가, 음성 신호를 프레임으로 분해하고 각각의 프레임에 대한 임의 유형의 특징 벡터를 계산하는 단계와, 프레임을 음소(phoneme)마다 다수의 라벨을 생성하는 문자 또는 문자 그룹에 의해 라벨링(labelling)하는 단계와, 사전결정된 음향 모델에 따라 상기 라벨을 디코딩하여 하나 이상의 워드 또는 하나의 워드의 단편을 구성하는 단계를 수행하며, 복수의 인식기가 음성 인식을 위해 활성화되도록 액세스 가능하며, 하나의 인식기에 의해 수행되는 음성 인식의 결과를 밸런싱하기 위해 결합하되,a) 센서 수단을 이용하여 음성 인식 경계 조건을 특징화하는 선택 베이스 데이터를 수집하는 단계와,b) 상기 수집된 데이터를 평가하는 프로그램 제어식 아비터 수단을 이용하는 단계와,c) 상기 평가에 따라 복수의 이용 가능한 인식기 중에서, 최상의 적합한 인식기 또는 인식기들의 조합을 선택하는 단계를 포함하는 음성 인식 시스템의 작동 방법.
- 제 1 항에 있어서,상기 센서 수단은 소프트웨어 프로그램을 포함하는 결정 로직, 물리적 센서 또는 이들의 조합 중 하나 이상인 음성 인식 시스템의 작동 방법.
- 제 1 항에 있어서,상기 프로그램 제어식 아비터 수단을 이용하는 단계는a) 통계적 테스트, 결정 트리와 퍼지 멤버쉽 함수 중 하나 이상을 구현하는 결정 로직에서의 물리적 센서 출력을 처리하는 단계와,b) 상기 센서 선택/조합 결정에 사용될 신뢰값을 상기 처리 단계로부터 리턴하는 단계를 포함하는 음성 인식 시스템의 작동 방법.
- 제 1 항에 있어서,인식기 선택 결정으로 된 선택 베이스 데이터는, 인식기의 고속 선택을 얻기 위해서, 인식기의 반복되는 고속 액세스를 위한 데이터베이스에 저장되는 음성 인식 시스템의 작동 방법.
- 제 1 항에 있어서,현재의 프로세서 부하에 따라 인식기의 개수 및/또는 조합을 선택하는 단계를 더 포함하는 음성 인식 시스템의 작동 방법.
- 제 1 항에 있어서,하나의 음향 모델이 다른 하나의 음향 모델로 변환되는 방법에 관한 매핑 규칙(7)을 저장하는 단계를 더 포함하는 음성 인식 시스템의 작동 방법.
- 제 1 항 내지 제 6 항 중 어느 한 항에 따른 방법의 단계를 수행하는 수단을 구비한 컴퓨터 시스템.
- 컴퓨터 프로그램 코드 부분이 컴퓨터상에서 실행될 때, 제 1 항 내지 제 6 항 중 어느 한 항에 따른 방법의 각각의 단계를 수행하는 컴퓨터 프로그램 코드 부분을 포함하며 데이터 처리 시스템에서 실행되는 프로그램을 갖춘 컴퓨터 판독 가능 저장 매체.
- 삭제
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP02102875.8 | 2002-12-20 | ||
EP02102875 | 2002-12-20 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20050090389A KR20050090389A (ko) | 2005-09-13 |
KR100800367B1 true KR100800367B1 (ko) | 2008-02-04 |
Family
ID=32668901
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020057009735A KR100800367B1 (ko) | 2002-12-20 | 2003-10-31 | 음성 인식 시스템의 작동 방법, 컴퓨터 시스템 및 프로그램을 갖춘 컴퓨터 판독 가능 저장 매체 |
Country Status (9)
Country | Link |
---|---|
US (1) | US7302393B2 (ko) |
EP (1) | EP1576581B1 (ko) |
JP (1) | JP2006510933A (ko) |
KR (1) | KR100800367B1 (ko) |
CN (1) | CN100552773C (ko) |
AU (1) | AU2003293646A1 (ko) |
CA (1) | CA2507999C (ko) |
TW (1) | TWI245259B (ko) |
WO (1) | WO2004057574A1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9626962B2 (en) | 2014-05-02 | 2017-04-18 | Samsung Electronics Co., Ltd. | Method and apparatus for recognizing speech, and method and apparatus for generating noise-speech recognition model |
Families Citing this family (101)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
JP4352790B2 (ja) * | 2002-10-31 | 2009-10-28 | セイコーエプソン株式会社 | 音響モデル作成方法および音声認識装置ならびに音声認識装置を有する乗り物 |
CN100369113C (zh) * | 2004-12-31 | 2008-02-13 | 中国科学院自动化研究所 | 利用增益自适应提高语音识别率的方法 |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
DE602006010505D1 (de) | 2005-12-12 | 2009-12-31 | Gregory John Gadbois | Mehrstimmige Spracherkennung |
US8380506B2 (en) * | 2006-01-27 | 2013-02-19 | Georgia Tech Research Corporation | Automatic pattern recognition using category dependent feature selection |
KR100770896B1 (ko) | 2006-03-07 | 2007-10-26 | 삼성전자주식회사 | 음성 신호에서 음소를 인식하는 방법 및 그 시스템 |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US20080071540A1 (en) * | 2006-09-13 | 2008-03-20 | Honda Motor Co., Ltd. | Speech recognition method for robot under motor noise thereof |
US8996379B2 (en) | 2007-03-07 | 2015-03-31 | Vlingo Corporation | Speech recognition text entry for software applications |
US8886545B2 (en) | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Dealing with switch latency in speech recognition |
US8886540B2 (en) | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Using speech recognition results based on an unstructured language model in a mobile communication facility application |
US10056077B2 (en) | 2007-03-07 | 2018-08-21 | Nuance Communications, Inc. | Using speech recognition results based on an unstructured language model with a music system |
US8635243B2 (en) | 2007-03-07 | 2014-01-21 | Research In Motion Limited | Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application |
US8838457B2 (en) | 2007-03-07 | 2014-09-16 | Vlingo Corporation | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility |
US8949130B2 (en) * | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Internal and external speech recognition use with a mobile communication facility |
US8949266B2 (en) | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Multiple web-based content category searching in mobile search application |
US20090071315A1 (en) * | 2007-05-04 | 2009-03-19 | Fortuna Joseph A | Music analysis and generation method |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US8019608B2 (en) * | 2008-08-29 | 2011-09-13 | Multimodal Technologies, Inc. | Distributed speech recognition using one way communication |
KR101239318B1 (ko) * | 2008-12-22 | 2013-03-05 | 한국전자통신연구원 | 음질 향상 장치와 음성 인식 시스템 및 방법 |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US9858925B2 (en) * | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US9798653B1 (en) * | 2010-05-05 | 2017-10-24 | Nuance Communications, Inc. | Methods, apparatus and data structure for cross-language speech adaptation |
US8442835B2 (en) * | 2010-06-17 | 2013-05-14 | At&T Intellectual Property I, L.P. | Methods, systems, and products for measuring health |
US8666768B2 (en) | 2010-07-27 | 2014-03-04 | At&T Intellectual Property I, L. P. | Methods, systems, and products for measuring health |
TWI412019B (zh) | 2010-12-03 | 2013-10-11 | Ind Tech Res Inst | 聲音事件偵測模組及其方法 |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US20120253784A1 (en) * | 2011-03-31 | 2012-10-04 | International Business Machines Corporation | Language translation based on nearby devices |
US20150149167A1 (en) * | 2011-03-31 | 2015-05-28 | Google Inc. | Dynamic selection among acoustic transforms |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
JP5978027B2 (ja) * | 2012-06-28 | 2016-08-24 | 本田技研工業株式会社 | 移動ロボットの制御装置 |
JP5966689B2 (ja) * | 2012-07-04 | 2016-08-10 | 日本電気株式会社 | 音響モデル適応装置、音響モデル適応方法および音響モデル適応プログラム |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
CN103903616B (zh) * | 2012-12-25 | 2017-12-29 | 联想(北京)有限公司 | 一种信息处理的方法及电子设备 |
US20140195233A1 (en) * | 2013-01-08 | 2014-07-10 | Spansion Llc | Distributed Speech Recognition System |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
EP3008641A1 (en) | 2013-06-09 | 2016-04-20 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
CN104700832B (zh) | 2013-12-09 | 2018-05-25 | 联发科技股份有限公司 | 语音关键字检测系统及方法 |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
KR102272453B1 (ko) | 2014-09-26 | 2021-07-02 | 삼성전자주식회사 | 음성 신호 전처리 방법 및 장치 |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
CN105355199B (zh) * | 2015-10-20 | 2019-03-12 | 河海大学 | 一种基于gmm噪声估计的模型组合语音识别方法 |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
JP6568813B2 (ja) * | 2016-02-23 | 2019-08-28 | Nttテクノクロス株式会社 | 情報処理装置、音声認識方法及びプログラム |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US10163437B1 (en) * | 2016-06-02 | 2018-12-25 | Amazon Technologies, Inc. | Training models using voice tags |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179588B1 (en) | 2016-06-09 | 2019-02-22 | Apple Inc. | INTELLIGENT AUTOMATED ASSISTANT IN A HOME ENVIRONMENT |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
KR102565274B1 (ko) * | 2016-07-07 | 2023-08-09 | 삼성전자주식회사 | 자동 통역 방법 및 장치, 및 기계 번역 방법 및 장치 |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US9959861B2 (en) * | 2016-09-30 | 2018-05-01 | Robert Bosch Gmbh | System and method for speech recognition |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
JP6226353B1 (ja) * | 2017-06-27 | 2017-11-08 | 株式会社ナレロー | リアルタイム習熟支援システム |
US11087766B2 (en) * | 2018-01-05 | 2021-08-10 | Uniphore Software Systems | System and method for dynamic speech recognition selection based on speech rate or business domain |
WO2019246314A1 (en) * | 2018-06-20 | 2019-12-26 | Knowles Electronics, Llc | Acoustic aware voice user interface |
CN108986811B (zh) * | 2018-08-31 | 2021-05-28 | 北京新能源汽车股份有限公司 | 一种语音识别的检测方法、装置和设备 |
US11438452B1 (en) | 2019-08-09 | 2022-09-06 | Apple Inc. | Propagating context information in a privacy preserving manner |
CN111144259B (zh) * | 2019-12-18 | 2022-12-23 | 重庆特斯联智慧科技股份有限公司 | 一种基于hmm模型的社区污染物处理方法和系统 |
CN111128141B (zh) * | 2019-12-31 | 2022-04-19 | 思必驰科技股份有限公司 | 音频识别解码方法和装置 |
US20210201928A1 (en) * | 2019-12-31 | 2021-07-01 | Knowles Electronics, Llc | Integrated speech enhancement for voice trigger application |
CN111461901B (zh) * | 2020-03-31 | 2023-05-12 | 德联易控科技(北京)有限公司 | 车辆保险理赔信息的输出方法和装置 |
US12002451B1 (en) * | 2021-07-01 | 2024-06-04 | Amazon Technologies, Inc. | Automatic speech recognition |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0094449A1 (en) * | 1982-05-19 | 1983-11-23 | Nissan Motor Co., Ltd. | Speech recognition system for an automotive vehicle |
US5081707A (en) * | 1989-08-08 | 1992-01-14 | Motorola, Inc. | Knowledge based radio |
EP0881625A2 (en) | 1997-05-27 | 1998-12-02 | AT&T Corp. | Multiple models integration for multi-environment speech recognition |
KR100336994B1 (ko) | 1999-07-23 | 2002-05-17 | 이계철 | 다단계 음성인식을 이용한 음성인식 포탈서비스 시스템 및 그 방법 |
US20020065584A1 (en) * | 2000-08-23 | 2002-05-30 | Andreas Kellner | Method of controlling devices via speech signals, more particularly, in motorcars |
US6418411B1 (en) | 1999-03-12 | 2002-07-09 | Texas Instruments Incorporated | Method and system for adaptive speech recognition in a noisy environment |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5730913A (en) * | 1980-08-01 | 1982-02-19 | Nissan Motor Co Ltd | Speech recognition response device for automobile |
JPH0573088A (ja) * | 1991-09-13 | 1993-03-26 | Toshiba Corp | 認識辞書の作成方法、認識辞書作成装置及び音声認識装置 |
JP3257832B2 (ja) * | 1992-09-04 | 2002-02-18 | 富士通テン株式会社 | 音声認識装置用騒音低減回路 |
JPH1011085A (ja) * | 1996-06-21 | 1998-01-16 | Matsushita Electric Ind Co Ltd | 音声認識方法 |
US6076056A (en) * | 1997-09-19 | 2000-06-13 | Microsoft Corporation | Speech recognition system for recognizing continuous and isolated speech |
JP2000075889A (ja) * | 1998-09-01 | 2000-03-14 | Oki Electric Ind Co Ltd | 音声認識システム及び音声認識方法 |
JP2000276188A (ja) * | 1999-03-24 | 2000-10-06 | Sony Corp | 音声認識装置、音声認識方法、音声認識用制御プログラムを記録した記録媒体、通信端末装置、通信方法、音声認識通信の制御用プログラムを記録した記録媒体、サーバ装置、音声認識用データの送受信方法及び音声認識用データの送受信制御プログラムを記録した記録媒体 |
US6789061B1 (en) * | 1999-08-25 | 2004-09-07 | International Business Machines Corporation | Method and system for generating squeezed acoustic models for specialized speech recognizer |
US6856956B2 (en) * | 2000-07-20 | 2005-02-15 | Microsoft Corporation | Method and apparatus for generating and displaying N-best alternatives in a speech recognition system |
DE60111329T2 (de) * | 2000-11-14 | 2006-03-16 | International Business Machines Corp. | Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung |
JP2002358093A (ja) * | 2001-05-31 | 2002-12-13 | Matsushita Electric Ind Co Ltd | 音声認識方法及び音声認識装置及びその記憶媒体 |
-
2003
- 2003-10-30 TW TW092130316A patent/TWI245259B/zh not_active IP Right Cessation
- 2003-10-31 US US10/539,454 patent/US7302393B2/en active Active
- 2003-10-31 WO PCT/EP2003/012168 patent/WO2004057574A1/en active Application Filing
- 2003-10-31 JP JP2004561151A patent/JP2006510933A/ja active Pending
- 2003-10-31 AU AU2003293646A patent/AU2003293646A1/en not_active Abandoned
- 2003-10-31 EP EP03788992.0A patent/EP1576581B1/en not_active Expired - Lifetime
- 2003-10-31 CN CNB200380106508XA patent/CN100552773C/zh not_active Expired - Fee Related
- 2003-10-31 CA CA2507999A patent/CA2507999C/en not_active Expired - Fee Related
- 2003-10-31 KR KR1020057009735A patent/KR100800367B1/ko not_active IP Right Cessation
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0094449A1 (en) * | 1982-05-19 | 1983-11-23 | Nissan Motor Co., Ltd. | Speech recognition system for an automotive vehicle |
US5081707A (en) * | 1989-08-08 | 1992-01-14 | Motorola, Inc. | Knowledge based radio |
EP0881625A2 (en) | 1997-05-27 | 1998-12-02 | AT&T Corp. | Multiple models integration for multi-environment speech recognition |
US6418411B1 (en) | 1999-03-12 | 2002-07-09 | Texas Instruments Incorporated | Method and system for adaptive speech recognition in a noisy environment |
KR100336994B1 (ko) | 1999-07-23 | 2002-05-17 | 이계철 | 다단계 음성인식을 이용한 음성인식 포탈서비스 시스템 및 그 방법 |
US20020065584A1 (en) * | 2000-08-23 | 2002-05-30 | Andreas Kellner | Method of controlling devices via speech signals, more particularly, in motorcars |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9626962B2 (en) | 2014-05-02 | 2017-04-18 | Samsung Electronics Co., Ltd. | Method and apparatus for recognizing speech, and method and apparatus for generating noise-speech recognition model |
Also Published As
Publication number | Publication date |
---|---|
CA2507999C (en) | 2013-09-03 |
JP2006510933A (ja) | 2006-03-30 |
CN1726532A (zh) | 2006-01-25 |
EP1576581A1 (en) | 2005-09-21 |
KR20050090389A (ko) | 2005-09-13 |
WO2004057574A1 (en) | 2004-07-08 |
US7302393B2 (en) | 2007-11-27 |
AU2003293646A1 (en) | 2004-07-14 |
US20060173684A1 (en) | 2006-08-03 |
CA2507999A1 (en) | 2004-07-08 |
CN100552773C (zh) | 2009-10-21 |
EP1576581B1 (en) | 2013-11-20 |
TWI245259B (en) | 2005-12-11 |
TW200421264A (en) | 2004-10-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100800367B1 (ko) | 음성 인식 시스템의 작동 방법, 컴퓨터 시스템 및 프로그램을 갖춘 컴퓨터 판독 가능 저장 매체 | |
EP1515305B1 (en) | Noise adaption for speech recognition | |
JP3581401B2 (ja) | 音声認識方法 | |
EP0966736B1 (en) | Method for discriminative training of speech recognition models | |
EP1557823B1 (en) | Method of setting posterior probability parameters for a switching state space model | |
US8515758B2 (en) | Speech recognition including removal of irrelevant information | |
US20080077404A1 (en) | Speech recognition device, speech recognition method, and computer program product | |
EP1465154B1 (en) | Method of speech recognition using variational inference with switching state space models | |
EP1385147A2 (en) | Method of speech recognition using time-dependent interpolation and hidden dynamic value classes | |
JPWO2007105409A1 (ja) | 標準パタン適応装置、標準パタン適応方法および標準パタン適応プログラム | |
KR20110010233A (ko) | 진화 학습에 의한 화자 적응 장치 및 방법과 이를 이용한 음성인식 시스템 | |
JP3920749B2 (ja) | 音声認識用音響モデル作成方法、その装置、そのプログラムおよびその記録媒体、上記音響モデルを用いる音声認識装置 | |
JP2938866B1 (ja) | 統計的言語モデル生成装置及び音声認識装置 | |
EP1369847A1 (en) | Speech recognition method and system | |
WO2021106047A1 (ja) | 検知装置、その方法、およびプログラム | |
JPH10254485A (ja) | 話者正規化装置、話者適応化装置及び音声認識装置 | |
CN114446283A (zh) | 语音处理方法、装置、电子设备及存储介质 | |
WO2009122780A1 (ja) | 適応話者選択装置および適応話者選択方法並びに記録媒体 | |
JP2003005784A (ja) | 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体 | |
JPH10333697A (ja) | 音声認識方法及び音声認識装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E90F | Notification of reason for final refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20130107 Year of fee payment: 6 |
|
FPAY | Annual fee payment |
Payment date: 20140106 Year of fee payment: 7 |
|
FPAY | Annual fee payment |
Payment date: 20150106 Year of fee payment: 8 |
|
FPAY | Annual fee payment |
Payment date: 20160104 Year of fee payment: 9 |
|
FPAY | Annual fee payment |
Payment date: 20170123 Year of fee payment: 10 |
|
FPAY | Annual fee payment |
Payment date: 20180117 Year of fee payment: 11 |
|
LAPS | Lapse due to unpaid annual fee |