ATE465485T1 - Verbesserung der spracherkennung von mobilgeräten - Google Patents

Verbesserung der spracherkennung von mobilgeräten

Info

Publication number
ATE465485T1
ATE465485T1 AT03739083T AT03739083T ATE465485T1 AT E465485 T1 ATE465485 T1 AT E465485T1 AT 03739083 T AT03739083 T AT 03739083T AT 03739083 T AT03739083 T AT 03739083T AT E465485 T1 ATE465485 T1 AT E465485T1
Authority
AT
Austria
Prior art keywords
location information
mobile devices
voice recognition
information
improved voice
Prior art date
Application number
AT03739083T
Other languages
English (en)
Inventor
Michael Deisher
Robert Knauerhase
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Application granted granted Critical
Publication of ATE465485T1 publication Critical patent/ATE465485T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72457User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to geographic location
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means
AT03739083T 2002-06-20 2003-06-10 Verbesserung der spracherkennung von mobilgeräten ATE465485T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/176,326 US7224981B2 (en) 2002-06-20 2002-06-20 Speech recognition of mobile devices
PCT/US2003/018408 WO2004001719A1 (en) 2002-06-20 2003-06-10 Improving speech recognition of mobile devices

Publications (1)

Publication Number Publication Date
ATE465485T1 true ATE465485T1 (de) 2010-05-15

Family

ID=29734126

Family Applications (1)

Application Number Title Priority Date Filing Date
AT03739083T ATE465485T1 (de) 2002-06-20 2003-06-10 Verbesserung der spracherkennung von mobilgeräten

Country Status (9)

Country Link
US (1) US7224981B2 (de)
EP (1) EP1514259B1 (de)
KR (2) KR20070065893A (de)
CN (1) CN1692407B (de)
AT (1) ATE465485T1 (de)
AU (1) AU2003245443A1 (de)
DE (1) DE60332236D1 (de)
TW (1) TWI229984B (de)
WO (1) WO2004001719A1 (de)

Families Citing this family (79)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2003279994A1 (en) * 2002-10-21 2004-05-13 John P. Sinisi System and method for mobile data collection
GB2409560B (en) * 2003-12-23 2007-07-25 Ibm Interactive speech recognition model
US8589156B2 (en) * 2004-07-12 2013-11-19 Hewlett-Packard Development Company, L.P. Allocation of speech recognition tasks and combination of results thereof
US20060074660A1 (en) 2004-09-29 2006-04-06 France Telecom Method and apparatus for enhancing speech recognition accuracy by using geographic data to filter a set of words
US7522065B2 (en) * 2004-10-15 2009-04-21 Microsoft Corporation Method and apparatus for proximity sensing in a portable electronic device
US20060095266A1 (en) * 2004-11-01 2006-05-04 Mca Nulty Megan Roaming user profiles for speech recognition
US7440894B2 (en) * 2005-08-09 2008-10-21 International Business Machines Corporation Method and system for creation of voice training profiles with multiple methods with uniform server mechanism using heterogeneous devices
US20070041589A1 (en) * 2005-08-17 2007-02-22 Gennum Corporation System and method for providing environmental specific noise reduction algorithms
US8214208B2 (en) * 2006-09-28 2012-07-03 Reqall, Inc. Method and system for sharing portable voice profiles
US20080147411A1 (en) * 2006-12-19 2008-06-19 International Business Machines Corporation Adaptation of a speech processing system from external input that is not directly related to sounds in an operational acoustic environment
US8345832B2 (en) * 2009-01-09 2013-01-01 Microsoft Corporation Enhanced voicemail usage through automatic voicemail preview
EP4318463A3 (de) 2009-12-23 2024-02-28 Google LLC Multimodale eingabe in eine elektronische vorrichtung
US11416214B2 (en) 2009-12-23 2022-08-16 Google Llc Multi-modal input on an electronic device
US9112989B2 (en) 2010-04-08 2015-08-18 Qualcomm Incorporated System and method of smart audio logging for mobile devices
US8265928B2 (en) 2010-04-14 2012-09-11 Google Inc. Geotagged environmental audio for enhanced speech recognition accuracy
US8468012B2 (en) * 2010-05-26 2013-06-18 Google Inc. Acoustic model adaptation using geographic information
US8359020B2 (en) 2010-08-06 2013-01-22 Google Inc. Automatically monitoring for voice input based on context
KR101165537B1 (ko) * 2010-10-27 2012-07-16 삼성에스디에스 주식회사 사용자 장치 및 그의 사용자의 상황 인지 방법
US8352245B1 (en) 2010-12-30 2013-01-08 Google Inc. Adjusting language models
KR101791907B1 (ko) * 2011-01-04 2017-11-02 삼성전자주식회사 위치 기반의 음향 처리 장치 및 방법
US8296142B2 (en) 2011-01-21 2012-10-23 Google Inc. Speech recognition using dock context
US9298287B2 (en) 2011-03-31 2016-03-29 Microsoft Technology Licensing, Llc Combined activation for natural user interface systems
US9842168B2 (en) 2011-03-31 2017-12-12 Microsoft Technology Licensing, Llc Task driven user intents
US9244984B2 (en) 2011-03-31 2016-01-26 Microsoft Technology Licensing, Llc Location based conversational understanding
KR101922744B1 (ko) * 2011-03-31 2018-11-27 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 위치-기반 대화 해석 기법
US9858343B2 (en) 2011-03-31 2018-01-02 Microsoft Technology Licensing Llc Personalization of queries, conversations, and searches
US9760566B2 (en) 2011-03-31 2017-09-12 Microsoft Technology Licensing, Llc Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof
US10642934B2 (en) 2011-03-31 2020-05-05 Microsoft Technology Licensing, Llc Augmented conversational understanding architecture
US9454962B2 (en) 2011-05-12 2016-09-27 Microsoft Technology Licensing, Llc Sentence simplification for spoken language understanding
US9064006B2 (en) 2012-08-23 2015-06-23 Microsoft Technology Licensing, Llc Translating natural language utterances to keyword search queries
US9245254B2 (en) 2011-12-01 2016-01-26 Elwha Llc Enhanced voice conferencing with history, language translation and identification
US10875525B2 (en) 2011-12-01 2020-12-29 Microsoft Technology Licensing Llc Ability enhancement
US8934652B2 (en) 2011-12-01 2015-01-13 Elwha Llc Visual presentation of speaker-related information
US9159236B2 (en) 2011-12-01 2015-10-13 Elwha Llc Presentation of shared threat information in a transportation-related context
US9107012B2 (en) 2011-12-01 2015-08-11 Elwha Llc Vehicular threat detection based on audio signals
US9368028B2 (en) 2011-12-01 2016-06-14 Microsoft Technology Licensing, Llc Determining threats based on information from road-based devices in a transportation-related context
US9064152B2 (en) 2011-12-01 2015-06-23 Elwha Llc Vehicular threat detection based on image analysis
US8811638B2 (en) * 2011-12-01 2014-08-19 Elwha Llc Audible assistance
US9053096B2 (en) 2011-12-01 2015-06-09 Elwha Llc Language translation based on speaker-related information
CN104025188B (zh) * 2011-12-29 2016-09-07 英特尔公司 声学信号修改
US9502029B1 (en) * 2012-06-25 2016-11-22 Amazon Technologies, Inc. Context-aware speech processing
EP2867890B1 (de) * 2012-06-28 2018-04-25 Nuance Communications, Inc. Metadateneingaben in die vorverarbeitung für automatische spracherkennung
US8831957B2 (en) 2012-08-01 2014-09-09 Google Inc. Speech recognition models based on location indicia
US9734819B2 (en) 2013-02-21 2017-08-15 Google Technology Holdings LLC Recognizing accented speech
US9401749B2 (en) 2013-03-08 2016-07-26 Google Technology Holdings LLC Method for codebook enhancement for multi-user multiple-input multiple-output systems
US9237225B2 (en) 2013-03-12 2016-01-12 Google Technology Holdings LLC Apparatus with dynamic audio signal pre-conditioning and methods therefor
US20140278415A1 (en) * 2013-03-12 2014-09-18 Motorola Mobility Llc Voice Recognition Configuration Selector and Method of Operation Therefor
US20140278393A1 (en) 2013-03-12 2014-09-18 Motorola Mobility Llc Apparatus and Method for Power Efficient Signal Conditioning for a Voice Recognition System
US9185199B2 (en) 2013-03-12 2015-11-10 Google Technology Holdings LLC Method and apparatus for acoustically characterizing an environment in which an electronic device resides
US20140270249A1 (en) 2013-03-12 2014-09-18 Motorola Mobility Llc Method and Apparatus for Estimating Variability of Background Noise for Noise Suppression
CN103258533B (zh) * 2013-05-27 2015-05-13 重庆邮电大学 远距离语音识别中的模型域补偿新方法
US9282096B2 (en) 2013-08-31 2016-03-08 Steven Goldstein Methods and systems for voice authentication service leveraging networking
US10405163B2 (en) * 2013-10-06 2019-09-03 Staton Techiya, Llc Methods and systems for establishing and maintaining presence information of neighboring bluetooth devices
US9299340B2 (en) * 2013-10-07 2016-03-29 Honeywell International Inc. System and method for correcting accent induced speech in an aircraft cockpit utilizing a dynamic speech database
CN104575494A (zh) * 2013-10-16 2015-04-29 中兴通讯股份有限公司 一种语音处理的方法和终端
CN104601764A (zh) * 2013-10-31 2015-05-06 中兴通讯股份有限公司 移动终端的噪音处理方法、装置及系统
CN103632666B (zh) * 2013-11-14 2016-09-28 华为技术有限公司 语音识别方法、语音识别设备和电子设备
CN103680493A (zh) * 2013-12-19 2014-03-26 百度在线网络技术(北京)有限公司 区分地域性口音的语音数据识别方法和装置
US9842592B2 (en) 2014-02-12 2017-12-12 Google Inc. Language models using non-linguistic context
US9412365B2 (en) 2014-03-24 2016-08-09 Google Inc. Enhanced maximum entropy models
KR102257910B1 (ko) 2014-05-02 2021-05-27 삼성전자주식회사 음성 인식 장치 및 방법, 잡음-음성 인식 모델 생성 장치 및 방법
US9904851B2 (en) 2014-06-11 2018-02-27 At&T Intellectual Property I, L.P. Exploiting visual information for enhancing audio signals via source separation and beamforming
US9257120B1 (en) 2014-07-18 2016-02-09 Google Inc. Speaker verification using co-location information
US11942095B2 (en) 2014-07-18 2024-03-26 Google Llc Speaker verification using co-location information
US11676608B2 (en) 2021-04-02 2023-06-13 Google Llc Speaker verification using co-location information
US10134394B2 (en) 2015-03-20 2018-11-20 Google Llc Speech recognition using log-linear model
US9801219B2 (en) 2015-06-15 2017-10-24 Microsoft Technology Licensing, Llc Pairing of nearby devices using a synchronized cue signal
US10044798B2 (en) 2016-02-05 2018-08-07 International Business Machines Corporation Context-aware task offloading among multiple devices
US10484484B2 (en) 2016-02-05 2019-11-19 International Business Machines Corporation Context-aware task processing for multiple devices
US9978367B2 (en) 2016-03-16 2018-05-22 Google Llc Determining dialog states for language models
KR102565274B1 (ko) 2016-07-07 2023-08-09 삼성전자주식회사 자동 통역 방법 및 장치, 및 기계 번역 방법 및 장치
US10832664B2 (en) 2016-08-19 2020-11-10 Google Llc Automated speech recognition using language models that selectively use domain-specific model components
US9972320B2 (en) 2016-08-24 2018-05-15 Google Llc Hotword detection on multiple devices
US10429817B2 (en) 2016-12-19 2019-10-01 Honeywell International Inc. Voice control of components of a facility
US10311860B2 (en) 2017-02-14 2019-06-04 Google Llc Language model biasing system
US10522137B2 (en) 2017-04-20 2019-12-31 Google Llc Multi-user authentication on a device
KR102424514B1 (ko) 2017-12-04 2022-07-25 삼성전자주식회사 언어 처리 방법 및 장치
CN110047478B (zh) * 2018-01-16 2021-06-08 中国科学院声学研究所 基于空间特征补偿的多通道语音识别声学建模方法及装置
TWI698857B (zh) 2018-11-21 2020-07-11 財團法人工業技術研究院 語音辨識系統及其方法、與電腦程式產品

Family Cites Families (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5263019A (en) * 1991-01-04 1993-11-16 Picturetel Corporation Method and apparatus for estimating the level of acoustic feedback between a loudspeaker and microphone
GB2252023B (en) * 1991-01-21 1995-01-18 Mitsubishi Electric Corp Acoustic system
US5297183A (en) * 1992-04-13 1994-03-22 Vcs Industries, Inc. Speech recognition system for electronic switches in a cellular telephone or personal communication network
JP2602158B2 (ja) * 1992-12-04 1997-04-23 株式会社エクォス・リサーチ 音声出力装置
US5384892A (en) * 1992-12-31 1995-01-24 Apple Computer, Inc. Dynamic language model for speech recognition
US5524169A (en) * 1993-12-30 1996-06-04 International Business Machines Incorporated Method and system for location-specific speech recognition
US5835667A (en) * 1994-10-14 1998-11-10 Carnegie Mellon University Method and apparatus for creating a searchable digital video library and a system and method of using such a library
DE4440598C1 (de) * 1994-11-14 1996-05-23 Siemens Ag Durch gesprochene Worte steuerbares Hypertext-Navigationssystem, Hypertext-Dokument für dieses Navigationssystem und Verfahren zur Erzeugung eines derartigen Dokuments
US6978159B2 (en) * 1996-06-19 2005-12-20 Board Of Trustees Of The University Of Illinois Binaural signal processing using multiple acoustic sensors and digital filtering
KR20000022231A (ko) * 1996-06-27 2000-04-25 조나단 피. 메이어 통신 시스템에서의 위치 결정
US6072881A (en) * 1996-07-08 2000-06-06 Chiefs Voice Incorporated Microphone noise rejection system
US6236365B1 (en) * 1996-09-09 2001-05-22 Tracbeam, Llc Location of a mobile station using a plurality of commercial wireless infrastructures
US6272457B1 (en) * 1996-09-16 2001-08-07 Datria Systems, Inc. Spatial asset management system that time-tags and combines captured speech data and captured location data using a predifed reference grammar with a semantic relationship structure
JPH10143191A (ja) * 1996-11-13 1998-05-29 Hitachi Ltd 音声認識システム
US5897616A (en) * 1997-06-11 1999-04-27 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US5953700A (en) * 1997-06-11 1999-09-14 International Business Machines Corporation Portable acoustic interface for remote access to automatic speech/speaker recognition server
US5991385A (en) * 1997-07-16 1999-11-23 International Business Machines Corporation Enhanced audio teleconferencing with sound field effect
US5970446A (en) * 1997-11-25 1999-10-19 At&T Corp Selective noise/channel/coding models and recognizers for automatic speech recognition
JP4154015B2 (ja) * 1997-12-10 2008-09-24 キヤノン株式会社 情報処理装置およびその方法
US6125115A (en) * 1998-02-12 2000-09-26 Qsound Labs, Inc. Teleconferencing method and apparatus with three-dimensional sound positioning
JP3722335B2 (ja) * 1998-02-17 2005-11-30 ヤマハ株式会社 残響付加装置
US6223156B1 (en) * 1998-04-07 2001-04-24 At&T Corp. Speech recognition of caller identifiers using location information
US6184829B1 (en) 1999-01-08 2001-02-06 Trueposition, Inc. Calibration for wireless location system
US6574601B1 (en) * 1999-01-13 2003-06-03 Lucent Technologies Inc. Acoustic speech recognizer system and method
US20030060211A1 (en) * 1999-01-26 2003-03-27 Vincent Chern Location-based information retrieval system for wireless communication device
EP1119158A1 (de) * 1999-07-28 2001-07-25 Mitsubishi Denki Kabushiki Kaisha Zellulares telefon
JP2001075594A (ja) * 1999-08-31 2001-03-23 Pioneer Electronic Corp 音声認識システム
US6937977B2 (en) * 1999-10-05 2005-08-30 Fastmobile, Inc. Method and apparatus for processing an input speech signal during presentation of an output audio signal
JP4415432B2 (ja) * 1999-10-08 2010-02-17 トヨタ自動車株式会社 手動バルブ
JP3376487B2 (ja) * 1999-10-27 2003-02-10 独立行政法人産業技術総合研究所 言い淀み検出方法及び装置
US6449593B1 (en) * 2000-01-13 2002-09-10 Nokia Mobile Phones Ltd. Method and system for tracking human speakers
US6850766B2 (en) * 2000-04-26 2005-02-01 Wirenix, Inc. Voice activated wireless locator service
KR20010106799A (ko) * 2000-05-23 2001-12-07 류정열 자동차용 음성 인식 장치
US6624922B1 (en) * 2000-06-02 2003-09-23 Northrop Grumman Corporation Electro-optic device for adding/subtracting optical signals
US7047196B2 (en) * 2000-06-08 2006-05-16 Agiletv Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
US6230138B1 (en) * 2000-06-28 2001-05-08 Visteon Global Technologies, Inc. Method and apparatus for controlling multiple speech engines in an in-vehicle speech recognition system
KR20020006357A (ko) 2000-07-12 2002-01-19 유영욱 구역별 정보 제공 서비스 방법 및 시스템
JP4283984B2 (ja) * 2000-10-12 2009-06-24 パイオニア株式会社 音声認識装置ならびに方法
US20020072917A1 (en) * 2000-12-11 2002-06-13 Irvin David Rand Method and apparatus for speech recognition incorporating location information
US20020097884A1 (en) * 2001-01-25 2002-07-25 Cairns Douglas A. Variable noise reduction algorithm based on vehicle conditions
US6810380B1 (en) * 2001-03-28 2004-10-26 Bellsouth Intellectual Property Corporation Personal safety enhancement for communication devices
US6785647B2 (en) * 2001-04-20 2004-08-31 William R. Hutchison Speech recognition system with network accessible speech processing resources
US7209881B2 (en) 2001-12-20 2007-04-24 Matsushita Electric Industrial Co., Ltd. Preparing acoustic models by sufficient statistics and noise-superimposed speech data
US6853907B2 (en) * 2002-03-21 2005-02-08 General Motors Corporation Method and system for communicating vehicle location information
EP1505571A4 (de) * 2002-04-12 2007-02-21 Mitsubishi Electric Corp Autonavigationssystem und spracherkennungseinrichtung dafür

Also Published As

Publication number Publication date
KR100830251B1 (ko) 2008-05-16
DE60332236D1 (de) 2010-06-02
KR20070065893A (ko) 2007-06-25
CN1692407A (zh) 2005-11-02
TWI229984B (en) 2005-03-21
AU2003245443A1 (en) 2004-01-06
US20030236099A1 (en) 2003-12-25
US7224981B2 (en) 2007-05-29
EP1514259B1 (de) 2010-04-21
WO2004001719A1 (en) 2003-12-31
CN1692407B (zh) 2012-04-04
EP1514259A1 (de) 2005-03-16
TW200412730A (en) 2004-07-16
KR20050007429A (ko) 2005-01-17

Similar Documents

Publication Publication Date Title
ATE465485T1 (de) Verbesserung der spracherkennung von mobilgeräten
ATE531033T1 (de) System und verfahren zur verteilung einer spracherkennungsgrammatik
WO2004100638A3 (en) Source-dependent text-to-speech system
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
WO2002035746A3 (en) Method and arrangement for enabling disintermediation, and receiver for use thereby
ATE541287T1 (de) Rechnerisch effizienter hintergrundrauschunterdrücker für die sprachcodierung und spracherkennung
ATE312398T1 (de) Sprecheranpassung für die spracherkennung
IT1319318B1 (it) Procedimento e sistema di attivazione di telefono portatile mediantericonoscimento di voce
DE60302407D1 (de) Umgebungs- und sprecheradaptierte Spracherkennung
RU2012144640A (ru) Поддержание контекстной информации между пользовательскими взаимодействиями с голосовым помощником
DE602004026357D1 (de) Verwendung des öffentlichen fernsprechwählnetzes zur erfassung elektronischer signaturen bei online-transaktionen
DE602004030021D1 (de) System und verfahren zur netzwerk-verwaltung auf voice-basis
DE60020660D1 (de) Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung
WO2005072336A3 (en) Method for aiding and enhancing verbal communications
GB2454143A (en) A method and arrangement for providing location information on a communication terminal
ATE347162T1 (de) Rauschunterdrückung zur robusten spracherkennung
ATE514162T1 (de) Dynamische erzeugung von kontexten zur spracherkennung
ATE433181T1 (de) Verteiltes spracherkennungsverfahren
DE602006019099D1 (de) Sprachanalysesystem
DE60015383D1 (de) Tragbare Kommunikationsvorrichtung und Verfahren
ATE539404T1 (de) Mobilteil mit fehlertoleranter aktualisierung
ATE441918T1 (de) Sprachdialogverfahren und -system
ATE486453T1 (de) Verfahren und system zur implementierung von fernsprechdiensten unter verwendung von sprach- xml
ATE332827T1 (de) Kommunikationsplattform in einem kraftfahrzeug
DE60206658D1 (de) Interaktive sprachdiensten

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties