DE69523531D1 - Verfahren und Vorrichtung zur Analyse von Audioeingabevorgängen in einem Spracherkennungssystem - Google Patents

Verfahren und Vorrichtung zur Analyse von Audioeingabevorgängen in einem Spracherkennungssystem

Info

Publication number
DE69523531D1
DE69523531D1 DE69523531T DE69523531T DE69523531D1 DE 69523531 D1 DE69523531 D1 DE 69523531D1 DE 69523531 T DE69523531 T DE 69523531T DE 69523531 T DE69523531 T DE 69523531T DE 69523531 D1 DE69523531 D1 DE 69523531D1
Authority
DE
Germany
Prior art keywords
speech recognition
audio input
recognition system
input processes
analyzing audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69523531T
Other languages
English (en)
Other versions
DE69523531T2 (de
Inventor
Marvin L Williams
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Application granted granted Critical
Publication of DE69523531D1 publication Critical patent/DE69523531D1/de
Publication of DE69523531T2 publication Critical patent/DE69523531T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)
DE69523531T 1994-08-16 1995-08-08 Verfahren und Vorrichtung zur Analyse von Audioeingabevorgängen in einem Spracherkennungssystem Expired - Lifetime DE69523531T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/291,372 US5764852A (en) 1994-08-16 1994-08-16 Method and apparatus for speech recognition for distinguishing non-speech audio input events from speech audio input events

Publications (2)

Publication Number Publication Date
DE69523531D1 true DE69523531D1 (de) 2001-12-06
DE69523531T2 DE69523531T2 (de) 2002-05-23

Family

ID=23120039

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69523531T Expired - Lifetime DE69523531T2 (de) 1994-08-16 1995-08-08 Verfahren und Vorrichtung zur Analyse von Audioeingabevorgängen in einem Spracherkennungssystem

Country Status (3)

Country Link
US (1) US5764852A (de)
EP (1) EP0702351B1 (de)
DE (1) DE69523531T2 (de)

Families Citing this family (67)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL112513A (en) * 1995-02-01 1999-05-09 Ald Advanced Logistics Dev Ltd System and method for failure reporting and collection
JP2952223B2 (ja) 1997-10-09 1999-09-20 オリンパス光学工業株式会社 コードイメージ記録装置
JP2945887B2 (ja) * 1997-10-09 1999-09-06 オリンパス光学工業株式会社 コードイメージ記録装置
JPH11143485A (ja) * 1997-11-14 1999-05-28 Oki Electric Ind Co Ltd 音声認識方法及び音声認識装置
JP4438028B2 (ja) * 1998-07-27 2010-03-24 キヤノン株式会社 情報処理装置及びその方法、及びそのプログラムを記憶した記憶媒体
US6594632B1 (en) * 1998-11-02 2003-07-15 Ncr Corporation Methods and apparatus for hands-free operation of a voice recognition system
JP3157788B2 (ja) * 1998-11-12 2001-04-16 埼玉日本電気株式会社 携帯型情報端末
US6185527B1 (en) 1999-01-19 2001-02-06 International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
US6324499B1 (en) * 1999-03-08 2001-11-27 International Business Machines Corp. Noise recognizer for speech recognition systems
US7283953B2 (en) * 1999-09-20 2007-10-16 International Business Machines Corporation Process for identifying excess noise in a computer system
US7330815B1 (en) 1999-10-04 2008-02-12 Globalenglish Corporation Method and system for network-based speech recognition
US6415258B1 (en) * 1999-10-06 2002-07-02 Microsoft Corporation Background audio recovery system
US7123166B1 (en) 2000-11-17 2006-10-17 Haynes Michael N Method for managing a parking lot
US6816085B1 (en) 2000-01-14 2004-11-09 Michael N. Haynes Method for managing a parking lot
US20020032691A1 (en) * 2000-05-26 2002-03-14 Infolibria, Inc. High performance efficient subsystem for data object storage
US7165134B1 (en) * 2000-06-28 2007-01-16 Intel Corporation System for selectively generating real-time interrupts and selectively processing associated data when it has higher priority than currently executing non-real-time operation
US7020292B1 (en) 2001-12-27 2006-03-28 Bellsouth Intellectual Property Corporation Apparatuses and methods for recognizing an audio input and muting an audio device
US7392183B2 (en) * 2002-12-27 2008-06-24 Intel Corporation Schedule event context for speech recognition
US9704502B2 (en) * 2004-07-30 2017-07-11 Invention Science Fund I, Llc Cue-aware privacy filter for participants in persistent communications
US9779750B2 (en) * 2004-07-30 2017-10-03 Invention Science Fund I, Llc Cue-aware privacy filter for participants in persistent communications
US8677377B2 (en) 2005-09-08 2014-03-18 Apple Inc. Method and apparatus for building an intelligent automated assistant
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US8996376B2 (en) 2008-04-05 2015-03-31 Apple Inc. Intelligent text-to-speech conversion
JPWO2009150894A1 (ja) * 2008-06-10 2011-11-10 日本電気株式会社 音声認識システム、音声認識方法および音声認識用プログラム
CN101877223A (zh) * 2009-04-29 2010-11-03 鸿富锦精密工业(深圳)有限公司 影音编辑系统、方法及具有该影音编辑系统的电子设备
US10241644B2 (en) 2011-06-03 2019-03-26 Apple Inc. Actionable reminder entries
US10241752B2 (en) 2011-09-30 2019-03-26 Apple Inc. Interface for a virtual digital assistant
US9431006B2 (en) 2009-07-02 2016-08-30 Apple Inc. Methods and apparatuses for automatic speech recognition
US8682667B2 (en) 2010-02-25 2014-03-25 Apple Inc. User profiling for selecting user specific voice input processing information
WO2011140221A1 (en) * 2010-05-04 2011-11-10 Shazam Entertainment Ltd. Methods and systems for synchronizing media
US9721563B2 (en) 2012-06-08 2017-08-01 Apple Inc. Name recognition system
US9547647B2 (en) 2012-09-19 2017-01-17 Apple Inc. Voice-based media searching
US9263059B2 (en) * 2012-09-28 2016-02-16 International Business Machines Corporation Deep tagging background noises
DE212014000045U1 (de) 2013-02-07 2015-09-24 Apple Inc. Sprach-Trigger für einen digitalen Assistenten
AU2015101078B4 (en) * 2013-02-07 2016-04-14 Apple Inc. Voice trigger for a digital assistant
WO2014197334A2 (en) 2013-06-07 2014-12-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9430463B2 (en) 2014-05-30 2016-08-30 Apple Inc. Exemplar-based natural language processing
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
CN104267922B (zh) * 2014-09-16 2019-05-31 联想(北京)有限公司 一种信息处理方法及电子设备
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US9865256B2 (en) 2015-02-27 2018-01-09 Storz Endoskop Produktions Gmbh System and method for calibrating a speech recognition system to an operating environment
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
DK179309B1 (en) 2016-06-09 2018-04-23 Apple Inc Intelligent automated assistant in a home environment
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10586535B2 (en) 2016-06-10 2020-03-10 Apple Inc. Intelligent digital assistant in a multi-tasking environment
DK179415B1 (en) 2016-06-11 2018-06-14 Apple Inc Intelligent device arbitration and control
DK179049B1 (en) 2016-06-11 2017-09-18 Apple Inc Data driven natural language event detection and classification
DK179343B1 (en) 2016-06-11 2018-05-14 Apple Inc Intelligent task discovery
DK201670540A1 (en) 2016-06-11 2018-01-08 Apple Inc Application integration with a digital assistant
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
DK179745B1 (en) 2017-05-12 2019-05-01 Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
DK201770431A1 (en) 2017-05-15 2018-12-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
CN110990534B (zh) * 2019-11-29 2024-02-06 北京搜狗科技发展有限公司 一种数据处理方法、装置和用于数据处理的装置

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4383135A (en) * 1980-01-23 1983-05-10 Scott Instruments Corporation Method and apparatus for speech recognition
JPS5870292A (ja) * 1981-10-22 1983-04-26 日産自動車株式会社 車両用音声認識装置
US4852181A (en) * 1985-09-26 1989-07-25 Oki Electric Industry Co., Ltd. Speech recognition for recognizing the catagory of an input speech pattern
US4797924A (en) * 1985-10-25 1989-01-10 Nartron Corporation Vehicle voice recognition method and apparatus
US4827520A (en) * 1987-01-16 1989-05-02 Prince Corporation Voice actuated control system for use in a vehicle
JP3088739B2 (ja) * 1989-10-06 2000-09-18 株式会社リコー 音声認識システム
JP2964518B2 (ja) * 1990-01-30 1999-10-18 日本電気株式会社 音声制御方式
US5274739A (en) * 1990-05-22 1993-12-28 Rockwell International Corporation Product code memory Itakura-Saito (MIS) measure for sound recognition
US5209695A (en) * 1991-05-13 1993-05-11 Omri Rothschild Sound controllable apparatus particularly useful in controlling toys and robots
JPH04362698A (ja) * 1991-06-11 1992-12-15 Canon Inc 音声認識方法及び装置
WO1994002936A1 (en) * 1992-07-17 1994-02-03 Voice Powered Technology International, Inc. Voice recognition apparatus and method
CA2115210C (en) * 1993-04-21 1997-09-23 Joseph C. Andreshak Interactive computer system recognizing spoken commands

Also Published As

Publication number Publication date
US5764852A (en) 1998-06-09
EP0702351A2 (de) 1996-03-20
DE69523531T2 (de) 2002-05-23
EP0702351A3 (de) 1997-10-22
EP0702351B1 (de) 2001-10-31

Similar Documents

Publication Publication Date Title
DE69523531T2 (de) Verfahren und Vorrichtung zur Analyse von Audioeingabevorgängen in einem Spracherkennungssystem
DE69518705D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69717899T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69324629T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69817844D1 (de) Verfahren und vorrichtung zur spracherkennungscomputereingabe
DE69726235D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69625950T2 (de) Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem
DE59707384D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69732041D1 (de) Verfahren und vorrichtung zur analyse von bitströmen
DE69828141D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69519887T2 (de) Verfahren und Vorrichtung zur Verarbeitung von Sprachinformation
DE69410753D1 (de) Vorrichtung und Verfahren zur Analyse eines Verarbeitungsystems
DE69806557D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69923253D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69830017D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69520559T2 (de) Verfahren und Vorrichtung für die Identifikation eines Anästhetikums in einem anästhetischen System
DE69726685T2 (de) Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung
DE69715071D1 (de) Verfahren und Vorrichtung zur Sprachverarbeitung
DE69618408D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69517829T2 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69715281T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69620304D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69431520T2 (de) Verfahren und vorrichtung zur verminderung von audiosignalverschlechterungen in einem kommunikationssystem
DE69631467D1 (de) Verfahren und vorrichtung zur abtrennung von argon

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8320 Willingness to grant licences declared (paragraph 23)
8328 Change in the person/name/address of the agent

Representative=s name: DUSCHER, R., DIPL.-PHYS. DR.RER.NAT., PAT.-ANW., 7