ATE429697T1 - Verfahren zum produzieren abwechselnder äusserungshypothesen unter verwendung von hilfsinformationen bezüglich nahen konkurrenten - Google Patents

Verfahren zum produzieren abwechselnder äusserungshypothesen unter verwendung von hilfsinformationen bezüglich nahen konkurrenten

Info

Publication number
ATE429697T1
ATE429697T1 AT04713366T AT04713366T ATE429697T1 AT E429697 T1 ATE429697 T1 AT E429697T1 AT 04713366 T AT04713366 T AT 04713366T AT 04713366 T AT04713366 T AT 04713366T AT E429697 T1 ATE429697 T1 AT E429697T1
Authority
AT
Austria
Prior art keywords
partial
histories
hypotheses
close call
history
Prior art date
Application number
AT04713366T
Other languages
English (en)
Inventor
Robert Roth
Arkady Khasin
Laurence Gillick
Original Assignee
Voice Signal Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Voice Signal Technologies Inc filed Critical Voice Signal Technologies Inc
Application granted granted Critical
Publication of ATE429697T1 publication Critical patent/ATE429697T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/085Methods for reducing search complexity, pruning

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)
  • Stereophonic System (AREA)
  • Circuits Of Receivers In General (AREA)
  • Devices For Checking Fares Or Tickets At Control Points (AREA)
  • Debugging And Monitoring (AREA)
  • User Interface Of Digital Computer (AREA)
AT04713366T 2003-02-21 2004-02-20 Verfahren zum produzieren abwechselnder äusserungshypothesen unter verwendung von hilfsinformationen bezüglich nahen konkurrenten ATE429697T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US44919503P 2003-02-21 2003-02-21
PCT/US2004/005187 WO2004077404A1 (en) 2003-02-21 2004-02-20 Method of producing alternate utterance hypotheses using auxilia ry information on close competitors

Publications (1)

Publication Number Publication Date
ATE429697T1 true ATE429697T1 (de) 2009-05-15

Family

ID=32927501

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04713366T ATE429697T1 (de) 2003-02-21 2004-02-20 Verfahren zum produzieren abwechselnder äusserungshypothesen unter verwendung von hilfsinformationen bezüglich nahen konkurrenten

Country Status (5)

Country Link
US (1) US7676367B2 (de)
EP (1) EP1595245B1 (de)
AT (1) ATE429697T1 (de)
DE (1) DE602004020738D1 (de)
WO (1) WO2004077404A1 (de)

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7831428B2 (en) * 2005-11-09 2010-11-09 Microsoft Corporation Speech index pruning
US7831425B2 (en) * 2005-12-15 2010-11-09 Microsoft Corporation Time-anchored posterior indexing of speech
US20100070263A1 (en) * 2006-11-30 2010-03-18 National Institute Of Advanced Industrial Science And Technology Speech data retrieving web site system
US8149999B1 (en) * 2006-12-22 2012-04-03 Tellme Networks, Inc. Generating reference variations
US20090030685A1 (en) * 2007-03-07 2009-01-29 Cerra Joseph P Using speech recognition results based on an unstructured language model with a navigation system
US20090030697A1 (en) * 2007-03-07 2009-01-29 Cerra Joseph P Using contextual information for delivering results generated from a speech recognition facility using an unstructured language model
US8949130B2 (en) * 2007-03-07 2015-02-03 Vlingo Corporation Internal and external speech recognition use with a mobile communication facility
US8838457B2 (en) * 2007-03-07 2014-09-16 Vlingo Corporation Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility
US20110054897A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Transmitting signal quality information in mobile dictation application
US20110054898A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Multiple web-based content search user interface in mobile search application
US8886545B2 (en) * 2007-03-07 2014-11-11 Vlingo Corporation Dealing with switch latency in speech recognition
US20090030687A1 (en) * 2007-03-07 2009-01-29 Cerra Joseph P Adapting an unstructured language model speech recognition system based on usage
US8635243B2 (en) * 2007-03-07 2014-01-21 Research In Motion Limited Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application
US20080221880A1 (en) * 2007-03-07 2008-09-11 Cerra Joseph P Mobile music environment speech processing facility
US8886540B2 (en) * 2007-03-07 2014-11-11 Vlingo Corporation Using speech recognition results based on an unstructured language model in a mobile communication facility application
US20090030688A1 (en) * 2007-03-07 2009-01-29 Cerra Joseph P Tagging speech recognition results based on an unstructured language model for use in a mobile communication facility application
US20110054899A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Command and control utilizing content information in a mobile voice-to-speech application
US10056077B2 (en) * 2007-03-07 2018-08-21 Nuance Communications, Inc. Using speech recognition results based on an unstructured language model with a music system
US8880405B2 (en) * 2007-03-07 2014-11-04 Vlingo Corporation Application text entry in a mobile environment using a speech processing facility
US20110060587A1 (en) * 2007-03-07 2011-03-10 Phillips Michael S Command and control utilizing ancillary information in a mobile voice-to-speech application
US20110054896A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Sending a communications header with voice recording to send metadata for use in speech recognition and formatting in mobile dictation application
US20110054895A1 (en) * 2007-03-07 2011-03-03 Phillips Michael S Utilizing user transmitted text to improve language model in mobile dictation application
US8949266B2 (en) * 2007-03-07 2015-02-03 Vlingo Corporation Multiple web-based content category searching in mobile search application
US8712774B2 (en) * 2009-03-30 2014-04-29 Nuance Communications, Inc. Systems and methods for generating a hybrid text string from two or more text strings generated by multiple automated speech recognition systems
US8676580B2 (en) * 2011-08-16 2014-03-18 International Business Machines Corporation Automatic speech and concept recognition
CN103035243B (zh) * 2012-12-18 2014-12-24 中国科学院自动化研究所 长语音连续识别及识别结果实时反馈方法和系统
WO2016006038A1 (ja) * 2014-07-08 2016-01-14 三菱電機株式会社 音声認識システム及び音声認識方法
US10854192B1 (en) * 2016-03-30 2020-12-01 Amazon Technologies, Inc. Domain specific endpointing
US10607601B2 (en) * 2017-05-11 2020-03-31 International Business Machines Corporation Speech recognition by selecting and refining hot words
US12002451B1 (en) * 2021-07-01 2024-06-04 Amazon Technologies, Inc. Automatic speech recognition
US12033618B1 (en) * 2021-11-09 2024-07-09 Amazon Technologies, Inc. Relevant context determination

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5369727A (en) * 1991-05-16 1994-11-29 Matsushita Electric Industrial Co., Ltd. Method of speech recognition with correlation of similarities
US5805772A (en) * 1994-12-30 1998-09-08 Lucent Technologies Inc. Systems, methods and articles of manufacture for performing high resolution N-best string hypothesization
US5680511A (en) * 1995-06-07 1997-10-21 Dragon Systems, Inc. Systems and methods for word recognition
US5712957A (en) * 1995-09-08 1998-01-27 Carnegie Mellon University Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists
US5855000A (en) 1995-09-08 1998-12-29 Carnegie Mellon University Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input
US5822728A (en) * 1995-09-08 1998-10-13 Matsushita Electric Industrial Co., Ltd. Multistage word recognizer based on reliably detected phoneme similarity regions
JP3535292B2 (ja) * 1995-12-27 2004-06-07 Kddi株式会社 音声認識システム
DE19639844A1 (de) * 1996-09-27 1998-04-02 Philips Patentverwaltung Verfahren zum Ableiten wenigstens einer Folge von Wörtern aus einem Sprachsignal
US5829000A (en) * 1996-10-31 1998-10-27 Microsoft Corporation Method and system for correcting misrecognized spoken words or phrases
US5950160A (en) * 1996-10-31 1999-09-07 Microsoft Corporation Method and system for displaying a variable number of alternative words during speech recognition
US6137863A (en) * 1996-12-13 2000-10-24 At&T Corp. Statistical database correction of alphanumeric account numbers for speech recognition and touch-tone recognition
EP0961781A1 (de) * 1997-10-03 1999-12-08 Smithkline Beecham Corporation AMA-Polypeptide
US6243680B1 (en) 1998-06-15 2001-06-05 Nortel Networks Limited Method and apparatus for obtaining a transcription of phrases through text and spoken utterances
US6684185B1 (en) 1998-09-04 2004-01-27 Matsushita Electric Industrial Co., Ltd. Small footprint language and vocabulary independent word recognizer using registration by word spelling
KR100310339B1 (ko) * 1998-12-30 2002-01-17 윤종용 이동전화 단말기의 음성인식 다이얼링 방법
US6766069B1 (en) * 1999-12-21 2004-07-20 Xerox Corporation Text selection from images of documents using auto-completion
US6389394B1 (en) 2000-02-09 2002-05-14 Speechworks International, Inc. Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations
WO2001084535A2 (en) * 2000-05-02 2001-11-08 Dragon Systems, Inc. Error correction in speech recognition
US7149970B1 (en) * 2000-06-23 2006-12-12 Microsoft Corporation Method and system for filtering and selecting from a candidate list generated by a stochastic input method
US6856956B2 (en) 2000-07-20 2005-02-15 Microsoft Corporation Method and apparatus for generating and displaying N-best alternatives in a speech recognition system
US6754625B2 (en) * 2000-12-26 2004-06-22 International Business Machines Corporation Augmentation of alternate word lists by acoustic confusability criterion
TW495736B (en) * 2001-02-21 2002-07-21 Ind Tech Res Inst Method for generating candidate strings in speech recognition
US6910012B2 (en) * 2001-05-16 2005-06-21 International Business Machines Corporation Method and system for speech recognition using phonetically similar word alternatives
US20020184019A1 (en) * 2001-05-31 2002-12-05 International Business Machines Corporation Method of using empirical substitution data in speech recognition
WO2004023455A2 (en) * 2002-09-06 2004-03-18 Voice Signal Technologies, Inc. Methods, systems, and programming for performing speech recognition
US7809574B2 (en) * 2001-09-05 2010-10-05 Voice Signal Technologies Inc. Word recognition using choice lists

Also Published As

Publication number Publication date
US20050108012A1 (en) 2005-05-19
EP1595245A1 (de) 2005-11-16
WO2004077404A1 (en) 2004-09-10
US7676367B2 (en) 2010-03-09
DE602004020738D1 (de) 2009-06-04
EP1595245B1 (de) 2009-04-22

Similar Documents

Publication Publication Date Title
ATE429697T1 (de) Verfahren zum produzieren abwechselnder äusserungshypothesen unter verwendung von hilfsinformationen bezüglich nahen konkurrenten
GB2480569A (en) Methods and systems for matching records and normalizing names
WO2007029002A3 (en) Music analysis
GB2604752A (en) Generating acoustic sequences via neural networks using combined prosody info
DK1523219T3 (da) Fremgangsmåde til efterfölgende optræning og drift af et höreapparat og et dertil svarende höreapparat
CN101441527B (zh) 拼音输入中提示正确读音的方法及装置
CN105373896A (zh) 项目管理系统
RU2009119491A (ru) Способ и устройство кодирования кадров перехода в речевых сигналах
ATE557093T1 (de) Verfahren zum schutz von pflanzen vor pathogenen pilzen
WO2009016729A1 (ja) 音声認識用照合ルール学習システム、音声認識用照合ルール学習プログラムおよび音声認識用照合ルール学習方法
CN105956529A (zh) 一种基于lstm型rnn的中国手语识别方法
Yang et al. Gigaspeech 2: An evolving, large-scale and multi-domain asr corpus for low-resource languages with automated crawling, transcription and refinement
DK1600796T3 (da) Fremgangsmåde til rekonstruktion af en stokastisk model for at forbedre indstillingen deraf til produktionsdataene
DE60324585D1 (de) Methode, verfahren und computerprogramm zum auffinden von punktkorrespondenzen in punktemengen
ATE458219T1 (de) Verfahren zur datenverarbeitung mit modularer potenzierung und dazugehörige vorrichtung
Morell Confusion in Earliest America: An emerging consensus that the Americas were inhabited earlier than has been thought has undone a neat synthesis of linguistic, dental, and archeological evidence
SA123447152B1 (ar) قياس الغاز الافتراضي في الزمن الحقيقي المعتمد على تعلم الآلة
RU2008126226A (ru) Способ и устройство генерации рекомендации для по меньшей мере одного элемента контента
FI20030864A0 (fi) Synkronointijärjestely
TWI264702B (en) Method for constructing acoustic model
CN113409630A (zh) 基于关联词汇生成的英文单词背诵辅助方法与系统
DE602004007429D1 (de) Verfahren zur Zellselektion
DE602004014416D1 (de) Spracherkennung durch kontextuelle modellierung der spracheinheiten
Zhou China's first recital and recording pianist ding shande: A critical examination of his performing career and performance style
Scharenborg et al. Recognising'real-life'speech with SpeM: A speech-based computational model of human speech recognition

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties