DE602006001764D1 - Verfahren zur Spracherkennung - Google Patents
Verfahren zur SpracherkennungInfo
- Publication number
- DE602006001764D1 DE602006001764D1 DE602006001764T DE602006001764T DE602006001764D1 DE 602006001764 D1 DE602006001764 D1 DE 602006001764D1 DE 602006001764 T DE602006001764 T DE 602006001764T DE 602006001764 T DE602006001764 T DE 602006001764T DE 602006001764 D1 DE602006001764 D1 DE 602006001764D1
- Authority
- DE
- Germany
- Prior art keywords
- speech
- speech recognition
- recognized
- user
- importation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Electrically Operated Instructional Devices (AREA)
- Machine Translation (AREA)
- User Interface Of Digital Computer (AREA)
- Electric Clocks (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2005065355A JP4667082B2 (ja) | 2005-03-09 | 2005-03-09 | 音声認識方法 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| DE602006001764D1 true DE602006001764D1 (de) | 2008-08-28 |
Family
ID=36250777
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| DE602006001764T Active DE602006001764D1 (de) | 2005-03-09 | 2006-02-17 | Verfahren zur Spracherkennung |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US7634401B2 (enExample) |
| EP (1) | EP1701338B1 (enExample) |
| JP (1) | JP4667082B2 (enExample) |
| KR (1) | KR100742888B1 (enExample) |
| CN (1) | CN100587806C (enExample) |
| AT (1) | ATE401644T1 (enExample) |
| DE (1) | DE602006001764D1 (enExample) |
| ES (1) | ES2310893T3 (enExample) |
Families Citing this family (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4282704B2 (ja) * | 2006-09-27 | 2009-06-24 | 株式会社東芝 | 音声区間検出装置およびプログラム |
| JP4950930B2 (ja) * | 2008-04-03 | 2012-06-13 | 株式会社東芝 | 音声/非音声を判定する装置、方法およびプログラム |
| KR20130133629A (ko) | 2012-05-29 | 2013-12-09 | 삼성전자주식회사 | 전자장치에서 음성명령을 실행시키기 위한 장치 및 방법 |
| US8577671B1 (en) | 2012-07-20 | 2013-11-05 | Veveo, Inc. | Method of and system for using conversation state information in a conversational interaction system |
| US9465833B2 (en) | 2012-07-31 | 2016-10-11 | Veveo, Inc. | Disambiguating user intent in conversational interaction system for large corpus information retrieval |
| US9799328B2 (en) * | 2012-08-03 | 2017-10-24 | Veveo, Inc. | Method for using pauses detected in speech input to assist in interpreting the input during conversational interaction for information retrieval |
| CN103971685B (zh) * | 2013-01-30 | 2015-06-10 | 腾讯科技(深圳)有限公司 | 语音命令识别方法和系统 |
| PT2994908T (pt) * | 2013-05-07 | 2019-10-18 | Veveo Inc | Interface de entrada incremental de discurso com retorno em tempo real |
| WO2014183035A1 (en) | 2013-05-10 | 2014-11-13 | Veveo, Inc. | Method and system for capturing and exploiting user intent in a conversational interaction based information retrieval system |
| US20160063990A1 (en) * | 2014-08-26 | 2016-03-03 | Honeywell International Inc. | Methods and apparatus for interpreting clipped speech using speech recognition |
| US9852136B2 (en) | 2014-12-23 | 2017-12-26 | Rovi Guides, Inc. | Systems and methods for determining whether a negation statement applies to a current or past query |
| US9854049B2 (en) | 2015-01-30 | 2017-12-26 | Rovi Guides, Inc. | Systems and methods for resolving ambiguous terms in social chatter based on a user profile |
| JP6804909B2 (ja) * | 2016-09-15 | 2020-12-23 | 東芝テック株式会社 | 音声認識装置、音声認識方法及び音声認識プログラム |
| JP6972287B2 (ja) * | 2016-09-15 | 2021-11-24 | 東芝テック株式会社 | 音声認識装置、音声認識方法及び音声認識プログラム |
| US10283117B2 (en) * | 2017-06-19 | 2019-05-07 | Lenovo (Singapore) Pte. Ltd. | Systems and methods for identification of response cue at peripheral device |
| US10586529B2 (en) | 2017-09-14 | 2020-03-10 | International Business Machines Corporation | Processing of speech signal |
| JP7092708B2 (ja) * | 2019-05-20 | 2022-06-28 | ヤフー株式会社 | 情報処理プログラム、情報処理装置及び情報処理方法 |
| JP7404664B2 (ja) * | 2019-06-07 | 2023-12-26 | ヤマハ株式会社 | 音声処理装置及び音声処理方法 |
| US12118984B2 (en) | 2020-11-11 | 2024-10-15 | Rovi Guides, Inc. | Systems and methods to resolve conflicts in conversations |
| KR102826343B1 (ko) | 2021-03-19 | 2025-06-30 | 삼성전자주식회사 | 개인화 tts 모듈을 포함하는 전자 장치 및 이의 제어 방법 |
| US11545143B2 (en) | 2021-05-18 | 2023-01-03 | Boris Fridman-Mintz | Recognition or synthesis of human-uttered harmonic sounds |
| US12394405B2 (en) * | 2023-03-24 | 2025-08-19 | Verizon Patent And Licensing Inc. | Systems and methods for reconstructing video data using contextually-aware multi-modal generation during signal loss |
Family Cites Families (35)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4761815A (en) * | 1981-05-01 | 1988-08-02 | Figgie International, Inc. | Speech recognition system based on word state duration and/or weight |
| US4712242A (en) * | 1983-04-13 | 1987-12-08 | Texas Instruments Incorporated | Speaker-independent word recognizer |
| US5774851A (en) * | 1985-08-15 | 1998-06-30 | Canon Kabushiki Kaisha | Speech recognition apparatus utilizing utterance length information |
| US4882757A (en) * | 1986-04-25 | 1989-11-21 | Texas Instruments Incorporated | Speech recognition system |
| JP2882791B2 (ja) * | 1986-10-03 | 1999-04-12 | 株式会社リコー | パターン比較方式 |
| JP2829014B2 (ja) | 1989-01-12 | 1998-11-25 | 株式会社東芝 | 音声認識装置及び方法 |
| JP2708566B2 (ja) * | 1989-09-06 | 1998-02-04 | 株式会社日立製作所 | 音声認識制御装置 |
| DE4031421C2 (de) * | 1989-10-05 | 1995-08-24 | Ricoh Kk | Musteranpassungssystem für eine Spracherkennungseinrichtung |
| JP3004749B2 (ja) * | 1990-05-14 | 2000-01-31 | 株式会社リコー | 標準パターン登録方法 |
| EP0474496B1 (en) * | 1990-09-07 | 1998-03-04 | Kabushiki Kaisha Toshiba | Speech recognition apparatus |
| US5692104A (en) * | 1992-12-31 | 1997-11-25 | Apple Computer, Inc. | Method and apparatus for detecting end points of speech activity |
| DE4306508A1 (de) * | 1993-03-03 | 1994-09-08 | Philips Patentverwaltung | Verfahren und Anordnung zum Ermitteln von Wörtern in einem Sprachsignal |
| US5765130A (en) * | 1996-05-21 | 1998-06-09 | Applied Language Technologies, Inc. | Method and apparatus for facilitating speech barge-in in connection with voice recognition systems |
| US5835890A (en) * | 1996-08-02 | 1998-11-10 | Nippon Telegraph And Telephone Corporation | Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon |
| JP3588929B2 (ja) | 1996-08-27 | 2004-11-17 | 日産自動車株式会社 | 音声認識装置 |
| US6167374A (en) * | 1997-02-13 | 2000-12-26 | Siemens Information And Communication Networks, Inc. | Signal processing method and system utilizing logical speech boundaries |
| EP0867856B1 (fr) | 1997-03-25 | 2005-10-26 | Koninklijke Philips Electronics N.V. | "Méthode et dispositif de detection d'activité vocale" |
| JPH10319991A (ja) * | 1997-05-20 | 1998-12-04 | Sony Corp | 電子機器の音声認識起動方法及び装置 |
| EP1083545A3 (en) * | 1999-09-09 | 2001-09-26 | Xanavi Informatics Corporation | Voice recognition of proper names in a navigation apparatus |
| JP4520555B2 (ja) * | 1999-09-09 | 2010-08-04 | クラリオン株式会社 | 音声認識装置および音声認識ナビゲーション装置 |
| US6389394B1 (en) * | 2000-02-09 | 2002-05-14 | Speechworks International, Inc. | Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations |
| JP4880136B2 (ja) | 2000-07-10 | 2012-02-22 | パナソニック株式会社 | 音声認識装置および音声認識方法 |
| US7277853B1 (en) * | 2001-03-02 | 2007-10-02 | Mindspeed Technologies, Inc. | System and method for a endpoint detection of speech for improved speech recognition in noisy environments |
| US7308404B2 (en) * | 2001-09-28 | 2007-12-11 | Sri International | Method and apparatus for speech recognition using a dynamic vocabulary |
| JP2003330491A (ja) * | 2002-05-10 | 2003-11-19 | Nec Corp | 音声認識装置および音声認識方法ならびにプログラム |
| KR100474253B1 (ko) * | 2002-12-12 | 2005-03-10 | 한국전자통신연구원 | 단어의 첫 자음 발성을 이용한 음성인식 방법 및 이를 저장한 기록 매체 |
| US7024360B2 (en) * | 2003-03-17 | 2006-04-04 | Rensselaer Polytechnic Institute | System for reconstruction of symbols in a sequence |
| US7343289B2 (en) * | 2003-06-25 | 2008-03-11 | Microsoft Corp. | System and method for audio/video speaker detection |
| CA2473195C (en) * | 2003-07-29 | 2014-02-04 | Microsoft Corporation | Head mounted multi-sensory audio input system |
| US20050033571A1 (en) * | 2003-08-07 | 2005-02-10 | Microsoft Corporation | Head mounted multi-sensory audio input system |
| KR100577387B1 (ko) | 2003-08-06 | 2006-05-10 | 삼성전자주식회사 | 음성 대화 시스템에서의 음성 인식 오류 처리 방법 및 장치 |
| JP3890326B2 (ja) * | 2003-11-07 | 2007-03-07 | キヤノン株式会社 | 情報処理装置、情報処理方法ならびに記録媒体、プログラム |
| JP4516863B2 (ja) * | 2005-03-11 | 2010-08-04 | 株式会社ケンウッド | 音声合成装置、音声合成方法及びプログラム |
| TWI319152B (en) * | 2005-10-04 | 2010-01-01 | Ind Tech Res Inst | Pre-stage detecting system and method for speech recognition |
| JP4282704B2 (ja) * | 2006-09-27 | 2009-06-24 | 株式会社東芝 | 音声区間検出装置およびプログラム |
-
2005
- 2005-03-09 JP JP2005065355A patent/JP4667082B2/ja not_active Expired - Fee Related
-
2006
- 2006-02-17 DE DE602006001764T patent/DE602006001764D1/de active Active
- 2006-02-17 AT AT06250864T patent/ATE401644T1/de not_active IP Right Cessation
- 2006-02-17 ES ES06250864T patent/ES2310893T3/es active Active
- 2006-02-17 EP EP06250864A patent/EP1701338B1/en not_active Not-in-force
- 2006-03-06 US US11/368,986 patent/US7634401B2/en not_active Expired - Fee Related
- 2006-03-08 KR KR1020060021863A patent/KR100742888B1/ko not_active Expired - Fee Related
- 2006-03-09 CN CN200610057222A patent/CN100587806C/zh not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| EP1701338B1 (en) | 2008-07-16 |
| CN100587806C (zh) | 2010-02-03 |
| US7634401B2 (en) | 2009-12-15 |
| EP1701338A1 (en) | 2006-09-13 |
| ATE401644T1 (de) | 2008-08-15 |
| KR100742888B1 (ko) | 2007-07-25 |
| JP2006251147A (ja) | 2006-09-21 |
| US20060206326A1 (en) | 2006-09-14 |
| KR20060097647A (ko) | 2006-09-14 |
| CN1831939A (zh) | 2006-09-13 |
| JP4667082B2 (ja) | 2011-04-06 |
| ES2310893T3 (es) | 2009-01-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE602006001764D1 (de) | Verfahren zur Spracherkennung | |
| GB2443753A (en) | Spoken language proficiency assessment by computer | |
| TW200601263A (en) | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition | |
| DE602005001125D1 (de) | Erlernen der Aussprache neuer Worte unter Verwendung eines Aussprachegraphen | |
| EP0865032A3 (en) | Speech recognizing performing noise adaptation | |
| WO2011133766A3 (en) | Methods and systems for training dictation-based speech-to-text systems using recorded samples | |
| EP1533694A3 (en) | System and method for providing context to an input method | |
| TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
| ATE398325T1 (de) | Synchrones verstehen von semantischen objekten, implementiert unter verwendung von sprachanwendungsmarkierungen | |
| DE60309142D1 (de) | Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells | |
| WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
| WO2008042511A3 (en) | Personalizing a voice dialogue system | |
| ATE405919T1 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
| EP1752911A3 (en) | Information processing method and information processing device | |
| ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
| ATE391985T1 (de) | Verfahren und vorrichtung zur modellierung eines spracherkennungssystems und zur schätzung einer wort-fehlerrate basierend auf einem text | |
| WO2004049305A3 (en) | Discriminative training of hidden markov models for continuous speech recognition | |
| ATE342566T1 (de) | Verfahren zur spracheingabe eines zielortes mit hilfe eines definierten eingabedialogs in ein zielführungssystem | |
| ATE487212T1 (de) | Verstekte bedingte zufallfeldermodelle für phonetische klassifizierung und spracherkennung | |
| WO2008005711A3 (en) | Non-enrolled continuous dictation | |
| ATE394773T1 (de) | Verfahren zur spracherkennung mit zeitabhängiger interpolation und verborgenen dynamischen wertklassen | |
| ATE433180T1 (de) | Vorrichtung und verfahren zur spracherkennung | |
| DE602008005641D1 (de) | Verfahren, vorrichtung und programmcode zur umwandlung von stimmen | |
| ATE339756T1 (de) | Verfahren und vorrichtung zur bestimmung von formanten unter benutzung eines restsignalmodells | |
| ATE454676T1 (de) | Vorrichtung und verfahren zur handschrifterkennung |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 8364 | No opposition during term of opposition |