DE69720134T2 - Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten - Google Patents
Spracherkenner unter Verwendung von GrundfrequenzintensitätsdatenInfo
- Publication number
- DE69720134T2 DE69720134T2 DE69720134T DE69720134T DE69720134T2 DE 69720134 T2 DE69720134 T2 DE 69720134T2 DE 69720134 T DE69720134 T DE 69720134T DE 69720134 T DE69720134 T DE 69720134T DE 69720134 T2 DE69720134 T2 DE 69720134T2
- Authority
- DE
- Germany
- Prior art keywords
- fundamental frequency
- intensity data
- speech recognizer
- frequency intensity
- recognizer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1807—Speech classification or search using natural language modelling using prosody or stress
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP8284827A JP3006677B2 (ja) | 1996-10-28 | 1996-10-28 | 音声認識装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69720134D1 DE69720134D1 (de) | 2003-04-30 |
DE69720134T2 true DE69720134T2 (de) | 2003-12-04 |
Family
ID=17683529
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69720134T Expired - Lifetime DE69720134T2 (de) | 1996-10-28 | 1997-10-28 | Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten |
Country Status (4)
Country | Link |
---|---|
US (1) | US5907826A (de) |
EP (1) | EP0838805B1 (de) |
JP (1) | JP3006677B2 (de) |
DE (1) | DE69720134T2 (de) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6202046B1 (en) * | 1997-01-23 | 2001-03-13 | Kabushiki Kaisha Toshiba | Background noise/speech classification method |
US6795807B1 (en) | 1999-08-17 | 2004-09-21 | David R. Baraff | Method and means for creating prosody in speech regeneration for laryngectomees |
KR20010089811A (ko) * | 1999-11-11 | 2001-10-08 | 요트.게.아. 롤페즈 | 음성 인식 시스템 |
US7043430B1 (en) * | 1999-11-23 | 2006-05-09 | Infotalk Corporation Limitied | System and method for speech recognition using tonal modeling |
JP4054507B2 (ja) * | 2000-03-31 | 2008-02-27 | キヤノン株式会社 | 音声情報処理方法および装置および記憶媒体 |
TW521266B (en) * | 2000-07-13 | 2003-02-21 | Verbaltek Inc | Perceptual phonetic feature speech recognition system and method |
US7233899B2 (en) * | 2001-03-12 | 2007-06-19 | Fain Vitaliy S | Speech recognition system using normalized voiced segment spectrogram analysis |
KR20030060593A (ko) * | 2002-01-10 | 2003-07-16 | 주식회사 현대오토넷 | 피치값을 이용한 음성 인식 방법 |
KR100533601B1 (ko) * | 2002-12-05 | 2005-12-06 | 베스티안파트너스(주) | 휴대전화의 화자독립형 음성인식을 위한 성별 구분방법 |
JP4447857B2 (ja) * | 2003-06-20 | 2010-04-07 | 株式会社エヌ・ティ・ティ・ドコモ | 音声検出装置 |
KR100571831B1 (ko) * | 2004-02-10 | 2006-04-17 | 삼성전자주식회사 | 음성 식별 장치 및 방법 |
JP4264841B2 (ja) * | 2006-12-01 | 2009-05-20 | ソニー株式会社 | 音声認識装置および音声認識方法、並びに、プログラム |
JP4882899B2 (ja) | 2007-07-25 | 2012-02-22 | ソニー株式会社 | 音声解析装置、および音声解析方法、並びにコンピュータ・プログラム |
JP5282737B2 (ja) * | 2007-08-22 | 2013-09-04 | 日本電気株式会社 | 音声認識装置および音声認識方法 |
JP5495858B2 (ja) * | 2010-03-02 | 2014-05-21 | 三菱電機株式会社 | 音楽音響信号のピッチ推定装置及び方法 |
US8725498B1 (en) * | 2012-06-20 | 2014-05-13 | Google Inc. | Mobile speech recognition with explicit tone features |
JP6546070B2 (ja) * | 2015-11-10 | 2019-07-17 | 日本電信電話株式会社 | 音響モデル学習装置、音声認識装置、音響モデル学習方法、音声認識方法、およびプログラム |
JP6943158B2 (ja) * | 2017-11-28 | 2021-09-29 | トヨタ自動車株式会社 | 応答文生成装置、方法及びプログラム並びに音声対話システム |
CN110648686B (zh) * | 2018-06-27 | 2023-06-23 | 达发科技股份有限公司 | 调整语音频率的方法及其声音播放装置 |
CN109036408A (zh) * | 2018-08-23 | 2018-12-18 | 重庆加河科技有限公司 | 一种用于vr展示教学的语音识别控制设备及控制方法 |
CN109448749B (zh) * | 2018-12-19 | 2022-02-15 | 中国科学院自动化研究所 | 基于有监督学习听觉注意的语音提取方法、系统、装置 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4667340A (en) * | 1983-04-13 | 1987-05-19 | Texas Instruments Incorporated | Voice messaging system with pitch-congruent baseband coding |
KR950013552B1 (ko) * | 1990-05-28 | 1995-11-08 | 마쯔시다덴기산교 가부시기가이샤 | 음성신호처리장치 |
US5657418A (en) * | 1991-09-05 | 1997-08-12 | Motorola, Inc. | Provision of speech coder gain information using multiple coding modes |
FI92535C (fi) * | 1992-02-14 | 1994-11-25 | Nokia Mobile Phones Ltd | Kohinan vaimennusjärjestelmä puhesignaaleille |
JP3450411B2 (ja) * | 1994-03-22 | 2003-09-22 | キヤノン株式会社 | 音声情報処理方法及び装置 |
JPH0876789A (ja) * | 1994-09-02 | 1996-03-22 | Toshiba Corp | 不特定話者単語音声認識システムおよび不特定話者単語音声認識方法 |
JP3591068B2 (ja) * | 1995-06-30 | 2004-11-17 | ソニー株式会社 | 音声信号の雑音低減方法 |
-
1996
- 1996-10-28 JP JP8284827A patent/JP3006677B2/ja not_active Expired - Fee Related
-
1997
- 1997-10-28 US US08/959,464 patent/US5907826A/en not_active Expired - Fee Related
- 1997-10-28 EP EP97118746A patent/EP0838805B1/de not_active Expired - Lifetime
- 1997-10-28 DE DE69720134T patent/DE69720134T2/de not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
EP0838805A3 (de) | 1998-12-23 |
JPH10133693A (ja) | 1998-05-22 |
DE69720134D1 (de) | 2003-04-30 |
US5907826A (en) | 1999-05-25 |
JP3006677B2 (ja) | 2000-02-07 |
EP0838805A2 (de) | 1998-04-29 |
EP0838805B1 (de) | 2003-03-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69720134T2 (de) | Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten | |
DE69719270D1 (de) | Sprachsynthese unter Verwendung von Hilfsinformationen | |
DE69814589D1 (de) | Spracherkennung unter verwendung mehrerer spracherkenner | |
DE69719576D1 (de) | Fluor-desoxy-glucose-Synthetisierer unter Verwendung von Kolonnen | |
DE69630355D1 (de) | Dynamische gerätanpassung unter verwendung von treiber-kandidatlisten | |
DE69613907D1 (de) | Veränderte Grundfrequenzverzögerung bei Verlust von Datenrahmen | |
NO974097D0 (no) | Talegjenkjenning | |
DK0789901T3 (da) | Talegenkendelse | |
DE69718553D1 (de) | Gesichtserkennung unter der Verwendung von dct-gestützten Merkmalsvektoren | |
DE59602336D1 (de) | Optischer frequenzgenerator | |
DE19882098T1 (de) | Adaptiver Frequenzwiederverwendungsplan | |
DE69720822D1 (de) | Verwendung von Sprachaktivitätserkennung zur effizienten Sprachkodierung | |
DK0749109T3 (da) | Talegenkendelse for tonesprog | |
DE69421596D1 (de) | Spracherkennung unter Anwendung von Biosignalen | |
FI973873A (fi) | Puhekoodaus | |
DE69628195D1 (de) | Verwendung von xyloglucanendotransglycosylase | |
DE69808936T2 (de) | Erhöhung der dichte von kodierten sprachsignalen | |
DE69708365D1 (de) | Zeichenerkennungsmethode | |
DE69732435D1 (de) | Verwendung von 1-nonen-3-one als aromastoff | |
NO20000574D0 (no) | Trinnstyrt frekvens-syntetisator | |
DE69707617D1 (de) | Optischer Frequenzstabilisator | |
DE69703177D1 (de) | Optischer parametrischer Oszillator | |
DE69421595T2 (de) | Spracherkennung unter Anwendung von Biosignalen | |
DE59510451D1 (de) | Echokompensator unter Verwendung von Kurzzeitspektralanalyse | |
DE69924769D1 (de) | Sprachmustererkennung unter Verwendung von Durchschnitts-Kovarianzmatrizen |