DE69720134D1 - Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten - Google Patents

Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten

Info

Publication number
DE69720134D1
DE69720134D1 DE69720134T DE69720134T DE69720134D1 DE 69720134 D1 DE69720134 D1 DE 69720134D1 DE 69720134 T DE69720134 T DE 69720134T DE 69720134 T DE69720134 T DE 69720134T DE 69720134 D1 DE69720134 D1 DE 69720134D1
Authority
DE
Germany
Prior art keywords
fundamental frequency
intensity data
speech recognizer
frequency intensity
recognizer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69720134T
Other languages
English (en)
Other versions
DE69720134T2 (de
Inventor
Keizaburo Takagi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of DE69720134D1 publication Critical patent/DE69720134D1/de
Application granted granted Critical
Publication of DE69720134T2 publication Critical patent/DE69720134T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1807Speech classification or search using natural language modelling using prosody or stress
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Telephonic Communication Services (AREA)
DE69720134T 1996-10-28 1997-10-28 Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten Expired - Lifetime DE69720134T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP8284827A JP3006677B2 (ja) 1996-10-28 1996-10-28 音声認識装置

Publications (2)

Publication Number Publication Date
DE69720134D1 true DE69720134D1 (de) 2003-04-30
DE69720134T2 DE69720134T2 (de) 2003-12-04

Family

ID=17683529

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69720134T Expired - Lifetime DE69720134T2 (de) 1996-10-28 1997-10-28 Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten

Country Status (4)

Country Link
US (1) US5907826A (de)
EP (1) EP0838805B1 (de)
JP (1) JP3006677B2 (de)
DE (1) DE69720134T2 (de)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6202046B1 (en) * 1997-01-23 2001-03-13 Kabushiki Kaisha Toshiba Background noise/speech classification method
US6795807B1 (en) 1999-08-17 2004-09-21 David R. Baraff Method and means for creating prosody in speech regeneration for laryngectomees
KR20010089811A (ko) * 1999-11-11 2001-10-08 요트.게.아. 롤페즈 음성 인식 시스템
US7043430B1 (en) * 1999-11-23 2006-05-09 Infotalk Corporation Limitied System and method for speech recognition using tonal modeling
JP4054507B2 (ja) * 2000-03-31 2008-02-27 キヤノン株式会社 音声情報処理方法および装置および記憶媒体
TW521266B (en) * 2000-07-13 2003-02-21 Verbaltek Inc Perceptual phonetic feature speech recognition system and method
US7233899B2 (en) * 2001-03-12 2007-06-19 Fain Vitaliy S Speech recognition system using normalized voiced segment spectrogram analysis
KR20030060593A (ko) * 2002-01-10 2003-07-16 주식회사 현대오토넷 피치값을 이용한 음성 인식 방법
KR100533601B1 (ko) * 2002-12-05 2005-12-06 베스티안파트너스(주) 휴대전화의 화자독립형 음성인식을 위한 성별 구분방법
JP4447857B2 (ja) * 2003-06-20 2010-04-07 株式会社エヌ・ティ・ティ・ドコモ 音声検出装置
KR100571831B1 (ko) * 2004-02-10 2006-04-17 삼성전자주식회사 음성 식별 장치 및 방법
JP4264841B2 (ja) * 2006-12-01 2009-05-20 ソニー株式会社 音声認識装置および音声認識方法、並びに、プログラム
JP4882899B2 (ja) 2007-07-25 2012-02-22 ソニー株式会社 音声解析装置、および音声解析方法、並びにコンピュータ・プログラム
JP5282737B2 (ja) * 2007-08-22 2013-09-04 日本電気株式会社 音声認識装置および音声認識方法
JP5495858B2 (ja) * 2010-03-02 2014-05-21 三菱電機株式会社 音楽音響信号のピッチ推定装置及び方法
US8725498B1 (en) * 2012-06-20 2014-05-13 Google Inc. Mobile speech recognition with explicit tone features
JP6546070B2 (ja) * 2015-11-10 2019-07-17 日本電信電話株式会社 音響モデル学習装置、音声認識装置、音響モデル学習方法、音声認識方法、およびプログラム
JP6943158B2 (ja) * 2017-11-28 2021-09-29 トヨタ自動車株式会社 応答文生成装置、方法及びプログラム並びに音声対話システム
CN110648686B (zh) * 2018-06-27 2023-06-23 达发科技股份有限公司 调整语音频率的方法及其声音播放装置
CN109036408A (zh) * 2018-08-23 2018-12-18 重庆加河科技有限公司 一种用于vr展示教学的语音识别控制设备及控制方法
CN109448749B (zh) * 2018-12-19 2022-02-15 中国科学院自动化研究所 基于有监督学习听觉注意的语音提取方法、系统、装置

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4667340A (en) * 1983-04-13 1987-05-19 Texas Instruments Incorporated Voice messaging system with pitch-congruent baseband coding
KR950013552B1 (ko) * 1990-05-28 1995-11-08 마쯔시다덴기산교 가부시기가이샤 음성신호처리장치
US5657418A (en) * 1991-09-05 1997-08-12 Motorola, Inc. Provision of speech coder gain information using multiple coding modes
FI92535C (fi) * 1992-02-14 1994-11-25 Nokia Mobile Phones Ltd Kohinan vaimennusjärjestelmä puhesignaaleille
JP3450411B2 (ja) * 1994-03-22 2003-09-22 キヤノン株式会社 音声情報処理方法及び装置
JPH0876789A (ja) * 1994-09-02 1996-03-22 Toshiba Corp 不特定話者単語音声認識システムおよび不特定話者単語音声認識方法
JP3591068B2 (ja) * 1995-06-30 2004-11-17 ソニー株式会社 音声信号の雑音低減方法

Also Published As

Publication number Publication date
EP0838805A3 (de) 1998-12-23
JPH10133693A (ja) 1998-05-22
DE69720134T2 (de) 2003-12-04
US5907826A (en) 1999-05-25
JP3006677B2 (ja) 2000-02-07
EP0838805A2 (de) 1998-04-29
EP0838805B1 (de) 2003-03-26

Similar Documents

Publication Publication Date Title
DE69720134T2 (de) Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten
DE69719270T2 (de) Sprachsynthese unter Verwendung von Hilfsinformationen
DE69814589D1 (de) Spracherkennung unter verwendung mehrerer spracherkenner
DE69719576T2 (de) Fluor-desoxy-glucose-Synthetisierer unter Verwendung von Kolonnen
DE69630355D1 (de) Dynamische gerätanpassung unter verwendung von treiber-kandidatlisten
DE69613907D1 (de) Veränderte Grundfrequenzverzögerung bei Verlust von Datenrahmen
NO974097D0 (no) Talegjenkjenning
DK0789901T3 (da) Talegenkendelse
DE69718553T2 (de) Gesichtserkennung unter der Verwendung von dct-gestützten Merkmalsvektoren
DE59602336D1 (de) Optischer frequenzgenerator
DE19882098T1 (de) Adaptiver Frequenzwiederverwendungsplan
DE69720822D1 (de) Verwendung von Sprachaktivitätserkennung zur effizienten Sprachkodierung
DK0749109T3 (da) Talegenkendelse for tonesprog
DE69421596T2 (de) Spracherkennung unter Anwendung von Biosignalen
FI973873A (fi) Puhekoodaus
DE69628195D1 (de) Verwendung von xyloglucanendotransglycosylase
DE69808936T2 (de) Erhöhung der dichte von kodierten sprachsignalen
DE69708365D1 (de) Zeichenerkennungsmethode
DE69732435D1 (de) Verwendung von 1-nonen-3-one als aromastoff
NO20000574D0 (no) Trinnstyrt frekvens-syntetisator
DE69707617D1 (de) Optischer Frequenzstabilisator
DE69703177T2 (de) Optischer parametrischer Oszillator
DE69421595D1 (de) Spracherkennung unter Anwendung von Biosignalen
DE59510451D1 (de) Echokompensator unter Verwendung von Kurzzeitspektralanalyse
DE69924769D1 (de) Sprachmustererkennung unter Verwendung von Durchschnitts-Kovarianzmatrizen