DE69432943T2 - Verfahren und Vorrichtung zur Sprachdetektion - Google Patents

Verfahren und Vorrichtung zur Sprachdetektion

Info

Publication number
DE69432943T2
DE69432943T2 DE69432943T DE69432943T DE69432943T2 DE 69432943 T2 DE69432943 T2 DE 69432943T2 DE 69432943 T DE69432943 T DE 69432943T DE 69432943 T DE69432943 T DE 69432943T DE 69432943 T2 DE69432943 T2 DE 69432943T2
Authority
DE
Germany
Prior art keywords
speech detection
speech
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE69432943T
Other languages
English (en)
Other versions
DE69432943D1 (de
Inventor
Yoshihisa Nakatoh
Takeshi Norimatsu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Application granted granted Critical
Publication of DE69432943D1 publication Critical patent/DE69432943D1/de
Publication of DE69432943T2 publication Critical patent/DE69432943T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)
DE69432943T 1993-05-19 1994-05-19 Verfahren und Vorrichtung zur Sprachdetektion Expired - Fee Related DE69432943T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5116980A JPH06332492A (ja) 1993-05-19 1993-05-19 音声検出方法および検出装置

Publications (2)

Publication Number Publication Date
DE69432943D1 DE69432943D1 (de) 2003-08-14
DE69432943T2 true DE69432943T2 (de) 2003-12-24

Family

ID=14700517

Family Applications (3)

Application Number Title Priority Date Filing Date
DE69432943T Expired - Fee Related DE69432943T2 (de) 1993-05-19 1994-05-19 Verfahren und Vorrichtung zur Sprachdetektion
DE69430082T Expired - Fee Related DE69430082T2 (de) 1993-05-19 1994-05-19 Verfahren und Vorrichtung zur Sprachdetektion
DE69433254T Expired - Fee Related DE69433254T2 (de) 1993-05-19 1994-05-19 Verfahren und Vorrichtung zur Sprachdetektion

Family Applications After (2)

Application Number Title Priority Date Filing Date
DE69430082T Expired - Fee Related DE69430082T2 (de) 1993-05-19 1994-05-19 Verfahren und Vorrichtung zur Sprachdetektion
DE69433254T Expired - Fee Related DE69433254T2 (de) 1993-05-19 1994-05-19 Verfahren und Vorrichtung zur Sprachdetektion

Country Status (4)

Country Link
US (1) US5611019A (de)
EP (3) EP1083541B1 (de)
JP (1) JPH06332492A (de)
DE (3) DE69432943T2 (de)

Families Citing this family (76)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU707896B2 (en) * 1995-02-15 1999-07-22 British Telecommunications Public Limited Company Voice activity detection
DE19508711A1 (de) * 1995-03-10 1996-09-12 Siemens Ag Verfahren zur Erkennung einer Signalpause zwischen zwei Mustern, welche in einem zeitvarianten Meßsignal vorhanden sind
AU720511B2 (en) * 1995-08-24 2000-06-01 British Telecommunications Public Limited Company Pattern recognition
JP3536471B2 (ja) * 1995-09-26 2004-06-07 ソニー株式会社 識別装置および識別方法、並びに音声認識装置および音声認識方法
US5768263A (en) * 1995-10-20 1998-06-16 Vtel Corporation Method for talk/listen determination and multipoint conferencing system using such method
US5774849A (en) * 1996-01-22 1998-06-30 Rockwell International Corporation Method and apparatus for generating frame voicing decisions of an incoming speech signal
US5778082A (en) * 1996-06-14 1998-07-07 Picturetel Corporation Method and apparatus for localization of an acoustic source
US6708146B1 (en) 1997-01-03 2004-03-16 Telecommunications Research Laboratories Voiceband signal classifier
JP3255584B2 (ja) * 1997-01-20 2002-02-12 ロジック株式会社 有音検知装置および方法
US6076055A (en) * 1997-05-27 2000-06-13 Ameritech Speaker verification method
US7630895B2 (en) * 2000-01-21 2009-12-08 At&T Intellectual Property I, L.P. Speaker verification method
DE59812167D1 (de) * 1997-09-12 2004-12-02 Siemens Ag Verfahren zur Zurückweisung unbekannter Wörter bei der Spracherkennung von Einzelworten
US6055499A (en) * 1998-05-01 2000-04-25 Lucent Technologies Inc. Use of periodicity and jitter for automatic speech recognition
US6226606B1 (en) * 1998-11-24 2001-05-01 Microsoft Corporation Method and apparatus for pitch tracking
US6556967B1 (en) * 1999-03-12 2003-04-29 The United States Of America As Represented By The National Security Agency Voice activity detector
JP4438127B2 (ja) * 1999-06-18 2010-03-24 ソニー株式会社 音声符号化装置及び方法、音声復号装置及び方法、並びに記録媒体
FI116992B (fi) * 1999-07-05 2006-04-28 Nokia Corp Menetelmät, järjestelmä ja laitteet audiosignaalin koodauksen ja siirron tehostamiseksi
US7072833B2 (en) * 2000-06-02 2006-07-04 Canon Kabushiki Kaisha Speech processing system
US6954745B2 (en) * 2000-06-02 2005-10-11 Canon Kabushiki Kaisha Signal processing system
US7035790B2 (en) * 2000-06-02 2006-04-25 Canon Kabushiki Kaisha Speech processing system
US20020026253A1 (en) * 2000-06-02 2002-02-28 Rajan Jebu Jacob Speech processing apparatus
US7010483B2 (en) * 2000-06-02 2006-03-07 Canon Kabushiki Kaisha Speech processing system
JP4201471B2 (ja) * 2000-09-12 2008-12-24 パイオニア株式会社 音声認識システム
JP4201470B2 (ja) * 2000-09-12 2008-12-24 パイオニア株式会社 音声認識システム
US20020147585A1 (en) * 2001-04-06 2002-10-10 Poulsen Steven P. Voice activity detection
JP3812887B2 (ja) * 2001-12-21 2006-08-23 富士通株式会社 信号処理システムおよび方法
US20030216909A1 (en) * 2002-05-14 2003-11-20 Davis Wallace K. Voice activity detection
KR100440973B1 (ko) * 2002-08-01 2004-07-21 삼성전자주식회사 신호간 상관계수 결정 장치 및 방법과 이를 이용한 신호피치 결정 장치 및 방법
US8793127B2 (en) * 2002-10-31 2014-07-29 Promptu Systems Corporation Method and apparatus for automatically determining speaker characteristics for speech-directed advertising or other enhancement of speech-controlled devices or services
JP4348970B2 (ja) 2003-03-06 2009-10-21 ソニー株式会社 情報検出装置及び方法、並びにプログラム
US20050015244A1 (en) * 2003-07-14 2005-01-20 Hideki Kitao Speech section detection apparatus
EP1661124A4 (de) * 2003-09-05 2008-08-13 Stephen D Grody Verfahren und vorrichtungen zur bereitstellung von diensten durch verwendung von spracherkennung
WO2005034395A2 (en) * 2003-09-17 2005-04-14 Nielsen Media Research, Inc. Methods and apparatus to operate an audience metering device with voice commands
KR100571831B1 (ko) * 2004-02-10 2006-04-17 삼성전자주식회사 음성 식별 장치 및 방법
CN100592386C (zh) * 2004-07-01 2010-02-24 日本电信电话株式会社 特定音响信号含有区间检测系统及其方法
DE102004049347A1 (de) * 2004-10-08 2006-04-20 Micronas Gmbh Schaltungsanordnung bzw. Verfahren für Sprache enthaltende Audiosignale
CN100399419C (zh) * 2004-12-07 2008-07-02 腾讯科技(深圳)有限公司 一种检测静音帧的方法
KR100682909B1 (ko) * 2004-12-23 2007-02-15 삼성전자주식회사 음성 인식 방법 및 장치
FR2864319A1 (fr) * 2005-01-19 2005-06-24 France Telecom Procede et dispositif de detection de parole dans un signal audio
US8175877B2 (en) * 2005-02-02 2012-05-08 At&T Intellectual Property Ii, L.P. Method and apparatus for predicting word accuracy in automatic speech recognition systems
KR100714721B1 (ko) * 2005-02-04 2007-05-04 삼성전자주식회사 음성 구간 검출 방법 및 장치
US20060241937A1 (en) * 2005-04-21 2006-10-26 Ma Changxue C Method and apparatus for automatically discriminating information bearing audio segments and background noise audio segments
US20070033042A1 (en) * 2005-08-03 2007-02-08 International Business Machines Corporation Speech detection fusing multi-class acoustic-phonetic, and energy features
US7962340B2 (en) * 2005-08-22 2011-06-14 Nuance Communications, Inc. Methods and apparatus for buffering data for use in accordance with a speech recognition system
JP2007114413A (ja) * 2005-10-19 2007-05-10 Toshiba Corp 音声非音声判別装置、音声区間検出装置、音声非音声判別方法、音声区間検出方法、音声非音声判別プログラムおよび音声区間検出プログラム
US8175868B2 (en) * 2005-10-20 2012-05-08 Nec Corporation Voice judging system, voice judging method and program for voice judgment
US9015740B2 (en) 2005-12-12 2015-04-21 The Nielsen Company (Us), Llc Systems and methods to wirelessly meter audio/visual devices
CA2633577C (en) * 2005-12-12 2016-04-05 Nielsen Media Research, Inc. Systems and methods to wirelessly meter audio/visual devices
US8521537B2 (en) * 2006-04-03 2013-08-27 Promptu Systems Corporation Detection and use of acoustic signal quality indicators
US8364492B2 (en) * 2006-07-13 2013-01-29 Nec Corporation Apparatus, method and program for giving warning in connection with inputting of unvoiced speech
US20080033583A1 (en) * 2006-08-03 2008-02-07 Broadcom Corporation Robust Speech/Music Classification for Audio Signals
US8015000B2 (en) * 2006-08-03 2011-09-06 Broadcom Corporation Classification-based frame loss concealment for audio signals
KR100774800B1 (ko) * 2006-09-06 2007-11-07 한국정보통신대학교 산학협력단 포아송 폴링 기법을 이용한 세그먼트 단위의 음성/비음성분류 방법 및 장치
JP4282704B2 (ja) * 2006-09-27 2009-06-24 株式会社東芝 音声区間検出装置およびプログラム
CN101165779B (zh) * 2006-10-20 2010-06-02 索尼株式会社 信息处理装置和方法、程序及记录介质
JP4239109B2 (ja) 2006-10-20 2009-03-18 ソニー株式会社 情報処理装置および方法、プログラム、並びに記録媒体
KR100964402B1 (ko) * 2006-12-14 2010-06-17 삼성전자주식회사 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치
JP4950930B2 (ja) * 2008-04-03 2012-06-13 株式会社東芝 音声/非音声を判定する装置、方法およびプログラム
KR20100006492A (ko) 2008-07-09 2010-01-19 삼성전자주식회사 부호화 방식 결정 방법 및 장치
US9124769B2 (en) 2008-10-31 2015-09-01 The Nielsen Company (Us), Llc Methods and apparatus to verify presentation of media content
CN102667927B (zh) 2009-10-19 2013-05-08 瑞典爱立信有限公司 语音活动检测的方法和背景估计器
US20140207456A1 (en) * 2010-09-23 2014-07-24 Waveform Communications, Llc Waveform analysis of speech
CN102629470B (zh) * 2011-02-02 2015-05-20 Jvc建伍株式会社 辅音区间检测装置及辅音区间检测方法
JP6047922B2 (ja) * 2011-06-01 2016-12-21 ヤマハ株式会社 音声合成装置および音声合成方法
US20140329511A1 (en) * 2011-12-20 2014-11-06 Nokia Corporation Audio conferencing
US8892046B2 (en) * 2012-03-29 2014-11-18 Bose Corporation Automobile communication system
CN104409080B (zh) * 2014-12-15 2018-09-18 北京国双科技有限公司 语音端点检测方法和装置
CN105118520B (zh) * 2015-07-13 2017-11-10 腾讯科技(深圳)有限公司 一种音频开头爆音的消除方法及装置
EP3301950B1 (de) * 2016-04-29 2020-11-04 Huawei Technologies Co., Ltd. Verfahren und vorrichtung zur bestimmung von spracheingabeanomalien, endgerät und speichermedium
US10235993B1 (en) * 2016-06-14 2019-03-19 Friday Harbor Llc Classifying signals using correlations of segments
GB201617016D0 (en) 2016-09-09 2016-11-23 Continental automotive systems inc Robust noise estimation for speech enhancement in variable noise conditions
US9978392B2 (en) * 2016-09-09 2018-05-22 Tata Consultancy Services Limited Noisy signal identification from non-stationary audio signals
CN112397093B (zh) * 2020-12-04 2024-02-27 中国联合网络通信集团有限公司 一种语音检测方法与装置
US20220180206A1 (en) * 2020-12-09 2022-06-09 International Business Machines Corporation Knowledge distillation using deep clustering
CN113345472B (zh) * 2021-05-08 2022-03-25 北京百度网讯科技有限公司 语音端点检测方法、装置、电子设备及存储介质
CN114743541B (zh) * 2022-04-24 2023-03-17 广东海洋大学 一种英语听说学习用互动系统

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4284846A (en) * 1978-05-08 1981-08-18 John Marley System and method for sound recognition
JPS59226400A (ja) * 1983-06-07 1984-12-19 松下電器産業株式会社 音声認識装置
US5131043A (en) * 1983-09-05 1992-07-14 Matsushita Electric Industrial Co., Ltd. Method of and apparatus for speech recognition wherein decisions are made based on phonemes
US4991216A (en) * 1983-09-22 1991-02-05 Matsushita Electric Industrial Co., Ltd. Method for speech recognition
US4920568A (en) * 1985-07-16 1990-04-24 Sharp Kabushiki Kaisha Method of distinguishing voice from noise
US5027408A (en) * 1987-04-09 1991-06-25 Kroeker John P Speech-recognition circuitry employing phoneme estimation
US4910784A (en) * 1987-07-30 1990-03-20 Texas Instruments Incorporated Low cost speech recognition system and method
DE68910859T2 (de) * 1988-03-11 1994-12-08 British Telecommunications P.L.C., London Detektion für die Anwesenheit eines Sprachsignals.
JPH01277899A (ja) * 1988-04-30 1989-11-08 Oki Electric Ind Co Ltd 音声帯域内信号検出方式
KR950013553B1 (ko) * 1990-05-28 1995-11-08 마쯔시다덴기산교 가부시기가이샤 음성신호처리장치

Also Published As

Publication number Publication date
EP1083541A2 (de) 2001-03-14
DE69430082T2 (de) 2002-10-31
DE69433254T2 (de) 2004-08-12
EP0625774B1 (de) 2002-03-13
EP1083541A3 (de) 2002-02-20
EP0625774A2 (de) 1994-11-23
DE69430082D1 (de) 2002-04-18
EP1083542A3 (de) 2002-01-23
DE69433254D1 (de) 2003-11-20
DE69432943D1 (de) 2003-08-14
US5611019A (en) 1997-03-11
EP0625774A3 (de) 1996-10-30
JPH06332492A (ja) 1994-12-02
EP1083541B1 (de) 2003-07-09
EP1083542B1 (de) 2003-10-15
EP1083542A2 (de) 2001-03-14

Similar Documents

Publication Publication Date Title
DE69430082T2 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69831991D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69633524D1 (de) Verfahren und Gerät zur Objekterfassung
DE69420400D1 (de) Verfahren und gerät zur sprechererkennung
DE69324629T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69518705D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69430426D1 (de) Vorrichtung und verfahren zur feuerbekämpfung
DE69332459D1 (de) Verfahren und Vorrichtung zur Zeichenerkennung
DE59308717D1 (de) Verfahren und Vorrichtung zur Film-Mode-Detektion
DE69309557D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69331044T2 (de) Vorrichtung und Verfahren zur syntaktischen Signalanalyse
DE69422845T2 (de) Vorrichtung und Verfahren zur Koordinateneingabe
DE69431445D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69629538D1 (de) Vorrichtung und Verfahren zur Trennung
DE69715071D1 (de) Verfahren und Vorrichtung zur Sprachverarbeitung
DE69412457D1 (de) Verfahren und vorrichtung zur atem-erkennung
DE69623879D1 (de) Vorrichtung und Verfahren zur Volumenermittlung
DE69803202D1 (de) Verfahren und vorrichtung zur sprachdetektion
DE69517829T2 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69419970T2 (de) Verfahren und Vorrichtung zur Elektroplattierung
DE69417273D1 (de) Verfahren und Vorrichtung zur Mustererkennung
DE69620304T2 (de) Vorrichtung und Verfahren zur Spracherkennung
DE59505720D1 (de) Vorrichtung und verfahren zur erkennung von objekten
DE69419978T2 (de) Verfahren und Vorrichtung zur Fehlstellenfeststellung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee