DE69920047T2 - Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) - Google Patents

Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) Download PDF

Info

Publication number
DE69920047T2
DE69920047T2 DE69920047T DE69920047T DE69920047T2 DE 69920047 T2 DE69920047 T2 DE 69920047T2 DE 69920047 T DE69920047 T DE 69920047T DE 69920047 T DE69920047 T DE 69920047T DE 69920047 T2 DE69920047 T2 DE 69920047T2
Authority
DE
Germany
Prior art keywords
speech
audio signal
detection
pure
energy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69920047T
Other languages
German (de)
English (en)
Other versions
DE69920047D1 (de
Inventor
Chuang Gu
Ming-Chieh Lee
Wei-Ge Chen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of DE69920047D1 publication Critical patent/DE69920047D1/de
Application granted granted Critical
Publication of DE69920047T2 publication Critical patent/DE69920047T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Monitoring And Testing Of Exchanges (AREA)
  • Machine Translation (AREA)
DE69920047T 1998-11-30 1999-11-30 Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) Expired - Lifetime DE69920047T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201705 1988-06-02
US09/201,705 US6205422B1 (en) 1998-11-30 1998-11-30 Morphological pure speech detection using valley percentage
PCT/US1999/028401 WO2000033294A1 (en) 1998-11-30 1999-11-30 Pure speech detection using valley percentage

Publications (2)

Publication Number Publication Date
DE69920047D1 DE69920047D1 (de) 2004-10-14
DE69920047T2 true DE69920047T2 (de) 2005-01-20

Family

ID=22746956

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69920047T Expired - Lifetime DE69920047T2 (de) 1998-11-30 1999-11-30 Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)

Country Status (6)

Country Link
US (1) US6205422B1 (enExample)
EP (1) EP1141938B1 (enExample)
JP (1) JP4652575B2 (enExample)
AT (1) ATE275750T1 (enExample)
DE (1) DE69920047T2 (enExample)
WO (1) WO2000033294A1 (enExample)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6801895B1 (en) * 1998-12-07 2004-10-05 At&T Corp. Method and apparatus for segmenting a multi-media program based upon audio events
KR100429896B1 (ko) * 2001-11-22 2004-05-03 한국전자통신연구원 잡음 환경에서의 음성신호 검출방법 및 그 장치
WO2005124722A2 (en) * 2004-06-12 2005-12-29 Spl Development, Inc. Aural rehabilitation system and method
US20070011001A1 (en) * 2005-07-11 2007-01-11 Samsung Electronics Co., Ltd. Apparatus for predicting the spectral information of voice signals and a method therefor
KR100713366B1 (ko) * 2005-07-11 2007-05-04 삼성전자주식회사 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치
KR100800873B1 (ko) 2005-10-28 2008-02-04 삼성전자주식회사 음성 신호 검출 시스템 및 방법
KR100790110B1 (ko) * 2006-03-18 2008-01-02 삼성전자주식회사 모폴로지 기반의 음성 신호 코덱 방법 및 장치
KR100762596B1 (ko) * 2006-04-05 2007-10-01 삼성전자주식회사 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
KR100860830B1 (ko) * 2006-12-13 2008-09-30 삼성전자주식회사 음성 신호의 스펙트럼 정보 추정 장치 및 방법
US8935158B2 (en) 2006-12-13 2015-01-13 Samsung Electronics Co., Ltd. Apparatus and method for comparing frames using spectral information of audio signal
US8355511B2 (en) * 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8521530B1 (en) * 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US8798290B1 (en) 2010-04-21 2014-08-05 Audience, Inc. Systems and methods for adaptive signal equalization
EP2724340B1 (en) * 2011-07-07 2019-05-15 Nuance Communications, Inc. Single channel suppression of impulsive interferences in noisy speech signals
US9286907B2 (en) * 2011-11-23 2016-03-15 Creative Technology Ltd Smart rejecter for keyboard click noise
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
DE112015003945T5 (de) 2014-08-28 2017-05-11 Knowles Electronics, Llc Mehrquellen-Rauschunterdrückung
US20170264942A1 (en) * 2016-03-11 2017-09-14 Mediatek Inc. Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction
US12016098B1 (en) 2019-09-12 2024-06-18 Renesas Electronics America System and method for user presence detection based on audio events

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4063033A (en) * 1975-12-30 1977-12-13 Rca Corporation Signal quality evaluator
US4281218A (en) * 1979-10-26 1981-07-28 Bell Telephone Laboratories, Incorporated Speech-nonspeech detector-classifier
US4628529A (en) * 1985-07-01 1986-12-09 Motorola, Inc. Noise suppression system
US4630304A (en) * 1985-07-01 1986-12-16 Motorola, Inc. Automatic background noise estimator for a noise suppression system
JPH01158499A (ja) * 1987-12-16 1989-06-21 Hitachi Ltd 定常雑音除去方式
US5208864A (en) * 1989-03-10 1993-05-04 Nippon Telegraph & Telephone Corporation Method of detecting acoustic signal
US4975657A (en) * 1989-11-02 1990-12-04 Motorola Inc. Speech detector for automatic level control systems
US5323337A (en) * 1992-08-04 1994-06-21 Loral Aerospace Corp. Signal detector employing mean energy and variance of energy content comparison for noise detection
US5479560A (en) * 1992-10-30 1995-12-26 Technology Research Association Of Medical And Welfare Apparatus Formant detecting device and speech processing apparatus
JP3626492B2 (ja) * 1993-07-07 2005-03-09 ポリコム・インコーポレイテッド 会話の品質向上のための背景雑音の低減
US5826230A (en) 1994-07-18 1998-10-20 Matsushita Electric Industrial Co., Ltd. Speech detection device
US6037988A (en) 1996-03-22 2000-03-14 Microsoft Corp Method for generating sprites for object-based coding sytems using masks and rounding average
US6075875A (en) 1996-09-30 2000-06-13 Microsoft Corporation Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results
JP3607450B2 (ja) * 1997-03-05 2005-01-05 Kddi株式会社 オーディオ情報分類装置
JP3160228B2 (ja) * 1997-04-30 2001-04-25 日本放送協会 音声区間検出方法およびその装置

Also Published As

Publication number Publication date
EP1141938A1 (en) 2001-10-10
DE69920047D1 (de) 2004-10-14
ATE275750T1 (de) 2004-09-15
EP1141938B1 (en) 2004-09-08
WO2000033294A9 (en) 2001-07-05
JP2002531882A (ja) 2002-09-24
JP4652575B2 (ja) 2011-03-16
WO2000033294A1 (en) 2000-06-08
US6205422B1 (en) 2001-03-20

Similar Documents

Publication Publication Date Title
DE69920047T2 (de) Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)
DE3236832C2 (de) Verfahren und Gerät zur Sprachanalyse
DE3236834C2 (de) Verfahren und Gerät zur Sprachanalyse
DE69432943T2 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69811310T2 (de) Verfahren und Vorrichtung zur Detektion und Endpunkt-Detektion von Vordergrund-Sprachsignalen
DE69926851T2 (de) Verfahren und Vorrichtung zur Sprachaktivitätsdetektion
DE69518705T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69326044T2 (de) Verfahren zur Erkennung von Sprachsignalen
DE60124842T2 (de) Rauschrobuste Mustererkennung
DE69720087T2 (de) Verfahren und Vorrichtung zur Unterdrückung von Hintergrundmusik oder -geräuschen im Eingangssignal eines Spracherkenners
DE60108373T2 (de) Verfahren zur Detektion von Emotionen in Sprachsignalen unter Verwendung von Sprecheridentifikation
DE3101851C2 (de) Vorrichtung zum Erkennen von Sprache
DE69121145T2 (de) Spektralbewertungsverfahren zur verbesserung der widerstandsfähigkeit gegen rauschen bei der spracherkennung
DE602004003439T2 (de) Rauschunterdrückung zur robusten Spracherkennung
DE112018006885B4 (de) Trainingsvorrichtung,sprachaktivitätsdetektor und verfahren zur erfassung einer sprachaktivität
DE2825110A1 (de) Verfahren zur erkennung kontinuierlicher sprachsignale
EP1388145B1 (de) Vorrichtung und verfahren zum analysieren eines audiosignals hinsichtlich von rhythmusinformationen
DE602004008666T2 (de) Verfolgen von Vokaltraktresonanzen unter Verwendung eines nichtlinearen Prädiktors
DE69616724T2 (de) Verfahren und System für die Spracherkennung
DE60023851T2 (de) Verfahren und vorrichtung zur erzeugung von zufallszahlen für mit 1/8 bitrate arbeitenden sprachkodierer
DE69106588T2 (de) Vorrichtung um Sprachgeräusch zu trennen.
EP0076233A1 (de) Verfahren und Vorrichtung zur redundanzvermindernden digitalen Sprachverarbeitung
DE60307965T2 (de) Vorrichtung und Verfahren zum Ändern der Wiedergabegeschwindigkeit von gespeicherten Sprachsignalen
DE602005004464T2 (de) Sprachverbesserung
DE19581667C2 (de) Spracherkennungssystem und Verfahren zur Spracherkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition