DE69920047D1 - Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) - Google Patents

Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)

Info

Publication number
DE69920047D1
DE69920047D1 DE69920047T DE69920047T DE69920047D1 DE 69920047 D1 DE69920047 D1 DE 69920047D1 DE 69920047 T DE69920047 T DE 69920047T DE 69920047 T DE69920047 T DE 69920047T DE 69920047 D1 DE69920047 D1 DE 69920047D1
Authority
DE
Germany
Prior art keywords
speech
audio signal
pure
detection
valley
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69920047T
Other languages
English (en)
Other versions
DE69920047T2 (de
Inventor
Chuang Gu
Ming-Chieh Lee
Wei-Ge Chen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of DE69920047D1 publication Critical patent/DE69920047D1/de
Publication of DE69920047T2 publication Critical patent/DE69920047T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Machine Translation (AREA)
  • Monitoring And Testing Of Exchanges (AREA)
DE69920047T 1998-11-30 1999-11-30 Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) Expired - Lifetime DE69920047T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/201,705 US6205422B1 (en) 1998-11-30 1998-11-30 Morphological pure speech detection using valley percentage
US201705 1998-11-30
PCT/US1999/028401 WO2000033294A1 (en) 1998-11-30 1999-11-30 Pure speech detection using valley percentage

Publications (2)

Publication Number Publication Date
DE69920047D1 true DE69920047D1 (de) 2004-10-14
DE69920047T2 DE69920047T2 (de) 2005-01-20

Family

ID=22746956

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69920047T Expired - Lifetime DE69920047T2 (de) 1998-11-30 1999-11-30 Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)

Country Status (6)

Country Link
US (1) US6205422B1 (de)
EP (1) EP1141938B1 (de)
JP (1) JP4652575B2 (de)
AT (1) ATE275750T1 (de)
DE (1) DE69920047T2 (de)
WO (1) WO2000033294A1 (de)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6801895B1 (en) * 1998-12-07 2004-10-05 At&T Corp. Method and apparatus for segmenting a multi-media program based upon audio events
KR100429896B1 (ko) * 2001-11-22 2004-05-03 한국전자통신연구원 잡음 환경에서의 음성신호 검출방법 및 그 장치
WO2005124722A2 (en) * 2004-06-12 2005-12-29 Spl Development, Inc. Aural rehabilitation system and method
US20070011001A1 (en) * 2005-07-11 2007-01-11 Samsung Electronics Co., Ltd. Apparatus for predicting the spectral information of voice signals and a method therefor
KR100713366B1 (ko) * 2005-07-11 2007-05-04 삼성전자주식회사 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치
KR100800873B1 (ko) 2005-10-28 2008-02-04 삼성전자주식회사 음성 신호 검출 시스템 및 방법
KR100790110B1 (ko) * 2006-03-18 2008-01-02 삼성전자주식회사 모폴로지 기반의 음성 신호 코덱 방법 및 장치
KR100762596B1 (ko) * 2006-04-05 2007-10-01 삼성전자주식회사 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
US8935158B2 (en) 2006-12-13 2015-01-13 Samsung Electronics Co., Ltd. Apparatus and method for comparing frames using spectral information of audio signal
KR100860830B1 (ko) * 2006-12-13 2008-09-30 삼성전자주식회사 음성 신호의 스펙트럼 정보 추정 장치 및 방법
US8355511B2 (en) * 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8521530B1 (en) * 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US8798290B1 (en) 2010-04-21 2014-08-05 Audience, Inc. Systems and methods for adaptive signal equalization
US9858942B2 (en) * 2011-07-07 2018-01-02 Nuance Communications, Inc. Single channel suppression of impulsive interferences in noisy speech signals
US9286907B2 (en) * 2011-11-23 2016-03-15 Creative Technology Ltd Smart rejecter for keyboard click noise
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
WO2016033364A1 (en) 2014-08-28 2016-03-03 Audience, Inc. Multi-sourced noise suppression
US20170264942A1 (en) * 2016-03-11 2017-09-14 Mediatek Inc. Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction
US12016098B1 (en) 2019-09-12 2024-06-18 Renesas Electronics America System and method for user presence detection based on audio events

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4063033A (en) * 1975-12-30 1977-12-13 Rca Corporation Signal quality evaluator
US4281218A (en) * 1979-10-26 1981-07-28 Bell Telephone Laboratories, Incorporated Speech-nonspeech detector-classifier
US4628529A (en) * 1985-07-01 1986-12-09 Motorola, Inc. Noise suppression system
US4630304A (en) * 1985-07-01 1986-12-16 Motorola, Inc. Automatic background noise estimator for a noise suppression system
JPH01158499A (ja) * 1987-12-16 1989-06-21 Hitachi Ltd 定常雑音除去方式
DE69011709T2 (de) * 1989-03-10 1994-12-15 Nippon Telegraph & Telephone Einrichtung zur Feststellung eines akustischen Signals.
US4975657A (en) * 1989-11-02 1990-12-04 Motorola Inc. Speech detector for automatic level control systems
US5323337A (en) * 1992-08-04 1994-06-21 Loral Aerospace Corp. Signal detector employing mean energy and variance of energy content comparison for noise detection
US5479560A (en) * 1992-10-30 1995-12-26 Technology Research Association Of Medical And Welfare Apparatus Formant detecting device and speech processing apparatus
JP3626492B2 (ja) * 1993-07-07 2005-03-09 ポリコム・インコーポレイテッド 会話の品質向上のための背景雑音の低減
JP3604393B2 (ja) 1994-07-18 2004-12-22 松下電器産業株式会社 音声検出装置
US6037988A (en) 1996-03-22 2000-03-14 Microsoft Corp Method for generating sprites for object-based coding sytems using masks and rounding average
US6075875A (en) 1996-09-30 2000-06-13 Microsoft Corporation Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results
JP3607450B2 (ja) * 1997-03-05 2005-01-05 Kddi株式会社 オーディオ情報分類装置
JP3160228B2 (ja) * 1997-04-30 2001-04-25 日本放送協会 音声区間検出方法およびその装置

Also Published As

Publication number Publication date
EP1141938B1 (de) 2004-09-08
DE69920047T2 (de) 2005-01-20
JP4652575B2 (ja) 2011-03-16
US6205422B1 (en) 2001-03-20
EP1141938A1 (de) 2001-10-10
WO2000033294A1 (en) 2000-06-08
WO2000033294A9 (en) 2001-07-05
ATE275750T1 (de) 2004-09-15
JP2002531882A (ja) 2002-09-24

Similar Documents

Publication Publication Date Title
DE69920047D1 (de) Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)
US6993481B2 (en) Detection of speech activity using feature model adaptation
Singh et al. Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination
US8046215B2 (en) Method and apparatus to detect voice activity by adding a random signal
KR20140031790A (ko) 잡음 환경에서 강인한 음성 구간 검출 방법 및 장치
CN102194452A (zh) 复杂背景噪声中的语音激活检测方法
Kwon et al. Speaker change detection using a new weighted distance measure.
Kumar et al. Classification of voiced and non-voiced speech signals using empirical wavelet transform and multi-level local patterns
JPH0462398B2 (de)
Song et al. Feature extraction and classification for audio information in news video
Ravindran et al. Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing
Hong et al. Detection of dynamic structures of speech fundamental frequency in tonal languages
CN110299133A (zh) 基于关键字判定非法广播的方法
Abu-Shikhah et al. A novel pitch estimation technique using the Teager energy function
Pencak et al. The NP speech activity detection algorithm
Hidayat Frequency domain analysis of MFCC feature extraction in children’s speech recognition system
Torre et al. Noise robust model-based voice activity detection
KR100835993B1 (ko) 마스킹 확률을 이용한 음성 인식 전처리 방법 및 전처리장치
Benincasa et al. Voicing state determination of co-channel speech
Sudhakar et al. Automatic speech segmentation to improve speech synthesis performance
Vavrek et al. Audio classification utilizing a rule-based approach and the support vector machine classifier
Pasad et al. Voice activity detection for children's read speech recognition in noisy conditions
Hidayat et al. Analysis of Amplitude Threshold on Speech Recognition System
Ali et al. Automatic detection and classification of stop consonants using an acoustic-phonetic feature-based system
Vini Voice Activity Detection Techniques-A Review

Legal Events

Date Code Title Description
8364 No opposition during term of opposition