DE69920047D1 - Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) - Google Patents
Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)Info
- Publication number
- DE69920047D1 DE69920047D1 DE69920047T DE69920047T DE69920047D1 DE 69920047 D1 DE69920047 D1 DE 69920047D1 DE 69920047 T DE69920047 T DE 69920047T DE 69920047 T DE69920047 T DE 69920047T DE 69920047 D1 DE69920047 D1 DE 69920047D1
- Authority
- DE
- Germany
- Prior art keywords
- speech
- audio signal
- pure
- detection
- valley
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000005236 sound signal Effects 0.000 title abstract 11
- 238000001514 detection method Methods 0.000 title abstract 7
- 238000000034 method Methods 0.000 abstract 4
- 230000000877 morphologic effect Effects 0.000 abstract 3
- 230000001594 aberrant effect Effects 0.000 abstract 1
- 230000003044 adaptive effect Effects 0.000 abstract 1
- 238000005259 measurement Methods 0.000 abstract 1
- 239000000203 mixture Substances 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Machine Translation (AREA)
- Monitoring And Testing Of Exchanges (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/201,705 US6205422B1 (en) | 1998-11-30 | 1998-11-30 | Morphological pure speech detection using valley percentage |
US201705 | 1998-11-30 | ||
PCT/US1999/028401 WO2000033294A1 (en) | 1998-11-30 | 1999-11-30 | Pure speech detection using valley percentage |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69920047D1 true DE69920047D1 (de) | 2004-10-14 |
DE69920047T2 DE69920047T2 (de) | 2005-01-20 |
Family
ID=22746956
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69920047T Expired - Lifetime DE69920047T2 (de) | 1998-11-30 | 1999-11-30 | Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) |
Country Status (6)
Country | Link |
---|---|
US (1) | US6205422B1 (de) |
EP (1) | EP1141938B1 (de) |
JP (1) | JP4652575B2 (de) |
AT (1) | ATE275750T1 (de) |
DE (1) | DE69920047T2 (de) |
WO (1) | WO2000033294A1 (de) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6801895B1 (en) * | 1998-12-07 | 2004-10-05 | At&T Corp. | Method and apparatus for segmenting a multi-media program based upon audio events |
KR100429896B1 (ko) * | 2001-11-22 | 2004-05-03 | 한국전자통신연구원 | 잡음 환경에서의 음성신호 검출방법 및 그 장치 |
WO2005124722A2 (en) * | 2004-06-12 | 2005-12-29 | Spl Development, Inc. | Aural rehabilitation system and method |
US20070011001A1 (en) * | 2005-07-11 | 2007-01-11 | Samsung Electronics Co., Ltd. | Apparatus for predicting the spectral information of voice signals and a method therefor |
KR100713366B1 (ko) * | 2005-07-11 | 2007-05-04 | 삼성전자주식회사 | 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치 |
KR100800873B1 (ko) | 2005-10-28 | 2008-02-04 | 삼성전자주식회사 | 음성 신호 검출 시스템 및 방법 |
KR100790110B1 (ko) * | 2006-03-18 | 2008-01-02 | 삼성전자주식회사 | 모폴로지 기반의 음성 신호 코덱 방법 및 장치 |
KR100762596B1 (ko) * | 2006-04-05 | 2007-10-01 | 삼성전자주식회사 | 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법 |
US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
US8935158B2 (en) | 2006-12-13 | 2015-01-13 | Samsung Electronics Co., Ltd. | Apparatus and method for comparing frames using spectral information of audio signal |
KR100860830B1 (ko) * | 2006-12-13 | 2008-09-30 | 삼성전자주식회사 | 음성 신호의 스펙트럼 정보 추정 장치 및 방법 |
US8355511B2 (en) * | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
US8521530B1 (en) * | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
US9858942B2 (en) * | 2011-07-07 | 2018-01-02 | Nuance Communications, Inc. | Single channel suppression of impulsive interferences in noisy speech signals |
US9286907B2 (en) * | 2011-11-23 | 2016-03-15 | Creative Technology Ltd | Smart rejecter for keyboard click noise |
US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
WO2016033364A1 (en) | 2014-08-28 | 2016-03-03 | Audience, Inc. | Multi-sourced noise suppression |
US20170264942A1 (en) * | 2016-03-11 | 2017-09-14 | Mediatek Inc. | Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction |
US12016098B1 (en) | 2019-09-12 | 2024-06-18 | Renesas Electronics America | System and method for user presence detection based on audio events |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4063033A (en) * | 1975-12-30 | 1977-12-13 | Rca Corporation | Signal quality evaluator |
US4281218A (en) * | 1979-10-26 | 1981-07-28 | Bell Telephone Laboratories, Incorporated | Speech-nonspeech detector-classifier |
US4628529A (en) * | 1985-07-01 | 1986-12-09 | Motorola, Inc. | Noise suppression system |
US4630304A (en) * | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic background noise estimator for a noise suppression system |
JPH01158499A (ja) * | 1987-12-16 | 1989-06-21 | Hitachi Ltd | 定常雑音除去方式 |
DE69011709T2 (de) * | 1989-03-10 | 1994-12-15 | Nippon Telegraph & Telephone | Einrichtung zur Feststellung eines akustischen Signals. |
US4975657A (en) * | 1989-11-02 | 1990-12-04 | Motorola Inc. | Speech detector for automatic level control systems |
US5323337A (en) * | 1992-08-04 | 1994-06-21 | Loral Aerospace Corp. | Signal detector employing mean energy and variance of energy content comparison for noise detection |
US5479560A (en) * | 1992-10-30 | 1995-12-26 | Technology Research Association Of Medical And Welfare Apparatus | Formant detecting device and speech processing apparatus |
JP3626492B2 (ja) * | 1993-07-07 | 2005-03-09 | ポリコム・インコーポレイテッド | 会話の品質向上のための背景雑音の低減 |
JP3604393B2 (ja) | 1994-07-18 | 2004-12-22 | 松下電器産業株式会社 | 音声検出装置 |
US6037988A (en) | 1996-03-22 | 2000-03-14 | Microsoft Corp | Method for generating sprites for object-based coding sytems using masks and rounding average |
US6075875A (en) | 1996-09-30 | 2000-06-13 | Microsoft Corporation | Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results |
JP3607450B2 (ja) * | 1997-03-05 | 2005-01-05 | Kddi株式会社 | オーディオ情報分類装置 |
JP3160228B2 (ja) * | 1997-04-30 | 2001-04-25 | 日本放送協会 | 音声区間検出方法およびその装置 |
-
1998
- 1998-11-30 US US09/201,705 patent/US6205422B1/en not_active Expired - Lifetime
-
1999
- 1999-11-30 EP EP99968458A patent/EP1141938B1/de not_active Expired - Lifetime
- 1999-11-30 WO PCT/US1999/028401 patent/WO2000033294A1/en active IP Right Grant
- 1999-11-30 DE DE69920047T patent/DE69920047T2/de not_active Expired - Lifetime
- 1999-11-30 AT AT99968458T patent/ATE275750T1/de not_active IP Right Cessation
- 1999-11-30 JP JP2000585861A patent/JP4652575B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
EP1141938B1 (de) | 2004-09-08 |
DE69920047T2 (de) | 2005-01-20 |
JP4652575B2 (ja) | 2011-03-16 |
US6205422B1 (en) | 2001-03-20 |
EP1141938A1 (de) | 2001-10-10 |
WO2000033294A1 (en) | 2000-06-08 |
WO2000033294A9 (en) | 2001-07-05 |
ATE275750T1 (de) | 2004-09-15 |
JP2002531882A (ja) | 2002-09-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69920047D1 (de) | Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) | |
US6993481B2 (en) | Detection of speech activity using feature model adaptation | |
Singh et al. | Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination | |
US8046215B2 (en) | Method and apparatus to detect voice activity by adding a random signal | |
KR20140031790A (ko) | 잡음 환경에서 강인한 음성 구간 검출 방법 및 장치 | |
CN102194452A (zh) | 复杂背景噪声中的语音激活检测方法 | |
Kwon et al. | Speaker change detection using a new weighted distance measure. | |
Kumar et al. | Classification of voiced and non-voiced speech signals using empirical wavelet transform and multi-level local patterns | |
JPH0462398B2 (de) | ||
Song et al. | Feature extraction and classification for audio information in news video | |
Ravindran et al. | Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing | |
Hong et al. | Detection of dynamic structures of speech fundamental frequency in tonal languages | |
CN110299133A (zh) | 基于关键字判定非法广播的方法 | |
Abu-Shikhah et al. | A novel pitch estimation technique using the Teager energy function | |
Pencak et al. | The NP speech activity detection algorithm | |
Hidayat | Frequency domain analysis of MFCC feature extraction in children’s speech recognition system | |
Torre et al. | Noise robust model-based voice activity detection | |
KR100835993B1 (ko) | 마스킹 확률을 이용한 음성 인식 전처리 방법 및 전처리장치 | |
Benincasa et al. | Voicing state determination of co-channel speech | |
Sudhakar et al. | Automatic speech segmentation to improve speech synthesis performance | |
Vavrek et al. | Audio classification utilizing a rule-based approach and the support vector machine classifier | |
Pasad et al. | Voice activity detection for children's read speech recognition in noisy conditions | |
Hidayat et al. | Analysis of Amplitude Threshold on Speech Recognition System | |
Ali et al. | Automatic detection and classification of stop consonants using an acoustic-phonetic feature-based system | |
Vini | Voice Activity Detection Techniques-A Review |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |