ATE275750T1 - Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) - Google Patents
Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage)Info
- Publication number
- ATE275750T1 ATE275750T1 AT99968458T AT99968458T ATE275750T1 AT E275750 T1 ATE275750 T1 AT E275750T1 AT 99968458 T AT99968458 T AT 99968458T AT 99968458 T AT99968458 T AT 99968458T AT E275750 T1 ATE275750 T1 AT E275750T1
- Authority
- AT
- Austria
- Prior art keywords
- speech
- audio signal
- pure
- detection
- valley
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 11
- 238000001514 detection method Methods 0.000 title abstract 7
- 238000000034 method Methods 0.000 abstract 4
- 230000000877 morphologic effect Effects 0.000 abstract 3
- 230000001594 aberrant effect Effects 0.000 abstract 1
- 230000003044 adaptive effect Effects 0.000 abstract 1
- 238000005259 measurement Methods 0.000 abstract 1
- 239000000203 mixture Substances 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Monitoring And Testing Of Exchanges (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/201,705 US6205422B1 (en) | 1998-11-30 | 1998-11-30 | Morphological pure speech detection using valley percentage |
| PCT/US1999/028401 WO2000033294A1 (en) | 1998-11-30 | 1999-11-30 | Pure speech detection using valley percentage |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE275750T1 true ATE275750T1 (de) | 2004-09-15 |
Family
ID=22746956
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT99968458T ATE275750T1 (de) | 1998-11-30 | 1999-11-30 | Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US6205422B1 (enExample) |
| EP (1) | EP1141938B1 (enExample) |
| JP (1) | JP4652575B2 (enExample) |
| AT (1) | ATE275750T1 (enExample) |
| DE (1) | DE69920047T2 (enExample) |
| WO (1) | WO2000033294A1 (enExample) |
Families Citing this family (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6801895B1 (en) * | 1998-12-07 | 2004-10-05 | At&T Corp. | Method and apparatus for segmenting a multi-media program based upon audio events |
| KR100429896B1 (ko) * | 2001-11-22 | 2004-05-03 | 한국전자통신연구원 | 잡음 환경에서의 음성신호 검출방법 및 그 장치 |
| WO2005124722A2 (en) * | 2004-06-12 | 2005-12-29 | Spl Development, Inc. | Aural rehabilitation system and method |
| US20070011001A1 (en) * | 2005-07-11 | 2007-01-11 | Samsung Electronics Co., Ltd. | Apparatus for predicting the spectral information of voice signals and a method therefor |
| KR100713366B1 (ko) * | 2005-07-11 | 2007-05-04 | 삼성전자주식회사 | 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치 |
| KR100800873B1 (ko) | 2005-10-28 | 2008-02-04 | 삼성전자주식회사 | 음성 신호 검출 시스템 및 방법 |
| KR100790110B1 (ko) * | 2006-03-18 | 2008-01-02 | 삼성전자주식회사 | 모폴로지 기반의 음성 신호 코덱 방법 및 장치 |
| KR100762596B1 (ko) * | 2006-04-05 | 2007-10-01 | 삼성전자주식회사 | 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법 |
| US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
| KR100860830B1 (ko) * | 2006-12-13 | 2008-09-30 | 삼성전자주식회사 | 음성 신호의 스펙트럼 정보 추정 장치 및 방법 |
| US8935158B2 (en) | 2006-12-13 | 2015-01-13 | Samsung Electronics Co., Ltd. | Apparatus and method for comparing frames using spectral information of audio signal |
| US8355511B2 (en) * | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
| US8521530B1 (en) * | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
| US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
| EP2724340B1 (en) * | 2011-07-07 | 2019-05-15 | Nuance Communications, Inc. | Single channel suppression of impulsive interferences in noisy speech signals |
| US9286907B2 (en) * | 2011-11-23 | 2016-03-15 | Creative Technology Ltd | Smart rejecter for keyboard click noise |
| US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
| US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
| DE112015003945T5 (de) | 2014-08-28 | 2017-05-11 | Knowles Electronics, Llc | Mehrquellen-Rauschunterdrückung |
| US20170264942A1 (en) * | 2016-03-11 | 2017-09-14 | Mediatek Inc. | Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction |
| US12016098B1 (en) | 2019-09-12 | 2024-06-18 | Renesas Electronics America | System and method for user presence detection based on audio events |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4063033A (en) * | 1975-12-30 | 1977-12-13 | Rca Corporation | Signal quality evaluator |
| US4281218A (en) * | 1979-10-26 | 1981-07-28 | Bell Telephone Laboratories, Incorporated | Speech-nonspeech detector-classifier |
| US4628529A (en) * | 1985-07-01 | 1986-12-09 | Motorola, Inc. | Noise suppression system |
| US4630304A (en) * | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic background noise estimator for a noise suppression system |
| JPH01158499A (ja) * | 1987-12-16 | 1989-06-21 | Hitachi Ltd | 定常雑音除去方式 |
| US5208864A (en) * | 1989-03-10 | 1993-05-04 | Nippon Telegraph & Telephone Corporation | Method of detecting acoustic signal |
| US4975657A (en) * | 1989-11-02 | 1990-12-04 | Motorola Inc. | Speech detector for automatic level control systems |
| US5323337A (en) * | 1992-08-04 | 1994-06-21 | Loral Aerospace Corp. | Signal detector employing mean energy and variance of energy content comparison for noise detection |
| US5479560A (en) * | 1992-10-30 | 1995-12-26 | Technology Research Association Of Medical And Welfare Apparatus | Formant detecting device and speech processing apparatus |
| JP3626492B2 (ja) * | 1993-07-07 | 2005-03-09 | ポリコム・インコーポレイテッド | 会話の品質向上のための背景雑音の低減 |
| US5826230A (en) | 1994-07-18 | 1998-10-20 | Matsushita Electric Industrial Co., Ltd. | Speech detection device |
| US6037988A (en) | 1996-03-22 | 2000-03-14 | Microsoft Corp | Method for generating sprites for object-based coding sytems using masks and rounding average |
| US6075875A (en) | 1996-09-30 | 2000-06-13 | Microsoft Corporation | Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results |
| JP3607450B2 (ja) * | 1997-03-05 | 2005-01-05 | Kddi株式会社 | オーディオ情報分類装置 |
| JP3160228B2 (ja) * | 1997-04-30 | 2001-04-25 | 日本放送協会 | 音声区間検出方法およびその装置 |
-
1998
- 1998-11-30 US US09/201,705 patent/US6205422B1/en not_active Expired - Lifetime
-
1999
- 1999-11-30 EP EP99968458A patent/EP1141938B1/en not_active Expired - Lifetime
- 1999-11-30 WO PCT/US1999/028401 patent/WO2000033294A1/en not_active Ceased
- 1999-11-30 JP JP2000585861A patent/JP4652575B2/ja not_active Expired - Fee Related
- 1999-11-30 AT AT99968458T patent/ATE275750T1/de not_active IP Right Cessation
- 1999-11-30 DE DE69920047T patent/DE69920047T2/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1141938A1 (en) | 2001-10-10 |
| DE69920047D1 (de) | 2004-10-14 |
| EP1141938B1 (en) | 2004-09-08 |
| DE69920047T2 (de) | 2005-01-20 |
| WO2000033294A9 (en) | 2001-07-05 |
| JP2002531882A (ja) | 2002-09-24 |
| JP4652575B2 (ja) | 2011-03-16 |
| WO2000033294A1 (en) | 2000-06-08 |
| US6205422B1 (en) | 2001-03-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE275750T1 (de) | Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) | |
| Singh et al. | Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination | |
| US20090076814A1 (en) | Apparatus and method for determining speech signal | |
| US20090125304A1 (en) | Method and apparatus to detect voice activity | |
| CN102930870A (zh) | 利用抗噪幂归一化倒谱系数的鸟类声音识别方法 | |
| US5101434A (en) | Voice recognition using segmented time encoded speech | |
| Kwon et al. | Speaker change detection using a new weighted distance measure. | |
| Pourhomayoun et al. | Bioacoustic signal classification based on continuous region processing, grid masking and artificial neural network | |
| CN113889134B (zh) | 一种噪声消除装置及其检测方法 | |
| Chandra et al. | Usable speech detection using the modified spectral autocorrelation peak to valley ratio using the LPC residual | |
| Kumar et al. | Classification of voiced and non-voiced speech signals using empirical wavelet transform and multi-level local patterns | |
| JPH0462398B2 (enExample) | ||
| Hu et al. | Separation of stop consonants | |
| Song et al. | Feature extraction and classification for audio information in news video | |
| RU94014278A (ru) | Способ распознавания изолированных слов речи с адаптацией к диктору | |
| CN110299133B (zh) | 基于关键字判定非法广播的方法 | |
| Pencak et al. | The NP speech activity detection algorithm | |
| Abu-Shikhah et al. | A novel pitch estimation technique using the Teager energy function | |
| JP4201204B2 (ja) | オーディオ情報分類装置 | |
| de la Torre et al. | Noise robust model-based voice activity detection. | |
| Benincasa et al. | Voicing state determination of co-channel speech | |
| Iyer et al. | Structural usable speech measure using lpc residual | |
| Vavrek et al. | Audio classification utilizing a rule-based approach and the support vector machine classifier | |
| KR100835993B1 (ko) | 마스킹 확률을 이용한 음성 인식 전처리 방법 및 전처리장치 | |
| Ali et al. | Automatic detection and classification of stop consonants using an acoustic-phonetic feature-based system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |