DE69920047T2 - Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) - Google Patents
Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) Download PDFInfo
- Publication number
- DE69920047T2 DE69920047T2 DE69920047T DE69920047T DE69920047T2 DE 69920047 T2 DE69920047 T2 DE 69920047T2 DE 69920047 T DE69920047 T DE 69920047T DE 69920047 T DE69920047 T DE 69920047T DE 69920047 T2 DE69920047 T2 DE 69920047T2
- Authority
- DE
- Germany
- Prior art keywords
- speech
- audio signal
- detection
- pure
- energy
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 105
- 238000001514 detection method Methods 0.000 title claims abstract description 61
- 238000000034 method Methods 0.000 claims abstract description 51
- 230000000877 morphologic effect Effects 0.000 claims abstract description 28
- 239000000203 mixture Substances 0.000 claims abstract description 7
- 238000004364 calculation method Methods 0.000 claims description 23
- 238000001914 filtration Methods 0.000 claims description 11
- 230000003628 erosive effect Effects 0.000 claims description 9
- 230000010339 dilation Effects 0.000 claims description 7
- 230000000717 retained effect Effects 0.000 claims 1
- 238000012549 training Methods 0.000 abstract description 7
- 230000003044 adaptive effect Effects 0.000 abstract description 2
- 230000001594 aberrant effect Effects 0.000 abstract 1
- 238000005259 measurement Methods 0.000 abstract 1
- 238000012545 processing Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 8
- 238000004140 cleaning Methods 0.000 description 6
- 238000005070 sampling Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 230000006835 compression Effects 0.000 description 3
- 238000007906 compression Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000002093 peripheral effect Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 2
- KWYUFKZDYYNOTN-UHFFFAOYSA-M potassium hydroxide Substances [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000006837 decompression Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000802 evaporation-induced self-assembly Methods 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000001020 rhythmical effect Effects 0.000 description 1
- 239000000264 sodium ferrocyanide Substances 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Monitoring And Testing Of Exchanges (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201705 | 1988-06-02 | ||
| US09/201,705 US6205422B1 (en) | 1998-11-30 | 1998-11-30 | Morphological pure speech detection using valley percentage |
| PCT/US1999/028401 WO2000033294A1 (en) | 1998-11-30 | 1999-11-30 | Pure speech detection using valley percentage |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| DE69920047D1 DE69920047D1 (de) | 2004-10-14 |
| DE69920047T2 true DE69920047T2 (de) | 2005-01-20 |
Family
ID=22746956
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| DE69920047T Expired - Lifetime DE69920047T2 (de) | 1998-11-30 | 1999-11-30 | Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US6205422B1 (enExample) |
| EP (1) | EP1141938B1 (enExample) |
| JP (1) | JP4652575B2 (enExample) |
| AT (1) | ATE275750T1 (enExample) |
| DE (1) | DE69920047T2 (enExample) |
| WO (1) | WO2000033294A1 (enExample) |
Families Citing this family (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6801895B1 (en) * | 1998-12-07 | 2004-10-05 | At&T Corp. | Method and apparatus for segmenting a multi-media program based upon audio events |
| KR100429896B1 (ko) * | 2001-11-22 | 2004-05-03 | 한국전자통신연구원 | 잡음 환경에서의 음성신호 검출방법 및 그 장치 |
| WO2005124722A2 (en) * | 2004-06-12 | 2005-12-29 | Spl Development, Inc. | Aural rehabilitation system and method |
| US20070011001A1 (en) * | 2005-07-11 | 2007-01-11 | Samsung Electronics Co., Ltd. | Apparatus for predicting the spectral information of voice signals and a method therefor |
| KR100713366B1 (ko) * | 2005-07-11 | 2007-05-04 | 삼성전자주식회사 | 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치 |
| KR100800873B1 (ko) | 2005-10-28 | 2008-02-04 | 삼성전자주식회사 | 음성 신호 검출 시스템 및 방법 |
| KR100790110B1 (ko) * | 2006-03-18 | 2008-01-02 | 삼성전자주식회사 | 모폴로지 기반의 음성 신호 코덱 방법 및 장치 |
| KR100762596B1 (ko) * | 2006-04-05 | 2007-10-01 | 삼성전자주식회사 | 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법 |
| US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
| KR100860830B1 (ko) * | 2006-12-13 | 2008-09-30 | 삼성전자주식회사 | 음성 신호의 스펙트럼 정보 추정 장치 및 방법 |
| US8935158B2 (en) | 2006-12-13 | 2015-01-13 | Samsung Electronics Co., Ltd. | Apparatus and method for comparing frames using spectral information of audio signal |
| US8355511B2 (en) * | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
| US8521530B1 (en) * | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
| US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
| EP2724340B1 (en) * | 2011-07-07 | 2019-05-15 | Nuance Communications, Inc. | Single channel suppression of impulsive interferences in noisy speech signals |
| US9286907B2 (en) * | 2011-11-23 | 2016-03-15 | Creative Technology Ltd | Smart rejecter for keyboard click noise |
| US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
| US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
| DE112015003945T5 (de) | 2014-08-28 | 2017-05-11 | Knowles Electronics, Llc | Mehrquellen-Rauschunterdrückung |
| US20170264942A1 (en) * | 2016-03-11 | 2017-09-14 | Mediatek Inc. | Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction |
| US12016098B1 (en) | 2019-09-12 | 2024-06-18 | Renesas Electronics America | System and method for user presence detection based on audio events |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4063033A (en) * | 1975-12-30 | 1977-12-13 | Rca Corporation | Signal quality evaluator |
| US4281218A (en) * | 1979-10-26 | 1981-07-28 | Bell Telephone Laboratories, Incorporated | Speech-nonspeech detector-classifier |
| US4628529A (en) * | 1985-07-01 | 1986-12-09 | Motorola, Inc. | Noise suppression system |
| US4630304A (en) * | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic background noise estimator for a noise suppression system |
| JPH01158499A (ja) * | 1987-12-16 | 1989-06-21 | Hitachi Ltd | 定常雑音除去方式 |
| US5208864A (en) * | 1989-03-10 | 1993-05-04 | Nippon Telegraph & Telephone Corporation | Method of detecting acoustic signal |
| US4975657A (en) * | 1989-11-02 | 1990-12-04 | Motorola Inc. | Speech detector for automatic level control systems |
| US5323337A (en) * | 1992-08-04 | 1994-06-21 | Loral Aerospace Corp. | Signal detector employing mean energy and variance of energy content comparison for noise detection |
| US5479560A (en) * | 1992-10-30 | 1995-12-26 | Technology Research Association Of Medical And Welfare Apparatus | Formant detecting device and speech processing apparatus |
| JP3626492B2 (ja) * | 1993-07-07 | 2005-03-09 | ポリコム・インコーポレイテッド | 会話の品質向上のための背景雑音の低減 |
| US5826230A (en) | 1994-07-18 | 1998-10-20 | Matsushita Electric Industrial Co., Ltd. | Speech detection device |
| US6037988A (en) | 1996-03-22 | 2000-03-14 | Microsoft Corp | Method for generating sprites for object-based coding sytems using masks and rounding average |
| US6075875A (en) | 1996-09-30 | 2000-06-13 | Microsoft Corporation | Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results |
| JP3607450B2 (ja) * | 1997-03-05 | 2005-01-05 | Kddi株式会社 | オーディオ情報分類装置 |
| JP3160228B2 (ja) * | 1997-04-30 | 2001-04-25 | 日本放送協会 | 音声区間検出方法およびその装置 |
-
1998
- 1998-11-30 US US09/201,705 patent/US6205422B1/en not_active Expired - Lifetime
-
1999
- 1999-11-30 EP EP99968458A patent/EP1141938B1/en not_active Expired - Lifetime
- 1999-11-30 WO PCT/US1999/028401 patent/WO2000033294A1/en not_active Ceased
- 1999-11-30 JP JP2000585861A patent/JP4652575B2/ja not_active Expired - Fee Related
- 1999-11-30 AT AT99968458T patent/ATE275750T1/de not_active IP Right Cessation
- 1999-11-30 DE DE69920047T patent/DE69920047T2/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1141938A1 (en) | 2001-10-10 |
| DE69920047D1 (de) | 2004-10-14 |
| ATE275750T1 (de) | 2004-09-15 |
| EP1141938B1 (en) | 2004-09-08 |
| WO2000033294A9 (en) | 2001-07-05 |
| JP2002531882A (ja) | 2002-09-24 |
| JP4652575B2 (ja) | 2011-03-16 |
| WO2000033294A1 (en) | 2000-06-08 |
| US6205422B1 (en) | 2001-03-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE69920047T2 (de) | Detektion von reiner sprache in einem audio signal, mit hilfe einer detektionsgrösse (valley percentage) | |
| DE3236832C2 (de) | Verfahren und Gerät zur Sprachanalyse | |
| DE3236834C2 (de) | Verfahren und Gerät zur Sprachanalyse | |
| DE69432943T2 (de) | Verfahren und Vorrichtung zur Sprachdetektion | |
| DE69811310T2 (de) | Verfahren und Vorrichtung zur Detektion und Endpunkt-Detektion von Vordergrund-Sprachsignalen | |
| DE69926851T2 (de) | Verfahren und Vorrichtung zur Sprachaktivitätsdetektion | |
| DE69518705T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
| DE69326044T2 (de) | Verfahren zur Erkennung von Sprachsignalen | |
| DE60124842T2 (de) | Rauschrobuste Mustererkennung | |
| DE69720087T2 (de) | Verfahren und Vorrichtung zur Unterdrückung von Hintergrundmusik oder -geräuschen im Eingangssignal eines Spracherkenners | |
| DE60108373T2 (de) | Verfahren zur Detektion von Emotionen in Sprachsignalen unter Verwendung von Sprecheridentifikation | |
| DE3101851C2 (de) | Vorrichtung zum Erkennen von Sprache | |
| DE69121145T2 (de) | Spektralbewertungsverfahren zur verbesserung der widerstandsfähigkeit gegen rauschen bei der spracherkennung | |
| DE602004003439T2 (de) | Rauschunterdrückung zur robusten Spracherkennung | |
| DE112018006885B4 (de) | Trainingsvorrichtung,sprachaktivitätsdetektor und verfahren zur erfassung einer sprachaktivität | |
| DE2825110A1 (de) | Verfahren zur erkennung kontinuierlicher sprachsignale | |
| EP1388145B1 (de) | Vorrichtung und verfahren zum analysieren eines audiosignals hinsichtlich von rhythmusinformationen | |
| DE602004008666T2 (de) | Verfolgen von Vokaltraktresonanzen unter Verwendung eines nichtlinearen Prädiktors | |
| DE69616724T2 (de) | Verfahren und System für die Spracherkennung | |
| DE60023851T2 (de) | Verfahren und vorrichtung zur erzeugung von zufallszahlen für mit 1/8 bitrate arbeitenden sprachkodierer | |
| DE69106588T2 (de) | Vorrichtung um Sprachgeräusch zu trennen. | |
| EP0076233A1 (de) | Verfahren und Vorrichtung zur redundanzvermindernden digitalen Sprachverarbeitung | |
| DE60307965T2 (de) | Vorrichtung und Verfahren zum Ändern der Wiedergabegeschwindigkeit von gespeicherten Sprachsignalen | |
| DE602005004464T2 (de) | Sprachverbesserung | |
| DE19581667C2 (de) | Spracherkennungssystem und Verfahren zur Spracherkennung |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 8364 | No opposition during term of opposition |