JP4652575B2 - バレーパーセンテージを使用した純粋音声の検出 - Google Patents
バレーパーセンテージを使用した純粋音声の検出 Download PDFInfo
- Publication number
- JP4652575B2 JP4652575B2 JP2000585861A JP2000585861A JP4652575B2 JP 4652575 B2 JP4652575 B2 JP 4652575B2 JP 2000585861 A JP2000585861 A JP 2000585861A JP 2000585861 A JP2000585861 A JP 2000585861A JP 4652575 B2 JP4652575 B2 JP 4652575B2
- Authority
- JP
- Japan
- Prior art keywords
- audio signal
- window
- speech
- audio
- energy level
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Monitoring And Testing Of Exchanges (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/201,705 | 1998-11-30 | ||
| US09/201,705 US6205422B1 (en) | 1998-11-30 | 1998-11-30 | Morphological pure speech detection using valley percentage |
| PCT/US1999/028401 WO2000033294A1 (en) | 1998-11-30 | 1999-11-30 | Pure speech detection using valley percentage |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2002531882A JP2002531882A (ja) | 2002-09-24 |
| JP2002531882A5 JP2002531882A5 (enExample) | 2007-01-25 |
| JP4652575B2 true JP4652575B2 (ja) | 2011-03-16 |
Family
ID=22746956
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2000585861A Expired - Fee Related JP4652575B2 (ja) | 1998-11-30 | 1999-11-30 | バレーパーセンテージを使用した純粋音声の検出 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US6205422B1 (enExample) |
| EP (1) | EP1141938B1 (enExample) |
| JP (1) | JP4652575B2 (enExample) |
| AT (1) | ATE275750T1 (enExample) |
| DE (1) | DE69920047T2 (enExample) |
| WO (1) | WO2000033294A1 (enExample) |
Families Citing this family (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6801895B1 (en) * | 1998-12-07 | 2004-10-05 | At&T Corp. | Method and apparatus for segmenting a multi-media program based upon audio events |
| KR100429896B1 (ko) * | 2001-11-22 | 2004-05-03 | 한국전자통신연구원 | 잡음 환경에서의 음성신호 검출방법 및 그 장치 |
| WO2005124722A2 (en) * | 2004-06-12 | 2005-12-29 | Spl Development, Inc. | Aural rehabilitation system and method |
| US20070011001A1 (en) * | 2005-07-11 | 2007-01-11 | Samsung Electronics Co., Ltd. | Apparatus for predicting the spectral information of voice signals and a method therefor |
| KR100713366B1 (ko) * | 2005-07-11 | 2007-05-04 | 삼성전자주식회사 | 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치 |
| KR100800873B1 (ko) | 2005-10-28 | 2008-02-04 | 삼성전자주식회사 | 음성 신호 검출 시스템 및 방법 |
| KR100790110B1 (ko) * | 2006-03-18 | 2008-01-02 | 삼성전자주식회사 | 모폴로지 기반의 음성 신호 코덱 방법 및 장치 |
| KR100762596B1 (ko) * | 2006-04-05 | 2007-10-01 | 삼성전자주식회사 | 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법 |
| US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
| KR100860830B1 (ko) * | 2006-12-13 | 2008-09-30 | 삼성전자주식회사 | 음성 신호의 스펙트럼 정보 추정 장치 및 방법 |
| US8935158B2 (en) | 2006-12-13 | 2015-01-13 | Samsung Electronics Co., Ltd. | Apparatus and method for comparing frames using spectral information of audio signal |
| US8355511B2 (en) * | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
| US8521530B1 (en) * | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
| US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
| EP2724340B1 (en) * | 2011-07-07 | 2019-05-15 | Nuance Communications, Inc. | Single channel suppression of impulsive interferences in noisy speech signals |
| US9286907B2 (en) * | 2011-11-23 | 2016-03-15 | Creative Technology Ltd | Smart rejecter for keyboard click noise |
| US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
| US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
| DE112015003945T5 (de) | 2014-08-28 | 2017-05-11 | Knowles Electronics, Llc | Mehrquellen-Rauschunterdrückung |
| US20170264942A1 (en) * | 2016-03-11 | 2017-09-14 | Mediatek Inc. | Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction |
| US12016098B1 (en) | 2019-09-12 | 2024-06-18 | Renesas Electronics America | System and method for user presence detection based on audio events |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4063033A (en) * | 1975-12-30 | 1977-12-13 | Rca Corporation | Signal quality evaluator |
| US4281218A (en) * | 1979-10-26 | 1981-07-28 | Bell Telephone Laboratories, Incorporated | Speech-nonspeech detector-classifier |
| US4628529A (en) * | 1985-07-01 | 1986-12-09 | Motorola, Inc. | Noise suppression system |
| US4630304A (en) * | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic background noise estimator for a noise suppression system |
| JPH01158499A (ja) * | 1987-12-16 | 1989-06-21 | Hitachi Ltd | 定常雑音除去方式 |
| US5208864A (en) * | 1989-03-10 | 1993-05-04 | Nippon Telegraph & Telephone Corporation | Method of detecting acoustic signal |
| US4975657A (en) * | 1989-11-02 | 1990-12-04 | Motorola Inc. | Speech detector for automatic level control systems |
| US5323337A (en) * | 1992-08-04 | 1994-06-21 | Loral Aerospace Corp. | Signal detector employing mean energy and variance of energy content comparison for noise detection |
| US5479560A (en) * | 1992-10-30 | 1995-12-26 | Technology Research Association Of Medical And Welfare Apparatus | Formant detecting device and speech processing apparatus |
| JP3626492B2 (ja) * | 1993-07-07 | 2005-03-09 | ポリコム・インコーポレイテッド | 会話の品質向上のための背景雑音の低減 |
| US5826230A (en) | 1994-07-18 | 1998-10-20 | Matsushita Electric Industrial Co., Ltd. | Speech detection device |
| US6037988A (en) | 1996-03-22 | 2000-03-14 | Microsoft Corp | Method for generating sprites for object-based coding sytems using masks and rounding average |
| US6075875A (en) | 1996-09-30 | 2000-06-13 | Microsoft Corporation | Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results |
| JP3607450B2 (ja) * | 1997-03-05 | 2005-01-05 | Kddi株式会社 | オーディオ情報分類装置 |
| JP3160228B2 (ja) * | 1997-04-30 | 2001-04-25 | 日本放送協会 | 音声区間検出方法およびその装置 |
-
1998
- 1998-11-30 US US09/201,705 patent/US6205422B1/en not_active Expired - Lifetime
-
1999
- 1999-11-30 EP EP99968458A patent/EP1141938B1/en not_active Expired - Lifetime
- 1999-11-30 WO PCT/US1999/028401 patent/WO2000033294A1/en not_active Ceased
- 1999-11-30 JP JP2000585861A patent/JP4652575B2/ja not_active Expired - Fee Related
- 1999-11-30 AT AT99968458T patent/ATE275750T1/de not_active IP Right Cessation
- 1999-11-30 DE DE69920047T patent/DE69920047T2/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1141938A1 (en) | 2001-10-10 |
| DE69920047D1 (de) | 2004-10-14 |
| ATE275750T1 (de) | 2004-09-15 |
| EP1141938B1 (en) | 2004-09-08 |
| DE69920047T2 (de) | 2005-01-20 |
| WO2000033294A9 (en) | 2001-07-05 |
| JP2002531882A (ja) | 2002-09-24 |
| WO2000033294A1 (en) | 2000-06-08 |
| US6205422B1 (en) | 2001-03-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP4652575B2 (ja) | バレーパーセンテージを使用した純粋音声の検出 | |
| Kinnunen | Voice activity detection using MFCC features and support vector machine | |
| US7117148B2 (en) | Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization | |
| JP4797342B2 (ja) | オーディオデータを自動的に認識する方法及び装置 | |
| Kiranyaz et al. | A generic audio classification and segmentation approach for multimedia indexing and retrieval | |
| JP2003177778A (ja) | 音声抄録抽出方法、音声データ抄録抽出システム、音声抄録抽出システム、プログラム、及び、音声抄録選択方法 | |
| Wang et al. | A fast and robust speech/music discrimination approach | |
| KR100745976B1 (ko) | 음향 모델을 이용한 음성과 비음성의 구분 방법 및 장치 | |
| US8838452B2 (en) | Effective audio segmentation and classification | |
| Thiruvengatanadhan | Music genre classification using GMM | |
| EP1531457B1 (en) | Apparatus and method for segmentation of audio data into meta patterns | |
| Wu et al. | Robust speech/non-speech detection in adverse conditions using the fuzzy polarity correlation method | |
| JP3607450B2 (ja) | オーディオ情報分類装置 | |
| Ravindran et al. | Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing | |
| KR100714721B1 (ko) | 음성 구간 검출 방법 및 장치 | |
| JP4201204B2 (ja) | オーディオ情報分類装置 | |
| CN115762551A (zh) | 鼾声检测方法、装置、计算机设备及存储介质 | |
| JPH01255000A (ja) | 音声認識システムに使用されるテンプレートに雑音を選択的に付加するための装置及び方法 | |
| Benincasa et al. | Voicing state determination of co-channel speech | |
| CN112927700A (zh) | 一种盲音频水印嵌入和提取方法及系统 | |
| JP4645866B2 (ja) | ディジタル信号処理方法、学習方法及びそれらの装置並びにプログラム格納媒体 | |
| Ntalampiras et al. | Speech/music discrimination based on discrete wavelet transform | |
| Cai et al. | Wavelet-based multi-feature voiced/unvoiced speech classification algorithm | |
| SU1781701A1 (en) | Method of separation of speech and nonstationary noise signals | |
| Thiruvengatanadhan et al. | Speech/music classification using SVM |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20061130 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20061130 |
|
| RD04 | Notification of resignation of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7424 Effective date: 20061130 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20091124 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20100224 |
|
| RD13 | Notification of appointment of power of sub attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7433 Effective date: 20100720 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A821 Effective date: 20100720 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20101022 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20101112 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20101210 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20101216 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20131224 Year of fee payment: 3 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees | ||
| S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
| R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |