JP4652575B2 - バレーパーセンテージを使用した純粋音声の検出 - Google Patents

バレーパーセンテージを使用した純粋音声の検出 Download PDF

Info

Publication number
JP4652575B2
JP4652575B2 JP2000585861A JP2000585861A JP4652575B2 JP 4652575 B2 JP4652575 B2 JP 4652575B2 JP 2000585861 A JP2000585861 A JP 2000585861A JP 2000585861 A JP2000585861 A JP 2000585861A JP 4652575 B2 JP4652575 B2 JP 4652575B2
Authority
JP
Japan
Prior art keywords
audio signal
window
speech
audio
energy level
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2000585861A
Other languages
English (en)
Japanese (ja)
Other versions
JP2002531882A (ja
JP2002531882A5 (enExample
Inventor
グ チゥアン
リー ミン−チエフ
チェン ウエイ−ジ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of JP2002531882A publication Critical patent/JP2002531882A/ja
Publication of JP2002531882A5 publication Critical patent/JP2002531882A5/ja
Application granted granted Critical
Publication of JP4652575B2 publication Critical patent/JP4652575B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Monitoring And Testing Of Exchanges (AREA)
  • Machine Translation (AREA)
JP2000585861A 1998-11-30 1999-11-30 バレーパーセンテージを使用した純粋音声の検出 Expired - Fee Related JP4652575B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/201,705 1998-11-30
US09/201,705 US6205422B1 (en) 1998-11-30 1998-11-30 Morphological pure speech detection using valley percentage
PCT/US1999/028401 WO2000033294A1 (en) 1998-11-30 1999-11-30 Pure speech detection using valley percentage

Publications (3)

Publication Number Publication Date
JP2002531882A JP2002531882A (ja) 2002-09-24
JP2002531882A5 JP2002531882A5 (enExample) 2007-01-25
JP4652575B2 true JP4652575B2 (ja) 2011-03-16

Family

ID=22746956

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2000585861A Expired - Fee Related JP4652575B2 (ja) 1998-11-30 1999-11-30 バレーパーセンテージを使用した純粋音声の検出

Country Status (6)

Country Link
US (1) US6205422B1 (enExample)
EP (1) EP1141938B1 (enExample)
JP (1) JP4652575B2 (enExample)
AT (1) ATE275750T1 (enExample)
DE (1) DE69920047T2 (enExample)
WO (1) WO2000033294A1 (enExample)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6801895B1 (en) * 1998-12-07 2004-10-05 At&T Corp. Method and apparatus for segmenting a multi-media program based upon audio events
KR100429896B1 (ko) * 2001-11-22 2004-05-03 한국전자통신연구원 잡음 환경에서의 음성신호 검출방법 및 그 장치
WO2005124722A2 (en) * 2004-06-12 2005-12-29 Spl Development, Inc. Aural rehabilitation system and method
US20070011001A1 (en) * 2005-07-11 2007-01-11 Samsung Electronics Co., Ltd. Apparatus for predicting the spectral information of voice signals and a method therefor
KR100713366B1 (ko) * 2005-07-11 2007-05-04 삼성전자주식회사 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치
KR100800873B1 (ko) 2005-10-28 2008-02-04 삼성전자주식회사 음성 신호 검출 시스템 및 방법
KR100790110B1 (ko) * 2006-03-18 2008-01-02 삼성전자주식회사 모폴로지 기반의 음성 신호 코덱 방법 및 장치
KR100762596B1 (ko) * 2006-04-05 2007-10-01 삼성전자주식회사 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
KR100860830B1 (ko) * 2006-12-13 2008-09-30 삼성전자주식회사 음성 신호의 스펙트럼 정보 추정 장치 및 방법
US8935158B2 (en) 2006-12-13 2015-01-13 Samsung Electronics Co., Ltd. Apparatus and method for comparing frames using spectral information of audio signal
US8355511B2 (en) * 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8521530B1 (en) * 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US8798290B1 (en) 2010-04-21 2014-08-05 Audience, Inc. Systems and methods for adaptive signal equalization
EP2724340B1 (en) * 2011-07-07 2019-05-15 Nuance Communications, Inc. Single channel suppression of impulsive interferences in noisy speech signals
US9286907B2 (en) * 2011-11-23 2016-03-15 Creative Technology Ltd Smart rejecter for keyboard click noise
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
DE112015003945T5 (de) 2014-08-28 2017-05-11 Knowles Electronics, Llc Mehrquellen-Rauschunterdrückung
US20170264942A1 (en) * 2016-03-11 2017-09-14 Mediatek Inc. Method and Apparatus for Aligning Multiple Audio and Video Tracks for 360-Degree Reconstruction
US12016098B1 (en) 2019-09-12 2024-06-18 Renesas Electronics America System and method for user presence detection based on audio events

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4063033A (en) * 1975-12-30 1977-12-13 Rca Corporation Signal quality evaluator
US4281218A (en) * 1979-10-26 1981-07-28 Bell Telephone Laboratories, Incorporated Speech-nonspeech detector-classifier
US4628529A (en) * 1985-07-01 1986-12-09 Motorola, Inc. Noise suppression system
US4630304A (en) * 1985-07-01 1986-12-16 Motorola, Inc. Automatic background noise estimator for a noise suppression system
JPH01158499A (ja) * 1987-12-16 1989-06-21 Hitachi Ltd 定常雑音除去方式
US5208864A (en) * 1989-03-10 1993-05-04 Nippon Telegraph & Telephone Corporation Method of detecting acoustic signal
US4975657A (en) * 1989-11-02 1990-12-04 Motorola Inc. Speech detector for automatic level control systems
US5323337A (en) * 1992-08-04 1994-06-21 Loral Aerospace Corp. Signal detector employing mean energy and variance of energy content comparison for noise detection
US5479560A (en) * 1992-10-30 1995-12-26 Technology Research Association Of Medical And Welfare Apparatus Formant detecting device and speech processing apparatus
JP3626492B2 (ja) * 1993-07-07 2005-03-09 ポリコム・インコーポレイテッド 会話の品質向上のための背景雑音の低減
US5826230A (en) 1994-07-18 1998-10-20 Matsushita Electric Industrial Co., Ltd. Speech detection device
US6037988A (en) 1996-03-22 2000-03-14 Microsoft Corp Method for generating sprites for object-based coding sytems using masks and rounding average
US6075875A (en) 1996-09-30 2000-06-13 Microsoft Corporation Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results
JP3607450B2 (ja) * 1997-03-05 2005-01-05 Kddi株式会社 オーディオ情報分類装置
JP3160228B2 (ja) * 1997-04-30 2001-04-25 日本放送協会 音声区間検出方法およびその装置

Also Published As

Publication number Publication date
EP1141938A1 (en) 2001-10-10
DE69920047D1 (de) 2004-10-14
ATE275750T1 (de) 2004-09-15
EP1141938B1 (en) 2004-09-08
DE69920047T2 (de) 2005-01-20
WO2000033294A9 (en) 2001-07-05
JP2002531882A (ja) 2002-09-24
WO2000033294A1 (en) 2000-06-08
US6205422B1 (en) 2001-03-20

Similar Documents

Publication Publication Date Title
JP4652575B2 (ja) バレーパーセンテージを使用した純粋音声の検出
Kinnunen Voice activity detection using MFCC features and support vector machine
US7117148B2 (en) Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
JP4797342B2 (ja) オーディオデータを自動的に認識する方法及び装置
Kiranyaz et al. A generic audio classification and segmentation approach for multimedia indexing and retrieval
JP2003177778A (ja) 音声抄録抽出方法、音声データ抄録抽出システム、音声抄録抽出システム、プログラム、及び、音声抄録選択方法
Wang et al. A fast and robust speech/music discrimination approach
KR100745976B1 (ko) 음향 모델을 이용한 음성과 비음성의 구분 방법 및 장치
US8838452B2 (en) Effective audio segmentation and classification
Thiruvengatanadhan Music genre classification using GMM
EP1531457B1 (en) Apparatus and method for segmentation of audio data into meta patterns
Wu et al. Robust speech/non-speech detection in adverse conditions using the fuzzy polarity correlation method
JP3607450B2 (ja) オーディオ情報分類装置
Ravindran et al. Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing
KR100714721B1 (ko) 음성 구간 검출 방법 및 장치
JP4201204B2 (ja) オーディオ情報分類装置
CN115762551A (zh) 鼾声检测方法、装置、计算机设备及存储介质
JPH01255000A (ja) 音声認識システムに使用されるテンプレートに雑音を選択的に付加するための装置及び方法
Benincasa et al. Voicing state determination of co-channel speech
CN112927700A (zh) 一种盲音频水印嵌入和提取方法及系统
JP4645866B2 (ja) ディジタル信号処理方法、学習方法及びそれらの装置並びにプログラム格納媒体
Ntalampiras et al. Speech/music discrimination based on discrete wavelet transform
Cai et al. Wavelet-based multi-feature voiced/unvoiced speech classification algorithm
SU1781701A1 (en) Method of separation of speech and nonstationary noise signals
Thiruvengatanadhan et al. Speech/music classification using SVM

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20061130

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20061130

RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20061130

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20091124

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100224

RD13 Notification of appointment of power of sub attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7433

Effective date: 20100720

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A821

Effective date: 20100720

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20101022

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20101112

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20101210

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20101216

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20131224

Year of fee payment: 3

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees
S111 Request for change of ownership or part of ownership

Free format text: JAPANESE INTERMEDIATE CODE: R313113

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350