ATE540398T1 - Sprachaktivitätsdetektionseinrichtung und verfahren - Google Patents

Sprachaktivitätsdetektionseinrichtung und verfahren

Info

Publication number
ATE540398T1
ATE540398T1 AT08734254T AT08734254T ATE540398T1 AT E540398 T1 ATE540398 T1 AT E540398T1 AT 08734254 T AT08734254 T AT 08734254T AT 08734254 T AT08734254 T AT 08734254T AT E540398 T1 ATE540398 T1 AT E540398T1
Authority
AT
Austria
Prior art keywords
vad
output
threshold
background noise
vad threshold
Prior art date
Application number
AT08734254T
Other languages
English (en)
Inventor
Zhe Wang
Original Assignee
Huawei Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Tech Co Ltd filed Critical Huawei Tech Co Ltd
Application granted granted Critical
Publication of ATE540398T1 publication Critical patent/ATE540398T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Noise Elimination (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Geophysics And Detection Of Objects (AREA)
AT08734254T 2007-06-07 2008-05-07 Sprachaktivitätsdetektionseinrichtung und verfahren ATE540398T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2007101084080A CN101320559B (zh) 2007-06-07 2007-06-07 一种声音激活检测装置及方法
PCT/CN2008/070899 WO2008148323A1 (en) 2007-06-07 2008-05-07 A voice activity detecting device and method

Publications (1)

Publication Number Publication Date
ATE540398T1 true ATE540398T1 (de) 2012-01-15

Family

ID=40093178

Family Applications (1)

Application Number Title Priority Date Filing Date
AT08734254T ATE540398T1 (de) 2007-06-07 2008-05-07 Sprachaktivitätsdetektionseinrichtung und verfahren

Country Status (7)

Country Link
US (1) US8275609B2 (de)
EP (1) EP2159788B1 (de)
JP (1) JP5089772B2 (de)
KR (1) KR101158291B1 (de)
CN (1) CN101320559B (de)
AT (1) ATE540398T1 (de)
WO (1) WO2008148323A1 (de)

Families Citing this family (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011033680A (ja) * 2009-07-30 2011-02-17 Sony Corp 音声処理装置及び方法、並びにプログラム
US8571231B2 (en) * 2009-10-01 2013-10-29 Qualcomm Incorporated Suppressing noise in an audio signal
CN102044241B (zh) * 2009-10-15 2012-04-04 华为技术有限公司 一种实现通信系统中背景噪声的跟踪的方法和装置
CN102044246B (zh) * 2009-10-15 2012-05-23 华为技术有限公司 一种音频信号检测方法和装置
CN102044243B (zh) * 2009-10-15 2012-08-29 华为技术有限公司 语音激活检测方法与装置、编码器
WO2011049515A1 (en) * 2009-10-19 2011-04-28 Telefonaktiebolaget Lm Ericsson (Publ) Method and voice activity detector for a speech encoder
ES2489472T3 (es) * 2010-12-24 2014-09-02 Huawei Technologies Co., Ltd. Método y aparato para una detección adaptativa de la actividad vocal en una señal de audio de entrada
WO2012083554A1 (en) * 2010-12-24 2012-06-28 Huawei Technologies Co., Ltd. A method and an apparatus for performing a voice activity detection
CN102741918B (zh) * 2010-12-24 2014-11-19 华为技术有限公司 用于话音活动检测的方法和设备
US8650029B2 (en) * 2011-02-25 2014-02-11 Microsoft Corporation Leveraging speech recognizer feedback for voice activity detection
CN102148030A (zh) * 2011-03-23 2011-08-10 同济大学 一种语音识别的端点检测方法
KR20140123071A (ko) 2012-01-18 2014-10-21 루카 로사토 안정성 정보 및 트랜션트/확률적 정보의 구별되는 인코딩 및 디코딩
JP5936378B2 (ja) * 2012-02-06 2016-06-22 三菱電機株式会社 音声区間検出装置
CN103325386B (zh) 2012-03-23 2016-12-21 杜比实验室特许公司 用于信号传输控制的方法和系统
US20140278389A1 (en) * 2013-03-12 2014-09-18 Motorola Mobility Llc Method and Apparatus for Adjusting Trigger Parameters for Voice Recognition Processing Based on Noise Characteristics
CN103839544B (zh) * 2012-11-27 2016-09-07 展讯通信(上海)有限公司 语音激活检测方法和装置
CN103903634B (zh) * 2012-12-25 2018-09-04 中兴通讯股份有限公司 激活音检测及用于激活音检测的方法和装置
CN103077723B (zh) * 2013-01-04 2015-07-08 鸿富锦精密工业(深圳)有限公司 音频传输系统
CN103971680B (zh) * 2013-01-24 2018-06-05 华为终端(东莞)有限公司 一种语音识别的方法、装置
CN103065631B (zh) 2013-01-24 2015-07-29 华为终端有限公司 一种语音识别的方法、装置
US9697831B2 (en) 2013-06-26 2017-07-04 Cirrus Logic, Inc. Speech recognition
CN106409313B (zh) 2013-08-06 2021-04-20 华为技术有限公司 一种音频信号分类方法和装置
KR102172149B1 (ko) * 2013-12-03 2020-11-02 주식회사 케이티 컨텐츠 재생 방법, 대사 구간 데이터 제공 방법 및 동영상 컨텐츠 재생 단말
US8990079B1 (en) * 2013-12-15 2015-03-24 Zanavox Automatic calibration of command-detection thresholds
US9524735B2 (en) 2014-01-31 2016-12-20 Apple Inc. Threshold adaptation in two-channel noise estimation and voice activity detection
CN104916292B (zh) 2014-03-12 2017-05-24 华为技术有限公司 检测音频信号的方法和装置
US10770075B2 (en) 2014-04-21 2020-09-08 Qualcomm Incorporated Method and apparatus for activating application by speech input
US9467779B2 (en) 2014-05-13 2016-10-11 Apple Inc. Microphone partial occlusion detector
CN104269178A (zh) * 2014-08-08 2015-01-07 华迪计算机集团有限公司 对语音信号进行自适应谱减和小波包消噪处理的方法和装置
CN110895930B (zh) * 2015-05-25 2022-01-28 展讯通信(上海)有限公司 语音识别方法及装置
CN106328169B (zh) 2015-06-26 2018-12-11 中兴通讯股份有限公司 一种激活音修正帧数的获取方法、激活音检测方法和装置
US10121471B2 (en) * 2015-06-29 2018-11-06 Amazon Technologies, Inc. Language model speech endpointing
CN104997014A (zh) * 2015-08-15 2015-10-28 黄佩霞 一种可调理贫血的药膳配方及其制作方法
CN105261368B (zh) * 2015-08-31 2019-05-21 华为技术有限公司 一种语音唤醒方法及装置
US10482899B2 (en) 2016-08-01 2019-11-19 Apple Inc. Coordination of beamformers for noise estimation and noise suppression
US9978392B2 (en) * 2016-09-09 2018-05-22 Tata Consultancy Services Limited Noisy signal identification from non-stationary audio signals
US11150866B2 (en) * 2018-11-13 2021-10-19 Synervoz Communications Inc. Systems and methods for contextual audio detection and communication mode transactions
CN110738986B (zh) * 2019-10-24 2022-08-05 数据堂(北京)智能科技有限公司 一种长语音标注装置及方法
CN111540342B (zh) * 2020-04-16 2022-07-19 浙江大华技术股份有限公司 一种能量阈值调整方法、装置、设备及介质
CN111739542B (zh) * 2020-05-13 2023-05-09 深圳市微纳感知计算技术有限公司 一种特征声音检测的方法、装置及设备
TWI756817B (zh) * 2020-09-08 2022-03-01 瑞昱半導體股份有限公司 語音活動偵測裝置與方法
CN112185426B (zh) * 2020-09-30 2022-12-27 青岛信芯微电子科技股份有限公司 一种语音端点检测设备及方法
CN113571072B (zh) * 2021-09-26 2021-12-14 腾讯科技(深圳)有限公司 一种语音编码方法、装置、设备、存储介质及产品
CN120048268B (zh) * 2025-04-23 2026-02-03 森丽康科技(北京)有限公司 一种基于声纹识别的自适应vad参数调节方法及系统
CN120388565B (zh) * 2025-06-25 2025-09-05 长江龙新媒体有限公司 一种基于3d虚拟的语音交互方法及系统

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6216103B1 (en) * 1997-10-20 2001-04-10 Sony Corporation Method for implementing a speech recognition system to determine speech endpoints during conditions with background noise
US6480823B1 (en) * 1998-03-24 2002-11-12 Matsushita Electric Industrial Co., Ltd. Speech detection for noisy conditions
FI118359B (fi) * 1999-01-18 2007-10-15 Nokia Corp Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin
US6453291B1 (en) * 1999-02-04 2002-09-17 Motorola, Inc. Apparatus and method for voice activity detection in a communication system
US6324509B1 (en) 1999-02-08 2001-11-27 Qualcomm Incorporated Method and apparatus for accurate endpointing of speech in the presence of noise
US6618701B2 (en) * 1999-04-19 2003-09-09 Motorola, Inc. Method and system for noise suppression using external voice activity detection
US6910011B1 (en) * 1999-08-16 2005-06-21 Haman Becker Automotive Systems - Wavemakers, Inc. Noisy acoustic signal enhancement
US20030179888A1 (en) * 2002-03-05 2003-09-25 Burnett Gregory C. Voice activity detection (VAD) devices and methods for use with noise suppression systems
CN1123863C (zh) * 2000-11-10 2003-10-08 清华大学 基于语音识别的信息校核方法
KR100992656B1 (ko) 2001-05-30 2010-11-05 앨리프컴 음향 및 비음향 센서를 이용한 유성음 및 무성음 감지시스템 및 방법
US7031916B2 (en) * 2001-06-01 2006-04-18 Texas Instruments Incorporated Method for converging a G.729 Annex B compliant voice activity detection circuit
BR0315179A (pt) 2002-10-11 2005-08-23 Nokia Corp Método e dispositivo para codificar um sinal de fala amostrado compreendendo quadros de fala
EP1443498B1 (de) * 2003-01-24 2008-03-19 Sony Ericsson Mobile Communications AB Rauschreduzierung und audiovisuelle Sprachaktivitätsdetektion
US8073689B2 (en) * 2003-02-21 2011-12-06 Qnx Software Systems Co. Repetitive transient noise removal
SG119199A1 (en) * 2003-09-30 2006-02-28 Stmicroelectronics Asia Pacfic Voice activity detector
CA2454296A1 (en) * 2003-12-29 2005-06-29 Nokia Corporation Method and device for speech enhancement in the presence of background noise
CN100456356C (zh) * 2004-11-12 2009-01-28 中国科学院声学研究所 一种应用于语音识别系统的语音端点检测方法
ATE523874T1 (de) * 2005-03-24 2011-09-15 Mindspeed Tech Inc Adaptive stimmenmodus-erweiterung für einen stimmenaktivitäts-detektor

Also Published As

Publication number Publication date
US8275609B2 (en) 2012-09-25
CN101320559A (zh) 2008-12-10
KR101158291B1 (ko) 2012-06-20
JP2010529494A (ja) 2010-08-26
EP2159788B1 (de) 2012-01-04
CN101320559B (zh) 2011-05-18
EP2159788A1 (de) 2010-03-03
KR20100012035A (ko) 2010-02-04
JP5089772B2 (ja) 2012-12-05
US20100088094A1 (en) 2010-04-08
WO2008148323A1 (en) 2008-12-11
EP2159788A4 (de) 2010-09-01

Similar Documents

Publication Publication Date Title
ATE540398T1 (de) Sprachaktivitätsdetektionseinrichtung und verfahren
EP4379711A3 (de) Verfahren und vorrichtung zur adaptiven detektion einer stimmaktivität in einem audioeingangssignal
ATE497236T1 (de) Vorrichtung und verfahren zur audiosignalverarbeitung
DK2011234T3 (da) Audioforstærkningskontrol anvendende specifik-lydstyrke-baseret auditiv hændelsesdetektering
ATE539434T1 (de) Vorrichtung und verfahren für mehrkanalparameterumwandlung
ATE506890T1 (de) Vorrichtung und verfahren zur vorhersage eines kontrollverlustes über einen muskel
NO20075732L (no) Flersensorisk taleforbedring ved bruk av sannsynligheten for ren tale
TW200630765A (en) Fault detection through feedback
TW200951426A (en) Apparatus for detecting periodic defect and method therefor
RU2012108872A (ru) Устройство отображения и способ управления
NO20071802L (no) Mikrostrukturinspesjonsapparat og mikrostrukturinspeksjonsfremgangsmate
MX376265B (es) Automatizacion de la perforacion utilizando control optimo estocastico.
ATE491262T1 (de) Verfahren und system zum verringern der auswirkungen von geräuschproduzierenden artefakten
DE602006015787D1 (de) Verfahren und vorrichtung zum umkonfigurieren eines gemeinsamen kanals
ATE523874T1 (de) Adaptive stimmenmodus-erweiterung für einen stimmenaktivitäts-detektor
DE502005002824D1 (de) Vorrichtung und verfahren zum ermitteln einer quantisierer-schrittweite
BRPI0911440A2 (pt) método e dispositivo para reconhecer um estado de uma máquina geradora de ruído a ser investigada
EP2329399A4 (de) Verfahren zur analyse eines tonsignals
DE602006012370D1 (de) Einrichtung und verfahren zum verarbeiten eines audio-datenstroms
MX355828B (es) Método y aparato para la detección de señales de audio.
FI20071018L (fi) Järjestelmät ja menetelmät äänisignaalin analysoimiseksi ja modifioimiseksi
EP2664062A4 (de) Verfahren und vorrichtung für verbesserte sprachqualität
ATE458311T1 (de) Messverstärkungsvorrichtung und -verfahren
ATE528749T1 (de) Verfahren zur verarbeitung eines akustischen eingangssignals zweck sendung eines ausgangssignals mit reduzierter lautstärke
GB201108577D0 (en) Intelligent rehabilitation (i-rehab)