CN115699173A - 语音活动检测方法和装置 - Google Patents

语音活动检测方法和装置 Download PDF

Info

Publication number
CN115699173A
CN115699173A CN202080101920.6A CN202080101920A CN115699173A CN 115699173 A CN115699173 A CN 115699173A CN 202080101920 A CN202080101920 A CN 202080101920A CN 115699173 A CN115699173 A CN 115699173A
Authority
CN
China
Prior art keywords
audio data
frames
path
vad
paths
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080101920.6A
Other languages
English (en)
Inventor
柯波
任博
鄢展鹏
王纪会
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN115699173A publication Critical patent/CN115699173A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

一种语音活动检测方法和装置,涉及语音检测领域,用于提高VAD的准确率。语音活动检测方法包括:按帧获取N路音频数据,其中,N为大于或等于2的整数(S501);针对每一帧,计算每路音频数据在高频子带的自相关系数(S502);针对每一帧,根据N路音频数据的自相关系数,选择对N路音频数据中的至少一路音频数据进行VAD(S503)。

Description

PCT国内申请,说明书已公开。

Claims (20)

  1. PCT国内申请,权利要求书已公开。
CN202080101920.6A 2020-06-16 2020-06-16 语音活动检测方法和装置 Pending CN115699173A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/096392 WO2021253235A1 (zh) 2020-06-16 2020-06-16 语音活动检测方法和装置

Publications (1)

Publication Number Publication Date
CN115699173A true CN115699173A (zh) 2023-02-03

Family

ID=79269055

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080101920.6A Pending CN115699173A (zh) 2020-06-16 2020-06-16 语音活动检测方法和装置

Country Status (2)

Country Link
CN (1) CN115699173A (zh)
WO (1) WO2021253235A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115862685A (zh) * 2023-02-27 2023-03-28 全时云商务服务股份有限公司 一种实时语音活动的检测方法、装置和电子设备

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5742734A (en) * 1994-08-10 1998-04-21 Qualcomm Incorporated Encoding rate selection in a variable rate vocoder
CN101010722A (zh) * 2004-08-30 2007-08-01 诺基亚公司 音频信号中话音活动的检测
US20080147389A1 (en) * 2006-12-15 2008-06-19 Motorola, Inc. Method and Apparatus for Robust Speech Activity Detection
US20110264447A1 (en) * 2010-04-22 2011-10-27 Qualcomm Incorporated Systems, methods, and apparatus for speech feature detection
CN108039182A (zh) * 2017-12-22 2018-05-15 西安烽火电子科技有限责任公司 一种语音激活检测方法
CN108597498A (zh) * 2018-04-10 2018-09-28 广州势必可赢网络科技有限公司 一种多麦克风语音采集方法及装置
CN109360585A (zh) * 2018-12-19 2019-02-19 晶晨半导体(上海)股份有限公司 一种语音激活检测方法
CN109686378A (zh) * 2017-10-13 2019-04-26 华为技术有限公司 语音处理方法和终端
CN111128244A (zh) * 2019-12-31 2020-05-08 西安烽火电子科技有限责任公司 基于过零率检测的短波通信语音激活检测方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8954324B2 (en) * 2007-09-28 2015-02-10 Qualcomm Incorporated Multiple microphone voice activity detector
EP2297727B1 (en) * 2008-06-30 2016-05-11 Dolby Laboratories Licensing Corporation Multi-microphone voice activity detector
CN103456305B (zh) * 2013-09-16 2016-03-09 东莞宇龙通信科技有限公司 终端和基于多个声音采集单元的语音处理方法
KR101711302B1 (ko) * 2015-10-26 2017-03-02 한양대학교 산학협력단 변별적 가중치 학습기법을 이용한 2 채널 마이크 기반의 음성 검출 장치 및 그 방법
CN108986833A (zh) * 2018-08-21 2018-12-11 广州市保伦电子有限公司 基于麦克风阵列的拾音方法、系统、电子设备及存储介质

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5742734A (en) * 1994-08-10 1998-04-21 Qualcomm Incorporated Encoding rate selection in a variable rate vocoder
CN101010722A (zh) * 2004-08-30 2007-08-01 诺基亚公司 音频信号中话音活动的检测
US20080147389A1 (en) * 2006-12-15 2008-06-19 Motorola, Inc. Method and Apparatus for Robust Speech Activity Detection
US20110264447A1 (en) * 2010-04-22 2011-10-27 Qualcomm Incorporated Systems, methods, and apparatus for speech feature detection
CN109686378A (zh) * 2017-10-13 2019-04-26 华为技术有限公司 语音处理方法和终端
CN108039182A (zh) * 2017-12-22 2018-05-15 西安烽火电子科技有限责任公司 一种语音激活检测方法
CN108597498A (zh) * 2018-04-10 2018-09-28 广州势必可赢网络科技有限公司 一种多麦克风语音采集方法及装置
CN109360585A (zh) * 2018-12-19 2019-02-19 晶晨半导体(上海)股份有限公司 一种语音激活检测方法
CN111128244A (zh) * 2019-12-31 2020-05-08 西安烽火电子科技有限责任公司 基于过零率检测的短波通信语音激活检测方法

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JIAYUE TANG ET AL.: "An Evaluation of Keyword Detection Using ACF of Pitch for Robust Speech Recognition", 2018 18TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 30 September 2018 (2018-09-30) *
郭丽惠;何昕;张亚昕;吕岳;: "基于顺序统计滤波的实时语音端点检测算法", 自动化学报, no. 04, 15 April 2008 (2008-04-15) *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115862685A (zh) * 2023-02-27 2023-03-28 全时云商务服务股份有限公司 一种实时语音活动的检测方法、装置和电子设备
CN115862685B (zh) * 2023-02-27 2023-09-15 全时云商务服务股份有限公司 一种实时语音活动的检测方法、装置和电子设备

Also Published As

Publication number Publication date
WO2021253235A1 (zh) 2021-12-23

Similar Documents

Publication Publication Date Title
CN110176226B (zh) 一种语音识别、及语音识别模型训练方法及装置
US11138992B2 (en) Voice activity detection based on entropy-energy feature
US9418651B2 (en) Method and apparatus for mitigating false accepts of trigger phrases
CN107742523B (zh) 语音信号处理方法、装置以及移动终端
US20160196838A1 (en) Utilizing Digital Microphones for Low Power Keyword Detection and Noise Suppression
US9799215B2 (en) Low power acoustic apparatus and method of operation
US9251804B2 (en) Speech recognition
US10629226B1 (en) Acoustic signal processing with voice activity detector having processor in an idle state
CN107393548B (zh) 多个语音助手设备采集的语音信息的处理方法及装置
CN111554321B (zh) 降噪模型训练方法、装置、电子设备及存储介质
CN108712566B (zh) 一种语音助手唤醒方法及移动终端
CN109672775B (zh) 调节唤醒灵敏度的方法、装置及终端
CN107993672B (zh) 频带扩展方法及装置
CN111477243B (zh) 音频信号处理方法及电子设备
WO2018118744A1 (en) Methods and systems for reducing false alarms in keyword detection
US9508345B1 (en) Continuous voice sensing
CN108492837B (zh) 音频突发白噪声的检测方法、装置及存储介质
CN110136733B (zh) 一种音频信号的解混响方法和装置
CN115699173A (zh) 语音活动检测方法和装置
EP3493200B1 (en) Voice-controllable device and method of voice control
US20180277134A1 (en) Key Click Suppression
CN106782614B (zh) 音质检测方法及装置
CN111863006A (zh) 一种音频信号处理方法、音频信号处理装置和耳机
WO2022057476A1 (zh) 一种校准信号的生成方法、电子设备和计算机存储介质
CN117153186A (zh) 声音信号处理方法、装置、电子设备和存储介质

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination