CN112005300A - 语音信号的处理方法和移动设备 - Google Patents

语音信号的处理方法和移动设备 Download PDF

Info

Publication number
CN112005300A
CN112005300A CN201880092454.2A CN201880092454A CN112005300A CN 112005300 A CN112005300 A CN 112005300A CN 201880092454 A CN201880092454 A CN 201880092454A CN 112005300 A CN112005300 A CN 112005300A
Authority
CN
China
Prior art keywords
frequency
low
voice
frames
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201880092454.2A
Other languages
English (en)
Other versions
CN112005300B (zh
Inventor
赵月娇
李向东
杨霖
尹朝阳
于雪松
张晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN112005300A publication Critical patent/CN112005300A/zh
Application granted granted Critical
Publication of CN112005300B publication Critical patent/CN112005300B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

一种语音信号的处理方法和移动设备,方法包括:对接收到的编码后的语音信号解码后得到m组低频语音参数;m组低频语音参数为语音信号的m个语音帧的低频语音参数;基于m组低频语音参数确定m个语音帧的类型,并重构m个语音帧对应的低频语音信号;根据n个清音帧的低频语音参数和混合高斯模型算法,得到n个清音帧对应的n个高频语音信号,并根据k个浊音帧的低频语音参数和神经网络算法,得到k个浊音帧对应的k个高频语音信号,n和k的和等于m;对每个语音帧的低频语音信号和高频语音信号进行合成,得到宽带语音信号。降低了噪声引入的概率,保留了原始语音的情感度,可精确的再现原始语音。

Description

PCT国内申请,说明书已公开。

Claims (12)

  1. PCT国内申请,权利要求书已公开。
CN201880092454.2A 2018-05-11 2018-05-11 语音信号的处理方法和移动设备 Active CN112005300B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/086596 WO2019213965A1 (zh) 2018-05-11 2018-05-11 语音信号的处理方法和移动设备

Publications (2)

Publication Number Publication Date
CN112005300A true CN112005300A (zh) 2020-11-27
CN112005300B CN112005300B (zh) 2024-04-09

Family

ID=68466641

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880092454.2A Active CN112005300B (zh) 2018-05-11 2018-05-11 语音信号的处理方法和移动设备

Country Status (2)

Country Link
CN (1) CN112005300B (zh)
WO (1) WO2019213965A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112992167A (zh) * 2021-02-08 2021-06-18 歌尔科技有限公司 音频信号的处理方法、装置及电子设备

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111415674A (zh) * 2020-05-07 2020-07-14 北京声智科技有限公司 语音降噪方法及电子设备
CN111710327B (zh) * 2020-06-12 2023-06-20 百度在线网络技术(北京)有限公司 用于模型训练和声音数据处理的方法、装置、设备和介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101996640A (zh) * 2009-08-31 2011-03-30 华为技术有限公司 频带扩展方法及装置
CN103026408A (zh) * 2010-07-19 2013-04-03 华为技术有限公司 音频信号产生装置
US20130151255A1 (en) * 2011-12-07 2013-06-13 Gwangju Institute Of Science And Technology Method and device for extending bandwidth of speech signal
CN104517610A (zh) * 2013-09-26 2015-04-15 华为技术有限公司 频带扩展的方法及装置
CN104637489A (zh) * 2015-01-21 2015-05-20 华为技术有限公司 声音信号处理的方法和装置
US20170194013A1 (en) * 2016-01-06 2017-07-06 JVC Kenwood Corporation Band expander, reception device, band expanding method for expanding signal band

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101996640A (zh) * 2009-08-31 2011-03-30 华为技术有限公司 频带扩展方法及装置
CN103026408A (zh) * 2010-07-19 2013-04-03 华为技术有限公司 音频信号产生装置
US20130151255A1 (en) * 2011-12-07 2013-06-13 Gwangju Institute Of Science And Technology Method and device for extending bandwidth of speech signal
CN104517610A (zh) * 2013-09-26 2015-04-15 华为技术有限公司 频带扩展的方法及装置
CN104637489A (zh) * 2015-01-21 2015-05-20 华为技术有限公司 声音信号处理的方法和装置
US20170194013A1 (en) * 2016-01-06 2017-07-06 JVC Kenwood Corporation Band expander, reception device, band expanding method for expanding signal band

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112992167A (zh) * 2021-02-08 2021-06-18 歌尔科技有限公司 音频信号的处理方法、装置及电子设备

Also Published As

Publication number Publication date
CN112005300B (zh) 2024-04-09
WO2019213965A1 (zh) 2019-11-14

Similar Documents

Publication Publication Date Title
CN110136731B (zh) 空洞因果卷积生成对抗网络端到端骨导语音盲增强方法
CN107358966B (zh) 基于深度学习语音增强的无参考语音质量客观评估方法
CN107680611B (zh) 基于卷积神经网络的单通道声音分离方法
US20220172708A1 (en) Speech separation model training method and apparatus, storage medium and computer device
CN108447495B (zh) 一种基于综合特征集的深度学习语音增强方法
US20130024191A1 (en) Audio communication device, method for outputting an audio signal, and communication system
CN1750124B (zh) 带限音频信号的带宽扩展
JP2022529641A (ja) 音声処理方法、装置、電子機器及びコンピュータプログラム
CN110085245B (zh) 一种基于声学特征转换的语音清晰度增强方法
EP1995723B1 (en) Neuroevolution training system
CN108597496A (zh) 一种基于生成式对抗网络的语音生成方法及装置
CN106782497B (zh) 一种基于便携式智能终端的智能语音降噪算法
CN112005300B (zh) 语音信号的处理方法和移动设备
JP2022547525A (ja) 音声信号を生成するためのシステム及び方法
CN111292762A (zh) 一种基于深度学习的单通道语音分离方法
Morgan et al. Real-time adaptive linear prediction using the least mean square gradient algorithm
WO2015154397A1 (zh) 一种噪声信号的处理和生成方法、编解码器和编解码系统
CN114338623B (zh) 音频的处理方法、装置、设备及介质
CN109328380A (zh) 具有噪声模型适配的递归噪声功率估计
US6701291B2 (en) Automatic speech recognition with psychoacoustically-based feature extraction, using easily-tunable single-shape filters along logarithmic-frequency axis
WO2022213825A1 (zh) 基于神经网络的端到端语音增强方法、装置
Iser et al. Bandwidth extension of telephony speech
CN114708876B (zh) 音频处理方法、装置、电子设备及存储介质
CN114863942B (zh) 音质转换的模型训练方法、提升语音音质的方法及装置
Shin et al. Audio coding based on spectral recovery by convolutional neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant