CN112005300A - 语音信号的处理方法和移动设备 - Google Patents
语音信号的处理方法和移动设备 Download PDFInfo
- Publication number
- CN112005300A CN112005300A CN201880092454.2A CN201880092454A CN112005300A CN 112005300 A CN112005300 A CN 112005300A CN 201880092454 A CN201880092454 A CN 201880092454A CN 112005300 A CN112005300 A CN 112005300A
- Authority
- CN
- China
- Prior art keywords
- frequency
- low
- voice
- frames
- neural network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000003672 processing method Methods 0.000 title abstract description 8
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 183
- 238000013528 artificial neural network Methods 0.000 claims abstract description 116
- 238000000034 method Methods 0.000 claims abstract description 83
- 239000000203 mixture Substances 0.000 claims abstract description 28
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 5
- 238000012549 training Methods 0.000 claims description 133
- 238000003062 neural network model Methods 0.000 claims description 43
- 230000002457 bidirectional effect Effects 0.000 claims description 30
- 230000000306 recurrent effect Effects 0.000 claims description 27
- 238000012545 processing Methods 0.000 claims description 20
- 230000015654 memory Effects 0.000 claims description 14
- 125000004122 cyclic group Chemical group 0.000 claims description 12
- 230000015572 biosynthetic process Effects 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 3
- 238000003786 synthesis reaction Methods 0.000 claims description 3
- 230000008451 emotion Effects 0.000 abstract description 10
- 210000002569 neuron Anatomy 0.000 description 90
- 239000013598 vector Substances 0.000 description 51
- 230000008569 process Effects 0.000 description 36
- 239000011159 matrix material Substances 0.000 description 28
- 230000006870 function Effects 0.000 description 24
- 238000010586 diagram Methods 0.000 description 19
- 230000003595 spectral effect Effects 0.000 description 15
- 238000013461 design Methods 0.000 description 14
- 238000004891 communication Methods 0.000 description 11
- 238000001228 spectrum Methods 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 7
- 238000010606 normalization Methods 0.000 description 7
- 230000008447 perception Effects 0.000 description 6
- 238000011478 gradient descent method Methods 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 238000005070 sampling Methods 0.000 description 5
- 101100243558 Caenorhabditis elegans pfd-3 gene Proteins 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 238000001914 filtration Methods 0.000 description 4
- 230000006403 short-term memory Effects 0.000 description 4
- 230000002123 temporal effect Effects 0.000 description 4
- 238000005311 autocorrelation function Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 210000001260 vocal cord Anatomy 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000007787 long-term memory Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000013179 statistical model Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 210000004704 glottis Anatomy 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
一种语音信号的处理方法和移动设备,方法包括:对接收到的编码后的语音信号解码后得到m组低频语音参数;m组低频语音参数为语音信号的m个语音帧的低频语音参数;基于m组低频语音参数确定m个语音帧的类型,并重构m个语音帧对应的低频语音信号;根据n个清音帧的低频语音参数和混合高斯模型算法,得到n个清音帧对应的n个高频语音信号,并根据k个浊音帧的低频语音参数和神经网络算法,得到k个浊音帧对应的k个高频语音信号,n和k的和等于m;对每个语音帧的低频语音信号和高频语音信号进行合成,得到宽带语音信号。降低了噪声引入的概率,保留了原始语音的情感度,可精确的再现原始语音。
Description
PCT国内申请,说明书已公开。
Claims (12)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2018/086596 WO2019213965A1 (zh) | 2018-05-11 | 2018-05-11 | 语音信号的处理方法和移动设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112005300A true CN112005300A (zh) | 2020-11-27 |
CN112005300B CN112005300B (zh) | 2024-04-09 |
Family
ID=68466641
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880092454.2A Active CN112005300B (zh) | 2018-05-11 | 2018-05-11 | 语音信号的处理方法和移动设备 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112005300B (zh) |
WO (1) | WO2019213965A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112992167A (zh) * | 2021-02-08 | 2021-06-18 | 歌尔科技有限公司 | 音频信号的处理方法、装置及电子设备 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111415674A (zh) * | 2020-05-07 | 2020-07-14 | 北京声智科技有限公司 | 语音降噪方法及电子设备 |
CN111710327B (zh) * | 2020-06-12 | 2023-06-20 | 百度在线网络技术(北京)有限公司 | 用于模型训练和声音数据处理的方法、装置、设备和介质 |
CN114880734B (zh) * | 2020-12-21 | 2024-10-15 | 长沙理工大学 | 一种基于bp-lstm的钢混组合桥面系温度场及温度效应预测方法 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101996640A (zh) * | 2009-08-31 | 2011-03-30 | 华为技术有限公司 | 频带扩展方法及装置 |
CN103026408A (zh) * | 2010-07-19 | 2013-04-03 | 华为技术有限公司 | 音频信号产生装置 |
US20130151255A1 (en) * | 2011-12-07 | 2013-06-13 | Gwangju Institute Of Science And Technology | Method and device for extending bandwidth of speech signal |
CN104517610A (zh) * | 2013-09-26 | 2015-04-15 | 华为技术有限公司 | 频带扩展的方法及装置 |
CN104637489A (zh) * | 2015-01-21 | 2015-05-20 | 华为技术有限公司 | 声音信号处理的方法和装置 |
US20170194013A1 (en) * | 2016-01-06 | 2017-07-06 | JVC Kenwood Corporation | Band expander, reception device, band expanding method for expanding signal band |
-
2018
- 2018-05-11 CN CN201880092454.2A patent/CN112005300B/zh active Active
- 2018-05-11 WO PCT/CN2018/086596 patent/WO2019213965A1/zh active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101996640A (zh) * | 2009-08-31 | 2011-03-30 | 华为技术有限公司 | 频带扩展方法及装置 |
CN103026408A (zh) * | 2010-07-19 | 2013-04-03 | 华为技术有限公司 | 音频信号产生装置 |
US20130151255A1 (en) * | 2011-12-07 | 2013-06-13 | Gwangju Institute Of Science And Technology | Method and device for extending bandwidth of speech signal |
CN104517610A (zh) * | 2013-09-26 | 2015-04-15 | 华为技术有限公司 | 频带扩展的方法及装置 |
CN104637489A (zh) * | 2015-01-21 | 2015-05-20 | 华为技术有限公司 | 声音信号处理的方法和装置 |
US20170194013A1 (en) * | 2016-01-06 | 2017-07-06 | JVC Kenwood Corporation | Band expander, reception device, band expanding method for expanding signal band |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112992167A (zh) * | 2021-02-08 | 2021-06-18 | 歌尔科技有限公司 | 音频信号的处理方法、装置及电子设备 |
Also Published As
Publication number | Publication date |
---|---|
CN112005300B (zh) | 2024-04-09 |
WO2019213965A1 (zh) | 2019-11-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112005300B (zh) | 语音信号的处理方法和移动设备 | |
CN110136731B (zh) | 空洞因果卷积生成对抗网络端到端骨导语音盲增强方法 | |
US20220172708A1 (en) | Speech separation model training method and apparatus, storage medium and computer device | |
CN107358966B (zh) | 基于深度学习语音增强的无参考语音质量客观评估方法 | |
CN107680611B (zh) | 基于卷积神经网络的单通道声音分离方法 | |
US20130024191A1 (en) | Audio communication device, method for outputting an audio signal, and communication system | |
CN1750124B (zh) | 带限音频信号的带宽扩展 | |
CN110085245B (zh) | 一种基于声学特征转换的语音清晰度增强方法 | |
EP1995723B1 (en) | Neuroevolution training system | |
CN106782497B (zh) | 一种基于便携式智能终端的智能语音降噪算法 | |
JP2022547525A (ja) | 音声信号を生成するためのシステム及び方法 | |
Morgan et al. | Real-time adaptive linear prediction using the least mean square gradient algorithm | |
CN114338623B (zh) | 音频的处理方法、装置、设备及介质 | |
CN109328380A (zh) | 具有噪声模型适配的递归噪声功率估计 | |
CN114863942B (zh) | 音质转换的模型训练方法、提升语音音质的方法及装置 | |
US6701291B2 (en) | Automatic speech recognition with psychoacoustically-based feature extraction, using easily-tunable single-shape filters along logarithmic-frequency axis | |
WO2022213825A1 (zh) | 基于神经网络的端到端语音增强方法、装置 | |
CN117476031A (zh) | 一种噪声环境下耳机通话语音增强方法及系统 | |
Iser et al. | Bandwidth extension of telephony speech | |
CN103971697B (zh) | 基于非局部均值滤波的语音增强方法 | |
Shin et al. | Audio coding based on spectral recovery by convolutional neural network | |
CN113571079A (zh) | 语音增强方法、装置、设备及存储介质 | |
CN114708876B (zh) | 音频处理方法、装置、电子设备及存储介质 | |
CN109215635B (zh) | 用于语音清晰度增强的宽带语音频谱倾斜度特征参数重建方法 | |
CN114582361B (zh) | 基于生成对抗网络的高解析度音频编解码方法及系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |