CN112820315B - 音频信号处理方法、装置、计算机设备及存储介质 - Google Patents

音频信号处理方法、装置、计算机设备及存储介质 Download PDF

Info

Publication number
CN112820315B
CN112820315B CN202010670626.9A CN202010670626A CN112820315B CN 112820315 B CN112820315 B CN 112820315B CN 202010670626 A CN202010670626 A CN 202010670626A CN 112820315 B CN112820315 B CN 112820315B
Authority
CN
China
Prior art keywords
audio signal
signal
domain signal
frequency domain
power spectrum
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010670626.9A
Other languages
English (en)
Chinese (zh)
Other versions
CN112820315A (zh
Inventor
梁俊斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN202010670626.9A priority Critical patent/CN112820315B/zh
Publication of CN112820315A publication Critical patent/CN112820315A/zh
Priority to PCT/CN2021/097663 priority patent/WO2022012195A1/fr
Application granted granted Critical
Publication of CN112820315B publication Critical patent/CN112820315B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Circuit For Audible Band Transducer (AREA)
CN202010670626.9A 2020-07-13 2020-07-13 音频信号处理方法、装置、计算机设备及存储介质 Active CN112820315B (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202010670626.9A CN112820315B (zh) 2020-07-13 2020-07-13 音频信号处理方法、装置、计算机设备及存储介质
PCT/CN2021/097663 WO2022012195A1 (fr) 2020-07-13 2021-06-01 Procédé de traitement de signal audio et appareil associé

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010670626.9A CN112820315B (zh) 2020-07-13 2020-07-13 音频信号处理方法、装置、计算机设备及存储介质

Publications (2)

Publication Number Publication Date
CN112820315A CN112820315A (zh) 2021-05-18
CN112820315B true CN112820315B (zh) 2023-01-06

Family

ID=75853211

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010670626.9A Active CN112820315B (zh) 2020-07-13 2020-07-13 音频信号处理方法、装置、计算机设备及存储介质

Country Status (2)

Country Link
CN (1) CN112820315B (fr)
WO (1) WO2022012195A1 (fr)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112820315B (zh) * 2020-07-13 2023-01-06 腾讯科技(深圳)有限公司 音频信号处理方法、装置、计算机设备及存储介质
CN113612808B (zh) * 2021-10-09 2022-01-25 腾讯科技(深圳)有限公司 音频处理方法、相关设备、存储介质及程序产品
CN114866856B (zh) * 2022-05-06 2024-01-02 北京达佳互联信息技术有限公司 音频信号的处理方法、音频生成模型的训练方法及装置
CN114822567B (zh) * 2022-06-22 2022-09-27 天津大学 一种基于能量算子的病理嗓音频谱重构方法
CN114974299B (zh) * 2022-08-01 2022-10-21 腾讯科技(深圳)有限公司 语音增强模型的训练、增强方法、装置、设备、介质
CN116248229B (zh) * 2022-12-08 2023-12-01 南京龙垣信息科技有限公司 一种面向实时语音通讯的丢包补偿方法
CN117395181B (zh) * 2023-12-12 2024-02-13 方图智能(深圳)科技集团股份有限公司 基于物联网的低延时多媒体音频传输检测方法及系统

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101512938A (zh) * 2006-08-01 2009-08-19 Dts(英属维尔京群岛)有限公司 用于补偿音频变换器的线性和非-线性失真的神经网络滤波技术
CN107112025A (zh) * 2014-09-12 2017-08-29 美商楼氏电子有限公司 用于恢复语音分量的系统和方法
CN109147805A (zh) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 基于深度学习的音频音质增强

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107895571A (zh) * 2016-09-29 2018-04-10 亿览在线网络技术(北京)有限公司 无损音频文件识别方法及装置
EP3474280B1 (fr) * 2017-10-19 2021-07-07 Goodix Technology (HK) Company Limited Processeur de signal pour l'amélioration du signal de parole
US11341983B2 (en) * 2018-09-17 2022-05-24 Honeywell International Inc. System and method for audio noise reduction
CN112820315B (zh) * 2020-07-13 2023-01-06 腾讯科技(深圳)有限公司 音频信号处理方法、装置、计算机设备及存储介质

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101512938A (zh) * 2006-08-01 2009-08-19 Dts(英属维尔京群岛)有限公司 用于补偿音频变换器的线性和非-线性失真的神经网络滤波技术
CN107112025A (zh) * 2014-09-12 2017-08-29 美商楼氏电子有限公司 用于恢复语音分量的系统和方法
CN109147805A (zh) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 基于深度学习的音频音质增强

Also Published As

Publication number Publication date
CN112820315A (zh) 2021-05-18
WO2022012195A1 (fr) 2022-01-20

Similar Documents

Publication Publication Date Title
CN112820315B (zh) 音频信号处理方法、装置、计算机设备及存储介质
JP6903611B2 (ja) 信号生成装置、信号生成システム、信号生成方法およびプログラム
CN110473567A (zh) 基于深度神经网络的音频处理方法、装置及存储介质
WO2021229197A1 (fr) Traitement d'audio variable dans le temps et non linéaire à l'aide de réseaux neuronaux profonds
CN113643714B (zh) 音频处理方法、装置、存储介质及计算机程序
Steinmetz et al. Efficient neural networks for real-time analog audio effect modeling
CN113921022B (zh) 音频信号分离方法、装置、存储介质和电子设备
CN113345460B (zh) 音频信号处理方法、装置、设备及存储介质
CN114974299B (zh) 语音增强模型的训练、增强方法、装置、设备、介质
JP2023548707A (ja) 音声強調方法、装置、機器及びコンピュータプログラム
Braun et al. Effect of noise suppression losses on speech distortion and ASR performance
US6701291B2 (en) Automatic speech recognition with psychoacoustically-based feature extraction, using easily-tunable single-shape filters along logarithmic-frequency axis
CN114792524B (zh) 音频数据处理方法、装置、程序产品、计算机设备和介质
CN111883154A (zh) 回声消除方法及装置、计算机可读的存储介质、电子装置
CN114530160A (zh) 模型训练方法、回声消除方法、系统、设备及存储介质
CN114333893A (zh) 一种语音处理方法、装置、电子设备和可读介质
Saeki et al. SelfRemaster: Self-supervised speech restoration with analysis-by-synthesis approach using channel modeling
CN111009259B (zh) 一种音频处理方法和装置
Schröter et al. CLC: complex linear coding for the DNS 2020 challenge
Chen et al. CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application
CN116013343A (zh) 语音增强方法、电子设备和存储介质
EP4283618A1 (fr) Procédé et appareil d'amélioration de parole, dispositif et support de stockage
CN112201227B (zh) 语音样本生成方法及装置、存储介质、电子装置
Rund et al. An evaluation of click detection algorithms against the results of listening tests
CN114302301A (zh) 频响校正方法及相关产品

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40043824

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant