WO2022161475A1 - Procédé et appareil de traitement audio et dispositif électronique - Google Patents

Procédé et appareil de traitement audio et dispositif électronique Download PDF

Info

Publication number
WO2022161475A1
WO2022161475A1 PCT/CN2022/074795 CN2022074795W WO2022161475A1 WO 2022161475 A1 WO2022161475 A1 WO 2022161475A1 CN 2022074795 W CN2022074795 W CN 2022074795W WO 2022161475 A1 WO2022161475 A1 WO 2022161475A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio signal
frequency
audio
signals
processing
Prior art date
Application number
PCT/CN2022/074795
Other languages
English (en)
Chinese (zh)
Inventor
张勇
Original Assignee
维沃移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信有限公司 filed Critical 维沃移动通信有限公司
Publication of WO2022161475A1 publication Critical patent/WO2022161475A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

La présente demande divulgue un procédé et un appareil de traitement audio et un dispositif électronique. Le procédé consiste : à effectuer un traitement d'augmentation de résolution sur un premier signal audio pour obtenir un second signal audio ; à effectuer un traitement de filtrage passe-bas sur le second signal audio pour obtenir un second signal audio traité ; à effectuer un traitement de signal sur le second signal audio traité pour obtenir Y premiers signaux de sous-bande ayant la même largeur de bande ; selon les signaux de sous-bande basse fréquence parmi les Y premiers signaux de sous-bande, à générer M signaux de sous-bande haute fréquence ; sur la base des informations de caractéristique haute fréquence du premier signal audio, à effectuer un ajustement de spectre sur les M signaux de sous-bande haute fréquence pour obtenir M signaux de sous-bande haute fréquence cibles ; et à synthétiser les M signaux de sous-bande haute fréquence cibles pour obtenir un signal audio cible ; Y et M étant des nombres entiers positifs.
PCT/CN2022/074795 2021-01-28 2022-01-28 Procédé et appareil de traitement audio et dispositif électronique WO2022161475A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110121348.6 2021-01-28
CN202110121348.6A CN113299313B (zh) 2021-01-28 2021-01-28 音频处理方法、装置及电子设备

Publications (1)

Publication Number Publication Date
WO2022161475A1 true WO2022161475A1 (fr) 2022-08-04

Family

ID=77318871

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/074795 WO2022161475A1 (fr) 2021-01-28 2022-01-28 Procédé et appareil de traitement audio et dispositif électronique

Country Status (2)

Country Link
CN (1) CN113299313B (fr)
WO (1) WO2022161475A1 (fr)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113299313B (zh) * 2021-01-28 2024-03-26 维沃移动通信有限公司 音频处理方法、装置及电子设备
CN115547350A (zh) * 2022-09-23 2022-12-30 维沃移动通信有限公司 音频信号处理方法、装置、电子设备及可读存储介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105280189A (zh) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 带宽扩展编码和解码中高频生成的方法和装置
CN105513601A (zh) * 2016-01-27 2016-04-20 武汉大学 一种音频编码带宽扩展中频带复制的方法及装置
CN105745706A (zh) * 2013-11-29 2016-07-06 索尼公司 用于扩展频带的装置、方法和程序
EP3382704A1 (fr) * 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Appareil et procédé permettant de déterminer une caractéristique liée à un traitement d'amélioration spectrale d'un signal audio
CN110556121A (zh) * 2019-09-18 2019-12-10 腾讯科技(深圳)有限公司 频带扩展方法、装置、电子设备及计算机可读存储介质
US20200051579A1 (en) * 2010-12-29 2020-02-13 Samsung Electronics Co., Ltd. Apparatus and method for encoding/decoding for high frequency bandwidth extension
CN113299313A (zh) * 2021-01-28 2021-08-24 维沃移动通信有限公司 音频处理方法、装置及电子设备

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4313993B2 (ja) * 2002-07-19 2009-08-12 パナソニック株式会社 オーディオ復号化装置およびオーディオ復号化方法
US7069212B2 (en) * 2002-09-19 2006-06-27 Matsushita Elecric Industrial Co., Ltd. Audio decoding apparatus and method for band expansion with aliasing adjustment
CN101471072B (zh) * 2007-12-27 2012-01-25 华为技术有限公司 高频重建方法、编码装置和解码装置
US9666202B2 (en) * 2013-09-10 2017-05-30 Huawei Technologies Co., Ltd. Adaptive bandwidth extension and apparatus for the same
CN106057220B (zh) * 2016-05-19 2020-01-03 Tcl集团股份有限公司 一种音频信号的高频扩展方法和音频播放器
CN107221334B (zh) * 2016-11-01 2020-12-29 武汉大学深圳研究院 一种音频带宽扩展的方法及扩展装置

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200051579A1 (en) * 2010-12-29 2020-02-13 Samsung Electronics Co., Ltd. Apparatus and method for encoding/decoding for high frequency bandwidth extension
CN105745706A (zh) * 2013-11-29 2016-07-06 索尼公司 用于扩展频带的装置、方法和程序
CN105280189A (zh) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 带宽扩展编码和解码中高频生成的方法和装置
CN105513601A (zh) * 2016-01-27 2016-04-20 武汉大学 一种音频编码带宽扩展中频带复制的方法及装置
EP3382704A1 (fr) * 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Appareil et procédé permettant de déterminer une caractéristique liée à un traitement d'amélioration spectrale d'un signal audio
CN110556121A (zh) * 2019-09-18 2019-12-10 腾讯科技(深圳)有限公司 频带扩展方法、装置、电子设备及计算机可读存储介质
CN113299313A (zh) * 2021-01-28 2021-08-24 维沃移动通信有限公司 音频处理方法、装置及电子设备

Also Published As

Publication number Publication date
CN113299313B (zh) 2024-03-26
CN113299313A (zh) 2021-08-24

Similar Documents

Publication Publication Date Title
WO2022161475A1 (fr) Procédé et appareil de traitement audio et dispositif électronique
EP3291231B1 (fr) Suréchantillonnage dans un banc de filtres de transposition combiné
TWI556227B (zh) 從訊號的低頻成份產生該訊號之高頻成份的系統與方法,及其機上盒、電腦程式產品、軟體程式及儲存媒體
US8971551B2 (en) Virtual bass synthesis using harmonic transposition
CN104318930B (zh) 子带处理单元以及生成合成子带信号的方法
CN106658284A (zh) 频域中的虚拟低音的相加
WO2021052287A1 (fr) Procédé d'extension de bande de fréquences, appareil, dispositif électronique et support de stockage lisible par ordinateur
CN107705801A (zh) 语音带宽扩展模型的训练方法及语音带宽扩展方法
CN112259116B (zh) 一种音频数据的降噪方法、装置、电子设备及存储介质
Wang et al. Denoising speech based on deep learning and wavelet decomposition
EP2720477B1 (fr) Synthèse virtuelle de graves à l'aide de transposition harmonique
CN106653049A (zh) 时域中的虚拟低音的相加
Nakamura et al. Time-domain audio source separation based on Wave-U-Net combined with discrete wavelet transform
CN116705056A (zh) 音频生成方法、声码器、电子设备及存储介质
JP7421827B2 (ja) 音声変換装置、音声変換方法及び音声変換プログラム
US11404055B2 (en) Simultaneous dereverberation and denoising via low latency deep learning
Goodwin et al. Frequency-domain algorithms for audio signal enhancement based on transient modification
Lan et al. Research on improved DNN and MultiResU_Net network speech enhancement effect
AU2019201296B2 (en) Efficient combined harmonic transposition
Sueur et al. Introduction to Frequency Analysis: The Fourier Transformation
Vanambathina et al. Real time speech enhancement using densely connected neural networks and Squeezed temporal convolutional modules
Srinivasarao Speech signal analysis and enhancement using combined wavelet Fourier transform with stacked deep learning architecture
WO2024102983A1 (fr) Reconstruction de signal audio pleine bande activée par sortie en provenance d'un modèle d'apprentissage automatique
Wang et al. Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments
CN117079623A (zh) 音频降噪模型训练方法、歌唱作品处理方法、设备和介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22745347

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22745347

Country of ref document: EP

Kind code of ref document: A1