WO2022161475A1 - Procédé et appareil de traitement audio et dispositif électronique - Google Patents
Procédé et appareil de traitement audio et dispositif électronique Download PDFInfo
- Publication number
- WO2022161475A1 WO2022161475A1 PCT/CN2022/074795 CN2022074795W WO2022161475A1 WO 2022161475 A1 WO2022161475 A1 WO 2022161475A1 CN 2022074795 W CN2022074795 W CN 2022074795W WO 2022161475 A1 WO2022161475 A1 WO 2022161475A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio signal
- frequency
- audio
- signals
- processing
- Prior art date
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 25
- 230000005236 sound signal Effects 0.000 claims abstract description 337
- 238000012545 processing Methods 0.000 claims abstract description 146
- 238000000034 method Methods 0.000 claims abstract description 55
- 238000001228 spectrum Methods 0.000 claims abstract description 49
- 238000001914 filtration Methods 0.000 claims abstract description 31
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 6
- 238000005070 sampling Methods 0.000 claims description 54
- 230000003595 spectral effect Effects 0.000 claims description 24
- 230000015572 biosynthetic process Effects 0.000 claims description 16
- 238000003786 synthesis reaction Methods 0.000 claims description 16
- 238000000605 extraction Methods 0.000 claims description 13
- 238000004891 communication Methods 0.000 claims description 7
- 238000003062 neural network model Methods 0.000 claims description 5
- 238000009432 framing Methods 0.000 claims description 3
- 230000010076 replication Effects 0.000 claims description 3
- 230000000875 corresponding effect Effects 0.000 description 20
- 238000010586 diagram Methods 0.000 description 15
- 230000008569 process Effects 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 14
- 238000013528 artificial neural network Methods 0.000 description 12
- 230000000694 effects Effects 0.000 description 10
- 230000006870 function Effects 0.000 description 10
- 230000009466 transformation Effects 0.000 description 9
- 230000014509 gene expression Effects 0.000 description 6
- 239000013598 vector Substances 0.000 description 4
- 238000012952 Resampling Methods 0.000 description 3
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 238000005311 autocorrelation function Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- IPNDWAYKBVVIBI-UHFFFAOYSA-N 2-hydroxy-3,5-bis(morpholin-4-ium-4-ylmethyl)-7-propan-2-ylcyclohepta-2,4,6-trien-1-one;dichloride Chemical compound [Cl-].[Cl-].C=1C(C[NH+]2CCOCC2)=C(O)C(=O)C(C(C)C)=CC=1C[NH+]1CCOCC1 IPNDWAYKBVVIBI-UHFFFAOYSA-N 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- MOXZPMYMMBOUJY-UHFFFAOYSA-N n-[2-(2-aminoethylsulfanyl)ethyl]-5-(dimethylamino)naphthalene-1-sulfonamide Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(=O)(=O)NCCSCCN MOXZPMYMMBOUJY-UHFFFAOYSA-N 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
La présente demande divulgue un procédé et un appareil de traitement audio et un dispositif électronique. Le procédé consiste : à effectuer un traitement d'augmentation de résolution sur un premier signal audio pour obtenir un second signal audio ; à effectuer un traitement de filtrage passe-bas sur le second signal audio pour obtenir un second signal audio traité ; à effectuer un traitement de signal sur le second signal audio traité pour obtenir Y premiers signaux de sous-bande ayant la même largeur de bande ; selon les signaux de sous-bande basse fréquence parmi les Y premiers signaux de sous-bande, à générer M signaux de sous-bande haute fréquence ; sur la base des informations de caractéristique haute fréquence du premier signal audio, à effectuer un ajustement de spectre sur les M signaux de sous-bande haute fréquence pour obtenir M signaux de sous-bande haute fréquence cibles ; et à synthétiser les M signaux de sous-bande haute fréquence cibles pour obtenir un signal audio cible ; Y et M étant des nombres entiers positifs.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110121348.6 | 2021-01-28 | ||
CN202110121348.6A CN113299313B (zh) | 2021-01-28 | 2021-01-28 | 音频处理方法、装置及电子设备 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022161475A1 true WO2022161475A1 (fr) | 2022-08-04 |
Family
ID=77318871
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2022/074795 WO2022161475A1 (fr) | 2021-01-28 | 2022-01-28 | Procédé et appareil de traitement audio et dispositif électronique |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN113299313B (fr) |
WO (1) | WO2022161475A1 (fr) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113299313B (zh) * | 2021-01-28 | 2024-03-26 | 维沃移动通信有限公司 | 音频处理方法、装置及电子设备 |
CN115547350A (zh) * | 2022-09-23 | 2022-12-30 | 维沃移动通信有限公司 | 音频信号处理方法、装置、电子设备及可读存储介质 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105280189A (zh) * | 2015-09-16 | 2016-01-27 | 深圳广晟信源技术有限公司 | 带宽扩展编码和解码中高频生成的方法和装置 |
CN105513601A (zh) * | 2016-01-27 | 2016-04-20 | 武汉大学 | 一种音频编码带宽扩展中频带复制的方法及装置 |
CN105745706A (zh) * | 2013-11-29 | 2016-07-06 | 索尼公司 | 用于扩展频带的装置、方法和程序 |
EP3382704A1 (fr) * | 2017-03-31 | 2018-10-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Appareil et procédé permettant de déterminer une caractéristique liée à un traitement d'amélioration spectrale d'un signal audio |
CN110556121A (zh) * | 2019-09-18 | 2019-12-10 | 腾讯科技(深圳)有限公司 | 频带扩展方法、装置、电子设备及计算机可读存储介质 |
US20200051579A1 (en) * | 2010-12-29 | 2020-02-13 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding/decoding for high frequency bandwidth extension |
CN113299313A (zh) * | 2021-01-28 | 2021-08-24 | 维沃移动通信有限公司 | 音频处理方法、装置及电子设备 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4313993B2 (ja) * | 2002-07-19 | 2009-08-12 | パナソニック株式会社 | オーディオ復号化装置およびオーディオ復号化方法 |
US7069212B2 (en) * | 2002-09-19 | 2006-06-27 | Matsushita Elecric Industrial Co., Ltd. | Audio decoding apparatus and method for band expansion with aliasing adjustment |
CN101471072B (zh) * | 2007-12-27 | 2012-01-25 | 华为技术有限公司 | 高频重建方法、编码装置和解码装置 |
US9666202B2 (en) * | 2013-09-10 | 2017-05-30 | Huawei Technologies Co., Ltd. | Adaptive bandwidth extension and apparatus for the same |
CN106057220B (zh) * | 2016-05-19 | 2020-01-03 | Tcl集团股份有限公司 | 一种音频信号的高频扩展方法和音频播放器 |
CN107221334B (zh) * | 2016-11-01 | 2020-12-29 | 武汉大学深圳研究院 | 一种音频带宽扩展的方法及扩展装置 |
-
2021
- 2021-01-28 CN CN202110121348.6A patent/CN113299313B/zh active Active
-
2022
- 2022-01-28 WO PCT/CN2022/074795 patent/WO2022161475A1/fr active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200051579A1 (en) * | 2010-12-29 | 2020-02-13 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding/decoding for high frequency bandwidth extension |
CN105745706A (zh) * | 2013-11-29 | 2016-07-06 | 索尼公司 | 用于扩展频带的装置、方法和程序 |
CN105280189A (zh) * | 2015-09-16 | 2016-01-27 | 深圳广晟信源技术有限公司 | 带宽扩展编码和解码中高频生成的方法和装置 |
CN105513601A (zh) * | 2016-01-27 | 2016-04-20 | 武汉大学 | 一种音频编码带宽扩展中频带复制的方法及装置 |
EP3382704A1 (fr) * | 2017-03-31 | 2018-10-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Appareil et procédé permettant de déterminer une caractéristique liée à un traitement d'amélioration spectrale d'un signal audio |
CN110556121A (zh) * | 2019-09-18 | 2019-12-10 | 腾讯科技(深圳)有限公司 | 频带扩展方法、装置、电子设备及计算机可读存储介质 |
CN113299313A (zh) * | 2021-01-28 | 2021-08-24 | 维沃移动通信有限公司 | 音频处理方法、装置及电子设备 |
Also Published As
Publication number | Publication date |
---|---|
CN113299313B (zh) | 2024-03-26 |
CN113299313A (zh) | 2021-08-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022161475A1 (fr) | Procédé et appareil de traitement audio et dispositif électronique | |
EP3291231B1 (fr) | Suréchantillonnage dans un banc de filtres de transposition combiné | |
TWI556227B (zh) | 從訊號的低頻成份產生該訊號之高頻成份的系統與方法,及其機上盒、電腦程式產品、軟體程式及儲存媒體 | |
US8971551B2 (en) | Virtual bass synthesis using harmonic transposition | |
CN104318930B (zh) | 子带处理单元以及生成合成子带信号的方法 | |
CN106658284A (zh) | 频域中的虚拟低音的相加 | |
WO2021052287A1 (fr) | Procédé d'extension de bande de fréquences, appareil, dispositif électronique et support de stockage lisible par ordinateur | |
CN107705801A (zh) | 语音带宽扩展模型的训练方法及语音带宽扩展方法 | |
CN112259116B (zh) | 一种音频数据的降噪方法、装置、电子设备及存储介质 | |
Wang et al. | Denoising speech based on deep learning and wavelet decomposition | |
EP2720477B1 (fr) | Synthèse virtuelle de graves à l'aide de transposition harmonique | |
CN106653049A (zh) | 时域中的虚拟低音的相加 | |
Nakamura et al. | Time-domain audio source separation based on Wave-U-Net combined with discrete wavelet transform | |
CN116705056A (zh) | 音频生成方法、声码器、电子设备及存储介质 | |
JP7421827B2 (ja) | 音声変換装置、音声変換方法及び音声変換プログラム | |
US11404055B2 (en) | Simultaneous dereverberation and denoising via low latency deep learning | |
Goodwin et al. | Frequency-domain algorithms for audio signal enhancement based on transient modification | |
Lan et al. | Research on improved DNN and MultiResU_Net network speech enhancement effect | |
AU2019201296B2 (en) | Efficient combined harmonic transposition | |
Sueur et al. | Introduction to Frequency Analysis: The Fourier Transformation | |
Vanambathina et al. | Real time speech enhancement using densely connected neural networks and Squeezed temporal convolutional modules | |
Srinivasarao | Speech signal analysis and enhancement using combined wavelet Fourier transform with stacked deep learning architecture | |
WO2024102983A1 (fr) | Reconstruction de signal audio pleine bande activée par sortie en provenance d'un modèle d'apprentissage automatique | |
Wang et al. | Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments | |
CN117079623A (zh) | 音频降噪模型训练方法、歌唱作品处理方法、设备和介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22745347 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 22745347 Country of ref document: EP Kind code of ref document: A1 |