CN114067832B - 一种头相关传输函数的预测方法、装置和电子设备 - Google Patents
一种头相关传输函数的预测方法、装置和电子设备 Download PDFInfo
- Publication number
- CN114067832B CN114067832B CN202111332717.2A CN202111332717A CN114067832B CN 114067832 B CN114067832 B CN 114067832B CN 202111332717 A CN202111332717 A CN 202111332717A CN 114067832 B CN114067832 B CN 114067832B
- Authority
- CN
- China
- Prior art keywords
- hrtf
- layer
- encoder
- automatic encoder
- amplitude spectrum
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 238000012546 transfer Methods 0.000 title claims abstract description 21
- 238000001228 spectrum Methods 0.000 claims abstract description 53
- 238000013507 mapping Methods 0.000 claims abstract description 34
- 230000006870 function Effects 0.000 claims abstract description 21
- 238000013528 artificial neural network Methods 0.000 claims abstract description 16
- 238000012549 training Methods 0.000 claims description 27
- 238000005259 measurement Methods 0.000 claims description 11
- 230000014509 gene expression Effects 0.000 claims description 6
- 229940088594 vitamin Drugs 0.000 claims description 6
- 239000011782 vitamin Substances 0.000 claims description 6
- 229930003231 vitamin Natural products 0.000 claims description 4
- 235000013343 vitamin Nutrition 0.000 claims description 4
- 150000003722 vitamin derivatives Chemical class 0.000 claims description 4
- 230000004044 response Effects 0.000 claims description 3
- 230000006835 compression Effects 0.000 claims description 2
- 238000007906 compression Methods 0.000 claims description 2
- 238000013135 deep learning Methods 0.000 abstract description 4
- 210000003128 head Anatomy 0.000 description 24
- 230000008569 process Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000003454 tympanic membrane Anatomy 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111332717.2A CN114067832B (zh) | 2021-11-11 | 2021-11-11 | 一种头相关传输函数的预测方法、装置和电子设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111332717.2A CN114067832B (zh) | 2021-11-11 | 2021-11-11 | 一种头相关传输函数的预测方法、装置和电子设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114067832A CN114067832A (zh) | 2022-02-18 |
CN114067832B true CN114067832B (zh) | 2024-05-14 |
Family
ID=80275011
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111332717.2A Active CN114067832B (zh) | 2021-11-11 | 2021-11-11 | 一种头相关传输函数的预测方法、装置和电子设备 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114067832B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114662663B (zh) * | 2022-03-25 | 2023-04-07 | 华南师范大学 | 虚拟听觉系统的声音播放数据获取方法和计算机设备 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108038291A (zh) * | 2017-12-05 | 2018-05-15 | 武汉大学 | 一种基于人体参数适配算法的个性化头相关传递函数生成系统及方法 |
CN108805104A (zh) * | 2018-06-29 | 2018-11-13 | 中国航空无线电电子研究所 | 个性化hrtf获取系统 |
CN112328676A (zh) * | 2020-11-27 | 2021-02-05 | 江汉大学 | 一种估计个性化头相关传输函数的方法及相关设备 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR3059191B1 (fr) * | 2016-11-21 | 2019-08-02 | Institut Mines Telecom | Dispositif a casque audio perfectionne |
-
2021
- 2021-11-11 CN CN202111332717.2A patent/CN114067832B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108038291A (zh) * | 2017-12-05 | 2018-05-15 | 武汉大学 | 一种基于人体参数适配算法的个性化头相关传递函数生成系统及方法 |
CN108805104A (zh) * | 2018-06-29 | 2018-11-13 | 中国航空无线电电子研究所 | 个性化hrtf获取系统 |
CN112328676A (zh) * | 2020-11-27 | 2021-02-05 | 江汉大学 | 一种估计个性化头相关传输函数的方法及相关设备 |
Non-Patent Citations (1)
Title |
---|
A hybrid approach to structural modeling of individualized HRTFs;Riccardo Miccini, etc;2021 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW);20210506;80-85 * |
Also Published As
Publication number | Publication date |
---|---|
CN114067832A (zh) | 2022-02-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110074813B (zh) | 一种超声图像重建方法及系统 | |
US10313818B2 (en) | HRTF personalization based on anthropometric features | |
CN110335587B (zh) | 语音合成方法、系统、终端设备和可读存储介质 | |
US9681250B2 (en) | Statistical modelling, interpolation, measurement and anthropometry based prediction of head-related transfer functions | |
Miccini et al. | HRTF individualization using deep learning | |
CN112526451B (zh) | 基于麦克风阵列成像的压缩波束形成及系统 | |
Li et al. | GAN-based spatial image steganography with cross feedback mechanism | |
CN108596016B (zh) | 一种基于深度神经网络的个性化头相关传输函数建模方法 | |
CN114067832B (zh) | 一种头相关传输函数的预测方法、装置和电子设备 | |
CN113284046A (zh) | 基于无高分辨率参考图的遥感图像增强和复原方法及网络 | |
CN113935240A (zh) | 基于生成式对抗网络算法的人工地震波模拟方法 | |
CN114469132A (zh) | 模型训练方法、装置、电子设备及存储介质 | |
Xia et al. | Domain fingerprints for no-reference image quality assessment | |
CN110428848B (zh) | 一种基于公共空间语音模型预测的语音增强方法 | |
CN115565548A (zh) | 异常声音检测方法、装置、存储介质及电子设备 | |
Li et al. | High-capacity coverless image steganographic scheme based on image synthesis | |
JP2019168608A (ja) | 学習装置、音響生成装置、方法及びプログラム | |
Yang et al. | GAN-based radar spectrogram augmentation via diversity injection strategy | |
CN110415722A (zh) | 语音信号处理方法、存储介质、计算机程序和电子设备 | |
Zhang et al. | Personalized hrtf modeling using dnn-augmented bem | |
Zhang et al. | HRTF field: Unifying measured HRTF magnitude representation with neural fields | |
Xi et al. | Magnitude modelling of individualized HRTFs using DNN based spherical harmonic analysis | |
Gao et al. | Improved Convolutional Neural Network–Time-Delay Neural Network Structure with Repeated Feature Fusions for Speaker Verification | |
CN114998137A (zh) | 一种基于生成对抗网络的探地雷达图像杂波抑制方法 | |
KR20210137665A (ko) | 신경망 모델의 분석을 위한 히트맵 청각화 방법 및 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Yao Dingding Inventor after: Zhao Jiale Inventor after: Li Junfeng Inventor after: Guo Xiaochao Inventor after: Liu Qingfeng Inventor after: Yan Yonghong Inventor before: Yao Dingding Inventor before: Zhao Jiale Inventor before: Li Junfeng Inventor before: Yan Yonghong |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20240131 Address after: 100142, Fu Cheng Road, Beijing, Haidian District, No. 28 Applicant after: AIR FORCE SPECIALTY MEDICAL CENTER OF PLA Country or region after: China Address before: 100190, No. 21 West Fourth Ring Road, Beijing, Haidian District Applicant before: INSTITUTE OF ACOUSTICS, CHINESE ACADEMY OF SCIENCES Country or region before: China |
|
GR01 | Patent grant | ||
GR01 | Patent grant |