CN110493701B - HRTF (head related transfer function) personalization method based on sparse principal component analysis - Google Patents

HRTF (head related transfer function) personalization method based on sparse principal component analysis Download PDF

Info

Publication number
CN110493701B
CN110493701B CN201910642057.4A CN201910642057A CN110493701B CN 110493701 B CN110493701 B CN 110493701B CN 201910642057 A CN201910642057 A CN 201910642057A CN 110493701 B CN110493701 B CN 110493701B
Authority
CN
China
Prior art keywords
principal component
hrtf
component analysis
physiological parameters
sparse
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910642057.4A
Other languages
Chinese (zh)
Other versions
CN110493701A (en
Inventor
曾向阳
路东东
王海涛
周治宇
马慧颖
晋安其
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northwestern Polytechnical University
Original Assignee
Northwestern Polytechnical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northwestern Polytechnical University filed Critical Northwestern Polytechnical University
Priority to CN201910642057.4A priority Critical patent/CN110493701B/en
Publication of CN110493701A publication Critical patent/CN110493701A/en
Application granted granted Critical
Publication of CN110493701B publication Critical patent/CN110493701B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2135Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on approximation criteria, e.g. principal component analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2136Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on sparsity criteria, e.g. with an overcomplete basis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The invention discloses a method for realizing HRTF personalization based on a sparse principal component analysis, which comprises the steps of firstly, using a principal component analysis method to reduce the dimension of the HRTF; then, reducing the dimension of the physiological parameters by using a sparse principal component analysis method; and finally, taking the physiological parameters subjected to dimension reduction as input, taking the HRTF subjected to dimension reduction as output, and applying a generalized regression neural network to perform nonlinear fitting. The invention applies sparse principal component analysis to realize the dimension reduction of three-dimensional physiological parameters, and reduces original dozens of dimensions of physiological parameters into several dimensions without influencing the structure of data. Local convergence of data when the correlation method is applied to dimension reduction processing is avoided, and the dimension reduction effect is improved. And the data subjected to dimensionality reduction is applied to carry out regression on the HRTF principal component coefficients, so that the regression time is reduced, and the overfitting risk is avoided. The HRTF personalization can be better realized by applying the three-dimensional physiological parameters subjected to dimension reduction.

Description

HRTF (head related transfer function) personalization method based on sparse principal component analysis
Technical Field
The patent relates to an HRTF personalization method, in particular to an HRTF personalization method based on sparse principal component analysis.
Background
A Head Related Transfer Function (HRTF) is a frequency-domain acoustic Transfer Function for describing that sound emitted from a free-field sound source reaches both ears after being scattered and reflected by physiological structures such as the Head, pinna, and trunk. Each sound source spatial position corresponds to a pair of HRTFs, which are generally functions of the distance of the sound source to the center of the head, the azimuth, elevation, and frequency of the sound source. Since the physiological structure and size of different individuals are different, and HRTFs are closely related to the physiological structure and size, they are physical quantities with distinct personalized features.
The filtering effect of physiological structures such as the trunk, the head and the auricle has more significant influence on the acoustic signal, that is, the physiological structures have more significant influence on the HRTF.
The literature "Martens W L.principles components and reconstruction of spectral documents to temporal direction. in: Proceedings of 1987International computer Music Conference,1987, 274-281". A method for personalizing head-related transfer functions by using human body morphological characteristics is disclosed, namely, head-related transfer functions are approximately estimated by using the correlation between basis functions and human body measurement items. In this paper, the authors select empirically the physiological parameters needed to build the model. Document "method of personalization of head-related transfer functions". A new method for constructing physiological parameters required by a model is disclosed. According to the method, all physiological parameters provided by the CIPIC database are subjected to correlation analysis, and irrelevant physiological parameters are selected as the physiological parameters required by the construction model. The physiological parameters obtained through experience are not persuasive in theory, and compared with the physiological parameters obtained through correlation analysis, the physiological parameters obtained through the correlation analysis theoretically have certain theoretical basis, however, the correlation analysis only considers the correlation between the physiological parameters, and does not consider the correlation between the physiological parameters and the HRTF. Meanwhile, the physiological parameters aimed at by the method are two-position physiological parameters, the physiological parameters are widely distributed and have weak local correlation, and the physiological parameters are all correlated when the three-dimensional physiological parameters are subjected to dimension reduction by applying the correlation due to relatively dense aggregation, so that the method is not applicable to the three-dimensional physiological parameters. Therefore, in order to realize HRTF personalization by directly applying three-dimensional physiological parameters, a sparse principal component analysis-based method is provided herein to realize HRTF personalization.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides an HRTF personalization method based on sparse principal component analysis.
The patent proposes a sparse principal component analysis-based method to achieve HRTF personalization. The invention comprises three steps: (1) reducing the dimension of the HRTF by using a Principal Component Analysis (PCA); (2) reducing the dimension of the physiological parameter by applying a Sparse Principal Component Analysis (SPCA); (3) taking the physiological parameters after dimension reduction as input, taking the HRTF after dimension reduction as output, and applying a Generalized Regression Neural Network (GRNN) to perform nonlinear fitting.
Technical scheme
The technical scheme adopted by the invention for solving the technical problem is that the HRTF personalization method based on sparse principal component analysis comprises the following steps:
step 1: selecting all tested HRTFs in a certain determined direction, performing principal component analysis on the HRTFs in the direction to obtain principal components of the direction:
(1) selecting all tested HRTFs with azimuth angle (theta, phi) to form a vector HijWherein i is a tested sequence, j is a frequency serial number, the tested number is m, theta is an azimuth angle, and phi is a pitch angle;
(2) to HijThe following normalization process was performed:
Figure GDA0002604441870000021
wherein the content of the first and second substances,
Figure GDA0002604441870000022
Figure GDA0002604441870000023
is HijA normalized head-related transfer function;
(3) performing principal component analysis on the normalized head-related transfer function:
Figure GDA0002604441870000024
wherein, Pm×nIs a score matrix of m × n for principal component analysis; w is a load matrix of n × n of principal component analysis, T represents the transpose of the matrix;
step two: and reducing the dimension of the three-dimensional physiological parameters by using an SPCA method.
(1) All tested physiological parameters are formed into vectors of X ═ { X ═ X1,x2,…,xn}∈Rp×nWhere p represents the physiological parameter dimension of a single test subject and n represents the number of test subjects. Singular Value Decomposition (SVD) is performed on X.
X=UDVT(3)
Where Z ═ UD is the principal component of vector X, and V is the load matrix of vector X.
(2) In this patent, we introduce sparse loadings to estimate the principal components. The sparse load is regressed by an elastic network. For the ith principal component, define Zi=UiDii
Figure GDA0002604441870000031
Wherein the content of the first and second substances,
Figure GDA0002604441870000032
is the first order norm of β; λ, λ1Is a resilient penalty function.
(4) Vector alpha corresponding to first L principal components of X principal component analysisjForming a new initialization matrix A; setting a random initialization matrix B ═ beta at the same time1,…,βL]。
(5) In the given case of a, the elastic net regression problem is solved:
Figure GDA0002604441870000033
wherein λ is1,λ1,jAre all elastic penalty functions
(6) Update B ═ β1,…,βL]Calculating XTXB=UDVTSimultaneously update A ═ UVT
(7) Repeating the steps (5) and (6) until the convergence of B
(8) Normalized betajAnd obtaining the sparse load matrix.
(9) And obtaining main components representing the physiological parameters according to the sparse load matrix.
Step three: and (3) taking the obtained physiological parameters as input, taking the principal component coefficients subjected to HRTF dimensionality reduction as output, and performing regression by applying GRNN. And restoring the regressed main component to obtain the restored HRTF.
Advantageous effects
The dimension reduction of three-dimensional physiological parameters can be realized by applying sparse principal component analysis, and original dozens of dimensions of physiological parameters are reduced into dozens of dimensions without influencing the structure of data. The three-dimensional physiological parameters are processed by applying sparse principal component analysis, so that the local convergence of data is avoided when a correlation method is applied to dimension reduction processing, and the dimension reduction effect is improved. And the data subjected to dimensionality reduction is applied to carry out regression on the HRTF principal component coefficients, so that the regression time is reduced, and the overfitting risk is avoided. The HRTF personalization can be better realized by applying the three-dimensional physiological parameters subjected to dimension reduction.
Drawings
FIG. 1 is a flowchart of HRTF personalization based on sparse principal component analysis;
fig. 2. experimental HRTFs were regressed by physiological parameters selected from sparse principal components, 063 (0 ° ).
Detailed description of the invention
Refer to fig. 1. The HRTF personalization process is specifically described for HRTFs with an azimuth angle θ equal to 0 ° and a pitch angle Φ equal to 0 °.
Step 1: selecting all tested HRTFs in the direction (0 degrees and 0 degrees), and performing principal component analysis on the HRTFs in the direction to obtain principal components of the direction:
(1) selecting all tested HRTFs with the azimuth of (0 degrees and 0 degrees) to form a vector HijWherein i is a tested sequence, j is a frequency serial number, the tested number is m, the azimuth angle theta is 0 DEG, and the pitch angle phi is 0 DEG;
(2) to HijThe following normalization process was performed:
Figure GDA0002604441870000041
wherein the content of the first and second substances,
Figure GDA0002604441870000042
Figure GDA0002604441870000043
is HijA normalized head-related transfer function;
(3) performing principal component analysis on the normalized head-related transfer function:
Figure GDA0002604441870000044
wherein, Pm×nIs a score matrix of m × n for principal component analysis; w is a load matrix of n × n of principal component analysis, T represents the transpose of the matrix;
step two: and reducing the dimension of the three-dimensional physiological parameters by using an SPCA method.
(1) Forming a tested physiological parameter with the number of 001-062 into a vector X ═ { X ═ X1,x2,…,xn}∈Rp×nWhere p represents the physiological parameter dimension of a single test subject and n represents the number of test subjects. Singular Value Decomposition (SVD) is performed on X.
X=UDVT(3)
Where Z ═ UD is the principal component of vector X, and V is the load matrix of vector X.
(2) In this patent, we introduce sparse loadings to estimate the principal components. The sparse load is regressed by an elastic network. For the ith principal component, define Zi=UiDii
Figure GDA0002604441870000045
Wherein the content of the first and second substances,
Figure GDA0002604441870000051
is the first order norm of β; λ, λ1Is a resilient penalty function.
(3) Vector omega corresponding to first L principal components of X principal component analysisjForming a new initialization matrix; setting a random initialization matrix B ═ beta at the same time1,…,βL]。
(4) In the given case of a, the elastic net regression problem is solved:
Figure GDA0002604441870000052
wherein λ is1,λ1,jAre all elastic penalty functions
(5) Update B ═ β1,…,βL]Calculating XTXB=UDVTSimultaneously update A ═ UVT
(6) Repeating the steps (5) and (6) until the convergence of B
(7) Normalized betajAnd obtaining the sparse load matrix.
(8) And obtaining main components representing the physiological parameters according to the sparse load matrix.
Step three: and (3) taking the obtained physiological parameters with the sparse principal component accumulation percentage of 95% as input, taking the principal component coefficients subjected to HRTF dimensionality reduction as output, and applying GRNN to carry out regression. And restoring the regressed main component to obtain the restored HRTF.
The tested sample with the number 063 is taken as a test sample, the physiological parameter of 063 with the sparse principal component accumulation percentage of 95% is selected as input, the trained GRNN network is used for training, and the training result is shown in figure 2. As can be seen from fig. 2, the trained HRTF is not much different from the original HRTF, and the result is better than that in the comparative paper.

Claims (1)

1. A head-related transfer function (HRTF) personalization method based on sparse principal component analysis comprises the following steps:
the method comprises the following steps: selecting all tested HRTFs in a certain determined direction, performing principal component analysis on the HRTFs in the direction to obtain principal components of the direction:
(1) selecting all tested HRTFs with azimuth angle (theta, phi) to form a vector HijWherein i is a tested sequence, j is a frequency serial number, the tested number is m, theta is an azimuth angle, and phi is a pitch angle;
(2) to HijThe following normalization process was performed:
Figure FDA0002622100800000011
wherein the content of the first and second substances,
Figure FDA0002622100800000012
Figure FDA0002622100800000013
is HijA normalized head-related transfer function;
(3) performing principal component analysis on the normalized head-related transfer function:
Figure FDA0002622100800000014
wherein, Pm×nIs a score matrix of m × n for principal component analysis; w is a load matrix of n × n of principal component analysis, T represents the transpose of the matrix;
step two: applying a sparse principal component analysis method, namely an SPCA method to reduce the dimension of the three-dimensional physiological parameters:
(1) all tested physiological parameters are formed into vectors of X ═ { X ═ X1,x2,…,xn}∈Rp×nWherein p represents a single physiological parameter dimension of the subject and n represents the number of subjects; performing Singular Value Decomposition (SVD) on X;
X=UDVT(3)
wherein, Z ═ UD is the principal component of vector X, V is the load matrix of vector X;
(2) in this patent, we introduce sparse loads to estimate the principal components; the sparse load is regressed through an elastic network; for the ith principal component, define Zi=UiDii
Figure FDA0002622100800000015
Wherein the content of the first and second substances,
Figure FDA0002622100800000016
is the first order norm of β; λ, λ1Is a resilient penalty function;
(4) analysis of X principal ComponentsVector α corresponding to the first L principal components of (a)jForming a new initialization matrix A; setting a random initialization matrix B ═ beta at the same time1,…,βL];
(5) In the given case of a, the elastic net regression problem is solved:
Figure FDA0002622100800000021
wherein λ is1,λ1,jAre all elastic penalty functions
(6) Update B ═ β1,…,βL]Calculating XTXB=UDVTSimultaneously update A ═ UVT
(7) Repeating the steps (5) and (6) until the convergence of B
(8) Normalized betajObtaining a sparse load matrix;
(9) acquiring main components representing physiological parameters according to the sparse load matrix;
step three: taking the obtained physiological parameters as input, taking principal component coefficients subjected to HRTF dimensionality reduction as output, and applying a generalized regression neural network GRNN to carry out regression; and restoring the regressed main component to obtain the restored HRTF.
CN201910642057.4A 2019-07-16 2019-07-16 HRTF (head related transfer function) personalization method based on sparse principal component analysis Active CN110493701B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910642057.4A CN110493701B (en) 2019-07-16 2019-07-16 HRTF (head related transfer function) personalization method based on sparse principal component analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910642057.4A CN110493701B (en) 2019-07-16 2019-07-16 HRTF (head related transfer function) personalization method based on sparse principal component analysis

Publications (2)

Publication Number Publication Date
CN110493701A CN110493701A (en) 2019-11-22
CN110493701B true CN110493701B (en) 2020-10-27

Family

ID=68547382

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910642057.4A Active CN110493701B (en) 2019-07-16 2019-07-16 HRTF (head related transfer function) personalization method based on sparse principal component analysis

Country Status (1)

Country Link
CN (1) CN110493701B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114662663B (en) * 2022-03-25 2023-04-07 华南师范大学 Sound playing data acquisition method of virtual auditory system and computer equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109166592A (en) * 2018-08-08 2019-01-08 西北工业大学 HRTF frequency-division section linear regression method based on physiological parameter
CN109769195A (en) * 2018-07-26 2019-05-17 西北工业大学 A kind of HRTF middle vertical plane orientation Enhancement Method
CN109791768A (en) * 2016-09-30 2019-05-21 冠状编码股份有限公司 For being converted to three-dimensional sound signal, stereo coding, decoding and transcoding process
CN110012384A (en) * 2018-01-04 2019-07-12 音科有限公司 A kind of method, system and the equipment of portable type measuring head related transfer function (HRTF) parameter

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100730297B1 (en) * 2005-05-31 2007-06-19 한국과학기술원 Sound source localization method using Head Related Transfer Function database
KR100954385B1 (en) * 2007-12-18 2010-04-26 한국전자통신연구원 Apparatus and method for processing three dimensional audio signal using individualized hrtf, and high realistic multimedia playing system using it
CN106231528B (en) * 2016-08-04 2017-11-10 武汉大学 Personalized head related transfer function generation system and method based on segmented multiple linear regression
CN107909653B (en) * 2017-11-15 2021-03-30 电子科技大学 Heart soft tissue three-dimensional reconstruction method based on sparse principal component analysis
CN108038291B (en) * 2017-12-05 2021-09-03 武汉大学 Personalized head-related transfer function generation system and method based on human body parameter adaptation algorithm
CN109299489A (en) * 2017-12-13 2019-02-01 中航华东光电(上海)有限公司 A kind of scaling method obtaining individualized HRTF using interactive voice

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109791768A (en) * 2016-09-30 2019-05-21 冠状编码股份有限公司 For being converted to three-dimensional sound signal, stereo coding, decoding and transcoding process
CN110012384A (en) * 2018-01-04 2019-07-12 音科有限公司 A kind of method, system and the equipment of portable type measuring head related transfer function (HRTF) parameter
CN109769195A (en) * 2018-07-26 2019-05-17 西北工业大学 A kind of HRTF middle vertical plane orientation Enhancement Method
CN109166592A (en) * 2018-08-08 2019-01-08 西北工业大学 HRTF frequency-division section linear regression method based on physiological parameter

Also Published As

Publication number Publication date
CN110493701A (en) 2019-11-22

Similar Documents

Publication Publication Date Title
Choi et al. Blind source separation and independent component analysis: A review
Sawada et al. Multichannel extensions of non-negative matrix factorization with complex-valued data
Zdunek et al. Non-negative matrix factorization with quasi-newton optimization
Kim et al. Blind source separation exploiting higher-order frequency dependencies
Palmer et al. Newton method for the ICA mixture model
Fu et al. Blind source separation by entropy rate minimization
KR102229218B1 (en) Signal translation system and signal translation method
Zhang et al. Self-adaptive blind source separation based on activation functions adaptation
DE102021107050A1 (en) GRADIENT COMPRESSION FOR DISTRIBUTED TRAINING
CN110032704B (en) Data processing method, device, terminal and storage medium
Xian-Chuan et al. A new blind image source separation algorithm based on feedback sparse component analysis
CN110493701B (en) HRTF (head related transfer function) personalization method based on sparse principal component analysis
Rodriguez et al. General non-orthogonal constrained ICA
Lu et al. Eliminating indeterminacy in ICA
Grijalva et al. Anthropometric-based customization of head-related transfer functions using Isomap in the horizontal plane
Mohanaprasad et al. Noise reduction in speech signals using adaptive independent component analysis (ICA) for hands free communication devices
Redif Fetal electrocardiogram estimation using polynomial eigenvalue decomposition
Einizade et al. A unified approach for simultaneous graph learning and blind separation of graph signal sources
Luo Independent vector analysis: Model, applications, challenges
Huang et al. An experimental study: on reducing RBF input dimension by ICA and PCA
Ben et al. An adaptive neural networks formulation for the two-dimensional principal component analysis
Zhang et al. Extended gaussianization method for blind separation of post-nonlinear mixtures
Pearlmutter et al. Sparse separation: Principles and tricks
Calhoun et al. Complex infomax: convergence and approximation of infomax with complex nonlinearities
Birlutiu et al. Multi-task preference learning with gaussian processes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant