CN104503963A - Method for processing head-related impulse response data set - Google Patents

Method for processing head-related impulse response data set Download PDF

Info

Publication number
CN104503963A
CN104503963A CN201410512710.2A CN201410512710A CN104503963A CN 104503963 A CN104503963 A CN 104503963A CN 201410512710 A CN201410512710 A CN 201410512710A CN 104503963 A CN104503963 A CN 104503963A
Authority
CN
China
Prior art keywords
hrir
point
dimensional
sample point
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410512710.2A
Other languages
Chinese (zh)
Other versions
CN104503963B (en
Inventor
陈喆
殷福亮
周文颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian University of Technology
Original Assignee
Dalian University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian University of Technology filed Critical Dalian University of Technology
Priority to CN201410512710.2A priority Critical patent/CN104503963B/en
Publication of CN104503963A publication Critical patent/CN104503963A/en
Application granted granted Critical
Publication of CN104503963B publication Critical patent/CN104503963B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/08Feature extraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions

Abstract

The embodiment of the invention provides a method for processing a head-related impulse response data set. The method comprises the following steps: a processor groups the head-related impulse response (HRIR) data set according to an angle of elevation and/or a horizontal angle; the processor selects the HRIR sample point with a minimum reconstruction error value in the grouped HRIR data set as a high-dimensional HRIR sample point through locally linear embedding (LLE); the processor reduces the dimension of the HRIR sample point to a low-dimensional space to obtain low-dimension HRIR sample points; the processor adopts sectional straight line fitting to select characteristic HRIR sample points in the low-dimension HRIR sample points; and the processor combines the characteristic HRIR sample points into a three-dimensional matrix according to an angle-of-elevation sequence and/or a horizontal angle sequence. The embodiment of the invention solves the problem of excessive errors of a characteristic HRIR selection method.

Description

Head coherent pulse response data sets disposal route
Technical field
The embodiment of the present invention relates to computer science, particularly relates to a kind of head coherent pulse response data sets disposal route.
Background technology
Human auditory can not only tone, loudness of a sound, the tone color of perceived sounds, can also differentiate Sounnd source direction and distance.In order to make not audience at the scene also can obtain " telepresenc " true to nature, people have invented three dimensional audio technology.Three dimensional audio technology can reproduce the acoustic information of original sound field truly, makes hearer produce the listening experience of " on the spot in person ".Three-dimensional audio realizes by " binaural sound " technology, namely head related transfer function (Head Related Transfer function is adopted, hereinafter referred to as: HRTF) filtering is carried out to single channel sound-source signal, by the sound with sense of direction obtained, then reset with earphone or a pair loudspeaker.HRTF time-domain representation be called the response of coherent pulse (Head Related Impulse Responses, hereinafter referred to as: HRIR).Binaural sound learns a skill and is widely used in blind person's sense of hearing navigation, flight training, virtual reality.HRTF describes the process that sound propagates into from sound source in ear, and the trunk, head, ear etc. of people have diffraction to voice signal, reflect, the effect such as to block, and can be equivalent to filter action.Usually the HRTF database obtained through accurate experiment measurement is adopted in reality.As shown in Figure 1, this database is a three-dimensional matrice, stores the HRTF of thousands of sound source locus, and the shock response duration of each HRTF is longer, and Equivalent FIR Filter exponent number is very high, and everyone HRTF is not quite similar.Therefore, the HRTF data of the person that store multiple-position measurement, need to consume huge storage space.When adopting embedded chip synthesis three-dimensional sound, the more aobvious scarcity of its storage space, becomes the bottleneck of three dimensional audio technology application.Now, if can with auditory localization performance loss inconspicuous for cost, exchange the very large storage space reduction of HRTF database for, just better can solve HRTF database and expend the large problem of memory space, reduce power consumption and the cost of the chip used in practical application.
In prior art, adopt manifold learning LLE algorithm to carry out dimensionality reduction to one group of HRIR, extract feature HRIR and represent all HRIR of this group, and other non-feature HRIR carries out interpolation by the feature HRIR of this group and obtains.The all HRIR being about to be positioned at same horizontal angle are divided into one group, and to often organizing HRIR, by dimensionality reduction module, by higher-dimension HRIR sample point dimensionality reduction to lower dimensional space, obtain the HRIR sample point of low-dimensional, this dimensionality reduction module uses LLE algorithm; Then, choose module by unique point, carrying out cluster to often organizing low-dimensional sample point, obtaining the low-dimensional sample point corresponding to cluster centre, this higher-dimension HRIR corresponding to low-dimensional sample point is the feature HRIR selected by this group, and what unique point chose that module adopts is K-means clustering algorithm.When reconstructing all HRIR according to feature HRIR, adopt two step interpolation: the first, adopt linear interpolation, according to the feature HRIR of this group, reconstruct all non-feature HRIR of this group; The second, the Neighborhood matrix neighbor obtained in LLE module in basis and weight matrix W, again carries out interpolation to non-feature HRIR, obtains the HRIR after all reconstruct of this group.
Due in feature HRIR leaching process, adopt K-mean cluster module extract the feature HRIR obtained and be weighted reconstruct, reconstructed error is bigger than normal.And reconstructing in all HRIR processes according to feature HRIR, the reconstructed error that the two step interpolation adopted in weighting reconstructed module obtain is bigger than normal, when this is owing to carrying out second step interpolation, non-feature HRIR is through linear interpolation, itself is with error, if use these HRIR to be reconstructed other orientation HRIR, can cause deviation accumulation.
Summary of the invention
The embodiment of the present invention provides a kind of head coherent pulse response data sets disposal route, to overcome technical matters large for the reconstructed error of HRIR in prior art.
Embodiments provide a kind of head coherent pulse response data sets disposal route, comprising:
Head coherent pulse response HRIR data set divides into groups according to the elevation angle and/or horizontal angle by processor;
Described processor is by choosing the HRIR of reconstructed error minimum value as higher-dimension HRIR sample point in Local Liner Prediction LLE after grouping HRIR;
Described HRIR sample point dimensionality reduction to lower dimensional space, is obtained low-dimensional HRIR by described processor;
Described processor adopts sectional straight line fitting selected characteristic HRIR sample point in described low-dimensional HRIR sample point;
Described feature HRIR sample is combined into three-dimensional matrice according to elevation angle order and/or horizontal angle order by described processor.
Further, described processor, by choosing the HRIR of reconstructed error minimum value in Local Liner Prediction LLE after grouping HRIR as HRIR sample point, comprising:
Processor chooses a HRIR sample point in HRIR after grouping, calculates the Euclidean distance between described HRIR sample point and other HRIR sample points, and chooses the point of proximity of described HRIR sample point according to described Euclidean distance;
Construct the local covariance matrix of described HRIR sample point, determine the conditional value of reconstructed error
Σ j = 1 K w j i = 1 - - - ( 1 )
Wherein, described K is and HRIR sample point x ineighbor point number, described j is the subscript of a jth neighbor point, described in for using a jth neighbor point x jreconstruct x iweights;
By described local covariance matrix Q iand described conditional value, adopt formula
w j i = Σ k = 1 K ( Q i ) jk - 1 Σ j = 1 K Σ k = 1 K ( Q i ) jk - 1 - - - ( 2 )
Calculate the weights reconstruction matrix w of described HRIR sample point ij, wherein, described K is local covariance matrix Q idimension, described i is the subscript of i-th sample point, and described k is local covariance matrix Q ikth dimension, described j is the subscript of a jth neighbor point;
By described w ijsubstitute into reconstructed error function formula
min ϵ ( W ) = Σ i = 1 I | x i - Σ j = 1 K w j i x j | 2 - - - ( 3 )
Determine that the HRIR sample point corresponding to reconstructed error minimum value is the higher-dimension HRIR sample point that can carry out dimension-reduction treatment, wherein, described I is that sample is always counted, and described i is the subscript of i-th sample, and described K is setting and described HRIR sample point x ithe number of the sample point that Euclidean distance is minimum, described x ifor the described HRIR sample point chosen, described x jfor described x ia jth neighbor point, described min ε (W) is minimal reconstruction error amount;
Described HRIR sample point dimensionality reduction to lower dimensional space, is obtained low-dimensional HRIR, comprises by described processor:
Formula will be met
Σ i = 1 N y i = 0 , 1 N Σ i = 1 N y i y i T = I - - - ( 4 )
Described HRIR sample point substitute into formula
min ϵ ( Y ) = Σ i = 1 I | y i - Σ j = 1 K w j i y j | 2 - - - ( 5 )
Obtain loss function
min ϵ ( Y ) = Σ j = 1 N Σ j = 1 N m ij y i T y j - - - ( 6 )
Choose formula
M=(I-W) T(I-W) (7)
Minimum d the nonzero eigenvalue of middle M is updated to described loss function, obtains with the proper vector of Y as the corresponding sample point mapping to lower dimensional space, wherein, and described y ifor described x ilow-dimensional map vector, described y jfor described y ia jth neighbor point, described min ε (Y) is loss function, and described i is the subscript of i-th sample, and described j is the subscript of a jth neighbor point, and described I is that sample is always counted, and described k is and y icontiguous counts.
Further, described processor adopts sectional straight line fitting selected characteristic HRIR sample point in the HRIR sample point of described low-dimensional, comprising:
Any two starting points for the one-dimensional manifold of low-dimensional HRIR group adopt formula
y 1 d y 2 d · · · y Nd = y 11 y 12 · · · y 1 , d - 1 y 21 y 22 · · · y 2 , d - 1 · · · · · · · · · y N 1 y N 2 · · · y N , d - 1 a 0 a 1 · · · a d - 1 - - - ( 8 )
Calculated line coefficient (a 0, a 1..., a d-1), wherein, described y is the point of the one-dimensional manifold of low-dimensional HRIR group, and described d is space dimensionality, and described a is straight line coefficient;
According to the bearing of trend of one-dimensional manifold, add N point, adopt formula
e N = Σ i = 1 N [ | | y id - y ^ id | | 2 ] 2 - - - ( 9 )
Calculate N point error of fitting e n, wherein, described N counts for described any two starting point institute structures are straight, and described i is that i-th section of straight line is counted, and described y is the point of the one-dimensional manifold of low-dimensional HRIR group, and described d is space dimensionality, described in for the actual point of the one-dimensional manifold of low-dimensional HRIR group;
If the described error of fitting e of N point nbe less than maximum error e mAX, then N point is joined in the point range between two starting points; If the described error of fitting e of N point nbe greater than maximum error e mAXthen using the end points of N-1 point as this section of straight line, then the higher-dimension HRIR corresponding to described end points is feature HRIR.
Further, described e mAX=ENER × ratio, wherein, described ratio is error of fitting ratio, and described ENER is the normalized energy that low-dimensional embeds stream shape.
The embodiment of the present invention, HRIR data set divides into groups according to the elevation angle and/or horizontal angle by processor, described processor is by choosing the HRIR of reconstructed error minimum value as higher-dimension HRIR sample point in Local Liner Prediction LLE after grouping HRIR, described processor by described HRIR sample point dimensionality reduction to lower dimensional space, obtain low-dimensional HRIR, adopt sectional straight line fitting selected characteristic HRIR sample point in described low-dimensional HRIR sample point, according to elevation angle order and/or horizontal angle order, described feature HRIR sample is combined into three-dimensional matrice, solve the problem that in prior art, extraordinary HRIR choosing method error is excessive.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is HRTF three-dimensional data base schematic diagram in prior art;
Fig. 2 is HRIR data set process flow figure of the present invention;
Fig. 3 is HRIR 3-D data set process flow figure of the present invention;
Fig. 4 is Local Liner Prediction LLE schematic diagram of the present invention;
Fig. 5 is the present invention 2 dimensional feature HRIR position view;
Fig. 6 is head relative coordinate system schematic diagram of the present invention.
Embodiment
For making the object of the embodiment of the present invention, technical scheme and advantage clearly, below in conjunction with the accompanying drawing in the embodiment of the present invention, technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
Fig. 2 is head coherent pulse response data sets process flow figure of the present invention, and as shown in Figure 2, the method for the present embodiment can comprise:
Head coherent pulse response HRIR data set divides into groups according to the elevation angle and/or horizontal angle by step 101, processor;
Step 102, described processor are by choosing the HRIR of reconstructed error minimum value as higher-dimension HRIR sample point in Local Liner Prediction LLE after grouping HRIR;
Described HRIR sample point dimensionality reduction to lower dimensional space, is obtained low-dimensional HRIR by step 103, described processor;
Step 104, described processor adopt sectional straight line fitting selected characteristic HRIR sample point in described low-dimensional HRIR sample point;
Described feature HRIR sample is combined into three-dimensional matrice according to elevation angle order and/or horizontal angle order by step 105, described processor.
Specifically, in HRIR database, each orientation for the HRIR of a plane should be equally distributed.Illustrate, in the HRIR database of CIPIC, the measurement orientation in horizontal angle is respectively-80 ° ,-65 ° ,-55 ° ,-45 ° ~ 45 °, 55 °, 65 °, 80 °.Wherein, between [-45 °, 45 °], uniform sampling is carried out with 5 ° of intervals.Therefore, the HRIR between [-45 °, 45 °] of each horizontal angle plane should be compiled is one group.And-80 of two ends ° ,-65 ° ,-55 ° ,-45 ° ~ 45 °, 55 °, 65 °, the HRIR in the 80 ° of orientation feature HRIR that will be directly brought into concentrates.Fig. 3 a is each elevation plane of HRIR three-dimensional data after grouping, and Fig. 3 b is each horizontal angle plane of HRIR three-dimensional data after grouping.For the HRIR after grouping by Local Liner Prediction (Locally Linear Embedding, hereinafter referred to as: LLE) calculate reconstructed error, and the HRIR choosing reconstructed error minimum value is as higher-dimension HRIR sample point, by this higher-dimension HRIR sample point dimensionality reduction to lower dimensional space, corresponding low-dimensional HRIR is obtained in lower dimensional space, adopt sectional straight line fitting selected characteristic HRIR sample point from low-dimensional HRIR sample point, finally according to the elevation angle and/or horizontal angle order, described feature HRIR sample is combined into three-dimensional matrice.
The embodiment of the present invention, HRIR data set divides into groups according to the elevation angle and/or horizontal angle by processor, described processor is by choosing the HRIR of reconstructed error minimum value as higher-dimension HRIR sample point in Local Liner Prediction LLE after grouping HRIR, described processor by described HRIR sample point dimensionality reduction to lower dimensional space, obtain low-dimensional HRIR, adopt sectional straight line fitting selected characteristic HRIR sample point in described low-dimensional HRIR sample point, according to elevation angle order and/or horizontal angle order, described feature HRIR sample is combined into three-dimensional matrice, solve the problem that in prior art, extraordinary HRIR choosing method error is excessive.
Fig. 4 is Local Liner Prediction LLE schematic diagram of the present invention, and as shown in Figure 4, the method for the present embodiment can comprise:
Step 201, processor choose a HRIR sample point in HRIR after grouping, calculate the Euclidean distance between described HRIR sample point and other HRIR sample points, and choose the point of proximity of described HRIR sample point according to described Euclidean distance;
Step 202, construct the local covariance matrix of described HRIR sample point, determine the conditional value of reconstructed error
Σ j = 1 K w j i = 1 - - - ( 1 )
Wherein, described K is and HRIR sample point x ineighbor point number, described j is the subscript of a jth neighbor point, described in for using a jth neighbor point x jreconstruct x iweights;
Step 203, by described local covariance matrix Q iand described conditional value, adopt formula
w j i = Σ k = 1 K ( Q i ) jk - 1 Σ j = 1 K Σ k = 1 K ( Q i ) jk - 1 - - - ( 2 )
Calculate the weights reconstruction matrix w of described HRIR sample point ij, wherein, described K is local covariance matrix Q idimension, described i is the subscript of i-th sample point, and described k is local covariance matrix Q ikth dimension, described in, j is the subscript of a jth neighbor point;
Step 204, by described w ijsubstitute into reconstructed error function formula
min ϵ ( W ) = Σ i = 1 I | x i - Σ j = 1 K w j i x j | 2 - - - ( 3 )
Determine that the HRIR sample point corresponding to reconstructed error minimum value is the higher-dimension HRIR sample point that can carry out dimension-reduction treatment, wherein, described I is that sample is always counted, and described i is the subscript of i-th sample, and described K is setting and described HRIR sample point x ithe number of the sample point that Euclidean distance is minimum, described x ifor the described HRIR sample point chosen, described x jfor described x ia jth neighbor point, described min ε (W) is minimal reconstruction error amount;
Specifically, in LLE algorithm, each HRIR can think a sample point in higher dimensional space, is designated as x i, i=1,2 ..., I.By calculating each sample point x iand Euclidean distance between all the other sample points, and using apart from a minimum K sample point as x ineighbor Points.K value can preset, and is taken as 12 in the present embodiment.The calculating of following calculating through type (1) rebuilds weight matrix W.
min ϵ ( W ) = Σ i = 1 I | x i - Σ j = 1 K w j i x j | 2 - - - ( 4 )
Wherein, x jfor x ia jth Neighbor Points; for using x jreconstruct x iweights, need satisfy condition construct the local covariance matrix Q of this sample point i, its element calculated by publicity (2),
q i jk = ( x i - x j ) T ( x i - x k ) - - - ( 5 )
Wherein, described T is matrix transpose computing;
By formula (5) with in conjunction with, use formula (6) to calculate w ij
w j i = Σ k = 1 K ( Q i ) jk - 1 Σ j = 1 K Σ k = 1 K ( Q i ) jk - 1 - - - ( 6 )
Wherein, Q i∈ R k × K.For preventing due to Q ibe singular matrix and cannot calculate, must to Q icarry out regularization,
Q i=Q i+rI (7)
Wherein, r is regularization coefficient; I ∈ R k × Kfor unit matrix.
Higher-dimension HRIR sample point dimensionality reduction to the concrete steps of lower dimensional space, comprises by described processor:
Higher-dimension HRIR sample point need meet
min ϵ ( Y ) = Σ i = 1 I | y i - Σ j = 1 K w j i y j | 2 - - - ( 8 )
Wherein, ε (Y) is loss function, y ix ilow-dimensional map vector, y jy ja jth Neighbor Points, y ifollowing condition need be met,
Σ i = 1 I y i = 0 , 1 N Σ i = 1 I y i y i T = I - - - ( 9 )
Use sparse matrix W ∈ R i × Istore work as x jbelong to x iduring Neighbor Points, the element in W otherwise, w ij=0.Loss function is written as
min ϵ ( Y ) = Σ i = 1 I Σ j = 1 I m ij y i T y j - - - ( 10 )
Wherein, m ijit is the element of matrix M.Symmetric matrix M ∈ R i × Icalculate by formula (11),
M=(I-W) T(I-W) (11)
For making formula (10) medial error minimum, then choose the proper vector corresponding to minimum d nonzero eigenvalue of M as output Y.Because minimum eigenwert is close to zero, generally get the individual minimum eigenwert characteristic of correspondence vector of front 2 ~ d+1 as Y.Wherein, d is the embedding dimension, is set to 3 in the present embodiment.
In the present embodiment, can adopt other manifold learnings by higher-dimension HRIR sample point dimensionality reduction to lower dimensional space as Isometric Maps algorithm (Isomap), local retaining projection (Locality PreservingProjection, hereinafter referred to as LPP) etc.
The detailed process of sectional straight line fitting is:
Get initial two points of one-dimensional manifold, use formula
y 1 d y 2 d · · · y Nd = y 11 y 12 · · · y 1 , d - 1 y 21 y 22 · · · y 2 , d - 1 · · · · · · · · · y N 1 y N 2 · · · y N , d - 1 a 0 a 1 · · · a d - 1 - - - ( 12 )
Least square fitting is utilized to obtain straight line coefficient (a 0, a 1..., a d-1);
According to the bearing of trend of one-dimensional manifold, often add a bit, use formula (12) and formula
e N = Σ i = 1 N [ | | y id - y ^ id | | 2 ] 2 - - - ( 13 )
Calculate (a 0, a 1..., a d-1) and e n, by e nwith maximum error e given in advance mAXrelatively, if e n<e mAX, then this point is joined in point range above, until e n>=e mAX, using the end points of N-1 point as this section of straight line.For the error of fitting threshold value e in sectional straight line fitting mAX, adopt formula
e MAX=ENER×ratio (14)
Calculate, wherein ENER is the normalized energy that low-dimensional embeds stream shape, shown in (15); Ratio is error of fitting ratio, manually need set, is set to 5 × 10 in the present embodiment -10.Wherein,
ENER = &Sigma; i = 1 I [ | | h i | | 2 ] 2 / max | h i | - - - ( 15 )
Using N point and N+1 point as initial 2 points of next section of fitting a straight line, go to step 1., until all sample point matching is complete.
After obtaining the end points of all segmented linear, using the higher-dimension HRIR corresponding to these cut-points as feature HRIR.
This programme feature HRIR collection chooses result as shown in Figure 5.In the present embodiment, feature samples clicks delivery block can adopt other subsection curve drafting, as secondary, trilinear matching.
In order to verify validity of the present invention, carry out computer simulation experiment.The database adopted during experiment is the left ear HRIR data measuring individual subject_003 in the HRIR database in CIPIC laboratory, and three-dimensional HRIR matrix size is D1 × D2 × L, and numerical value is 50 × 25 × 200.
Adopt subjective listen test:
Test HRIR collection: choose the HRIR collection data that two groups are respectively the subject_003 of CIPIC, and compression reconfiguration data set.
Test subject: 4 normal testers of the sense of hearing: 2 women, 2 male sex;
Test angle (φ, θ): 7 orientation are tested, be respectively (0 °,-35 °), (0 °, 0 °), (0 °, 35 °), (-22.5 °, 0 °), (33.8 °, 0 °), (61.9 °, 0 °), (90 °, 0 °).There are front and back owing to using general HRIR and obscure this possibility, these 7 orientation therefore chosen all are positioned at front half-plane, and head relative coordinate system as shown in Figure 6.
Testing audio: be two groups of HRIR convolution (rebuilding HRIR after original HRIR, compression) respectively that 0.4 second white noise of 44.1kHz is identical with orientation by sampling rate, obtain the binaural sound of 0.4 second.Repeated 10 times, do interval with 0.1 second quiet, obtain the 5 seconds testing audios in two groups, this orientation.Perception angular distance option table is as shown in table 1,
Table 1
Difference angle Describe Affinity score
Identical 5
0°~5° Just perceptible difference 3
5°~15° Significant difference 2
15°~30° Very different 1
>30° Complete difference 0
Two groups, each orientation testing audio is played to 4 testers respectively, carries out angle difference between audition alternative two groups.Then search reserved portion according to difference angle in table 2.The score in 7 orientation is done on average, obtains the listening test results shown in table 3.Listening test results as known from Table 2, three kinds of scheme HRIR after overcompression, reconstruct distinguish with original HRIR in audition.
Table 2
Scenario Name YXH LZL SJW LD
This programme 5.00 4.71 5.00 5.00
Calculated amount: when synthesizing binaural sound, the HRIR in required orientation is divided into three kinds of situations:
The first situation, when required HRIR belongs to feature HRIR collection, can directly use, and calculated amount is 0;
The second situation, when the both sides of required HRIR are feature HRIR, as horizontal angle index in Fig. 5 be 1, elevation angle index be 2 time, in elevation plane, the HRIR of these both sides, position belongs to feature HRIR collection, the HRIR of its both sides is used to carry out linear interpolation, reconstruct this orientation and need 200 real multiplications, 200 real additions.
The third situation, when required HRIR both sides not existing characteristics HRIR time, if horizontal angle index in Fig. 5 is 5, when elevation angle index is 2, the HRIR of this position needs by the feature HRIR linear interpolation of 4 around it, and now reconstructing this orientation HRIR needs 200 real multiplications, 600 real additions.
Comprehensive three kinds of situations, that average computational load of the method is 133.4 real multiplications, 202.6 real additions.
The present embodiment, head coherent pulse response HRIR data set divides into groups according to the elevation angle and/or horizontal angle by processor, by choosing the HRIR of reconstructed error minimum value in Local Liner Prediction LLE after grouping HRIR as higher-dimension HRIR sample point, by described HRIR sample point dimensionality reduction to lower dimensional space to low-dimensional HRIR, adopt sectional straight line fitting selected characteristic HRIR sample point in described low-dimensional HRIR sample point, according to elevation angle order and/or horizontal angle order, described feature HRIR sample is combined into three-dimensional matrice.Solve the technical matters that reconstructed error is little in HRIR selected characteristic HRIR process.The feature HRIR quantity of carrying out dividing into groups obtained according to the elevation angle and/or horizontal angle is less, thus make feature HRIR have more representativeness, adopt sectional straight line fitting selected characteristic HRIR sample point in described low-dimensional HRIR sample point, selected feature HRIR is more accurate.
Last it is noted that above each embodiment is only in order to illustrate technical scheme of the present invention, be not intended to limit; Although with reference to foregoing embodiments to invention has been detailed description, those of ordinary skill in the art is to be understood that: it still can be modified to the technical scheme described in foregoing embodiments, or carries out equivalent replacement to wherein some or all of technical characteristic; And these amendments or replacement, do not make the essence of appropriate technical solution depart from the scope of various embodiments of the present invention technical scheme.

Claims (4)

1. a head coherent pulse response data sets disposal route, is characterized in that, comprising:
Head coherent pulse response HRIR data set divides into groups according to the elevation angle and/or horizontal angle by processor;
Described processor is by choosing the HRIR of reconstructed error minimum value as higher-dimension HRIR sample point in Local Liner Prediction LLE after grouping HRIR;
Described HRIR sample point dimensionality reduction to lower dimensional space, is obtained low-dimensional HRIR by described processor;
Described processor adopts sectional straight line fitting selected characteristic HRIR sample point in described low-dimensional HRIR sample point;
Described feature HRIR sample is combined into three-dimensional matrice according to elevation angle order and/or horizontal angle order by described processor.
2. method according to claim 1, is characterized in that, described processor, by choosing the HRIR of reconstructed error minimum value in Local Liner Prediction LLE after grouping HRIR as HRIR sample point, comprising:
Processor chooses a HRIR sample point in HRIR after grouping, calculates the Euclidean distance between described HRIR sample point and other HRIR sample points, and chooses the point of proximity of described HRIR sample point according to described Euclidean distance;
Construct the local covariance matrix of described HRIR sample point, determine the conditional value of reconstructed error
Wherein, described K is and HRIR sample point x ineighbor point number, described j is the subscript of a jth neighbor point, described in for using a jth neighbor point x jreconstruct x iweights;
By described local covariance matrix Q iand described conditional value, adopt formula
Calculate the weights reconstruction matrix w of described HRIR sample point ij, wherein, described K is local covariance matrix Q idimension, described i is the subscript of i-th sample point, and described k is local covariance matrix Q ikth dimension, described j is the subscript of a jth neighbor point;
By described w ijsubstitute into reconstructed error function formula
Determine that the HRIR sample point corresponding to reconstructed error minimum value is the higher-dimension HRIR sample point that can carry out dimension-reduction treatment, wherein, described I is that sample is always counted, and described i is the subscript of i-th sample, and described K is setting and described HRIR sample point x ithe number of the sample point that Euclidean distance is minimum, described x ifor the described HRIR sample point chosen, described x jfor described x ia jth neighbor point, described min ε (W) is minimal reconstruction error amount;
Described HRIR sample point dimensionality reduction to lower dimensional space, is obtained low-dimensional HRIR, comprises by described processor:
Formula will be met
Described HRIR sample point substitute into formula
Obtain loss function
Choose formula
M=(I-W) T(I-W) (7)
Minimum d the nonzero eigenvalue of middle M is updated to described loss function, obtains with the proper vector of Y as the corresponding sample point mapping to lower dimensional space, wherein, and described y ifor described x ilow-dimensional map vector, described y jfor described y ia jth neighbor point, described min ε (Y) is loss function, and described i is the subscript of i-th sample, and described j is the subscript of a jth neighbor point, and described I is that sample is always counted, and described k is and y icontiguous counts.
3. method according to claim 1 and 2, is characterized in that, described processor adopts sectional straight line fitting selected characteristic HRIR sample point in the HRIR sample point of described low-dimensional, comprising:
Any two starting points for the one-dimensional manifold of low-dimensional HRIR group adopt formula
Calculated line coefficient (a 0, a 1..., a d-1), wherein, described y is the point of the one-dimensional manifold of low-dimensional HRIR group, and described d is space dimensionality, and described a is straight line coefficient;
Add N point according to the bearing of trend of one-dimensional manifold, adopt formula
Calculate N point error of fitting e n, wherein, described N counts for described any two starting point institute structures are straight, and described i is that i-th section of straight line is counted, and described y is the point of the one-dimensional manifold of low-dimensional HRIR group, and described d is space dimensionality, described in for the actual point of the one-dimensional manifold of low-dimensional HRIR group;
If the described error of fitting e of N point nbe less than maximum error e mAX, then N point is joined in the point range between two starting points; If the described error of fitting e of N point nbe greater than maximum error e mAXthen using the end points of N-1 point as this section of straight line, then the higher-dimension HRIR corresponding to described end points is feature HRIR.
4. method according to claim 3, is characterized in that, described e mAX=ENER × ratio, wherein, described ratio is error of fitting ratio, and described ENER is the normalized energy that low-dimensional embeds stream shape.
CN201410512710.2A 2014-09-26 2014-09-26 Head coherent pulse response data sets processing method Active CN104503963B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410512710.2A CN104503963B (en) 2014-09-26 2014-09-26 Head coherent pulse response data sets processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410512710.2A CN104503963B (en) 2014-09-26 2014-09-26 Head coherent pulse response data sets processing method

Publications (2)

Publication Number Publication Date
CN104503963A true CN104503963A (en) 2015-04-08
CN104503963B CN104503963B (en) 2018-01-09

Family

ID=52945361

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410512710.2A Active CN104503963B (en) 2014-09-26 2014-09-26 Head coherent pulse response data sets processing method

Country Status (1)

Country Link
CN (1) CN104503963B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105959877A (en) * 2016-07-08 2016-09-21 北京时代拓灵科技有限公司 Sound field processing method and apparatus in virtual reality device
CN106780141A (en) * 2016-12-15 2017-05-31 贵州电网有限责任公司电力科学研究院 A kind of power transmission lines overhauling plan optimization method and system based on manifold learning
CN108805104A (en) * 2018-06-29 2018-11-13 中国航空无线电电子研究所 Personalized HRTF obtains system
CN108885690A (en) * 2016-03-15 2018-11-23 自环绕公司 For generating the arrangement of head related transfer function filter
US11562471B2 (en) 2018-03-29 2023-01-24 Apple Inc. Arrangement for generating head related transfer function filters

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101938686A (en) * 2010-06-24 2011-01-05 中国科学院声学研究所 Measurement system and measurement method for head-related transfer function in common environment
US20130041648A1 (en) * 2008-10-27 2013-02-14 Sony Computer Entertainment Inc. Sound localization for user in motion
CN103209381A (en) * 2013-04-15 2013-07-17 清华大学 Multi-channel electro-acoustic transfer function measurement method
CN103731796A (en) * 2013-10-10 2014-04-16 华南理工大学 Multi-sound-source automatic measurement system for head related transfer function of distant field and near field

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130041648A1 (en) * 2008-10-27 2013-02-14 Sony Computer Entertainment Inc. Sound localization for user in motion
CN101938686A (en) * 2010-06-24 2011-01-05 中国科学院声学研究所 Measurement system and measurement method for head-related transfer function in common environment
CN103209381A (en) * 2013-04-15 2013-07-17 清华大学 Multi-channel electro-acoustic transfer function measurement method
CN103731796A (en) * 2013-10-10 2014-04-16 华南理工大学 Multi-sound-source automatic measurement system for head related transfer function of distant field and near field

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
汤永清: ""一种局部线性嵌入的空间听觉重建方法"", 《上海大学学报(自然科学版)》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108885690A (en) * 2016-03-15 2018-11-23 自环绕公司 For generating the arrangement of head related transfer function filter
US11557055B2 (en) 2016-03-15 2023-01-17 Apple Inc. Arrangement for producing head related transfer function filters
US11823472B2 (en) 2016-03-15 2023-11-21 Apple Inc. Arrangement for producing head related transfer function filters
CN105959877A (en) * 2016-07-08 2016-09-21 北京时代拓灵科技有限公司 Sound field processing method and apparatus in virtual reality device
CN105959877B (en) * 2016-07-08 2020-09-01 北京时代拓灵科技有限公司 Method and device for processing sound field in virtual reality equipment
CN106780141A (en) * 2016-12-15 2017-05-31 贵州电网有限责任公司电力科学研究院 A kind of power transmission lines overhauling plan optimization method and system based on manifold learning
US11562471B2 (en) 2018-03-29 2023-01-24 Apple Inc. Arrangement for generating head related transfer function filters
CN108805104A (en) * 2018-06-29 2018-11-13 中国航空无线电电子研究所 Personalized HRTF obtains system
CN108805104B (en) * 2018-06-29 2022-03-08 中国航空无线电电子研究所 Personalized HRTF acquisition system

Also Published As

Publication number Publication date
CN104503963B (en) 2018-01-09

Similar Documents

Publication Publication Date Title
US9369818B2 (en) Filtering with binaural room impulse responses with content analysis and weighting
US8489371B2 (en) Method and device for determining transfer functions of the HRTF type
CN104503963A (en) Method for processing head-related impulse response data set
CN105917407A (en) Identifying codebooks to use when coding spatial components of a sound field
CN107820158B (en) Three-dimensional audio generation device based on head-related impulse response
CN104408040A (en) Head related function three-dimensional data compression method and system
CN110619887B (en) Multi-speaker voice separation method based on convolutional neural network
CN106412793B (en) The sparse modeling method and system of head-position difficult labor based on spheric harmonic function
Bertet et al. 3D sound field recording with higher order ambisonics-objective measurements and validation of spherical microphone
Salvador et al. Design theory for binaural synthesis: Combining microphone array recordings and head-related transfer function datasets
Hu et al. Head related transfer function personalization based on multiple regression analysis
Miccini et al. A hybrid approach to structural modeling of individualized HRTFs
Spagnol Auditory model based subsetting of head-related transfer function datasets
CN104318064A (en) Three-dimensional head-related impulse response data compressing method based on canonical multi-decomposition
Lu et al. Personalization of head-related transfer function based on sparse principle component analysis and sparse representation of 3D anthropometric parameters
Lu et al. Head-related transfer function reconstruction with anthropometric parameters and the direction of the sound source: Deep learning-based head-related transfer function personalization
Barumerli et al. Localization in elevation with non-individual head-related transfer functions: comparing predictions of two auditory models
Chen et al. HRTF Representation with Convolutional Auto-encoder
CN109031202A (en) indoor environment area positioning system and method based on auditory scene analysis
O'Dwyer et al. A machine learning approach to detecting sound-source elevation in adverse environments
Chen et al. Interpolation method of head-related transfer functions based on common-pole/zero modeling
Algazi et al. Subject dependent transfer functions in spatial hearing
Grijalva et al. A recommender system for improving median plane sound localization performance based on a nonlinear representation of HRTFs
CN113449255B (en) Improved method and device for estimating phase angle of environmental component under sparse constraint and storage medium
O’Dwyer et al. Hrtf clustering for robust training of a dnn for sound source localization

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant