CN107105384B

CN107105384B - The synthetic method of near field virtual sound image on a kind of middle vertical plane

Info

Publication number: CN107105384B
Application number: CN201710347867.8A
Authority: CN
Inventors: 钟小丽
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2017-05-17
Filing date: 2017-05-17
Publication date: 2018-11-02
Anticipated expiration: 2037-05-17
Also published as: CN107105384A

Abstract

The present invention relates to a kind of synthetic methods of near field virtual sound image on middle vertical plane, include the following steps：Known far field head-position difficult labor HRTF_far, the wherein distance D of sound source and head center；Determine the distance d of near-field target acoustic image and head center on number of people radius a and middle vertical plane；According to formula, the angle correction θ of auris dextra R is calculated；Extract far field HRTF_farThe auris dextra HRTF of (θ, R) as near-field target acoustic image；According to the bilateral symmetry of middle vertical plane, far field HRTF is extracted_farThe left ear HRTF of (- θ, L) as near-field target acoustic image；By the ears HRTF of near-field target acoustic image and input acoustical signal progress convolution, it can be achieved that the 3D virtual playbacks of the middle vertical plane near field acoustic image based on earphone.Method provided by the invention deduces near-field HRTF using far field HRTF, can easily realize the synthesis of middle vertical plane near field virtual sound image, saves the cumbersome measurement process of near-field HRTF.

Description

The synthetic method of near field virtual sound image on a kind of middle vertical plane

Technical field

The present invention relates to 3D virtual sound technologies fields, and in particular to a kind of near field middle vertical plane Virtual Sound based on far-field information Synthetic method.

Background technology

Virtual Sound resets the virtual sound image of the method simulation specific dimensional orientation using Underwater Acoustic channels, to make hearer produce Raw feeling of immersion and telepresenc.Since reproducing device (be typically earphone) is simple, portable, Virtual Sound is reset in multimedia computer, a The fields such as people's audio mobile terminal have a wide range of applications.Head-position difficult labor (head-related transfer Function, HRTF) it is the key that virtual sound signal processing.HRTF be in the case of free field sound source to the acoustic transmission letter of ears Number is related to sound bearing, distance, frequency of sound wave, hearer's physiological structure etc..When the distance of sound source to listener head is more than 1.0 Rice, HRTF change, referred to as far field HRTF little with distance；And the distance for working as sound source to listener head is less than 1.0 meters, HRTF Significant change, referred to as near-field HRTF occurs with distance.Near-field HRTF contains the information of sound source Distance positioning, therefore it is in ears It is of great significance in the research of Distance positioning mechanism；Meanwhile with universal (such as the mobile phone) of hand-held playback equipment, it is also desirable to The signal processing of near field Virtual Sound playback is carried out using near-field HRTF.

Currently, near-field HRTF data are mainly obtained by measuring.In actual near-field HRTF measurement, it is primarily present following The difficulty of three aspects：(1) closer due to measuring sound source and subject, multiple scattering and diffraction will inevitably occur for sound wave Problem, it is therefore desirable to specific small size sound source；(2) due to near-field HRTF and distance dependent, it is therefore desirable to measure respectively different The HRTF of distance, workload are very big.(3) related (such as head size, auricle are subtle for the physiological structure of HRTF and subject Structure etc.).In principle, in order to obtain best sound replaying effect, the near field for measuring each hearer (i.e. subject) is needed HRTF.Currently, having multiple seminars establishes far field HRTF databases, and disclose on the internet, such as MIT and CIPIC HRTF databases.If the near-field HRTF that known far field HRTF derives different distance may be used, that can be to avoid upper The difficulty for stating near-field HRTF measurement realizes the convenient acquisition of near-field HRTF.

The space orientation of acoustic image is broadly divided into horizontal direction positioning and vertical direction positioning.Level orientation positioning is mainly depended on In the difference of left and right ear HRTF, and vertical direction positioning (Sound image localization especially on middle vertical plane) relies primarily on HRTF's Spectrum signature.Therefore, the accuracy of the spectrum signature of HRTF determines the accuracy of middle vertical plane Sound image localization；In middle vertical plane far field During HRTF derives near-field HRTF, the accurate reproduction for paying special attention to near-field HRTF spectrum signature is needed.

In general, the correlative study (including existing database) of HRTF uses head centre coordinate system, the wherein angle of HRTF It is defined as angle of the sound source relative to head center.However, HRTF itself is defined as sound source to the transfer function of ears, HRTF data And picked up at ears.Therefore, during far field HRTF derives near-field HRTF, in order to ensure near-field HRTF frequency The accuracy of spectrum signature needs to consider angle of the sound source relative to ear (rather than head center).Calculation shows that far field sound source is opposite It is roughly equal in head center and angle relative to ear；And angle of the near-field sound source relative to head center and relative to ear exist compared with Large deviation, it might even be possible to reach 30 °.This means that in common head centre coordinate system, far field HRTF cannot be directly used The near-field HRTF with angle is derived, needs to consider near-field sound source at the angle of two kinds of ears coordinate system and head centre coordinate system Spend deviation.

Invention content

The angular deviation problem during near-field HRTF is derived for far field HRTF, the present invention provides one kind being based on angle The acquisition methods of modified middle vertical plane near-field HRTF.The present invention using remote, close sound source to ear angle having the same as starting point, push away The angle (i.e. angle correction) of middle vertical plane near-field HRTF far field HRTF corresponding in head centre coordinate system is exported, and then from Some in the far field HRTF databases of head centre coordinate system using corresponding ears HRTF data are extracted, for subsequently synthesizing middle vertical plane Near field virtual sound image.This method can accurately, easily synthesize the virtual sound image at the arbitrary sound source distance of middle vertical plane.

The purpose of the present invention is achieved through the following technical solutions.

The synthetic method of near field virtual sound image on a kind of middle vertical plane, including step：

Step 1, input given data, including far field HRTF baseline databases, three distance parameters, described three distances Parameter include the distance D of far field sound source and head center, number of people radius a, on middle vertical plane near-field target acoustic image and head center distance d；

Step 2 calculates the angle correction θ of auris dextra R according to formula：

Wherein, change the distance d of near-field target acoustic image and head center on middle vertical plane, so that it may to obtain the different distances of near field Angle correction；

Step 3, extraction far field HRTF_farThe auris dextra HRTF of (θ, R) as near-field target acoustic image, i.e. HRTF_near(R)；Extraction Far field HRTF_farThe left ear HRTF, i.e. HRTF of (- θ, L) as near-field target acoustic image_near(L)；

Step 4, the ears HRTF by near-field target acoustic image_nearConvolution is carried out with input acoustical signal, and feeds and earphone or raises Sound device carries out low voice speaking put.

Further, the far field is the region other than 1 meter of head center, the near field be apart from 1 meter of head center with Interior region.

Further, the feature of the HRTF in the far field does not change with distance, and the feature of the HRTF near field is with distance And change.

Further, the multiple far field HRTF databases being disclosed before the feelings the pulse with the finger-tip of HRTF baseline databases described in step 1, Including Massachusetts Institute Technology's MIT HRTF databases, California, USA university Davis branch school CIPIC HRTF data Library.

Further, the number of people radius a described in step 1 is averaged as 8.75cm.

Further, the number of people radius a described in step 1 is by directly measuring acquisition.

Further, the binaural signals synthesized in step 4 are directly fed to Headphone reproducing；When using speaker playback, It then needs to be inserted into string sound elimination algorithm.

The principle of the present invention is：The sound wave that any sound source in space is sent out, by with hearer's physiological structure (such as head, auricle, Trunk etc.) interaction after, reach ears.HRTF reflects the change of sound wave during the above-mentioned sound transmission.No matter far field or Near field, as long as sound source is identical relative to the azimuth of ear, corresponding sound transmission process be it is identical, therefore same angle (relative to For ear) remote near-field HRTF have the similar spectrum structure of height.Here it is the reasons that near-field HRTF is deduced using far field HRTF By basis.However, the research of far field HRTF and database often use head centre coordinate system.The study found that due near field sound Relative to head center and relative to ear, there are apparent angular deviations in source.This means that the same angle distance field for ear HRTF is different angle in head centre coordinate system.If the far field HRTF using existing head centre coordinate system derives near field HRTF must carry out angle modification, to ensure that the far field HRTF and near-field target HRTF of extraction are same angle for ear 's.In view of the accuracy of HRTF frequency spectrums is vital, emphasis of the present invention for the accurate reproduction of middle vertical plane virtual sound image The angle modification formula on middle vertical plane is derived.Similar method can also be generalized to other than middle vertical plane.

Compared with prior art, advantage of the invention is that：

(1) near-field HRTF data are derived using existing far field HRTF data, can thoroughly avoids complicated near-field HRTF It measures；

(2) angular deviation of the near-field HRTF relative to ear and relative to head center is considered, it is public to use corresponding amendment Formula, to improve the accuracy that far field HRTF derives near-field HRTF；

(3) present invention may be used the software that algorithmic language is worked out and be realized on multimedia computer, can also use letter Number processing chip circuit or dedicated integrated circuit are realized, are set for multimedia computer, the broadcasting of various handheld portable audios The short distance Virtual Sound of standby (such as mobile phone) etc. is reset.

Description of the drawings

Fig. 1 is the realization principle figure of the present invention.

Fig. 2 is the schematic diagram of angle modification of the present invention.

Fig. 3 is the signal flow graph that the multimedia computer of the embodiment of the present invention is realized.

Specific implementation mode

Following further describes the present invention with reference to the drawings, but the claimed scope of the invention is not limited to implement The range that example indicates.

Fig. 1 is the realization principle figure of the synthetic method of near field virtual sound image on a kind of middle vertical plane of the present invention.It is considered closely Sound source is deduced corresponding angle modification formula relative to head center and relative to the angular deviation of ear in the case of.According to Angle modification formula can accurately extract near-field target HRTF data from HRTF baseline databases known to far field, so that it is guaranteed that The accuracy that near field virtual sound image synthesizes on middle vertical plane.The synthetic method of near field virtual sound image can be avoided multiple on this middle vertical plane The measurement of miscellaneous near-field HRTF can easily obtain the virtual sound image of the arbitrary distance of near field.

Step 2, according to formula, calculate the angle correction θ of auris dextra R：

Specifically, the far field is the region other than 1 meter of head center, the near field be apart from 1 meter of head center with Interior region.

Specifically, the feature of the HRTF in the far field does not change with distance, and the feature of the HRTF near field is with distance And change.

Specifically, the multiple far field HRTF databases being disclosed before the feelings the pulse with the finger-tip of HRTF baseline databases described in step 1, Including Massachusetts Institute Technology's MIT HRTF databases, California, USA university Davis branch school CIPIC HRTF data Library.Wherein, disclosed HRTF baseline databases can all indicate the measurement distance of its data, the i.e. distance of far field sound source and head center D。

Specifically, the number of people radius a described in step 1 is averaged as 8.75cm, can also be obtained by directly measuring ?.For example, the three-dimensional dimension of somatosensory device (kinect that such as Microsoft produces) capture listener head may be used, and then really The head radius of fixed specific hearer.

Specifically, the binaural signals synthesized in step 4 can be directly fed to Headphone reproducing；If thought highly of using raising one's voice It puts, then needs to be inserted into string sound elimination algorithm.Wherein, the specific mathematical form of string sound elimination algorithm and the quantity of loud speaker, put Mode is all related.

Fig. 2 is the schematic diagram of angle modification.Since the visual angle of figure is overlooked downwards from the crown, so middle vertical plane (is defined as By the section that human body or so is divided equally) it is rendered as a dotted line for dividing head equally in left and right.Although the number of people is not an ideal in reality Spherical shape, but it is existing research confirm spherical shape be the number of people a kind of good approximation；Simultaneously as the sense of hearing of the mankind has centainly Resolution ratio, this approximation will not cause the sense of hearing to distort.In figure, O points represent head center, and A points represent auris dextra, and B points, which represent, it is expected to close At near-field target acoustic image, ∠ BOC=θ represent correction angle, and C points represent acoustic image known to the far field determined according to correction angle, OA= A, OB=d, OC=D.It can be seen from the figure that AB and AC is on same straight line, near field acoustic image B and far field acoustic image C relative to The angle of auris dextra is identical, and corresponding far field and near-field HRTF have similar spectrum structure.Therefore, the HRTF of far field C points is extracted It as the HRTF of near field B points, i.e., derives near field B point HRTF from far field C points HRTF, there is reasonability.Fig. 3 is that computer is realized Flow chart, may be used matlab or C Plus Plus programming implement.

The above embodiment of the present invention be only to clearly illustrate example of the present invention, and not be to the present invention Embodiment restriction.For those of ordinary skill in the art, it can also make on the basis of the above description Other various forms of variations or variation.There is no necessity and possibility to exhaust all the enbodiments.It is all the present invention All any modification, equivalent and improvement etc., should be included in the protection of the claims in the present invention made by within spirit and principle Within the scope of.

Claims

1. the synthetic method of near field virtual sound image on a kind of middle vertical plane, which is characterized in that including step：

Step 1, input given data, including far field HRTF baseline databases, three distance parameters, described three distance parameters The distance d of near-field target acoustic image and head center on distance D, number of people radius a, middle vertical plane including far field sound source and head center；

Step 3, extraction far field HRTF_farThe auris dextra HRTF of (θ, R) as near-field target acoustic image, i.e. HRTF_near(R)；Extract far field HRTF_farThe left ear HRTF, i.e. HRTF of (- θ, L) as near-field target acoustic image_near(L)；

Step 4, the ears HRTF by near-field target acoustic image_nearCarry out convolution with input acoustical signal, and feed earphone or loud speaker into Row is low voice speaking to be put.

2. the synthetic method of near field virtual sound image on a kind of middle vertical plane as described in claim 1, it is characterised in that：The far field For the region other than 1 meter of head center, the near field is the region within 1 meter of head center.

3. the synthetic method of near field virtual sound image on a kind of middle vertical plane as claimed in claim 2, it is characterised in that：The far field HRTF feature not with distance and change, and the feature of the HRTF near field with distance and change.

4. the synthetic method of near field virtual sound image on a kind of middle vertical plane as described in claim 1, it is characterised in that：In step 1 The HRTF baseline databases are described to be disclosed using any of the multiple far field HRTF databases being disclosed at present Multiple far field HRTF databases include Massachusetts Institute Technology's MIT HRTF databases, California, USA university Davis Branch school CIPIC HRTF databases.

5. the synthetic method of near field virtual sound image on a kind of middle vertical plane as described in claim 1, it is characterised in that：In step 1 The number of people radius a is averaged as 8.75cm.

6. the synthetic method of near field virtual sound image on a kind of middle vertical plane as described in claim 1, it is characterised in that：In step 1 The number of people radius a is by directly measuring acquisition.

7. the synthetic method of near field virtual sound image on a kind of middle vertical plane as described in claim 1, it is characterised in that：In step 4 The binaural signals of synthesis are directly fed to Headphone reproducing；When using speaker playback, then need to be inserted into string sound elimination algorithm.