CN1236652C - Method for producing stereo sound effect - Google Patents

Method for producing stereo sound effect Download PDF

Info

Publication number
CN1236652C
CN1236652C CN 02140249 CN02140249A CN1236652C CN 1236652 C CN1236652 C CN 1236652C CN 02140249 CN02140249 CN 02140249 CN 02140249 A CN02140249 A CN 02140249A CN 1236652 C CN1236652 C CN 1236652C
Authority
CN
China
Prior art keywords
hrtf
monaural
sound effect
stereo sound
itd
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 02140249
Other languages
Chinese (zh)
Other versions
CN1466401A (en
Inventor
林柏廷
吴奇峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Silicon Integrated Systems Corp
Original Assignee
Silicon Integrated Systems Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Silicon Integrated Systems Corp filed Critical Silicon Integrated Systems Corp
Priority to CN 02140249 priority Critical patent/CN1236652C/en
Publication of CN1466401A publication Critical patent/CN1466401A/en
Application granted granted Critical
Publication of CN1236652C publication Critical patent/CN1236652C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Stereophonic System (AREA)

Abstract

The present invention discloses a method for generating stereo sound effects, which can use the technology of a monaural head corresponding conversion algorithm (HRTF) to synthesize a sound effect of double sound tracks in a monaural sound source. The method comprises the steps that a group of HRTF coefficients of one ear are recorded through the measurement operation of the monaural head corresponding conversion algorithm (HRTF) and used for establishing a data base of the monaural head corresponding conversion algorithm (HRTF) and a compensation curve of the time difference of binaural sound reception (ITD), and in this way, an external input monaural signal is converted into a stereo sound effect signal according to the established monaural HRTF data base and the ITD compensation curve.

Description

The production method of stereo sound effect
Technical field
The present invention relates to the processing of stereo sound effect, especially a kind of production method (implementation method of 3D audio) that utilizes monaural head corresponding conversion algorithm (HRTF) technology with the stereo sound effect of a synthetic dual track audio in a monaural source of sound.
Background technology
One typically uses some simple delay circuits and the phase filter data mixed processing with left and right acoustic channels around sound effect system, to simulate a stereo sound effect.Yet this mode can consequently cause the distortion of firsthand information because mixed processing is obscured the data of left and right acoustic channels usually.Again, above-mentioned typically around sound effect system can't from one listen the hearer forward and backward, and time four direction produce stereo sound effect, especially be not positioned at so-called " during dessert (sweet spot) when listening the hearer.Therefore, said system needs on the position of front and back sound filter to be installed, and an electrical system is provided again, when spending from left-hand rotation to the right side to allow listening hearer's rotary head 180, the sound of front can be heard via the filter of front, and when turning right, the sound of back can be heard via the filter of back to a left side.According to system's meter data that numerical analysis is derived, the output that scaler (scaler) is used for filter is gone into, and to adjust scope and the position that source of sound produces, ITD and two hears of sound density contrast (11D) (aftermentioned) for example to eliminate.Yet this will need a large amount of circuit elements and filtered power, so that audio when participating in the cintest to be provided.And then develop and the HRTF technology that is usually used in producing stereo sound effect at present therefore.Above-mentioned HRTF provides a data bank, build when measuring each sound that had both been sent on the allocation in the 360 degree spaces and being sent to two ears the frequency response characteristic that ear and brain are experienced in the data bank in.Utilize a digital signal processor (DSP) that above-mentioned frequency multiple response characteristic is done computing, can calculate and have stereo sound effect and hold little sound wave, this sound wave is sent to ear, just can make brain experience the senses of reality of 360 degree and produces the effect of three-dimensional sound.Fig. 1 shows that typical 360 degree HRTF measure.As shown in Figure 1, put a headform 12 in the center of a circle 10 and circle 10 is divided into 360 sections, specify an azimuth Control Parameter (360 degree control) and represent a unduplicated anchor point for every section.So, when the position of a light source successfully goes to next section by one section, the continuity that headform 12 perceives light source position moves.Measure according to above-mentioned HRTF, assumed position 0 is being aimed at a left ear of 12, to number 14 representatives.Position 90 is in 12 dead ahead, to number 16 representatives.Similarly, an auris dextra of 12 is being aimed in position 180, and to number 18 representatives, position 270 is in 12 dead astern, to number 20 representatives.Because the azimuth position parameter is around circle 10 configurations, therefore, position 0 and 360 can overlap the place of numbering 14.Use a range parameter to represent the scope of speaking or the distance of sound source.Central point with 12 is initial point (position 0), and 10 are divided into 360 circles and represent the sound source distance from initial point to circle.Wherein position 0 to 19 is a scope of 12, numbers 22 as shown.All the other positions 20 to 359 are with 24 representatives of diagram numbering.Wherein above-mentioned position 359 is for hearing the limit on the range of sounds.Certainly, the above-mentioned limit can adjust according to actual demand.
Aforementioned arrangements must framework in an anechoic chamber (echoic chamber), under each sound source in the different coordinate space of 360 degree (section), carry out the ears measurement respectively, with write down 48 or the 44.IKHZ sampling rate under 20HZ to the sound wave of 20KHZ.Be about sound wave and walk 20 centimeters that is scope 0-19 because sound wave is sent to standard time difference between two ears) time so that cause two to hear of the sound time difference (ITD).Again, the head of different people, shoulder, arm can absorb different acoustic wave energies, and at this moment the big small voice amplitude heard of two ears is different, thereby causes sound density contrast (11D) between two ears.So, utilize a simple sund source respectively left and right sides ear to be HRTF and measure and record, to finish the foundation of HRTF data bank.But, this mode is known as the personage who knows this technology, sound wave is run into the effect that object all can have reflection, the sound wave of the different wave length that changes through people's body, shoulder, head and external ear produces diffraction (diffractive) phenomenon in the auricular concha outside, and then to make the frequency response of the sound that the sound drum experienced also be different.These variations that are recorded in the HRTF data bank not only become along with the elevation angle, point sound source azimuth, scope and frequency, also easy because of the people, therefore quite arduous usually if will set up this class HRTF data bank, in general, add that back and forth the complete HRTF parameter of adjustment one cover will spend the time of several months and just can finish.
Summary of the invention
In view of this, a purpose of the present invention provides a kind of production method of stereo sound effect, and it uses head corresponding conversion algorithm (HRTF), with a synthetic dual track audio in a monaural source of sound.
The production method of stereo sound effect of the present invention comprises the following step:
Ears are measured to obtain the conversion function of two ears;
Via one group of HRTF coefficient of an ear of monaural HRTF measuring operation record, in order to set up monaural head corresponding conversion algorithm (HRTF) data bank;
According to the conversion function of described two ears to calculate sound time difference (ITD) compensated curve between above-mentioned two ears; And
According to above-mentioned two groups of monaural HRTF data bank and the ITD compensated curve set up, convert an outside monaural signal of importing to a stereo sound effect signal.
The production method of above-mentioned stereo sound effect further comprises the following step is adjusted this stereo sound effect signal according to this ITD compensated curve ITD module.The production method of above-mentioned stereo sound effect further comprises provides an adjustment instrument to a user, to set the head shadow parameter to the stereo sound effect surrounding effect of being expected.
The production method of the another kind of stereo sound effect that the present invention points out is measured to obtain the conversion function of two ears ears;
Write down one group of HRTF coefficient of an ear via a monaural HRTF measuring operation, in order to set up monaural head corresponding conversion algorithm (HRTF) data bank;
According to the above-mentioned conversion function of two above-mentioned ears to calculate sound time difference (ITD) compensated curve between above-mentioned two ears;
Coming separately according to the HRTF data bank of being set up, an outside input monaural signal becomes a near-end binaural signal and a far-end binaural signal;
Adjust an ITD module of said distal ends binaural signal according to the HRTF data bank of being set up and ITD compensated curve;
Provide an adjustment instrument to a user, to set the stereo sound effect surrounding effect that above-mentioned near-end binaural signal and above-mentioned both head shadow parameters of far-end binaural signal with adjusted ITD module are extremely expected.
The present invention can utilize monaural head corresponding conversion algorithm (HRTF) technology with a synthetic dual track audio in a monaural source of sound, thereby makes the setting of stereo sound effect more convenient, and has saved the adjustment time.
Description of drawings
Fig. 1 shows that one has the typical HRTF survey map of 360 degree;
Fig. 2 shows a monaural HRTF survey map of the present invention;
Fig. 3 shows one according to dual track composite structure figure of the present invention;
Fig. 4 shows that one is used for the ITD compensated curve figure of the ITD ripple device of Fig. 3 according to the present invention;
Fig. 5 be one according to the present invention the operational flowchart of Fig. 3.
Embodiment
In full, the like function is with the representative of similar elements numbering.
Fig. 2 shows a monaural HRTF survey map of the present invention.Above-mentioned monaural HRTF measures the survey data (one group of filter number) that only writes down left ear or auris dextra in Fig. 2, and it is different from typical HRTF measures, and must write down left ear and both survey datas of auris dextra respectively.As shown in Figure 2, a loudspeaker 21 is placed in apart from 1.4 meters places far away of head 22 left ears (as receiving Mike's wind action), in order to transmit sound to left ear.Because people's face presents symmetry haply, carrying out HRTF when measuring, can only measure right-half plane or left demifacet, (for example, have azimuth angle theta presetting at each measuring point ACAnd elevation angle theta ABA point position on) sound that broadcasts from loudspeaker with 44.1kHZ sampling by microphone of record, record one group of filter number (impulse response) and obtain conversion function (transfer function) (being transformed into the time field from frequency field) between them in frequency field can be finished the HRTF data bank.Again, can use a time eqalizing cricuit (not shown) to eliminate the influence of measuring instrument, for example the frequency response of loudspeaker 21, head 22 and reception microphone (measuring the ear of side) itself.So, the measurement that is stored in the HRTF data bank will be and comprise the conversion function that head has nothing to do at interior measuring instrument itself.In addition, above-mentioned data bank is actually because of difficulty if think directly to implement, because it must be just possible with very long FIR filter, and the above journey in normally hundreds of rank (tap) time, we must do further abbreviation again.Because to the conversion function of left ear with can comprise into to the phase difference between the conversion function of auris dextra that the ITD module compensates, so we do the minimum phase conversion to the conversion function of two ears, most energy that found that of observing again after changing all concentrates on the place on 32 rank, front (tap).So, to the conversion function that survey data produces via minimum phase conversion back, can the effectively stereoeffect that simulates 3-D as long as store preceding 32 rank of this conversion function.The above-mentioned HRTF data bank that comprises the FIR filter number on preceding 32 rank just can utilize one 32 rank FIR filters to carry out simply.Just, everyone capitiform all is not quite similar when considering practical application, for the influence of 22 pairs of conversion functions of head is removed, with reference to figure 3, no matter be to left ear or the sound of auris dextra all can compensate the different feeling of different capitiforms to the 3-D stereo sound effect via the compensating filter 32,34 of head shadow effect.In addition, according to acousticoppsychology, the near-end loudspeaker is little of can being left in the basket to the delay of left and right sides ear, but the time difference between far-end loudspeaker to two ear can't be left in the basket, so need to use an ITD filter do compensation.Like this, utilize very simple filter framework, just can reach the impression of different individually 3-D stereo sound effects, to save the cost and the time of calculating.Fig. 4 shows that one is used for the ITD compensated curve figure of the ITD filter of Fig. 3 according to the present invention.In Fig. 4, the ITD module of above-mentioned compensated curve when the 0 degree elevation angle (the C axle among Fig. 2 just).As shown in Figure 4, take out on the same measurement position (azimuth) in the former survey data storehouse, the decay part of conversion function that is recorded by left ear and the conversion function that recorded by auris dextra is done cross correlation computing (crosscorrelation) (can utilize Gaussian function formula (Gaussian function) to multiply by SIN function (sine) obtains), to find out maximum length of delay, this value is the ITD reference value that we will compensate.Then, with the ITD compensated curve of deriving among Fig. 4 with the ITD filter configuration that shows among Fig. 3 between monaural HRTF data bank and distal end I IR filter.Again, based on above-mentioned everyone different capitiforms, simulate the stereo sound effect except setting up 32 common rank FIR filters, also provide an adjustment instrument to the user, to set the head shadow parameter of near-end and far-end shadow effect wave-wave device, provide the method for adjusting individual difference by this to reach the stereo sound effect surrounding effect.This adjustment instrument provides two parameter setting apparatus, allows the user obviously experience the degree of stereo sound effect with the limit of adjusting each iir filter and value at zero point (pole and zero values) to one.
Comprehensively above-mentioned, the operating process of Fig. 3 structure is shown among Fig. 5.In Fig. 5, this operating process comprises the following step: set up a monaural HRTF data bank and an ITD compensated curve (S1); Reach according to monaural HRTF data bank of being set up and ITD compensated curve and carry out ITD adjustment and shadow effect adjustment (S2).As shown in Figure 5, the above-mentioned monaural HRTF data bank of setting up comprises 32 rank FIR filter coefficients and utilizes one 32 rank FIR filters to implement this monaural HRTF data bank.Though be illustrative examples with 32 rank, yet on the practice, be used to dispose the required exponent number of using of above-mentioned data bank, be not limited to 32 rank herein can decide according to the actual requirements.The slope of the ITD compensated curve of being set up can present the curve (as shown in Figure 4) of an intimate proportionality constant, so, the user can oneself think the most tangible setting of 3-D effect by adjusting ITD module of the present invention and far and near termination portion's shadow effect filter (iir filter), finding easily.Therefore,, must not make the HRTF measurement and/or change the filter number that is stored in the data bank, just can produce the stereo sound effect that makes the user satisfied individual difference via stereo sound effect production method of the present invention.
Though the present invention discloses as above with preferred embodiment; right its is not in order to qualification the present invention, any those who are familiar with this art, without departing from the spirit and scope of the invention; when can doing a little change and retouching, so the present invention's protection range is as the criterion when looking claims scope person of defining.

Claims (8)

1. the production method of a stereo sound effect is characterized in that: may further comprise the steps:
Ears are measured to obtain the conversion function of two ears;
Via one group of HRTF coefficient of an ear of monaural HRTF measuring operation record, in order to set up monaural head corresponding conversion algorithm (HRTF) data bank;
According to the conversion function of described two ears to calculate sound time difference (ITD) compensated curve between above-mentioned two ears; And
According to above-mentioned two groups of monaural HRTF data bank and the ITD compensated curve set up, convert an outside monaural signal of importing to a stereo sound effect signal.
2. the production method of stereo sound effect as claimed in claim 1, it is characterized in that: this is set up monaural head corresponding conversion algorithm (HRTF) data bank and comprises finite impulse response (FIR) (FIR) filter coefficient, and above-mentioned finite impulse response (FIR) (FIR) filter coefficient is implemented with a FIR filter.
3. the production method of stereo sound effect as claimed in claim 1 is characterized in that: comprise that further a step is for adjusting the ITD module of this stereo sound effect signal according to this ITD compensated curve.
4. the production method of stereo sound effect as claimed in claim 1 is characterized in that: comprise that further a step is for providing an adjustment instrument to a user, to set the head shadow parameter to the stereo sound effect surrounding effect of being expected.
5. the production method of stereo sound effect as claimed in claim 4 is characterized in that: this head shadow parameter comprises that unlimited pulse rings (IIRs).
6. the production method of a stereo sound effect is characterized in that: comprise the following steps:
Ears are measured to obtain the conversion function of two ears;
Write down one group of HRTF coefficient of an ear via a monaural HRTF measuring operation, in order to set up monaural head corresponding conversion algorithm (HRTF) data bank;
According to the above-mentioned conversion function of two above-mentioned ears to calculate sound time difference (ITD) compensated curve between above-mentioned two ears;
Coming separately according to the HRTF data bank of being set up, an outside input monaural signal becomes a near-end binaural signal and a far-end binaural signal;
Adjust an ITD module of said distal ends binaural signal according to the HRTF data bank of being set up and ITD compensated curve;
Provide an adjustment instrument to a user, to set the stereo sound effect surrounding effect that above-mentioned near-end binaural signal and above-mentioned both head shadow parameters of far-end binaural signal with adjusted ITD module are extremely expected.
7. the production method of stereo sound effect as claimed in claim 6, it is characterized in that: this is set up a monaural head corresponding conversion algorithm HRTF data bank and comprises finite impulse response (FIR) (FIR) filter coefficient, and above-mentioned finite impulse response (FIR) (FIR) filter coefficient is implemented with a FIR filter.
8. the production method of stereo sound effect as claimed in claim 6 is characterized in that: this head shadow parameter comprises that unlimited pulse rings (IIRs).
CN 02140249 2002-07-02 2002-07-02 Method for producing stereo sound effect Expired - Fee Related CN1236652C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 02140249 CN1236652C (en) 2002-07-02 2002-07-02 Method for producing stereo sound effect

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 02140249 CN1236652C (en) 2002-07-02 2002-07-02 Method for producing stereo sound effect

Publications (2)

Publication Number Publication Date
CN1466401A CN1466401A (en) 2004-01-07
CN1236652C true CN1236652C (en) 2006-01-11

Family

ID=34147533

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 02140249 Expired - Fee Related CN1236652C (en) 2002-07-02 2002-07-02 Method for producing stereo sound effect

Country Status (1)

Country Link
CN (1) CN1236652C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101938686A (en) * 2010-06-24 2011-01-05 中国科学院声学研究所 Measurement system and measurement method for head-related transfer function in common environment

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR3040807B1 (en) * 2015-09-07 2022-10-14 3D Sound Labs METHOD AND SYSTEM FOR DEVELOPING A TRANSFER FUNCTION RELATING TO THE HEAD ADAPTED TO AN INDIVIDUAL
CN105611481B (en) * 2015-12-30 2018-04-17 北京时代拓灵科技有限公司 A kind of man-machine interaction method and system based on spatial sound
CN107889044B (en) * 2017-12-19 2019-10-15 维沃移动通信有限公司 The processing method and processing device of audio data
CN109005496A (en) * 2018-07-26 2018-12-14 西北工业大学 A kind of HRTF middle vertical plane orientation Enhancement Method
CN109754825B (en) * 2018-12-26 2021-02-19 广州方硅信息技术有限公司 Audio processing method, device, equipment and computer readable storage medium
CN114866948B (en) * 2022-04-26 2024-07-05 北京奇艺世纪科技有限公司 Audio processing method, device, electronic equipment and readable storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101938686A (en) * 2010-06-24 2011-01-05 中国科学院声学研究所 Measurement system and measurement method for head-related transfer function in common environment
CN101938686B (en) * 2010-06-24 2013-08-21 中国科学院声学研究所 Measurement system and measurement method for head-related transfer function in common environment

Also Published As

Publication number Publication date
CN1466401A (en) 2004-01-07

Similar Documents

Publication Publication Date Title
Brinkmann et al. A high resolution and full-spherical head-related transfer function database for different head-above-torso orientations
Brown et al. An efficient HRTF model for 3-D sound
KR100964353B1 (en) Method for processing audio data and sound acquisition device therefor
US6078669A (en) Audio spatial localization apparatus and methods
US6611603B1 (en) Steering of monaural sources of sound using head related transfer functions
CN104010264B (en) The method and apparatus of binaural audio signal process
US9237398B1 (en) Motion tracked binaural sound conversion of legacy recordings
CN1937854A (en) Apparatus and method of reproduction virtual sound of two channels
JP2012004668A (en) Head transmission function generation device, head transmission function generation method, and audio signal processing apparatus
US20110026745A1 (en) Distributed signal processing of immersive three-dimensional sound for audio conferences
Akeroyd et al. The binaural performance of a cross-talk cancellation system with matched or mismatched setup and playback acoustics
Masiero Individualized binaural technology: measurement, equalization and perceptual evaluation
US7921016B2 (en) Method and device for providing 3D audio work
Pörschmann et al. How wearing headgear affects measured head-related transfer functions
Rothbucher et al. Comparison of head-related impulse response measurement approaches
CN1236652C (en) Method for producing stereo sound effect
Braren et al. A high-resolution head-related transfer function data set and 3D-scan of KEMAR
US11032660B2 (en) System and method for realistic rotation of stereo or binaural audio
Pausch et al. SCaLAr–A surrounding spherical cap loudspeaker array for flexible generation and evaluation of virtual acoustic environments
Kuntz et al. Sound field synthesis for psychoacoustic research: In situ evaluation of auralized sound pressure level
Jakka Binaural to multichannel audio upmix
Horbach et al. Real-time rendering of dynamic scenes using wave field synthesis
US11589181B1 (en) System and method for realistic rotation of stereo or binaural audio
Gardner Spatial audio reproduction: Towards individualized binaural sound
TW546982B (en) Implementation method of 3D audio

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20060111

Termination date: 20100702