CN101419801A - Method for subband measuring correlation sensing characteristic between ears and device thereof - Google Patents

Method for subband measuring correlation sensing characteristic between ears and device thereof Download PDF

Info

Publication number
CN101419801A
CN101419801A CNA2008102366464A CN200810236646A CN101419801A CN 101419801 A CN101419801 A CN 101419801A CN A2008102366464 A CNA2008102366464 A CN A2008102366464A CN 200810236646 A CN200810236646 A CN 200810236646A CN 101419801 A CN101419801 A CN 101419801A
Authority
CN
China
Prior art keywords
ears
frequency
sensing characteristic
subband
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2008102366464A
Other languages
Chinese (zh)
Other versions
CN101419801B (en
Inventor
胡瑞敏
陈水仙
涂卫平
刘雨田
陈文琴
杭波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN2008102366464A priority Critical patent/CN101419801B/en
Publication of CN101419801A publication Critical patent/CN101419801A/en
Application granted granted Critical
Publication of CN101419801B publication Critical patent/CN101419801B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)

Abstract

The invention discloses a method for measuring the correlation perceptive characteristics between ears by dividing frequency band and a device thereof. In spatial audio coding, the correlation between ears is an important parameter for controlling sound field width. The invention adopts the method of equal loudness and analysis by dividing the frequency band, and the perceptive characteristics of the correlation parameter between ears of the people are tested. On this basis, all the frequency bands independently judge whether the correlation parameter between ears is needed to be extracted to take part in coding; compared with the existing method of full frequency band extraction or no full frequency band extraction, the invention effectively eliminates the subjective redundancy of the correlation between ears and improves the coding efficiency of the parameter.

Description

The method of subband measuring correlation sensing characteristic between ears and device thereof
Technical field:
The present invention relates to a kind of measuring method and device thereof of correlation sensing characteristic between ears, more specifically, relate to the apperceive characteristic measuring method and the device thereof of degree of correlation parameter between the ear that extracts in the spatial audio coding method.
Background technology:
In the spatial audio coding technology, spatial parameter has been expressed the spatial information that contains in multichannel (the containing stereo) audio frequency.The degree of correlation between ear (Interaural Correlation is hereinafter to be referred as IC) is of outbalance in numerous spatial parameters, is controlling the sound field width, is defined as the related coefficient between sound channel in actual coding is used, and its numerical value is between 0 and 1.Shown in the IC sense of hearing perception synoptic diagram of accompanying drawing 1: when IC approached 0 value, signal correlation was less between sound channel, and what ears perceived is area source of sound, gave a kind of open auditory perception of people; When IC approached 1 value, signal correlation was bigger between sound channel, and what ears perceived is point sound source, gave a kind of concentrated auditory perception of people.
When the psychoacoustic research of front space shows that IC has difference in perception in the different operating unit, that is to say that under different frequency, to identical IC numerical value, the sound field width of ears perception is different; Under this external same frequency condition, have only when the IC value is enough little, ears just can be experienced sound field and have certain width, otherwise ears think hear this moment remain point sound source.
Yet in the existing space audio coding, the disposal route of IC is comprised two classes: a class is that the IC to each operating unit (as the subband of each division) encodes in whole signal frequency range; Another kind of is for reducing the parameter code check of coding, ignores IC the influence of coding tonequality is not encoded to IC.The disposal route of this two class, ignored the difference in perception that IC has in the different operating unit, wasted the parameter coding code check, do not handle the perception information of ears with coding the dropping to cost of tonequality to dimension, thereby make to have the subjective redundant information that to remove in a large number in the coded signal, reduce encoding compression efficient.
Summary of the invention:
The present invention is intended to propose a kind of method and device thereof of subband measuring IC apperceive characteristic, and subjective redundant for effectively removing the IC that exists in the multi-channel signal, improving encoding compression efficient provides effective solution.
Solution of the present invention may further comprise the steps:
The method of subband measuring correlation sensing characteristic between ears and device thereof.It is characterized in that, comprise following steps,
Step 1 by frequency band division unit design measured signal, is done the frequency division tape handling to it, each frequency band independent test;
Step 2 by the loudness setup unit, is arranged to identical loudness with all band signals;
Step 3, by the method that parameter set unit adopts left and right acoustic channels to mix mutually, degree of correlation size between the ear of change signal;
Step 4 is made with reference to sound by the IC test cell, and progressively increases the IC value and make test tone simultaneously, will carry out subjective testing relatively with reference to sound and test tone, obtains critical IC value.
The present invention proposes a kind of method of subband measuring IC apperceive characteristic.The frequency bandwidth of 20Hz~15500Hz is divided into 24 frequency bands.Under the condition that waits loudness, a plurality of different IC are set on each frequency band, carry out subjective testing respectively, according to IC size that can be perceived, but judge proper perception (the Just noticeable difference of IC in each frequency band, hereinafter to be referred as JND) point, connection can obtain the JND curve of IC.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, in the described step 1, measured signal adopts two incoherent white noises, perhaps adopts the combination of a white noise and a single frequency tone signal.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, in the described step 1, described measured signal adopts narrow band noise, and its frequency is positioned at the centre frequency position of each subband; Perhaps adopt the white noise that covers full sub-bands of frequencies.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, in the described step 1,, then remove the single frequency tone frequency content in the white noise if measured signal adopts the combination of a white noise and a single frequency tone signal.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, described frequency partition method is drawn for the Bark division methods, and frequency is divided into 24 frequency bands.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, in the described step 3, the method that left and right acoustic channels mixes mutually is that L channel adds the R channel composition, and R channel adds the L channel composition.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, in the described step 4, progressively increase in the process of IC value, ITD and ILD are fixed value, the variation range of test tone IC is between 0 to 1.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, described making parameter value with reference to sound is IC=1, ITD=0, ILD=0.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, after described step 4 is finished, the IC critical value of all frequencies and loudness is done statistical study by statistical analysis unit, obtain JND family of curves.
Method at above-mentioned subband measuring correlation sensing characteristic between ears, it comprises frequency band division unit and the loudness setup unit and the parameter set unit that link to each other with the frequency partition unit respectively, described parameter set unit also links to each other with the IC test cell, and the IC test cell also links to each other with statistical analysis unit.
Description of drawings
Fig. 1 is the auditory perception model of IC among the present invention
Fig. 2 is subjective testing process flow diagram among the present invention
Fig. 3 is for revising cycle tests IC size schematic diagram among the present invention
Fig. 4 is JND expected results figure among the present invention
The structure principle chart that Fig. 5 uses for the present invention
Embodiment:
The method of subband measuring correlation sensing characteristic between ears and device thereof.It is characterized in that, comprise following steps,
Step 1 by frequency band division unit design measured signal, is done the frequency division tape handling to it, and the frequency partition method is drawn for the Bark division methods, and frequency is divided into 24 frequency bands.Each frequency band independent test; Measured signal adopts two incoherent white noises, perhaps adopts the combination of a white noise and a single frequency tone signal.Measured signal adopts narrow band noise, and its frequency is positioned at the centre frequency position of each subband; Perhaps adopt the white noise that covers full sub-bands of frequencies.If measured signal adopts the combination of a white noise and a single frequency tone signal, then remove the single frequency tone frequency content in the white noise.
Select two section audio sequences (as white noise or single frequency tone) respectively as left and right sound track signals, guarantee that they are uncorrelated mutually on mathematical meaning, promptly the degree of correlation is zero.By the frequency band division unit, it is carried out the filtering of branch frequency band.Divide the frequency band rule with reference to Bark frequency band division method, totally 24 frequency bands.Promptly obtain 24 mutual incoherent subband noises of left and right acoustic channels after the filtering, as basic cycle tests.
Step 2 by the loudness setup unit, is arranged to identical loudness with all band signals; The loudness of loudness setup unit correction basic test sequence.Set a loudness value such as rational,, calculate the energy size of each frequency band, the basic test sequence is adjusted on the corresponding energy value in this loudness according to contour of equal loudness.
Step 3, by the method that parameter set unit adopts left and right acoustic channels to mix mutually, degree of correlation size between the ear of change signal; The method that left and right acoustic channels mixes mutually is that L channel adds the R channel composition, and R channel adds the L channel composition.
When the IC size of cycle tests was set, by the cycle tests that previous step obtains, its IC must be 0.With the method that left and right acoustic channels is sneaked into mutually, can be adjusted to specific I C size.Synoptic diagram as shown in Figure 3, formula is as follows:
L=s 1?cos?α+s 2?sin?α
R=s 1?sin?α+s 2?cos?α (1)
S1 wherein, s2 represents 2) in the left and right sound track signals of each frequency band; After L, R represent to adjust IC, the left and right sound track signals of formation sequence.α is a mathematic parameter of control IC size, and corresponding one by one with IC, its pass is:
IC=sin(2α) (2)
Get a series of IC sizes, calculate corresponding α value.Generate corresponding sequence with programming tool.Promptly generate the standard pronunciation sequence.Generally speaking, to get IC be 1 to the standard pronunciation sequence.Calculating α is 45 °.Generate standard pronunciation sequence, totally 24 with programming tool.
Step 4 is made with reference to sound by the IC test cell, and progressively increases the IC value and make test tone simultaneously, will carry out subjective testing relatively with reference to sound and test tone, obtains critical IC value.In the present embodiment, be with reference to the making parameter value of sound, IC=1, ITD=0, ILD=0, in the process that progressively increases the IC value, ITD and ILD are fixed value, the variation range of test tone IC is between 0 to 1.Subjective testing divides 24 frequency bands to carry out respectively, and each frequency band contains several cycle testss and 1 standard pronunciation sequence, to its audiometry.The tester need be when each cycle tests of audiometry, and all the standard pronunciation sequence with this frequency band compares, and whether the sound field effect of each cycle tests of mark is identical with the sound field effect of standard pronunciation sequence.
In the method for above-mentioned subband measuring correlation sensing characteristic between ears, after described step 4 is finished, the IC critical value of all frequencies and loudness is done statistical study by statistical analysis unit, obtain JND family of curves.The statistical analysis unit analytical test is as a result the time, the data of first analyzing personal, and in 24 frequency bands, two kinds of different marks in the distinguishing tests find their boundary.The definition boundary perceived effect IC size different with standard pronunciation sequence effect is the JND value.Then each has the JND value of 24 branch frequency bands per capita.Analyzing total volume data is again done sums on average to all testers' result, obtains 24 JND values after average.After the broken line connection, promptly become the JND curve.
The frequency band grouping is according to the characteristic of JND curve, and with identical characteristics, and adjacent frequency band is divided into one group.When encoding and decoding, they can adopt identical spatial parameter to accept or reject strategy.
A kind of device that is used for the method for subband measuring correlation sensing characteristic between ears, it comprises frequency band division unit and the loudness setup unit and the parameter set unit that link to each other with the frequency partition unit respectively, described parameter set unit also links to each other with the IC test cell, and the IC test cell also links to each other with statistical analysis unit.
This device has been stipulated a kind of operation steps of the IC of measurement parameter apperceive characteristic.Measurement result will be directly used in stereo coding, determine the threshold value of each frequency band extraction IC parameter in the scrambler.Threshold value then when stereo coding, only processes the IC that surpasses the threshold value size in case determine: quantize and write code stream.Be left intact for the IC that is lower than the threshold value size, regard default value (being generally 1) as, needn't quantize and write code stream.This device in the effect of stereo coding as shown in Figure 5.
Be example with stereo below, utilize designed method of testing of the present invention and device, measure the apperceive characteristic of the degree of correlation between one group of ear.Wherein, loudness value is got people's ear and is felt 20 comparatively comfortable sides, and IC magnitude range to be measured gets 0.5 to 0.9, and step-length is 0.1, and it is 1 that standard pronunciation is got IC, and other spatial stereo parameter I LD gets 0, and ITD gets 0.The audition tester chooses 20 experienced testers.Concrete implementation step is as follows:
1) generates cycle tests.Select two sections different white noise sequences, get 3 second duration, as stereosonic left and right acoustic channels to be measured.With Audition it is divided into 24 Bark frequency bands, obtains 24 basic cycle testss.
2) loudness of correction basic test sequence.Setting loudness is 20 sides, according to contour of equal loudness, calculates the energy size that each frequency band should be set.Adjust the energy size of sequence on each frequency band respectively.
3) the IC size of cycle tests is set.Setting the IC size is 0.5,0.6,0.7,0.8,0.9, calculates α and is respectively 15 °, 18.43 °, 22.21 °, 26.57 °, 32.08 °.With MATLAB instrument coding, generate 5 sequences that satisfy above IC size.24 frequency bands are totally 120 sequences.
4) generate the standard pronunciation sequence.The standard pronunciation sequence is got IC and is equaled 1, and promptly α equals 45 °.Generate standard pronunciation sequence, totally 24 with the MATLAB instrument.
5) carry out subjective testing.Select 20 test mans that the subjective testing experience is arranged, divide the frequency band audiometric sequence.When requiring each cycle tests of audiometry, all compare with the standard pronunciation sequence, whether the sound field effect of each cycle tests of mark is identical with the sound field effect of standard pronunciation sequence.Label symbol is set at: be all " 0 " mutually, inequality is " 1 ".
6) analytical test result.At first the analyzing personal data find in each frequency band, the boundary position of " 0 " and " 1 ", and " 1 " pairing IC size is the JND value on this frequency band in the record delimitation position.On frequency band, all tester JND values average is overall JND value.24 JND values are linked to be broken line, are the JND curve.
7) frequency band grouping.According to the trend of JND curve, with identical characteristics, and adjacent frequency band is divided into one group, and when encoding and decoding, they can adopt identical spatial parameter to accept or reject strategy.

Claims (10)

1. the method for a subband measuring correlation sensing characteristic between ears and device thereof.It is characterized in that, comprise following steps,
Step 1 by frequency band division unit design measured signal, is done the frequency division tape handling to it, each frequency band independent test;
Step 2 by the loudness setup unit, is arranged to identical loudness with all band signals;
Step 3, by the method that parameter set unit adopts left and right acoustic channels to mix mutually, degree of correlation size between the ear of change signal;
Step 4 is made with reference to sound by the IC test cell, and progressively increases the IC value and make test tone simultaneously, will carry out subjective testing relatively with reference to sound and test tone, obtains critical IC value.
2. the method for subband measuring correlation sensing characteristic between ears according to claim 1 is characterized in that: in the described step 1, measured signal adopts two incoherent white noises, perhaps adopts the combination of a white noise and a single frequency tone signal.
3. the method for subband measuring correlation sensing characteristic between ears according to claim 2 is characterized in that: in the described step 1, described measured signal adopts narrow band noise, and its frequency is positioned at the centre frequency position of each subband; Perhaps adopt the white noise that covers full sub-bands of frequencies.
4. according to the method for claim 1 or 2 or 3 described subband measuring correlation sensing characteristic between ears, it is characterized in that: in the described step 1, if measured signal adopts the combination of a white noise and a single frequency tone signal, then remove the single frequency tone frequency content in the white noise.
5. the method for subband measuring correlation sensing characteristic between ears according to claim 1, it is characterized in that: described frequency partition method is drawn for the Bark division methods, and frequency is divided into 24 frequency bands.
6. the method for subband measuring correlation sensing characteristic between ears according to claim 1 is characterized in that: in the described step 3, the method that left and right acoustic channels mixes mutually is that L channel adds the R channel composition, and R channel adds the L channel composition.
7. the method for subband measuring correlation sensing characteristic between ears according to claim 1, it is characterized in that: in the described step 4, progressively increase in the process of IC value, ITD and ILD are fixed value, and the variation range of test tone IC is between 0 to 1.
8. the method for subband measuring correlation sensing characteristic between ears according to claim 1, it is characterized in that: described making parameter value with reference to sound is IC=1, ITD=0, ILD=0.
9. the method for subband measuring correlation sensing characteristic between ears according to claim 1 is characterized in that: after described step 4 is finished, by statistical analysis unit the IC critical value of all frequencies and loudness is done statistical study, obtain JND family of curves.
10. device that is used for the described method of claim 1, it is characterized in that: it comprises frequency band division unit and the loudness setup unit and the parameter set unit that link to each other with the frequency partition unit respectively, described parameter set unit also links to each other with the IC test cell, and the IC test cell also links to each other with statistical analysis unit.
CN2008102366464A 2008-12-03 2008-12-03 Method for subband measuring correlation sensing characteristic between ears and device thereof Expired - Fee Related CN101419801B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008102366464A CN101419801B (en) 2008-12-03 2008-12-03 Method for subband measuring correlation sensing characteristic between ears and device thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2008102366464A CN101419801B (en) 2008-12-03 2008-12-03 Method for subband measuring correlation sensing characteristic between ears and device thereof

Publications (2)

Publication Number Publication Date
CN101419801A true CN101419801A (en) 2009-04-29
CN101419801B CN101419801B (en) 2011-08-17

Family

ID=40630566

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2008102366464A Expired - Fee Related CN101419801B (en) 2008-12-03 2008-12-03 Method for subband measuring correlation sensing characteristic between ears and device thereof

Country Status (1)

Country Link
CN (1) CN101419801B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101604526B (en) * 2009-07-07 2011-11-16 武汉大学 Weight-based system and method for calculating audio frequency attention
CN102637432A (en) * 2012-03-20 2012-08-15 武汉大学 Self-adaptive measuring method for dual-aural cue perceptual characteristic in three-dimensional audio coding
CN104240712A (en) * 2014-09-30 2014-12-24 武汉大学深圳研究院 Three-dimensional audio multichannel grouping and clustering coding method and three-dimensional audio multichannel grouping and clustering coding system
US10362427B2 (en) 2014-09-04 2019-07-23 Dolby Laboratories Licensing Corporation Generating metadata for audio object

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3805946A1 (en) * 1988-02-25 1989-09-07 Fraunhofer Ges Forschung DEVICE FOR DETERMINING CHARACTERISTIC PARAMETERS FROM THE INPUT AND OUTPUT SIGNALS OF A SYSTEM FOR AUDIO SIGNAL PROCESSING
JP2007085734A (en) * 2005-09-16 2007-04-05 Research Organization Of Information & Systems Sound source direction detection device and method
CN101149925B (en) * 2007-11-06 2011-02-16 武汉大学 Space parameter selection method for parameter stereo coding
CN101188878B (en) * 2007-12-05 2010-06-02 武汉大学 A space parameter quantification and entropy coding method for 3D audio signals and its system architecture

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101604526B (en) * 2009-07-07 2011-11-16 武汉大学 Weight-based system and method for calculating audio frequency attention
CN102637432A (en) * 2012-03-20 2012-08-15 武汉大学 Self-adaptive measuring method for dual-aural cue perceptual characteristic in three-dimensional audio coding
CN102637432B (en) * 2012-03-20 2013-12-18 武汉大学 Self-adaptive measuring method for dual-aural cue perceptual characteristic in three-dimensional audio coding
US10362427B2 (en) 2014-09-04 2019-07-23 Dolby Laboratories Licensing Corporation Generating metadata for audio object
CN104240712A (en) * 2014-09-30 2014-12-24 武汉大学深圳研究院 Three-dimensional audio multichannel grouping and clustering coding method and three-dimensional audio multichannel grouping and clustering coding system
CN104240712B (en) * 2014-09-30 2018-02-02 武汉大学深圳研究院 A kind of three-dimensional audio multichannel grouping and clustering coding method and system

Also Published As

Publication number Publication date
CN101419801B (en) 2011-08-17

Similar Documents

Publication Publication Date Title
US8612237B2 (en) Method and apparatus for determining audio spatial quality
CN104995677B (en) Use programme information or the audio coder of subflow structural metadata and decoder
CN100571043C (en) A kind of space parameter stereo coding/decoding method and device thereof
CN105164749B (en) The hybrid coding of multichannel audio
CN101149925B (en) Space parameter selection method for parameter stereo coding
CN101188878B (en) A space parameter quantification and entropy coding method for 3D audio signals and its system architecture
EP1979900B1 (en) Apparatus for estimating sound quality of audio codec in multi-channel and method therefor
CN105531763B (en) Uneven parameter for advanced coupling quantifies
CA2558161A1 (en) Device and method for processing a multi-channel signal
KR101600082B1 (en) Method and appratus for a evaluation of audio signal quality
CN101419801B (en) Method for subband measuring correlation sensing characteristic between ears and device thereof
CN103489449A (en) Audio signal decoder, method for providing upmix signal representation state
CN103262158B (en) The multi-channel audio signal of decoding or stereophonic signal are carried out to the apparatus and method of aftertreatment
CN103151049B (en) A kind of QoS guarantee method towards Mobile audio frequency and system
CN103026406A (en) Device and method for postprocessing decoded multi-channel audio signal or decoded stereo signal
WO2020084170A1 (en) Directional loudness map based audio processing
CN107170468A (en) A kind of multichannel audio quality evaluating method based on two-layer model
CN101408615B (en) Method and device for measuring binaural sound time difference ILD critical apperceive characteristic
CN102063905A (en) Blind noise filling method and device for audio decoding
CN101408614B (en) Method and device for measuring binaural sound strong difference ILD critical apperceive characteristic
Watanabe Objective perceptual audio quality measurement method
CN101504835B (en) Measurement method for spacial sensed information content in acoustic field and application thereof
van Dorp Schuitman et al. An artificial listener for assessing content-specific objective parameters related to room acoustical quality
Wang et al. Study of Dynamic Spectrum Access Scheme in HD Radio
KR20210146980A (en) Determination of Significance of Spatial Audio Parameters and Associated Encoding

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110817

Termination date: 20141203

EXPY Termination of patent right or utility model