CN102446508B - Voice audio uniform coding window type selection method and device - Google Patents

Voice audio uniform coding window type selection method and device Download PDF

Info

Publication number
CN102446508B
CN102446508B CN2010105026796A CN201010502679A CN102446508B CN 102446508 B CN102446508 B CN 102446508B CN 2010105026796 A CN2010105026796 A CN 2010105026796A CN 201010502679 A CN201010502679 A CN 201010502679A CN 102446508 B CN102446508 B CN 102446508B
Authority
CN
China
Prior art keywords
frequency energy
sudden change
subframe
described subframe
threshold value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2010105026796A
Other languages
Chinese (zh)
Other versions
CN102446508A (en
Inventor
金剑
刘贵忠
顿玉洁
杜正中
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN2010105026796A priority Critical patent/CN102446508B/en
Publication of CN102446508A publication Critical patent/CN102446508A/en
Application granted granted Critical
Publication of CN102446508B publication Critical patent/CN102446508B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The embodiment of the invention discloses a voice audio uniform coding window type selection method and device, relates to the technical field of communication, and aims to improve the accuracy for selecting a window type for a code signal. The method comprises the following steps of: dividing a current frame code signal into at least two sub-frames; detecting high-frequency energy characteristics of the sub-frames by using a preset coding bit rate; and selecting coding window types for the sub-frames according to the high-frequency energy characteristics. The embodiment of the invention is mainly used in a voice audio uniform coding technology.

Description

The type system of selection of speech audio Unified coding window and device
Technical field
The present invention relates to communication technical field, relate in particular to the type system of selection of a kind of speech audio Unified coding window and device in the speech audio Unified coding technology.
Background technology
The core algorithm of speech audio Unified coding module is to revise discrete cosine transform.And in revising the process of discrete cosine transform, need be to the coded signal piecemeal windowing of input.Two types of window types are provided in the most speech audio coding standard, are respectively long window and short window.
In the prior art, can whether undergo mutation to judge that the coded signal of input need use the window type of which kind of type by judging the signal high-frequency energy, and when definite described coded signal need use short window, described coded signal be divided into groups.
But, the method for the window type of judgement coded signal in the prior art, its judged result is inaccurate.
Summary of the invention
The embodiment of the invention provides the type system of selection of a kind of speech audio Unified coding window and device, selects the accuracy of window type to rise to coded signal.
The embodiment of the invention adopts following technical scheme:
The system of selection of a kind of speech audio Unified coding window type comprises:
The present frame coded signal is divided at least two subframes;
Utilize the high-frequency energy characteristic of the described subframe of preset coding bit-rate detection;
Be that described subframe is selected coding window type according to described high-frequency energy characteristic.
A kind of speech audio Unified coding window type selecting arrangement comprises:
Division unit is used for the present frame coded signal is divided at least two subframes;
Detecting unit is for the high-frequency energy characteristic of utilizing the described subframe of preset coding bit-rate detection;
Selected cell, being used for according to described high-frequency energy characteristic is that described subframe is selected coding window type.
The type system of selection of speech audio Unified coding window and device that the embodiment of the invention provides, parameter with coding bit rate during as the high-frequency energy characteristic of the subframe that detects the present frame coded signal also is the influence of having considered in the embodiment of the invention that coding bit rate is selected coded signal window type.Therefore, utilize method and the device of the embodiment of the invention, can improve the accuracy of the window type of selecting coded signal.
Description of drawings
In order to be illustrated more clearly in the technical scheme of the embodiment of the invention, the accompanying drawing of required use is done an introduction simply in will describing embodiment below, apparently, accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the process flow diagram of embodiment of the invention speech audio Unified coding window type system of selection;
Fig. 2 is the process flow diagram of further embodiment of this invention speech audio Unified coding window type system of selection;
Fig. 3 is the process flow diagram of further embodiment of this invention speech audio Unified coding window type system of selection;
Fig. 4 is the algorithm synoptic diagram of subframe grouping in the embodiment of the invention;
Fig. 5 is the synoptic diagram of embodiment of the invention speech audio Unified coding window type selecting arrangement;
Fig. 6 is the structural drawing of embodiment of the invention speech audio Unified coding window type selecting arrangement.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
As shown in Figure 1, the embodiment of the invention provides the system of selection of a kind of speech audio Unified coding window type, comprises the steps:
Step 11, the present frame coded signal is divided at least two subframes.
Be that 1024 sampling points are example with a coded signal frame, this coded signal frame can be divided into 8 subframes.
Step 12, utilize the high-frequency energy characteristic of the described subframe of preset coding bit-rate detection.
Wherein said coding bit rate was set arbitrarily before coding according to demand by the user, and for example this demand can be the size of coding back voice document, quality of voice signal etc.Described high-frequency energy characteristic can comprise: whether the high-frequency energy of subframe first order sudden change takes place; The perhaps actual high-frequency energy summation of described subframe and the relation between the described codified high-frequency energy.First order sudden change can be understood as bigger sudden change, also can be understood as less relatively sudden change, and the quality of the quality of the voice signal that the degree of concrete sudden change obtains according to the final expectation of user determines.For example if the user expects to obtain high-quality voice signal, he can the less sudden change of sudden change degree be set to first order sudden change so.In general, the actual high-frequency energy of the subframe that step 22 calculates below utilizing can be thought more than 4 times and exists the first order to suddenly change with predicting to differ between the high-frequency energy.
Step 13, be that described subframe is selected coding window type according to described high-frequency energy characteristic.
As seen from the above, the speech audio Unified coding window type system of selection that the embodiment of the invention provides, parameter with coding bit rate during as the high-frequency energy characteristic of the subframe that detects the present frame coded signal also is the influence of having considered in the embodiment of the invention that coding bit rate is selected coded signal window type.Therefore, utilize the method for the embodiment of the invention, can improve the accuracy of the window type of selecting coded signal.
Below in conjunction with Fig. 2-embodiment shown in Figure 4, describe the detailed process of embodiment of the invention speech audio Unified coding window type system of selection in detail.
As shown in Figure 2, the system of selection of further embodiment of this invention speech audio Unified coding window type comprises the steps:
Step 21, the present frame coded signal is divided at least two subframes.
In this embodiment and embodiment shown in Figure 4, all the present frame coded signal is 1024 sampling points, and it is example that this coded signal frame is divided into 8 subframes.Comprise 128 sampling points so, accordingly in each subframe.
Step 22, the actual high-frequency energy that calculates each subframe and prediction high-frequency energy.
In this step, at first use an I IR wave filter to this current coded signal frame signal iCarry out high-pass filtering, picked up signal hpsignal after the filtering i, filtering method as shown in Equation (1).In high-pass filtering, the high-pass equipment cutoff frequency of choosing is pi/2, and exponent number is 2 rank.
Hpsignal i=0.7548signal i-0.7548signal I-1+ 0.5095hpsignal I-1, wherein, 0≤i≤1023 (1)
Then, according to the actual high-frequency energy subEnergy of each subframe after the mode calculation of filtered shown in the formula (2) w
subEnergy w = Σ i = w * 128 w * 128 + 127 hpsignal i , Wherein, 0≤w≤7 (2)
At last, recycling formula (3) calculates the prediction prediction high-frequency energy preSubEnergy of each subframe w,
PreSubEnergy w=0.65preSubEnergy W-1+ 0.35subEnergy W-1, 0≤w≤7 wherein.(3)
Wherein, i represents sampling point call number in the subframe, and w represents subframe index number.
Step 23, utilize preset coding bit rate bitRate to calculate first high-frequency energy sudden change threshold value attackRatio.
Its account form is as shown in Equation (4):
attackRatio = 0.1 × bitRate / 24000 - 0.01 ; nChannels = 1 0.1 × bitRate / nChannles / 16000 - 0.01 ; nChannels > 1 - - - ( 4 )
attackRatio = MinAttackRatio , attackRatio < MinAttackRatio MaxAttackRatio , attackRatio > MaxAttackRatio
Wherein, nChannels represents the channel number that the core encoder module will be encoded, and MinAttackRatio and MaxAttackRatio represent minimum value and the maximal value of attackRatio respectively.The value of nChannels can get 1,2 etc., and the value of MinAttackRatio and MaxAttackRatio rule of thumb is worth setting by the user.Wherein said first high-frequency energy sudden change threshold value is less than 1.In this embodiment, the value of MinAttackRatio and MaxAttackRatio can rule of thumb be worth and is set at 0.05 and 0.25 respectively.
For example, bit rate commonly used is 12000,16000,20000,24000, when nChannels=1, can calculate corresponding " first high-frequency energy sudden change threshold value " according to formula (4) and be respectively 0.04,0.057,0.073,0.09.
Step 24, utilize described actual high-frequency energy, described prediction high-frequency energy and described first high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe first order sudden change takes place.
Concrete, in the process that detects, ratio and the first high-frequency energy sudden change threshold value of actual high-frequency energy with described prediction high-frequency energy can be compared, to judge the sudden change degree of each subframe high-frequency energy.When the ratio of described actual high-frequency energy and described prediction high-frequency energy less than the ratio of described first high-frequency energy sudden change threshold value or described prediction high-frequency energy and described actual high-frequency energy during less than described first high-frequency energy sudden change threshold value, determine the high-frequency energy generation first order sudden change of described subframe, otherwise determine that first order sudden change does not take place the high-frequency energy of described subframe.
Step 25, according to testing result, be that described subframe selects coding window type.
Concrete, select short window for the subframe of high-frequency energy generation first order sudden change, the subframe that first order sudden change does not take place for high-frequency energy is selected long window.
Step 26, according to described high-frequency energy characteristic be described subframe to select coding window type be after described subframe is selected short window, with described subframe grouping.
In this embodiment, by judging whether subframe second level sudden change takes place and its sudden change situation is identified to divide into groups.Wherein, second level sudden change is the littler sudden change of sudden change degree than first order sudden change.
That is to say, in embodiments of the present invention, be to determine grouping under the subframe by the judgement of two-stage sudden change.
Concrete, at first utilize described first high-frequency energy sudden change threshold calculations, second high-frequency energy sudden change threshold value, wherein said second high-frequency energy sudden change threshold value is less than 1, concrete computing formula as shown in Equation (5):
mulRatio=attackRatio×2 (5)
Wherein, mulRatio represents second high-frequency energy sudden change threshold value, and attackRatio represents first high-frequency ability sudden change threshold value.
Then, utilize the actual high-frequency energy of described subframe, described prediction high-frequency energy and described second high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe second level sudden change takes place, and according to testing result, for subframe arranges the energy jump sign.
Concrete detection method is: in detecting process, ratio and the second high-frequency energy sudden change threshold value of actual high-frequency energy with described prediction high-frequency energy can be compared.When the ratio of described actual high-frequency energy and described prediction high-frequency energy less than the ratio of described second high-frequency energy sudden change threshold value or described prediction high-frequency energy and described actual high-frequency energy during less than described second high-frequency energy sudden change threshold value, second level sudden change takes place in the high-frequency energy of determining described subframe, otherwise determines that second level sudden change does not take place the high-frequency energy of described subframe.Whether take place to identify each subframe in the process of second level sudden change at the high-frequency energy that detects each subframe, to determine whether to take place second level sudden change.For example the subframe of second level sudden change takes place in available 1 sign, and the subframe of second level sudden change does not take place with 0 sign.Otherwise perhaps.
, in general, between actual high-frequency energy and prediction high-frequency energy, differ and to think more than 2 times and exist the second level to suddenly change the describing for example of first order sudden change according to above-mentioned.
At last, according to described energy jump sign described subframe is divided into groups.
Suppose to utilize mulIndex[w] identify each subframe whether second level sudden change takes place, work as mulIndex[w]=second level sudden change takes place, works as mulIndex[w in 1 expression]=second level sudden change do not take place in 0 expression.For a subframe, suppose mulIndex[w]=(0,0,1,1,0,0,1,0), wherein w=0...7.Its algorithm synoptic diagram as shown in Figure 3.
At first, when initialization, organized into groups number engrolen=0, subframe index w=1, group number g=0, sequence number n=8.For second subframe, mulIndex[1]=0, then the value of w being added 1 is w=2, and because w<8, so then judge second subframe.For the 3rd subframe, mulIndex[1]=0, group[0 then]=w-engroLen=2-0=2, namely the 1st group has 2 subframes; The value that the value that the engroLen value changes 2, g into adds 1, w adds 1, i.e. g=1, w=3.For the 4th subframe, mulIndex[3]=1, group[1]=w-engroLen=3-2=1, namely the 2nd group has 1 subframe, and the value that the value that the engroLen value changes 3, g into adds 1, w adds 1, i.e. g=2, w=4.For the 5th subframe, mulIndex[4]=0, then the value of w being added 1 is w=5, and because w<8, so then judge the 6th subframe.For the 6th subframe, mulIndex[5]=0, then the value of w being added 1 is w=6, and because w<8, so then judge the 7th subframe.For the 7th subframe, mulIndex[6]=1, group[2]=w-engroLen=6-3=3, namely the 3rd group has 3 subframes, and the value that the value that the engroLen value changes 6, g into adds 1, w adds 1, i.e. g=3, w=7.Then judge the 8th subframe, because mulIndex[7]=0, then the value of w being added 1 is w=8.And because w=8, so deterministic process finishes.Group[3]=2, namely the 4th group has 2 subframes.
By above-mentioned deterministic process, as can be seen, be that 8 subframes have been divided into four groups, its group result group[]={ 2,1,3,2} has 2,1,3,2 subframes respectively in every group.Therefore, can be with one group of the 0th, 1 subframe, one group of the 2nd subframe, one group of the 3rd, 4,5 subframe, one group of the 6th, 7 subframe.
Step 27, according to described high-frequency energy characteristic be described subframe to select coding window type be described subframe when selecting long window, adjust the long window type of described subframe.
Multiple concrete different long window type is provided in speech audio Unified coding standard, comprise short window (EIGHT_SHORT_SEQUENCE), long window (ONLY_LONG_SEQUENCE), long beginning window (LONG_START_SEQUENCE), longly finish window (LONG_STOP_SEQUENCE), shortly begin to finish window (STOP_START_SEQUENCE), 1152 long end windows (STOP_1152_SEQUENCE), 1152 weak points and begin to finish totally seven kinds of windows (STOP_START_1152_SEQUENCE); Coding mode has two kinds of time domain codings (CORE_MODE_TD), Frequency Domain Coding (CORE_MODE_FD).In the cataloged procedure, need adjust the window type of each frame according to the employed window type of front and back frame.
The detailed process that the window type is adjusted is with of the prior art identical, at this its regulation rule done and briefly introduced.Its regulation rule is:
If the coding mode of next frame coded signal is time domain coding, then the window type of next frame coded signal changes short window into; If the coding mode of present frame coded signal is the coding mode of time domain coding and next frame coded signal is Frequency Domain Coding, then the window type of next frame coded signal changes 1152 long end windows into; If the window type of next frame coded signal is 1152 long end windows for the window type of short window and present frame coded signal, then the window type of next frame coded signal changes 1152 weak points into and begins to finish window; If the window type of next frame coded signal is long window for the window type of short window and present frame coded signal, then the window type of next frame coded signal changes long beginning window into; If the window type of next frame coded signal is the long window that finishes for the window type of short window and present frame coded signal, then the window type of next frame coded signal changes the short window that begins to finish into; If the window type of next frame coded signal is short window for the window type of long window and present frame coded signal, then the window type of next frame coded signal changes the long window that finishes into.
By above description as can be seen, because first high-frequency energy sudden change threshold value attackRadio value is the number between 0 and 1, coding bit rate is more high, the value of attackRadio is more big, so can detected jump signal just more many, thereby use the possibility of short window more big, final coding efficiency is more good.On the contrary, coding bit rate is more low, and the value of attackRadio is more little, and testing conditions is stricter, and the detected jump signal of energy is more few, uses the possibility of short window to reduce, final coding efficiency reduction.
Therefore, the method for the embodiment of the invention is owing to taken into full account influence thereby accuracy of judgement that coding bit rate is selected the window type.Computation complexity is low simultaneously.When lacking the window grouping, a kind of finer method for grouping short windows is provided, better adapt to characteristics of signals, be conducive to quantization encoding.
As shown in Figure 4, the system of selection of further embodiment of this invention speech audio Unified coding window type comprises the steps:
Step 31, the present frame coded signal is divided at least two subframes.
This step is identical with the description of step 21.
Step 32, the actual high-frequency energy that calculates each subframe and actual high-frequency energy summation.
Wherein the account form of the actual high-frequency energy of each subframe can be with reference to describing accordingly in the step 22.The computing formula of actual high-frequency energy summation sumSubEnergy is as shown in Equation (6):
sumSubEnergy = &Sigma; w = 0 w = 7 subEnergy w - - - ( 6 )
Step 33, utilize the preset coding bit rate to calculate codified high-frequency energy minSumEnergy.Computing formula is as shown in Equation (7):
minSumEnergy=MinEnergyKbits×bitRate
Wherein MinEnergyKbits can be set to 500, bitRate presentation code bit rate.
Step 34, actual high-frequency energy summation and the described codified high-frequency energy of described subframe compared, obtain the actual high-frequency energy summation of described subframe and the relation between the described codified high-frequency energy, and select the window type according to comparative result.
In detailed process, if described actual high-frequency energy summation is less than described codified high-frequency energy, for described subframe is selected long window; If described actual high-frequency energy summation is greater than described codified high-frequency energy, for described subframe is selected short window.
Step 35, according to described high-frequency energy characteristic be described subframe to select coding window type be after described subframe is selected short window, with described subframe grouping.
Different with a last embodiment is in this embodiment, before subframe being lacked the window grouping, at first will calculate actual high-frequency energy and the prediction high-frequency energy of described subframe and utilize described coding bit rate to calculate first high-frequency energy sudden change threshold value.That describes in its computation process and step 23 and 24 is identical.Then, lack the window grouping according to the mode of describing in 26 again.
Step 36, according to described high-frequency energy characteristic be described subframe to select coding window type be described subframe when selecting long window, adjust the long window type of described subframe.
This step can be with reference to the description of step 27.
Wherein, need to prove that minSumEnergy is the parameter at some lower signals of high-frequency energy.Coding bit rate is more high, and the value of minSumEnergy is more big, and long window codified high-frequency energy is more big, uses the possibility of long window more big like this.For the signal with the relatively lower feature of high-frequency energy, final coding efficiency is more good.
Therefore, the method for the embodiment of the invention is owing to taken into full account influence thereby accuracy of judgement that coding bit rate is selected the window type.Computation complexity is low simultaneously.When lacking the window grouping, a kind of finer method for grouping short windows is provided, better adapt to characteristics of signals, be conducive to quantization encoding.
As shown in Figure 5, the embodiment of the invention also provides a kind of speech audio Unified coding window type selecting arrangement, comprising: division unit 51 is used for the present frame coded signal is divided at least two subframes; Detecting unit 52 is for the high-frequency energy characteristic of utilizing the described subframe of preset coding bit-rate detection; Selected cell 53, being used for according to described high-frequency energy characteristic is that described subframe is selected coding window type.
As the description among the method embodiment, described high-frequency energy characteristic can comprise: whether the high-frequency energy of subframe first order sudden change takes place; The perhaps actual high-frequency energy summation of described subframe and the relation between the described codified high-frequency energy.
Accordingly, when described high-frequency energy characteristic can comprise whether the high-frequency energy of subframe first order sudden change takes place, described detecting unit 52 can comprise: first computing module is used for calculating the actual high-frequency energy of described subframe and predicts high-frequency energy; Second computing module is used for utilizing described coding bit rate to calculate first high-frequency energy sudden change threshold value, and wherein said first high-frequency energy sudden change threshold value is less than 1; First detection module is used for utilizing described actual high-frequency energy, described prediction high-frequency energy and described first high-frequency energy sudden change threshold value, and whether the high-frequency energy that detects described subframe first order sudden change takes place.
In concrete application process, described first detection module specifically is used for: when the ratio of described actual high-frequency energy and described prediction high-frequency energy less than the ratio of described first high-frequency energy sudden change threshold value or described prediction high-frequency energy and described actual high-frequency energy during less than described first high-frequency energy sudden change threshold value, determine the high-frequency energy generation first order sudden change of described subframe, otherwise determine that first order sudden change does not take place the high-frequency energy of described subframe.
At this moment, described selected cell 53 specifically is used for: select short window for the subframe of high-frequency energy generation first order sudden change, the subframe that first order sudden change does not take place for high-frequency energy is selected long window.
In the case, the principle of work of described speech audio Unified coding window type selecting arrangement can be with reference to the description of embodiment shown in Figure 2.
When described high-frequency energy characteristic can comprise concerning between the actual high-frequency energy summation of described subframe and the described codified high-frequency energy, described detecting unit 52 can comprise: the 3rd computing module is used for calculating the actual high-frequency energy of described subframe; The 4th computing module is for the actual high-frequency energy summation of calculating described subframe according to the actual high-frequency energy of described subframe; The 5th computing module is used for utilizing described coding bit rate to calculate described codified high-frequency energy; Second detection module is used for actual high-frequency energy summation and the described codified high-frequency energy of described subframe are compared, and obtains the actual high-frequency energy summation of described subframe and the relation between the described codified high-frequency energy.At this moment, described selected cell 53 specifically is used for: if described actual high-frequency energy summation is less than described codified high-frequency energy, for described subframe is selected long window; If described actual high-frequency energy summation is greater than described codified high-frequency energy, for described subframe is selected short window.
In the case, the principle of work of described speech audio Unified coding window type selecting arrangement can be with reference to the description of embodiment shown in Figure 4.
In addition, as shown in Figure 6, the described device of the embodiment of the invention also can comprise:
Grouped element 54 is used for described subframe grouping; Window type adjustment unit 55 is for the long window type of adjusting described present frame coded signal.
According to the different situations of high-frequency energy characterisitic parameter, described grouped element 54 can comprise: first computing module, be used for utilizing described first high-frequency energy sudden change threshold calculations, second high-frequency energy sudden change threshold value, and wherein said second high-frequency energy sudden change threshold value is less than 1; First processing module, be used for utilizing the actual high-frequency energy of described subframe, described prediction high-frequency energy and described second high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe second level sudden change takes place, and according to testing result, for described subframe arranges the energy jump sign; First grouping module is used for according to described energy jump sign described subframe grouping.
Perhaps, described grouped element 54 can comprise: second computing module is used for calculating the actual high-frequency energy of described subframe and predicts high-frequency energy; The 3rd computing module is used for utilizing described coding bit rate to calculate first high-frequency energy sudden change threshold value, and wherein said first high-frequency energy sudden change threshold value is less than 1; The 4th computing module is used for utilizing described first high-frequency energy sudden change threshold calculations, second high-frequency energy sudden change threshold value, and wherein said second high-frequency energy sudden change threshold value is less than 1; Second processing module, be used for utilizing the actual high-frequency energy of described subframe, described prediction high-frequency energy and described second high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe second level sudden change takes place, and according to testing result, for subframe arranges the energy jump sign; Second grouping module is used for according to described energy jump sign described subframe grouping.
But the corresponding description among the concrete course of work reference method embodiment of wherein said grouped element.
As seen from the above, the speech audio Unified coding window type selecting arrangement that the embodiment of the invention provides, parameter with coding bit rate during as the high-frequency energy characteristic of the subframe that detects the present frame coded signal also is the influence of having considered in the embodiment of the invention that coding bit rate is selected coded signal window type.Therefore, utilize the device of the embodiment of the invention, can improve the accuracy of the window type of selecting coded signal.
In addition, utilize the device of the embodiment of the invention when subframe being lacked the window grouping, can better adapt to characteristics of signals, be conducive to quantization encoding.
One of ordinary skill in the art will appreciate that all or part of flow process that realizes in above-described embodiment method, be to instruct relevant hardware to finish by computer program, described program can be stored in the computer read/write memory medium, this program can comprise the flow process as the embodiment of above-mentioned each side method when carrying out.Wherein, described storage medium can be magnetic disc, CD, read-only storage memory body (Read-Only Memory, ROM) or at random store memory body (Random Access Memory, RAM) etc.
The above; only be the specific embodiment of the present invention, but protection scope of the present invention is not limited thereto, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; can expect easily changing or replacing, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of described claim.

Claims (17)

1. speech audio Unified coding window type system of selection is characterized in that, comprising:
The present frame coded signal is divided at least two subframes;
Utilize the high-frequency energy of the described subframe of preset coding bit-rate detection whether first order sudden change takes place, perhaps, utilize the actual high-frequency energy summation of the described subframe of preset coding bit-rate detection whether greater than the codified high-frequency energy;
According to testing result, for described subframe is selected coding window type;
Be after described subframe selects coding window type to be the short window of described subframe selection according to testing result, whether the high-frequency energy that detects described subframe again second level sudden change takes place, and according to the result who detects again described subframe is divided into groups, wherein, described second level sudden change is the littler sudden change of sudden change degree than first order sudden change;
Being after described subframe selects coding window type to be the long window of described subframe selection according to testing result, adjust the long window type of described present frame coded signal.
2. method according to claim 1 is characterized in that, utilizes the high-frequency energy of the described subframe of preset coding bit-rate detection whether first order sudden change takes place, and comprising:
Calculate actual high-frequency energy and the prediction high-frequency energy of described subframe;
Utilize described coding bit rate to calculate first high-frequency energy sudden change threshold value, wherein said first high-frequency energy sudden change threshold value is less than 1;
Utilize described actual high-frequency energy, described prediction high-frequency energy and described first high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe first order sudden change takes place.
3. method according to claim 2 is characterized in that, described described actual high-frequency energy, described prediction high-frequency energy and described first high-frequency energy sudden change threshold value utilized, and whether the high-frequency energy that detects described subframe first order sudden change takes place comprises:
When the ratio of described actual high-frequency energy and described prediction high-frequency energy less than the ratio of described first high-frequency energy sudden change threshold value or described prediction high-frequency energy and described actual high-frequency energy during less than described first high-frequency energy sudden change threshold value, determine the high-frequency energy generation first order sudden change of described subframe, otherwise determine that first order sudden change does not take place the high-frequency energy of described subframe.
4. method according to claim 3 is characterized in that, and is described according to testing result, selects coding window type for described subframe and comprises:
Select short window for the subframe of high-frequency energy generation first order sudden change, the subframe that first order sudden change does not take place for high-frequency energy is selected long window.
5. whether method according to claim 1 is characterized in that, utilize the actual high-frequency energy summation of the described subframe of preset coding bit-rate detection greater than the codified high-frequency energy; Comprise:
Calculate the actual high-frequency energy of described subframe;
Calculate the actual high-frequency energy summation of described subframe according to the actual high-frequency energy of described subframe;
Utilize described coding bit rate to calculate described codified high-frequency energy;
Actual high-frequency energy summation and the described codified high-frequency energy of described subframe are compared, obtain the actual high-frequency energy summation of described subframe and the relation between the described codified high-frequency energy.
6. method according to claim 5 is characterized in that, and is described according to testing result, selects coding window type for described subframe and comprises:
If described actual high-frequency energy summation is less than described codified high-frequency energy, for described subframe is selected long window; If described actual high-frequency energy summation is greater than described codified high-frequency energy, for described subframe is selected short window.
7. method according to claim 4 is characterized in that, whether the described high-frequency energy that detects described subframe again second level sudden change takes place, and according to the result who detects again described subframe is divided into groups, and comprising:
Utilize described first high-frequency energy sudden change threshold calculations, second high-frequency energy sudden change threshold value, wherein said second high-frequency energy sudden change threshold value is less than 1;
Utilize the actual high-frequency energy of described subframe, described prediction high-frequency energy and described second high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe second level sudden change takes place, and according to testing result, for described subframe arranges the energy jump sign;
According to described energy jump sign described subframe is divided into groups.
8. method according to claim 6 is characterized in that, whether the described high-frequency energy that detects described subframe again second level sudden change takes place, and according to the result who detects again described subframe is divided into groups, and comprising:
Calculate actual high-frequency energy and the prediction high-frequency energy of described subframe;
Utilize described coding bit rate to calculate first high-frequency energy sudden change threshold value, wherein said first high-frequency energy sudden change threshold value is less than 1;
Utilize described first high-frequency energy sudden change threshold calculations, second high-frequency energy sudden change threshold value, wherein said second high-frequency energy sudden change threshold value is less than 1;
Utilize the actual high-frequency energy of described subframe, described prediction high-frequency energy and described second high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe second level sudden change takes place, and according to testing result, for subframe arranges the energy jump sign;
According to described energy jump sign described subframe is divided into groups.
9. according to claim 7 or 8 described methods, it is characterized in that whether the high-frequency energy of the described subframe of described detection second level sudden change takes place comprises:
When the ratio of described actual high-frequency energy and described prediction high-frequency energy less than the ratio of described second high-frequency energy sudden change threshold value or described prediction high-frequency energy and described actual high-frequency energy during less than described second high-frequency energy sudden change threshold value, second level sudden change takes place in the high-frequency energy of determining described subframe, otherwise determines that second level sudden change does not take place the high-frequency energy of described subframe.
10. a speech audio Unified coding window type selecting arrangement is characterized in that, comprising:
Division unit is used for the present frame coded signal is divided at least two subframes;
Whether detecting unit is used for utilizing the high-frequency energy of the described subframe of preset coding bit-rate detection whether first order sudden change takes place, perhaps, be used for utilizing the actual high-frequency energy summation of the described subframe of preset coding bit-rate detection greater than the codified high-frequency energy;
Selected cell is used for according to testing result, for described subframe is selected coding window type;
Grouped element, be used for is being after described subframe selects coding window type to be the short window of described subframe selection according to testing result, whether the high-frequency energy that detects described subframe again second level sudden change takes place, and according to the result who detects again described subframe is divided into groups, wherein, described second level sudden change is the littler sudden change of sudden change degree than first order sudden change;
Window type adjustment unit is used for adjusting the long window type of described present frame coded signal being after described subframe selects coding window type to be the long window of described subframe selection according to testing result.
11. device according to claim 10 is characterized in that, described detecting unit is used for utilizing the high-frequency energy of the described subframe of preset coding bit-rate detection whether first order sudden change takes place; Described detecting unit comprises:
First computing module is used for calculating the actual high-frequency energy of described subframe and predicts high-frequency energy;
Second computing module is used for utilizing described coding bit rate to calculate first high-frequency energy sudden change threshold value, and wherein said first high-frequency energy sudden change threshold value is less than 1;
First detection module is used for utilizing described actual high-frequency energy, described prediction high-frequency energy and described first high-frequency energy sudden change threshold value, and whether the high-frequency energy that detects described subframe first order sudden change takes place.
12. device according to claim 11 is characterized in that, described first detection module specifically is used for:
When the ratio of described actual high-frequency energy and described prediction high-frequency energy less than the ratio of described first high-frequency energy sudden change threshold value or described prediction high-frequency energy and described actual high-frequency energy during less than described first high-frequency energy sudden change threshold value, determine the high-frequency energy generation first order sudden change of described subframe, otherwise determine that first order sudden change does not take place the high-frequency energy of described subframe.
13. device according to claim 12 is characterized in that, described selected cell specifically is used for:
Select short window for the subframe of high-frequency energy generation first order sudden change, the subframe that first order sudden change does not take place for high-frequency energy is selected long window.
14. device according to claim 10 is characterized in that, whether described detecting unit is used for utilizing the actual high-frequency energy summation of the described subframe of preset coding bit-rate detection greater than the codified high-frequency energy; Described detecting unit comprises:
The 3rd computing module is for the actual high-frequency energy that calculates described subframe;
The 4th computing module is for the actual high-frequency energy summation of calculating described subframe according to the actual high-frequency energy of described subframe;
The 5th computing module is used for utilizing described coding bit rate to calculate described codified high-frequency energy;
Second detection module is used for actual high-frequency energy summation and the described codified high-frequency energy of described subframe are compared, and obtains the actual high-frequency energy summation of described subframe and the relation between the described codified high-frequency energy.
15. device according to claim 14 is characterized in that, described selected cell specifically is used for:
If described actual high-frequency energy summation is less than described codified high-frequency energy, for described subframe is selected long window; If described actual high-frequency energy summation is greater than described codified high-frequency energy, for described subframe is selected short window.
16. device according to claim 11 is characterized in that, described grouped element comprises:
First computing module is used for utilizing described first high-frequency energy sudden change threshold calculations, second high-frequency energy sudden change threshold value, and wherein said second high-frequency energy sudden change threshold value is less than 1;
First processing module, be used for utilizing the actual high-frequency energy of described subframe, described prediction high-frequency energy and described second high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe second level sudden change takes place, and according to testing result, for described subframe arranges the energy jump sign;
First grouping module is used for according to described energy jump sign described subframe grouping.
17. according to claim 11 or 14 described devices, it is characterized in that described grouped element comprises:
Second computing module is used for calculating the actual high-frequency energy of described subframe and predicts high-frequency energy;
The 3rd computing module is used for utilizing described coding bit rate to calculate first high-frequency energy sudden change threshold value, and wherein said first high-frequency energy sudden change threshold value is less than 1;
The 4th computing module is used for utilizing described first high-frequency energy sudden change threshold calculations, second high-frequency energy sudden change threshold value, and wherein said second high-frequency energy sudden change threshold value is less than 1;
Second processing module, be used for utilizing the actual high-frequency energy of described subframe, described prediction high-frequency energy and described second high-frequency energy sudden change threshold value, whether the high-frequency energy that detects described subframe second level sudden change takes place, and according to testing result, for subframe arranges the energy jump sign;
Second grouping module is used for according to described energy jump sign described subframe grouping.
CN2010105026796A 2010-10-11 2010-10-11 Voice audio uniform coding window type selection method and device Expired - Fee Related CN102446508B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010105026796A CN102446508B (en) 2010-10-11 2010-10-11 Voice audio uniform coding window type selection method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010105026796A CN102446508B (en) 2010-10-11 2010-10-11 Voice audio uniform coding window type selection method and device

Publications (2)

Publication Number Publication Date
CN102446508A CN102446508A (en) 2012-05-09
CN102446508B true CN102446508B (en) 2013-09-11

Family

ID=46008960

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010105026796A Expired - Fee Related CN102446508B (en) 2010-10-11 2010-10-11 Voice audio uniform coding window type selection method and device

Country Status (1)

Country Link
CN (1) CN102446508B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103794218B (en) * 2014-01-27 2017-02-01 东南大学 DVFS method for AAC
CN112992164A (en) * 2014-07-28 2021-06-18 日本电信电话株式会社 Encoding method, apparatus, program, and recording medium

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6311154B1 (en) * 1998-12-30 2001-10-30 Nokia Mobile Phones Limited Adaptive windows for analysis-by-synthesis CELP-type speech coding
KR100472442B1 (en) * 2002-02-16 2005-03-08 삼성전자주식회사 Method for compressing audio signal using wavelet packet transform and apparatus thereof
US20030215013A1 (en) * 2002-04-10 2003-11-20 Budnikov Dmitry N. Audio encoder with adaptive short window grouping
CN100493056C (en) * 2006-08-18 2009-05-27 清华大学 Frequency domain channel estimation method of crossing frequency division multiplexing system with time-domain enveloping weighting
KR20080053739A (en) * 2006-12-11 2008-06-16 삼성전자주식회사 Apparatus and method for encoding and decoding by applying to adaptive window size
CN101308655B (en) * 2007-05-16 2011-07-06 展讯通信(上海)有限公司 Audio coding and decoding method and layout design method of static discharge protective device and MOS component device
CN101694773B (en) * 2009-10-29 2011-06-22 北京理工大学 Self-adaptive window switching method based on TDA domain

Also Published As

Publication number Publication date
CN102446508A (en) 2012-05-09

Similar Documents

Publication Publication Date Title
EP2159788B1 (en) A voice activity detecting device and method
CN101964189B (en) Audio signal switching method and device
CN102436820B (en) High frequency band signal coding and decoding methods and devices
CN102446506B (en) Classification identifying method and equipment of audio signals
CN101149921B (en) Mute test method and device
CN107331386B (en) Audio signal endpoint detection method and device, processing system and computer equipment
CN101488344B (en) Quantitative noise leakage control method and apparatus
EP2702585B1 (en) Frame based audio signal classification
CN101609678A (en) Compression method and compression set thereof
CN104021796A (en) Voice enhancement processing method and device
CN101197133A (en) Decoding method and device
US20200135218A1 (en) Signal Processing Method and Device
CN103915097B (en) Voice signal processing method, device and system
CN101521010A (en) Coding and decoding method for voice frequency signals and coding and decoding device
CN102446508B (en) Voice audio uniform coding window type selection method and device
EP3147900B1 (en) Method and device for processing audio signal
CN105225668A (en) Coding method and equipment
CN102419977B (en) Method for discriminating transient audio signals
CN101770778B (en) Pre-emphasis filter, perception weighted filtering method and system
CN102231274B (en) Fundamental tone period estimated value correction method, fundamental tone estimation method and related apparatus
CN102307323B (en) Method for modifying sound channel delay parameter of multi-channel signal
CN106463140B (en) Modified frame loss correction with voice messaging
CN105096958A (en) Audio coding method and related device
CN105336338A (en) Audio coding method and apparatus
CN101895373B (en) Channel decoding method, system and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20130911

Termination date: 20191011