CN101288115A - Method and apparatus for signal processing - Google Patents
Method and apparatus for signal processing Download PDFInfo
- Publication number
- CN101288115A CN101288115A CNA2006800380589A CN200680038058A CN101288115A CN 101288115 A CN101288115 A CN 101288115A CN A2006800380589 A CNA2006800380589 A CN A2006800380589A CN 200680038058 A CN200680038058 A CN 200680038058A CN 101288115 A CN101288115 A CN 101288115A
- Authority
- CN
- China
- Prior art keywords
- data
- coding
- pilot
- value
- entropy
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The present invention relates to a method and apparatus for processing a signal. An object of the present invention devised to solve the problem lies on a method and apparatus for processing a signal, which allows a signal having optimized signal transmission efficiency to be transmitted/ received. According to an aspect of the present invention, there is provided a method of processing a signal including receiving a broadcasting signal including audio data coded using a pilot reference value and a pilot difference value, demodulating the broadcasting signal in consideration of a scattered pilot which varies over time and a continual pilot which is fixed over time in a frame of the received broadcasting signal and decoding the demodulated signal to obtain a broadcasting transmission stream,; demultiplexing the broadcasting transmission stream to obtain coded audio data in an Internet protocol (IP) packet and an identifier for identifying a method of decoding the audio data, obtaining the pilot reference value corresponding to a plurality of data and the pilot difference value corresponding to the pilot reference value from the coded audio data and obtaining the audio data using the pilot reference value and the pilot difference value.
Description
Technical field
The present invention relates to be used for the method and apparatus of processing signals, more particularly, relate to a kind of method and apparatus that makes it possible to carry out the processing signals of signal compression or recovery with the voice data treatment effeciency.
Background technology
Up to the present, people have proposed the multiple technology that relates to signal compression and recovery, and are applied to comprising the several data of sound signal and vision signal usually.Signal compression and the recovery technology of improving picture quality and sound quality and increase compressibility have simultaneously been developed.In order to adapt to multiple communication environment, people are increasing the effort of transfer efficiency.
Typically say, utilized multiple broadcasting method that the content that comprises sound signal, vision signal and extraneous information is provided.Recently, utilized digital broadcast signal that the plurality of kinds of contents that comprises sound signal is provided.The sound signal that sends by DMB can be compressed and sends subsequently according to multiple compression method.When receiving sound signal, by the compression method of the sound signal that comprises in the broadcast singal this sound signal of decoding.Along with the use of the Internet sharply increases and for the increase in demand based on the business of Internet protocol (IP), people are also in the method for considering to provide according to IP scenario broadcast singal.Yet, the broadcast singal that comprises sound signal is effectively being compressed and, is not being proposed to provide the processing of compression method by broadcast singal according to comprising that the multiple delivery plan of IP scenario provides under the situation of compressed broadcast signal.For example, effective compressed broadcast signal is being sent under the situation of IP-based terminal or mobile broadcasting terminal, is not proposing to handle the method for broadcast singal.Therefore, when utilizing broadcast singal that broadcasting service is provided, may go wrong.
Summary of the invention
For addressing this problem a kind of method and apparatus that is used for processing signals that an object of the present invention is to provide that designs, its allow broadcast transmission/receptions form with digital video broadcast-terrestrial (DVB-T) system or hand-held digital video broadcast (DVB-H) system send/the received signal transmission efficiency carried out the signal of optimization.
Provide a kind of method and apparatus that is used for deal with data for addressing this problem the another object of the present invention that designs, it can send/receive the broadcast singal that comprises the voice data that compresses according to Internet protocol (IP) scheme.
Provide a kind of method and apparatus that data is carried out efficient coding for addressing this problem the another object of the present invention that designs.
Provide and a kind of data are carried out the method and apparatus of Code And Decode for addressing this problem the another object of the present invention that designs, it can make the maximise transmission efficiency of the control data that audio frequency uses in recovering.
Make a kind of medium that comprises coded data is provided for addressing this problem the another object of the present invention that designs.
Provide a kind of data structure that is used for effectively sending coded data for addressing this problem the another object of the present invention that designs.
Provide a kind of system that comprises decoding device for addressing this problem the another object of the present invention that designs.
For realizing these and other advantage, and according to concrete enforcement and broadly described purpose of the present invention, a kind of method and apparatus of processing signals is provided, and it can send/receive the broadcast singal that comprises compressing audio signal with the broadcast singal form of digital video broadcast-terrestrial (DVB-T) system or hand-held digital video broadcast (DVB-H) system.
One aspect of the present invention provides a kind of method that is used for processing signals, and this method may further comprise the steps: receive the broadcast singal that comprises voice data, this voice data utilizes pilot frequency benchmark value and pilot tone difference to encode; Time-varying discrete (scattered) pilot tone in one frame of the broadcast singal that consideration receives and fixing in time continuous pilot are come this broadcast singal of demodulation and the broadcast singal after the demodulation are decoded to obtain broadcast transmission stream; This broadcast transmission stream is carried out the identifier of demultiplexing to obtain the coding audio data in Internet protocol (IP) packet and to be used to identify the method for this voice data of decoding; From this coding audio data obtain with corresponding this pilot frequency benchmark value of a plurality of data and with corresponding this pilot tone difference of this pilot frequency benchmark value; And utilize this pilot frequency benchmark value and this pilot tone difference to obtain this voice data.This method also comprises at least one step of decoding in this pilot frequency benchmark value and this pilot tone difference.Parameter comprise channel grade poor (channel level difference) (CLD), inter-channel correlation (ICC) and fall at least one of mixing in the gain (arbitrary downmix gain) (ADG) arbitrarily.
In the mean value that this pilot frequency benchmark value can be these a plurality of data, intermediate value, the most frequently used value and the default value one.This pilot frequency benchmark value can be a value of extracting from table.This method can also may further comprise the steps: select the highest data of code efficiency as final pilot frequency benchmark value after being provided with this pilot frequency benchmark value in these a plurality of data each.This IP packet can comprise the real-time transport protocol (rtp) packet, and this RTP packet comprises the core encoder voice data and according to the pilot frequency benchmark value and the pilot tone difference of this core encoder voice data.
This IP packet can comprise a RTP packet and the 2nd RTP packet, the one RTP packet comprises this core encoder voice data, and the 2nd RTP packet comprises pilot frequency benchmark value and pilot tone difference according to this core encoder voice data, and a RTP packet has identical timestamp information with the 2nd RTP packet.Obtain in the configuration information of the audio object that this identifier can comprise from this audio stream.
One aspect of the present invention provides a kind of device that is used for processing signals, and this device comprises: tuner, be used for the tuning broadcast singal that comprises voice data, and this voice data utilizes pilot frequency benchmark value and pilot tone difference to encode; Demodulation section is used for considering coming this broadcast singal of demodulation through the time-varying scattered pilot and the fixing in time continuous pilot of a frame of tuning broadcast singal; Demultiplexing portion, be used for the signal after the demodulation is decoded, the broadcast transmission stream that comprises voice data is carried out demultiplexing, and in by the audio stream of demultiplexing, parse when utilizing this pilot frequency benchmark value and this pilot tone difference to carry out the identifier of this voice data of coding, demultiplexing goes out to utilize this pilot frequency benchmark value and this pilot tone difference to carry out this voice data and the core encoder voice data of coding from this audio stream; Core codec portion is used for this core encoder voice data is decoded; The spatial information lsb decoder is used for the voice data that utilizes this pilot frequency benchmark value and this pilot tone difference to carry out coding is decoded; And the multichannel generating unit, be used for and will export with the form of multi-channel audio from the sound signal of this core codec portion and the output of this spatial information lsb decoder.
The audio stream that this demultiplexing portion demultiplexing goes out can comprise the IP packet, and this IP packet comprises this core encoder voice data and utilizes this pilot frequency benchmark value and this pilot tone difference and this core encoder voice data to carry out the voice data of encoding.
This IP packet can comprise the RTP packet, and this RTP packet comprises according to this pilot frequency benchmark value of this core encoder voice data and this pilot tone difference.This IP packet can comprise a RTP packet and the 2nd RTP packet, the one RTP packet comprises this core encoder voice data, and the 2nd RTP packet comprises pilot frequency benchmark value and pilot tone difference according to this core encoder voice data, and a RTP packet has identical timestamp information with the 2nd RTP packet.Obtain in the configuration information of the audio object that this identifier can comprise from this audio stream.
Description of drawings
Fig. 1 and Fig. 2 are the block diagrams according to system of the present invention;
Fig. 3 and Fig. 4 are used for explanation according to PBC coding of the present invention;
Fig. 5 is used to illustrate the type according to DIFF coding of the present invention;
Fig. 6 to 8 is examples of having used the DIFF encoding scheme;
Fig. 9 is the block diagram that is used to illustrate according to the relation of one of at least three encoding schemes of selection of the present invention;
Figure 10 is the block diagram that is used to illustrate the relation of one of at least three encoding schemes of selection according to prior art;
Figure 11 and 12 is respectively the process flow diagram according to digital coding selection scheme of the present invention;
Figure 13 is used for explanation according to inner grouping of the present invention;
Figure 14 is used for explanation according to external packet of the present invention;
Figure 15 is used for explanation according to many groupings of the present invention;
Figure 16 and Figure 17 are respectively applied for the mixing grouping of explanation another embodiment according to the present invention;
Figure 18 is the exemplary view according to 1D of the present invention and 2D entropy (entropy) table;
Figure 19 is the exemplary view according to two kinds of methods of the 2D of being used for entropy coding of the present invention;
Figure 20 is according to the entropy coding scheme at the PBC coding result of the present invention;
Figure 21 is according to the entropy coding scheme at the DIFF coding result of the present invention;
Figure 22 is used to illustrate the method according to selective entropy table of the present invention;
Figure 23 is layering (hierarchical) figure according to data structure of the present invention;
Figure 24 is the block diagram according to the device that is used for audio compression and recovery of one embodiment of the present invention;
Figure 25 is the detailed diagram according to the spatial information encoding section of one embodiment of the present invention;
Figure 26 is the detailed diagram according to the spatial information lsb decoder of one embodiment of the present invention;
Figure 27 is the block diagram of the device that is used for processing signals according to an embodiment of the present invention;
Figure 28 is the example according to the signal structure according to transmission/method of reseptance of the present invention;
Figure 29 is the device that is used for processing signals of another embodiment according to the present invention;
Figure 30 is the IP packet structure;
Figure 31 is the example according to the transmission/received signal of the method and apparatus that is used for processing signals of the present invention;
Figure 32 is the example that is used to resolve the grammer of audio object;
Figure 33 is the example of the identifier of audio object shown in Figure 32;
Figure 34 is through overview and grade according to the voice data of the method and apparatus coding/decoding that is used for processing signals of the present invention;
Figure 35 is another example that is used to resolve the grammer of audio object; And
Figure 36 is the example of the identifier of audio object shown in Figure 35.
Embodiment
To coding method according to an embodiment of the present invention be described at first, below.This coding method can be used for the signal that sends according to the business based on IP (Internet protocol) is carried out coding/decoding.For example, audio coder/audio decoder can utilize MPEG disclosed herein to come signal is carried out coding/decoding around (surround) coding method.
In the present invention, the implication of coding comprises encoding process and decoding processing.Yet, it will be apparent to those skilled in the art that the specific coding processing is only applicable to coding or decoding processing, will be distinguished it in the explanation of appropriate section below.And, coding can also be called encoding and decoding (codec).
In the present invention, the step that signal is encoded will be divided into digital coding and entropy coding is illustrated.Yet, there is correlativity between digital coding and the entropy coding, this will describe in detail after a while.In the present invention, explanation is divided into groups to carry out the several different methods of digital coding and entropy coding effectively to data.Group technology has had the independent and efficient technical concept, and irrelevant with concrete data or entropy coding scheme.In the present invention, will be that example describes audio coding scheme with spatial information (for example, ISO/IEC 23003, MPEG around) in detail with the situation that adopts digital coding and entropy coding.
Fig. 1 and Fig. 2 are according to system of the present invention.Fig. 1 shows code device 1, and Fig. 2 shows decoding device 2.
With reference to Fig. 1, code device 1 according to the present invention comprises at least one packet portion 10, the first digital coding portion 20, the second digital coding portion 31, the 3rd digital coding portion 32, entropy coding portion 40 and bit stream multiplexing unit 50.
Optionally, the second digital coding portion 31 and the 3rd digital coding portion 32 can be integrated into a data encoding section 30.For example, carry out variable length code by 40 pairs in entropy coding portion through the second digital coding portion 31 and the 3rd digital coding portion 32 coded datas.Said modules is done following detailed description.
For example, data are distinguished according to data type by packet portion 10.And in the digital coding portion 20,31 and 32 one encodes to the data that distinguish.At data-handling efficiency, packet portion 10 is distinguished at least one group with in the data some.And, one in the digital coding portion 20,31 and 32 integrated data is encoded.In addition, the back is elaborated with reference to Figure 13 to the 17 pair of group technology of the operation of packet portion 10 that comprises according to the present invention.
In the digital coding portion 20,31 and 32 each is all encoded to the input data according to corresponding coding scheme.In the digital coding portion 20,31 and 32 each all adopts at least a in PCM (pulse code modulation (PCM)) scheme and the differential coding scheme.Specifically, the first digital coding portion 20 adopts the PCM scheme, and the second digital coding portion 31 adopts the first differential coding scheme of utilizing the pilot frequency benchmark value, and the 3rd digital coding portion 32 adopts the second differential coding scheme of the difference of utilization and for example adjacent data.
Hereinafter, for ease of explanation, the first differential coding scheme is called coding (PBC) based on pilot tone, and the second differential coding scheme is called differential coding (DIFF).And the operation with reference to Fig. 3 to 8 pair of data encoding section 20,31 and 32 after a while is elaborated.
Simultaneously, entropy coding portion 40 is with reference to entropy table 41, carries out variable length code according to the statistical nature of data.And the operation with reference to Figure 18 to 22 pair of entropy coding portion 40 after a while is elaborated.
Data behind 50 pairs of codings of bit stream multiplexing unit are arranged and/or are changed, and make it corresponding to transmitting standard, and the form with bit stream transmits through arrangement/data converted then.Yet if adopt particular system of the present invention not use bit stream multiplexing unit 50, those skilled in the art are with clear, and this system can be constituted as does not need bit stream multiplexer 50.
Simultaneously, decoding device 2 is constituted as corresponding with above-mentioned code device 1.
For example, with reference to Fig. 2, the bit stream of potential flow solution multiplexing unit 60 reception inputs and the default form of basis make an explanation to various information included in the bit stream that receives and classify.
First data decoding part 80, second data decoding part 91 and the 3rd data decoding part 92 are decoded accordingly with aforementioned first to the 3rd digital coding portion 20,31 and 32 respectively.
Specifically, carry out under the situation of differential decoding in second data decoding part 91 and the 3rd data decoding part 92, the overlapping decoding processing that can handle is incorporated in the decoding processing.
Like this, the present invention has used at least two kinds of encoding schemes simultaneously in order effectively to carry out digital coding, therefore is desirable to provide a kind of efficient coding scheme of utilizing the correlativity between the encoding scheme.
And the present invention aims to provide the multiple data coded data grouping scheme that is used for effectively carrying out.
And the present invention aims to provide a kind of data structure that comprises feature of the present invention.
When technical concept of the present invention is applied to various system, it will be apparent to those skilled in the art that and to use various other structures together with assembly illustrated in figures 1 and 2.For example, need carry out data-measuring or need controller to control above-mentioned processing.
[digital coding]
Below PCM (pulse code modulation (PCM)), PBC (based on the coding of pilot tone) and the DIFF (differential coding) that can be used as data coding scheme of the present invention is elaborated.In addition, subsequently, effective selection and correlativity that also can the logarithm encoding scheme describe.
1, PCM (pulse code modulation (PCM))
PCM is a kind of encoding scheme that analog signal conversion is become digital signal.PCM, quantizes accordingly result analog signal sampling then with predetermined interval.May there be shortcoming in PCM aspect code efficiency, but can be used to be not suitable for the back effectively with the PBC of explanation or the data of DIFF encoding scheme.
In the present invention, when carrying out digital coding, PCM and PBC or DIFF encoding scheme are used together, describe with reference to Fig. 9 to 12 after a while.
2, PBC (based on the coding of pilot tone)
The notion of 2-1, PBC
PBC be a kind of in the data set of distinguishing, determine special datum and use between the data relation as the coding target and the encoding scheme of definite benchmark.
Can be reference value, pilot tone, pilot frequency benchmark value or pilot value with value defined as the benchmark of using PBC.Hereinafter, for ease of explanation, it is called the pilot frequency benchmark value.
And, the difference between the data in pilot frequency benchmark value and a group can be called difference or pilot tone poor.
And, represented to have been used final group of specific cluster scheme as the data set of unit in order to use PBC by aforementioned data grouping portion 10.Packet can be carried out according to multiple mode, after a while it is elaborated.
In the present invention, will be for having the parameter of data definition that specific meanings is divided into groups in a manner described for illustrating.This is for convenience of explanation, and can replace with different terms.
PBC processing according to the present invention comprises following at least two steps.
At first, select and the corresponding pilot frequency benchmark value of a plurality of parameters.In this case, decide this pilot frequency benchmark value with reference to parameter as the PBC target.
For example, the pilot frequency benchmark value is arranged to from as the mean value of the parameter of PBC target, as the approximate value of the mean value of the parameter of target, with as the corresponding intermediate value of intergrade of the parameter of target and as the value of selecting in the most frequently used value the parameter of target.And, the pilot frequency benchmark value can also be arranged to default default value.And, can be by in default table, selecting to decide pilot value.
Alternatively, in the present invention, interim pilot frequency benchmark value is arranged to the pilot frequency benchmark value selected by at least two kinds in the system of selection of multiple pilot frequency benchmark value, at each situation calculation code efficient, will be chosen as final pilot frequency benchmark value with the corresponding interim pilot frequency benchmark value of the situation with optimum coding efficient then.
When mean value was P, the approximate value of mean value was Ceil[P] or Floor[P].In this case, Ceil[x] be the maximum integer that is no more than x, and Floor[P] for surpassing the smallest positive integral of x.
Yet, can also select fixing arbitrarily default value, and not need with reference to parameter as the PBC target.
Again for example, as previously mentioned, after selecting the several values that may be selected to be pilot tone at random and a plurality ofly, the value that shows optimum coding efficient can be chosen as optimum pilot tone.
Then, seek difference between the parameter in selected pilot tone and one group.For example, come calculated difference by from parameter value, deducting the pilot frequency benchmark value as the PBC target.With reference to Fig. 2 and Fig. 4 this is explained as follows.
Fig. 3 and 4 is used for explanation according to PBC coding of the present invention.
For example, supposing has a plurality of parameters (for example, 10 parameters) in the group, have following parameter value: X[n respectively]=11,12,9,12,10,8,12,9,10,9.
If selected the PBC scheme to come the parameter in this group is encoded, then should at first select the pilot frequency benchmark value.In this embodiment, can see that in Fig. 4, the pilot frequency benchmark value is configured to 10.
As previously mentioned, can select the pilot frequency benchmark value by the whole bag of tricks of selecting the pilot frequency benchmark value.
Calculate the difference of PBC gained according to formula 1.
[formula 1]
D[n]=x[n]-P, wherein, n=0,1,, 9.
In this case, P represents pilot frequency benchmark value (=10) and x[n] be the target component of digital coding.
According to the result of the PBC of formula 1 corresponding to d[n]=1,2 ,-1,2,0 ,-2,2 ,-1,0 ,-1.That is, the result of PBC coding comprises selected pilot frequency benchmark value and the d[n that calculates].And these values are as after a while with the target of entropy coding of explanation.In addition, PBC is more effective under the whole less situation of the deviation of target component value.
2-2, PBC object
The target of PBC coding is not designated as one.Can be by the encode numerical data of various signals of PBC, for example, can be applicable to after a while audio coding with explanation.In the present invention, will be elaborated with the target of the simultaneously treated extra control data of voice data as the PBC coding.
Except falling of audio frequency mixed (downmixed) signal, go back communications of control data and be used for the reconstruct audio frequency subsequently.In the following description, control data is defined as spatial information or spatial parameter.
Spatial information comprises various spatial parameters, as channel grade poor (hereinafter being abbreviated as CLD), inter-channel correlation (hereinafter being abbreviated as ICC), channel prediction coefficient (hereinafter, being abbreviated as CPC) etc.
Specifically, CLD is the parameter of the energy difference between two different channels of expression.For example, the value of CLD 15 and+change between 15.ICC is the parameter of the correlativity between two different channels of expression.For example, the value of ICC changes between 0 and 7.And CPC is the parameter that expression is used for generating according to two channels the prediction coefficient of three channels.For example, the value of CPC changes between 20 and 30.
As the target of PBC coding, can comprise the yield value of the gain that is used for conditioning signal, for example, ADC (fall arbitrarily and mix gain).
And, be applied to fall audio mixing frequently the ATD (setting data arbitrarily) of any channel switch frame of signal can be used as PBC coding target.Specifically, ADG is the parameter that is different from CLD, ICC or CPC.That is, ADG makes it to be different from the parameter of the spatial information (as CLD, ICC, CPC etc.) that extracts from the channel of sound signal corresponding to the gain that is used for regulating audio frequency.Yet, for example, can handle ADG or ATD, thereby improve the efficient of audio coding by the mode identical with aforementioned CLD.
As another target of PBC coding, can consider local parameter.In the present invention, local parameter is meant the part of parameter.
For example, suppose that special parameter shows as the n position, is divided at least two parts with the n position.And, can respectively these two parts be defined as first local parameter and second local parameter.Situation for attempting carrying out the PBC coding can find the difference between first local parameter values and the pilot frequency benchmark value.Yet second local parameter that is left out in asking difference calculating should transmit as independent value.
More particularly, for example, for the situation of being represented parameter value by the n position, (LSB) is defined as second local parameter with least significant bit (LSB), and the individual high-order parameter value that constitutes of residue (n-1) can be defined as first local parameter.In this case, only can carry out PBC to first local parameter.This is because code efficiency may strengthen because of the less deviation between (n-1) individual high-order first local parameter values that constitutes.
Be transmitted in individually and ask second local parameter that is left out in the difference calculating, by lsb decoder reconstruct final argument the time, take in then.Alternatively, can also obtain second local parameter rather than transmit second local parameter individually by predetermined scheme.
Utilize the use of PBC coding of the characteristic of local parameter to be restricted according to the characteristic of target component.
For example, as previously mentioned, the deviation between first local parameter should be less.If this deviation is bigger, then do not need to utilize local parameter.It in addition may deterioration code efficiency.
According to experimental result, the CPC parameter of aforesaid space information is suitable for the application of PBC scheme.Yet it is not preferred that the CPC parameter is applied to the rudenss quantization scheme.For the rough situation of quantization scheme, the deviation between first local parameter increases.
In addition, utilize the digital coding of local parameter also to be applicable to DIFF scheme and PBC scheme.
Situation for the local parameter notion being applied to the CPC parameter is explained as follows signal processing method and the device that is used for reconstruct.
For example, the method for utilizing local parameter to come processing signals according to the present invention comprises that utilization obtains the step of first local parameter and utilizes this first local parameter and second local parameter to decide the step of parameter with the corresponding reference value of first local parameter with the corresponding difference of this reference value.
In this case, reference value is pilot frequency benchmark value or difference reference value.And first local parameter comprises the part position of this parameter, and second local parameter comprises all the other positions of this parameter.And second local parameter comprises the least significant bit (LSB) of this parameter.
The parameter that this signal processing method also comprises utilization and determined is come the step of reconstructed audio signal.
This parameter is at least one the spatial information that comprises among CLD, ICC, CPC and the ADG.
Not rough, then can obtain second local parameter if if this parameter is the quantization scale of CPC and this parameter.
And, by doubly taking advantage of this local parameter and the multiplied result and the second local parameter addition being decided final argument.
The device that utilizes local parameter to come processing signals according to the present invention comprises utilizing and obtains the first parameter acquisition portion of first local parameter with the corresponding reference value of first local parameter with the corresponding difference of this reference value and utilize first local parameter and second local parameter decides the parameter determination unit of parameter.
This signal processing apparatus also comprises by receiving the second parameter acquisition portion that second local parameter obtains second local parameter.
And the first parameter acquisition portion, parameter determination unit and the second local parameter acquisition portion are included in aforementioned data lsb decoder 91 or 92.
The method of utilizing local parameter to come processing signals according to the present invention comprises parameter is divided into the step of first local parameter and second local parameter and utilizes the step that generates difference with the corresponding reference value of first local parameter and first local parameter.
And this signal processing method also comprises the step that transmits this difference and second local parameter.
The device that utilizes local parameter to come processing signals according to the present invention comprises parameter is divided into the parameter division portion of first local parameter and second local parameter and utilizes the difference generating unit that generates difference with the corresponding reference value of first local parameter and first local parameter.
And this signal processing apparatus also comprises the parameter efferent that transmits this difference and second local parameter.
And parameter division portion and difference generating unit are included in aforementioned data encoding section 31 or 32.
2-3, PBC condition
Select independent pilot frequency benchmark value then selected pilot frequency benchmark value to be included under the situation in the bit stream at PBC of the present invention coding, the transfer efficiency of PBC coding becomes probably less than after a while with the DIFF encoding scheme of explanation.
Thereby the present invention aims to provide a kind of top condition that is used to carry out the PBC coding.
If the quantity as the data of the target of the digital coding in a group in experiment is at least three or higher, then be suitable for the PBC coding.This result during corresponding to the efficient of considering digital coding.This means that if only there are two data in one group, then DIFF or pcm encoder are more effective than the PBC coding.
Although the PBC coding is applicable to a three or more at least data, preferably, the PBC coding is applied to exist in one group at least five data conditions.In other words, PBC coding the situation of effective application be to have at least five as the data of the target of digital coding and the less situation of deviation between this at least five data.And the minimum number that is suitable for carrying out the PBC coded data will depend on system and coding environment.
Data have been specified at each data tape as the target of digital coding.This will be illustrated by the packet transaction that will illustrate after a while.Thereby for example, the present invention proposes, and after a while the mpeg audio of explanation being encoded around application PBC in encoding needs at least five data tapes.
Hereinafter, signal processing method and the device that the condition of PBC is carried out in utilization is explained as follows.
In signal processing method according to one embodiment of the present invention, if obtained quantity with the corresponding data of pilot frequency benchmark value, if and the quantity of data tape satisfy pre-conditioned, then obtain the pilot frequency benchmark value and with the corresponding pilot tone difference of this pilot frequency benchmark value.Subsequently, utilize this pilot frequency benchmark value and pilot tone difference to obtain data.Specifically, utilization comprises that the quantity of the frequency ranges of data of these data obtains the quantity of data.
In the signal processing method of another embodiment, utilize the quantity of data to decide a kind of in the several data encoding scheme, and these data are decoded according to the data coding scheme that is determined according to the present invention.The several data encoding scheme comprises the pilot codes scheme at least.If it is pre-conditioned that the quantity of data satisfies, then the data coding scheme decision is the pilot codes scheme.
And this data decode is handled and is comprised and obtaining with the corresponding pilot frequency benchmark value of a plurality of data with the step of the corresponding pilot tone difference of this pilot frequency benchmark value with utilize this pilot frequency benchmark value and the pilot tone difference obtains the step of data.
And in this signal processing method, data are parameters.And, utilize parameter to recover sound signal.Do the corresponding identification information of quantity of reception and parameter and the identification information that utilization receives generate the quantity (numbering of parameter in this signal processing method?).By the quantity (numbering of considering data?), hierarchically extract the identification information of representing a plurality of data.
In the step of extracting identification information, extract first identification information of expression first data coding scheme, utilize the quantity (numbering of this first identification information and data then?) extract the expression second data coding scheme second identification information.In this case, first identification information has represented whether it is the DIFF encoding scheme.And second identification information has represented that it is pilot codes scheme or PCM grouping scheme.
In the signal processing method of another embodiment according to the present invention, pre-conditioned if the quantity of a plurality of data satisfies, then utilize with the corresponding pilot frequency benchmark value of a plurality of data and these data and generate the pilot tone difference.Transmit the pilot tone difference that is generated then.In this signal processing method, transmit the pilot frequency benchmark value.
In the signal processing method of another embodiment, decide data coding scheme according to the quantity of a plurality of data according to the present invention.Come data are encoded according to the data coding scheme that is determined then.In this case, the several data encoding scheme comprises the pilot codes scheme at least.If the quantity of data satisfies preset standard, then the data coding scheme decision is the pilot codes scheme.
According to the device that is used for processing signals of one embodiment of the present invention comprise obtain with the quantity acquisition portion of the quantity of the corresponding data of pilot frequency benchmark value, when the quantity of data satisfy obtain when pre-conditioned the pilot frequency benchmark value and with the value acquisition portion of the corresponding pilot tone difference of this pilot frequency benchmark value, and utilize this pilot frequency benchmark value and pilot tone difference to obtain the data acquisition portion of data.In this case, quantity acquisition portion, value acquisition portion and data acquisition portion are included in aforementioned data lsb decoder 91 or 92.
The device that is used for processing signals of another embodiment comprises that quantity according to a plurality of data decides a kind of scheme determination section of several data encoding scheme according to the present invention, and the lsb decoder of data being decoded according to the data coding scheme that is determined.In this case, the several data encoding scheme comprises the pilot codes scheme at least.
The device that is used for processing signals of another embodiment comprises that quantity when a plurality of data satisfies and utilizes value generating unit that generates the pilot tone difference with the corresponding pilot frequency benchmark value of a plurality of data and these data and the efferent that transmits the pilot tone difference that is generated when pre-conditioned according to the present invention.In this case, the value generating unit is included in aforementioned data encoding section 31 or 32.
The device that is used for processing signals of another embodiment comprises that quantity according to a plurality of data decides the scheme determination section of data coding scheme according to the present invention, and the encoding section of data being encoded according to the data coding scheme that is determined.In this case, the several data encoding scheme comprises the pilot codes scheme at least.
2-4, PBC signal processing method
Signal processing method and device to the PBC of utilization coding characteristic according to the present invention are explained as follows.
In signal processing method according to one embodiment of the present invention, obtain with the corresponding pilot frequency benchmark value of a plurality of data and with the corresponding pilot tone difference of this pilot frequency benchmark value.Subsequently, utilize this pilot frequency benchmark value and pilot tone difference to obtain data.And this method can also comprise at least one step of decoding in pilot frequency benchmark value and the pilot tone difference.In this case, the data of application PBC are parameters.And this method can also comprise that parameter that utilization obtains comes the step of reconstructed audio signal.
According to the device that is used for processing signals of one embodiment of the present invention comprise with the corresponding pilot frequency benchmark value of a plurality of data and with the corresponding pilot tone difference of this pilot frequency benchmark value, and utilize this pilot frequency benchmark value and this pilot tone difference to obtain the data acquisition portion of data.In this case, value acquisition portion and data acquisition portion are included in aforementioned data encoding section 91 or 92.
The signal processing method of another embodiment comprises that utilization and the corresponding pilot frequency benchmark value of a plurality of data and these data generate the step of pilot tone difference and the step of the pilot tone difference that output is generated according to the present invention.
The device that is used for processing signals of another embodiment comprises that utilization and the corresponding pilot frequency benchmark value of a plurality of data and these data generate the value generating unit of pilot tone difference according to the present invention, and the efferent of exporting the pilot tone difference that is generated.
According to the present invention the method for the processing signals of another embodiment comprise obtain with the corresponding pilot frequency benchmark value of a plurality of gains and with the step of the corresponding pilot tone difference of this pilot frequency benchmark value, and the step of utilizing this pilot frequency benchmark value and this pilot tone difference to obtain to gain.And this method can also comprise at least one step of decoding in pilot frequency benchmark value and the pilot tone difference.And this method can also comprise that gain that utilization obtains comes the step of reconstructed audio signal.
In this case, the pilot frequency benchmark value can be the most frequently used value of the average intermediate value of the mean value of a plurality of gains, a plurality of gains, a plurality of gains, a value that is configured to default value or extracts from table.And this method can also be included in in a plurality of gains each and be provided with the gain of selecting to have high coding efficiency after the pilot frequency benchmark value step as final pilot frequency benchmark value.
According to the present invention the device that is used for processing signals of another embodiment comprise obtain with the corresponding pilot frequency benchmark value of a plurality of gains and with the value acquisition portion of the corresponding pilot tone difference of this pilot frequency benchmark value, and the gain acquisition portion that utilizes this pilot frequency benchmark value and this pilot tone difference to obtain to gain.
The method of the processing signals of another embodiment comprises that utilization and the corresponding pilot frequency benchmark value of a plurality of gains and these gains generate the step of pilot tone difference according to the present invention, and the step of exporting the pilot tone difference that is generated.
And the device that is used for processing signals of another embodiment comprises that utilization and the corresponding pilot frequency benchmark value of a plurality of gains and these gains generate the value calculating part of pilot tone difference according to the present invention, and the efferent of exporting the pilot tone difference that is generated.
3, DIFF (differential coding)
DIFF coding is the encoding scheme of the relation between a plurality of data that exist in the data set that distinguishes of a kind of utilization, and it can be called as differential coding.In this case, the data set as the unit of using DIFF is meant that aforementioned data grouping portion 10 uses final group of specific cluster scheme.In the present invention, the data definition with specific meanings with grouping in a manner described is the parameter that will illustrate.And, this with for illustrated the same of PBC.
Specifically, the DIFF encoding scheme is a kind of difference of using between a plurality of parameters that exist in same group, more particularly, uses the difference between the adjacent parameter, encoding scheme.
With reference to Fig. 5 to 8, the type and the detailed applications example of DIFF encoding scheme is explained as follows.
3-1, DIFF type
Fig. 5 is used to illustrate the type according to DIFF coding of the present invention.DIFF distinguishes according to the direction of the difference between searching and the adjacent parameter.
For example, the DIFF type of coding can be divided into along frequency direction DIFF (hereinafter being abbreviated as DIFF_FREQ or DF) and along the DIFF (hereinafter being abbreviated as DIFF_TIME or DT) of time orientation.
With reference to Fig. 5, group 1 expression is along the DIFF (DF) of frequency axis calculated difference, and organizes 2 or organize 3 and come calculated difference along time shaft.
In Fig. 5, as can be seen, redistrict DIFF (DT) along the time shaft calculated difference to seek difference according to the direction of time shaft.
For example, be applied to organize 2 DIFF (DT) corresponding to the scheme (for example, group 1) of between the parameter value of the parameter value of current time and previous moment, seeking difference.This is called as the back to time D IFF (DT) (hereinafter being abbreviated as DT-BACKWARD).
For example, be applied to organize 3 DIFF (DT) corresponding to the scheme (for example, group 4) of between the parameter value of current time and next parameter value constantly, seeking difference.This is called as forward direction time D IFF (DT) (hereinafter being abbreviated as DT-FORWARD).
Therefore, as shown in Figure 5, group 1 is DIFF (DF) encoding scheme, and group 2 is DIFF (DT-BACKWARD) encoding schemes, is DIFF (DT-FORWARD) encoding schemes and organize 3.Yet the encoding scheme of group 4 is not decision also.
In the present invention, although just will be defined as a kind of encoding scheme (for example, DIFF (DF)) along the DIFF of frequency axis, equally can be by it being distinguished into DIFF (DF-TOP) and DIFF (DF-BOTTM) defines.
3-2, DIFF examples of applications
Fig. 6 to 8 is examples of using the DIFF encoding scheme.
In Fig. 6,, be example with group shown in Figure 51 and group 2 for ease of explanation.Group 1 is abideed by DIFF (DF) encoding scheme, and its parameter value is x[n]=11,12,9,12,10,8,12,9,10,9.Group 2 is followed DIFF (DF-BACKWARD) encoding scheme, and its parameter is y[n]=10,13,8,11,10,7,14,8,10,8.
Fig. 7 shows the result of the difference of calculating group 1.Because organize 1 according to DIFF (DF) encoding scheme coding, so come calculated difference by formula 2.Formula 2 is illustrated in the difference of seeking on the frequency axis with last parameter.
[formula 2]
d[0]=x[0]
D[n]=x[n] x[n-1], n=1 wherein, 2,, 9.
Specifically, group 1 is d[n through DIFF (DF) result of formula 2]=-11,1 ,-3,3 ,-2 ,-2,4 ,-3,1 ,-1.
Fig. 8 shows the result of the difference of calculating group 2.Because organize 2 according to DIFF (DF-BACKWARD) encoding scheme coding, so come calculated difference by formula 3.Formula 3 is illustrated in the difference of seeking on the time shaft with last parameter.
[formula 3]
D[n]=y[n] x[n], n=1 wherein, 2,, 9.
Specifically, group 2 is d[n through DIFF (DF-BACKWARD) result of formula 3]=-1,1 ,-1 ,-1,0,01,2 ,-1,0 ,-1.
4, for the selection of data coding scheme
The invention is characterized in, compress or reconstruct data by mixing various data coding schemes.Thereby, when particular group is encoded, must from least three kinds or more kinds of data coding scheme, select an encoding scheme.And, the identification information at selected encoding scheme should be delivered to lsb decoder via bit stream.
To the method for selection data coding scheme according to the present invention with utilize the coding method of this data coding scheme and device to be explained as follows.
Method according to the processing signals of one embodiment of the present invention comprises step that obtains the digital coding identification information and the step of data being carried out data decode according to the represented data coding scheme of this digital coding identification information.
In this case, this data coding scheme comprises the PBC encoding scheme at least.And corresponding pilot frequency benchmark value of the utilization of PBC encoding scheme and a plurality of data and pilot tone difference come data are decoded.And this pilot tone difference utilizes these data to generate with this pilot frequency benchmark value.
Data coding scheme also comprises the DIFF encoding scheme.The DIFF encoding scheme is corresponding to one in DIFF-DF scheme and the DIFF-DT scheme.And the DIFF-DT scheme is in time D IFF-DT (BACKWARD) one corresponding to forward direction time D IFF-DT (FORWARD) scheme and back.
This signal processing method also comprises the step that obtains the entropy coding identification information, and utilizes the represented entropy coding scheme of this entropy coding identification information data to be carried out the step of entropy decoding.
In the data decode step, the data of decoding through entropy are carried out data decode by this data coding scheme.
And this signal processing method comprises that also with these data be the step that parameter decodes sound signal.
The device that is used for processing signals according to one embodiment of the present invention comprises
Obtain the identification information acquisition portion of digital coding identification information, and the lsb decoder that data is carried out data decode according to the represented data coding scheme of this digital coding identification information.
In this case, data coding scheme comprises the PBC encoding scheme at least.And corresponding pilot frequency benchmark value of the utilization of PBC encoding scheme and a plurality of data and pilot tone difference come data are decoded.And the pilot tone difference is utilized data and pilot frequency benchmark value and is generated.
The method of the processing signals of another embodiment comprises the step of data being carried out digital coding according to data coding scheme according to the present invention, and the step that generates and transmit the digital coding identification information of this data coding scheme of expression.
In this case, data coding scheme comprises the PBC encoding scheme at least.Corresponding pilot frequency benchmark value of the utilization of PBC encoding scheme and a plurality of data and pilot tone difference come data are encoded.And the pilot tone difference is utilized data and pilot frequency benchmark value and is generated.
The device that is used for processing signals of another embodiment comprises the encoding section of data being carried out digital coding according to data coding scheme according to the present invention, and the efferent that generates and transmit the digital coding identification information of this data coding scheme of expression.
In this case, data coding scheme comprises the PBC encoding scheme at least.Corresponding pilot frequency benchmark value of the utilization of PBC encoding scheme and a plurality of data and pilot tone difference come data are encoded.And the pilot tone difference is utilized data and pilot frequency benchmark value and is generated.
To the method for selection data coding scheme according to the present invention with transmit coding by optimum transmission efficiency and select the method for identification information to be explained as follows.
4-1, considered the digital coding identification method of frequency of utilization
Fig. 9 is used for illustrating the block diagram of selecting one relation at least three kinds of encoding schemes according to of the present invention.
With reference to Fig. 9, suppose that the frequency of utilization of first to the 3rd digital coding portion 53,52 of existence and 51, the first digital coding portions 53 is minimum, and the frequency of utilization of the 3rd digital coding portion 51 is the highest.
For ease of explanation, for sum 100, the frequency of utilization of supposing the first digital coding portion 53 is that the frequency of utilization of 10, the second digital coding portions 52 is 30, and the frequency of utilization of the 3rd digital coding portion 51 is 60.Specifically, for 100 data sets, can think that the PCM scheme used 10 times, the PBC scheme has been used 30 times, and the DIFF scheme has been used 60 times.
Based on above-mentioned supposition, calculate the figure place of three kinds of required identification informations of encoding scheme of sign by following mode.
For example, according to Fig. 9, because used 1 first information, so identify the encoding scheme of 100 groups altogether as the first information with 100.Because identify the 3rd the highest digital coding portion 51 of frequency of utilization by 100, so the remainder of 1 second information only utilizes 40 just can distinguish the first digital coding portion 53 and the second digital coding portion 52.
Therefore, be used to the identification information of 100 every group coding types of data group selection altogether to need 140 altogether, get by the first information (100)+second information (40).
Figure 10 is the block diagram that is used for illustrating according to a kind of relation of at least three kinds of encoding schemes of selection of prior art.
As Fig. 9, for ease of explanation, for sum 100, the frequency of utilization of supposing the first digital coding portion 53 is that the frequency of utilization of 10, the second digital coding portions 52 is 30, and the frequency of utilization of the 3rd digital coding portion 51 is 60.
In Figure 10, calculate the figure place of the required identification information of three kinds of encoding scheme types of sign by following mode.
At first, according to Figure 10, because used 1 first information, so identify the encoding scheme of 100 groups altogether as the first information with 100.
The first digital coding portion 53 that frequency of utilization is minimum preferentially identifies by 100.Thereby the remainder of 1 second information 90 multidigits is altogether distinguished the second digital coding portion 52 and the 3rd digital coding portion 51.
Therefore, be used to the identification information of 100 every group coding types of data group selection altogether to need 190 altogether, get by the first information (100)+second information (90).
Situation more shown in Figure 9 and situation shown in Figure 10 as can be seen, digital coding shown in Figure 9 selects identification information more favourable aspect transfer efficiency.
That is,, the invention is characterized in, utilize different identification informations, rather than distinguishing similar each other two kinds of encoding scheme types aspect the frequency of utilization by identical identification information for there being three or more data encoding schemes.
For example, for the situation that as shown in figure 10 the first digital coding portion 53 and the second digital coding portion 52 are classified as same identification information, the data transmission position increases, and has reduced transfer efficiency.
Situation for there being at least three kinds of digital coding types the invention is characterized in, distinguishes the highest data coding scheme of frequency of utilization by the first information.Thereby, distinguish all the other lower two kinds of encoding schemes of frequency of utilization by second information.
Figure 11 and Figure 12 are respectively the process flow diagrams according to digital coding selection scheme of the present invention.
In Figure 11, suppose that the DIFF coding is to use the highest data coding scheme of frequency.In Figure 12, suppose that the PBC coding is to use the highest data coding scheme of frequency.
With reference to Figure 11, check whether there is the minimum pcm encoder of frequency of utilization (S10).As previously mentioned, carry out this inspection by the first information that is used to identify.
As check result, if pcm encoder then checks whether be PBC coding (S20).This is to carry out by second information that is used to identify.
For the frequency of utilization of DIFF coding be 60 times situation in 100 times altogether, is used to 140 altogether of the identification information needs of 100 identical every group coding types of data group selection, that is, and and the first information (100)+second information (40).
With reference to Figure 12,, check whether there is the minimum pcm encoder of frequency of utilization (S30) as Figure 11.As previously mentioned, carry out this inspection by the first information that is used to identify.
As check result, if pcm encoder then checks whether be DIFF coding (S40).This is to carry out by second information that is used to identify.
For the frequency of utilization of DIFF coding be 80 times situation in 100 times altogether, and the identification information needs that are used to 100 identical every group coding types of data group selection 120 are altogether, that is, and and the first information (100)+second information (20).
To the method for a plurality of data coding schemes of sign according to the present invention with utilize the signal processing method of this method and device to be explained as follows.
Comprise the step of the identification information that hierarchically extracts a plurality of data coding schemes of expression according to the method for the processing signals of one embodiment of the present invention, and according to the step of data being decoded with the corresponding data coding scheme of this identification information.
In this case, the PBC encoding scheme that from different layers, comprises in a plurality of data coding scheme of extraction expression and the identification information of DIFF encoding scheme.
In decoding step, utilize the difference that generates with utilizing these data with the corresponding reference value of a plurality of data, obtain these data according to data coding scheme.In this case, reference value is pilot frequency benchmark value or difference reference value.
The method of the processing signals of another embodiment comprises the step of the identification information that hierarchically extracts three or more at least data encoding schemes of expression according to the present invention.In this case, from different layers, extract the identification information of two high data encoding schemes of expression frequency of utilization.
The method of the processing signals of another embodiment comprises the step of hierarchically extracting identification information according to the frequency of utilization of identification information of expression data coding scheme according to the present invention, and according to the step of data being decoded with the corresponding data decode scheme of this identification information.
In this case, extract this identification information in the mode of hierarchically extracting first identification information and second identification information.First identification information represents whether be first data coding scheme, and second identification information represents whether be second data coding scheme.
First identification information represents whether be the DIFF encoding scheme.And second identification information is represented pilot codes scheme or PCM grouping scheme.
First data coding scheme can be the pcm encoder scheme.And second data coding scheme can be PBC encoding scheme or DIFF encoding scheme.
Data are a plurality of parameters, and this signal processing method also comprises the step of utilizing these parameter reconstructs to go out sound signal.
According to the device of the processing signals of one embodiment of the present invention comprise hierarchically extract the identification information of distinguishing a plurality of data coding schemes the identifier extraction unit (for example, among Figure 13 710), and according to the lsb decoder of data being decoded with the corresponding data coding scheme of this identification information.
The method of the processing signals of another embodiment comprises the step of data being encoded according to data coding scheme according to the present invention, and the step that generates the identification information be used to distinguish the different data coding scheme of the frequency of utilization each other used when data are encoded.
In this case, identification information is distinguished from each other out pcm encoder scheme and PBC encoding scheme.Specifically, identification information distinguishes pcm encoder scheme and DIFF encoding scheme.
And, the device of the processing signals of another embodiment comprises the encoding section of data being encoded according to data coding scheme according to the present invention, and the identification information generating unit (for example, 400 among Figure 11) that generates the identification information be used to distinguish the different data coding scheme of the frequency of utilization each other when data are encoded, used.
Concern between 4-2, digital coding
At first, there are independence and/or dependence mutually between PCM of the present invention, PBC and the DIFF.For example, can choose at random a kind of in three kinds of type of codings for each group as the target of digital coding.Thereby whole digital coding has utilized the result of three kinds of encoding scheme types with having produced combination with one another.Yet,, first and foremost select a kind of in the DIFF encoding scheme of frequency of utilization optimum and all the other two kinds of encoding schemes (for example, PCM and PBC) by considering the frequency of utilization of three kinds of encoding scheme types.Subsequently, a kind of among PCM and the PBC selected in inferior strategic point.Yet as previously mentioned, this is in order to consider the transfer efficiency of identification information, but not owing to the similarity of basic coding scheme.
With regard to the similarity of encoding scheme, PBC and DIFF are similar each other aspect calculated difference.Thereby the encoding process of PBC and DIFF significantly overlaps each other.Specifically, come the step of reconstruct initial parameter to be defined as the δ decoding according to difference and can be designed in same step, handle in when decoding.
In the process of carrying out PBC coding or DIFF coding, may there be the parameter that departs from its scope.In this case, it is necessary encoding and transmit relevant parameters by independent PCM.
[grouping]
1, Fen Zu notion
The present invention proposes grouping, promptly when coding, consider by specified data is bundled deal with data for efficient.Specifically,,, the pilot frequency benchmark value selects, so packet transaction need be finished as carrying out PBC coding step before because being unit with the group for the situation of PBC coding.This grouping is applied to the DIFF coding in an identical manner.And, also being applicable to entropy coding according to some schemes of grouping of the present invention, can be described in corresponding description part after a while.
The manner of execution of can just dividing into groups is divided into external packet and inner grouping with packet type of the present invention.
Alternatively, the target of can just dividing into groups is divided into territory grouping, packet and channel packet with packet type of the present invention.
Alternatively, can just divide into groups execution sequence with packet type of the present invention be divided into first the grouping, second the grouping and the 3rd grouping.
Alternatively, can just divide into groups to carry out counting (count) packet class of the present invention is divided into single grouping and many groupings.
Yet above-mentioned packet class proposes for ease of changing notion of the present invention, and it does not apply restriction to the term that uses.
Grouping according to the present invention is to finish according to the mode that various grouping schemes in use overlap each other or combination with one another is used.
In the following description, be described by the mode that will packet zone according to the present invention be divided into inner grouping and external packet.To many groupings of multiple packet type coexistence be described subsequently.And, will the notion of territory grouping and packet be described.
2, inner grouping
Inner grouping is meant that the execution of grouping is that carry out inside.If carry out the inside grouping usually, then carry out inside and divide into groups again, with (divided) group that generates new group or cut apart to last group.
Figure 13 is used for explanation according to inner grouping of the present invention.
With reference to Figure 13, inner grouping according to the present invention is for example undertaken by frequency domain unit's (hereinafter being called band (band)).Thereby inner grouping scheme sometimes may be corresponding to the grouping of a kind of territory.
If sampled data is through specific filter, for example, QMF (quadrature mirror filter) has then generated a plurality of subbands.Under the subband pattern, carry out the first frequency grouping, to generate first group of band that can be called the parameter band.The first frequency grouping can generate a plurality of parameter bands by brokenly subband being bundled.Thereby, can be not etc. ground constitute the size of parameter band.Yet,, can constitute the parameter band according to the coding purpose with being equal to.And, the step that generates subband can be categorized as a kind of grouping.
Subsequently, the parameter band that generates is carried out the second frequency grouping, to generate second group of band that can be called as data tape.The second frequency grouping can generate a plurality of data tapes by unifying the parameter band with unified numbering (uniform number).
According to the purpose of encoding after finishing grouping, can with first group of band corresponding parameter band be the unit or with second group of corresponding data tape of band be that coding is carried out in the unit.
For example, when application of aforementioned PBC encodes, can be considered as a group by the parameter band that will be grouped into or be considered as a group selecting pilot frequency benchmark value (a kind of group of reference value) by the data tape that will be grouped into.Explanation is identical in utilizing detail operations that selected pilot frequency benchmark value carries out PBC and PBC and the front being described.
Again for example, when application of aforementioned DIFF encodes, be considered as a group by the parameter band that will be grouped into and decide group reference value, calculated difference then.Alternatively, can also be considered as a group by the data tape that will be grouped into and decide the group reference value, and calculated difference.And explanation was identical during the detail operations of DIFF and front were described.
If with first and/or frequency grouping be applied to actual coding, then must transmit corresponding information, be described with reference to Figure 23 after a while.
3, external packet
External packet is meant that the execution of grouping is that carry out the outside.If carry out external packet usually, then carry out the outside and divide into groups again, to generate (combined) group of new group or combination to last group.
Figure 14 is used for explanation according to external packet of the present invention.
With reference to Figure 14, for example carry out by time domain unit's (hereinafter being called time slot) according to external packet of the present invention.Thereby external packet sometimes may be corresponding to the grouping of a kind of territory.
The frame that comprises sampled data is carried out very first time grouping, to generate first group of time slot.Figure 14 exemplarily shows the situation that generates eight time slots.Very first time grouping also has the implication that a frame is divided into onesize time slot.
In the time slot that selection generates by very first time grouping at least one.Figure 14 shows the selected situation of time slot 1,4,5 and 8.According to encoding scheme, can select to select in the step whole time slots at this.
Then, selected time slot 1,4,5 is rearranged into time slot 1,2,3 and 4 with 8.Yet the purpose according to coding can partly rearrange selected time slot 1,4,5 and 8.In this case, because the time slot that is excluded beyond rearranging will be excluded beyond final group forms, so it is got rid of from PBC or DIFF coded object.
Selected time slot is carried out second time packet, to be formed in the group that is handled simultaneously on the final time shaft.
For example, can with time slot 1 and 2 or time slot 3 and 4 constitute a group, it is right that it is called as time slot.Again for example, time slot 1,2 and 3 can constitute a group, and it is called as three time slot groups.And, may there be the single time slot that does not constitute a group with another time slot.
Situation for the grouping of first and second time slots being applied to actual coding needs to transmit corresponding information, is explained with reference to Figure 23 after a while.
4, many groupings
Many groupings are meant the grouping scheme by inside grouping, external packet and various other packet assembling are generated final group together.As previously mentioned, each grouping scheme according to the present invention can overlap each other or combination with one another ground is used.And, be in order to improve the efficient of various encoding schemes with dividing into groups as a kind of scheme more.
4-1, blend interior grouping and external packet
Figure 15 is used for explanation according to many groupings of the present invention, has wherein mixed inner grouping and external packet.
With reference to Figure 15, in frequency domain, finish generated final grouping after the inner grouping be with 64.And, in time domain, finish external packet and generated final time slot 61,62 and 63 afterwards.
An independent time slot of finishing after the grouping is called data set.In Figure 15, label 61a, 61b, 62a, 62b and 63 represent data set respectively.
Specifically, two data set 61a and 61b or two data set 62a and 62b can constitute a pair of by external packet in addition.It is right that the paired data collection is called data.
After finishing many groupings, carry out PBC or DIFF coding and use.
For example, for the situation of carrying out the PBC coding, at the data of finally finishing to 61 or 62 or do not have composition data each right data set 63 is selected pilot frequency benchmark value P1, P2 or P3.Utilize selected pilot frequency benchmark value to carry out the PBC coding then.
For example, for the situation of carrying out the DIFF coding, at each the decision DIFF type of coding among data set 61a, 61b, 62a, the 62b and 63.As previously mentioned, should be each data set decision DIFF direction and decision and be one among DIFF-DF and the DIFF-DT.It is identical that the processing that is used for carrying out DIFF coding according to the DIFF encoding scheme that is determined and aforementioned description are mentioned.
In order to come composition data right, should carry out identical inside grouping to each right data set of composition data by carrying out external packet with many packet modes.
For example, each among the right data set 61a of composition data and the 61b all has identical data reel number.And each among the right data set 62a of composition data and the 62b all has identical data reel number.Yet this can not cause any problem, may be different on the data reel number each other to the data set of (for example, 61a and 62a) because belong to different pieces of information respectively.This means can be to each data to using different inside groupings.
For composition data concerning situation, can by inside divide into groups to carry out first the grouping and by external packet carry out second the grouping.
For example, the data reel number after second grouping is corresponding to the specified multiple of the data reel number after first grouping.This is because each right data set of composition data all has identical data reel number.
4-2, blend interior grouping and inner grouping
Figure 16 and Figure 17 are respectively applied for the mixing grouping of explanation another embodiment according to the present invention.Specifically, Figure 16 and Figure 17 concentrated area (intensively) show the mixing of inner grouping.Thereby, be clear that very much, in Figure 16 or Figure 17 or can in Figure 16 or Figure 17, carry out external packet.
For example, Figure 16 shows at generated the situation of carrying out inner grouping once more under the situation of data tape after finishing the second frequency grouping.Specifically, will be divided into low-frequency band and high frequency band by the data tape that the second frequency grouping generates.For the situation of specific coding, must use low-frequency band or high frequency band separately.Specifically, with separately low-frequency band and high frequency bring the situation of use to be called dual (dual) pattern.
Thereby, for the situation of double-mode, be considered as a group by the low-frequency band that will finally generate or high frequency band and carry out digital coding.For example, generate pilot frequency benchmark value P1 and P2 at low-frequency band and high frequency band respectively, in frequency band, carry out the PBC coding then.
Double-mode can be used according to every characteristic of channel.Thereby this is called as channel packet.And double-mode can also differently be used according to data type.
For example, Figure 17 shows at generated the situation of carrying out inner grouping once more under the situation of data tape after finishing aforementioned second frequency grouping.That is, will be divided into low-frequency band and high frequency band by the data tape that the second frequency grouping generates.For the situation of specific coding, only use low-frequency band, and need abandon high frequency band.Specifically, be called low frequency channel (LFE) pattern with only telling the situation that low frequency brings use.
Under low frequency channel (LFE) pattern, be considered as a group by the low-frequency band that will finally generate and carry out digital coding.
For example, generate pilot frequency benchmark value P1, in corresponding low-frequency band, carry out the PBC coding then at low-frequency band.Yet, can generate new data tape by selected low-frequency band being carried out inner grouping.This is for the low-frequency band that will show is concentrated grouping.
And therefore low frequency channel (LFE) pattern will be used according to the low frequency channel characteristic can be called as channel packet.
5, territory grouping and packet
Just the object of grouping is divided into territory grouping and packet with grouping.
The territory grouping is meant in the last scheme that the unit in territory is made up of special domain (for example, frequency domain or time domain).And the territory grouping can be carried out by aforementioned inner grouping and/or external packet.
And packet is meant the scheme that data itself are divided into groups.Packet can be carried out by aforementioned inner grouping and/or external packet.
Under the particular case of packet, thereby can divide into groups and to be used in the entropy coding.For example, use this packet when under finally finishing Packet State, True Data being carried out entropy coding shown in Figure 15.That is, come deal with data on one of frequency direction and time orientation according to the mode that two data adjacent one another are are bundled.
Yet,, the data division ground in final group is divided into groups again for the situation of carrying out packet in the manner described above.Thereby, only the group (for example, two data) that forms through packet is not used PBC or DIFF coding.In addition, after a while will be to describing with the corresponding entropy coding scheme of packet.
6, utilize the signal processing method of grouping
6-1, the signal processing method that utilizes inside to divide into groups at least
The signal processing method of aforementioned groupings scheme and the device of utilizing according to the present invention is explained as follows.
According to the method for the processing signals of one embodiment of the present invention comprise obtain with a group that obtains by first grouping with at the inside grouping of first grouping in included corresponding group of reference value of a plurality of data and with the step of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the step of these data.
The invention is characterized in, divide into groups the quantity of the data that greater than the quantity of the data of dividing into groups by inside by first.In this case, the group reference value can be pilot frequency benchmark value or difference reference value.
This method according to one embodiment of the present invention also comprises at least one step of decoding in group reference value and the difference.In this case, the pilot frequency benchmark value decides at every group.
And, set in advance the quantity of data included in the inside group that obtains by inside grouping respectively.In this case, the quantity of the data that comprise in the inner group differs from one another.
On frequency domain, data are carried out first grouping and inner grouping.What in this case, frequency domain can be corresponding in hybrid domain, parameter band territory, data tape territory and the channel domain is a kind of.
And, the invention is characterized in by first first group of dividing into groups to obtain to comprise a plurality of inner group that obtains by the inside grouping.
Frequency domain of the present invention is distinguished according to frequency band.Frequency band is by inside grouping becoming sub-band.Subband is by inside grouping becoming parameter band.The parameter band becomes data tape by the inside grouping.In this case, the quantity of parameter band can be defined as maximum 28.And the parameter band is grouped into a data tape with 2,5 or 10.
According to the device of the processing signals of one embodiment of the present invention comprise obtain with a group that obtains by first grouping with at the inside grouping of first grouping in included corresponding group of reference value of a plurality of data and with the value acquisition portion of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the data acquisition portion of these data.
According to the present invention the method for the processing signals of another embodiment comprise utilize with a group that obtains by first grouping with at the inside grouping of first grouping in included corresponding group of reference value of a plurality of data and these data step of generating difference, and the step that transmits the difference that is generated.
And, according to the present invention the device of the processing signals of another embodiment comprise utilize with a group that obtains by first grouping with at the inside grouping of first grouping in included corresponding group of reference value of a plurality of data and these data value generating unit of generating difference, and the efferent that transmits the difference that is generated.
The signal processing methods of 6-2, the many groupings of utilization
The signal processing method of aforementioned groupings scheme and the device of utilizing according to the present invention is explained as follows.
According to the method for the processing signals of one embodiment of the present invention comprise obtain with a group that obtains by grouping in included corresponding group of reference value of a plurality of data and with the step of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the step of these data.
In this case, the group reference value can be in the reference value one of pilot frequency benchmark value and difference.
And grouping can be corresponding in external packet and the inner grouping.
And grouping can be corresponding in territory grouping and the packet.
The territory group is carried out packet.And included time domain comprises at least one in time slot territory, parameter set territory and the data set territory in the grouping of territory.
Included frequency domain can comprise at least one in sample territory, subband domain, hybrid domain, parameter band territory, data tape territory and the channel domain in the grouping of territory.
According to a plurality of data that comprise in the group a poor reference value is set.And, the decision classified counting, divide class range and whether have in the grouping at least one.
According to the device of the processing signals of one embodiment of the present invention comprise obtain with a group that obtains by grouping in included corresponding group of reference value of a plurality of data and with the value acquisition portion of the corresponding difference of this group reference value, and the data acquisition portion that obtains these data with this group reference value and this difference.
According to the present invention the method for the processing signals of another embodiment comprise utilize with a group that obtains by grouping in included corresponding group of reference value of a plurality of data and these data step of generating difference, and the step that transmits the difference that is generated.
According to the present invention the device of the processing signals of another embodiment comprise utilize with a group that obtains by grouping in included corresponding group of reference value of a plurality of data and these data value generating unit of generating difference, and the efferent that transmits the difference that is generated.
According to the present invention the method for the processing signals of another embodiment comprise obtain with a group that obtains by the grouping that comprises first grouping and second grouping in included corresponding group of reference value of a plurality of data and with the step of corresponding first difference of this group reference value, and utilize this group reference value and this first difference to obtain the step of these data.
In this case, the group reference value can comprise pilot frequency benchmark value or difference reference value.
This method also comprises at least one step of decoding in the group reference value and first difference.And the first pilot frequency benchmark value decides at every group.
This method also comprise obtain with the corresponding second pilot frequency benchmark value of a plurality of first pilot frequency benchmark values and with the step of corresponding second difference of this second pilot frequency benchmark value, and utilize this second pilot frequency benchmark value and this second difference to obtain the step of the first pilot frequency benchmark value.
In this case, second grouping can comprise external packet and the inner grouping at first grouping.
On in time domain and frequency domain at least one data are carried out this grouping.Specifically, this grouping is at least one territory of the dividing into groups grouping in time domain and the frequency domain.
Time domain can comprise time slot territory, parameter set territory or data set territory.
Frequency domain can comprise sample territory, subband domain, hybrid domain, parameter band territory, data tape territory or channel domain.And the data of dividing into groups are index or parameter.
Utilization is carried out the entropy decoding by the represented entropy table of the index that comprises in first group of dividing into groups to obtain to first difference.And utilization group reference value and first difference of decoding through entropy obtain these data.
Utilization is carried out the entropy decoding by the represented entropy table of the index that comprises in first group of dividing into groups to obtain to first difference and group reference value.And, utilize through the group reference value of entropy decoding and first difference of decoding and obtain these data through entropy.
According to the present invention the device of the processing signals of another embodiment comprise obtain with a group that obtains by the grouping that comprises first grouping and second grouping in included corresponding group of reference value of a plurality of data and with the value acquisition portion of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the data acquisition portion of these data.
According to the present invention the method for the processing signals of another embodiment comprise utilize with a group that obtains by the grouping that comprises first grouping and second grouping in included corresponding group of reference value of a plurality of data and these data step of generating difference, and the step of the difference that generated of transmission.
According to the present invention the device of the processing signals of another embodiment comprise utilize with a group that obtains by the grouping that comprises first grouping and second grouping in included corresponding group of reference value of a plurality of data and these data value generating unit of generating difference, and the efferent of the difference that generated of transmission.
According to the present invention the method for the processing signals of another embodiment comprise obtain with a group that obtains by first grouping with at the external packet of first grouping in included corresponding group of reference value of a plurality of data and with the step of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the step of these data.
In this case, with by first divide into groups the data that corresponding first data bulk of quantity less than with corresponding second data bulk of the quantity of the data of dividing into groups by external packet.And, there is relation at double between first data bulk and second data bulk.
The group reference value can comprise pilot frequency benchmark value or difference reference value.
This method also comprises at least one the step in decision group reference value and the difference.
The pilot frequency benchmark value decides at every group.
On in time domain and frequency domain at least one data are divided into groups.Time domain can comprise time slot territory, parameter set territory or data set territory.And frequency domain can comprise sample territory, subband domain, hybrid domain, parameter band territory, data tape territory or channel domain.
This method also comprises the step of coming the reconstruct voice data as parameter with the data that obtain.And external packet can comprise paired parameter.
According to the present invention the device of the processing signals of another embodiment comprise obtain with a group that obtains by first grouping with at the external packet of first grouping in included corresponding group of reference value of a plurality of data and with the value acquisition portion of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the data acquisition portion of these data.
According to the present invention the method for the processing signals of another embodiment comprise utilize with a group that obtains by first grouping with at the external packet of first grouping in included corresponding group of reference value of a plurality of data and these data step of generating difference, and the step of transmitting the difference that is generated.
And, according to the present invention the device of the processing signals of another embodiment comprise utilize with a group that obtains by first grouping with at the external packet of first grouping in included corresponding group of reference value of a plurality of data and these data value generating unit of generating difference, and the efferent that transmits the difference that is generated.
6-3, utilize the signal processing method of packet at least
The signal processing method of aforementioned groupings scheme and the device of utilizing according to the present invention is explained as follows.
According to the method for the processing signals of one embodiment of the present invention comprise obtain with a group that obtains by packet with at the inside grouping of packet in included corresponding group of reference value of a plurality of data and with the step of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the step of these data.
In this case, the quantity of the data that comprise in the inner grouping is less than the quantity of the data that comprise in the packet.And data are corresponding to parameter.
A plurality of data that go out through packet are integrally carried out the inside grouping.In this case, can carry out the inside grouping in every parameter band ground.
Can partly carry out the inside grouping to a plurality of data that go out through packet.In this case, channel ground that can whenever a plurality of each in the data that packet goes out carries out the inside grouping.
The group reference value can comprise pilot frequency benchmark value or difference reference value.
This method can also comprise at least one step of decoding in group reference value and the difference reference value.In this case, the pilot frequency benchmark value decides at every group.
On frequency domain, data are carried out packet and inner grouping.
Frequency domain can comprise in sample territory, subband domain, hybrid domain, parameter band territory, data tape territory or the channel domain.When obtaining data, the grouping information of at least one during use is divided into groups at packet and inside.
This grouping information comprises the coding and decoding scheme of the numbering of position, each group of each group, the quantity that whether has every group of set of applications reference value, group reference value, group reference value and whether has at least a in the acquisition group reference value.
According to the device of the processing signals of one embodiment of the present invention comprise obtain with a group that obtains by packet with at the inside grouping of packet in included corresponding group of reference value of a plurality of data and with the value acquisition portion of the corresponding difference of this group reference value, and utilize this group reference value and this difference to obtain the data acquisition portion of these data.
According to the present invention the method for the processing signals of another embodiment comprise utilize with a group that obtains by packet with at the inside grouping of packet in included corresponding group of reference value of a plurality of data and these data step of generating difference, and the step that transmits the difference that is generated.
And, according to the present invention the device of the processing signals of another embodiment comprise utilize with a group that obtains by packet with at the inside grouping of packet in included corresponding group of reference value of a plurality of data and these data value generating unit of generating difference, and the efferent that transmits the difference that is generated.
[entropy coding]
1, the notion of entropy coding
Entropy coding according to the present invention is meant that the result to the data coding carries out the processing of variable length code.
In general, entropy coding is handled the probability of occurrence of particular data in the mode of statistics.For example, transfer efficiency integrally improves in such a way: be that frequency of occurrences higher data is distributed less bits on the probability, and be that the frequency of occurrences is lower on the probability data allocations is than multidigit.
And the present invention is intended to propose the practical entropy coding method that is different from general entropy coding a kind of and PBC coding and the interconnection of DIFF coding.
1-1, entropy table
At first, carry out the entropy table that entropy coding need be scheduled to.The entropy table is defined as code book.And encoding section and lsb decoder use same table.
The present invention proposes a kind of various digital coding results' entropy coding method and a kind of entropy table of uniqueness handled effectively.
1-2, entropy coding type (1D/2D)
Entropy coding of the present invention is divided into two types.A kind of is to derive an index (index 1) by an entropy table, and another kind is to derive two continuity indexs (index 1 and index 2) by an entropy table.The former is called as 1D (one dimension) entropy coding, and the latter is called as 2D (two dimension) entropy coding.
Figure 18 is the exemplary view according to 1D of the present invention and 2D entropy table.With reference to Figure 18, entropy table of the present invention consists essentially of index (Index) field, length (Length) field and code word (Codeword) field.
For example, if particular data (for example, pilot frequency benchmark value, difference reference value etc.) calculates by the aforementioned data coding, then corresponding data (corresponding with index) has by this entropy table and the code word of appointment.This code word becomes bit stream, is transferred into lsb decoder then.
The entropy lsb decoder decision that receives this code word has been used for the entropy table of corresponding data, and the bit length of the code word that the entropy table that utilizes corresponding codewords and formation to be determined then is interior is derived index value.In this case, the present invention is shown sexadecimal with codeword table.
The positive sign (+) and the negative sign (-) of the index value of deriving have been omitted according to 1D or 2D entropy coding.Thereby, must finish 1D or 2D entropy coding designated symbols afterwards.
In the present invention, come differently designated symbols according to 1D or 2D.
For example, for the situation of 1D entropy coding,, then distribute 1 independent bit sign position (for example, bsSign) and transmit if respective index is not 0.
For the situation of 2D entropy coding,, determine whether distributing sign bit by the mode that the relation between two index that extracted is programmed because extracted two index continuously.In this case, program use two that extract index and values, two extract the difference of index and the maximum value (lav) in the relative entropy table.With under the situation of simple 2D, all distribute the situation of sign bit to compare for each index, can reduce the quantity of transmission.
The 1D entropy table that an index derived in one of them index can be used for all digital coding results.Yet wherein the 2D entropy table of two index of each index derivation has the limited purposes at particular case.
For example, if digital coding is not to handle obtain a pair of by aforementioned groupings, then 2D entropy matrix section ground has limited purposes.And the use of 2D entropy table is confined to the result who encodes as PBC and the pilot frequency benchmark value that calculates.
Therefore, as previously mentioned, entropy coding of the present invention is characterised in that, utilizes the most effective entropy coding scheme according to the mode of result's interconnection of entropy coding and digital coding.This situation is described in detail as follows.
1-3,2D method (time pairing/frequency pairing)
Figure 19 is the exemplary view according to two kinds of methods at the 2D entropy coding of the present invention.The 2D entropy coding is the processing that is used to derive two index adjacent one another are.Thereby the 2D entropy coding can be distinguished according to the direction of two continuity indexs.
For example, two index are called 2D frequency pairing (hereinafter being abbreviated as 2D-FP) in frequency direction situation adjacent one another are.And being called the 2D time in time orientation situation adjacent one another are, two index match (hereinafter being abbreviated as 2D-TP).
With reference to Figure 19,2D-FP and 2D-TP can constitute independent concordance list respectively.Scrambler must decide the most effective entropy coding scheme according to the result of data decode.
In the following description, the method with the entropy coding of digital coding interconnection of decision is effectively described.
1-4, entropy coding signal processing method
Come the method for processing signals to be explained as follows to the entropy coding that utilizes according to the present invention.
In method according to the processing signals of one embodiment of the present invention, obtain with the corresponding reference value of a plurality of data and with the corresponding difference of this reference value.Subsequently, this difference is carried out the entropy decoding.Then, utilize this reference value and obtain these data through the difference of entropy decoding.
This method also comprises the step of reference value being carried out the entropy decoding.And this method can also comprise that utilization is through the reference value of entropy decoding with obtain the step of these data through the difference of entropy decoding.
This method can also comprise the step that obtains the entropy coding identification information.And, carry out entropy coding according to the entropy coding scheme that the entropy coding identification information is represented.
In this case, the entropy coding scheme is a kind of in 1D encoding scheme and the multidimensional coding scheme (for example, 2D encoding scheme).And the multidimensional coding scheme is a frequency to a kind of in (TP) encoding scheme of (FP) encoding scheme and time.
Reference value can comprise in pilot frequency benchmark value and the difference reference value.
And this signal processing method can also comprise the step of coming reconstructed audio signal with these data as parameter.
Comprise according to the device of the processing signals of one embodiment of the present invention obtaining with the corresponding reference value of a plurality of data with the value acquisition portion of the corresponding difference of this reference value, this difference being carried out the entropy lsb decoder of entropy decoding, and utilize this reference value and obtain the data acquisition portion of these data through the difference of entropy decoding.
In this case, value acquisition portion is included in the aforementioned potential flow solution multiplexing unit 60, and data acquisition portion is included in aforementioned data lsb decoder 91 or 92.
The method of the processing signals of another embodiment comprises that utilization and the corresponding reference value of a plurality of data and these data generate the step of difference, the reference value that generates is carried out the step of entropy coding according to the present invention, and exports the step through the difference of entropy coding.
In this case, reference value is carried out entropy coding.Reference value through entropy coding is transmitted.
This method also comprises the entropy coding scheme that is used for entropy coding that generates.And, the entropy coding scheme that generates is transmitted.
The device of the processing signals of another embodiment comprises that utilization and the corresponding reference value of a plurality of data and these data generate the value generating unit of difference, the difference that generates is carried out the entropy coding portion that entropy is decoded according to the present invention, and exports the efferent through the difference of entropy coding.
In this case, the value generating unit is included in aforementioned data encoding section 31 or 32.And efferent is included in the aforementioned bit stream multiplexing unit 50.
The method of the processing signals of another embodiment comprises and obtains with the step of the corresponding data of a plurality of data coding schemes, utilizes and decide the step of entropy table for the unique entropy table identifier of data coding scheme at pilot frequency benchmark value included in the data and at least one in the pilot tone difference according to the present invention, and utilizes this entropy table that in pilot frequency benchmark value and the pilot tone difference at least one carried out the step of entropy decoding.
In this case, the entropy table identifier is unique for a kind of in pilot codes scheme, frequency differential encoding scheme and the time difference encoding scheme.
And the entropy table identifier all is unique for pilot frequency benchmark value and pilot tone difference.
The entropy table is unique for the entropy table identifier and comprises in pilot tone table, difference on the frequency submeter and the mistiming submeter one.
Alternatively, the entropy table be not unique for the entropy table identifier and can the shared frequencies table of difference and the mistiming submeter in one.
With the corresponding entropy table of pilot frequency benchmark value can the frequency of utilization table of difference.In this case, by 1D entropy coding scheme the pilot frequency benchmark value is carried out the entropy decoding.
The entropy coding scheme comprises 1D entropy coding scheme and 2D entropy coding scheme.Specifically, 2D entropy coding scheme comprise frequency to (2D-FP) encoding scheme and time to (2D-TP) encoding scheme.
And this method can be come reconstructed audio signal as parameter with data.
According to the present invention the device of the processing signals of another embodiment comprise obtain with the corresponding pilot frequency benchmark value of a plurality of data and with the value acquisition portion of the corresponding pilot tone difference of this pilot frequency benchmark value, and the entropy lsb decoder that this pilot tone difference is carried out the entropy decoding.And this device comprises the data acquisition portion that utilizes this pilot frequency benchmark value and obtain these data through the pilot tone difference of entropy decoding.
The method of the processing signals of another embodiment comprises that utilization and the corresponding pilot frequency benchmark value of a plurality of data and these data generate the step of pilot tone difference, the pilot tone difference that generates is carried out the step of entropy coding according to the present invention, and transmits the step through the pilot tone difference of entropy coding.
In this case, the table that is used for entropy coding can comprise the pilot tone special table.
This method also comprises the step of this pilot frequency benchmark value being carried out entropy coding.And, the pilot frequency benchmark value through entropy coding is transmitted.
This method also comprises the step that generates the entropy coding scheme that is used for entropy coding.And, the entropy coding scheme that generates is transmitted.
The device of the processing signals of another embodiment comprises that utilization and the corresponding pilot frequency benchmark value of a plurality of data and these data generate the value generating unit of pilot tone difference, the pilot tone difference that generates is carried out the entropy coding portion of entropy coding according to the present invention, and transmits the efferent through the pilot tone difference of entropy coding.
2, about digital coding
As previously mentioned, the present invention has proposed three kinds of data coding schemes.Yet,, data are not carried out entropy coding according to the PCM scheme.In the following description, respectively the relation between the relation between PBC coding and the entropy coding and DIFF coding and the entropy coding is described.
2-1, PBC coding and entropy coding
Figure 20 is according to the entropy coding scheme at the PBC coding result of the present invention.
As previously mentioned, after finishing the PBC coding, calculate a pilot frequency benchmark value and a plurality of pilot tone difference.And all pilot frequency benchmark values and difference are all as the object of entropy coding.
For example, according to the aforementioned groupings method, decision will be used the group of PBC coding.In Figure 20,, be example with the situation of non-paired (non-pair) on the situation of paired (pair) on the time shaft and the time shaft for ease of explanation.The entropy coding of finishing after PBC encodes is explained as follows.
At first, the situation 83 of carrying out the PBC coding in pairs to non-is described.A pilot frequency benchmark value as the entropy coding object is carried out the 1D entropy coding, and can carry out 1D entropy coding or 2D-FP entropy coding all the other differences.
Specifically, because under non-paired situation, there is a group, so can not carry out the 2D-TP entropy coding for a data set on the time shaft.Even after exporting to right index, carry out 2D-FP, also should carry out the 1D entropy coding to the parameter value of failing to constitute in last a pair of band 81a.In case determined every data entropy coding scheme, just utilized corresponding entropy table to generate code word.
Because the present invention relates to generate a pilot frequency benchmark value, so should carry out the 1D entropy coding at for example group.Yet, in another embodiment of the present invention,, can carry out the 2D entropy coding to the continuous pilot reference value if generated at least two pilot frequency benchmark values from a group.
Then, illustrate carrying out the situation 84 of PBC coding in pairs.
A pilot frequency benchmark value as the entropy coding object is carried out the 1D entropy coding, and can carry out 1D entropy coding, 2D-FP entropy coding or 2D-TP entropy coding all the other differences.
Specifically, because under paired situation, there is a group, so can carry out the 2D-TP entropy coding at two data sets adjacent one another are on the time shaft.Even after exporting to right index, carry out 2D-FP, also should be with the parameter value in 81b or the 81c to carry out the 1D entropy coding to failing to constitute a pair of last.Yet,, under the situation of using the 2D-TP entropy coding, do not exist and fail to constitute last a pair of band as can in Figure 20, confirming.
2-2, DIFF coding and entropy coding
Figure 21 is according to the entropy coding scheme for the DIFF coding result of the present invention.
As previously mentioned, after finishing the DIFF coding, calculate a pilot frequency benchmark value and a plurality of difference.And all pilot frequency benchmark values and difference are all as the object of entropy coding.Yet, under the situation of DIFF-DT, may not have reference value.
For example, according to the aforementioned groupings method, decision will be used the group of DIFF coding, in Figure 21, for ease of explanation, is example with non-paired situation on situation paired on the time shaft and the time shaft.And Figure 21 shows according to the DIFF coding staff to being distinguished into as the data set of digital coding unit along the DIFF-DT of time-axis direction with along the situation of the axial DIFF-DF of frequency.
Be explained as follows finishing DIFF coding entropy coding afterwards.
At first, the situation of carrying out the DIFF coding in pairs to non-is described.Under non-paired situation, there is a data set on the time shaft.And this data set can be according to the DIFF coding staff to becoming DIFF-DF or DIFF-DT.
For example, if a non-paired data set is DIFF-DF (85), then reference value becomes the parameter value in the first band 82a.This reference value is carried out the 1D entropy coding, and can carry out 1D entropy coding or 2D-FP entropy coding all the other differences.
That is,, on time shaft, there is a group at a data set for DIFF-DF and non-paired situation.Thereby, can not carry out the 2D-TP entropy coding.Even after exporting to right index, carry out 2D-FP, also should carry out the 1D entropy coding to the parameter value of failing to constitute in last a pair of parameter band 83a.In case determined the encoding scheme of each data, just utilized corresponding entropy table to generate code word.
For example, for a non-paired situation that data set is DIFF-DT (86), because there is not reference value in the corresponding data collection, so do not carry out first tape handling.Thereby, can carry out 1D entropy coding or 2D-FP entropy coding to difference.
For DIFF-DT and non-paired situation, the data set that seek difference can be to fail data set in the right adjacent data collection of composition data or another audio frame.
That is,, on time shaft, there is a group at a data set for DIFF-DT and non-paired situation (86).Thereby, can not carry out the 2D-TP entropy coding.Even after exporting to right index, carry out the 2D-FP entropy coding, also should carry out the 1D entropy coding to the parameter value of failing to constitute in last a pair of parameter band.Yet Figure 21 only shows the situation of failing to constitute last a pair of band that for example do not exist.
In case determined the encoding scheme of each data, just utilized corresponding entropy table to generate code word.
Secondly, illustrate carrying out the situation of DIFF coding in pairs.To carrying out in pairs under the situation of digital coding, two data sets constitute a group on time shaft.And each data set in this group can be according to the DIFF coding staff to becoming DIFF-DF or DIFF-DT.Thereby, it can be divided into situation (87), two a pair of data sets of formation that two a pair of data sets of formation all are DIFF-DF all is the situation of DIFF-DT, and constitute the situation (for example, DIFF-DF/DT or DIFF-DT/DF) (88) that two a pair of data sets have the different coding direction respectively.
For example, for constitute two a pair of data sets all be DIFF-DF (that is, and situation DIFF-DF/DF) (87), if each data set is not DIFF-DF in pairs and all, then all available entropy coding schemes all are feasible.
For example, each reference value in the corresponding data collection is all as the parameter value in the first band 82b or the 82c, and reference value is carried out the 1D entropy coding.And, can carry out 1D entropy coding or 2D-FP entropy coding to all the other differences.
Even after exporting to right index, in the corresponding data collection, carry out 2D-FP, also should be with the parameter value in 83b or the 83c to carry out the 1D entropy coding to failing to constitute a pair of last.Because it is a pair of that two data sets have constituted, so can carry out the 2D-TP entropy coding.In this case, the band that from first band 82b or the 82c in the corresponding data collection next taken to last band sequentially carries out the 2D-TP entropy coding.
If carry out the 2D-TP entropy coding, then do not generate and fail to constitute last a pair of band.
In case determined the entropy coding scheme of each data, just utilized corresponding entropy table to generate code word.
For example, all be that (that is, situation DIFF-DT/DT) (89) is not because exist reference value, so do not carry out first tape handling in the corresponding data collection for DIFF-DT for constituting two a pair of data sets.And, can carry out 1D entropy coding or 2D-FP entropy coding to all differences in each data set.
Even after exporting to right index, in the corresponding data collection, carry out 2D-FP, also should carry out the 1D entropy coding to the parameter value of failing to constitute in last a pair of band.Yet Figure 21 shows and does not have the example of failing to constitute last a pair of band.
Because two data set formations are a pair of, so can carry out the 2D-TP entropy coding.In this case, to sequentially carrying out the 2D-TP entropy coding from first band that takes last band in the corresponding data collection.
If carry out the 2D-TP entropy coding, then do not generate and fail to constitute last a pair of band.
In case determined the entropy coding scheme of each data, just utilized corresponding entropy table to generate code word.
For example, may exist two a pair of data sets of formation to have the situation (88) of different coding direction (that is, DIFF-DF/DT or DIFF-DT/DF) respectively.Figure 21 shows the example of DIFF-DF/DT.In this case, can carry out according to the corresponding encoded type and applicable all entropy coding schemes each data set basically.
For example, constituting the DIFF-DF data centralization that two a pair of data are concentrated, the parameter value in the first band 82d with reference value in the corresponding data collection (DIFF-DF) is being carried out the 1D entropy coding.And, can carry out 1D entropy coding or 2D-FP entropy coding to other difference.
Even after exporting to right index, in corresponding data collection (DIFF-DF), carry out 2D-FP, also should carry out the 1D entropy coding to the parameter value of failing to constitute in last a pair of band 83d.
For example, constituting the DIFF-DT data centralization that two a pair of data are concentrated, because there is not reference value, so do not carry out first tape handling.And, can carry out 1D entropy coding or 2D-FP entropy coding to all differences in the corresponding data collection (DIFF-DT).
Even after exporting to right index, in corresponding data collection (DIFF-DT), carry out 2D-FP, also should carry out the 1D entropy coding to the parameter value of failing to constitute in last a pair of band.Yet Figure 21 shows and does not have the example of failing to constitute last a pair of band.
Because constitute two a pair of data sets have respectively the coding staff that differs from one another to, so can carry out the 2D-TP entropy coding.In this case, to sequentially carrying out the 2D-TP entropy coding from next band that takes last band to except first band that comprises the first band 82d.
If carry out the 2D-TP entropy coding, then do not generate and fail to constitute last a pair of band.
In case determined the entropy coding scheme of each data, just utilized corresponding entropy table to generate code word.
2-3, entropy coding and grouping
As previously mentioned, for the situation of 2D-FP or 2D-TP entropy coding, utilize a code word to extract two index.Thereby, this means at entropy coding and adopted the grouping scheme.And this can be called as time packet or frequency grouping.
For example, encoding section is divided into groups to two index that extract in the data coding step along frequency direction or time orientation.
Subsequently, encoding section utilizes the entropy table to select to represent a code word of two index through dividing into groups, and transmits selected code word in the bit stream by it is included in then.
Lsb decoder receives the included code word that two index are divided into groups to obtain passed through in the bit stream, utilizes applied entropy table to extract two index values then.
2-4, utilize the signal processing method that concerns between digital coding and entropy coding
Feature to the signal processing method that concerns between relation between the PBC of utilization according to the present invention coding and entropy coding and DIFF coding and entropy coding is explained as follows.
Comprise the step that obtains difference information, this difference information carried out the step that entropy is decoded according to the method for the processing signals of one embodiment of the present invention according to the entropy coding scheme that comprises the grouping of time packet and frequency, and according to the step that comprises that pilot tone is poor, the data decode scheme of mistiming and difference on the frequency is carried out data decode to this difference information.And what illustrate in detailed relation between digital coding and the entropy coding and the aforementioned description is identical.
The step that the method for the processing signals of another embodiment comprises the step that obtains digital signal, this digital signal carried out the entropy decoding according to the entropy coding scheme according to the present invention, and according to one in a plurality of data coding schemes that the comprise the pilot codes scheme at least step to carrying out data decode through the digital signal of entropy decoding.In this case, can decide the entropy coding scheme according to data coding scheme.
The entropy lsb decoder that the device of the processing signals of another embodiment comprises the signal acquisition portion that obtains digital signal, this digital signal carried out the entropy decoding according to the entropy coding scheme according to the present invention, and according to one in a plurality of data coding schemes that the comprise the pilot codes scheme at least data decoding part to carrying out data decode through the digital signal of entropy decoding.
According to the present invention the method for the processing signals of another embodiment comprise according to data coding scheme to digital signal carry out the step of digital coding, according to the entropy coding scheme to carrying out the step of entropy coding through the digital signal of digital coding, and transmit step through the digital signal of entropy coding.In this case, can decide the entropy coding scheme according to data coding scheme.
And, the device of the processing signals of another embodiment comprises the digital coding portion that digital signal is carried out digital coding according to data coding scheme according to the present invention, and according to the entropy coding scheme to carry out the entropy coding portion of entropy coding through the digital signal of digital coding.And this device can also comprise the efferent of transmission through the digital signal of entropy coding.
3, to the selection of entropy table
The entropy table that is used for entropy coding is according to data coding scheme and as the data type of entropy coding object and automatically decision.
For example, if data type be CLD parameter and entropy coding to as if the pilot frequency benchmark value, then use table name to carry out entropy coding as the 1D entropy table of hcodPilot_CLD.
For example, if data type is CPC parameter, digital coding be DIFF-DF and entropy coding to as if the first band value, then use table name to carry out entropy coding as the 1D entropy table of hcodFirstband_CPC.
For example, if data type is ICC parameter, data coding scheme to be PBC and to carry out entropy coding according to 2D-TP, then use table name to carry out entropy coding as the 2D-PC/TP entropy table of hcod2D_ICC_PC_TP_LL.In this case, the LL in the 2D table name represents the maximum value (hereinafter being abbreviated as LAV) in this table.And, will describe maximum value (LAV) after a while.
For example, if data type is ICC parameter, data coding scheme to be DIFF-DF and to carry out entropy coding according to 2D-FP, then use table name to carry out entropy coding as the 2D-FP entropy table of hcod2D_ICC_DF_FP_LL.
That is, to use in a plurality of entropy tables which to carry out entropy coding be very important in decision.And, preferably, constitute the entropy table be suitable for as the characteristic of each data of each entropy target individually.
Yet, at attribute each other the entropy table of similar data can be shared use.For a representational example,, then can use CLD entropy table if data type is ADG or ATD.And, the first band entropy table can be applied to the pilot frequency benchmark value that PBC encodes.
To utilizing maximum value (LAV) to come the method for selective entropy table to do following detailed description.
The maximum value of 3-1, entropy table (LAV)
Figure 22 is used to illustrate the method according to selective entropy table of the present invention.
A plurality of entropy tables have been shown in Figure 22 (a), and in Figure 22 (b) table that is used for the selective entropy table have been shown.
As previously mentioned, there are a plurality of entropy tables according to digital coding and data type.
For example, the entropy table can comprise be applicable to data type be xxx situation the entropy table (for example, table 1 is to 4), be applicable to data type be yyy situation the entropy table (for example, table 5 is to 8), the special-purpose entropy table of PBC (for example, the table k to k+1), escape (escape) entropy table (for example, table n-2~n-1), and LAV index entropy table (for example, table n).
Specifically, although preferably by for each index that may appear in the corresponding data provides code word to constitute table, if like this, the size of table will enlarge markedly.And, be not easy to manage index unnecessary or that just occurred.For the situation of 2D entropy table, these problems are because of occurring deriving inconvenience more too much.In order to address these problems, used maximum value (LAV).
For example, if at specific data type (for example, the scope of index value CLD)-X~+ X (X=15) between, then in this scope, select at least one high LAV of the frequency of occurrences on the probability, and it constituted independent table.
For example, when constituting CLD entropy table, can provide the table of LAV=3, the table of LAV=5, the table of LAV=7 or the table of LAV=9.
For example, in (a) of Figure 22, table 1 (91a) can be arranged to LAV=3 CLD table, with table 2 (91b) be arranged to LAV=5 the CLD table, table 3 (91c) is arranged to the CLD table of LAV=7, and table 4 (91d) is arranged to the table of LAV=9.
(for example, table n-2~n-1) handles the index that departs from the LAV scope in the LAV table according to escape entropy table.
For example, when the CLD table 91c that utilizes LAV=7 encodes, if depart from maximal value 7 index (for example, 8,9 ..., 15), then according to escape entropy table (for example, table n-2~n-1) respective index is carried out individual processing.
Equally, can at another data type (for example, ICC, CPC etc.) the LAV table be set according to the mode same with the CLD epiphase.Yet, have different values because of the scope of every data type is different at the LAV of each data.
For example, when constituting ICC entropy table, for example can provide the table of LAV=1, the table of LAV=3, the table of LAV=5 and the table of LAV=7.When constituting CPC entropy table, for example can provide the table of LAV=3, the table of LAV=6, the table of LAV=9 and the table of LAV=12.
3-2, at the entropy table of LAV index
The present invention adopts the LAV index to select to use the entropy table of LAV.That is, shown in Figure 22 (b), the LAV value of every data type is distinguished according to the LAV index.
Specifically,, confirm the LAV index of every corresponding data type, confirm and the corresponding LAV of this LAV index then for the entropy table of selecting finally will use.The final LAV value of confirming is corresponding to the LL in the formation of aforementioned entropy table name.
For example, be DIFF-DF, carry out entropy coding and LAV=3, then use table name to carry out entropy coding as the entropy table of hcod2D_CLD_DF_FP_03 according to 2D-FP if data type is CLD parameter, data coding scheme.
When confirming every data type LAV index, the invention is characterized in, use the entropy table individually at the LAV index.This means that the object that the LAV index itself is used as entropy coding handles.
For example, the table n in Figure 22 (a) is used as LAV index entropy table 91e.This is represented as table 1.
Table 1
LavIdx | Bit length | Code word [sexadecimal/scale-of-two] |
0 | 1 | 0×0(0b) |
1 | 2 | 0×2(10b) |
2 | 3 | 0×6(110b) |
3 | 3 | 0×7(111b) |
This table means LAV index value frequency of utilization difference statistically itself.
For example, because LAV index=0 frequency of utilization is the highest, institute thinks one of its distribution.And, for distributing two in frequency of utilization time high LAV index=1.At last, distribute three for the low LAV=2 or 3 of frequency of utilization.
Situation for not using LAV index entropy table 91e should transmit 2 bit-identify information, to distinguish four kinds of LAV index when using LAV entropy table at every turn.
Yet, if used LAV index entropy table 91e of the present invention, be at least the situation of 60% LAV index=0 for for example frequency of utilization, it is just enough to transmit 1 bit word.Thereby the present invention can improve transfer efficiency to such an extent that be higher than art methods.
In this case, the LAV index entropy table 91e in the table 1 is applied to the situation of four kinds of LAV index.And, should be understood that if there is more LAV index, it is more that transfer efficiency is improved.
3-3, the signal processing method that utilizes the entropy table to select
Signal processing method and the device that utilizes aforementioned entropy table to select is explained as follows.
The step that comprises the step that obtains index information, this index information is carried out the entropy decoding according to the method for the processing signals of one embodiment of the present invention, and the step of identification and the corresponding content of index information of decoding through entropy.
In this case, this index information is the information at the index with the frequency of utilization characteristic on the probability.
As previously mentioned, utilize the special-purpose entropy table of index 91e that this index information is carried out the entropy decoding.
This content is classified according to data type and is used for data decode.And this content can be used as grouping information.
This grouping information is to be used for information that a plurality of data are divided into groups.
And the index of entropy table is the maximum value (LAV) in the index that comprises in the entropy table.
And, when carrying out the decoding of 2D entropy based on parameter, use the entropy table.
The lsb decoder that comprises the information acquisition portion that obtains index information, this index information is carried out the entropy decoding according to the device of the processing signals of one embodiment of the present invention, and the mark part of identification and the corresponding content of index information of decoding through entropy.
The method of the processing signals of another embodiment comprises the step that generates the index information be used to discern content, this index information is carried out the step of entropy coding according to the present invention, and transmits the step through the index information of entropy coding.
The device of the processing signals of another embodiment comprises the information generating unit that generates the index information be used to discern content, this index information is carried out the encoding section of entropy coding according to the present invention, and transmits the information output part through the index information of entropy coding.
According to the present invention the method for the processing signals of another embodiment comprise the step that obtains difference and index information, the step, identification of this index information being carried out the entropy decoding and step through the corresponding entropy table of index information of entropy decoding, and utilize the entropy table of being discerned that this difference is carried out the step that entropy is decoded.
Subsequently, use obtains data with corresponding reference value of a plurality of data and decoded difference value.In this case, reference value can comprise pilot frequency benchmark value or difference reference value.
Utilize the special-purpose entropy table of index that index information is carried out the entropy decoding.And, according to each the type in a plurality of data the entropy table is classified.
Data are parameters, and this method also comprises and utilizes parameter to come the step of reconstructed audio signal.
Under the situation of difference being carried out the entropy decoding, utilize the entropy table that difference is carried out the decoding of 2D entropy.
And this method also comprises the step that obtains reference value and utilizes the entropy table that is exclusively used in this reference value that this reference value is carried out the step that entropy is decoded.
According to the present invention the device of the processing signals of another embodiment comprise the input part that obtains difference and index information, the index lsb decoder, identification that this index information are carried out the entropy decoding with through the table identification part of the corresponding entropy table of index information of entropy decoding, and utilize the entropy table of being discerned that this difference is carried out the data decoding part that entropy is decoded.
This device also comprises the data acquisition portion that obtains these data with corresponding reference value of a plurality of data and the difference through decoding that utilizes.
According to the present invention the method for the processing signals of another embodiment comprise utilize with the corresponding reference value of a plurality of data and these data generate difference step, utilize the entropy table that this difference is carried out the step of entropy coding, and the step that generates the index information that is used to discern the entropy table.
And this method also comprises carries out the step of entropy coding and transmits through the index information of entropy coding and the step of this difference this index information.
And, according to the present invention the device of the processing signals of another embodiment comprise utilize with the corresponding reference value of a plurality of data and these data generate difference the value generating unit, utilize the entropy table to this difference carry out entropy coding the value encoding section, generate the information generating unit of the index information that is used to discern the entropy table and the index encoding section of this index information being carried out entropy coding.And this device comprises that also transmission is through the index information of entropy coding and the information output part of this difference.
[data structure]
The data structure of the various information that are associated with aforementioned data coding, grouping and entropy coding that comprises according to the present invention is explained as follows.
Figure 23 is the slice map according to data structure of the present invention.
With reference to Figure 23, data structure according to the present invention comprises header 100 and a plurality of follow-up (off) frame 101 and 102.Common application is included in the header 100 in the configuration information of downstream frame 101 and 102.And this configuration information comprises the grouping information that is used for aforementioned groupings.
For example, this grouping information comprises very first time grouping information 100a, first frequency grouping information 100b and channel packet information 100c.
In addition, the configuration information in the header 100 is called as main configuration information, and the message part that is recorded in the frame is called as useful load.
Specifically, for example, in the following description the situation that data structure of the present invention is applied to audio space information is described.
At first, the very first time grouping information 100a in the header 100 is as the bsFrameLength field of specifying the number of timeslots in the frame.
First frequency grouping information 100b is as the bsFreqRes field of specifying the parameter band quantity in the frame.
In the frame 101 and 102 each all comprises frame information (Frame Info) 101a of all groups of common application in a frame, and a plurality of groups of 101b and 101c.
In detail, for example in the following description the situation that data structure of the present invention is applied to audio space information is described.
Selection of time information 103a in the frame information 101a comprises bsNumParamset field, bsParamslot field and bsDataMode field.
The bsNumParamset field is the information of the quantity of existing parameter set in the expression entire frame.
And the bsParamslot field is the information of specifying the position of the time slot that has parameter set.
In addition, the bsDataMode field is the information of specifying the Code And Decode disposal route of each parameter set.
For example, () situation for example, default mode, lsb decoder is replaced the relevant parameter collection with default value for the bsDataMode=0 of specific set of parameters.
() situation for example, preceding mode, lsb decoder is kept the decode value of previous parameter set for the bsDataMode=1 of specific set of parameters.
() situation for example, interpolative mode, lsb decoder calculates the relevant parameter collection by interpolation between parameter set for the bsDataMode=2 of specific set of parameters.
At last, (for example, read mode) situation this means the coded data of transmission at the relevant parameter collection for the bsDataMode=3 of specific set of parameters.Thereby a plurality of groups of 101b in the frame and 101c are the groups of utilizing the data that transmit under the situation of bsDataMode=3 (for example, read mode) to constitute.Therefore, encoding section is come decoded data at the coding type information in each group.
Signal processing method that utilizes the bsDataMode field and device according to one embodiment of the present invention are done following detailed description the in detail.
According to the method for the processing signals of utilizing the bsDataMode field of one embodiment of the present invention comprise acquisition model information step, according to the represented data attribute of this pattern information obtain with the corresponding pilot frequency benchmark value of a plurality of data and with the step of the corresponding pilot tone difference of this pilot frequency benchmark value, and utilize this pilot frequency benchmark value and this pilot tone difference to obtain the step of these data.
In this case, data are parameters, and this method also comprises and utilizes parameter to come the step of reconstructed audio signal.
If this pattern information is represented read mode, then obtain the pilot tone difference.
This pattern information also comprises at least a in default mode, preceding mode and the interpolative mode.
And every group of band ground obtains the pilot tone difference.
And this signal processing method uses first parameter (for example, dataset) to discern the quantity of read mode, and use second parameter (for example, setidx) to obtain the pilot tone difference based on first variable.
According to one embodiment of the present invention utilize the bsDataMode field come the device of processing signals comprise acquisition model information information acquisition portion, according to the represented data attribute of this pattern information obtain with the corresponding pilot frequency benchmark value of a plurality of data and with the value acquisition portion of the corresponding pilot tone difference of this pilot frequency benchmark value, and utilize this pilot frequency benchmark value and this pilot tone difference to obtain the data acquisition portion of these data.
And information acquisition portion, value acquisition portion and data acquisition portion are set in aforementioned data lsb decoder 91 or 92.
The bsDataMode field of utilizing of another embodiment comes the method for processing signals to comprise the corresponding pilot frequency benchmark value of step, utilization and a plurality of data of the pattern information that generates the attribute of representing data and the step that these data generate the pilot tone difference according to the present invention, and the step that transmits the difference that is generated.And this method also comprises the step that the difference that generates is encoded.
The bsDataMode field of utilizing of another embodiment comes the device of processing signals to comprise the corresponding pilot frequency benchmark value of information generating unit, utilization and a plurality of data of the pattern information that generates the attribute of representing data and the value generating unit that these data generate the pilot tone difference according to the present invention, and the efferent that transmits the difference that is generated.And this value generating unit is set in aforementioned data encoding section 31 or 32.
The second time packet information 103b in the frame information 101a comprises the bsDatapair field.Whether paired this bsDatapair field be the information of specifying by between the bsDataMode=3 data designated collection.Specifically, by the bsDatapair field two data sets are grouped into a group.
Second frequency grouping information in the frame information 101a comprises the bsFreqResStride field.This bsFreqResStride field is the information that is used for the parameter band through dividing into groups first as the bsFreqRes field of first frequency grouping information 100b is carried out the secondary grouping.That is, by bundling and generate data tape to being accumulated as parameter by the span (stride) of bsFreqResStride field appointment.Thereby, every data tape ground designated parameter value.
Among group 101b and the 101c each all comprises coding type information 104a, entropy coding type information 104b, code word 104c and side data (side data) 104d.
In detail, for example, the situation that data structure of the present invention is applied to audio space information is explained as follows.
At first, the digital coding type information 104a in each of group 101b and 101c comprises bsPCMCoding field, bsPilotCoding field, bsDiffType field and bdDifftimeDirection field.
The bsPCMCoding field is that the digital coding that is used to identify respective sets is the information of PCM scheme or DIFF scheme.
Have only when the bsPCMCoding field is appointed as the PCM scheme, just specify whether there is the PBC scheme by the bsPilotCoding field.
The bsDiifType field be used to specify under the situation of using DIFF scheme coding staff to information.And the bsDiffType field is specified DF:DIFF-FREQ or DT:DIFF-TIME.
And, the bdDifftimeDirection field be used for the bsDiffType field be on the axle of following fixed time of situation of DT coding staff to be forward direction or back to information.
Entropy coding type information 104b in each of group 101b and 101c comprises bsCodingScheme field and bsPairing field.
The bsCodingScheme field is that to be used to specify entropy coding be 1D or the information of 2D.
And the bsPairing field is to be illustrated in the bsCodingScheme field to specify the direction of extracting two index under the situation of 2D be frequency direction (FP: the frequency pairing) the still information of time orientation (TP: the time matches).
The bsLsb field be to be applied to the field of aforementioned local parameter and be only when data type be CPC and under the situation of non-rudenss quantization just the transmission side information.
And the bsSign field is the information that is used to specify the symbol of the index that extracts under the situation that adopts the 1D entropy coding.
And the data that transmit according to the PCM scheme are included among the side data 104d.
Signal Processing data structure according to the present invention is explained as follows.
At first, signal Processing data structure according to the present invention comprises that having every at least frame all comprises the digital coding information of pilot codes information and at least one the useful load portion in the entropy coding information, and has the header as the main configuration information of this useful load portion.
Main configuration information comprises the very first time information portion that has at the temporal information of entire frame, and has the first frequency information portion at the frequency information of entire frame.
And main configuration information also comprises having and is used for every frame is comprised that all the random groups of a plurality of data carries out the first inner grouping information portion of the information of inner grouping.
Frame comprises at least one first data portion that has in digital coding information and the entropy coding information, and has the frame information portion as the sub-configuration information of this first data portion.
Sub-configuration information comprises the second temporal information portion that has at whole group temporal information.And sub-configuration information also comprises having and is used for comprising all that to every group the random groups of a plurality of data carries out the external packet information portion of the information of external packet.And sub-configuration information also comprises having the second inner grouping information portion that is used for the random groups that comprises a plurality of data is carried out the information of inner grouping.
At last, group comprise the digital coding information with the information that is used for data coding scheme, entropy coding information with the information that is used for the entropy coding scheme, with the corresponding reference value of a plurality of data, and have the reference value utilized and these data and second data portion of the difference that generates.
[for the application of audio coding (MPEG around)]
The embodiment that has unified aforementioned concepts of the present invention and feature is explained as follows.
Figure 24 is the block diagram according to the device that is used for audio compression and recovery of one embodiment of the present invention.
With reference to Figure 24, comprise audio compression portion 105~400 and audio frequency recovery section 500~800 according to the device that is used for audio compression and recovery of one embodiment of the present invention.
And, fall mixed portion 105 and comprise that mixed portion 110 and spatial information generating unit 120 fall in channel.
In falling mixed portion 105, it is N channel X1 that the input that mixes portion 110 falls in channel, X2 ..., the sound signal of XN and sound signal.
Channel falls the portion of mixing 110 outputs and is fallen and blend together the signal of the number of channel less than the input channel number.
Fall the output that mixes portion 105 and blended together one or two channel by falling---mix the particular channel number of ordering according to falling separately, perhaps the particular channel number that realization is preset according to system.
200 pairs of channels of core encoder portion fall the output that mixes portion 110 (that is, fall after mixing sound signal) and carry out core encoder.In this case, core encoder is according to utilizing the mode of compressing input such as multiple conversion scheme such as discrete transform schemes to carry out.
Spatial information generating unit 120 is extracted spatial information from multi channel audio signal.Then, spatial information generating unit 120 is sent to spatial information encoding section 300 with the spatial information that extracts.
The spatial information of 300 pairs of inputs of spatial information encoding section carries out digital coding and entropy coding.Spatial information encoding section 300 is carried out at least a among PCM, PBC and the DIFF.In some cases, spatial information encoding section 300 is also carried out entropy coding.Can decide decoding scheme according to that a kind of data coding scheme that spatial information encoding section 300 is used according to spatial information lsb decoder 700.And, after a while with reference to Figure 25, spatial information encoding section 300 is elaborated.
The output of the output of core encoder portion 200 and spatial information encoding section 300 is input to multiplexing unit 400.
Multiplexing unit 400 is multiplexed into bit stream with these two inputs, then this bit stream is sent to audio frequency recovery section 500 to 800.
Audio frequency recovery section 500 to 800 comprises demultiplexing portion 500, core codec portion 600, spatial information lsb decoder 700 and multichannel generating unit 800.
The compressing audio signal that core codec portion 600 receives from demultiplexing portion 500.Core codec portion 600 generates by compressing audio signal is decoded falls audio mixing signal frequently.
The compression stroke information that spatial information lsb decoder 700 receives from demultiplexing portion 500.Spatial information lsb decoder 700 generates spatial information by compression stroke information is decoded.
During this period, the various grouping informations of expression that from the bit stream that receives, comprise in the extraction data structure shown in Figure 23 and the identification information of coded message.From at least one or a plurality of decoding scheme, select specific decoding scheme according to this identification information.And, generate spatial information by spatial information being decoded according to selected decoding scheme.In this case, can use any data coding scheme to decide the decoding scheme of spatial information lsb decoder 700 according to spatial information encoding section 300.And, after a while with reference to Figure 26, spatial information lsb decoder 700 is elaborated.
Simultaneously, audio compression portion 105~400 provides representation space information encoding section 300 to use the identifier of what data coding scheme to audio frequency recovery section 500~800.For tackling above-mentioned situation, audio frequency recovery section 500~800 comprises the device that is used to resolve identification information.
Thereby spatial information lsb decoder 700 decides decoding scheme with reference to the identification information that audio compression portion 105~400 provides.Preferably, provide the identification information device of resolving that is used for the expression encoding scheme for spatial information lsb decoder 700.
Figure 25 is that wherein, spatial information is called as spatial parameter according to the detailed diagram of the spatial information encoding section of one embodiment of the present invention.
With reference to Figure 25, comprise pcm encoder portion 310, DIFF (differential coding) portion 320 and Huffman (Huffman) encoding section 330 according to the encoding section of one embodiment of the present invention.Huffman encoding portion 330 is corresponding to an embodiment that carries out aforementioned entropy coding.
32 pairs of spatial parameters of DIFF portion carry out aforementioned DIFF.
Specifically, in the present invention, optionally operate, come spatial parameter is encoded for one in grouping pcm encoder portion 311, PBC portion 312 and the DIFF portion 320.And, its control device is not shown separately among this figure.
In aforementioned description, described the PBC that PBC portion 312 carries out in detail, so omitted its explanation in the following description.
For another example of PBC, spatial parameter is carried out PBC one time.And, can also to the first time PBC the result carry out (N>1) PBC N time.Specifically, to the result's of PBC pilot value or difference are carried out PBC at least one time for the first time as carrying out.In some cases, preferably from the second time PBC difference except that pilot value is only carried out PBC.
DIFF portion 320 comprises the DIFF_FREQ encoding section 321 of spatial parameter being carried out DIFF_FREQ, and the DIFF_TIME encoding section 322 and 323 of spatial parameter being carried out DIFF_TIME.
In DIFF portion 320, a spatial parameter to input of selecting from the group that is made of DIFF_FREQ encoding section 321 and DIFF_TIME encoding section 322 and 323 is handled.
In this case, with the DIFF_TIME coded portion for spatial parameter being carried out the DIFF_TIME_FORWARD portion 322 of DIFF_TIME_FORWARD and spatial parameter being carried out the DIFF_TIME_BACKWARD portion 323 of DIFF_TIME_BACKWARD.
In DIFF_TIME encoding section 322 and 323, selected from DIFF_TIME_FORWARD portion 322 and DIFF_TIME_BACKWARD portion 323 spatial parameter to input carries out the digital coding processing.In addition, in aforementioned description, described each DIFF that the carries out coding in the intraware 321,322 and 323 of DIFF portion 320 in detail, so omitted its explanation in the following description.
In the output of 330 pairs of PBC portions 312 of Huffman encoding portion and the output of DIFF portion 320 at least one carried out Huffman encoding.
Selected one in HUFF_1D portion 331 in the Huffman encoding portion 330 and HUFF_2D portion 332 and 333 is carried out Huffman encoding to input and handles.
In this case, with HUFF_2D portion 332 and 333 be divided into to the data that bundle based on frequency to the frequency of carrying out Huffman encoding to 2 dimension Huffman encoding portions (hereinafter being abbreviated as HUFF_2D_FREQ_PAIR portion) 332 and to the data that bundle based on the time to time of carrying out Huffman encoding to 2 dimension Huffman encoding portions (hereinafter being abbreviated as HUFF_2D_TIME_PAIR portion) 333.
In HUFF_2D portion 332 and 333, selected one in HUFF_2D_FREQ_PAIR portion 332 and the HUFF_2D_TIME_PAIR portion 333 is carried out Huffman encoding to input and handles.
In the following description, will each Huffman encoding that carries out in the intraware 331,332 and 333 of Huffman encoding portion 330 be elaborated.
After this, carry out multiplexing with the output of the grouping pcm encoder portion 311 that will transmit the output of Huffman encoding portion 330.
In spatial information encoding section according to the present invention, will be inserted into by the various identification informations that digital coding and entropy coding generate in the transmission bit stream.And, should transmit bit stream and be sent to spatial information lsb decoder shown in Figure 26.
Figure 26 is the detailed diagram according to the spatial information lsb decoder of one embodiment of the present invention.
With reference to Figure 26, the spatial information lsb decoder receives the transmission bit stream that comprises spatial information, generates this spatial information by the transmission bit stream that receives is decoded then.
Spatial information lsb decoder 700 comprises that identifier extracts (mark analysis unit) 710, PCM lsb decoder 720, Huffman decoding portion 730 and differential decoding portion 740.
The identifier analysis unit 710 of spatial information lsb decoder is extracted various identifiers from the transmission bit stream, then the identifier that extracts is resolved.This means the various information of mentioning in the aforementioned description of extracting Figure 23.
The spatial information lsb decoder can utilize the output of identifier analysis unit 710 to know which kind of encoding scheme spatial parameter has been used, then decision and the corresponding decoding scheme of discerning of encoding scheme.In addition, the processing carried out of identifier analysis unit 710 can be carried out by aforementioned demultiplexing portion 500 equally.
Grouping PCM lsb decoder 721 generates spatial parameter by the transmission bit stream is carried out the PCM decoding.In some cases, grouping PCM lsb decoder 721 is by the spatial parameter of portion in groups in next life that the transmission bit stream is decoded.
Based on the lsb decoder 722 of pilot tone by the output of Huffman decoding portion 730 is carried out generating spatial parameter based on the decoding of pilot tone.The situation that comprises pilot value in this output corresponding to Huffman decoding portion 730.For individual other example, can comprise pilot extraction portion (not shown) based on the lsb decoder 722 of pilot tone, be used for directly extracting pilot value from the transmission bit stream.Thereby, utilize pilot value that pilot extraction portion extracted and generate spatial parameter value as the difference of the output of Huffman decoding portion 730.
730 pairs of transport stream of Huffman decoding portion are carried out Huffman decoding.Huffman decoding portion 730 comprises by the transmission bit stream is carried out 1 dimension Huffman decoding 1 of output data value dimension Huffman decoding portion (hereinafter being abbreviated as the HUFF_1D lsb decoder) 731 and 2 dimension Huffman decoding portions (hereinafter being abbreviated as the HUFF_2D lsb decoder) 732 and 733 by exporting a pair of data value separately to transmitting bit stream to carry out 2 dimension Huffman decodings one by one.
For Huffman encoding scheme in the transmission bit stream is the situation of HUFF_2D, identifier analysis unit 710 also extract expression HUFF_2D scheme be HUFF_2D_FREQ_PAIR or the identifier of HUFF_2D_TIME_PAIR (for example, bsParsing), resolve the identifier that is extracted then.Thereby identifier analysis unit 710 can identify and constitute two a pair of data and be based on frequency and also be based on the time and be bundled in together.And, with frequency to 2 dimension Huffman decodings (hereinafter be abbreviated as HUFF_2D_FREQ_PAIR decoding) and time in the 2 dimension Huffman decodings (hereinafter being abbreviated as HUFF_2D_TIME_PAIR decodes) with the corresponding decision of corresponding situation be the Huffman decoding scheme.
In HUFF_2D lsb decoder 732 and 733, HUFF-2D_FREQ_PAIR portion 732 carries out the HUFF_2D_FREQ_PAIR decoding, and HUFF_2D_TIME_PAIR portion 733 carries out the HUFF_2D_FREQ_TIME decoding.
Based on the output of identifier analysis unit 710, the output of Huffman decoding portion 730 is sent to lsb decoder 722 or differential decoding portion 740 based on pilot tone.
For the DIFF scheme is the situation of DIFF_TIME, identifier analysis unit 710 also from the transmission bit stream, extract expression DIFF_TIME be DIFF_TIME_FORWARD or the identifier of DIFF_TIME_BACKWARD (for example, bsDiffTimeDirection), resolve the identifier that is extracted then.
Thereby the output that can discern Huffman decoding portion 730 is difference between current data and the last data or the difference between current data and next data.With among DIFF_TIME_FORWARD and the DIFF_TIME_BACKWARD with the corresponding decision of corresponding situation be the DIFF_TIME scheme.
In DIFF_TIME lsb decoder 742 and 743, DIFF_TIME_FORWARD portion 742 carries out the DIFF_TIME_FORWARD decoding, and DIFF_TIME_BACKWARD portion 743 carries out the DIFF_TIME_BACKWARD decoding.
Decide the process of Huffman decoding scheme and data decode scheme to be explained as follows to output based on the identifier analysis unit 710 in the spatial information lsb decoder.
For example, identifier analysis unit 710 read be illustrated in used when spatial parameter encoded among PCM and the DIFF which first identifier (for example, bsPCMCoding).
If first identifier is corresponding to the value of expression PCM, then identifier analysis unit 710 further read expression use among PCM and the PBC which spatial parameter has been carried out coding second identifier (for example, bsPilotCoding).
If second identifier is corresponding to the value of expression PBC, then the spatial information lsb decoder carries out the corresponding decoding with PBC.
If second identifier is corresponding to the value of expression PCM, then the spatial information lsb decoder carries out the corresponding decoding with PCM.
On the other hand, if first identifier corresponding to the expression DIFF value, then the spatial information lsb decoder carries out the corresponding decoding processing with DIFF.
To the method that send, receives the decode through above-mentioned audio coding scheme encoded signals be described below.
Figure 27 is the block diagram of device that is used to send sound signal according to embodiment of the present invention.With reference to Figure 27 the dispensing device of this embodiment according to the present invention is described.To convert signal as the audio/video signal of broadcast singal to and in multiplexer 1000, carry out multiplexing with mpeg 2 transport stream (TS) form.Sound signal can be to carry out encoded signals according to reference Figure 24 and 25 described coding methods.
1000 pairs of multiplexers comprise that the signal with MPEG-2TS form of the sound signal that is used for energy dispersal carries out multiplexing.External encoder 2100 and outer interleaver (interleaver) 2200 can be encoded and interweave multiplex data, thereby strengthens the transfer efficiency of multiplexed signals.Outer coding methods can comprise reed-solomon (Reed-Solomon) coding method and deinterleaving method can comprise the convolutional interleave method.
3200 pairs of transmission signals of internal encoder 3100 and inner interleaver are encoded and are interweaved, to prevent generation error in transmission signals.Internal encoder can come transmission signals is encoded based on punctured convolutional codes, and inner deinterleaving method can comprise this machine that uses based on storer according to transmission mode (as 2k pattern, 4k pattern or 8k pattern) or go deep into deinterleaving method.
Mapper 3500 is considered parameter signals (TPS) and pilot signal according to transmission mode, according to 16 quadrature amplitude modulation (16QAM), 64QAM or Quadrature Phase Shift Keying (QPSK) transmission signals with symbol is shone upon.Frame formation portion 4000 utilizes OFDM (OFDM) method that mapping signal is modulated, and is inserted with protection frame at interval in the data break of formation comprising this modulation signal.Each frame all comprises 68 OFDM symbols.Each symbol all comprises 6817 carrier waves under the 8k pattern, and all comprises 1705 carrier waves under the 2k pattern.Protection is the circulation continuity (cyclic continuation) of data trnascription in the data break at interval, and its length becomes with transmission mode.The OFDM frame comprises scattered pilot, continuous pilot and TPS carrier wave.After a while with reference to Figure 28, the structure of the frame that forms by frame formation portion shown in Figure 27 is elaborated.
D/A switch portion 4100 will have data break and protection digital broadcast signal at interval converts simulating signal to, and sending part 4200 sends the simulating signal that converts to the mode via the RF signal.Therefore, can send the sound signal of utilizing above-mentioned coding method coding by the DVB-T form.
The signal that Figure 28 shows in the formed frame of frame formation portion shown in Figure 27 is arranged.In Figure 28, Tu represents the quantity of effective available carrier wave, the scattered pilot distance on the Dt express time direction, and Df represents the scattered pilot distance on the frequency direction.Scattered pilot distance D f on the frequency direction has determined the delay scope of the ghost image that can estimate in the channel (ghost).Figure 28 shows the insertion position of pilot tone when receiving the signal that forms in frame formation portion.
In order to insert position when receiving signal temporarily, arrange these symbols according to the mode that per four incoming symbols present same pilot frequency mode in pilot tone.That is, have and the identical scattered pilot of symbol, and can be when receiving signal carry out at t=2 the interim insertion of 3 and 4 symbols of importing in the position of scattered pilot in t=5 input as the symbol of first input (t=1).
Have and the identical scattered pilot pattern of symbol at the symbol of t=6 input, and can carry out at t=3 the interim insertion of 4 and 5 symbols of importing in the position of the scattered pilot of the symbol of the symbol of t=2 input and t=6 input in the t=2 input.
Therefore, if imported symbol and when receiving signal, carried out interim insertion at t=7, then because have the scattered pilot of per four carrier waves at the symbol of t=4 input, so between the scattered pilot of the symbol of t=4 input, reduced into 1/4 of original gap between these scattered pilots, and had wherein per four patterns that carrier wave is positioned of scattered pilot at the symbol of t=4 input in the gap on the frequency direction.Therefore, can be in location, symbol place more pilot when reception.Therefore, when utilizing continuous pilot and scattered pilot to send signal, can be when received signal according to the state self-adaption ground compensate for channel of receive channel.
Figure 29 shows the dispensing device that is used to send coding audio signal of another embodiment according to the present invention.Below with reference to Figure 29, the dispensing device of another embodiment according to the present invention is described.
Another example as sending sound signal can use hand-held digital video broadcast (DVB-H) scheme.The DVB-H scheme expands to mobile terminal area with broadcast area, and can utilize IP datagram to send transmission information.IP datagram is represented to be used to utilize IP-based packet to send the treated signal of signal, and comprises data capsule (container) and the information that is used to send the header that comprises the IP address.In the IP datagram of packet unit, data capsule can comprise vision signal and sound signal.That is, the DVB-H scheme is used the signal after the internet protocol datagram broadcasting scheme is divided sound signal and vision signal and compression and sent division as unit with packet.After a while with reference to Figure 30, the structure of the IP data that are used for the IP data broadcasting is elaborated.
Can carry out the IP datagram that in signal conversion part 8000, is packaged into by the time slicing method according to MPE multiplexing, to reduce power consumption.Multiplexed signals is converted to transport stream and carries out multiplexing with the MPEG-2TS that comprises vision signal or sound signal.Modulation and encoding section 5000 can comprise the assembly by label shown in Figure 27 21 to 42 expressions.Can carry out the processing that the DVB-T broadcast singal is modulated and encoded described with reference to Figure 27 to the sound signal of multiplexing MPEG-2TS, and send by broadcast singal.
Figure 30 is the embodiment that is used for the packet structure of processing signals according to of the present invention.Specifically, although the RTP packet structure is described as the example of packet structure, the present invention also is applicable to other packets that are used for deal with data except that the RTP packet.
Can utilize host-host protocol to use realaudio data such as real-time transport protocol (rtp) or RTP Control Protocol (RTCP).RTP/RTCP is the example that can transmit the agreement of multimedia or broadcasted content in real time by Internet reliably.RTP carries out under UDP, and carries out many transmission, but does not comprise that transmission control function, connection are provided with function and frequency band reserved function.RTP can transmit end-to-end real time data such as interactive video or audio frequency by unicast or multicast channel.
With reference to Figure 30 (a), the IP header that the RTP packet comprises useful load, RTP header, UDP header and represents as the IP header to distinguish.
The RTP header comprises as version represents the field " Ver " distinguished, whether carried out the field " pad " in the zone of filling as expression, field " x " as the extension header district, represent the field " cc " distinguished as the coefficient of contribution source identifier (CSRC), the field " M " that the symbol that serves as a mark is distinguished, represent the field " PT " distinguished as PT Payload Type, field " Sequence number (sequence number) " as sequence of data packet number expression district, as the field of representing the effective time of packet to distinguish " time stamp (timestamp) ", represent the field " SSRC " distinguished as synchronous source identifier, and represent the field " CSRC " distinguished as the contribution source identifier.
Figure 30 (b) shows the example of UDP header, and UDP is that wherein transmitter side sends data unilaterally and need not inform the communication protocol that sends or received signal by the internet exchange message time.That is, UDP is that wherein transmitter side sends the data transmitter side agreement of not getting in touch with receiver side simultaneously unilaterally, therefore is called as connectionless protocol.
UDP header comprises that field " Source port address (source port address) ", the expression of address that expression is used to generate the application program of particular message is used to receive the field " Total length (total length) " of total length of field " Destination port addrss (destination port address) ", expression user datagram of address of the application program of particular message, and the field " Checksum (verification and) " that is used for error correction.
Figure 30 (c) shows the example of IP header.In the present invention, the packet in the IP packet is called as datagram.
The IP header comprises the field " VER " of the version number of expression IP header, the field " HLEN " of the length of expression IP header, expression is for the field " Service type (type of service) " of input that is used for coming according to defined rule the IP protocol apparatus of processing messages, expression comprises the field " Total length (total length) " of the length of data package of protocol header, be used for segmentation (fragmentation) so that the field " identification (sign) " of the segmentation of sign reorganization segmentation, the field " Flags (mark) " whether expression can carry out segmentation to datagram, field " Fragmentation offset (grading excursion) " as the pointer of the data-bias of expression original datagram when the segmentation, the expression packet is kept field how long " Time to live (time-to-live) " on network, the host-host protocol that expression is used for transmits data packets is TCP, UDP still is the field " Protocol (agreement) " of ICMP, thereby the integrality that is used to check header does not keep the field " header checksum (header check and) " of remainder data bag, the field " Sourceaddress (source address) " of the internet address of the original source of expression datagram, the field " Destination address (destination-address) " of the internet address of the final destination of expression datagram, and the field " Option (optional) " that is used for the additional functionality of IP datagram.
As shown in figure 24, core encoder voice data and through the voice data of spatial information coding, that is, and utilize pilot frequency benchmark value and pilot tone differential coding voice data can send with same RTP packet.Alternatively, the core encoder voice data can send with a RTP packet, and utilize pilot frequency benchmark value and pilot tone differential coding voice data can send with the 2nd RTP packet.In this case, identical timestamp information is inserted in the header of RTP packet, thereby can decodes simultaneously the voice data of a RTP packet and the voice data of the 2nd RTP data.If a RTP packet comprises multiframe core encoder voice data, then the 2nd RTP packet can comprise the voice data of equal number frame.Can with the RTP useful load be unit to the core encoder voice data or utilize the pilot frequency benchmark value and the pilot tone differential coding voice data (all being included in each RTP packet) interweave, thereby be easy to carry out error correction.
Send comprising utilize pilot frequency benchmark value and pilot tone differential coding the IP packet of voice data the time, expression can have been comprised utilize pilot frequency benchmark value and pilot tone differential coding the identifier of voice data be added in the broadcast signal streams.This identifier can be included in the RTP header, perhaps can be configured to resolve by AudioSpecificConfig (), thereby identify audio object as the MPEG-4 audio stream.With reference to Figure 32, AudioSpecificConfig () is elaborated below.
Figure 31 show transmission wherein to the DVB-H system applies example of the business that sends of time slicing method professional and the common channel by DVB-T system and DVB-H system.Can send program by each the channel in DVB-H system and the DVB-T system.When the channel by the DVB-H system sends program, can carry out time division multiplex to these business by the time slicing method and send then.Sound signal is included in the IP datagram of DVB-H system, is converted into MPE or MPE-FEC, and has the MEPG-2TS of MPE or MPEG-FEC to be sent out with wherein embedding.
Figure 32 show transmission/reception be illustrated in channel by DVB-T system or DVB-H system send utilize pilot frequency benchmark value and pilot tone differential coding voice data the time voice data carried out the example of the identifier of Methods for Coding.AudioSpecificConfig () can resolve the information that can identify the method that the audio object (audioObject) as the coding audio data that comprises in the MPEG-4 stream is compressed.That is, can be included in the MPEG-4 broadcast transmission stream and be sent out according to the identification information of audio coding of the present invention.The type of audio object (audioObject) has according to the identifier that audio object is carried out Methods for Coding.Figure 33 shows the identifier of audio object type.
With reference to Figure 33, when audio object be included in the core encoder voice data and utilize pilot frequency benchmark value and pilot tone differential coding the stream that sends of voice data in the time, identifier is arranged to the audio object type, thereby can send/receive the information relevant, as shown in figure 33 with the method for compressed audio object.In Figure 33, utilize using the coding method (MPS) of pilot frequency benchmark value and the pilot tone difference identifier of decoding is 30 audio object.Therefore, when receiving audio compressed data, utilizing AudioSpecificConfig () (case 30 in the branch statement) to come the analyzing audio object type is 30 audio object, can consider to utilize coding method (as utilizing the coding method of pilot frequency benchmark value and pilot tone difference) this audio object of decoding of spatial information.
As mentioned above, can send the core encoder audio object by the DVB-H form, utilize pilot frequency benchmark value and pilot tone differential coding audio object and be used to identify the identifier that audio object is carried out Methods for Coding.
Therefore, be used to receive and the device of processing signals can comprise be used for to comprise utilize pilot frequency benchmark value and pilot tone differential coding the broadcast singal of voice data carry out tuning tuner and be used for considering demodulating the demodulation section of broadcast singal through the time-varying scattered pilot of the frame of tuning broadcast singal and the continuous pilot of fixing in time.Can be input to demodulation multiplexer shown in Figure 24 from the signal of demodulation section output, and can come reconstructed audio signal by multichannel.According to example shown in Figure 24, when the voice data that received the core encoder voice data and utilize pilot frequency benchmark value and pilot tone differential coding, can utilize device shown in Figure 24 this data of decoding.As mentioned above, although utilize pilot frequency benchmark value and pilot tone differential coding voice data lost, only can decode to the core encoder data.
Figure 34 show can according to utilize pilot frequency benchmark value and pilot tone differential coding the overview and the grade of the sound signal that reconstructs of voice data.As shown in figure 34, can be when sending voice data according to the method and apparatus that is used for processing signals of the present invention by the RTP packet, the sound signal of having utilized all grades of overview to send/receive to utilize pilot frequency benchmark value and pilot tone differential coding.Yet,, have the bit stream of 515,525 tree configuration but still can decode although this device is only supported the arbitrary grade according to the function of receiving trap.For example, although 515 audio bit streams of grade 2 send with the bit stream through efficient Advanced Audio Coding version 2 (HE AAC v2) coding, but the embodiment of signal processing apparatus shown in Figure 24 to 26 can be to decoding through the bit stream of core encoder by HE AAC v2, fall mixed channel thereby generate monophony, and generate the 2 channels output of the binary decoding schema that utilizes MPS.
Figure 35 and 36 shows another example of transmission/reception identifier, this identifier be used for the channel by DVB-T system or DVB-H system send utilize pilot frequency benchmark value and pilot tone differential coding voice data time notice this voice data is carried out Methods for Coding.Figure 35 shows according to the configuration information of audio object (promptly, AudiospecificConfig ()) obtains to have carried out the payload information of the object of core encoder, and parse the grammer of the expansion useful load (extension_payload) of useful load based on AAC.Be 1100 (Figure 36) if expansion type is the value of EXT_SAC_DATA and EXT_SAC_DATA, then represent to be sent to the expansion useful load through the voice data of MPS coding according to the expansion useful load.Therefore, can identify the voice data that utilizes pilot frequency benchmark value and pilot tone differential coding according to the encode identification information of the voice data that comprises in the useful load.
Industrial usability
It will be apparent to those skilled in the art that preferred embodiment of the present invention only is exemplary, and in the situation that does not break away from the spirit or scope of the present invention, can carry out various improvement, modification, change or interpolation to embodiments of the present invention. For example, be applicable to multiple application and product according to grouping of the present invention, data encoding and entropy coding. In addition, can be provided for storing the medium of the data with at least one feature of the present invention.
Claims (15)
1, a kind of method that is used for processing signals, this method may further comprise the steps:
Reception comprises the broadcast singal of voice data, and this voice data utilizes pilot frequency benchmark value and pilot tone difference to encode;
Time-varying scattered pilot in one frame of the broadcast singal that consideration receives and fixing in time continuous pilot are come this broadcast singal of demodulation and the broadcast singal after the demodulation are decoded to obtain broadcast transmission stream;
This broadcast transmission stream is carried out the identifier of demultiplexing to obtain the coding audio data in Internet protocol (IP) packet and to be used to identify the method for this voice data of decoding;
From this coding audio data obtain with corresponding this pilot frequency benchmark value of a plurality of data and with corresponding this pilot tone difference of this pilot frequency benchmark value; And
Utilize this pilot frequency benchmark value and this pilot tone difference to obtain this voice data.
2, method according to claim 1, this method also comprise at least one step of decoding in this pilot frequency benchmark value and this pilot tone difference.
3, method according to claim 1, wherein, these data are parameters, and
Wherein, this method parameter of also comprising utilization and being obtained is come the step of reconstructed audio signal.
4, method according to claim 3, wherein, this parameter comprises channel grade poor (CLD), inter-channel correlation (ICC) and falls at least one of mixing in the gain (ADG) arbitrarily.
5, method according to claim 1, wherein, in the mean value that this pilot frequency benchmark value is these a plurality of data, intermediate value, the most frequently used value and the default value one.
6, method according to claim 1, wherein, this pilot frequency benchmark value is a value of extracting from table.
7, method according to claim 1, this method is further comprising the steps of: select the highest data of code efficiency as final pilot frequency benchmark value after being provided with this pilot frequency benchmark value in these a plurality of data each.
8, method according to claim 1, wherein, this IP packet comprises the real-time transport protocol (rtp) packet, this RTP packet comprises the core encoder voice data and according to the pilot frequency benchmark value and the pilot tone difference of this core encoder voice data.
9, method according to claim 1, wherein, this IP packet comprises a RTP packet and the 2nd RTP packet, the one RTP packet comprises the core encoder voice data, and the 2nd RTP packet comprises pilot frequency benchmark value and pilot tone difference according to this core encoder voice data, and a RTP packet has identical timestamp information with the 2nd RTP packet.
10, method according to claim 1, wherein, this identifier is to obtain in the configuration information of the audio object that comprises from this broadcast transmission stream.
11, a kind of device that is used for processing signals, this device comprises:
Tuner is used for the tuning broadcast singal that comprises voice data, and this voice data utilizes pilot frequency benchmark value and pilot tone difference to encode;
Demodulation section is used for considering coming this broadcast singal of demodulation through the time-varying scattered pilot and the fixing in time continuous pilot of a frame of tuning broadcast singal;
Demultiplexing portion, be used for the signal after the demodulation is decoded, the broadcast transmission stream that comprises voice data is carried out demultiplexing, and parse in by the audio stream of demultiplexing when utilizing this pilot frequency benchmark value and this pilot tone difference to carry out the identifier of this voice data of coding that demultiplexing goes out to utilize this pilot frequency benchmark value and this pilot tone difference to carry out this voice data and the core encoder voice data of coding from this audio stream;
Core codec portion is used for this core encoder voice data is decoded;
The spatial information lsb decoder is used for the voice data that utilizes this pilot frequency benchmark value and this pilot tone difference to carry out coding is decoded; And
The multichannel generating unit is used for and will exports with the form of multi-channel audio from the sound signal of this core codec portion and the output of this spatial information lsb decoder.
12, device according to claim 11, wherein, the audio stream that this demultiplexing portion demultiplexing goes out comprises the IP packet, and this IP packet comprises this core encoder voice data and utilizes this pilot frequency benchmark value and this pilot tone difference and this core encoder voice data to carry out the voice data of encoding.
13, device according to claim 12, wherein, this IP packet comprises the RTP packet, this RTP packet comprises according to the pilot frequency benchmark value of this core encoder voice data and pilot tone difference.
14, device according to claim 12, wherein, this IP packet comprises a RTP packet and the 2nd RTP packet, the one RTP packet comprises this core encoder voice data, and the 2nd RTP packet comprises pilot frequency benchmark value and pilot tone difference according to this core encoder voice data, and a RTP packet has identical timestamp information with the 2nd RTP packet.
15, device according to claim 11, wherein, this identifier is to obtain in the configuration information of the audio object that comprises from this audio stream.
Applications Claiming Priority (17)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US72565405P | 2005-10-13 | 2005-10-13 | |
US60/725,654 | 2005-10-13 | ||
US60/726,228 | 2005-10-14 | ||
US60/729,713 | 2005-10-25 | ||
US60/730,394 | 2005-10-27 | ||
US60/730,393 | 2005-10-27 | ||
US60/737,760 | 2005-11-18 | ||
US60/752,911 | 2005-12-23 | ||
US60/753,408 | 2005-12-27 | ||
US60/758,231 | 2006-01-12 | ||
US60/758,238 | 2006-01-12 | ||
KR10-2006-0004050 | 2006-01-13 | ||
KR10-2006-0004049 | 2006-01-13 | ||
KR10-2006-0030651 | 2006-04-04 | ||
KR10-2006-0079836 | 2006-08-23 | ||
KR10-2006-0079838 | 2006-08-23 | ||
KR10-2006-0079837 | 2006-08-23 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101288115A true CN101288115A (en) | 2008-10-15 |
Family
ID=40059364
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2006800380606A Pending CN101288116A (en) | 2005-10-13 | 2006-10-13 | Method and apparatus for signal processing |
CNA2006800380574A Pending CN101310328A (en) | 2005-10-13 | 2006-10-13 | Method and apparatus for signal processing |
CNA2006800380589A Pending CN101288115A (en) | 2005-10-13 | 2006-10-13 | Method and apparatus for signal processing |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2006800380606A Pending CN101288116A (en) | 2005-10-13 | 2006-10-13 | Method and apparatus for signal processing |
CNA2006800380574A Pending CN101310328A (en) | 2005-10-13 | 2006-10-13 | Method and apparatus for signal processing |
Country Status (1)
Country | Link |
---|---|
CN (3) | CN101288116A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105593930A (en) * | 2013-07-22 | 2016-05-18 | 弗朗霍夫应用科学研究促进协会 | Apparatus and method for enhanced spatial audio object coding |
CN107431829A (en) * | 2015-03-12 | 2017-12-01 | 索尼公司 | Information processor, communication system, information processing method and non-transitory computer-readable medium |
US10249311B2 (en) | 2013-07-22 | 2019-04-02 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Concept for audio encoding and decoding for audio channels and audio objects |
US10277998B2 (en) | 2013-07-22 | 2019-04-30 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for low delay object metadata coding |
WO2024016758A1 (en) * | 2022-07-20 | 2024-01-25 | 哲库科技(上海)有限公司 | Audio data transmission method and apparatus, chip, electronic device, and storage medium |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102142924B (en) * | 2010-02-03 | 2014-04-09 | 中兴通讯股份有限公司 | Versatile audio code (VAC) transmission method and device |
EP2830060A1 (en) | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Noise filling in multichannel audio coding |
US9922664B2 (en) | 2016-03-28 | 2018-03-20 | Nuance Communications, Inc. | Characterizing, selecting and adapting audio and acoustic training data for automatic speech recognition systems |
CN106483088A (en) * | 2016-12-27 | 2017-03-08 | 东南大学 | A kind of gas concentration measuring apparatus based on ultraviolet light modulation and method |
CN112198496B (en) * | 2020-09-29 | 2022-11-29 | 上海特金无线技术有限公司 | Signal processing method, device and equipment and storage medium |
CN112506916B (en) * | 2020-10-29 | 2024-04-09 | 望海康信(北京)科技股份公司 | Main data processing method, system, corresponding computer equipment and storage medium |
-
2006
- 2006-10-13 CN CNA2006800380606A patent/CN101288116A/en active Pending
- 2006-10-13 CN CNA2006800380574A patent/CN101310328A/en active Pending
- 2006-10-13 CN CNA2006800380589A patent/CN101288115A/en active Pending
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11463831B2 (en) | 2013-07-22 | 2022-10-04 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for efficient object metadata coding |
US11330386B2 (en) | 2013-07-22 | 2022-05-10 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for realizing a SAOC downmix of 3D audio content |
US10249311B2 (en) | 2013-07-22 | 2019-04-02 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Concept for audio encoding and decoding for audio channels and audio objects |
US10277998B2 (en) | 2013-07-22 | 2019-04-30 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for low delay object metadata coding |
US11984131B2 (en) | 2013-07-22 | 2024-05-14 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Concept for audio encoding and decoding for audio channels and audio objects |
US10659900B2 (en) | 2013-07-22 | 2020-05-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for low delay object metadata coding |
US11910176B2 (en) | 2013-07-22 | 2024-02-20 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for low delay object metadata coding |
US10701504B2 (en) | 2013-07-22 | 2020-06-30 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for realizing a SAOC downmix of 3D audio content |
CN105593930B (en) * | 2013-07-22 | 2019-11-08 | 弗朗霍夫应用科学研究促进协会 | The device and method that Spatial Audio Object for enhancing encodes |
US11227616B2 (en) | 2013-07-22 | 2022-01-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Concept for audio encoding and decoding for audio channels and audio objects |
US10715943B2 (en) | 2013-07-22 | 2020-07-14 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for efficient object metadata coding |
US11337019B2 (en) | 2013-07-22 | 2022-05-17 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for low delay object metadata coding |
CN105593930A (en) * | 2013-07-22 | 2016-05-18 | 弗朗霍夫应用科学研究促进协会 | Apparatus and method for enhanced spatial audio object coding |
CN107431829A (en) * | 2015-03-12 | 2017-12-01 | 索尼公司 | Information processor, communication system, information processing method and non-transitory computer-readable medium |
CN107431829B (en) * | 2015-03-12 | 2021-05-07 | 索尼公司 | Information processing apparatus, communication system, information processing method, and non-transitory computer readable medium |
WO2024016758A1 (en) * | 2022-07-20 | 2024-01-25 | 哲库科技(上海)有限公司 | Audio data transmission method and apparatus, chip, electronic device, and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN101288116A (en) | 2008-10-15 |
CN101310328A (en) | 2008-11-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101288115A (en) | Method and apparatus for signal processing | |
US8019611B2 (en) | Method of processing a signal and apparatus for processing a signal | |
US11184409B2 (en) | Broadcasting signal transmission device, broadcasting signal reception device, broadcasting signal transmission method, and broadcasting signal reception method | |
US11516556B2 (en) | Apparatus for transmitting broadcast signal, apparatus for receiving broadcast signal, method for transmitting broadcast signal and method for receiving broadcast signal | |
US8203930B2 (en) | Method of processing a signal and apparatus for processing a signal | |
US9860571B2 (en) | Apparatus for transmitting broadcast signal, apparatus for receiving broadcast signal, method for transmitting broadcast signal and method for receiving broadcast signal | |
US11888916B2 (en) | Broadcast signal tranmission device, broadcast signal reception device, broadcast signal tranmission method, and broadcast signal reception method | |
US9749372B2 (en) | Device for transmitting broadcast signal, device for receiving broadcast signal, method for transmitting broadcast signal, and method for receiving broadcast signal | |
US10499095B2 (en) | Apparatus and method for receiving/transmitting broadcast signal | |
US10341036B2 (en) | Broadcast signal transmission apparatus, broadcast signal reception apparatus, broadcast signal transmission method, and broadcast signal reception method | |
US8194754B2 (en) | Method for processing a signal and apparatus for processing a signal | |
CN101317215A (en) | Signal processing method and device | |
KR20070041398A (en) | Method and apparatus for processing a signal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20081015 |