WO2005078703A1 - Verfahren und vorrichtung zum quantisieren eines informationssignals - Google Patents
Verfahren und vorrichtung zum quantisieren eines informationssignals Download PDFInfo
- Publication number
- WO2005078703A1 WO2005078703A1 PCT/EP2005/001343 EP2005001343W WO2005078703A1 WO 2005078703 A1 WO2005078703 A1 WO 2005078703A1 EP 2005001343 W EP2005001343 W EP 2005001343W WO 2005078703 A1 WO2005078703 A1 WO 2005078703A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio
- values
- value
- block
- threshold
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
- G10L19/265—Pre-filtering, e.g. high frequency emphasis prior to encoding
Definitions
- the present invention relates generally to quantizers or the quantization of information signals and, in exemplary embodiments, to the quantization of audio signals, as used, for example, for data compression of audio signals or for audio coding. In a specific embodiment, the present invention relates to audio coding with a short delay time.
- the currently best known audio compression process is the MPEG-1 Layer III.
- the sampling or audio values of an audio signal are encoded with loss in an encoded signal.
- irrelevance and redundancy of the original audio signal are reduced or ideally removed during the compression.
- a psychoacoustic model recognizes simultaneous and temporal masking, i.e. a temporally changing masking threshold which is dependent on the audio signal is calculated or determined, which indicates the volume at which tones of a certain frequency are only perceptible to the human ear.
- This information is in turn used to encode the signal by quantizing the spectral values of the audio signal more precisely, less precisely or not at all, depending on the masking threshold, and integrating them into the encoded signal.
- Audio compression methods such as the MP3 format, are then limited in their applicability when it comes to compressing audio data over a bit rate-limited transmission channel, on the one hand, but transmitting it with the shortest possible delay time.
- the delay time doesn't matter Role, such as in the archiving of audio information.
- audio coders with low delay sometimes also called “ultra low delay coders” are necessary where time-critical transmissions of audio signals are concerned, such as teleconferencing, wireless speakers or microphones.
- the article by Schuller G. etc. “Perceptual Audio Coding using Adaptive Pre- and Post-Filters and Lossless Compression", IEEE Transactions on Speech and Audio Processing, Vol. 10, No. 6, September 2002, pp. 379-390, a Audio coding proposed, in which the irrelevance reduction and the redundancy reduction are not carried out based on a single transformation, but on two separate transformations.
- the coding is based on an audio signal 902 which has already been sampled and is therefore already available as a sequence 904 of audio or sample values 906, the sequence in time of the audio values 906 being indicated by an arrow 908.
- a listening threshold is calculated using a psychoacoustic model. For example, FIG.
- FIG. 13 shows a diagram in which the spectrum is represented by the frequency f with the curve a a signal block from 128 audio values 906 and at b the masking threshold, as calculated by a psychoacoustic model, is plotted in logarithmic units, as already mentioned, the masking threshold indicates to what intensity frequencies are inaudible to the human ear All tones are below the masking threshold B.
- an irrelevance reduction is now achieved by controlling a parameterizable filter followed by a quantizer.
- parameterization is calculated such that the frequency response the same to the I nversen des Amount corresponds to the masking threshold. This parameterization is indicated in FIG. 12 by x # (i).
- quantization takes place with a constant step size, for example a rounding operation to the nearest integer.
- the resulting quantization noise is white noise.
- the filtered signal is "re-transformed" again with a parameterizable filter, the transfer function of which is set to the amount of the masking threshold itself. In this way, not only is the filtered signal decoded again, but also the quantizing noise on the decoder side is adapted to the shape of the masking threshold - If the masking threshold corresponds exactly, a gain value a # is also calculated on the encoder side for each parameter set or for each parameterization, which is applied to the filtered signal before quantization.
- the gain value becomes the decoder side so that the inverse transformation can be carried out a and the parameterization x are transmitted to the encoder as page information 910 in addition to the actual main data, namely the quantized, filtered audio values 912.
- this data ie page information 910 u nd main data 912, still subjected to lossless compression, namely entropy coding, whereby the coded signal is obtained.
- the article suggests a size of 128 samples 906 as the block size. This enables a relatively short delay of 8 ms at a sampling rate of 32 kHz.
- the article also describes that to increase the efficiency of the page information coding, the page information, namely the coefficients x # and a #, are only transmitted if there is sufficient change compared to a previously transmitted parameter set, ie if the Change exceeds a certain threshold. moreover describes that the implementation is preferably carried out so that a current parameter set is not applied directly to all samples belonging to the respective block, but that a linear interpolation of the filter coefficients x # is used in order to avoid audible artifacts.
- a lattice structure is proposed for the filter in order to prevent the occurrence of instabilities.
- the article still suggests to selectively multiply or weaken the filtered signal scaled with the time-dependent gain factor a by a factor not equal to 1, so that audible interference arise, but the bit rate can be reduced at difficult to encode locations of the audio signal.
- a problem with the above scheme is that due to the need to transmit the masking threshold or the transfer function of the encoder-side filter, hereinafter referred to as pre-filter to have to, the transmission channel is loaded relatively high, although the filter coefficients are only transmitted when a predetermined threshold is exceeded.
- Another disadvantage of the above coding scheme is that due to the fact that the masking threshold or the inverse thereof must be made available on the decoding side by the parameter set x # to be transmitted, a compromise between on the one hand the lowest possible bit rate or a high compression ratio and others - On the one hand, the most accurate approximation or parameterization of the masking threshold or the inverse of it must be made. It is therefore inevitable that the audio coding scheme given above to the masking threshold adapted quantization noise at some frequency ranges exceeds the masking threshold and therefore leads to audible audio interference for the listener. 13 shows, for example with curve c, the parameterized frequency response of the decoder-side parameterizable filter.
- the transfer function of the decoder-side filter hereinafter also referred to as the post filter
- the masking threshold b exceeds the masking threshold b.
- the problem is exacerbated by the fact that the parameterization is only transmitted intermittently if there is enough change between the parameterizations and is interpolated in between.
- An interpolation of the filter coefficients x # as proposed in the article, alone leads to audible disturbances if the amplification value a # is kept constant from support point to support point or from new parameterization to new parameterization.
- the interpolation proposed in the article is also applied to the side information value a # , ie the transmitted gain values, audible audio artifacts can remain in the audio signal arriving on the decoder side.
- a further problem with the audio coding scheme according to FIGS. 12 and 13 is that the filtered signal can take on an unforeseeable form due to the frequency-selective filtering, in which a single or individual audio values of the coded signal change due to a random superposition of many individual harmonics add up to very high values which, due to their rare occurrence, in turn lead to a poorer compression ratio in the subsequent redundancy reduction.
- the object of the present invention is to provide a method and a device for quantizing an information signal, so that with only a slight deterioration in the quality of the original information a higher data compression of the information signal can be realized.
- Quantizing an information signal of a sequence of information values in accordance with the invention comprises frequency-selective filtering of the sequence of information values in order to obtain a sequence of filtered information values, and quantizing the filtered information values in order to obtain a sequence of quantized information values by means of a quantization stage function which the filtered ones Maps information values to the quantized information values, and their course is steeper below a threshold information value than above the threshold information value.
- frequency-selective filtering of an audio signal results in artificially generated artifacts in the resulting filtered information signal, at which individual information values assume values that are significantly higher than the maximum values of the original signal due to a random constructive interference of all or many harmonics. such as more than twice as high.
- a truncation of the filtered information signal above a suitable threshold which is, for example, twice as large as the largest possible value of the original information signal to be filtered, so that the artifacts artificially generated by the frequency-selective filtering from the filtered Information signal are removed or smoothed, after back filtering hardly leads to an impairment of the quality of the information signal filtered back after quantization, but cutting off or increasing the quantization step size above a suitable one Threshold offers enormous savings in a bit representation of the filtered information signal.
- the information signal is an audio signal in which the selective quantization above or below a certain threshold leads to a barely audible reduction in audio quality with a simultaneous enormous reduction in the bit representation.
- the quantization level function can alternatively be provided to quantize all audio values to a highest quantization level above the threshold value, or a quantization level function is used which is flatter above the threshold value or has a larger quantization step size above the threshold value, so that the artificial generated artifacts can be quantized more coarsely.
- FIG. 1 is a block diagram of an audio encoder according to an embodiment of the present invention.
- FIG. 2 is a flow chart illustrating the operation of the audio encoder of FIG. 1 at the data input;
- FIG. 3 shows a flowchart to illustrate the mode of operation of the audio encoder from FIG. 1 with regard to the evaluation of the incoming audio signal by a psychoacoustic model
- FIG. 4 is a flowchart to illustrate the operation of the audio encoder of FIG. 1 with regard to the application of the psychoacoustic table model received parameters on the incoming audio signal;
- Fig. 5a is a schematic diagram illustrating the incoming audio signal, the sequence of audio values from which it is composed, and the working steps of Fig. 4 in relation to the audio values;
- 5b is a schematic diagram to illustrate the structure of the coded signal
- FIG. 6 shows a flowchart to illustrate the mode of operation of the audio encoder from FIG. 1 with regard to the final processing up to the coded signal;
- FIG. 7a shows a diagram in which an exemplary embodiment of a quantization level function is shown
- FIG. 7b shows a diagram in which a further exemplary embodiment for a quantization level function is shown
- FIG. 8 shows a block diagram of an audio decoder, which is capable of decoding an audio signal encoded by the audio encoder of FIG. 1, according to an exemplary embodiment of the present invention
- Fig. 9 is a flow chart illustrating the operation of the decoder of Fig. 8 at the data input;
- Fig. 10 is a flow chart illustrating the operation of the decoder of Fig. 8 with respect to the intermediate storage of the pre-decoded, quantized, filtered audio data and the Processing of audio blocks without associated page information;
- FIG. 11 shows a flow chart to illustrate the mode of operation of the decoder from FIG. 8 with regard to the actual back-filtering
- FIG. 12 is a schematic diagram illustrating a conventional audio coding scheme with a short delay time
- FIG. 13 shows a diagram in which, by way of example, a spectrum of an audio signal, a listening threshold thereof and the transfer function of the post filter are shown in the decoder.
- the audio encoder which is generally indicated at 10, first comprises a data input 12, at which it receives the audio signal to be encoded, which, as will be explained in more detail later with reference to FIG. 5a, consists of a sequence of audio values or samples there is, and a data output at which the coded signal is output, the information content of which is discussed in more detail with reference to FIG. 5b.
- the audio encoder 10 of FIG. 1 is divided into an irrelevance reduction part 16 and a redundancy reduction part 18.
- the irrelevance reduction part 16 comprises a device 20 for determining a monitoring threshold, a device 22 for calculating a gain value, a device 24 for calculating a parameterization, and a reference point comparison device 26, a quantizer 28 and a parameterizable pre-filter 30 as well as an input FIFO (first-in-first-out) buffer 32, a buffer or memory 38 and a multiplier or a multip Licensing device 40.
- the redundancy reduction part 18 comprises a compressor 34 and a bit rate controller 36.
- Irrelevance reduction part 16 and redundancy reduction part 18 are connected in series in this order between data input 12 and data output 14.
- the data input 12 is connected to a data input of the device 20 for determining a listening threshold and a data input of the input buffer 32.
- a data output of the device 20 for determining a monitoring threshold is connected to an input of the device 24 for calculating a parameterization and to a data input of the device 22 for calculating a gain value in order to forward a determined monitoring threshold to the same.
- the devices 22 and 24 calculate a parameterization or a gain value on the basis of the listening threshold and are connected to the reference point comparison device 26 in order to forward these results to the same.
- the reference point comparison device 26 forwards the results calculated by the devices 22 and 24 as input parameters or parameterization to the parameterizable pre-filter 30.
- the parameterizable pre-filter 30 is connected between a data output of the input buffer 32 and a data input of the buffer 38.
- the multiplier 40 is connected between a data output of the buffer 38 and the quantizer 28.
- the quantizer 28 forwards multiplied or scaled, but in any case quantized, filtered audio values to the redundancy reduction part 18, to be precise to a data input of the compressor 34.
- the reference point comparison device 26 forwards information to the redundancy reduction part 18, from which the data is transmitted the parameterizable pre-filter 30 forwarded input parameters can be derived, specifically to a further data input of the compressor 34.
- the bit rate control is connected via a control connection to a control input of the multiplier 40 in order to do so ensure that the quantized, filtered audio values, as they are obtained from the pre-filter 30, are multiplied by the multiplier 40 by a suitable multiplier, as will be discussed in more detail below.
- the bit rate controller 36 is connected between a data output of the compressor 34 and the data output 14 of the audio encoder 10 in order to suitably determine the multiplier for the multiplier 40.
- the multiplier is initially set to an appropriate scaling factor, such as 1.
- buffer 38 continues to store each filtered audio value to give bit rate controller 36 the opportunity to use the multiplier as described below change another iteration of a block of audio values. If such a change is not indicated by the bit rate controller 36, the buffer 38 can release the memory occupied by this block.
- the audio signal when it reaches the data input 12, has already been obtained from an analog audio signal by audio signal sampling 50.
- the audio signal sampling is carried out at a predetermined sampling frequency, which is usually between 32-48 kHz. Consequently, an audio signal is present at data input 12, which consists of a sequence of sampling or audio values.
- the coding of the audio signal does not take place in a block-based manner, the audio values at the data input 12 are first combined in a step 52 to form audio blocks.
- the summary of the audio blocks is only for the purpose of determining the listening threshold. le and takes place in an input stage of the device 20 for determining a listening threshold.
- 5a shows the sequence of samples at 54, each sample being illustrated by a rectangle 56.
- the samples are numbered for purposes of illustration, only some of the samples of the sequence 54 being shown for reasons of clarity.
- 128 consecutive samples are combined to form a block, the 128 samples immediately following forming the next block.
- the combination into blocks could also be carried out differently, for example by overlapping blocks or spaced blocks and blocks with a different block size, although the block size of 128 is again preferred because it is a good compromise between the one high audio quality and, on the other hand, the lowest possible delay time.
- the incoming audio values are buffered or buffered 54 in the input buffer 32 until the parameterizable pre-filter 30 is compared by the reference point comparison.
- direction 26 has received input parameters in order to carry out a prefiltration, as will be described below.
- the device 20 for determining a monitoring threshold begins its processing immediately after sufficient audio values have been received at the data input 12 to form an audio block or to form the next audio block, which the device 20 confirms by a check monitored in step 60. If a complete audio block that can be edited is not yet available, the device 20 waits.
- the device 20 calculates a monitoring threshold in a step 62 on the basis of a suitable psychoacoustic model in a step 62 based on a suitable psychoacoustic model.
- a suitable psychoacoustic model in a step 62 based on a suitable psychoacoustic model.
- the masking threshold which is determined in step 62, is a frequency-dependent function that can vary for successive audio blocks and can also vary significantly from audio signal to audio signal, such as from rock to classical music pieces.
- the listening threshold specifies a threshold value for each frequency, below which the human ear cannot perceive interference.
- amplification value a or a parameter set from N parameters x (i) (i 1, ... from the calculated listening threshold M (f) (where f is the frequency). , N).
- the device 24 now calculates the parameterization aj; such that the transfer function H (f) of the parameterizable pre-filter 30 is approximately equal to the inverse of the amount of the masking threshold M (f), i.e. so that
- the filter coefficients aj obtained as follows: the inverse discrete Fourier transform of
- the target autocorrelation function r ⁇ i) gives the frequency for the block at time t. Then the a are obtained by solving the linear system of equations:
- a lattice structure is preferably used for the filter 30, the filter coefficients for the lattice structure being reflected in parameters are re-parameterized.
- the filter coefficients for the lattice structure being reflected in parameters are re-parameterized.
- the device 22 calculates a noise power limit, namely a limit which indicates which noise power the quantizer 28 in may introduce the audio signal filtered by the pre-filter 30 so that the quantization noise after the back or post-filtering on the decoder side is below the listening threshold M (f) or exactly on the same.
- the device 22 calculates this noise power limit as the area below the square of the amount of the listening threshold M, i.e. as
- the quantization noise is the noise caused by the quantizer 28.
- the noise caused by the quantizer 28 is, as will be described later, white noise and is therefore frequency-independent.
- the quantization noise power is the power of the quantization noise.
- the device 22 calculates the noise power limit in addition to the gain value a. Furthermore, although it is possible for the reference point comparison device 26 to calculate the noise power limit again from the gain value a obtained from the device 22, it is also possible that In addition to the gain value a of the reference point comparison device 26, the device 22 also immediately transmits the determined noise power limit.
- the reference point comparison device 26 After calculating the gain value and the parameterization, the reference point comparison device 26 then checks in a step 66 whether the parameterization just calculated differs by more than a predetermined threshold from the current parameterization that was last passed on to the parameterizable pre-filter. If the check in step 66 shows that the parameterization just calculated differs from the current one by more than the predetermined threshold, the filter coefficients just calculated and the gain value or the noise power limit just calculated are temporarily stored in the reference point comparison device 26 for an interpolation which is still to be discussed and the reference point comparison device 26 transfers the filter coefficients that have just been calculated in a step 68 and the gain value that has just been calculated in a step 70 to the pre-filter 30.
- the node comparison device (26) transfers to the pre-filter 30 in step 72, instead of the parameterization just calculated, only the current node parameterization, ie the parameterization that last led to a positive result in step 66, that is to say that it differed from a previous interpolation point parameterization by more than a predetermined threshold.
- the process of Fig. 3 returns to processing the next audio block, i.e. to query 60, back.
- the pre-filter 30 applies this node parameterization to all samples of this audio block located in the FIFO 32, as will be described in more detail below, whereby this current block is removed from the FIFO 32 and the quantizer 28 produces a resultant one Receives audio block from pre-filtered audio values.
- FIG. 4 shows the mode of operation of the parameterizable pre-filter 30 in more detail in the event that it receives the parameterization and the gain value which have just been calculated because they differ sufficiently from the current reference point parameterization.
- processing according to FIG. 4 does not take place for each of the successive audio blocks, but rather only for audio blocks in which the associated parameterization differs sufficiently from the current node parameterization.
- the other audio blocks, as just described are pre-filtered by applying the current reference point parameterization and the associated current amplification value to all samples of these audio blocks.
- the parameterizable pre-filter 30 now monitors whether a transfer of just calculated filter coefficients from the Node comparison device 26 has taken place or from older node parameterizations. The pre-filter 30 carries out the monitoring 80 until such a transfer has taken place.
- the parameterizable pre-filter 30 begins processing the current audio block of audio values that is currently in the buffer memory 32, that is to say the one for which the parameterization has just been calculated.
- FIG. 5a has, for example, illustrated that all audio values 56 before the audio value with the number 0 have already been processed and therefore the memory 32 has already been processed have happened.
- the processing of the block of audio values before the audio value with the number 0 was triggered at the time because the parameterization that was calculated for the audio block before the block 0, namely x o (i), differs by more than the predetermined threshold from the previously defined to the pre-filter 30 base parameterization.
- the parameterization xo (i) is therefore a support parameterization, as it is referred to in the present invention.
- the processing of the audio values in the audio block before the audio value 0 was carried out based on the parameter set a 0 , xo (i).
- the pre-filter 30 determines in step 84 the noise power limit qi corresponding to the gain value ai. This can be done by the reference point comparison device 26 forwarding this value to the pre-filter 30, or by recalculating this value by the pre-filter 30, as was described with reference to step 64 above.
- an index j is then initialized to a sample value in a step 86 in order to point to the oldest sample value remaining in the FIFO memory 32 or the first sample value of the current audio block “block 1”, ie in the present example from FIG. 5a Sampling value 128.
- the parameterizable pre-filter carries out an interpolation between the filter coefficients x 0 and Xi, with parameterization x 0 as a base value at the base with audio value number 127 of the previous block 0 and parameterization xi as base value at the base Audio value number 255 of the current block 1 applies.
- These audio value positions 127 and 255 are also referred to below as support point 0 and support point 1, the support point parameterizations relating to the support points being indicated by the arrows 90 and 92.
- the parameterizable pre-filter 30 performs an interpolation between the noise power limit qi and qo in order to obtain an interpolated noise power limit at the scanning position j, ie q (t j ).
- the parameterizable pre-filter 30 then calculates the gain value for the sampling position j on the basis of the interpolated noise power limit and the quantization noise power, and preferably also the interpolated filter coefficient, namely, for example, depending on the root
- Quantization noise power . ,. , To which end the execution q (t 3) approximations to step 64 of FIG. 3 is referenced.
- the parameterizable pre-filter 30 then applies the calculated gain value and the interpolated filter coefficients to the sample at the sample position j in order to obtain a filtered sample value for this sample position, namely s' (t-,).
- the parameterizable pre-filter 30 checks whether the scanning position j has reached the current interpolation point, ie interpolation point 1, in the case of FIG. 5a the scanning position 255, ie the scanning value for which the parameterizable pre-filter 30 is transmitted. parameterization plus gain value should apply immediately, ie without interpolation. If this is not the case, the parameterizable pre-filter 30 increases or increments the index j by 1, the steps 88-96 being repeated again.
- step 100 the parameterizable pre-filter applies the gain value last transmitted by the reference point comparison device 26 and the filter coefficients last transmitted by the interpolation point comparator 26 immediately without interpolation to the sample value at the new interpolation point, whereupon the current block, ie in the present case block 1, has been processed and the process again at step 80 with respect to the subsequent block to be processed is carried out, which, depending on whether the parameterization of the next audio block block 2 differs sufficiently from the parameterization x ⁇ (i), may also be this next audio block block 2 or is a later audio block.
- the purpose of the filtering is to filter the audio signal at the input 12 with an adaptive filter, the transfer function of which is constantly adapted as optimally as possible to the inverse of the listening threshold, which also changes over time.
- the reason for this is that on the decoder side the back-filtering by an adaptive filter, the transfer function of which is accordingly constantly adapted to the listening threshold, the white quantization noise introduced by quantization of the filtered audio signal, i.e. the quantization noise, which is constant in frequency, shapes, namely adapts to the shape of the listening threshold.
- the application of the gain value in steps 94 and 100 in the pre-filter 30 consists in multiplying the audio signal or the filtered audio signal, ie the sample values s or the filtered sample values s', by the gain factor.
- the purpose is to thereby as far as possible the quantization noise, which is inserted into the filtered audio signal by the quantization described below, and which is adapted to the shape of the monitoring threshold on the decoder side set so high that it does not exceed the listening threshold. This can be illustrated by the Parseval 'see formula, according to which the square of a function is equal to the square of the Fourier transform.
- the quantization noise power is also reduced, namely by the factor a "2 , where a is the gain value. Consequently, by using the gain value in the pre-filter 30, the quantization noise power can be set optimally high, which is synonymous with the fact that the quantization step size increases and thus the number of quantization stages to be encoded is reduced, which in turn increases the compression in the subsequent redundancy reduction part.
- the effect of the prefilter can be viewed as a normalization of the signal to its masking threshold, so that the level of the quantization disturbances or the quantization noise can be kept constant both in time and in frequency. Since the audio signal is in the time domain, the quantization can therefore be carried out step by step with a uniform constant quantization, as will be described below. In this way, any irrelevance is ideally removed from the audio signal and a lossless compression scheme can be used to also remove the remaining redundancy in the pre-filtered and quantized audio signal, as will be described below.
- the filter coefficients and gain values a 0 , ai, x 0 , Xi used must of course be available as side information on the decoder side, but this reduces the transmission effort will mean that new filter coefficients and new gain values are not simply reused for each block. Rather, a threshold value check 66 takes place in order to transmit the parameterizations as page information only when there is a sufficient change in parameterization, and otherwise the page information or parameterizations are not transmitted. On the audio blocks for which the parameterizations have been transferred, an interpolation from the old to the new parameterization takes place over the area of these blocks. The filter coefficients are interpolated in the manner described above with reference to step 88.
- the interpolation with regard to the amplification takes place via a detour, namely via a linear interpolation 90 of the noise power limit q 0 , qi.
- a detour namely via a linear interpolation 90 of the noise power limit q 0 , qi.
- the linear interpolation leads to a better hearing result or less audible artifacts with regard to the noise power limit.
- the filtered sample values output by the parameterizable pre-filter 30 are stored in the buffer 38 and at the same time passed from the buffer 38 to the multiplier 40, where, since it is their first pass, they are initially unchanged, namely with a scaling factor of one are passed through the multiplier 40 to the quantizer 28.
- the filtered audio values above an upper barrier are cut off in a step 110 and then quantized in a step 112.
- the two steps 110 and 112 are carried out by the quantizer 28.
- the two steps 110 and 112 are preferably carried out by the quantizer 28 in one step in that the filtered audio values s' are quantized with a quantization stage function which For example, in a floating point representation, filtered sample values s' are mapped to a plurality of integer quantization level values or indices and, starting from a certain threshold value for the filtered sample values, run flat, so that filtered sample values that are larger than the threshold value are quantized to one and the same quantization level become.
- a quantization level function is shown in FIG. 7a.
- the quantized filtered samples are designated ⁇ 'in FIG. 7a.
- the quantization level function is preferably a quantization level function with a constant step size below the threshold, i.e. the jump to the next quantization level always takes place after a constant interval along the input values S '.
- the step size to the threshold value is set in such a way that the number of quantization stages preferably corresponds to a power of 2.
- the threshold value is smaller, so that a maximum value of the displayable range of the floating point representation exceeds the threshold value.
- the threshold value is that it has been observed that the filtered audio signal, which is output by the pre-filter 30, has individual audio values which add up to very large values due to an unfavorable accumulation of harmonics. It has also been observed that clipping these values, as achieved by the quantization level function shown in Fig. 7a, results in high data reduction, but only a minor degradation in audio quality. Rather, these isolated points in the filtered audio signal are created artificially by the frequency-selective filtering in the parameterizable filter 30, so that cutting them off only slightly impairs the audio quality. A somewhat more concrete example of the quantization level function shown in FIG.
- Fig. 7a would be one which rounds all filtered sample values s' to the nearest integer up to the threshold value, and from then on quantizes all the filtered sample values above it to the highest quantization level, such as eg 256. This case is shown in Fig. 7a.
- FIG. 7b Another example of a possible quantization level function would be that shown in Fig. 7b.
- the quantization level function of FIG. 7b corresponds to that of FIG. 7a.
- the quantization stage function continues with a slope that is smaller than the slope in the area below the threshold value.
- the quantization step size is larger above the threshold value. This achieves an effect similar to that of the quantization function of FIG. 7a, but with more effort on the one hand due to the different step sizes of the quantization level function above and below the threshold value and on the other hand an improved audio quality, since very high filtered audio values s' are not completely cut off , but only be quantized with a larger quantization step size.
- the compressor 34 therefore makes a first compression attempt and compresses side information including the gain values a o and ai at the interpolation points, such as 127 and 255, and the filter coefficients. ten x 0 and x ⁇ at the support points and the quantized, filtered sample values ⁇ 'in a preliminary filtered signal.
- the compressor 34 is a loss-free encoder, such as a Huffman or arithmetic encoder with or without prediction and / or adaptation.
- the memory 38 through which the sampled audio values ⁇ 'pass, serves as a buffer for a suitable block size, with which the compressor 34 processes the quantized, filtered and, if necessary, scaled audio values ⁇ ' output by the quantizer 28 as described below.
- the block size may differ from the block size of the audio blocks as used by the device 20.
- the bit rate controller 36 controlled the multiplier 40 with a multiplier of 1, so that the filtered audio values from the pre-filter 30 pass unchanged to the quantizer 28 and from there to the compressor 34 as quantized, filtered audio values.
- the compressor 34 monitors whether a certain compression block size, ie a certain number of quantized, sampled audio values, has been encoded in the preliminary coded signal or whether further quantized, filtered audio values ⁇ 'in the current preliminary coded signal are to be coded. If the compression block size has not been reached, the compressor 34 continues to perform the current compression 114.
- the bit rate controller 36 checks in a step 118 whether the bit quantity required for the compression is larger than a bit quantity prescribed by a desired bit rate. If this is not the case, the bit rate controller 36 checks in a step 120 whether the required bit quantity is smaller than the bit quantity prescribed by the desired bit rate. If this is the case, the bit rate controller 36 adds fill bits to the coded signal in step 122 until the prescribed bit quantity is reached by the desired bit rate. The coded signal is then output in step 124.
- bit rate controller 36 could send the compression block of filtered audio values ⁇ ', which is still stored in the memory 38 and is the basis for the compression, multiplied by a multiplier greater than 1 by the multiplier 40 to the quantizer 28 in order to repeat the steps 110-118 forward until the bit quantity prescribed by the desired bit rate is reached, as indicated by a dashed step 125.
- step 118 if the check in step 118 reveals that the required bit quantity is greater than that prescribed by the desired bit rate, the bit rate controller 36 changes the multiplier for the multiplier 40 to a factor between 0 and 1 exclusively. It does this in step 126.
- the bit rate controller 36 ensures that the memory 38 outputs the last compression block of the filtered audio values ⁇ 'on which the compression is based, the latter then being multiplied by the factor set in step 126 and fed again to the quantizer 28 steps 110-118 are then carried out again and the previously coded signal is rejected.
- step 114 the factor used in step 126 (or step 125) is of course also incorporated into the coded signal.
- step 126 The sense of the procedure after step 126 is that the factor increases the effective step size of the quantizer 28. This means that the resulting quantization noise is evenly above the masking threshold, which leads to audible interference or audible rem noise, but results in a reduced bit rate. If, after repeating steps 110-116 in step 118, it is again determined that the required bit quantity is greater than that prescribed by the desired bit rate, the factor is further reduced in step 126, etc.
- the next compression block is carried out by the subsequent quantized, filtered audio values ⁇ '.
- FIG. 5b once again illustrates the resulting encoded signal, indicated generally at 130.
- the encoded signal includes page information and intermediate main data.
- the side information includes information from which the value of the gain value and the value of the filter coefficients can be derived for special audio blocks, namely audio blocks in which there has been a significant change in the filter coefficients as a result of audio blocks. If necessary, the side information also includes further information relating to the gain value used for the bit control. Because of the mutual dependency between the gain value and the noise power limit q, the side information can optionally include the noise power limit q # or only the latter in addition to the gain value a # for a support point #.
- the side information is preferably arranged within the coded signal in such a way that the side information on filter coefficients and associated gain value or associated noise level are arranged in front of the main data for the audio block of quantized, filtered audio values ⁇ ', from which these filter coefficients with the associated gain value or associated noise power limit have been derived, i.e. the side information a 0 , x 0 (i) after the block -1 and the Side information ai, x x (i) after block 1.
- the main data ie the quantized, filtered audio values ⁇ '
- the main data are exclusive of an audio block of the type in which there is a significant change in the filter coefficients as a result of audio blocks up to and including the next audio block of this type, in Fig. 5b for example the audio values ⁇ '(t 0 ) - ⁇ ' (t 255 ), always between the side information block 132 to the former of these two audio blocks (block -1) and the another side information block 134 to the second of these two audio blocks (block 1).
- the audio values ⁇ '(t 0 ) - ⁇ ' (t ⁇ 27 ) are as in v 5a mentioned above was obtained or decodable solely by means of the side information 132, while the audio values ⁇ '(t ⁇ 28 ) - ⁇ ' (t 255 ) by interpolation using the side information 132 as reference values at the reference point with the sample number 127 and by means of the side information 134 as base values at the base with the sample number 255 have been obtained and can therefore only be decoded with both side information.
- the side information relating to the gain value or the noise power limit and the filter coefficients is not always integrated independently in each side information block 132 and 134. Rather, this page information is transmitted in differences from the previous page information block.
- the side information block 132 contains, for example, gain value a 0 and filter coefficient x 0 with respect to the support point at time t_ ⁇ . In the page information block 132, these values can be derived from the block itself. However, from page information block 134 are the Page information regarding the support point at time t 255 can no longer be derived from this block alone.
- the side information block 134 only contains information about differences of the gain value ai of the support point at time t 2 ss to the gain value of the support point at time t 0 and the differences of the filter coefficients Xi to the filter coefficients x 0 .
- the page information block 134 consequently only contains the information relating to ai-ao and x ⁇ (i) -xo (i).
- the filter coefficients and the gain value or the noise power limit should be transmitted in full and not just as a difference to the previous interpolation point, eg every second, in order to enable a receiver or decoder to latch into a running stream of coding data, as will be discussed below.
- an exemplary embodiment for an audio decoder is described below which is suitable for decoding the encoded signal generated by the audio encoder 10 of FIG. 1 into a decoded, playable or further processable audio signal.
- the decoder indicated generally at 210, includes a decompressor 212, a FIFO memory 214, a multiplier 216 and a parameterizable post filter 218.
- Decompressors 212, FIFO memory 214, multiplier 216 and parameterizable post filter 218 are connected in this order between a data input 220 and a data output 222 of the decoder 210, the coded signal is obtained at the data input 220 and the decoded audio signal is output at the data output 222, which only differs from the original audio signal at the data input 12 of the audio encoder 10 by the quantization noise generated by the quantizer 28 in the audio encoder 10.
- the decompressor 212 is connected at a further data output to a control input of the multiplier 216 in order to forward a multiplier to the same, and via a further data output to a parameterization input of the parameterizable post filter 218.
- the decompressor 212 first decompresses the compressed signal present at the data input 220 in a step 224 in order to supply the quantized, filtered audio data, namely the sample values ⁇ ′, and the associated page information in the page information blocks 132 , 134, which indicate the filter coefficients and gain values or, instead of the gain values, the noise power limits at the reference points.
- the decompressor 212 checks the decompressed signal in the order of its arrival whether it contains page information with filter coefficients, in a self-contained form without reference to a previous page information block. In other words, the decompressor 212 searches for the first page information block 132. As soon as the decompressor 212 has found what it is looking for, the quantized, filtered audio values ⁇ 'are stored in the FIFO memory 214 in a step 228. saved.
- step 228 If a complete audio block of quantized, filtered audio values ⁇ 'has been stored during step 228 without a side information block immediately following, then this is first of all step 228 by means of the information about parameterization and gain value contained in the page information received in step 226 post-filtered in the post-filter and amplified in multiplier 216, whereby it is decoded and thus the associated decoded audio block is obtained.
- the decompressor 212 monitors the decompressed signal for the appearance of any kind of page information block, namely with absolute filter coefficients or filter coefficient differences towards a previous page information block.
- the decompressor 212 would recognize the appearance of the side information block 134 upon detection of the side information block 132 in step 226 in step 230.
- the block of quantized, filtered audio values ⁇ '(t 0 ) - ⁇ ' (t ⁇ 27 ) would have been decoded in step 228, using the side information 132.
- Step 232 is freely omitted if the current page information block is a self-contained page information block without differences, which, as described above, is the case, for example, every second can.
- side information blocks 132 in which the parameter values can be derived absolutely, ie without relation to another side information block, are arranged at sufficiently small intervals so that the switch-on time or the dead time when Switching on the audio encoder 210 is not too great in the case of a radio transmission or radio transmission, for example.
- the number of intervening side information blocks 134 with the difference values are also arranged in a fixed predetermined number between the side information blocks 132 so that the decoder knows when a side information block of the type 132 is expected again in the encoded signal.
- the different types of page information blocks are indicated by corresponding flags.
- a sample index j is initially initialized to 0 in step 234. This value corresponds to the sample position of the first sample value in the audio block currently remaining in the FIFO 214, to which the current page information relates.
- Step 234 is performed by the parameterizable post filter 218.
- the post filter 218 then carries out a calculation of the noise power limit at the new support point in a step 236, this step corresponding to step 84 of FIG.
- the post filter 218 then performs interpolations with respect to the filter coefficients and noise power limits that correspond to the interpolations 88 and 90 of FIG. 4.
- the subsequent calculation of the gain value for the sample position j based on the interpolated noise power limit and the interpolated filter coefficients from steps 238 and 240 in step 242 corresponds to step 92 of FIG. 4.
- the post filter 218 then applies the gain value calculated in step 242 and the interpolated filter coefficients to the sample value at the sample position j. This step differs from step 94 of FIG.
- the interpolated filter coefficients are applied to the quantized, filtered sample values ⁇ 'in such a way that the transfer function of the parameterizable post filter does not correspond to the inverse of the monitoring threshold, but to the monitoring threshold itself.
- the post-filter does not perform a multiplication by the gain value, but a division by the gain value on the quantized, filtered sample value ⁇ 'or already filtered back, quantized, filtered sample value at position j.
- step 248 If the post filter 218 has not yet reached the current support point with the scanning position j, which checks the same in step 246, it increments the scanning position index j in step 248 and starts steps 238-246 again. Only when the support point is reached does it apply the gain value and the filter coefficients of the new support point to the sample value at the support point, namely in step 250.
- the application comprises, instead of multiplication, division by means of the gain value and filtering a transfer function equal to the listening threshold and not the inverse of the latter.
- the current audio block is decoded by interpolation between two reference point parameterizations.
- the filtering and the application of the gain value in steps 218 and 224 adapt the noise introduced by the quantization during the coding in steps 110 and 112 both in form and in height to the listening threshold. It should also be pointed out that in the event that the quantized, filtered audio values have been subjected to a further multiplication in step 126 due to the bit rate control prior to encoding in the encoded signal, this factor can also be taken into account in steps 218 and 224 , Alternatively, the audio values obtained by the process of FIG. 11 can of course be subjected to a further multiplication in order to amplify the audio values weakened again for the sake of a low bit rate.
- the parameterization intended for an audio block or the gain value determined for this audio block can also be applied directly to another value, such as the audio value in the middle of the audio block, such as the 64th audio value in the case of the above block size of 128 audio values.
- the present invention has been described above with reference to a specific audio coding scheme that enables short delay times, the present invention is of course also applicable to other audio codes.
- an audio coding scheme would also be conceivable in which the coded signal consists of the quantized, filtered audio values per se without any redundancy. danzreductions is carried out. Accordingly, it would also be conceivable to carry out the frequency-selective filtering differently from the manner described above, namely on the encoder side with a transfer function equal to the inverse of the listening threshold and on the decoder side with a transfer function equal to the listening threshold.
- the present invention is not limited to audio signals. It is also applicable to other information signals, namely for example video signals consisting of a sequence of frames, i.e. a sequence of pixel arrays.
- the above audio coding scheme provides a possibility for limiting the bit rate in an audio encoder with a very short delay time.
- the bit rate peaks that arise during coding depending on the audio signal are avoided by limiting the output value range of the pre-filter.
- an upper limit for the bit rate of the transmission can always be observed, which often for example exists in wireless transmission media.
- the change in the quantization level function above the threshold is a suitable means of limiting the bit rate to the permissible maximum.
- the encoder consisted of a pre-filter that appropriately shapes the audio signal, a quantizer with a quantizer level height, followed by an entropy encoder.
- the quantizer generates values, which are also called indices.
- higher indices also mean a higher bit rate associated therewith, which was avoided, however, by limiting the area of the indices (FIG. 7a) or thinning them out (FIG. 7b), but with the possibility of deteriorating the audio quality.
- the quantizer would respond to a signal, for example, in order to either keep the quantization stage function constant 7a or 7b, so that the quantizer could be informed by the signal to carry out the quantization step reduction above the threshold value or the cutting off above the threshold value with a slight deterioration in audio quality.
- the threshold value could be gradually be gradually reduced. In this case, the threshold reduction instead of the factor reduction of step 126 could be performed.
- the preliminarily compressed signal could only be subjected to a selective threshold quantization in a modified step 126 if the bit rate is still too high (118).
- the filtered audio values would then be quantized using the quantization level function, which has a flatter course above the audio threshold. Further bit rate reductions could be carried out in the modified step 126 by reducing the threshold value and thus a further modification of the quantization level function.
- the quantization scheme according to the invention can also be implemented in software.
- the implementation can take place on a digital storage medium, in particular a floppy disk or a CD with electronically readable control signals, which can cooperate with a programmable computer system in such a way that the corresponding method is carried out.
- the invention thus also consists in a computer program product with program code stored on a machine-readable carrier for carrying out the method according to the invention when the computer program product runs on a computer.
- the invention can thus be implemented as a computer program with a program code for carrying out the method if the computer program runs on a computer.
- the scheme according to the invention can also be implemented in software.
- the implementation can take place on a digital storage medium, in particular a floppy disk or a CD with electronically readable control signals, which can interact with a programmable computer system in such a way that the corresponding method is carried out.
- the invention thus also consists in a computer program product with program code stored on a machine-readable carrier for carrying out the method according to the invention when the computer program product runs on a computer.
- the invention can thus be implemented as a computer program with a program code for carrying out the method if the computer program runs on a computer.
Abstract
Description
Claims
Priority Applications (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE502005001821T DE502005001821D1 (de) | 2004-02-13 | 2005-02-10 | Verfahren und vorrichtung zum quantisieren eines informationssignals |
JP2006552545A JP4444295B2 (ja) | 2004-02-13 | 2005-02-10 | 情報信号を量子化するための方法および装置 |
BRPI0506627A BRPI0506627B1 (pt) | 2004-02-13 | 2005-02-10 | método e dispositivo para quantizar um sinal de informações |
EP05715289A EP1697929B1 (de) | 2004-02-13 | 2005-02-10 | Verfahren und vorrichtung zum quantisieren eines informationssignals |
AU2005213767A AU2005213767B2 (en) | 2004-02-13 | 2005-02-10 | Method and device for quantizing a data signal |
CN200580004688XA CN1918630B (zh) | 2004-02-13 | 2005-02-10 | 量化信息信号的方法和设备 |
CA2555639A CA2555639C (en) | 2004-02-13 | 2005-02-10 | Method and device for quantizing a data signal |
US11/460,433 US7464027B2 (en) | 2004-02-13 | 2006-07-27 | Method and device for quantizing an information signal |
IL177164A IL177164A (en) | 2004-02-13 | 2006-07-31 | Method and device for quantizing an information signal |
NO20064091A NO337836B1 (no) | 2004-02-13 | 2006-09-12 | Kvantisering av datasignaler |
HK07100911A HK1093814A1 (en) | 2004-02-13 | 2007-01-25 | Method and device for quantizing a data signal |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102004007184A DE102004007184B3 (de) | 2004-02-13 | 2004-02-13 | Verfahren und Vorrichtung zum Quantisieren eines Informationssignals |
DE102004007184.5 | 2004-02-13 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/460,433 Continuation US7464027B2 (en) | 2004-02-13 | 2006-07-27 | Method and device for quantizing an information signal |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2005078703A1 true WO2005078703A1 (de) | 2005-08-25 |
Family
ID=34853461
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2005/001343 WO2005078703A1 (de) | 2004-02-13 | 2005-02-10 | Verfahren und vorrichtung zum quantisieren eines informationssignals |
Country Status (16)
Country | Link |
---|---|
US (1) | US7464027B2 (de) |
EP (1) | EP1697929B1 (de) |
JP (1) | JP4444295B2 (de) |
KR (1) | KR100813193B1 (de) |
CN (1) | CN1918630B (de) |
AT (1) | ATE377243T1 (de) |
AU (1) | AU2005213767B2 (de) |
BR (1) | BRPI0506627B1 (de) |
CA (1) | CA2555639C (de) |
DE (2) | DE102004007184B3 (de) |
ES (1) | ES2294685T3 (de) |
HK (1) | HK1093814A1 (de) |
IL (1) | IL177164A (de) |
NO (1) | NO337836B1 (de) |
RU (1) | RU2337413C2 (de) |
WO (1) | WO2005078703A1 (de) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007131564A1 (de) * | 2006-05-12 | 2007-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Informationssignalcodierung |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE102004009955B3 (de) | 2004-03-01 | 2005-08-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Ermitteln einer Quantisierer-Schrittweite |
US7627481B1 (en) | 2005-04-19 | 2009-12-01 | Apple Inc. | Adapting masking thresholds for encoding a low frequency transient signal in audio data |
JP4640020B2 (ja) * | 2005-07-29 | 2011-03-02 | ソニー株式会社 | 音声符号化装置及び方法、並びに音声復号装置及び方法 |
EP1852848A1 (de) | 2006-05-05 | 2007-11-07 | Deutsche Thomson-Brandt GmbH | Verfahren und Vorrichtung für verlustfreie Kodierung eines Quellensignals unter Verwendung eines verlustbehafteten kodierten Datenstroms und eines verlustfreien Erweiterungsdatenstroms |
EP2122615B1 (de) * | 2006-10-20 | 2011-05-11 | Dolby Sweden AB | Vorrichtung und verfahren zum codieren eines informationssignals |
US7823092B1 (en) * | 2007-11-23 | 2010-10-26 | Altera Corporation | Method and apparatus for implementing a parameterizable filter block with an electronic design automation tool |
EP2830059A1 (de) | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Rauschfüllungsenergieanpassung |
PL3413307T3 (pl) * | 2014-07-25 | 2021-01-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Urządzenie do kodowania sygnałów audio, urządzenie do dekodowania sygnałów audio i ich sposoby |
DE102014220687A1 (de) * | 2014-10-13 | 2016-04-14 | Continental Automotive Gmbh | Kommunikationsvorrichtung für ein Fahrzeug und Verfahren zum Kommunizieren |
RU2754497C1 (ru) * | 2020-11-17 | 2021-09-02 | федеральное государственное автономное образовательное учреждение высшего образования "Казанский (Приволжский) федеральный университет" (ФГАОУ ВО КФУ) | Способ передачи речевых файлов по зашумленному каналу и устройство для его реализации |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000063886A1 (en) * | 1999-04-16 | 2000-10-26 | Dolby Laboratories Licensing Corporation | Using gain-adaptive quantization and non-uniform symbol lengths for audio coding |
EP1160770A2 (de) * | 2000-06-02 | 2001-12-05 | Lucent Technologies Inc. | Perzeptuelle Kodierung von Audiosignalen unter Verwendung von getrennter Reduzierung von Irrelevanz und Redundanz |
Family Cites Families (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3506912A1 (de) | 1985-02-27 | 1986-08-28 | Telefunken Fernseh Und Rundfunk Gmbh, 3000 Hannover | Verfahren zur uebertragung eines audiosignals |
DE3820037A1 (de) | 1988-06-13 | 1989-12-14 | Ant Nachrichtentech | Bildcodierverfahren und einrichtung |
DE3820038A1 (de) * | 1988-06-13 | 1989-12-14 | Ant Nachrichtentech | Verfahren zur aufbereitung und uebertragung einer bildsequenz |
JP3186290B2 (ja) * | 1993-01-20 | 2001-07-11 | ソニー株式会社 | 符号化方法、符号化装置、復号化装置及び記録媒体 |
US5581653A (en) | 1993-08-31 | 1996-12-03 | Dolby Laboratories Licensing Corporation | Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder |
DE19549621B4 (de) | 1995-10-06 | 2004-07-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung zum Codieren von Audiosignalen |
GB2307833B (en) * | 1995-12-01 | 2000-06-07 | Geco As | A data compression method and apparatus for seismic data |
DE69724819D1 (de) * | 1996-07-05 | 2003-10-16 | Univ Manchester | Sprachkodier- und dekodiersystem |
US6370477B1 (en) | 1996-11-22 | 2002-04-09 | Schlumberger Technology Corporation | Compression method and apparatus for seismic data |
US6131084A (en) | 1997-03-14 | 2000-10-10 | Digital Voice Systems, Inc. | Dual subframe quantization of spectral magnitudes |
KR100335609B1 (ko) | 1997-11-20 | 2002-10-04 | 삼성전자 주식회사 | 비트율조절이가능한오디오부호화/복호화방법및장치 |
US6195633B1 (en) * | 1998-09-09 | 2001-02-27 | Sony Corporation | System and method for efficiently implementing a masking function in a psycho-acoustic modeler |
KR100440896B1 (ko) * | 1998-12-30 | 2004-09-18 | 주식회사 대우일렉트로닉스 | 양자화복호화장치 |
US6370502B1 (en) * | 1999-05-27 | 2002-04-09 | America Online, Inc. | Method and system for reduction of quantization-induced block-discontinuities and general purpose audio codec |
EP1228569A1 (de) | 1999-10-30 | 2002-08-07 | STMicroelectronics Asia Pacific Pte Ltd. | Verfahren zur frequenzkoeffizientenkodierung in einem ac- 3 kodierer |
EP1377070A4 (de) * | 2001-03-28 | 2010-08-11 | Sony Corp | Quantisierungsvorrichtung, quantisierungsverfahren, quantisierungsprogramm und aufzeichnungsmedium |
DE10217297A1 (de) * | 2002-04-18 | 2003-11-06 | Fraunhofer Ges Forschung | Vorrichtung und Verfahren zum Codieren eines zeitdiskreten Audiosignals und Vorrichtung und Verfahren zum Decodieren von codierten Audiodaten |
CA2388352A1 (en) * | 2002-05-31 | 2003-11-30 | Voiceage Corporation | A method and device for frequency-selective pitch enhancement of synthesized speed |
-
2004
- 2004-02-13 DE DE102004007184A patent/DE102004007184B3/de not_active Expired - Lifetime
-
2005
- 2005-02-10 EP EP05715289A patent/EP1697929B1/de active Active
- 2005-02-10 BR BRPI0506627A patent/BRPI0506627B1/pt active IP Right Grant
- 2005-02-10 AU AU2005213767A patent/AU2005213767B2/en active Active
- 2005-02-10 JP JP2006552545A patent/JP4444295B2/ja active Active
- 2005-02-10 CN CN200580004688XA patent/CN1918630B/zh active Active
- 2005-02-10 DE DE502005001821T patent/DE502005001821D1/de active Active
- 2005-02-10 AT AT05715289T patent/ATE377243T1/de active
- 2005-02-10 RU RU2006132742/09A patent/RU2337413C2/ru active
- 2005-02-10 WO PCT/EP2005/001343 patent/WO2005078703A1/de active IP Right Grant
- 2005-02-10 ES ES05715289T patent/ES2294685T3/es active Active
- 2005-02-10 KR KR1020067015896A patent/KR100813193B1/ko active IP Right Grant
- 2005-02-10 CA CA2555639A patent/CA2555639C/en active Active
-
2006
- 2006-07-27 US US11/460,433 patent/US7464027B2/en active Active
- 2006-07-31 IL IL177164A patent/IL177164A/en active IP Right Grant
- 2006-09-12 NO NO20064091A patent/NO337836B1/no unknown
-
2007
- 2007-01-25 HK HK07100911A patent/HK1093814A1/xx unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000063886A1 (en) * | 1999-04-16 | 2000-10-26 | Dolby Laboratories Licensing Corporation | Using gain-adaptive quantization and non-uniform symbol lengths for audio coding |
EP1160770A2 (de) * | 2000-06-02 | 2001-12-05 | Lucent Technologies Inc. | Perzeptuelle Kodierung von Audiosignalen unter Verwendung von getrennter Reduzierung von Irrelevanz und Redundanz |
Non-Patent Citations (1)
Title |
---|
SCHULLER G D T ET AL: "PERCEPTUAL AUDIO CODING USING ADAPTIVE PRE- AND POST-FILTERS AND LOSSLESS COMPRESSION", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE INC. NEW YORK, US, vol. 10, no. 6, September 2002 (2002-09-01), pages 379 - 390, XP001219570, ISSN: 1063-6676 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007131564A1 (de) * | 2006-05-12 | 2007-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Informationssignalcodierung |
DE102006022346B4 (de) * | 2006-05-12 | 2008-02-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Informationssignalcodierung |
NO20084786L (no) * | 2006-05-12 | 2008-12-11 | Fraunhofer Ges Forschung | Koding av informasjonssignal |
JP2009537033A (ja) * | 2006-05-12 | 2009-10-22 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | 情報信号符号化 |
AU2007250308B2 (en) * | 2006-05-12 | 2010-05-06 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Information signal coding |
NO340674B1 (no) * | 2006-05-12 | 2017-05-29 | Fraunhofer Ges Forschung | Koding av informasjonssignal |
US9754601B2 (en) | 2006-05-12 | 2017-09-05 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Information signal encoding using a forward-adaptive prediction and a backwards-adaptive quantization |
US10446162B2 (en) | 2006-05-12 | 2019-10-15 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | System, method, and non-transitory computer readable medium storing a program utilizing a postfilter for filtering a prefiltered audio signal in a decoder |
Also Published As
Publication number | Publication date |
---|---|
CA2555639C (en) | 2012-07-10 |
IL177164A (en) | 2010-11-30 |
ATE377243T1 (de) | 2007-11-15 |
EP1697929B1 (de) | 2007-10-31 |
AU2005213767B2 (en) | 2008-04-10 |
CN1918630A (zh) | 2007-02-21 |
KR100813193B1 (ko) | 2008-03-13 |
NO20064091L (no) | 2006-11-10 |
JP4444295B2 (ja) | 2010-03-31 |
JP2007522509A (ja) | 2007-08-09 |
CN1918630B (zh) | 2010-04-14 |
KR20060113999A (ko) | 2006-11-03 |
BRPI0506627B1 (pt) | 2018-10-09 |
BRPI0506627A (pt) | 2007-05-02 |
NO337836B1 (no) | 2016-06-27 |
EP1697929A1 (de) | 2006-09-06 |
ES2294685T3 (es) | 2008-04-01 |
HK1093814A1 (en) | 2007-03-09 |
DE502005001821D1 (de) | 2007-12-13 |
RU2337413C2 (ru) | 2008-10-27 |
DE102004007184B3 (de) | 2005-09-22 |
US20070043557A1 (en) | 2007-02-22 |
AU2005213767A1 (en) | 2005-08-25 |
RU2006132742A (ru) | 2008-03-20 |
CA2555639A1 (en) | 2005-08-25 |
US7464027B2 (en) | 2008-12-09 |
IL177164A0 (en) | 2006-12-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1687808B1 (de) | Audiocodierung | |
EP1697929B1 (de) | Verfahren und vorrichtung zum quantisieren eines informationssignals | |
DE60214027T2 (de) | Kodiervorrichtung und dekodiervorrichtung | |
EP1697928B1 (de) | Audiokodierung | |
EP2022043B1 (de) | Informationssignalcodierung | |
EP0954909B1 (de) | Verfahren zum codieren eines audiosignals | |
DE60214599T2 (de) | Skalierbare audiokodierung | |
EP1025646B1 (de) | Verfahren und vorrichtung zum codieren von audiosignalen sowie verfahren und vorrichtungen zum decodieren eines bitstroms | |
EP1145227B1 (de) | Verfahren und vorrichtung zum verschleiern eines fehlers in einem codierten audiosignal und verfahren und vorrichtung zum decodieren eines codierten audiosignals | |
DE60307252T2 (de) | Einrichtungen, verfahren und programme zur kodierung und dekodierung | |
DE69726661T2 (de) | Verfahren und vorrichtung zur kodierung eines digitalen informationssignales | |
EP1687810B1 (de) | Vorrichtung und verfahren zum ermitteln einer quantisierer-schrittweite | |
DE19736669C1 (de) | Verfahren und Vorrichtung zum Erfassen eines Anschlags in einem zeitdiskreten Audiosignal sowie Vorrichtung und Verfahren zum Codieren eines Audiosignals | |
WO2008058754A2 (de) | Vorrichtung zum kodieren und dekodieren | |
EP1112621A1 (de) | Vorrichtung und verfahren zum entropie-codieren von informationswörtern und vorrichtung und verfahren zum decodieren von entropie-codierten informationswörtern | |
WO2005083678A1 (de) | Vorrichtung und verfahren zum verarbeiten eines multikanalsignals | |
EP0910928A1 (de) | Codieren und decodieren von audiosignalen unter verwendung von intensity-stereo und prädiktion | |
WO2001043503A2 (de) | Verfahren und vorrichtung zum verarbeiten eines stereoaudiosignals | |
DE102012009435A1 (de) | Vorrichtung und Verfahren zum Streaming hybridcodierter Audiodaten | |
DE69636272T2 (de) | Audiokodier- und -dekodiergerät | |
DE202004002269U1 (de) | Vorrichtung zum Quantisieren eines Informationssignals | |
DE202004002264U1 (de) | Audiocodierung | |
EP0905918A2 (de) | Verfahren und Vorrichtung zum Kodieren von Audiosignalen | |
DE19515805C2 (de) | Verfahren zur Datenreduktion eines digitalisierten Tonsignals | |
DE3440615C1 (de) | Verfahren zum Übertragen und Speichern von Tonsignalen und Einrichtung zur Durchführung des Verfahrens |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DPEN | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2005715289 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11460433 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 177164 Country of ref document: IL |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005213767 Country of ref document: AU Ref document number: 2181/KOLNP/2006 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020067015896 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2555639 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: PA/a/2006/009110 Country of ref document: MX |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200580004688.X Country of ref document: CN Ref document number: 2006552545 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: DE |
|
ENP | Entry into the national phase |
Ref document number: 2005213767 Country of ref document: AU Date of ref document: 20050210 Kind code of ref document: A |
|
WWP | Wipo information: published in national office |
Ref document number: 2005213767 Country of ref document: AU |
|
WWP | Wipo information: published in national office |
Ref document number: 2005715289 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006132742 Country of ref document: RU |
|
WWP | Wipo information: published in national office |
Ref document number: 1020067015896 Country of ref document: KR |
|
WWP | Wipo information: published in national office |
Ref document number: 11460433 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: PI0506627 Country of ref document: BR |
|
WWG | Wipo information: grant in national office |
Ref document number: 2005715289 Country of ref document: EP |