EP2816556A1 - Method and a decoder for attenuation of signal regions reconstructed with low accuracy - Google Patents
Method and a decoder for attenuation of signal regions reconstructed with low accuracy Download PDFInfo
- Publication number
- EP2816556A1 EP2816556A1 EP14184428.2A EP14184428A EP2816556A1 EP 2816556 A1 EP2816556 A1 EP 2816556A1 EP 14184428 A EP14184428 A EP 14184428A EP 2816556 A1 EP2816556 A1 EP 2816556A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- attenuation
- spectral
- region
- spectral region
- attenuated
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 16
- 230000003595 spectral effect Effects 0.000 claims abstract description 131
- 230000002238 attenuated effect Effects 0.000 claims abstract description 28
- 230000003044 adaptive effect Effects 0.000 claims abstract description 20
- 230000007423 decrease Effects 0.000 claims abstract description 12
- 230000005236 sound signal Effects 0.000 claims abstract description 10
- 238000001228 spectrum Methods 0.000 claims description 17
- 230000015572 biosynthetic process Effects 0.000 claims description 6
- 238000003786 synthesis reaction Methods 0.000 claims description 6
- 239000013598 vector Substances 0.000 description 8
- 230000006870 function Effects 0.000 description 6
- 238000013139 quantization Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/035—Scalar quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
Definitions
- the embodiments of the present invention relate to a decoder, an encoder for audio signals, and methods thereof.
- the audio signals may comprise speech in various conditions, music and mixed speech and music content.
- the embodiments relate to attenuation of spectral regions which are poorly reconstructed. This may for instance apply to regions which are coded with a low number of bits or with no bits assigned.
- Traditionally mobile networks are designed to handle speech signals at low bitrates. This has been realised by using designated speech codecs which show good performance for speech signals at low bit rates, but has poor performance for music and mixed content. There is an increasing demand that the networks should also handle these signals, for e.g. music-on-hold and ringback tones.
- Audio codecs normally operate using a higher bitrate than the speech codecs.
- certain spectral regions of the signal may be coded with a low number of bits, and the desired target quality of the reconstructed signal can therefore not be guaranteed.
- the spectral regions refer to frequency domain regions, e.g., certain subbands of the frequency transformed signal block. For simplicity "spectral regions" will be used throughout the specification with the meaning of "part of short-time signal spectra”.
- spectral regions with no bits assigned.
- Such spectral regions have to be reconstructed at the decoder, by reusing information from the available coded spectral regions (e.g., noise-fill or bandwidth extension). In all these cases some attenuation of energy of low accuracy reconstructed regions is desirable to avoid loud signal distortions.
- the signal regions coded with either insufficient number of bits or with no bits assigned will be reconstructed with low accuracy and accordingly it is desired to attenuate these spectral regions.
- the insufficient number of bits is defined as a number of bits which are too low to be able to represent the spectral region with perceptually plausible quality. Note that this number will be dependent on the sensitivity of the audio perception for that region as well as the complexity of the signal region at hand.
- Attenuation of low-accuracy coded spectral regions is not a trivial problem.
- strong attenuation is desired to mask unwanted distortion.
- attenuation might be perceived by listeners as loudness loss in the reconstructed signal, change of frequency characteristics, or change in signal dynamics e.g., over time coding algorithm can select different signal regions to noise-fill.
- conventional audio coding systems apply very conservative, i.e. limited, attenuation, which achieves on average certain balance between different types of the above listed distortions.
- the embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.
- a method for a decoder for determining an attenuation to be applied to an audio signal is provided.
- spectral regions to be attenuated are identified, subsequent identified spectral regions are grouped to form a continuous spectral region, a width of the continuous spectral region is determined, and an attenuation of the continuous spectral region adaptive to the width is applied such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- an attenuation controller of a decoder for determining an attenuation to be applied to an audio signal comprises an identifier unit configured to identify spectral regions to be attenuated, a grouping unit configured to group subsequent identified spectral regions to form a continuous spectral region, and a determination unit configured to determine a width of the continuous spectral region.
- an application unit is provided, wherein the application unit is configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- a mobile terminal comprises a decoder with an attenuation controller.
- the attenuation controller comprises an identifier unit configured to identify spectral regions to be attenuated, a grouping unit configured to group subsequent identified spectral regions to form a continuous spectral region, and a determination unit configured to determine a width of the continuous spectral region.
- an application unit is provided, wherein the application unit is configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- a network node comprises a decoder with an attenuation controller.
- the attenuation controller comprises an identifier unit configured to identify spectral regions to be attenuated, a grouping unit configured to group subsequent identified spectral regions to form a continuous spectral region, and a determination unit configured to determine a width of the continuous spectral region.
- an application unit is provided, wherein the application unit is configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- An advantage with embodiments of the present invention is that the proposed adaptive attenuation allows for a significant reduction of audible noise in the reconstructed audio signal compared to conventional systems, which have restrictive constant attenuation.
- the decoder according to embodiments of the present invention can be used in an audio codec, audio decoder, which can be used in end user devices such as mobile devices (e.g. a mobile phone) or stationary PCs, or in network nodes where decoding occurs.
- end user devices such as mobile devices (e.g. a mobile phone) or stationary PCs, or in network nodes where decoding occurs.
- the solution of the embodiments of the invention relates to an adaptive attenuation that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics. That is achieved in the attenuation controller in the decoder, as illustrated in a flowchart of figure 2 .
- the flowchart of figure 2 shows a method in a decoder according to one embodiment.
- spectral regions to be attenuated are identified 201. This step may involve an examination of the reconstructed subvectors 201a. Subsequent identified spectral regions are grouped 202 to form a continuous spectral region and a width of the continuous spectral region is determined 203. Then, an attenuation of the continuous spectral region is applied 204, wherein the attenuation is adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region.
- An attenuation controller can be implemented in an audio decoder in a mobile terminal or in a network node.
- the audio decoder can be used in a real-time communication scenario targeting primarily speech or in a streaming scenario targeting primarily music.
- the audio codec where the attenuation controller is being implemented is a transform domain audio codec e.g. employing a pulse-based vector quantization scheme.
- a Factorial Pulse Coding (FPC) type quantizer is used but it is understood by a person skilled in the art that any vector quantizing scheme may be used.
- FPC Factorial Pulse Coding
- a short audio segment (20-40 ms), denoted input audio, 100 is transformed to the frequency domain by a Modified Discrete Cosine Transform (MDCT).105
- MDCT Modified Discrete Cosine Transform
- the MDCT vector X(k) 107 obtained by the MDCT 105 is split into multiple bands, i.e. subvectors.
- any other suitable frequency transform may be used instead of MDCT, such as DFT or DCT.
- the energy in each band is calculated in an envelope calculator 110, which gives an approximation of the spectrum envelope.
- the spectrum envelope is quantized by an envelope quantizer 120, and the quantization indices are sent to the bitstream multiplexer in order to be stored or transmitted to a decoder.
- a residual vector 117 is obtained by scaling of the MDCT vectors using the inverse of the quantized envelope gains, e.g., the residual in each band is scaled to have unit Root-Mean-Square (RMS) Energy.
- RMS Root-Mean-Square
- Bits for a quantizer performing a quantization of different residual subvectors 125 are assigned by a bit allocator 130 based on quantized envelope energies. Due to a limited bit-budget, some of the subvectors receive no bits.
- the residual subvectors are quantized, and the quantization indices are transmitted to the decoder. Residual quantization is performed with a Factorial Pulse Coding (FPC) scheme.
- FPC Factorial Pulse Coding
- a multiplexer 135 multiplexes the quantization indices of the envelope and the subvector into a bitstream 140 which may be stored or transmitted to the decoder.
- residual subvectors with no bits assigned are not coded, but noise-filled at the decoder. This can be achieved by creating a virtual codebook from coded subvectors or any other noise-fill algorithm. The noise-fill creates content in the non-coded subvectors.
- the decoder receives the bitstream 140 from the encoder at a demultiplexer 145.
- the quantized envelope gains are reconstructed by the envelope decoder 160.
- the quantized envelope gains are used by the bit allocator 155 which produces a bit allocation which is used by the subvector decoder 150 to produce the decoded residual subvectors.
- the sequence of the decoded residual subvectors forms a normalized spectrum. Due to the restricted bit budget, some of the subvectors will not be represented and will yield zeroes or holes in the spectrum. These spectral holes are filled by a noise filling algorithm 165.
- the noise filling algorithm may also include a BWE algorithm, which may reconstruct the spectrum above the last encoded band.
- a fixed envelope attenuation is determined 175.
- the quantized envelope gains are modified using the determined attenuation and an MDCT spectrum is reconstructed by scaling the decoded residual subvectors using these gains 170.
- a reconstructed audio frame 190 is produced by inverse MDCT 185.
- the embodiments of the presented invention are related to the envelope attenuation described above, previous step in the list above, where additional weighting of the envelope gains is added to control the energy of subvectors quantized with low precision, that is subvectors coded with a low number, or non-coded noise-filled subvectors.
- the subvectors coded with a low number of bits imply that the number of bits is insufficient to achieve a desirable accuracy.
- the insufficient number of bits is defined as a number of bits which are too low to be able to represent the spectral region with perceptually plausible quality. Note that this number will be dependent on the sensitivity of the audio perception for that region as well as the complexity of the signal region at hand.
- FIG. 3a An overview of a decoder in such a scheme with the algorithm according to embodiments is shown in figure 3a .
- the decoder of figure 3a corresponds to the decoder of figure 1 with the addition of an attenuation controller 300 according to embodiments of the present invention.
- the attenuation controller 300 controls the adaptive attenuation according to embodiments of the invention.
- the attenuation controller is configured to identify spectral regions to be attenuated, to group the identified spectral regions to form a continuous spectral region, to determine a width of the continuous spectral region, and to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region.
- the low precision spectral regions to be attenuated are according to the embodiments either coded with a low number of bits or with no bits assigned.
- the step of identifying low precision spectral regions may also comprise an analysis of the reconstructed subvectors.
- the first step 201 is to examine 201a the reconstructed subvectors to identify the spectral regions of the decoded frequency domain residual that are represented with low precision.
- the spectral region is said to be represented with low precision when the assigned number of bits for the said reconstructed subvector is below a predetermined threshold.
- a pulse coding scheme is employed to encode the spectral subvectors and a spectral region is said to be represented with low precision if it consists of one or more consecutive subvectors where the number of pulses P(b) is below a predetermined threshold.
- the spectral subvectors comprise of one or more consecutive subvectors where the number of pulses P(b) used to quantize the subvector fulfills equation 1.
- P b ⁇ ⁇ , b 1 , 2 ... N b
- N b is the number of subvectors
- the number of pulses can be converted to a number of bits.
- more elaborate methods may be applied to identify the low precision regions, e.g. by using the bitrate in conjunction with analysis of the synthesized shape vector. Such a setup is illustrated in figure 3b , where the synthesized shape vector is input to the envelope attenuator.
- the analysis of the synthesized shape may e.g. involve measuring the peakiness of the synthesized shape, as a peaky synthesis for higher rates may indicate a peaky input signal and hence better input/synthesis coherence.
- the estimated accuracy of the decoded subvector may be used to identify the corresponding band as a low resolution band and decide a suitable attenuation.
- Subvectors that received zero bits in the bit allocation and are noise-filled may also be included in this category.
- the identified spectral regions are grouped 202 and the width of the grouped spectral region is determined 203 by e.g. counting the number of subvectors in the grouped region.
- the attenuation 204 is dependent on the width of low precision spectral region. Hence the attenuation should be decreased with the width. That implies that a narrow region allows a larger attenuation than a wider region.
- the attenuation can be obtained in two steps. First, an initial attenuation factor A(b) is decided per subvector b . For noise filled subvectors, the attenuation factor is decided based on the number of consecutive noise filling subvectors. For the low precision coded vectors an accuracy function may be used to define the initial attenuation. When the low precision regions are identified, the attenuation level for each region is estimated using the bandwidth of the low precision region. The attenuation factors are adjusted to form A'(b) which take into consideration the low precision region bandwidth.
- FIG. 4 An example attenuation limiting function A(b) depending on the bandwidth b of the low precision region is shown in figure 4 .
- Figure 5a shows an example of the first 16 subvectors and the number of pulses used to quantize each subvector together with the low precision regions identified by the algorithm and the region widths in subvectors. Subsequent low precision regions are grouped to form a continuous spectral region 501;502;503 and the width of the continuous spectral region is determined. The width of each region is used for determining the attenuation to be applied.
- Figure 5b shows the impact of the algorithm on the corresponding subvector energies. One can see how the algorithm limits the attenuation in the region 512 that has a width of 7 subvectors while it allows target attenuation of the regions 511 and 513 that are 1 and 3 subvectors wide respectively.
- the attenuation decreased with the width of the low precision spectral region. Since the bands are non-uniform with increasing bandwidth for higher frequencies and the width is defined in number of bands, the scheme will have an implicit frequency dependency. Since the bandwidths correspond to the perceptual frequency resolution, the perceived attenuation should be roughly constant across the spectrum. However, one could also consider making this frequency dependency explicit.
- ⁇ is L / 4, where L is the number of coefficients in the MDCT spectrum.
- the equation (4) will allow more attenuation for higher frequencies, similar to what is already obtained in this embodiment.
- One could also make the inverse relation w.r.t. frequency like so ⁇ w ⁇ f ⁇ 0 , w ⁇ C 1 , w ⁇ ⁇ ⁇ f - C / T > 1 w ⁇ ⁇ ⁇ f - C / T , otherwise where ⁇ denotes another tuning parameter.
- the attenuation will be restricted for higher frequencies. This may be desirable if it is found that there is less benefit of attenuation for higher frequencies.
- the concept described above can be restricted to the noise-filled regions only, if due to specifics of the quantizer; sub-bands with low number of assigned bits are treated separately.
- the concept described in conjunction with the first embodiment can operate without noise-filled bands, e.g., if the codec operates at high-bitrate and noise-filled bands do not exist.
- the reconstructed spectrum also includes a region which is reconstructed using a bandwidth extension (BWE) algorithm.
- BWE bandwidth extension
- the concept of adaptive attenuation of low accuracy reconstructed signal regions can be used in combination with a BWE module.
- Modern BWE algorithms apply certain attenuation on reconstructed spectral regions that are detected to be very different from the corresponding regions in the target signal. Such attenuation can be also made adaptive according to the concept described above.
- BWE algorithm may be an integral part of the noise-filling unit 310 as disclosed in figure 3a .
- the BWE algorithm modified according to the embodiments can be part both time domain codecs or transform domain codecs .
- the decoder of an audio communication/compression system can implement the adaptive attenuation algorithm according to embodiments without explicitly accounting for regions that are noise-filled, bandwidth extended, or quantized with low number bits. Instead, regions candidate for attenuation can be selected based on an encoder side subvector analysis using a distance measure between the reconstructed subvector and the input subvector. The distance measure may also be calculated between the reconstruction and synthesis of the residual subvectors.
- FIG. 6a A schematic overview of an encoder performing such analysis using a subvector analysis unit is illustrated in figure 6a . If the error in certain frequency region is above a certain threshold, the region is potential candidate for attenuation.
- the error measure can be for instance minimum mean squared error of the synthesized spectrum relative to the input spectrum, the energy error or a combination of error criteria.
- Such analysis can be used for identifying the regions for attenuation and/or deciding the attenuation for the identified regions.
- the encoder side analysis requires additional parameters to be added to the bitstream in order to reproduce the region identification and attenuation in the decoder.
- the decoder in such an embodiment would receive a result of the encoder side analysis via an encoded parameter through the bitstream and include the parameter in the attenuation control. Such a decoder is depicted in figure 6 b.
- the attenuation controller which can be implemented in a decoder of e.g. a user equipment as shown in figure 7a comprises according to one embodiment an identifier unit 703 configured to identify spectral regions to be attenuated, a grouping unit 704 configured to group subsequent identified spectral regions to form a continuous spectral region, and a determination unit 705 configured to determine a width of the continuous spectral region.
- an application unit 706 configured to apply an attenuation of the continuous spectral region adaptive to the width is provided in the attenuation controller 300. In this way an increased width decreases the attenuation of the continuous spectral region.
- the spectral regions to be attenuated are coded with either a low number of bits or with no bits assigned.
- the identifier unit 703 configured to identify spectral regions that are coded with either a low number of bits or no bits assigned may further be configured to examine reconstructed subvectors to identify the spectral regions of the decoded frequency domain residual that are represented with low precision.
- a spectral region may be said to be represented with low precision when the assigned number of bits for the said reconstructed subvector is below a predetermined threshold.
- a pulse coding scheme is employed to encode the spectral subvectors and a spectral region is said to be represented with low precision if it consists of one or more consecutive subvectors where the number of pulses P(b) is below a predetermined threshold.
- spectral regions that are coded with no bits assigned are identified and or spectral regions that are coded with a low number of bits are identified.
- the reconstructed spectrum can also include a region which is reconstructed using a bandwidth extension algorithm.
- the attenuation controller 300 comprises an input/output unit 710 configured to receive an analysis from the encoder and wherein the identifier unit 703 is further configured to identify the spectral regions to be attenuated based on the received analysis.
- the identifier unit 703 is further configured to identify the spectral regions to be attenuated based on the received analysis.
- a distance measure between a reconstructed synthesis signal and an input target signal are used by the encoder. If the distance measure in certain frequency region is above a certain threshold, the spectral region is a potential candidate for attenuation.
- the units of the attenuation controller 300 of the decoder can be implemented by a processor 700 configured to process software portions providing the functionality of the units as illustrated in figure 7 b.
- the software portions are stored in a memory 701 and retrieved from the memory when being processed.
- the input/output unit 710 is configured to receive input parameters from e.g. bit allocation and envelope decoding and to send information to envelope shaping.
- a mobile device 800 comprising the attenuation controller 300 in a decoder according to the embodiments is provided as illustrated in figure 8 .
- the attenuation controller 300 of the embodiments also can be implemented in a network node in a decoder as illustrated in figure 9 .
Abstract
Description
- The embodiments of the present invention relate to a decoder, an encoder for audio signals, and methods thereof. The audio signals may comprise speech in various conditions, music and mixed speech and music content. In particular, the embodiments relate to attenuation of spectral regions which are poorly reconstructed. This may for instance apply to regions which are coded with a low number of bits or with no bits assigned.
- Traditionally mobile networks are designed to handle speech signals at low bitrates. This has been realised by using designated speech codecs which show good performance for speech signals at low bit rates, but has poor performance for music and mixed content. There is an increasing demand that the networks should also handle these signals, for e.g. music-on-hold and ringback tones. Mobile internet applications further drive the need for low bitrate audio coding for streaming applications. Audio codecs normally operate using a higher bitrate than the speech codecs. When constraining the bit budget for the audio codec, certain spectral regions of the signal may be coded with a low number of bits, and the desired target quality of the reconstructed signal can therefore not be guaranteed. The spectral regions refer to frequency domain regions, e.g., certain subbands of the frequency transformed signal block. For simplicity "spectral regions" will be used throughout the specification with the meaning of "part of short-time signal spectra".
- Moreover, at low- and moderate bitrates there will be spectral regions with no bits assigned. Such spectral regions have to be reconstructed at the decoder, by reusing information from the available coded spectral regions (e.g., noise-fill or bandwidth extension). In all these cases some attenuation of energy of low accuracy reconstructed regions is desirable to avoid loud signal distortions.
- The signal regions coded with either insufficient number of bits or with no bits assigned will be reconstructed with low accuracy and accordingly it is desired to attenuate these spectral regions. Here, the insufficient number of bits is defined as a number of bits which are too low to be able to represent the spectral region with perceptually plausible quality. Note that this number will be dependent on the sensitivity of the audio perception for that region as well as the complexity of the signal region at hand.
- However, attenuation of low-accuracy coded spectral regions is not a trivial problem. On one hand, strong attenuation is desired to mask unwanted distortion. On the other hand, such attenuation might be perceived by listeners as loudness loss in the reconstructed signal, change of frequency characteristics, or change in signal dynamics e.g., over time coding algorithm can select different signal regions to noise-fill. For these reasons conventional audio coding systems apply very conservative, i.e. limited, attenuation, which achieves on average certain balance between different types of the above listed distortions.
- The embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.
- According to a first aspect a method for a decoder for determining an attenuation to be applied to an audio signal is provided. In the method, spectral regions to be attenuated are identified, subsequent identified spectral regions are grouped to form a continuous spectral region, a width of the continuous spectral region is determined, and an attenuation of the continuous spectral region adaptive to the width is applied such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- According to a second aspect, an attenuation controller of a decoder for determining an attenuation to be applied to an audio signal is provided. The attenuation controller comprises an identifier unit configured to identify spectral regions to be attenuated, a grouping unit configured to group subsequent identified spectral regions to form a continuous spectral region, and a determination unit configured to determine a width of the continuous spectral region. Further, an application unit is provided, wherein the application unit is configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- According to a third aspect, a mobile terminal is provided. The mobile terminal comprises a decoder with an attenuation controller. The attenuation controller comprises an identifier unit configured to identify spectral regions to be attenuated, a grouping unit configured to group subsequent identified spectral regions to form a continuous spectral region, and a determination unit configured to determine a width of the continuous spectral region. Further, an application unit is provided, wherein the application unit is configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- According to a fourth aspect, a network node is provided. The network node comprises a decoder with an attenuation controller. The attenuation controller comprises an identifier unit configured to identify spectral regions to be attenuated, a grouping unit configured to group subsequent identified spectral regions to form a continuous spectral region, and a determination unit configured to determine a width of the continuous spectral region. Further, an application unit is provided, wherein the application unit is configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- An advantage with embodiments of the present invention is that the proposed adaptive attenuation allows for a significant reduction of audible noise in the reconstructed audio signal compared to conventional systems, which have restrictive constant attenuation.
-
-
Fig. 1 illustrates schematically an overview of a MDCT transform based encoder and a decoder system. -
Fig 2 is a flowchart of a method according to an embodiment of the present invention. -
Figs, 3a and3b illustrate overviews of a decoder containing an attenuation control according to embodiments of the present invention. -
Fig. 4 shows an attenuation limit function which can be used by the embodiments and the resulting gain modification when applying the attenuation limiting function. -
Fig. 5a shows an example of 16 subvectors with pulse allocation, wherein low precisions regions are identified and the width of the respective region is determined according to embodiments of the present invention. -
Fig. 5b shows the impact of the attenuation when the adaptive attenuation is applied according to embodiments of the present invention. -
Fig. 6a illustrates schematically an overview of an encoder containing a subvector analysis unit, wherein the result of the subvector analysis unit is used by the decoder according to embodiments of the present invention. -
Fig. 6b illustrates an overview of a decoder containing an attenuation control according to an embodiment which is done based on a parameter from the bitstream which corresponds to an encoder analysis. -
Fig. 7a andfig. 7b illustrate schematically an attenuation controller according to embodiments of the present invention. -
Fig. 8 illustrates a mobile terminal with the attenuation controller of embodiments of the present invention. -
Fig. 9 illustrates a network node with the attenuation controller of embodiments of the present invention. - The decoder according to embodiments of the present invention can be used in an audio codec, audio decoder, which can be used in end user devices such as mobile devices (e.g. a mobile phone) or stationary PCs, or in network nodes where decoding occurs. The solution of the embodiments of the invention relates to an adaptive attenuation that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics. That is achieved in the attenuation controller in the decoder, as illustrated in a flowchart of
figure 2 . - The flowchart of
figure 2 shows a method in a decoder according to one embodiment. First, spectral regions to be attenuated are identified 201. This step may involve an examination of the reconstructed subvectors 201a. Subsequent identified spectral regions are grouped 202 to form a continuous spectral region and a width of the continuous spectral region is determined 203. Then, an attenuation of the continuous spectral region is applied 204, wherein the attenuation is adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region. - An attenuation controller according to embodiments can be implemented in an audio decoder in a mobile terminal or in a network node. The audio decoder can be used in a real-time communication scenario targeting primarily speech or in a streaming scenario targeting primarily music.
- In one embodiment, the audio codec where the attenuation controller is being implemented is a transform domain audio codec e.g. employing a pulse-based vector quantization scheme. In this exemplary embodiment, a Factorial Pulse Coding (FPC) type quantizer is used but it is understood by a person skilled in the art that any vector quantizing scheme may be used. A schematic overview of such an audio codec is shown in
figure 1 and a short description of the steps involved is given below. - A short audio segment (20-40 ms), denoted input audio, 100 is transformed to the frequency domain by a Modified Discrete Cosine Transform (MDCT).105
- The MDCT vector X(k) 107 obtained by the
MDCT 105 is split into multiple bands, i.e. subvectors. Note that any other suitable frequency transform may be used instead of MDCT, such as DFT or DCT. - The energy in each band is calculated in an
envelope calculator 110, which gives an approximation of the spectrum envelope. - The spectrum envelope is quantized by an
envelope quantizer 120, and the quantization indices are sent to the bitstream multiplexer in order to be stored or transmitted to a decoder. - A
residual vector 117 is obtained by scaling of the MDCT vectors using the inverse of the quantized envelope gains, e.g., the residual in each band is scaled to have unit Root-Mean-Square (RMS) Energy. - Bits for a quantizer performing a quantization of different
residual subvectors 125 are assigned by a bit allocator 130 based on quantized envelope energies. Due to a limited bit-budget, some of the subvectors receive no bits. - Based on the number of available bits, the residual subvectors are quantized, and the quantization indices are transmitted to the decoder. Residual quantization is performed with a Factorial Pulse Coding (FPC) scheme. A
multiplexer 135 multiplexes the quantization indices of the envelope and the subvector into abitstream 140 which may be stored or transmitted to the decoder. - It should be noted that residual subvectors with no bits assigned are not coded, but noise-filled at the decoder. This can be achieved by creating a virtual codebook from coded subvectors or any other noise-fill algorithm. The noise-fill creates content in the non-coded subvectors.
- With further reference to
figure 1 , the decoder receives thebitstream 140 from the encoder at ademultiplexer 145. The quantized envelope gains are reconstructed by theenvelope decoder 160. The quantized envelope gains are used by the bit allocator 155 which produces a bit allocation which is used by thesubvector decoder 150 to produce the decoded residual subvectors. The sequence of the decoded residual subvectors forms a normalized spectrum. Due to the restricted bit budget, some of the subvectors will not be represented and will yield zeroes or holes in the spectrum. These spectral holes are filled by anoise filling algorithm 165. The noise filling algorithm may also include a BWE algorithm, which may reconstruct the spectrum above the last encoded band. Using the bit allocation, a fixed envelope attenuation is determined 175. The quantized envelope gains are modified using the determined attenuation and an MDCT spectrum is reconstructed by scaling the decoded residual subvectors using these gains 170. Finally, areconstructed audio frame 190 is produced byinverse MDCT 185. - The embodiments of the presented invention are related to the envelope attenuation described above, previous step in the list above, where additional weighting of the envelope gains is added to control the energy of subvectors quantized with low precision, that is subvectors coded with a low number, or non-coded noise-filled subvectors. The subvectors coded with a low number of bits imply that the number of bits is insufficient to achieve a desirable accuracy. Thus, the insufficient number of bits is defined as a number of bits which are too low to be able to represent the spectral region with perceptually plausible quality. Note that this number will be dependent on the sensitivity of the audio perception for that region as well as the complexity of the signal region at hand.
- An overview of a decoder in such a scheme with the algorithm according to embodiments is shown in
figure 3a . The decoder offigure 3a corresponds to the decoder offigure 1 with the addition of anattenuation controller 300 according to embodiments of the present invention. Theattenuation controller 300 controls the adaptive attenuation according to embodiments of the invention. - Accordingly, the attenuation controller is configured to identify spectral regions to be attenuated, to group the identified spectral regions to form a continuous spectral region, to determine a width of the continuous spectral region, and to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region.
- The low precision spectral regions to be attenuated are according to the embodiments either coded with a low number of bits or with no bits assigned. The step of identifying low precision spectral regions may also comprise an analysis of the reconstructed subvectors.
- With reference again to
figure 2 which is a flowchart of a method according to an embodiment of the present invention, the first step 201 is to examine 201a the reconstructed subvectors to identify the spectral regions of the decoded frequency domain residual that are represented with low precision. According to one embodiment, the spectral region is said to be represented with low precision when the assigned number of bits for the said reconstructed subvector is below a predetermined threshold. - According to another embodiment, a pulse coding scheme is employed to encode the spectral subvectors and a spectral region is said to be represented with low precision if it consists of one or more consecutive subvectors where the number of pulses P(b) is below a predetermined threshold.
- Hence, it is determined if the spectral subvectors comprise of one or more consecutive subvectors where the number of pulses P(b) used to quantize the subvector fulfills
equation 1.figure 3b , where the synthesized shape vector is input to the envelope attenuator. The analysis of the synthesized shape may e.g. involve measuring the peakiness of the synthesized shape, as a peaky synthesis for higher rates may indicate a peaky input signal and hence better input/synthesis coherence. The estimated accuracy of the decoded subvector may be used to identify the corresponding band as a low resolution band and decide a suitable attenuation. - Subvectors that received zero bits in the bit allocation and are noise-filled may also be included in this category.
- Returning to
figure 2 , for each identified low precision spectral region, the identified spectral regions are grouped 202 and the width of the grouped spectral region is determined 203 by e.g. counting the number of subvectors in the grouped region. - To obtain the best possible audio quality, it is desirable to attenuate the low precision regions of the spectrum. According to embodiments, the attenuation 204 is dependent on the width of low precision spectral region. Hence the attenuation should be decreased with the width. That implies that a narrow region allows a larger attenuation than a wider region.
- As an example, the attenuation can be obtained in two steps. First, an initial attenuation factor A(b) is decided per subvector b. For noise filled subvectors, the attenuation factor is decided based on the number of consecutive noise filling subvectors. For the low precision coded vectors an accuracy function may be used to define the initial attenuation. When the low precision regions are identified, the attenuation level for each region is estimated using the bandwidth of the low precision region. The attenuation factors are adjusted to form A'(b) which take into consideration the low precision region bandwidth.
- An example attenuation limiting function A(b) depending on the bandwidth b of the low precision region is shown in
figure 4 . The resulting gain modification A'(b) also shown infigure 4 can be described usingequation 2,
where a (w) is defined inequation 3,
where w denotes the bandwidth in number of subvectors of the low precision region, and C and T are constants which control the adjustment function α(w). In this example, it was found that suitable values were C = 6 and T = 5. -
Figure 5a shows an example of the first 16 subvectors and the number of pulses used to quantize each subvector together with the low precision regions identified by the algorithm and the region widths in subvectors. Subsequent low precision regions are grouped to form a continuousspectral region 501;502;503 and the width of the continuous spectral region is determined. The width of each region is used for determining the attenuation to be applied.Figure 5b shows the impact of the algorithm on the corresponding subvector energies. One can see how the algorithm limits the attenuation in theregion 512 that has a width of 7 subvectors while it allows target attenuation of theregions
where f denotes the frequency bin of the spectrum and β is a tuning parameter. One possible value for β is L/4, where L is the number of coefficients in the MDCT spectrum. The equation (4) will allow more attenuation for higher frequencies, similar to what is already obtained in this embodiment. One could also make the inverse relation w.r.t. frequency like so
where γ denotes another tuning parameter. In this case the attenuation will be restricted for higher frequencies. This may be desirable if it is found that there is less benefit of attenuation for higher frequencies. - In a further embodiment, the concept described above can be restricted to the noise-filled regions only, if due to specifics of the quantizer; sub-bands with low number of assigned bits are treated separately.
- In an alternative embodiment, the concept described in conjunction with the first embodiment can operate without noise-filled bands, e.g., if the codec operates at high-bitrate and noise-filled bands do not exist.
- In a further embodiment, the reconstructed spectrum also includes a region which is reconstructed using a bandwidth extension (BWE) algorithm. The concept of adaptive attenuation of low accuracy reconstructed signal regions can be used in combination with a BWE module. Modern BWE algorithms apply certain attenuation on reconstructed spectral regions that are detected to be very different from the corresponding regions in the target signal. Such attenuation can be also made adaptive according to the concept described above. BWE algorithm may be an integral part of the noise-filling
unit 310 as disclosed infigure 3a . The BWE algorithm modified according to the embodiments can be part both time domain codecs or transform domain codecs . - In a further embodiment, the decoder of an audio communication/compression system can implement the adaptive attenuation algorithm according to embodiments without explicitly accounting for regions that are noise-filled, bandwidth extended, or quantized with low number bits. Instead, regions candidate for attenuation can be selected based on an encoder side subvector analysis using a distance measure between the reconstructed subvector and the input subvector. The distance measure may also be calculated between the reconstruction and synthesis of the residual subvectors. A schematic overview of an encoder performing such analysis using a subvector analysis unit is illustrated in
figure 6a . If the error in certain frequency region is above a certain threshold, the region is potential candidate for attenuation. The error measure can be for instance minimum mean squared error of the synthesized spectrum relative to the input spectrum, the energy error or a combination of error criteria. Such analysis can be used for identifying the regions for attenuation and/or deciding the attenuation for the identified regions. The encoder side analysis requires additional parameters to be added to the bitstream in order to reproduce the region identification and attenuation in the decoder. The decoder in such an embodiment would receive a result of the encoder side analysis via an encoded parameter through the bitstream and include the parameter in the attenuation control. Such a decoder is depicted infigure 6 b. - The attenuation controller which can be implemented in a decoder of e.g. a user equipment as shown in
figure 7a comprises according to one embodiment anidentifier unit 703 configured to identify spectral regions to be attenuated, agrouping unit 704 configured to group subsequent identified spectral regions to form a continuous spectral region, and adetermination unit 705 configured to determine a width of the continuous spectral region. Moreover, anapplication unit 706 configured to apply an attenuation of the continuous spectral region adaptive to the width is provided in theattenuation controller 300. In this way an increased width decreases the attenuation of the continuous spectral region. - According to one embodiment, the spectral regions to be attenuated are coded with either a low number of bits or with no bits assigned. In addition, the
identifier unit 703 configured to identify spectral regions that are coded with either a low number of bits or no bits assigned may further be configured to examine reconstructed subvectors to identify the spectral regions of the decoded frequency domain residual that are represented with low precision. - A spectral region may be said to be represented with low precision when the assigned number of bits for the said reconstructed subvector is below a predetermined threshold.
- Alternatively, a pulse coding scheme is employed to encode the spectral subvectors and a spectral region is said to be represented with low precision if it consists of one or more consecutive subvectors where the number of pulses P(b) is below a predetermined threshold.
- According to a further embodiment, spectral regions that are coded with no bits assigned are identified and or spectral regions that are coded with a low number of bits are identified.
- The reconstructed spectrum can also include a region which is reconstructed using a bandwidth extension algorithm.
- According to a yet further embodiment, the
attenuation controller 300 comprises an input/output unit 710 configured to receive an analysis from the encoder and wherein theidentifier unit 703 is further configured to identify the spectral regions to be attenuated based on the received analysis. In the received analysis a distance measure between a reconstructed synthesis signal and an input target signal are used by the encoder. If the distance measure in certain frequency region is above a certain threshold, the spectral region is a potential candidate for attenuation. - It should be noted that the units of the
attenuation controller 300 of the decoder can be implemented by aprocessor 700 configured to process software portions providing the functionality of the units as illustrated infigure 7 b. The software portions are stored in amemory 701 and retrieved from the memory when being processed. The attenuation controller. The input/output unit 710 is configured to receive input parameters from e.g. bit allocation and envelope decoding and to send information to envelope shaping. - According to a further aspect of the present invention, a
mobile device 800 comprising theattenuation controller 300 in a decoder according to the embodiments is provided as illustrated infigure 8 . It should be noted that theattenuation controller 300 of the embodiments also can be implemented in a network node in a decoder as illustrated infigure 9 .
Claims (16)
- A method for a decoder for determining an attenuation to be applied to an audio signal, comprising:- identifying (201) spectral regions to be attenuated,- grouping (202) subsequent identified spectral regions to form a continuous spectral region,- determining (203) a width of the continuous spectral region, and- applying (204) an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- The method according to claim 1, wherein the step of identifying (201) spectral regions to be attenuated comprises examining (201a) reconstructed subvectors.
- The method according to claim 2, wherein a spectral region is said to be represented with low precision when the assigned number of bits for the said reconstructed subvector is below a predetermined threshold.
- The method according to claim 2, wherein a pulse coding scheme is employed to encode the spectral subvectors and a spectral region is said to be represented with low precision if it consists of one or more consecutive subvectors where the number of pulses P(b) is below a predetermined threshold.
- The method according to any of claims 1-4, wherein spectral regions that are coded with no bits assigned are identified.
- The method according to any of claims 1-5, where the reconstructed spectrum also includes a region which is reconstructed using a bandwidth extension algorithm.
- The method according to claim 1 or 6, wherein the spectral regions to be attenuated are identified based on an analysis received from the encoder wherein a distance measure between a reconstructed synthesis signal and an input target signal are used by the encoder, if the distance measure in certain frequency region is above a certain threshold, the spectral region is a potential candidate for attenuation.
- An attenuation controller (300) of a decoder for determining an attenuation to be applied to an audio signal, comprising an identifier unit (703) configured to identify spectral regions to be attenuated, a grouping unit (704) configured to group subsequent identified spectral regions to form a continuous spectral region, a determination unit (705) configured to determine a width of the continuous spectral region, and an application unit (706) configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region, wherein the spectral regions to be attenuated are coded with no bits assigned.
- The attenuation controller (300) according to claim 8, wherein the identifier unit (703) configured to identify spectral regions to be attenuated further is configured to examine reconstructed subvectors.
- The attenuation controller (300) according to claim 9, wherein a spectral region is said to be represented with low precision when the assigned number of bits for the said reconstructed subvector is below a predetermined threshold.
- The attenuation controller (300) according to claim 9, wherein a pulse coding scheme is employed to encode the spectral subvectors and a spectral region is said to be represented with low precision if it consists of one or more consecutive subvectors where the number of pulses P(b) is below a predetermined threshold.
- The attenuation controller (300) according to any of claims 8-11, wherein spectral regions that are coded with no bits assigned are identified.
- The attenuation controller (300) according to any of claims 8-12, where the reconstructed spectrum also includes a region which is reconstructed using a bandwidth extension algorithm.
- The attenuation controller (300) according to claim 8 or 13, wherein it comprises an input unit (710) configured to receive an analysis from the encoder and wherein the identifier unit (703) is further configured to identify the spectral regions to be attenuated based on the received analysis wherein a distance measure between a reconstructed synthesis signal and an input target signal are used by the encoder, if the distance measure in certain frequency region is above a certain threshold, the spectral region is a potential candidate for attenuation wherein the spectral regions to be attenuated are coded with no bits assigned.
- A mobile terminal comprising an attenuation controller (300) of a decoder for determining an attenuation to be applied to an audio signal, wherein the attenuation controller (300) comprises an identifier unit (703) configured to identify spectral regions to be attenuated, a grouping unit (704) configured to group subsequent identified spectral regions to form a continuous spectral region, a determination unit (705) configured to determine a width of the continuous spectral region, and an application unit (706) configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region wherein the spectral regions to be attenuated are coded with no bits assigned.
- A network node comprising an attenuation controller (300) of a decoder for determining an attenuation to be applied to an audio signal, wherein the attenuation controller (300) comprises an identifier unit (703) configured to identify spectral regions to be attenuated, a grouping unit (704) configured to group subsequent identified spectral regions to form a continuous spectral region, a determination unit (705) configured to determine a width of the continuous spectral region, and an application unit (706) configured to apply an attenuation of the continuous spectral region adaptive to the width such that an increased width decreases the attenuation of the continuous spectral region wherein the spectral regions to be attenuated are coded with no bits assigned.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DK16167229.0T DK3067888T3 (en) | 2011-04-15 | 2011-12-15 | DECODES FOR DIMAGE OF SIGNAL AREAS RECONSTRUCTED WITH LOW ACCURACY |
EP16167229.0A EP3067888B1 (en) | 2011-04-15 | 2011-12-15 | Decoder for attenuation of signal regions reconstructed with low accuracy |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161475711P | 2011-04-15 | 2011-04-15 | |
EP11801709.4A EP2697796B1 (en) | 2011-04-15 | 2011-12-15 | Method and a decoder for attenuation of signal regions reconstructed with low accuracy |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP11801709.4A Division-Into EP2697796B1 (en) | 2011-04-15 | 2011-12-15 | Method and a decoder for attenuation of signal regions reconstructed with low accuracy |
EP11801709.4A Division EP2697796B1 (en) | 2011-04-15 | 2011-12-15 | Method and a decoder for attenuation of signal regions reconstructed with low accuracy |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP16167229.0A Division EP3067888B1 (en) | 2011-04-15 | 2011-12-15 | Decoder for attenuation of signal regions reconstructed with low accuracy |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2816556A1 true EP2816556A1 (en) | 2014-12-24 |
EP2816556B1 EP2816556B1 (en) | 2016-05-04 |
Family
ID=45406733
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP16167229.0A Active EP3067888B1 (en) | 2011-04-15 | 2011-12-15 | Decoder for attenuation of signal regions reconstructed with low accuracy |
EP14184428.2A Active EP2816556B1 (en) | 2011-04-15 | 2011-12-15 | Method and a decoder for attenuation of signal regions reconstructed with low accuracy |
EP11801709.4A Active EP2697796B1 (en) | 2011-04-15 | 2011-12-15 | Method and a decoder for attenuation of signal regions reconstructed with low accuracy |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP16167229.0A Active EP3067888B1 (en) | 2011-04-15 | 2011-12-15 | Decoder for attenuation of signal regions reconstructed with low accuracy |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP11801709.4A Active EP2697796B1 (en) | 2011-04-15 | 2011-12-15 | Method and a decoder for attenuation of signal regions reconstructed with low accuracy |
Country Status (7)
Country | Link |
---|---|
US (4) | US8706509B2 (en) |
EP (3) | EP3067888B1 (en) |
KR (1) | KR101520212B1 (en) |
CN (1) | CN103503065B (en) |
DK (1) | DK3067888T3 (en) |
ES (2) | ES2540051T3 (en) |
WO (1) | WO2012139668A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101704482B1 (en) * | 2012-03-29 | 2017-02-09 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | Bandwidth extension of harmonic audio signal |
RU2658128C2 (en) | 2013-06-21 | 2018-06-19 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Apparatus and method for generating an adaptive spectral shape of comfort noise |
EP2980792A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating an enhanced signal using independent noise-filling |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000045379A2 (en) * | 1999-01-27 | 2000-08-03 | Coding Technologies Sweden Ab | Enhancing perceptual performance of sbr and related hfr coding methods by adaptive noise-floor addition and noise substitution limiting |
WO2003107328A1 (en) * | 2002-06-17 | 2003-12-24 | Dolby Laboratories Licensing Corporation | Audio coding system using spectral hole filling |
WO2009029036A1 (en) * | 2007-08-27 | 2009-03-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and device for noise filling |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4617676A (en) * | 1984-09-04 | 1986-10-14 | At&T Bell Laboratories | Predictive communication system filtering arrangement |
KR940001817B1 (en) * | 1991-06-14 | 1994-03-09 | 삼성전자 주식회사 | Voltage-current transformation circuit for active filter |
JPH08223049A (en) * | 1995-02-14 | 1996-08-30 | Sony Corp | Signal coding method and device, signal decoding method and device, information recording medium and information transmission method |
JPH08328599A (en) * | 1995-06-01 | 1996-12-13 | Mitsubishi Electric Corp | Mpeg audio decoder |
GB9512284D0 (en) * | 1995-06-16 | 1995-08-16 | Nokia Mobile Phones Ltd | Speech Synthesiser |
CN1748443B (en) * | 2003-03-04 | 2010-09-22 | 诺基亚有限公司 | Support of a multichannel audio extension |
WO2008106036A2 (en) * | 2007-02-26 | 2008-09-04 | Dolby Laboratories Licensing Corporation | Speech enhancement in entertainment audio |
US8326617B2 (en) * | 2007-10-24 | 2012-12-04 | Qnx Software Systems Limited | Speech enhancement with minimum gating |
-
2011
- 2011-12-15 ES ES11801709.4T patent/ES2540051T3/en active Active
- 2011-12-15 CN CN201180070142.XA patent/CN103503065B/en active Active
- 2011-12-15 KR KR1020137029473A patent/KR101520212B1/en active IP Right Grant
- 2011-12-15 EP EP16167229.0A patent/EP3067888B1/en active Active
- 2011-12-15 WO PCT/EP2011/072963 patent/WO2012139668A1/en active Application Filing
- 2011-12-15 DK DK16167229.0T patent/DK3067888T3/en active
- 2011-12-15 EP EP14184428.2A patent/EP2816556B1/en active Active
- 2011-12-15 US US13/379,054 patent/US8706509B2/en active Active
- 2011-12-15 EP EP11801709.4A patent/EP2697796B1/en active Active
- 2011-12-15 ES ES16167229.0T patent/ES2637031T3/en active Active
-
2013
- 2013-11-20 US US14/085,082 patent/US9349379B2/en active Active
-
2016
- 2016-04-26 US US15/138,530 patent/US9595268B2/en active Active
- 2016-11-16 US US15/352,729 patent/US9691398B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000045379A2 (en) * | 1999-01-27 | 2000-08-03 | Coding Technologies Sweden Ab | Enhancing perceptual performance of sbr and related hfr coding methods by adaptive noise-floor addition and noise substitution limiting |
WO2003107328A1 (en) * | 2002-06-17 | 2003-12-24 | Dolby Laboratories Licensing Corporation | Audio coding system using spectral hole filling |
WO2009029036A1 (en) * | 2007-08-27 | 2009-03-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and device for noise filling |
Also Published As
Publication number | Publication date |
---|---|
WO2012139668A1 (en) | 2012-10-18 |
CN103503065B (en) | 2015-08-05 |
CN103503065A (en) | 2014-01-08 |
KR20140035900A (en) | 2014-03-24 |
US9349379B2 (en) | 2016-05-24 |
EP2697796B1 (en) | 2015-05-06 |
US8706509B2 (en) | 2014-04-22 |
US20120278085A1 (en) | 2012-11-01 |
ES2540051T3 (en) | 2015-07-08 |
US20140081646A1 (en) | 2014-03-20 |
EP3067888A1 (en) | 2016-09-14 |
US9691398B2 (en) | 2017-06-27 |
EP2816556B1 (en) | 2016-05-04 |
ES2637031T3 (en) | 2017-10-10 |
KR101520212B1 (en) | 2015-05-13 |
US20170061977A1 (en) | 2017-03-02 |
US20160240201A1 (en) | 2016-08-18 |
DK3067888T3 (en) | 2017-07-10 |
US9595268B2 (en) | 2017-03-14 |
EP2697796A1 (en) | 2014-02-19 |
EP3067888B1 (en) | 2017-05-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2346030B1 (en) | Audio encoder, method for encoding an audio signal and computer program | |
US7613603B2 (en) | Audio coding device with fast algorithm for determining quantization step sizes based on psycho-acoustic model | |
US8972270B2 (en) | Method and an apparatus for processing an audio signal | |
CN110223704B (en) | Apparatus for performing noise filling on spectrum of audio signal | |
US9966082B2 (en) | Filling of non-coded sub-vectors in transform coded audio signals | |
EP3014609B1 (en) | Bitstream syntax for spatial voice coding | |
RU2505921C2 (en) | Method and apparatus for encoding and decoding audio signals (versions) | |
JP2004512560A (en) | Perceptually enhanced enhancement of coded audio signals | |
US8589155B2 (en) | Adaptive tuning of the perceptual model | |
US9691398B2 (en) | Method and a decoder for attenuation of signal regions reconstructed with low accuracy | |
US20040002859A1 (en) | Method and architecture of digital conding for transmitting and packing audio signals | |
US8010370B2 (en) | Bitrate control for perceptual coding | |
EP3550563B1 (en) | Encoder, decoder, encoding method, decoding method, and associated programs | |
US10657976B2 (en) | Signal encoding method and apparatus, and signal decoding method and apparatus | |
EP2104095A1 (en) | A method and an apparatus for adjusting quantization quality in encoder and decoder | |
US20130117028A1 (en) | Apparatus and method for coding signal in a communication system | |
KR20130047630A (en) | Apparatus and method for coding signal in a communication system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20140911 |
|
AC | Divisional application: reference to earlier application |
Ref document number: 2697796 Country of ref document: EP Kind code of ref document: P |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
R17P | Request for examination filed (corrected) |
Effective date: 20150522 |
|
RBV | Designated contracting states (corrected) |
Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/02 20130101AFI20151028BHEP Ipc: G10L 19/035 20130101ALI20151028BHEP Ipc: G10L 21/02 20130101ALN20151028BHEP |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/035 20130101ALI20151109BHEP Ipc: G10L 21/02 20130101ALN20151109BHEP Ipc: G10L 19/02 20130101AFI20151109BHEP |
|
INTG | Intention to grant announced |
Effective date: 20151124 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AC | Divisional application: reference to earlier application |
Ref document number: 2697796 Country of ref document: EP Kind code of ref document: P |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: REF Ref document number: 797516 Country of ref document: AT Kind code of ref document: T Effective date: 20160515 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602011026297 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: NV Representative=s name: MARKS AND CLERK (LUXEMBOURG) LLP, CH |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: MP Effective date: 20160504 |
|
REG | Reference to a national code |
Ref country code: LT Ref legal event code: MG4D |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: NO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160804 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 797516 Country of ref document: AT Kind code of ref document: T Effective date: 20160504 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160905 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160805 Ref country code: RS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 6 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 602011026297 Country of ref document: DE |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: SM Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20170207 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: MM4A |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20161215 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20161215 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 7 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT; INVALID AB INITIO Effective date: 20111215 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: MK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20161215 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20160504 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: CH Payment date: 20230109 Year of fee payment: 12 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20221228 Year of fee payment: 12 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230523 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20231227 Year of fee payment: 13 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: TR Payment date: 20231130 Year of fee payment: 13 Ref country code: FR Payment date: 20231227 Year of fee payment: 13 |