EP1997103A1 - Procede de codage d'un signal audio source, dispositif de codage, procede et dispositif de decodage, signal, produits programme d'ordinateur correspondants - Google Patents
Procede de codage d'un signal audio source, dispositif de codage, procede et dispositif de decodage, signal, produits programme d'ordinateur correspondantsInfo
- Publication number
- EP1997103A1 EP1997103A1 EP07731731A EP07731731A EP1997103A1 EP 1997103 A1 EP1997103 A1 EP 1997103A1 EP 07731731 A EP07731731 A EP 07731731A EP 07731731 A EP07731731 A EP 07731731A EP 1997103 A1 EP1997103 A1 EP 1997103A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- coding
- quantization
- profile
- representative
- quantization profile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 118
- 230000005236 sound signal Effects 0.000 title claims abstract description 54
- 238000004590 computer program Methods 0.000 title claims description 10
- 238000013139 quantization Methods 0.000 claims abstract description 165
- 238000012545 processing Methods 0.000 claims description 16
- 230000006870 function Effects 0.000 claims description 14
- 230000005540 biological transmission Effects 0.000 claims description 13
- 238000005259 measurement Methods 0.000 claims description 9
- 238000000605 extraction Methods 0.000 claims description 2
- 238000004891 communication Methods 0.000 claims 2
- 230000000873 masking effect Effects 0.000 description 54
- 230000003595 spectral effect Effects 0.000 description 20
- 238000001228 spectrum Methods 0.000 description 15
- 230000006835 compression Effects 0.000 description 8
- 238000007906 compression Methods 0.000 description 8
- 238000011002 quantification Methods 0.000 description 6
- 238000013459 approach Methods 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000007670 refining Methods 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 238000010183 spectrum analysis Methods 0.000 description 2
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000010420 art technique Methods 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000011423 initialization method Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/035—Scalar quantisation
Definitions
- the field of the invention is that of encoding and decoding digital audio signals, such as music or digitized speech signals.
- the invention relates to the quantization of the spectral coefficients of audio signals, by implementing a perceptual coding.
- the invention applies in particular, but not exclusively, to systems implementing a hierarchical encoding of digital audio data, of the type of the "scalable" (or “scalable") coding / decoding system proposed as part of the MPEG standard. Audio (ISO / IEC 14496-3).
- the invention finds applications in the field of sound and music quantization in an efficient manner, for storage, compression and transmission through transmission channels, for example wireless or hard-wired.
- Audio compression is often based on certain hearing abilities of the human ear.
- the coding and quantization of an audio signal often takes this characteristic into account.
- the ear is unable to separate two components of a signal transmitted at close frequencies, as well as in a reduced time interval. This property is called auditory masking.
- the ear has a hearing threshold, in a quiet environment, below which no sound will be perceived. The level of this threshold varies according to the frequency of the sound wave.
- the quantization principles then use a human ear-induced masking threshold and the masking property to determine the amount of maximum quantization noise that is acceptable to inject into the signal without it being perceived by the ear when the audio signal is restored, that is to say without introducing too much distortion.
- This technique uses the frequency masking model of the ear illustrated in Figure 1, which shows an example of a frequency representation of an audio signal and the threshold of masking of the ear.
- the abscissa axis 10 represents the frequencies f, in Hz, the ordinate axis 11 that of the sound intensity I, in dB.
- the ear breaks down the spectrum of a signal x ⁇ t) into critical bands 120, 121, 122, 123 in the frequency domain according to the Barks scale.
- the critical band 120 of index n of the signal x (t) and energy E n then generates a mask 13 inside the band of index n and in the neighboring critical bands 122 and 123.
- the masking threshold 13 is proportional to the energy E n of the component 120 "masking" and decreasing for the critical bands of indices lower and greater than n.
- the components 122 and 123 are hidden in the example of Figure 1.
- the component 121 is also hidden since it is below Absolute hearing threshold 14.
- An overall masking curve is thus obtained, by combining the absolute hearing threshold 14 and the masking thresholds associated with each of the components of the audio signal x (t) analyzed in critical bands.
- This masking curve represents the spectral density of maximum quantization noise that can be superimposed on the signal, during its coding, without it being perceptible by the human ear.
- a quantization step profile also called, by abuse of language, injected noise profile, is then shaped during the quantization of the spectral coefficients resulting from the frequency transform of the source audio signal.
- Fig. 2 is a flowchart illustrating the principle of a conventional perceptual encoder.
- a time source audio signal x (t) is transformed in the frequency domain by a time-frequency transform block.
- a spectrum of the source signal, composed of spectral coefficients X n is then obtained, which is analyzed by a psychoacoustic model 21, whose role is to determine the overall masking curve C of the signal, as a function of the absolute hearing threshold. as well as masking thresholds of each spectral component of the signal.
- the masking curve obtained makes it possible to know the quantity of quantization noise that can be injected and thus to determine the number of bits to be used for quantizing the spectral coefficients, or samples.
- This step of determining the number of bits is performed by a binary allocation block 22, which delivers a quantization step profile A n for each coefficient X n .
- the bit allocation seeks to reach the target rate by adjusting the quantization steps under the shaping constraint given by the masking curve C.
- the quantization steps A n are coded, in the form of scale factors F in particular, by this block 22 of binary allocation, to be transmitted as additional information in the bit stream T.
- a quantization block 23 receives the spectral coefficients X n as well as the quantization steps A n determined, and then delivers quantized coefficients X n .
- a block 24 for coding and forming a bitstream centralizes the quantized spectral coefficients X n and the scale factors F, to code them and thus to form a bit stream containing the useful data relating to the coded source audio signal as well as the data representative of scale factors.
- Hierarchical coding consists in cascading several stages of coders, the first stage generating the coded version at the lowest rate at which the subsequent stages bring successive improvements for gradually increasing flows.
- the improvement stages are conventionally based on transform perceptual coding as described in the previous section.
- the estimation of the masking curve is based on the quantized values of the coefficients of the time-frequency transform, it can be performed identically at the level of the encoder and the decoder: this has the advantage of avoiding transmit the profile of the quantization step, or quantization noise, to the decoder.
- the masking model implemented simultaneously with the encoder and the decoder is necessarily fixed, and therefore can not be adapted precisely to the nature of the signal.
- a single masking factor is used, regardless of the tonal character or not of the components of the spectrum to be encoded.
- the masking curves are calculated under a stationary hypothesis of the signal, and apply poorly to transient portions and sound attacks.
- the masking curve of the first levels is incomplete, since certain portions of the spectrum are not yet coded. This incomplete curve does not necessarily represent an optimal form of the profile of the quantization step for the hierarchical level considered.
- the invention relates to a method for coding a source audio signal, comprising the following steps: encoding a quantization profile of coefficients representative of at least one transform of the source audio signal, according to at least two distinct coding techniques, delivering at least two sets of data representative of the quantization profile; selecting one of the data sets representative of the quantization profile, based on a selection criterion based on signal distortion measurements reconstructed respectively from said data sets and on the rate required to code said data sets;
- the invention thus relies on a new and inventive approach to coding the coefficients of a source audio signal, making it possible to reduce the bit rate allocated to the transmission of quantization steps while keeping a quantization noise profile injected as close as possible to that given by a masking curve calculated from the complete knowledge of the signal.
- the invention proposes a selection between different possible modes of calculating the quantization step profile. It can thus make a selection between several quantization step profile templates, or injected noise profiles. This choice is indicated by an indicator, for example a signal contained in the bitstream formed by the encoder and transmitted to the system for reproducing the audio signal, the decoder.
- the selection criterion can take into account, in particular, the efficiency of each quantization profile and the bit rate required to code the corresponding data set.
- Quantification is therefore optimized, while minimizing the bit rate required to transmit data representative of the profile of the step of quantization, and not providing direct information on the audio signal itself.
- the choice of a quantization mode is made by comparing a reference masking curve, estimated from the audio signal to be encoded, with the noise profiles associated with each of the modes. of quantification.
- the data set may correspond to a parametric representation of the quantization profile.
- the parametric representation is formed of at least one line segment, characterized by a slope and a value at the origin.
- a second of the coding techniques can deliver a constant quantization profile.
- This encoding mode therefore proposes to code the quantization step profile based on a signal-to-noise ratio (SNR), and not on a signal masking curve.
- SNR signal-to-noise ratio
- the quantization profile corresponds to an absolute hearing threshold.
- the set of data representative of the quantization profile may be empty and no data relating to the quantization profile is transmitted from the encoder to the decoder.
- the threshold of absolute hearing is known to the decoder.
- the set of data representative of the quantization profile may comprise all the quantization steps implemented.
- This fourth coding technique corresponds to the case where the quantization step profile is determined as a function of the signal masking curve, known only to the encoder, and fully transmitted to the decoder.
- the requested bit rate is important, but the signal quality is optimal.
- the coding implements hierarchical processing delivering at least two hierarchical coding levels, comprising a basic level and at least one level of refinement comprising refinement information with respect to the basic level or a previous refinement level.
- a fifth coding technique provides that the set of data representative of the quantization profile is obtained, at a given level of refinement, taking into account data constructed at the previous hierarchical level.
- the invention thus effectively applies to hierarchical coding, and proposes to code the quantization step profile according to a technique of refining it at each hierarchical level.
- the selection step can be implemented at each level of hierarchical coding.
- the selection step can be implemented for each of the frames.
- the signaling can thus be carried out not only for each processing frame, but, in the particular application of a hierarchical coding of the data, for each level of refinement.
- the coding can be implemented on groups of frames, of predefined or variable sizes. It can also be expected that the current profile remains unchanged until a new indicator has been transmitted.
- the invention also relates to a coding device for a source audio signal, comprising means for implementing such a method.
- the invention also relates to a computer program product for implementing the coding method as described above.
- the invention also relates to a coded signal representative of a source audio signal, comprising data representative of a quantization profile.
- a coded signal representative of a source audio signal comprising data representative of a quantization profile.
- Such a signal includes in particular:
- an indicator representative of a coding technique of the quantization profile implemented selected from among at least two available techniques, as a function of a selection criterion based on distortion measurements of reconstructed signals respectively from the profile; quantization coded according to said techniques and the bit rate necessary to code the quantization profile according to said techniques;
- Such a signal may in particular comprise data relating to at least two hierarchical levels obtained by a hierarchical processing, comprising a basic level and at least one level of refinement comprising refinement information relative to the basic level or to a level of refinement. previous, and includes an indicator representative of a coding technique for each level.
- the signal according to the invention When the signal according to the invention is organized in frames of successive coefficients, it may comprise an indicator representative of the coding technique used for each of the frames.
- the invention also relates to a method for decoding such a signal.
- This method notably comprises the following steps: extraction of the coded signal:
- Such a decoding method also comprises a step of constructing a reconstructed audio signal, representative of the source audio signal, taking into account the reconstructed quantization profile.
- the data set may correspond to a parametric representation of the quantization profile
- the reconstruction step delivers a reconstructed quantization profile in the form of at least one line segment.
- the data set may be empty and the reconstruction step delivers a constant quantization profile.
- the data set may be empty, and the reconstruction step delivers a quantization profile corresponding to an absolute hearing threshold.
- the data set may comprise all the quantization steps implemented during the coding method described above, and the construction step delivers a quantization profile in the form of a set of quantization steps implemented during the coding process.
- the decoding method may implement a hierarchical processing delivering two hierarchical decoding levels, comprising a basic level and at least one level of decoding. refinement comprising refinement information relative to the base level or a previous refinement level.
- the reconstruction step delivers a quantization profile obtained, at a given level of refinement, taking into account data constructed at the previous hierarchical level.
- the invention also relates to a device for decoding a coded signal representative of a source audio signal, comprising means for implementing the decoding method described above.
- the invention also relates to a computer program product for implementing the decoding method as described above.
- FIG. 1 illustrates the frequency masking threshold
- FIG. 2 is a simplified flowchart of the perceptual coding by transform according to the state of the art
- FIG. 3 illustrates an exemplary signal according to the invention
- FIG. 4 is a simplified flowchart of the coding method according to the invention.
- FIG. 5 is a simplified flowchart of the decoding method according to the invention.
- FIGS. 6A and 6B schematically illustrate a coding device and a decoding device embodying the invention.
- a source audio signal x (t) is intended to be transformed in the frequency domain, directly or indirectly.
- the signal x (t) can first be coded in a coding step 40.
- a coding step 40 is implemented by a "heart" coder.
- this first coding step corresponds to a first hierarchical level of coding, that is to say the basic level.
- Such a "heart" encoder may implement a coding step 401, and a local decoding step 402. It then delivers a first bitstream 46 representative of the data of the coded audio signal at the lowest level of refinement.
- Different coding techniques can be envisaged to obtain the low bit rate, such as parametric encodings such as the sinusoidal encoding described in B. den Brinker, E.Schuijers and W.Oomen, "Parametric coding for high-quality audio", in Proc. 112nd AES Convention, Kunststoff, Germany, 2002 or coding by CELP (for Code-Excited Linear Prediction) in the document M. Schroeder and B. Atal, Code-excited linear prediction (CELP) : high quality speech at very low bit rates ", in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Tampa, pp. 937-940, 1985.
- CELP Code-excited linear prediction
- a subtraction 403 is performed between the decoded samples by the local decoder 402 and the actual values of x (t), so as to obtain a time domain residue signal r (t).
- Frequency coefficients l ⁇ p are obtained, in the frequency domain, representative of the residues delivered by the "core” coder 40, for each critical band of index k and for the first hierarchical level.
- the next coding level stage 42 contains a residue coding step 421 associated with an implementation of a psychoacoustic model for determining a first masking curve for the first refinement level.
- quantized residual coefficients R ⁇ are obtained, which are subtracted (423) from the original coefficients R ⁇ * resulting from the "core” coding step 40.
- New coefficients R [* are obtained, which are themselves quantified and coded at step
- a psychoacoustic model 432 is implemented and updates the masking threshold as a function of the coefficients R * of residues previously quantified.
- the basic coding step 40 (“core” coder) allows the transmission and decoding, in a terminal, of a low bit rate version of the audio signal.
- the successive stages 42, 43 of quantization of the residues in the transformed domain constitute improvement layers, making it possible to construct a hierarchical bit stream from the low rate level to a desired maximum rate.
- an indicator ⁇ ⁇ , ⁇ ⁇ is associated with each psychoacoustic model 422, 432 of each coding level, for each of the quantization stages.
- the value of this indicator is specific to each stage and controls the calculation mode of the quantization step profile. It is placed in the header 441 and 451 quantized spectral coefficient frames 442, 452 in the bitstreams 44, 45 associated and formed at each coding level 42, 43 improved.
- FIG. 3 An example of a structure of a signal obtained according to this coding technique is illustrated in FIG. 3.
- the signal is organized in blocks or data frames 31 each comprising a header 32 and a data field 33.
- a block corresponds to for example to the data (contained in the field 33) of a hierarchical level for a predetermined time interval.
- the header 32 may include several signaling information, decoding assistance ... It comprises at least, according to the invention, the information ⁇ . 5.2 Structure of the decoder
- the decoding comprises several levels 50, 51, 52 of decoding refinement.
- a first decoding step 501 receives a bit stream 53 containing the data 530 representative of the indicator ⁇ ⁇ of the first level, determined during the first coding step and transmitted to the decoder.
- the bitstream further contains the data 531 representative of the spectral coefficients of the audio signal.
- a psychoacoustic model is implemented in a first step 502, to determine a first estimate of the masking curve, and thus a profile. quantization step that is used to process the residuals of the spectral coefficients available to the decoder at this stage of the decoding process.
- the spectral coefficient residues obtained Tc k 'for each critical band of index k make it possible to update the psychoacoustic model at the following level 51, in a step 512, which then refines the masking curve and therefore the pitch profile. of quantification.
- This refinement therefore takes into account the value of the indicator r r 'for level 2, contained in the header 540 of the bit stream 54 transmitted by the corresponding coder, the quantized residues at the previous level as well as quantized data 541 relating to level 2 residues included in bit stream 54.
- Residues quantized are obtained at the output of the second decoding level 51. They are added (56) to the RJp residues of the previous level, but also injected at the next level 52, which similarly refines the precision on the spectral coefficients as well as on the profile of the quantization steps. from a decoding step 521, and the implementation of a psychoacoustic model in a step 522. This level further receives a bit stream 55 sent by the encoder containing the value of the indicator 55 yr 'and the Quantized spectrum 551.
- the quantized R ⁇ p residues obtained are added to the residues R 1, and so on.
- the psychoacoustic model is updated as the coefficients are decoded by the successive refinement levels. Reading the indicator ⁇ transmitted by the encoder then makes it possible to reconstruct the noise profile (or quantization) by each quantization stage.
- the steps of updating the psychoacoustic model, and quantizing the spectral coefficients common to the coding method and the decoding method, according to a particular embodiment, are described in detail below.
- the step of determining the value of the indicator ⁇ , carried out at the coding, is then detailed, followed by the step of reconstructing the quantization steps at the decoder.
- a psycho-acoustic model takes into account the sub-bands in which the ear breaks down an audio signal and thus determines masking thresholds by using psychoacoustic information. These thresholds are used to determine the quantization step of the spectral coefficients.
- the step (implementation in steps 422, 432 of the coding method and in steps 502, 512, 522 of the decoding method) of updating the masking curve by the psychoacoustic model remains unchanged regardless of the value of the indicator ⁇ on the choice of the profile of the quantization step.
- this masking curve updated by the psycho-acoustic model which is conditioned by the value of the indication ⁇ to define the profile of the quantization step implemented for quantify the spectral coefficients (or residual coefficients determined at a previous refinement level).
- the psychoacoustic model uses at each level of quantification
- This spectrum is initialized at the first level of quantization refinement by the data available at the output of the coding step implemented by the core coder.
- the coding and decoding methods each comprise an initialization step Init of the psycho-acoustic model during its first implementation (step 422 of the coding method and step 502 of the decoding method), from the data transmitted by the heart coder.
- the coefficient g t corresponds to a constant gain making it possible to adjust the level of the quantization noise injected parallel to the profile given by ⁇ ⁇
- this gain g t is determined by an allocation loop in order to reach a target bit rate assigned to each index quantization level. It is then transmitted to the decoder in the bitstream at the output of the quantization stage.
- the gain g t is a function of the single level of index refinement / and this function is known to the decoder.
- the coding and decoding methods according to the invention then propose to determine a profile A ⁇ of quantization step from a choice between several coding techniques, or modes of calculation of this profile.
- the selection is indicated by the value of the indicator ⁇ transmitted in the bit stream.
- the quantization step profile is either fully transmitted, partially, or not at all. In the latter case, the profile of the quantization step is estimated at the decoder.
- the quantization step profile A 2 used by the index quantization stage / is calculated from the masking curve available at this stage and the input indicator ⁇ ⁇ '.
- the indicator ⁇ ⁇ is coded on 3 bits, to indicate five coding techniques different from the profile of the quantization step.
- the quantization step profile is defined only from the absolute hearing threshold according to the equation k ⁇ ffset (n + 1) -l
- the encoder transmits to the decoder no information relating to the quantization step.
- the value of the slope a is chosen by correlation with the reference masking curve, calculated at the encoder from a spectral analysis of the signal to be coded. Its quantized value at is then transmitted to the decoder and used to define the profile of the quantization steps according to the formula: A ⁇ - D n (a).
- the profile of the quantization steps A i determined in the coding step is entirely transmitted to the decoder.
- the invention proposes a particular technique for judiciously choosing the value of the indicator ⁇ , and therefore the quantization step profile to be applied for coding and decoding an audio signal. This choice is made at the coding step, for each level of quantization (in the case of a hierarchical coding) index /.
- a ⁇ ⁇ M ⁇ .
- the ratio of the gains G 1 and G 2 makes it possible to standardize the profiles of the quantization step with respect to each other.
- ⁇ ( ⁇ ) represents the additional cost in bits associated with the transmission of the profile quantization steps. In other words, it represents the number of additional bits (except those encoding the indicator ⁇ ) to be transmitted to the decoder to allow the reconstruction of quantization steps.
- the reconstruction of the quantization step profile at an index quantization stage / is performed according to the data transmitted by the decoder.
- the decoder decodes the value of this indicator present in the bit stream header. received for each frame, then read the value of the adjustment gain g t .
- the decoder reads all the quantization steps
- the method of the invention can be implemented a coding device, the structure of which is presented in relation to FIG. 6A.
- a device comprises a memory M 600, a processing unit 601, equipped for example with a microprocessor, and controlled by the computer program Pg 602.
- the code instructions of the computer program 602 are for example loaded into a RAM memory before being executed by the processor of the processing unit 601.
- the processing unit 601 receives as input a source audio signal to be coded 603.
- the microprocessor ⁇ P of the processing unit 601 implements the coding method described above, according to the instructions of the program Pg 602.
- the processing unit 601 outputs a bitstream 604 including in particular quantized data representative of the coded source audio signal, data representative of a quantization step profile, and finally representative data of the ⁇ indicator.
- the invention also relates to a device for decoding a coded signal representative of a source audio signal according to the invention, whose simplified overall structure is illustrated schematically in FIG. 6B.
- It comprises a memory M 610, a processing unit 611, equipped for example with a microprocessor, and controlled by the computer program Pg 612.
- the code instructions of the computer program 612 are for example loaded into a RAM before being executed by the processor of the processing unit 611.
- the processing unit 611 receives as input a bitstream 613, comprising data representative of a coded source audio signal, representative data a quantization step profile and representative data of the ⁇ indicator.
- the microprocessor ⁇ P of the processing unit 611 implements the decoding method according to the instructions of the program Pg 612, to deliver a reconstructed audio signal 612.
- ANNEX ANNEX
- the psychoacoustic model can be initialized in several ways, depending on the type of "core" coder implemented in the base level coding step.
- a sinusoidal encoder models the audio signal by a sum of sinusoids of varying frequencies and amplitudes over time.
- the quantized values of the frequencies and amplitudes are transmitted to the decoder. From these values, one can build the spectrum X ⁇ of the sinusoidal components of the signal.
- the initial spectrum X 1 can be simply estimated from a short-term spectral analysis of the decoded signal at the output of the core coder.
- the initial spectrum X 1 can be obtained by adding the LPC envelope spectrum defined according to the previous equation, and the short-term spectrum estimated from the coded residue by a CELP coder.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0602179A FR2898443A1 (fr) | 2006-03-13 | 2006-03-13 | Procede de codage d'un signal audio source, dispositif de codage, procede et dispositif de decodage, signal, produits programme d'ordinateur correspondants |
PCT/FR2007/050915 WO2007104889A1 (fr) | 2006-03-13 | 2007-03-12 | Procede de codage d'un signal audio source, dispositif de codage, procede et dispositif de decodage, signal, produits programme d'ordinateur correspondants |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1997103A1 true EP1997103A1 (fr) | 2008-12-03 |
EP1997103B1 EP1997103B1 (fr) | 2011-09-14 |
Family
ID=36996146
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07731731A Active EP1997103B1 (fr) | 2006-03-13 | 2007-03-12 | Procede de codage d'un signal audio source, dispositif de codage, procede et dispositif de decodage, signal, produits programme d'ordinateur correspondants |
Country Status (7)
Country | Link |
---|---|
US (1) | US8224660B2 (fr) |
EP (1) | EP1997103B1 (fr) |
JP (1) | JP5192400B2 (fr) |
CN (1) | CN101432804B (fr) |
AT (1) | ATE524808T1 (fr) |
FR (1) | FR2898443A1 (fr) |
WO (1) | WO2007104889A1 (fr) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2852172A1 (fr) * | 2003-03-04 | 2004-09-10 | France Telecom | Procede et dispositif de reconstruction spectrale d'un signal audio |
CN102081927B (zh) * | 2009-11-27 | 2012-07-18 | 中兴通讯股份有限公司 | 一种可分层音频编码、解码方法及系统 |
CN102652336B (zh) * | 2009-12-28 | 2015-02-18 | 三菱电机株式会社 | 声音信号复原装置以及声音信号复原方法 |
US9450812B2 (en) | 2014-03-14 | 2016-09-20 | Dechnia, LLC | Remote system configuration via modulated audio |
PL3385948T3 (pl) * | 2014-03-24 | 2020-01-31 | Nippon Telegraph And Telephone Corporation | Sposób kodowania, koder, program i nośnik zapisu |
CN106653035B (zh) * | 2016-12-26 | 2019-12-13 | 广州广晟数码技术有限公司 | 数字音频编码中码率分配的方法和装置 |
US10966033B2 (en) | 2018-07-20 | 2021-03-30 | Mimi Hearing Technologies GmbH | Systems and methods for modifying an audio signal using custom psychoacoustic models |
US10455335B1 (en) * | 2018-07-20 | 2019-10-22 | Mimi Hearing Technologies GmbH | Systems and methods for modifying an audio signal using custom psychoacoustic models |
EP3614380B1 (fr) | 2018-08-22 | 2022-04-13 | Mimi Hearing Technologies GmbH | Systèmes et procédés d'amélioration sonore dans des systèmes audio |
CN110265043B (zh) * | 2019-06-03 | 2021-06-01 | 同响科技股份有限公司 | 自适应有损或无损的音频压缩和解压缩演算方法 |
CN113904900B (zh) * | 2021-08-26 | 2024-05-14 | 北京空间飞行器总体设计部 | 一种实时遥测信源分阶相对编码方法 |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR9206143A (pt) * | 1991-06-11 | 1995-01-03 | Qualcomm Inc | Processos de compressão de final vocal e para codificação de taxa variável de quadros de entrada, aparelho para comprimir im sinal acústico em dados de taxa variável, codificador de prognóstico exitado por córdigo de taxa variável (CELP) e descodificador para descodificar quadros codificados |
EP0559348A3 (fr) * | 1992-03-02 | 1993-11-03 | AT&T Corp. | Processeur ayant une boucle de réglage du débit pour un codeur/décodeur perceptuel |
JP3341474B2 (ja) * | 1994-07-28 | 2002-11-05 | ソニー株式会社 | 情報符号化方法及び復号化方法、情報符号化装置及び復号化装置、並びに情報記録媒体 |
JP3304739B2 (ja) * | 1996-02-08 | 2002-07-22 | 松下電器産業株式会社 | ロスレス符号装置とロスレス記録媒体とロスレス復号装置とロスレス符号復号装置 |
KR100261254B1 (ko) * | 1997-04-02 | 2000-07-01 | 윤종용 | 비트율 조절이 가능한 오디오 데이터 부호화/복호화방법 및 장치 |
KR100335609B1 (ko) * | 1997-11-20 | 2002-10-04 | 삼성전자 주식회사 | 비트율조절이가능한오디오부호화/복호화방법및장치 |
US6115689A (en) * | 1998-05-27 | 2000-09-05 | Microsoft Corporation | Scalable audio coder and decoder |
US6499010B1 (en) * | 2000-01-04 | 2002-12-24 | Agere Systems Inc. | Perceptual audio coder bit allocation scheme providing improved perceptual quality consistency |
JP2003195894A (ja) * | 2001-12-27 | 2003-07-09 | Mitsubishi Electric Corp | 符号化装置、復号化装置、符号化方法、及び復号化方法 |
KR100467617B1 (ko) * | 2002-10-30 | 2005-01-24 | 삼성전자주식회사 | 개선된 심리 음향 모델을 이용한 디지털 오디오 부호화방법과그 장치 |
JP4212591B2 (ja) * | 2003-06-30 | 2009-01-21 | 富士通株式会社 | オーディオ符号化装置 |
US7383180B2 (en) * | 2003-07-18 | 2008-06-03 | Microsoft Corporation | Constant bitrate media encoding techniques |
JP4091506B2 (ja) * | 2003-09-02 | 2008-05-28 | 日本電信電話株式会社 | 2段音声画像符号化方法、その装置及びプログラム及びこのプログラムを記録した記録媒体 |
DE102004009955B3 (de) * | 2004-03-01 | 2005-08-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Ermitteln einer Quantisierer-Schrittweite |
JP4301092B2 (ja) * | 2004-06-23 | 2009-07-22 | 日本ビクター株式会社 | 音響信号符号化装置 |
CN1731694A (zh) * | 2004-08-04 | 2006-02-08 | 上海乐金广电电子有限公司 | 数字音频编码方法以及装置 |
WO2006054583A1 (fr) * | 2004-11-18 | 2006-05-26 | Canon Kabushiki Kaisha | Appareil et méthode de codage de signal audio |
US7668715B1 (en) * | 2004-11-30 | 2010-02-23 | Cirrus Logic, Inc. | Methods for selecting an initial quantization step size in audio encoders and systems using the same |
KR100851970B1 (ko) * | 2005-07-15 | 2008-08-12 | 삼성전자주식회사 | 오디오 신호의 중요주파수 성분 추출방법 및 장치와 이를이용한 저비트율 오디오 신호 부호화/복호화 방법 및 장치 |
JP2007183528A (ja) * | 2005-12-06 | 2007-07-19 | Fujitsu Ltd | 符号化装置、符号化方法、および符号化プログラム |
US7835904B2 (en) * | 2006-03-03 | 2010-11-16 | Microsoft Corp. | Perceptual, scalable audio compression |
-
2006
- 2006-03-13 FR FR0602179A patent/FR2898443A1/fr not_active Withdrawn
-
2007
- 2007-03-12 AT AT07731731T patent/ATE524808T1/de not_active IP Right Cessation
- 2007-03-12 EP EP07731731A patent/EP1997103B1/fr active Active
- 2007-03-12 JP JP2008558864A patent/JP5192400B2/ja active Active
- 2007-03-12 CN CN200780015598.XA patent/CN101432804B/zh active Active
- 2007-03-12 US US12/282,731 patent/US8224660B2/en active Active
- 2007-03-12 WO PCT/FR2007/050915 patent/WO2007104889A1/fr active Application Filing
Non-Patent Citations (1)
Title |
---|
See references of WO2007104889A1 * |
Also Published As
Publication number | Publication date |
---|---|
JP5192400B2 (ja) | 2013-05-08 |
ATE524808T1 (de) | 2011-09-15 |
WO2007104889A1 (fr) | 2007-09-20 |
FR2898443A1 (fr) | 2007-09-14 |
US8224660B2 (en) | 2012-07-17 |
EP1997103B1 (fr) | 2011-09-14 |
CN101432804A (zh) | 2009-05-13 |
US20090083043A1 (en) | 2009-03-26 |
JP2009530653A (ja) | 2009-08-27 |
CN101432804B (zh) | 2013-01-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1997103B1 (fr) | Procede de codage d'un signal audio source, dispositif de codage, procede et dispositif de decodage, signal, produits programme d'ordinateur correspondants | |
KR101251790B1 (ko) | 노이즈 필러, 노이즈 필링 파라미터 계산기, 오디오 신호의 노이즈-필드된 스펙트럴 표현을 제공하는 방법, 노이즈 필링 파라미터를 제공하는 방법, 저장 매체 | |
JP5356406B2 (ja) | オーディオコーディングシステム、オーディオデコーダ、オーディオコーディング方法及びオーディオデコーディング方法 | |
EP1987513B1 (fr) | Procede et dispositif de codage hierarchique d'un signal audio source, procede et dispositif de decodage, programmes et signal correspondants | |
TWI529700B (zh) | 雜訊塡充技術 | |
FR2947945A1 (fr) | Allocation de bits dans un codage/decodage d'amelioration d'un codage/decodage hierarchique de signaux audionumeriques | |
FR2891098A1 (fr) | Procede et dispositif de mixage de flux audio numerique dans le domaine compresse. | |
WO2011004097A1 (fr) | Codage/décodage perfectionne de signaux audionumériques | |
EP1514263A1 (fr) | Systeme de codage audio utilisant des caracteristiques d'un signal decode pour adapter des composants spectraux synthetises | |
EP2769378A2 (fr) | Codage hierarchique perfectionne | |
US7603271B2 (en) | Speech coding apparatus with perceptual weighting and method therefor | |
WO2010031951A1 (fr) | Attenuation de pre-echos dans un signal audionumerique | |
EP2171713B1 (fr) | Codage de signaux audionumériques | |
WO2011090434A1 (fr) | Procédé et dispositif pour déterminer un nombre de bits pour coder un signal audio | |
EP2589045B1 (fr) | Codage/décodage prédictif linéaire adaptatif | |
WO2007091000A2 (fr) | Procede de codage d'un signal audio source, dispositif de codage, procede de decodage, signal, support de donnees, produits programme d'ordinateur correspondants | |
WO2009136872A1 (fr) | Procédé et dispositif pour coder un signal audio, procédé et dispositif pour générer des données audio codées et procédé et dispositif pour déterminer un débit binaire d'un signal audio codé | |
FR3018942A1 (fr) | Estimation d'un bruit de codage introduit par un codage en compression de type micda | |
FR2737360A1 (fr) | Procedes de codage et de decodage de signaux audiofrequence, codeur et decodeur pour la mise en oeuvre de tels procedes | |
FR2863792A1 (fr) | Procede de compression de signaux audio sans contrainte de delai |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20080930 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR |
|
17Q | First examination report despatched |
Effective date: 20100927 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D Free format text: LANGUAGE OF EP DOCUMENT: FRENCH |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602007017186 Country of ref document: DE Effective date: 20111124 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: VDEP Effective date: 20110914 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 |
|
LTIE | Lt: invalidation of european patent or patent extension |
Effective date: 20110914 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111215 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 524808 Country of ref document: AT Kind code of ref document: T Effective date: 20110914 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FD4D |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120114 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120116 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 |
|
26N | No opposition filed |
Effective date: 20120615 |
|
BERE | Be: lapsed |
Owner name: FRANCE TELECOM Effective date: 20120331 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 602007017186 Country of ref document: DE Effective date: 20120615 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120331 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120331 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120331 Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120331 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111225 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111214 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20110914 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120312 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070312 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 10 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 11 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 12 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20240220 Year of fee payment: 18 Ref country code: GB Payment date: 20240220 Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20240221 Year of fee payment: 18 |