WO2000045378A2 - Codage efficient de l'enveloppe spectrale mettant en oeuvre une resolution temps frequence et une commutation temps frequence - Google Patents
Codage efficient de l'enveloppe spectrale mettant en oeuvre une resolution temps frequence et une commutation temps frequence Download PDFInfo
- Publication number
- WO2000045378A2 WO2000045378A2 PCT/SE2000/000158 SE0000158W WO0045378A2 WO 2000045378 A2 WO2000045378 A2 WO 2000045378A2 SE 0000158 W SE0000158 W SE 0000158W WO 0045378 A2 WO0045378 A2 WO 0045378A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- time
- frequency
- resolution
- signal
- spectral envelope
- Prior art date
Links
- 230000003595 spectral effect Effects 0.000 title claims abstract description 36
- 238000000034 method Methods 0.000 claims abstract description 27
- 239000008187 granular material Substances 0.000 claims description 45
- 230000001052 transient effect Effects 0.000 claims description 44
- 238000004458 analytical method Methods 0.000 claims description 10
- 230000015572 biosynthetic process Effects 0.000 claims description 7
- 238000003786 synthesis reaction Methods 0.000 claims description 7
- 230000009286 beneficial effect Effects 0.000 claims description 4
- 230000005540 biological transmission Effects 0.000 claims 4
- 238000007619 statistical method Methods 0.000 claims 2
- 230000003044 adaptive effect Effects 0.000 abstract description 3
- 238000013507 mapping Methods 0.000 abstract description 2
- 230000011664 signaling Effects 0.000 description 13
- 238000005070 sampling Methods 0.000 description 7
- 239000013598 vector Substances 0.000 description 6
- 238000002592 echocardiography Methods 0.000 description 5
- 230000000873 masking effect Effects 0.000 description 5
- 239000011159 matrix material Substances 0.000 description 5
- 230000005236 sound signal Effects 0.000 description 5
- 230000002123 temporal effect Effects 0.000 description 5
- 239000000463 material Substances 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001020 rhythmical effect Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 230000003321 amplification Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000002196 fr. b Anatomy 0.000 description 1
- 210000003918 fraction a Anatomy 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000033764 rhythmic process Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Definitions
- the present invention relates to a new method and apparatus for efficient coding of spectral envelopes in audio coding systems
- the method may be used both for natural audio coding and speech coding and is especially suited for coders usmg SBR [WO 98/57436] and other high frequency reconstruction methods
- Audio source codmg techniques can be divided into two classes natural audio codmg and speech codmg Natural audio codmg is commonly used for music or arbitrary signals at medium bitrates, and generally offers wide audio bandwidth Speech coders are basically limited to speech reproduction but can on the other hand be used at very low bitrates, albeit with low audio bandwidth In both classes, the signal is generally separated into two major signal components, the "spectral envelope” and the corresponding "residual" signal Throughout the following description, the term “spectral envelope” refers to the coarse spectral distribution of the signal in a general sense, e g filter coefficients m an linear prediction based coder or a set of time-frequency averages of subband samples in a subband coder The term “residual” refers to the fine spectral distribution in a general sense, e g the LPC error signal or subband samples normalized usmg the above time-frequency averages "Envelope data” refers to the quantized and coded spectral envelope, and "resi
- the spectral envelope is a function of two variables time and frequency
- the encodmg can be done by exploiting redundancy m either direction of the time/frequency plane
- codmg of the spectral envelope is performed in the frequency direction usmg delta coding (DPCM), linear prediction (LPC), or vector quantization (VQ) SUMMARY OF THE INVENTION
- DPCM frequency direction usmg delta coding
- LPC linear prediction
- VQ vector quantization
- the present mvention provides a new method and an apparatus for spectral envelope encoding
- the mvention teaches how to perform and signal compactly a time/frequency mapping of the envelope representation, and further, encode the spectral envelope data efficiently using adaptive time/frequency direction codmg
- a time/frequency grid with low temporal and high frequency resolution is used as default
- the temporal resolution is increased at the expense of frequency resolution
- the mvention describes two schemes for signalling of the time and frequency resolution used One scheme allows arbitrary selection of instantaneous resolution by explicit signalling of time segment borders and frequency resolutions, whereas the other exploits the fact that transients are separated at least by a minimum time, T mm , ⁇ order to reduce the required number of control bits
- a transient detector decides whether the current granule contains a transient, and if so, determines the position of the onset of the transient The position withm the granule is encode
- the present mvention presents a new and efficient method for scalefactor redundancy codmg
- a dirac pulse in the time domain transforms to a constant in the frequency domain, and a drrac in the frequency domain, l e a single smusoid, corresponds to a signal with constant magnitude m the time domain Simplified, on a short term basis, the signal shows less variations in one domam than the other
- usmg prediction or delta codmg, codmg efficiency is mcreased if the spectral envelope is coded in either time- or frequency-direction depending on the signal characte ⁇ stics
- Figs la - lb illustrate uniform respective non-uniform sampling in time of the spectral envelope
- Figs 2a - 2c illustrate transient detector look-ahead and granule mterdependency
- Figs 3a - 3f illustrate segments with different time and frequency resolutions, and the corresponding control signals Fig 4 illustrates time/frequency switched envelope codmg
- Fig 5 is a block diagram of an encoder using the envelope coding according to the mvention
- Fig 6 is a block diagram of a decoder usmg the envelope codmg according to the invention DESCRIPTION OF PREFERRED EMBODIMENTS
- Fig 1 shows the time/frequency representation of a musical signal where sustained chords are combmed with sharp transients with mainly high frequency contents
- the chords have high energy and the transient energy is low, whereas the opposite is true
- m the highband
- the envelope data that is generated durmg time intervals where transients are present is dominated by the high intermittent transient energy
- the decoder the spectral envelope of the transposed signal is estimated using the same mstantaneous t ⁇ me-/frequency resolution as used for the analysis of the original highband
- An equalization of the transposed signal is then performed, based on dissimilarities m the spectral envelopes E g amplification factors in an envelope adjustmg filterbank are calculated as the quotients between o ⁇ gmal signal and transposed signal scalefactors
- the transposed signal has the same chord to transient energy ratio as the lowband
- the gams needed in order to adjust the transposed transients to the correct level thus cause the
- the prmcipal solution is to maintam a low update rate durmg tonal passages, which make up the majority of a typical programme material, and by means of a transient detector localize the transient positions, and update the envelope data close to the leading flanks, see Fig lb
- the update rate is momentarily increased in a time interval after the transient start This eliminates gain induced post-echoes
- the time segmenting during the decay is not as crucial as findmg the start of the transient, as will be explained later
- a lower frequency resolution can be used durmg the transient, keeping the data size within limits
- a non-uniform samplmg in time and frequency as outlined above is applicable both on subband coders and linear prediction based coders
- Typical coders operate on a block basis, where every block represents a fixed time interval Those blocks will be referred to as "granules"
- granules Let a granule have a length of q time quantization steps, hereinafter called “subgranules"
- subgranules a transient detector look-ahead can be employed on the encoder side Having this additional information, envelope data spanning across borders of granules can be comprised This enables a more flexible selection of time/frequency resolutions, and faciliates constant bitrate operation, smce parts of the payload can be moved between consecutive granules Referring to Fig 2, the granules are divided into eight subgranules
- the transient detector operates on granules with the same timespan as the granule that overlap 50% of two consecutive granules, that is, the transient detector look-ahead is half a granule
- the transient detector has detected a transient
- the segment borders and frequency resolutions (number of coefficients or scalefactors) must be signalled If a non-uniform sampling according to Fig 2 is to be employed, the problem of envelope data spanning over the granule borders must be dealt with Furthermore, the signalling must be flexible enough to cover all combinations of interest, without generating a too large amount of control data
- transients can occur withm a granule m C combinations, ranging from no transient at all to q transients, where C is given by
- the first step towards an efficient signalling is to employ two time sampling modes, uniform and non-uniform sampling in time
- the uniform mode is used during quasi-stationary passages, and employs high frequency resolution and relatively long time segments, both of which are predefined Hence this mode does not require any signalling of segment borders or frequency resolutions
- One bit is sufficient to signal the time samplmg mode to the decoder
- the non-uniform mode is used durmg transient passages and requires additional signalling Two such signalling systems are proposed by the present invention
- the first system hereinafter referred to as the "border-signalling system" uses one bit per subgranule to signal whether a segment border is present at the subgranule left border or not Envelope data corresponding to a segment is always sent m the granule in which the segment starts This means that the number of envelopes transmitted m a granule equals the number of left borders in the granule or the bit sum of the q border bits
- the segment frequency resolutions are signalled with dynamically allocated control bits, e g one bit per envelope Again, this number of bits is derived from the q border bits
- Fig 3 Some examples of grouping of subgranules into time segments are given m Fig 3, where the subgranules are numbered from 000 to 111 L denotes low frequency resolution and H denotes high resolution
- L denotes low frequency resolution
- H denotes high resolution
- the number of scalefactors or coefficients m a high resolution segment is assumed to be two times that of a low resolution segment
- Figure 3a shows a reference system, constantly using the highest possible time and frequency resolution
- the relative data matrix size is one by definition, and obviously no control signal bits are needed in this system If no transient is present in or next to a specific granule, the granule is divided mto two segments of equal length and the envelope representations are calculated using high frequency resolution If the two envelope representations do not differ more than a certain amount, only one set of high resolution envelope data is sent Those cases are illustrated by Figs 3b and 3c, where the control signal "Uniform" tells that uniform sampling in time is used, and the signal “Low
- the second system heremafter referred to as the "position-signalling system" is intended for very low bitrate applications and utilizes some musical signal properties m order to reduce the number of control signal bits
- the positions-signalling system are intended for very low bitrate applications and utilizes some musical signal properties m order to reduce the number of control signal bits
- Eq 1 many of the states described by Eq 1 are not very likely, and would also generate too large amounts of envelope data to be practical at a limited bitrate
- the following simplifications can be made with little or no sacrifice of quality for practical signals
- the minimum time-span between consecutive transients in music programme material can be estimated in the following way In musical notation, the rhythmic "pulse" is described by a time signature expressed as a fraction A/B, where A denotes the number of "beats" per bar and MB is the type of note corresponding to one beat, for example a l A note, commonly referred to as a quarter note
- A denotes the number of "beats” per bar
- MB is the type of note corresponding to one beat, for example a l A note, commonly referred to as a quarter note
- BPM Beats Per Mmute
- T q The necessary time resolution T q must also be established In some cases a transient original signal has its mam energy in the highband to be reconstructed This means that the encoded spectral envelope must carry all the "timing" information The desired timing precision thus determines the resolution needed for encodmg of leading flanks T q is much smaller than the minimum note period T nm ⁇ n , smce small time deviations within the period clearly can be heard In most cases however, the transient has significant energy in the lowband
- T m the gain- induced pre-echoes must fall within the so called pre- or backward masking time T m of the human auditory system m order to be inaudible Hence T q must satisfy two conditions
- T m ⁇ T Formula mm (otherwise the notes would be so fast that they could not be resolved) and according to ["Modeling the Additivity of Nonsimultaneous Masking", Hearmg Res , vol 80, pp 105-118 (1994)], T m amounts to 10-20 ms Smce T classroom mm is in the 50ms range, a reasonable selection of T q according to Eq 3 results m that the second condition is also met
- the precision of the transient detection in the encoder and the time resolution of the analysis/synthesis filterbank must also be considered when selecting T q
- the note-off position has little or no effect on the perceived rhythm
- most instruments do not exhibit sharp trailing flanks, but rather a smooth decay curve, l e a well defined note-off time does not exist
- the post- or forward masking time is substantially longer than the pre-masking time
- Time/Frequency Switched Scalefactor Encoding Utilizing a time to frequency transform it can be shown that a pulse in the time domain corresponds to a flat spectrum m the frequency domain, and a "pulse" in the frequency domain, l e a single sinusoidal, corresponds to a quasi-stationary signal in the time domain In other words a signal usually shows more transient properties m one domain than the other In a spectrogram, l e a time/frequency matrix display, this property is evident, and can advantageously be used when codmg spectral envelopes
- a tonal stationary signal can have a very sparse spectrum not suitable for delta codmg m the frequency-direction, but well suited for delta codmg in the time-direction, and vice versa This is displayed in Fig 4 Throughout the following description a vector of scale factors calculated at time n 0 represents the spectral envelope
- Y(k,n 0 ) [a ⁇ , a 2 , a 3 , , a k , ,a N ], (Eq 5) where ⁇ a N are the amplitude values for different frequencies Common practice is to code the difference between adjacent values in the frequency-direction at a given time, which yields
- Start values are transmitted whenever the spectral envelope is coded m the frequency direction but not when coded in the time direction smce they are available at the decoder, through the previous envelope
- the proposed algorithm also require extra information to be transmitted, namely a time/frequency flag indicating in which direction the spectral envelope was coded
- the T/F algorithm can advantageously be used with several different coding schemes of the scalefactor-envelope representation apart from DPCM and Huffman, such as ADPCM, LPC and vector quantisation
- the proposed T/F algorithm gives significant bitrate-reduction for the spectral-envelope data, up to around 20% reduction compared to commonly used delta-coding techniques If the number of scalefactors per octave is constant, it is possible to delta code on an octave basis instead of delta codmg of adjacent scale factors
- the analogue input signal is fed to an A/D- converter 501, forming a digital signal
- the digital audio signal is fed to a perceptual audio encoder 502, where source codmg is performed
- the digital signal is fed to a transient detector 503 and to an analysis filterbank 504, which splits the signal mto its spectral equivalents (subband signals)
- the transient detector could operate on the subband signals from the analysis bank, but for generality purposes it is here assumed to operate on the digital time domam samples directly
- the transient detector divides the signal mto granules and determines, accordmg to the invention, whether subgranules withm the granules is to be flagged as transient
- This information is sent to the envelope groupmg block 505, which specifies the time/frequency grid to be used for the current granule Accordmg to the grid, the block combmes the uniform sampled subband signals, to form the non-uniform
- the decoder side of the invention is shown in Fig 6
- the demultiplexer 601 restores the signals and feeds the approp ⁇ ate part to an audio decoder 602, which produces a low band digital audio signal
- the envelope information is fed from the demultiplexer to the envelope decodmg block 603, which, by use of control data, determines in which direction the current envelope are coded and decodes the data
- the low band signal from the audio decoder is routed to the transposition module 604, which generates a replicated high band signal consisting of one or several harmomcs from the low band signal.
- the high band signal is fed to an analysis filterbank 606, which is of the same type as on the encoder side.
- the subband signals are combined m the scalefactor grouping umt 607.
- the same type of combination and time/frequency distribution of the subband samples is adopted as on the encoder side.
- the envelope information from the demultiplexer and the information from the scalefactor groupmg umt is processed m the gam control module 608.
- the module computes gam factors to be applied to the subband samples before recombmation in the synthesis filterbank block 609.
- the output from the synthesis filterbank is thus an envelope adjusted high band audio signal This signal is added to the output from the delay unit 605, which is fed with the low band audio signal.
- the delay compensates for the processmg time of the high band signal
- the obtamed digital wideband signal is converted to an analogue audio signal m the digital to analogue converter 610
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Priority Applications (19)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU25856/00A AU2585600A (en) | 1999-01-27 | 2000-01-26 | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
US09/763,128 US6978236B1 (en) | 1999-10-01 | 2000-01-26 | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
DK00968271T DK1216474T3 (da) | 1999-10-01 | 2000-09-29 | Effektiv spektral indhyllingskurvekodning ved anvendelse af variabel tids/frekvensoplösning |
PT00968271T PT1216474E (pt) | 1999-10-01 | 2000-09-29 | Codificacao eficiente de envolvente especial utilizando resolucao tempo/frequencia variavel |
AT00968271T ATE271250T1 (de) | 1999-10-01 | 2000-09-29 | Kodierung der hüllkurve des spektrums mittels variabler zeit/frequenz-auflösung |
BRPI0014642A BRPI0014642B1 (pt) | 1999-10-01 | 2000-09-29 | codificação de envelope espectral usando resolução de tempo-frequência variável e mudança de tempo-frequência |
EP00968271A EP1216474B1 (fr) | 1999-10-01 | 2000-09-29 | Codage efficace de l'enveloppe spectrale par resolution temps/frequence variable |
CNB008136025A CN1172293C (zh) | 1999-10-01 | 2000-09-29 | 有效频谱包络编码方法及其编解码设备 |
DE60012198T DE60012198T2 (de) | 1999-10-01 | 2000-09-29 | Kodierung der hüllkurve des spektrums mittels variabler zeit/frequenz-auflösung |
AU78212/00A AU7821200A (en) | 1999-10-01 | 2000-09-29 | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
JP2001528974A JP4035631B2 (ja) | 1999-10-01 | 2000-09-29 | 可変時間/周波数分解能および時間/周波数切り替えを使用する効率的なスペクトルエンベロープ符号化 |
ES00968271T ES2223591T3 (es) | 1999-10-01 | 2000-09-29 | Codificacion eficaz de envolvente especial utilizando una resolucion tiempo/frecuencia variable. |
PCT/SE2000/001887 WO2001026095A1 (fr) | 1999-10-01 | 2000-09-29 | Codage efficace de l'enveloppe spectrale par resolution temps/frequence et commutation temps/frequence variables |
RU2002111665/09A RU2236046C2 (ru) | 1999-10-01 | 2000-09-29 | Эффективное кодирование огибающей спектра с использованием переменного разрешения по времени и по частоте и переключения время/частота |
HK03101398.3A HK1049401B (zh) | 1999-10-01 | 2003-02-24 | 有效頻譜包絡編碼方法及其編解碼設備 |
JP2005292388A JP4334526B2 (ja) | 1999-10-01 | 2005-10-05 | 可変時間/周波数分解能および時間/周波数切り替えを使用する効率的なスペクトルエンベロープ符号化 |
JP2005292384A JP4628921B2 (ja) | 1999-10-01 | 2005-10-05 | 可変時間/周波数分解能および時間/周波数切り替えを使用する効率的なスペクトルエンベロープ符号化 |
US11/246,283 US7181389B2 (en) | 1999-10-01 | 2005-10-11 | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
US11/246,284 US7191121B2 (en) | 1999-10-01 | 2005-10-11 | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9900256A SE9900256D0 (sv) | 1999-01-27 | 1999-01-27 | Metod och anordning för förbättring av effektivitet och ljudkvalitet hos ljudkodare |
SE9900256-0 | 1999-01-27 | ||
SE9903552-9 | 1999-10-01 | ||
SE9903552A SE9903552D0 (sv) | 1999-01-27 | 1999-10-01 | Efficient spectral envelope coding using dynamic scalefactor grouping and time/frequency switching |
Related Child Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09763128 A-371-Of-International | 2000-01-26 | ||
US11/246,284 Division US7191121B2 (en) | 1999-10-01 | 2005-10-11 | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
US11/246,283 Division US7181389B2 (en) | 1999-10-01 | 2005-10-11 | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2000045378A2 true WO2000045378A2 (fr) | 2000-08-03 |
WO2000045378A3 WO2000045378A3 (fr) | 2000-11-16 |
Family
ID=26663488
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/SE2000/000158 WO2000045378A2 (fr) | 1999-01-27 | 2000-01-26 | Codage efficient de l'enveloppe spectrale mettant en oeuvre une resolution temps frequence et une commutation temps frequence |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU2585600A (fr) |
SE (1) | SE9903552D0 (fr) |
WO (1) | WO2000045378A2 (fr) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002093560A1 (fr) * | 2001-05-10 | 2002-11-21 | Dolby Laboratories Licensing Corporation | Ameliorations apportees a la performance transitoire de systemes de codage audio a faible debit binaire par reduction du pre-bruit |
US7246065B2 (en) | 2002-01-30 | 2007-07-17 | Matsushita Electric Industrial Co., Ltd. | Band-division encoder utilizing a plurality of encoding units |
US7283954B2 (en) | 2001-04-13 | 2007-10-16 | Dolby Laboratories Licensing Corporation | Comparing audio using characterizations based on auditory events |
EP1603117A3 (fr) * | 2001-07-10 | 2008-02-06 | Coding Technologies Sweden AB | Codage stéréo paramétrique efficace et échelonnable pour applications à faible débit |
US7461002B2 (en) | 2001-04-13 | 2008-12-02 | Dolby Laboratories Licensing Corporation | Method for time aligning audio signals using characterizations based on auditory events |
US7508947B2 (en) | 2004-08-03 | 2009-03-24 | Dolby Laboratories Licensing Corporation | Method for combining audio signals using auditory scene analysis |
US7610205B2 (en) | 2002-02-12 | 2009-10-27 | Dolby Laboratories Licensing Corporation | High quality time-scaling and pitch-scaling of audio signals |
WO2010003543A1 (fr) * | 2008-07-11 | 2010-01-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Appareil et procédé de calcul de données d'extension de bande passante utilisant un découpage en trames contrôlant la balance spectrale |
US7711123B2 (en) | 2001-04-13 | 2010-05-04 | Dolby Laboratories Licensing Corporation | Segmenting audio signals into auditory events |
US8041578B2 (en) | 2006-10-18 | 2011-10-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
US8126721B2 (en) | 2006-10-18 | 2012-02-28 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
US8170882B2 (en) | 2004-03-01 | 2012-05-01 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US8280743B2 (en) | 2005-06-03 | 2012-10-02 | Dolby Laboratories Licensing Corporation | Channel reconfiguration with side information |
US8417532B2 (en) | 2006-10-18 | 2013-04-09 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
US8605911B2 (en) | 2001-07-10 | 2013-12-10 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
WO2014118179A1 (fr) * | 2013-01-29 | 2014-08-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codeurs audio, décodeurs audio, systèmes, procédés et programmes d'ordinateur utilisant une résolution temporelle accrue à proximité temporelle de débuts ou de fins de fricatives ou d'affriquées |
US9043200B2 (en) | 2005-04-13 | 2015-05-26 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Adaptive grouping of parameters for enhanced coding efficiency |
US9761237B2 (en) | 2001-11-29 | 2017-09-12 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
RU2650031C2 (ru) * | 2013-08-29 | 2018-04-06 | Долби Интернэшнл Аб | Проектирование таблицы частотных диапазонов для алгоритмов высокочастотной реконструкции |
US10157623B2 (en) | 2002-09-18 | 2018-12-18 | Dolby International Ab | Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5504832A (en) * | 1991-12-24 | 1996-04-02 | Nec Corporation | Reduction of phase information in coding of speech |
US5581653A (en) * | 1993-08-31 | 1996-12-03 | Dolby Laboratories Licensing Corporation | Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder |
US5737718A (en) * | 1994-06-13 | 1998-04-07 | Sony Corporation | Method, apparatus and recording medium for a coder with a spectral-shape-adaptive subband configuration |
WO1998057436A2 (fr) * | 1997-06-10 | 1998-12-17 | Lars Gustaf Liljeryd | Amelioration de codage de la source par reproduction de la bande spectrale |
US5852806A (en) * | 1996-03-19 | 1998-12-22 | Lucent Technologies Inc. | Switched filterbank for use in audio signal coding |
-
1999
- 1999-10-01 SE SE9903552A patent/SE9903552D0/xx unknown
-
2000
- 2000-01-26 WO PCT/SE2000/000158 patent/WO2000045378A2/fr active Application Filing
- 2000-01-26 AU AU25856/00A patent/AU2585600A/en not_active Abandoned
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5504832A (en) * | 1991-12-24 | 1996-04-02 | Nec Corporation | Reduction of phase information in coding of speech |
US5581653A (en) * | 1993-08-31 | 1996-12-03 | Dolby Laboratories Licensing Corporation | Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder |
US5737718A (en) * | 1994-06-13 | 1998-04-07 | Sony Corporation | Method, apparatus and recording medium for a coder with a spectral-shape-adaptive subband configuration |
US5852806A (en) * | 1996-03-19 | 1998-12-22 | Lucent Technologies Inc. | Switched filterbank for use in audio signal coding |
WO1998057436A2 (fr) * | 1997-06-10 | 1998-12-17 | Lars Gustaf Liljeryd | Amelioration de codage de la source par reproduction de la bande spectrale |
Non-Patent Citations (2)
Title |
---|
BOSI M. ET AL.: 'Time versus frequency resolution in a low-rate, high quality audio transform coder' IEEE ASSP WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, FINAL PROGRAM AND PAPER SUMMARIES 1991, pages 0_81 - 0_82 * |
PRINCEN J. ET AL.: 'Audio coding with signal adaptive filterbanks' INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP-95 vol. 5, 1995, pages 3071 - 3074 * |
Cited By (73)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7711123B2 (en) | 2001-04-13 | 2010-05-04 | Dolby Laboratories Licensing Corporation | Segmenting audio signals into auditory events |
US8195472B2 (en) | 2001-04-13 | 2012-06-05 | Dolby Laboratories Licensing Corporation | High quality time-scaling and pitch-scaling of audio signals |
US7283954B2 (en) | 2001-04-13 | 2007-10-16 | Dolby Laboratories Licensing Corporation | Comparing audio using characterizations based on auditory events |
US9165562B1 (en) | 2001-04-13 | 2015-10-20 | Dolby Laboratories Licensing Corporation | Processing audio signals with adaptive time or frequency resolution |
US7461002B2 (en) | 2001-04-13 | 2008-12-02 | Dolby Laboratories Licensing Corporation | Method for time aligning audio signals using characterizations based on auditory events |
US8488800B2 (en) | 2001-04-13 | 2013-07-16 | Dolby Laboratories Licensing Corporation | Segmenting audio signals into auditory events |
CN1312662C (zh) * | 2001-05-10 | 2007-04-25 | 杜比实验室特许公司 | 通过降低前噪声改善音频编码系统的瞬时性能的方法 |
US7313519B2 (en) | 2001-05-10 | 2007-12-25 | Dolby Laboratories Licensing Corporation | Transient performance of low bit rate audio coding systems by reducing pre-noise |
WO2002093560A1 (fr) * | 2001-05-10 | 2002-11-21 | Dolby Laboratories Licensing Corporation | Ameliorations apportees a la performance transitoire de systemes de codage audio a faible debit binaire par reduction du pre-bruit |
CN1758338B (zh) * | 2001-07-10 | 2010-11-17 | 杜比国际公司 | 用于低比特率音频编码应用的高效可标度参数立体声编码 |
US8081763B2 (en) | 2001-07-10 | 2011-12-20 | Coding Technologies Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US10297261B2 (en) | 2001-07-10 | 2019-05-21 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US10540982B2 (en) | 2001-07-10 | 2020-01-21 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
EP2015292A1 (fr) * | 2001-07-10 | 2009-01-14 | Dolby Sweden AB | Codage stéréo paramétrique efficace et échelonnable pour applications à faible débit |
US8014534B2 (en) | 2001-07-10 | 2011-09-06 | Coding Technologies Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US9799340B2 (en) | 2001-07-10 | 2017-10-24 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US8059826B2 (en) | 2001-07-10 | 2011-11-15 | Coding Technologies Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US8073144B2 (en) | 2001-07-10 | 2011-12-06 | Coding Technologies Ab | Stereo balance interpolation |
US9865271B2 (en) | 2001-07-10 | 2018-01-09 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate applications |
US8116460B2 (en) * | 2001-07-10 | 2012-02-14 | Coding Technologies Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US9799341B2 (en) | 2001-07-10 | 2017-10-24 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate applications |
US9792919B2 (en) | 2001-07-10 | 2017-10-17 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate applications |
EP1603117A3 (fr) * | 2001-07-10 | 2008-02-06 | Coding Technologies Sweden AB | Codage stéréo paramétrique efficace et échelonnable pour applications à faible débit |
CN101887724B (zh) * | 2001-07-10 | 2012-05-30 | 杜比国际公司 | 编码功率频谱包络的解码方法 |
US10902859B2 (en) | 2001-07-10 | 2021-01-26 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US8243936B2 (en) | 2001-07-10 | 2012-08-14 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US8605911B2 (en) | 2001-07-10 | 2013-12-10 | Dolby International Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US9818418B2 (en) | 2001-11-29 | 2017-11-14 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
US9761237B2 (en) | 2001-11-29 | 2017-09-12 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
US11238876B2 (en) | 2001-11-29 | 2022-02-01 | Dolby International Ab | Methods for improving high frequency reconstruction |
US10403295B2 (en) | 2001-11-29 | 2019-09-03 | Dolby International Ab | Methods for improving high frequency reconstruction |
US9812142B2 (en) | 2001-11-29 | 2017-11-07 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
US9792923B2 (en) | 2001-11-29 | 2017-10-17 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
US9779746B2 (en) | 2001-11-29 | 2017-10-03 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
US9761236B2 (en) | 2001-11-29 | 2017-09-12 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
US9761234B2 (en) | 2001-11-29 | 2017-09-12 | Dolby International Ab | High frequency regeneration of an audio signal with synthetic sinusoid addition |
US7246065B2 (en) | 2002-01-30 | 2007-07-17 | Matsushita Electric Industrial Co., Ltd. | Band-division encoder utilizing a plurality of encoding units |
US7610205B2 (en) | 2002-02-12 | 2009-10-27 | Dolby Laboratories Licensing Corporation | High quality time-scaling and pitch-scaling of audio signals |
US10157623B2 (en) | 2002-09-18 | 2018-12-18 | Dolby International Ab | Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks |
US9691405B1 (en) | 2004-03-01 | 2017-06-27 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US10460740B2 (en) | 2004-03-01 | 2019-10-29 | Dolby Laboratories Licensing Corporation | Methods and apparatus for adjusting a level of an audio signal |
US9704499B1 (en) | 2004-03-01 | 2017-07-11 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US9715882B2 (en) | 2004-03-01 | 2017-07-25 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US11308969B2 (en) | 2004-03-01 | 2022-04-19 | Dolby Laboratories Licensing Corporation | Methods and apparatus for reconstructing audio signals with decorrelation and differentially coded parameters |
US9691404B2 (en) | 2004-03-01 | 2017-06-27 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US9672839B1 (en) | 2004-03-01 | 2017-06-06 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US9779745B2 (en) | 2004-03-01 | 2017-10-03 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US9640188B2 (en) | 2004-03-01 | 2017-05-02 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US8170882B2 (en) | 2004-03-01 | 2012-05-01 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US10796706B2 (en) | 2004-03-01 | 2020-10-06 | Dolby Laboratories Licensing Corporation | Methods and apparatus for reconstructing audio signals with decorrelation and differentially coded parameters |
US9697842B1 (en) | 2004-03-01 | 2017-07-04 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US10403297B2 (en) | 2004-03-01 | 2019-09-03 | Dolby Laboratories Licensing Corporation | Methods and apparatus for adjusting a level of an audio signal |
US10269364B2 (en) | 2004-03-01 | 2019-04-23 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US8983834B2 (en) | 2004-03-01 | 2015-03-17 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US7508947B2 (en) | 2004-08-03 | 2009-03-24 | Dolby Laboratories Licensing Corporation | Method for combining audio signals using auditory scene analysis |
US9043200B2 (en) | 2005-04-13 | 2015-05-26 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Adaptive grouping of parameters for enhanced coding efficiency |
US8280743B2 (en) | 2005-06-03 | 2012-10-02 | Dolby Laboratories Licensing Corporation | Channel reconfiguration with side information |
US8041578B2 (en) | 2006-10-18 | 2011-10-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
US8417532B2 (en) | 2006-10-18 | 2013-04-09 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
US8126721B2 (en) | 2006-10-18 | 2012-02-28 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
AU2009267529B2 (en) * | 2008-07-11 | 2011-03-03 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for calculating bandwidth extension data using a spectral tilt controlling framing |
TWI457914B (zh) * | 2008-07-11 | 2014-10-21 | Fraunhofer Ges Forschung | 用以使用頻譜傾斜控制訊框化技術來計算帶寬延伸資料之裝置和方法 |
RU2443028C2 (ru) * | 2008-07-11 | 2012-02-20 | Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен | Устройство и способ расчета параметров расширения полосы пропускания посредством управления фреймами наклона спектра |
WO2010003543A1 (fr) * | 2008-07-11 | 2010-01-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Appareil et procédé de calcul de données d'extension de bande passante utilisant un découpage en trames contrôlant la balance spectrale |
US8788276B2 (en) | 2008-07-11 | 2014-07-22 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for calculating bandwidth extension data using a spectral tilt controlled framing |
WO2014118179A1 (fr) * | 2013-01-29 | 2014-08-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codeurs audio, décodeurs audio, systèmes, procédés et programmes d'ordinateur utilisant une résolution temporelle accrue à proximité temporelle de débuts ou de fins de fricatives ou d'affriquées |
US10438596B2 (en) | 2013-01-29 | 2019-10-08 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoders, audio decoders, systems, methods and computer programs using an increased temporal resolution in temporal proximity of onsets or offsets of fricatives or affricates |
EP3680899A1 (fr) * | 2013-01-29 | 2020-07-15 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | Codeurs audio, décodeurs audio, systèmes, procédés et programmes informatiques utilisant une résolution temporelle améliorée dans la proximité temporelle de débuts ou de décalages de phonèmes fricatifs ou affriqués |
RU2651425C2 (ru) * | 2013-01-29 | 2018-04-19 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Аудиокодеры, аудиодекодеры, системы, способы и компьютерные программы, использующие увеличенное временное разрешение во временной окрестности появлений или исчезновений фрикативных согласных или аффрикатов |
US11205434B2 (en) | 2013-01-29 | 2021-12-21 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoders, audio decoders, systems, methods and computer programs using an increased temporal resolution in temporal proximity of onsets or offsets of fricatives or affricates |
EP3279894A1 (fr) * | 2013-01-29 | 2018-02-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codeurs audio, décodeurs audio, systèmes, procédés et programmes d'ordinateur utilisant une résolution temporelle accrue à proximité temporelle de débuts ou de fins de fricatives ou d'affriquées |
EP4336501A3 (fr) * | 2013-01-29 | 2024-05-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codeur audio, procédé et programme informatique utilisant une résolution temporelle accrue dans la proximité temporelle d'apparitions ou de décalages de friandises ou d'affricates |
RU2650031C2 (ru) * | 2013-08-29 | 2018-04-06 | Долби Интернэшнл Аб | Проектирование таблицы частотных диапазонов для алгоритмов высокочастотной реконструкции |
Also Published As
Publication number | Publication date |
---|---|
AU2585600A (en) | 2000-08-18 |
SE9903552D0 (sv) | 1999-10-01 |
WO2000045378A3 (fr) | 2000-11-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6978236B1 (en) | Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching | |
EP1730725B1 (fr) | Codage efficace de donnees audio numeriques spectrales utilisant de la similitude spectrale | |
WO2000045378A2 (fr) | Codage efficient de l'enveloppe spectrale mettant en oeuvre une resolution temps frequence et une commutation temps frequence | |
US5819215A (en) | Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data | |
US6502069B1 (en) | Method and a device for coding audio signals and a method and a device for decoding a bit stream | |
US8924201B2 (en) | Audio encoder and decoder | |
EP1904999B1 (fr) | Segmentation de frequence permettant d'obtenir des bandes de codage efficace de donnees multimedia numeriques | |
EP2056294B1 (fr) | Appareil, support et procédé pour coder et décoder un signal haute fréquence | |
US7548853B2 (en) | Scalable compressed audio bit stream and codec using a hierarchical filterbank and multichannel joint coding | |
JP4220461B2 (ja) | 時間的に離散した音声信号のアップサンプリングした信号を発生する方法と装置 | |
JP3071795B2 (ja) | サブバンドコーディング方法および装置 | |
US10255928B2 (en) | Apparatus, medium and method to encode and decode high frequency signal | |
RU2752127C2 (ru) | Усовершенствованный квантователь | |
EP1423847A1 (fr) | Procede permettant d'ameliorer la reconstruction des hautes frequences | |
JPH0629859A (ja) | デジタル入力信号符号化方法 | |
AU6216498A (en) | Audio coding method and apparatus | |
WO2009059632A1 (fr) | Codeur | |
CN114550732A (zh) | 一种高频音频信号的编解码方法和相关装置 | |
AU2011205144B2 (en) | Scalable compressed audio bit stream and codec using a hierarchical filterbank and multichannel joint coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09763128 Country of ref document: US |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase |