WO2013189030A1 - Procédé de codage audio stéréo ou monophonique - Google Patents

Procédé de codage audio stéréo ou monophonique Download PDF

Info

Publication number
WO2013189030A1
WO2013189030A1 PCT/CN2012/077155 CN2012077155W WO2013189030A1 WO 2013189030 A1 WO2013189030 A1 WO 2013189030A1 CN 2012077155 W CN2012077155 W CN 2012077155W WO 2013189030 A1 WO2013189030 A1 WO 2013189030A1
Authority
WO
WIPO (PCT)
Prior art keywords
encoding
layer
stereo
enhancement layer
mono
Prior art date
Application number
PCT/CN2012/077155
Other languages
English (en)
Chinese (zh)
Inventor
王磊
闫建新
Original Assignee
深圳广晟信源技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳广晟信源技术有限公司 filed Critical 深圳广晟信源技术有限公司
Priority to PCT/CN2012/077155 priority Critical patent/WO2013189030A1/fr
Priority to CN201280000961.1A priority patent/CN104170007B/zh
Publication of WO2013189030A1 publication Critical patent/WO2013189030A1/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Definitions

  • This invention relates to the field of audio coding processing, and more particularly to a method of encoding mono or stereo.
  • AAC-SSR Advanced Audio Coding-Scalable Sampling Rate
  • MPEG-4 Part 3 and MPEG-2 Part 7 the encoding architecture is similar to its unique ARTAC (Adaptive Transform Acoustic Coding) encoding.
  • the coding scheme first divides the input digital audio signal into four frequency bands through a 4-band polyphase quadrature filter (PQF, PJF, and then performs one 256-point MDCT for each of the four frequency bands.
  • the coding scheme can also reduce the data rate by removing the high PQF band, and achieve bitstream layering by reducing the frequency band, thereby obtaining different bit rates and sampling rates.
  • the advantage of this coding scheme is that independent blocks can be selected independently in each frequency band. Or short block MDCT, so the high-frequency can use short block coding to enhance the time resolution; and the low-frequency use long block coding to obtain high frequency resolution.
  • the coding efficiency of the transform domain coefficients of adjacent parts will decrease. Summary of the invention
  • the present invention provides a method for encoding mono or stereo, comprising: dividing a mono or stereo audio signal into a basic layer and at least one enhancement layer; using mp3, A for the base layer AC, SBR, PS, and/or DRA coding mode coding; encoding at least one enhancement layer using mp3, AAC, SBR, PS, DRA, residual coding, partial parameter coding algorithm, and/or parameter coding algorithm, respectively.
  • the above dividing the mono or stereo audio signal into a base layer and an enhancement layer is: dividing the mono or stereo audio signal into a base layer and an enhancement layer based on the frequency band, and the base layer is mono or The low frequency encoding portion of the stereo; the enhancement layer is a mono or stereo high frequency encoding portion; or the stereo audio signal is divided into a base layer and an enhancement layer based on the channel, and the base layer transmits the left channel or the channel; The layer transmits the right channel or the difference channel; or the stereo audio signal is divided into a base layer and an enhancement layer based on the parametric stereo coding, the base layer transmits a single channel of the left and right channel downmix; the enhancement layer transmits the parameter stereo information; or The mono or stereo audio signal is divided into a base layer and an enhancement layer based on the residual differential layer structure.
  • the foregoing base layer and/or at least one enhancement layer are respectively coded by using a bandwidth extension algorithm.
  • the step of separately encoding the base layer and the enhancement layer obtained by dividing the residual difference layer structure comprises: supplementing the base layer low frequency coding part according to the enhancement layer low frequency residual; and modifying the parameter to the base layer by using the enhancement layer bandwidth extension.
  • the bandwidth extension parameters are adjusted.
  • the base layer includes encoding the downmixed channel low frequency portion for encoding and bandwidth extension and parametric stereo encoding information; and the enhancement layer transmits the residual encoding of the low frequency portion.
  • the base layer transmits the low frequency partial coding information of the downmixed mono signal; the enhancement layer transmits the low frequency partial residual coding information and the bandwidth extension and the parameter stereo coding information.
  • the step of encoding the base layer includes: encoding according to a code rate requirement of the base layer, and putting the obtained encoded data into a base layer transmission; comparing the original audio with the restored audio of the base layer decoding to obtain a residual signal.
  • the step of encoding the enhancement layer is to encode the residual signal as an enhancement layer.
  • the dividing the mono or stereo audio signal into a base layer, the first enhancement layer and the second enhancement layer is: dividing the mono or stereo audio signal into a base layer, a first enhancement layer, and a second enhancement layer, wherein the base layer is a mono or stereo low frequency coding portion; the first enhancement layer is a mono or stereo intermediate frequency coding portion; and the second enhancement layer is a mono or stereo high frequency coding portion.
  • the above-described residual channel layer structure divides the mono or stereo audio signal into a basic layer and at least one enhancement layer; and the step of encoding the base layer includes: encoding according to the code rate requirement of the base layer, The full-band basic quality coded data is placed in the base layer transmission; the original audio is compared with the base layer decoded and recovered audio to obtain a first-stage residual signal; and the first enhancement layer and/or the second enhancement layer are encoded
  • the method includes: encoding the first-level residual signal as the data of the first enhancement layer; removing the signal decoded and restored by the first enhancement layer from the input first-stage residual signal of the first enhancement layer coding, to obtain the second level Residual signal; encoding the second-level residual signal as the data of the second enhancement layer; sequentially obtaining the next-level residual signal according to the residual signal of the previous stage, and encoding the residual signal of the next-level as the next
  • the data of the level enhancement layer is encoded until all enhancement layers are completed.
  • the step of encoding the base layer comprises: performing MDCT transformation on the time domain data x[n] to obtain a spectral coefficient X[k] at the encoding end ; dividing the frequency domain coefficient into a plurality of subbands, belonging to the subband b The spectral coefficient is divided by a quantization step; the quantization step is rounded (nint) to obtain the quantized spectral coefficient Each quantization step size and spectral coefficient X [W is transmitted to the decoder.
  • the step of separately encoding the at least one enhancement layer comprises: performing MDCT transformation on the time domain data x[n] to obtain a spectral coefficient X[k] at the encoding end ; dividing the frequency domain coefficient into a plurality of subbands, belonging to The spectral coefficient of subband b is divided by a quantization step; after the quantization step is rounded (nint) to be quantized ; Each spectral coefficients and quantization step size XW transmitted to the decoding side; restored by the inverse quantization step size and quantization spectrum f W is the number of spectral coefficients f
  • ⁇ k] A b - X[k] .
  • the number is divided into multiple sub-bands, and the spectral coefficient belonging to the sub-band c is divided by a residual spectral coefficient quantization step, and the quantized residual is obtained by nint Transmitting the residual spectral coefficient quantization step size and the quantized residual spectral coefficient to the decoding end.
  • the present invention performs coarse layering on mono or stereo, generally only 2 or 3 layers, and is simple to implement to ensure more efficient compression without the various constraints of fine layering technology.
  • the best integrated sound quality can be obtained by flexibly controlling the quality of each channel; it is easy to meet channel coding requirements.
  • FIG. 1 is a schematic diagram of layering a mono or stereo according to an embodiment of the present invention.
  • FIG. 2 is a schematic diagram of a coding process of an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of layering an audio signal based on a layered structure of a frequency band according to an embodiment of the present invention
  • FIG. 4 is a schematic diagram of layering an audio signal based on a layered structure of a channel according to an embodiment of the present invention
  • FIG. 5 is a schematic diagram of layering an audio signal based on a layered structure of parametric stereo coding according to an embodiment of the present invention
  • FIG. 6 is a schematic diagram of a layered structure according to an embodiment of the present invention
  • FIG. 7 is a schematic diagram of layering an audio signal based on a hierarchical structure of residuals according to an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of a two-layer structure based on a residual difference layer when a base layer has a bandwidth extension algorithm according to an embodiment of the present invention
  • FIG. 9 is a schematic diagram of a two-layer structure based on a residual difference layer when the enhancement layer has a bandwidth extension algorithm according to an embodiment of the present invention.
  • FIG. 10 is a schematic diagram of a two-layer architecture based on a residual differential layer with bandwidth extension and bandwidth extension correction in an enhancement layer according to an embodiment of the present invention.
  • FIG. 11 is a schematic structural diagram of layering a stereo audio signal according to an embodiment of the present invention
  • FIG. 12 is a schematic structural diagram of layering a stereo audio signal according to an embodiment of the present invention
  • FIG. 14 is a schematic diagram of another audio layered multi-layer structure according to an embodiment of the present invention.
  • FIG. 15 is a schematic diagram of an audio layered structure according to an embodiment of the present invention.
  • 16 is a simplified schematic diagram of a dra algorithm according to an embodiment of the present invention.
  • FIG. 17 is a schematic diagram of a DRA kernel residual coding algorithm according to an embodiment of the present invention.
  • FIG. 18 is a schematic diagram of a layered structure of stereo audio according to an embodiment of the present invention. detailed description
  • the method for encoding mono or stereo in this embodiment includes:
  • Step S1 dividing the mono or stereo audio signal into a basic layer and at least one enhancement layer
  • Step S2 encoding the basic layer by using mp3, AAC, SBR, PS, and/or DRA coding modes
  • Step S3 encoding, by using at least one enhancement layer, mp3, AAC, SBR, PS, DRA, residual coding, partial parameter coding algorithm, and/or parameter coding algorithm.
  • the present invention provides a series of different layering schemes.
  • the present invention divides the mono or stereo audio signal into a basic layer and an enhancement layer based on the frequency band, sequentially from low frequency to high frequency.
  • the audio coding information of each frequency band is placed in the base layer and the enhancement layer.
  • the base layer is the low frequency encoding portion of the mono or stereo sound; the enhancement layer is the mono or stereo high frequency encoding portion.
  • the high frequency partial coding can participate in the same algorithm as the low frequency part, or use a parameter method such as a bandwidth extension algorithm.
  • the basic layer generally adopts normal coding algorithms such as mp3, AAC or DRA, etc.
  • the enhancement layer can still use normal coding algorithms, partial parameter coding algorithms such as intensity stereo, parameter coding algorithms such as bandwidth extension.
  • the advantage of the band stratification scheme is to guarantee the quality of the low frequencies. Referring to the channel-based hierarchical structure shown in FIG. 4, the audio signal is layered.
  • the present invention divides the stereo audio signal into a basic layer and an enhancement layer based on the channel, and the base layer transmits the left channel or the harmony.
  • the enhancement layer transmits the right channel or the difference channel.
  • the bandwidth extension algorithm can be selected for any single channel, such as the left channel or the channel, to improve subjective sound quality at low bit rates and to ensure a broadband quality.
  • the present invention divides a stereo audio signal into a basic layer and an enhancement layer based on parametric stereo coding, and the base layer transmits left and right channels.
  • Mixed single channel; enhancement layer transmits parametric stereo information.
  • each layer is coded under the layering scheme, and the low-band portion of the base layer may select a single channel after the left-right channel downmixing using the bandwidth extension algorithm; the enhancement layer transmission
  • the parameter is stereo information, and the high frequency portion of the downmix channel encoded by the transmission bandwidth extension algorithm can also be selected.
  • the layering scheme and the coding scheme can achieve higher quality at a low bit rate.
  • the audio signal is layered.
  • the present invention divides the mono or stereo audio signal into a basic layer and an enhancement layer based on the residual differential layer structure.
  • the steps of encoding the base layer and the enhancement layer include:
  • Step S21 Encoding according to a code rate requirement of the base layer, and putting the obtained coded data into a basic layer for transmission;
  • Step S22 Compare the original audio with the restored audio of the base layer decoding to obtain a residual signal.
  • Step S3 The step of encoding the enhancement layer is to encode the residual signal as an enhancement layer.
  • the normal encoding is first performed according to the code rate requirement of the first layer, and the encoded data is transmitted in the base layer; then the original audio and the base layer are decoded and restored.
  • the audio comparison acquires the residual signal (either in the time domain or in the transform domain) and continues to encode the residual signal as an enhancement layer.
  • the audio signal can be layered in a plurality of hierarchical structures.
  • the base layer shown in FIG. 8 has a two-layer structure diagram based on the residual difference layer when the bandwidth layer has a bandwidth extension algorithm
  • FIG. 9 is a schematic diagram of a two-layer structure based on the residual difference layer when the enhancement layer has a bandwidth extension algorithm
  • the basic layer has a two-layer structure diagram based on the residual differential layer with bandwidth extension and enhancement layer bandwidth extension correction.
  • the base layer low frequency coding portion is supplemented according to the enhancement layer low frequency residual, a more accurate low frequency portion is obtained, and the base layer bandwidth extension parameter is adjusted by the enhancement layer bandwidth extension correction parameter to better Restore the high frequency portion of each channel.
  • the base layer includes the channel low frequency partial coding of the downmix and the bandwidth extension and parametric stereo coding information, and the enhancement layer transmits the residual coding of the low frequency portion.
  • the base layer transmits the low frequency partial coding information of the downmixed mono signal
  • the enhancement layer transmits the low frequency partial residual coding information and the bandwidth extension and the parameter stereo coding. information.
  • the layered structure of the audio signal is simple, and the coding efficiency is improved.
  • the present invention also proposes that, in addition to a two-layer structure of a base layer and a reinforcement layer, the audio signal can be divided into a multilayer structure of a base layer and a plurality of enhancement layers.
  • FIG. 13 a schematic diagram of an audio layered multi-layer structure, which divides a mono or stereo audio signal into a base layer, a first enhancement layer, and a second enhancement layer, wherein the base layer is mono or stereo.
  • the present invention may further divide a mono or stereo audio signal into a base layer and at least one enhancement layer based on the residual differential layer structure.
  • the step S2 of encoding the base layer includes:
  • Step S21 Encoding according to the code rate requirement of the base layer, and putting the obtained full-band basic quality coded data into the base layer transmission;
  • Step S22 Compare the original audio with the audio restored by the base layer decoding to obtain a first-level residual signal.
  • the step S3 of encoding the first enhancement layer and/or the second enhancement layer includes:
  • Step S31 encoding the first-level residual signal as the data of the first enhancement layer
  • Step S32 removing the signal decoded and restored by the first enhancement layer from the input first-stage residual signal of the first enhancement layer coding, Obtaining a second level residual signal
  • Step S33 encoding the second-level residual signal as the data of the second enhancement layer
  • Step S34 sequentially obtaining the next-level residual signal according to the residual signal of the previous stage, and encoding the residual signal of the next stage as The data of the next level of enhancement layer is encoded until all enhancement layers are completed.
  • the present invention can implement Layer 2, Layer 3 or Layer 4 and above layering and encoding for audio signals, generally no more than four layers to simplify the layering and encoding process.
  • a specific example of the present invention is given here. Referring to FIG. 15, a schematic diagram of an audio hierarchy is shown, wherein the DRA core coding module is a standard algorithm for implementing DRA according to the standard GB/T 22726-2008. In the present invention, mono and stereo DRA coding is specifically referred to. The simple diagram of the dra algorithm is shown in Figure 16. Shown. In order to clearly describe this patent, the decoding end is also briefly described, wherein the decoding end module is shown in the dashed block diagram of FIG.
  • Step S211 at the encoding end, performing MDCT transformation on the time domain data x[n] to obtain a spectral coefficient X[k] ; in step S212, dividing the frequency domain coefficient into a plurality of subbands, and dividing the spectral coefficient belonging to the subband b by one Quantization step size;
  • Step S214, each quantization step size and spectral coefficient X [W are transmitted to the decoding end by various means (the steps of decoding the base layer at the decoding end are:
  • Step S4 using the quantization step size and the spectral coefficient W transmitted in step S214 to restore the inverse quantized spectral coefficient f[W
  • the inverse quantized spectral coefficient fc 3 ⁇ 4 IMDCT is obtained by inversely quantized time domain data.
  • the above SBR coding module is in accordance with the standard "ISO/IEC 14496-3:2001/Amd.l:2003,
  • the present invention further provides an example of separately encoding at least one enhancement layer based on the above coding of the base layer.
  • the DRA core residual coding module used in this embodiment is an intermediate module as shown in FIG. 16.
  • the schematic diagram of the DRA kernel residual coding algorithm shown in FIG. 17 shows that the base layer and the coding end of FIG. 18 are completely identical, that is, fully compatible.
  • the implementation of the base layer is as above.
  • the implementation steps of at least one enhancement layer coding in this embodiment as follows:
  • the following steps of adding the following enhancement layer in the base layer step 3 include:
  • step S317 dividing the residual spectral coefficient into a plurality of sub-bands, dividing the spectral coefficient belonging to the sub-band c by a residual spectral coefficient quantization step, and rounding (nint) the quantized residual spectral coefficient Step S318, transmitting the residual spectral coefficient quantization step size ⁇ and the quantized residual spectral coefficient to the solution
  • the process of decoding the at least one enhancement layer at the decoding end is as follows:
  • step S42 using the residual spectral coefficient quantization step size and the quantized residual spectral coefficient passed in step S34 to restore the inverse quantized residual spectral coefficient
  • Step S43 adding the inverse quantized spectral coefficient obtained in step S41 and the inverse quantized residual spectral coefficient obtained in step S42 to obtain an enhanced inverse quantized spectral coefficient [ ⁇ ]
  • X a [k] X[k] - E[k] , step S52, inversely quantized spectral coefficient f for enhancement.
  • W does IMDCT to get inverse quantized time domain data x[n]
  • the present invention further proposes that the total coding rate is 48 kbps, and the audio signal is divided into two layers by a residual differential layer structure, and each layer is 24 kbps as an example to describe the implementation steps of separately coding the base layer and the at least one enhancement layer in this embodiment.
  • Step S201 Encoding the base layer with a coding rate of 24 kbps at a coding bandwidth of 48 kbps, and obtaining a quantization step size of the 24 kbps code rate and a quantized spectral coefficient and an sbr code stream;
  • Step S301 multiplying the quantized spectral coefficients by the quantized step size at the encoding end to obtain an inverse quantized spectral coefficient at a coding rate of 24 kbps.
  • Step S302 subtracting the inverse quantized spectral coefficient f W from the original spectral coefficient x W to obtain a residual signal spectral coefficient E[k].
  • Step S303 using a 24 kbps code rate for the residual signal spectral coefficient £ [W, quantization, quantization method Consistent or similar to the quantization, the quantized step size ⁇ ⁇ quantized residual spectral coefficients of the quantized residual signal are obtained and transmitted to the decoding end.
  • the invention also proposes that if only stereo coding is performed, except for the above implementation, The encoding of the base layer and the at least one enhancement layer can be implemented with the next embodiment. An advantage of this embodiment over the previous embodiment is that higher quality can be obtained when the stereo total code rate is low.
  • a stereo audio layering structure diagram in this embodiment, the two stereo channels are downmixed into one channel and encoded by PS, wherein the PS code is in accordance with the standard ISO/IEC 14496-3:2001/Amd. 2:2004: "Parametric Coding for High Quality Audio" is implemented.
  • the DRA downmix channel coding is the same as the base layer coding principle and the procedure in FIG. 16; and the coding principle of the enhancement layer in this embodiment is the same as the DRA downmix channel residual coding, and therefore will not be described again.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

La présente invention concerne un procédé de codage audio stéréo ou monophonique. Le procédé comprend les étapes consistant à : diviser un signal audio stéréo ou monophonique en une couche de base et au moins une couche améliorée; coder la couche de base à l'aide d'un mode de codage de mp3, AAC, SBR, PS et/ou DRA; et coder la ou les couches améliorées à l'aide d'un mode de codage de mp3, AAC, SBR, PS, DRA, un codage résiduel, un algorithme de codage d'une partie des paramètres et/ou un algorithme de codage des paramètres, respectivement. Dans la présente invention, on réalise une organisation en couche grossière sur un élément audio stéréo ou monophonique, dans seules 2 ou 3 couches étant divisées; de cette manière, une compression dotée d'une efficacité plus élevée peut être assurée d'une manière simple, exempte de tous types de contraintes techniques rencontrées dans une technologie d'organisation en couche fine. La qualité sonore globale optimale peut être obtenue par un contrôle flexible de la qualité de chaque couche de piste sonore, et des exigences de codage de canal peuvent être facilement satisfaites.
PCT/CN2012/077155 2012-06-19 2012-06-19 Procédé de codage audio stéréo ou monophonique WO2013189030A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2012/077155 WO2013189030A1 (fr) 2012-06-19 2012-06-19 Procédé de codage audio stéréo ou monophonique
CN201280000961.1A CN104170007B (zh) 2012-06-19 2012-06-19 对单声道或立体声进行编码的方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/077155 WO2013189030A1 (fr) 2012-06-19 2012-06-19 Procédé de codage audio stéréo ou monophonique

Publications (1)

Publication Number Publication Date
WO2013189030A1 true WO2013189030A1 (fr) 2013-12-27

Family

ID=49768020

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/077155 WO2013189030A1 (fr) 2012-06-19 2012-06-19 Procédé de codage audio stéréo ou monophonique

Country Status (2)

Country Link
CN (1) CN104170007B (fr)
WO (1) WO2013189030A1 (fr)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6576934B2 (ja) * 2014-01-07 2019-09-18 ハーマン インターナショナル インダストリーズ インコーポレイテッド 圧縮済みオーディオ信号の信号品質ベース強調及び補償
CN114708874A (zh) 2018-05-31 2022-07-05 华为技术有限公司 立体声信号的编码方法和装置
CN110556118B (zh) 2018-05-31 2022-05-10 华为技术有限公司 立体声信号的编码方法和装置
CN111768793B (zh) * 2020-07-11 2023-09-01 北京百瑞互联技术有限公司 一种lc3音频编码器编码优化方法、系统、存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1623185A (zh) * 2002-03-12 2005-06-01 诺基亚有限公司 可伸缩音频编码的有效改进
CN1905010A (zh) * 2005-07-29 2007-01-31 索尼株式会社 编码音频数据的设备和方法及解码音频数据的设备和方法
US20070208557A1 (en) * 2006-03-03 2007-09-06 Microsoft Corporation Perceptual, scalable audio compression
CN101167126A (zh) * 2005-04-28 2008-04-23 松下电器产业株式会社 语音编码装置和语音编码方法
CN101206860A (zh) * 2006-12-20 2008-06-25 华为技术有限公司 一种可分层音频编解码方法及装置
CN101800048A (zh) * 2009-02-10 2010-08-11 数维科技(北京)有限公司 基于dra编码器的多声道数字音频编码方法及其编码系统

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101340233B1 (ko) * 2005-08-31 2013-12-10 파나소닉 주식회사 스테레오 부호화 장치, 스테레오 복호 장치 및 스테레오부호화 방법
WO2008062990A1 (fr) * 2006-11-21 2008-05-29 Samsung Electronics Co., Ltd. Procédé, support et système de codage/décodage extensible de signaux audio/vocaux

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1623185A (zh) * 2002-03-12 2005-06-01 诺基亚有限公司 可伸缩音频编码的有效改进
CN101167126A (zh) * 2005-04-28 2008-04-23 松下电器产业株式会社 语音编码装置和语音编码方法
CN1905010A (zh) * 2005-07-29 2007-01-31 索尼株式会社 编码音频数据的设备和方法及解码音频数据的设备和方法
US20070208557A1 (en) * 2006-03-03 2007-09-06 Microsoft Corporation Perceptual, scalable audio compression
CN101206860A (zh) * 2006-12-20 2008-06-25 华为技术有限公司 一种可分层音频编解码方法及装置
CN101800048A (zh) * 2009-02-10 2010-08-11 数维科技(北京)有限公司 基于dra编码器的多声道数字音频编码方法及其编码系统

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHOU, HONG ET AL.: "The Scalability Audio Coding", AUDIO ENGINEERING, September 2000 (2000-09-01), pages 3 - 7 *

Also Published As

Publication number Publication date
CN104170007A (zh) 2014-11-26
CN104170007B (zh) 2017-09-26

Similar Documents

Publication Publication Date Title
KR101428608B1 (ko) 대역폭 확장을 위한 스펙트럼 평탄도 제어
US8862480B2 (en) Audio encoding/decoding with aliasing switch for domain transforming of adjacent sub-blocks before and subsequent to windowing
US11908484B2 (en) Apparatus and method for generating an enhanced signal using independent noise-filling at random values and scaling thereupon
EP2044589B1 (fr) Procédé et appareil de codage sans perte d'un signal source avec utilisation d'un flux de données codées avec pertes et d'un flux de données d'extension sans pertes
JP5418930B2 (ja) 音声復号化方法および音声復号化器
CN113963705A (zh) 频域处理器以及时域处理器的音频编码器和解码器
CN105453176A (zh) 智能间隙填充框架内使用双声道处理的音频编码器、音频解码器及相关方法
MX2007009887A (es) Esquema de codificador/descodificador de multicanal casi transparente o transparente.
EP1749296A1 (fr) Extension audio multicanal
TW202016924A (zh) 使用信號白化或信號後處理之多重信號編碼器、多重信號解碼器及相關方法
US10332526B2 (en) Audio encoding apparatus and method, and audio decoding apparatus and method
CN111210832A (zh) 基于频谱包络模板的带宽扩展音频编解码方法及装置
WO2013189030A1 (fr) Procédé de codage audio stéréo ou monophonique
Gunawan et al. Investigation of lossless audio compression using ieee 1857.2 advanced audio coding
KR20050027179A (ko) 오디오 데이터 복원 방법 및 그 장치
WO2010099752A1 (fr) Procédé de codage stéréo, dispositif et codeur
KR102546098B1 (ko) 블록 기반의 오디오 부호화/복호화 장치 및 그 방법
WO2016023322A1 (fr) Procédé de codage de signal acoustique multicanal, procédé et dispositif de décodage
Ghaderi et al. Wideband speech coding using ADPCM and a new enhanced bandwidth extension method
WO2017148526A1 (fr) Codeur de signal audio, décodeur de signal audio, procédé de codage et procédé de décodage
Hansen et al. Fine-grain scalable audio coding based on envelope restoration and the SPIHT algorithm
De Meuleneire et al. Algebraic quantization of transform coefficients for embedded audio coding
Ghaderi et al. Wideband speech and audio coding using a new spectral replication method based on parametric stereo coding
CN112786063A (zh) 使用频域处理器、时域处理器和用于连续初始化的交叉处理器的音频编码器和解码器

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12879437

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO F1205N DATED 26-05-2015)

122 Ep: pct application non-entry in european phase

Ref document number: 12879437

Country of ref document: EP

Kind code of ref document: A1