WO2014005327A1 - Method for encoding multichannel digital audio - Google Patents

Method for encoding multichannel digital audio Download PDF

Info

Publication number
WO2014005327A1
WO2014005327A1 PCT/CN2012/078306 CN2012078306W WO2014005327A1 WO 2014005327 A1 WO2014005327 A1 WO 2014005327A1 CN 2012078306 W CN2012078306 W CN 2012078306W WO 2014005327 A1 WO2014005327 A1 WO 2014005327A1
Authority
WO
WIPO (PCT)
Prior art keywords
bytes
full
base layer
channel
band
Prior art date
Application number
PCT/CN2012/078306
Other languages
French (fr)
Chinese (zh)
Inventor
闫建新
王磊
Original Assignee
深圳广晟信源技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳广晟信源技术有限公司 filed Critical 深圳广晟信源技术有限公司
Priority to CN201280000959.4A priority Critical patent/CN103650036B/en
Priority to PCT/CN2012/078306 priority patent/WO2014005327A1/en
Publication of WO2014005327A1 publication Critical patent/WO2014005327A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Definitions

  • the present invention relates to the field of audio coding processing, and more particularly to a method of encoding multi-channel digital audio.
  • MPEG-4 BSAC Bit sliced arithmetic coding
  • AVS Anaudio Video coding
  • MPEG-4 SLS Scalable Lossless Coding's lossless enhancement layer approach enables fine layering of audio and encoding each layer separately.
  • the fine layering method has the disadvantages that the layering is too thin, requires a lot of auxiliary information, low coding efficiency, complicated structure, and high processing logic complexity.
  • AAC-SSR Advanced Audio Coding-Scalable Sampling Rate
  • PQF Polyphase Quadrature
  • Filter is divided into four frequency bands, and then the four frequency bands respectively perform one 256-point MDCT (512 sample window length) or eight 32-point (64 sample window length) MDCT.
  • the coding scheme can also reduce the data rate by removing the high PQF band, and achieve bit stream layering by reducing the frequency band, thereby obtaining different bit rates and sampling rates.
  • the advantage of this coding scheme is that the long block or short block MDCT can be independently selected in each frequency band, so that the high frequency can use short block coding to enhance the time resolution; and the low frequency use the long block coding to obtain high frequency resolution.
  • the coding efficiency of the transform domain coefficients of adjacent parts will decrease.
  • the present invention provides a method for encoding a multi-channel digital audio, comprising: dividing a multi-channel audio into a basic layer and at least one enhancement layer; respectively configuring a basic layer and at least one enhancement layer The number of bytes; respectively encoded for a base layer and at least one enhancement layer.
  • the multi-channel audio signal is divided into a base layer and an enhancement layer; wherein the base layer includes at least one full-band channel, the enhancement layer includes at least one full-band channel; and the base layer includes a full-band channel that is not greater than The number of full-band channels included in the enhancement layer.
  • the full-band channel included in the base layer is smaller than the number of full-band channels included in the enhancement layer
  • the situation also includes: the number of bytes configured for the base layer is the total number of bytes of the data frame/2, the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer
  • the number of bytes configured for the enhancement layer is the total number of bytes of the data frame/2; the number of bytes per channel of the enhancement layer is the total number of bytes of the data frame/2* the number of full-band channels included in the enhancement layer.
  • the method further includes: configuring the number of bytes for the base layer to be greater than the total number of bytes of the data frame/2; configuring the enhancement layer The number of bytes is less than the total number of bytes in the data frame/2.
  • the method further comprises: configuring the same number of bytes for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the number of full-band channels included in the enhancement layer) ).
  • the method further comprises: configuring, for each full-band channel in the base layer, the number of bytes as the total number of bytes of the data frame/the number of full-band channels included in the basic layer, and (the total number of bytes of the data frame/2) > (total number of bytes of data frame / number of full-band channels included in the base layer) > (total number of bytes of data frame / (number of full-band channels included in the base layer + number of full-band channels included in the enhancement layer))
  • the number of bytes for a channel configuration of the enhancement layer is greater than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer, and the rest
  • the number of bytes of at least one channel configuration is less than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer)/the number of full-band channels included in the enhancement layer.
  • the number of bytes is separately configured for the base layer and the enhancement layer according to a block size of LDPC coding, a channel coding condition, a characteristic of the base layer, and/or a characteristic of the enhancement layer in each transmission frame.
  • the multi-channel audio signal is divided into a base layer and a plurality of enhancement layers; wherein the base layer comprises at least one full-band channel, the plurality of enhancement layers respectively comprise at least one full-band channel; the base layer comprises a full-band The channel is less than the sum of the full-band channels included in all enhancement layers.
  • the number of bytes configured for the base layer is the total number of bytes of the data frame/2, and the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer;
  • the sum of the number of bytes of the at least one enhancement layer configuration is the total number of bytes of the data frame/2, wherein the number of bytes of each full-band channel of the first enhancement layer is greater than the total number of bytes of the data frame/2 (enhancement layer includes The number of full-band channels + the number of full-band channels included in the base layer),
  • the number of bytes of each full-band channel of the remaining at least one enhancement layer is less than the total number of bytes of the data frame/2 (the number of full-band channels included in the enhancement layer + the number of full-band channels included in the base layer).
  • the same number of bytes are configured for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of full-band channels included in all enhancement layers) ).
  • the number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (data frame) Total number of bytes / number of full-band channels included in the base layer)> (total number of bytes of data frame / (the number of full-band channels included in the base layer + the sum of the number of channels of the full-band included in all enhancement layers));
  • the number of bytes of a channel configuration for the first enhancement layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer) / the sum of the total number of channels included in all enhancement layers
  • the number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the sum of the number of full-band channels included in all enhancement layers.
  • the number of bytes is separately configured for the base layer and the at least one enhancement layer according to the block size of the LDPC encoding, the channel coding condition, the characteristics of the base layer, and/or the characteristics of the enhancement layer in each transmission frame.
  • the method further includes: encoding, by using a DRA coding algorithm, a basic layer and at least one enhancement layer, respectively.
  • the method further comprises: separately performing bandwidth expansion on the base layer and/or the at least one enhancement layer.
  • the invention also provides a method for encoding multi-channel digital audio, comprising: dividing a multi-channel audio signal into a base layer and an enhancement layer, wherein the base layer comprises at least one full-band channel, and the enhancement layer comprises at least one Full-band channel; the number of full-band channels included in the base layer is not greater than the number of full-band channels included in the enhancement layer; the number of bytes is configured for the base layer and the enhancement layer respectively; wherein, for each full-band channel in the base layer
  • the configured number of bytes is the total number of bytes of the data frame / the number of full-band channels included in the base layer, and (the total number of bytes of the data frame / 2) > (the total number of bytes of the data frame / the full-band sound contained in the base layer) Number of tracks)>(the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the number of full-band channels included in the enhancement layer)); the number of bytes configured for one channel
  • the invention also provides a method for encoding multi-channel digital audio, comprising: dividing a multi-channel audio signal into a basic layer and a plurality of enhancement layers; wherein the base layer comprises at least one full-band channel, and multiple enhancement layers Each of the at least one full-band channel is included; the number of full-band channels included in the base layer is not greater than the sum of the number of full-band channels included in all enhancement layers; and the number of bytes is configured for a basic layer and at least one enhancement layer;
  • the number of bytes of each full-band channel configuration in the base layer is the total number of bytes of the data frame / the number of full-band channels included in the base layer, and (the total number of bytes of the data frame / 2) > (the total number of bytes of the data frame) /
  • the invention not only avoids the degradation of coding efficiency caused by fine layering, but also satisfies some fields of application, such as digital audio broadcasting.
  • the invention is simple to implement, and has the advantages of flexible control of the quality of each channel, obtaining the best integrated sound quality, easily satisfying the channel coding requirements, and eliminating various constraints in fine layering, and ensuring more efficient compression.
  • FIG. 2 is a schematic diagram of a multi-layer digital audio layer structure according to an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of a multi-channel digital audio multi-layer structure according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of a two-layer structure of stereo left and right channels according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of a two-layer structure of stereo and differential channels according to an embodiment of the present invention.
  • FIG. 6 is a schematic diagram of a surround sound two-layer structure according to an embodiment of the present invention.
  • FIG. 7 is a schematic diagram of a two-layer structure of surround sound according to an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of a three-layer structure of surround sound according to an embodiment of the present invention.
  • FIG. 9 is a schematic diagram of a three-layer structure of surround sound according to an embodiment of the present invention.
  • FIG. 10 is a schematic diagram of a hierarchical structure of a DRA & DRA+ surround sound according to an embodiment of the present invention.
  • the multi-channel digital audio encoding method of the first embodiment of the present invention includes:
  • Step S1 dividing the multi-channel audio into a basic layer and at least one enhancement layer
  • Step S2 configuring a number of bytes for a basic layer and at least one enhancement layer respectively;
  • Step S3 encoding a basic layer and at least one enhancement layer separately.
  • the second embodiment of the present invention provides a two-layer structure for dividing a multi-channel audio signal into a basic layer and a enhancement layer, wherein the basic layer includes at least one.
  • the base layer includes a full-band channel that is no larger than the full-band channel number included in the enhancement layer.
  • the present invention proposes various embodiments under the premise that the total load (i.e., the number of bytes) is constant.
  • the third embodiment emphasizes the configuration scheme for the base layer. Since the base layer is more important, and the enhancement layer contributes less to the overall sound quality, it is necessary to divide the payload into roughly equal parts. In particular, application scenarios in which the enhancement layer is not properly obtained due to channel or the like and the quality of the base layer is emphasized.
  • the number of bytes configured in the base layer in this embodiment is the total number of bytes of the data frame/2, and the byte of each channel of the base layer.
  • the number is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer;
  • the number of bytes configured for the enhancement layer is the total number of bytes of the data frame/2;
  • the number of bytes per channel of the enhancement layer is data Total number of frames in the frame/2* The number of full-band channels included in the enhancement layer.
  • the total number of bytes of a data frame be D.
  • the number of bytes allocated to the base layer and the enhancement layer is D/2
  • the effective number of bytes per channel of the base layer is D/2. *k
  • the number of bytes per channel of the enhancement layer is D/2*m.
  • the number of bytes configured in the base layer in this embodiment is greater than the total number of bytes in the data frame/2; bytes configured on the enhancement layer The number is less than the total number of bytes of the data frame/2.
  • Each channel of such a base layer can be represented by more bytes per channel of the enhancement layer, thereby obtaining better sound quality for each channel of the base layer.
  • the fourth embodiment is a k:m configuration scheme, or a uniform configuration scheme.
  • the above first embodiment highlights the importance of the base layer; and from the overall perspective of multi-channel, it is more reasonable to give equal attention to each full-band channel, so that it can only be correctly decoded due to certain channels and the like.
  • the obtained sound quality is slightly inferior to that of the first embodiment, but when both the base layer and the enhancement layer can be decoded, the overall multi-channel quality is superior to that of the first embodiment.
  • the same number of bytes are configured for each full-band channel, and the total number of bytes of the data frame/(the number of full-band channels included in the basic layer + the number of full-band channels included in the enhancement layer) is set to one.
  • the total number of bytes of the audio frame is D
  • the number of bytes of each full-band channel is D/(k+m), which is encoded by the same number of bytes for each full-band channel, so that each full-band The channels have the same sound quality.
  • the fifth embodiment is a near k:m configuration scheme, and the non-base layer emphasizes the configuration and is not uniformly configured.
  • the first embodiment described above highlights the importance of the base layer, but when k ⁇ m in general, the base layer emphasis configuration may over-emphasize the quality of the base layer.
  • the second embodiment treats the base layer full-band channel as a normal full-band channel; therefore, it should be given the most reasonable configuration, ie, close to the k:m configuration, depending on the multi-channel specific situation.
  • each full-band channel in the base layer is more important than the enhancement-band full-band channel, and should be given more than a uniform configuration and less than the number of bytes of the first configuration; and for m in the enhancement layer
  • the full-band channel also needs to be considered separately.
  • the center channel in the movie audio system is generally set to dubbing, which should be given more attention than the two surround channels. This configuration provides better multi-channel quality than the first two configurations.
  • the number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (data frame) Total number of bytes / number of full-band channels included in the base layer) > (total number of bytes of data frame / (number of full-band channels included in the base layer + number of full-band channels included in the enhancement layer)); for the enhancement layer
  • the number of bytes of a channel configuration is greater than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer, and the remaining at least one channel
  • the configured number of bytes is less than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer.
  • the total number of bytes of one audio frame is D
  • the number of bytes configured for each full-band channel in the base layer is D/k, and D/2>D/k>D/(k+m)
  • the enhancement layer is also given an appropriate configuration according to the characteristics of each full-band channel in the enhancement layer. For example, in 5.1 surround sound, the center channel should be configured more than D (1-1/k)/m bytes, and each channel in the left and right surround pairs is configured less than D (1-1/k)/m. byte.
  • the sixth embodiment is a limited configuration scheme that relies on the requirements of channel coding conditions. Due to channel coding such as LDPC (Low Density) Parity Check) coding is block coding, and the two layers need to adopt different protection levels, so each layer of layered coding needs to consider the multi-channel base layer and enhancement layer according to the block size of LDPC coding in each transmission frame. The characteristics are given the most reasonable arrangement and configuration. For the limited configuration case, the general base layer and enhancement layer byte number allocation is similar to that of the third embodiment, but considers the total capacity of each layer of LDPC coded blocks in the transmission frame.
  • LDPC Low Density
  • Parity Check Low Density Check
  • the scheme configures the number of bytes for the base layer and the enhancement layer respectively according to the block size of the LDPC code, the channel coding condition, the characteristics of the base layer, and/or the characteristics of the enhancement layer in each transmission frame.
  • the present invention also proposes a multi-layering scheme.
  • the multi-channel audio signal is divided into a base layer and a plurality of enhancement layers; wherein the base layer includes at least one full-band channel, the plurality of enhancement layers respectively comprise at least one full-band channel; and the base layer includes a full-band channel The sum of the number of full-band channels included in all enhancement layers.
  • the present invention is based on a multi-layered scheme, which is a basic layer-emphasized configuration scheme in which the base layer occupies half or more of the payload.
  • the reason and characteristics of the scheme are similar to those of the third embodiment, and therefore will not be described again.
  • the number of bytes configured in the basic layer of the scheme is the total number of bytes of the data frame/2, and the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer;
  • the sum of the number of bytes of an enhancement layer configuration is the total number of bytes of the data frame/2, wherein the number of bytes of each full-band channel of the first enhancement layer is greater than the total number of bytes of the data frame/2 (all enhancement layers include The sum of the full-band channels + the number of full-band channels included in the base layer),
  • the number of bytes of each full-band channel of the remaining at least one enhancement layer is less than the total number of bytes of the data frame/2 (
  • the total number of bytes of an audio frame is D
  • the base layer includes k full-band channels
  • the first enhancement layer is set to include m full-band channels
  • the second enhancement layer is arranged to contain n full-band channels.
  • the number of bytes allocated to the base layer is D/2
  • the number of valid bytes per channel of the base layer is D/2k.
  • the sum of the number of bytes of the two enhancement layers is also D/2, but the number of bytes of each full-band channel of the first enhancement layer is greater than D/2 (m+n), and each full-band of the second enhancement layer
  • the number of bytes of the channel is less than D/2 (m+n), so that each base layer channel can be represented by more bytes than the two enhancement layer channels, thereby obtaining the sound quality of each channel of the base layer.
  • the first enhancement layer will also get a higher quality coding than the second enhancement layer.
  • the enhancement layer is three or more, the number of bytes of each full-band channel of the first enhancement layer is greater than D/2 (m+n), The sum of the number of bytes of each full-band channel of the second enhancement layer, the third enhancement layer to the N-th enhancement layer is less than D/2 (m+n).
  • the eighth embodiment is a k:m:n configuration, or a uniform configuration scheme.
  • the reason and features of the configuration are similar to those of the fourth embodiment, and thus are not described herein.
  • This scheme configures the same number of bytes for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of full-band channels included in all enhancement layers) .
  • the total number of bytes of one audio frame be D
  • the base layer contains k full-band channels
  • the first enhancement layer is set to include m full-band channels
  • the second enhancement layer is set to include n full-band channels.
  • the number of bytes of each full-band channel is D/(k+m+n)
  • each full-band channel is represented (encoded) by the same number of bytes, so each full-band channel has the same The quality of the sound.
  • the ninth embodiment is a near-k:m:n configuration, which is intermediate to the two configuration schemes provided by the seventh embodiment and the eighth embodiment.
  • the reason and features of the configuration are similar to those of the fifth embodiment, and thus are not described herein.
  • the number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (the total number of data frames) Number of bytes/the number of full-band channels included in the base layer)> (the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of channels of the full-band included in all enhancement layers));
  • the number of bytes of a channel configuration of the first enhancement layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer) / the sum of the number of channels of the full-band included in all enhancement layers,
  • the number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the sum of the number of full-band channels
  • the total number of bytes of an audio frame is D
  • the number of bytes of each full-band channel in the base layer is D/k, and D/2. >D/k>D/(k+m+n);
  • D(1-1/k)/3 bytes should be configured for the center channel of the full band, and less than D(1-1/k)/3 bytes for each channel of the left and right surround pairs
  • D(1-1/k)/3 bytes for each channel of the left and right surround pairs
  • the left and right surround channels of the second enhancement layer give the same assignment (or the same code as one channel pair).
  • the tenth embodiment is a limited configuration, and is dependent on the channel coding conditions.
  • the reason and characteristics of the configuration are similar to those of the sixth embodiment, and therefore will not be described again.
  • the scheme configures the number of bytes for the base layer and the at least one enhancement layer respectively according to the block size of the LDPC code, the channel coding condition, the characteristics of the base layer, and/or the characteristics of the enhancement layer in each transmission frame.
  • each of the foregoing embodiments encodes a base layer and at least one enhancement layer by using a DRA coding algorithm.
  • Bandwidth extension enhancement coding tools may also be used to perform bandwidth expansion on the base layer and/or at least one enhancement layer, respectively.
  • Reference map 4 shows a schematic diagram of the stereo left and right channel two-layer structure.
  • the stereo audio signal has only two independent full-band channels, so the base layer transmits the left channel and the enhancement layer transmits the right channel.
  • the configuration of the two layers should be uniformly configured, that is, the left and right channels are configured with the same sound quality, that is, the same number of bytes are configured.
  • the base layer and the enhancement layer can respectively use the bandwidth extension enhancement coding tool for bandwidth expansion, and the figure is illustrated by a dashed box.
  • the stereo and differential channel two hierarchical structure shown in FIG. 5 there are only two full-band channels in this example, so there is only a two-layer scheme.
  • stereo signals it is usually necessary to perform the difference encoding in order to improve the coding efficiency at the time of encoding. Since there is a certain correlation between the two channels of the stereo signal, the difference signal has a smaller dynamic range than the right channel in probability, so encoding requires less data representation.
  • one channel is the accompanying sound, one channel lyrics (speech), and the channel can represent the two channels by mixing the two channels together.
  • the sum channel (and possible bandwidth extension) should be used as the base layer
  • the difference channel (and possible bandwidth extension) as the enhancement layer
  • the base layer emphasis configuration mode should be adopted.
  • This application example will layer better than the left and right channels when it can only correctly decode the base layer.
  • this embodiment is 5.1 surround sound, wherein 5 full-band channels and 1 sub-subwoofer channel.
  • the stereo left channel (shown as L) and the right channel (shown as R) are transmitted in the base layer; the other channels are transmitted in the enhancement layer, and the order of the channels in the enhanced channel is the center channel (illustration It is C), subwoofer channel (shown as LFE), left surround and right surround channels (illustrated LS and RS respectively).
  • each full-band channel can be selected with a bandwidth extension enhancement tool (shown by dashed lines in the figure) to improve coding efficiency; in addition, for each channel pair (L&R and LS&RS respectively), a parametric stereo coding tool can be further selected. Reduce the information redundancy.
  • the corresponding channels are down-mixed to mono (the pictures are M0 and M1 respectively) for basic encoding. Two layers of basic layer emphasis configuration and near k:m configuration can be used.
  • the audio layering structure is similar to that of the previous embodiment, except that the enhancement layer can adjust the channel arrangement order to the preferred encoding left surround and right surround channels, and then the central sound. Road and subwoofer channels.
  • the 5.1 channel is divided into three layers for encoding, wherein the base layer encodes the left and right channels (L and R), and the bandwidth expansion enhancement tool and the optional bandwidth expansion can be selected.
  • Parametric stereo coding tool for improved coding efficiency; first enhancement layer encoding center channel (C), optional bandwidth extension enhancement tool followed by subwoofer channel (LFE) encoding; second enhancement layer for transmission left surround and right Surround channels (LS and RS), optional bandwidth extension and parametric stereo enhancement tools.
  • the basic encoding of the stereo pair should be modified to the mono encoding of the stereo pair, such as L&R downmixing to M0 and LS&RS downmixing to M1.
  • This application example should adopt a data structure of near k:m:n configuration.
  • the audio layered structure is similar to the previous embodiment, but the first enhancement layer and the second enhancement layer are interchanged.
  • DRA & DRA+ The schematic diagram of the surround sound layering structure adopts a structure of surround sound two layers to form a base layer and an enhancement layer.
  • Adopt DRA Digital Rise
  • SBR Standard Band Replication
  • Parametric Stereo Coded PS Parametric Stereo
  • Low-band partial encoding in the enhancement layer, the first channel C is DRA-encoded, optionally using SBR bandwidth extension technology, then DRA encoding for the subwoofer channel LFE, and finally the left and right surround channels (LS and RS) Stereo-to-DRA encoding, optional bandwidth extension SBR and parametric stereo encoding PS, improve coding efficiency for surround pairs.
  • the data structure that should be used in this example is set to near k:m:n, or a limited setting is applied when applied to digital audio broadcasting.
  • the present invention proposes that each of the foregoing embodiments encodes a base layer and at least one enhancement layer by using a DRA coding algorithm.
  • the present invention can perform four or more layers on an audio signal, but generally adopts a two- to three-layer hierarchical structure, which is easy to implement. Layering based on the channel, the best overall sound quality is achieved by flexibly controlling the quality of each channel. It is easy to meet channel coding requirements: Since LDPC channel coding requires a fixed size for each coding block, channel-based coarse layering can be reasonably arranged to meet channel requirements. Various restrictions when fine layering is not required, such as MPEG In AAC-BSAC audio coding, MDCT coefficients are required to perform arithmetic coding and related auxiliary data every 32 groups, which affects the overall coding efficiency, so coarse layering can ensure more efficient compression.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The present invention provides a method for encoding multichannel digital audio, comprising: dividing multichannel audio into a base layer and at least one enhancement layer; configuring the number of bytes for the base layer and the at least one enhancement layer respectively; and encoding the base layer and the at least one enhancement layer respectively. According to the present invention, decrease of the encoding efficiency that is caused by fine layering is avoided to some extent, and at the same time, applications such as digital audio broadcast in some fields are satisfied. The implementation of the present invention is easy, optimal comprehensive sound quality is obtained by flexibly controlling the quality of a sound channel at each layer, channel encoding requirements are easy to be satisfied, and various limiting conditions during fine layering are not required, thereby ensuring compression at higher efficiency.

Description

对多声道数字音频编码的方法 Method for encoding multi-channel digital audio 技术领域 Technical field
本发明涉及音频编码处理领域,特别是涉及一种对多声道数字音频编码的方法。 The present invention relates to the field of audio coding processing, and more particularly to a method of encoding multi-channel digital audio.
背景技术 Background technique
在对多声道数字音频分层音频编码领域,已经存在通过精细分层方式进行有损数字音频编码方法及无损音频编码技术,如ISO/IEC 14496-3 MPEG-4 BSAC(Bit sliced arithmetic coding)比特片算术编码、在AVS(Audio Video coding Standard Workgroup of China)中采用的类似于MPEG-4 BSAC编码方法以及MPEG-4 SLS(Scalable Lossless Coding)的无损增强层方式都可实现对音频进行精细分层,对每一层分别编码。但精细分层方式存在分层过细,需要许多辅助信息,编码效率低、结构复杂、处理逻辑复杂度高等缺点。In the field of multi-channel digital audio layered audio coding, there are already lossy digital audio coding methods and lossless audio coding techniques through fine layering, such as ISO/IEC 14496-3. MPEG-4 BSAC (Bit sliced arithmetic coding) bit slice arithmetic coding, in AVS (Audio Video coding) Standard Workgroup of China) similar to MPEG-4 BSAC encoding method and MPEG-4 SLS (Scalable Lossless Coding's lossless enhancement layer approach enables fine layering of audio and encoding each layer separately. However, the fine layering method has the disadvantages that the layering is too thin, requires a lot of auxiliary information, low coding efficiency, complicated structure, and high processing logic complexity.
现有技术中还有一种非精细分层的编码方案:在MPEG-4第三部分和MPEG-2第七部分中都提供了可伸缩采样率编码算法 AAC-SSR(Advanced Audio Coding-Scalable Sampling Rate),首先是由Sony提出的,编码架构也类似于其独有的ARTAC(Adaptive Transform Acoustic Coding)编码。该编码方案首先将输入的数字音频信号通过4带的多相正交滤波器组(PQF,Polyphase Quadrature Filter)分割成4个频带,然后这4个频带分别进行1个256点MDCT(512样点窗长)或8个32点(64样点窗长)MDCT。该编码方案还可通过去除高PQF带的方式降低数据率,通过减少频带的方式实现比特流分层,从而获得不同比特率和采样率。这种编码方案的好处是在每个频带内可以独立选择长块或短块MDCT,因此对高频可使用短块编码增强时间分辨率;而对低频使用长块编码获得高频率分辨率。但是由于4个PQF带间存在混迭,因此相邻部分的变换域系数编码效率会下降。There is also a non-fine layered coding scheme in the prior art: a scalable sample rate coding algorithm is provided in both MPEG-4 Part 3 and MPEG-2 Part 7. AAC-SSR (Advanced Audio Coding-Scalable Sampling Rate), first proposed by Sony, the encoding architecture is similar to its unique ARTAC (Adaptive Transform Acoustic) Coding) coding. The encoding scheme first passes the input digital audio signal through a 4-band polyphase quadrature filter bank (PQF, Polyphase Quadrature). Filter) is divided into four frequency bands, and then the four frequency bands respectively perform one 256-point MDCT (512 sample window length) or eight 32-point (64 sample window length) MDCT. The coding scheme can also reduce the data rate by removing the high PQF band, and achieve bit stream layering by reducing the frequency band, thereby obtaining different bit rates and sampling rates. The advantage of this coding scheme is that the long block or short block MDCT can be independently selected in each frequency band, so that the high frequency can use short block coding to enhance the time resolution; and the low frequency use the long block coding to obtain high frequency resolution. However, due to the overlap between the four PQF bands, the coding efficiency of the transform domain coefficients of adjacent parts will decrease.
发明内容 Summary of the invention
为解决上述技术问题,本发明提出一种对多声道数字音频编码的方法,包括:对多声道音频分为一基本层及至少一增强层;对一基本层及至少一增强层分别配置字节数;对一基本层及至少一增强层分别编码。To solve the above technical problem, the present invention provides a method for encoding a multi-channel digital audio, comprising: dividing a multi-channel audio into a basic layer and at least one enhancement layer; respectively configuring a basic layer and at least one enhancement layer The number of bytes; respectively encoded for a base layer and at least one enhancement layer.
优选地,将多声道音频信号分为一基本层和一增强层;其中基本层包含至少一全频带声道,增强层包含至少一全频带声道;基本层包含的全频带声道不大于增强层包含的全频带声道数。Preferably, the multi-channel audio signal is divided into a base layer and an enhancement layer; wherein the base layer includes at least one full-band channel, the enhancement layer includes at least one full-band channel; and the base layer includes a full-band channel that is not greater than The number of full-band channels included in the enhancement layer.
优选地,对于基本层包含的全频带声道小于增强层包含的全频带声道数 情况,还包括:对基本层配置字节数为数据帧总字节数/2,基本层每声道的字节数为数据帧总字节数/2*基本层包含的全频带声道数;对增强层配置字节数为数据帧总字节数/2;增强层每声道的字节数为数据帧总字节数/2*增强层包含的全频带声道数。Preferably, the full-band channel included in the base layer is smaller than the number of full-band channels included in the enhancement layer The situation also includes: the number of bytes configured for the base layer is the total number of bytes of the data frame/2, the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer The number of bytes configured for the enhancement layer is the total number of bytes of the data frame/2; the number of bytes per channel of the enhancement layer is the total number of bytes of the data frame/2* the number of full-band channels included in the enhancement layer.
优选地,对于基本层包含的全频带声道等于增强层包含的全频带声道数情况,还包括:对基本层配置的字节数大于数据帧总字节数/2;对增强层配置的字节数小于数据帧总字节数/2。Preferably, for the case where the baseband included in the base layer is equal to the number of full-band channels included in the enhancement layer, the method further includes: configuring the number of bytes for the base layer to be greater than the total number of bytes of the data frame/2; configuring the enhancement layer The number of bytes is less than the total number of bytes in the data frame/2.
优选地,还包括:对每一全频带声道配置同样多的字节数,均为数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数)。Preferably, the method further comprises: configuring the same number of bytes for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the number of full-band channels included in the enhancement layer) ).
优选地,还包括:对基本层中每个全频带声道配置的字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数));对于增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数。Preferably, the method further comprises: configuring, for each full-band channel in the base layer, the number of bytes as the total number of bytes of the data frame/the number of full-band channels included in the basic layer, and (the total number of bytes of the data frame/2) > (total number of bytes of data frame / number of full-band channels included in the base layer) > (total number of bytes of data frame / (number of full-band channels included in the base layer + number of full-band channels included in the enhancement layer)) The number of bytes for a channel configuration of the enhancement layer is greater than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer, and the rest The number of bytes of at least one channel configuration is less than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer)/the number of full-band channels included in the enhancement layer.
优选地,根据每一传输帧中LDPC编码的块大小、信道编码条件、所述基本层的特性和/或所述增强层的特性,对所述基本层和增强层分别配置字节数。Preferably, the number of bytes is separately configured for the base layer and the enhancement layer according to a block size of LDPC coding, a channel coding condition, a characteristic of the base layer, and/or a characteristic of the enhancement layer in each transmission frame.
优选地,将多声道音频信号分为一基本层和多个增强层;其中基本层包含至少一全频带声道,多个增强层分别包含至少一全频带声道;基本层包含的全频带声道少于所有增强层包含的全频带声道数之和。Preferably, the multi-channel audio signal is divided into a base layer and a plurality of enhancement layers; wherein the base layer comprises at least one full-band channel, the plurality of enhancement layers respectively comprise at least one full-band channel; the base layer comprises a full-band The channel is less than the sum of the full-band channels included in all enhancement layers.
优选地,对基本层配置的字节数为数据帧总字节数/2,基本层每声道的字节数为数据帧总字节数/2*基本层包含的全频带声道数;至少一增强层配置的字节数之和为数据帧总字节数/2,其中第一增强层的每个全频带声道的字节数大于数据帧总字节数/2(增强层包含的全频带声道数+基本层包含的全频带声道数), 其余至少一增强层的每个全频带声道的字节数小于数据帧总字节数/2(增强层包含的全频带声道数+基本层包含的全频带声道数)。Preferably, the number of bytes configured for the base layer is the total number of bytes of the data frame/2, and the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer; The sum of the number of bytes of the at least one enhancement layer configuration is the total number of bytes of the data frame/2, wherein the number of bytes of each full-band channel of the first enhancement layer is greater than the total number of bytes of the data frame/2 (enhancement layer includes The number of full-band channels + the number of full-band channels included in the base layer), The number of bytes of each full-band channel of the remaining at least one enhancement layer is less than the total number of bytes of the data frame/2 (the number of full-band channels included in the enhancement layer + the number of full-band channels included in the base layer).
优选地,对每一全频带声道配置同样多的字节数,均为数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和)。Preferably, the same number of bytes are configured for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of full-band channels included in all enhancement layers) ).
优选地,对基本层中每个全频带声道配置字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和));对于第一增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和。Preferably, the number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (data frame) Total number of bytes / number of full-band channels included in the base layer)> (total number of bytes of data frame / (the number of full-band channels included in the base layer + the sum of the number of channels of the full-band included in all enhancement layers)); The number of bytes of a channel configuration for the first enhancement layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer) / the sum of the total number of channels included in all enhancement layers And the number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the sum of the number of full-band channels included in all enhancement layers.
优选地,根据每一传输帧中LDPC编码的块大小、信道编码条件、基本层的特性和/或增强层的特性,对基本层和至少一增强层分别配置字节数。Preferably, the number of bytes is separately configured for the base layer and the at least one enhancement layer according to the block size of the LDPC encoding, the channel coding condition, the characteristics of the base layer, and/or the characteristics of the enhancement layer in each transmission frame.
优选地,还包括:对一基本层及至少一增强层分别采用DRA编码算法编码。Preferably, the method further includes: encoding, by using a DRA coding algorithm, a basic layer and at least one enhancement layer, respectively.
优选地,还包括:对基本层和\或至少一增强层分别进行带宽扩展。Preferably, the method further comprises: separately performing bandwidth expansion on the base layer and/or the at least one enhancement layer.
本发明还提出一种对多声道数字音频编码的方法,包括:将多声道音频信号分为一基本层和一增强层,其中基本层包含至少一全频带声道,增强层包含至少一全频带声道;基本层包含的全频带声道数不大于增强层包含的全频带声道数;对基本层及增强层分别配置字节数;其中,对基本层中每个全频带声道配置的字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数));对增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数;对所述基本层及增强层分别采用DRA编码算法编码。The invention also provides a method for encoding multi-channel digital audio, comprising: dividing a multi-channel audio signal into a base layer and an enhancement layer, wherein the base layer comprises at least one full-band channel, and the enhancement layer comprises at least one Full-band channel; the number of full-band channels included in the base layer is not greater than the number of full-band channels included in the enhancement layer; the number of bytes is configured for the base layer and the enhancement layer respectively; wherein, for each full-band channel in the base layer The configured number of bytes is the total number of bytes of the data frame / the number of full-band channels included in the base layer, and (the total number of bytes of the data frame / 2) > (the total number of bytes of the data frame / the full-band sound contained in the base layer) Number of tracks)>(the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the number of full-band channels included in the enhancement layer)); the number of bytes configured for one channel of the enhancement layer is larger than the data frame The total number of bytes * (1-1 / the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer, and the number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame * (1-1/the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer; DRA and enhancement layer coding algorithm respectively.
本发明还提出一种对多声道数字音频编码的方法,包括:将多声道音频信号分为一基本层和多个增强层;其中基本层包含至少一全频带声道,多个增强层分别包含至少一全频带声道;基本层包含的全频带声道数不大于所有增强层包含的全频带声道数之和;对一基本层及至少一增强层分别配置字节数;其中对基本层中每个全频带声道配置字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和));对于第一增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和;对一基本层及至少一增强层分别采用DRA编码算法编码。The invention also provides a method for encoding multi-channel digital audio, comprising: dividing a multi-channel audio signal into a basic layer and a plurality of enhancement layers; wherein the base layer comprises at least one full-band channel, and multiple enhancement layers Each of the at least one full-band channel is included; the number of full-band channels included in the base layer is not greater than the sum of the number of full-band channels included in all enhancement layers; and the number of bytes is configured for a basic layer and at least one enhancement layer; The number of bytes of each full-band channel configuration in the base layer is the total number of bytes of the data frame / the number of full-band channels included in the base layer, and (the total number of bytes of the data frame / 2) > (the total number of bytes of the data frame) / The number of full-band channels included in the base layer) > (the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of channels of the full-band included in all enhancement layers)); for the first enhancement The number of bytes in a channel configuration of the layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer) / the sum of the total number of channels included in all enhancement layers, and the rest The number of bytes in at least one channel configuration is less than the total number of bytes in the data frame* (1-1/the full band included in the base layer) Channel number) / reinforcing layer comprises all of the number of full-range channels and; on a base layer and at least one enhancement layer respectively DRA coding algorithm.
本发明在一定程度上既避免了精细分层引起的编码效率下降,同时又满足一些领域的应用,如数字音频广播等。本发明实现简单,通过灵活控制每层声道的质量,获取最佳综合声音质量,易于满足信道编码要求,并且不需要精细分层时的各种限制条件,保证更高效率的压缩。The invention not only avoids the degradation of coding efficiency caused by fine layering, but also satisfies some fields of application, such as digital audio broadcasting. The invention is simple to implement, and has the advantages of flexible control of the quality of each channel, obtaining the best integrated sound quality, easily satisfying the channel coding requirements, and eliminating various constraints in fine layering, and ensuring more efficient compression.
附图说明 DRAWINGS
图1为本发明一实施例的流程示意图; 1 is a schematic flow chart of an embodiment of the present invention;
图2为本发明一实施例的多声道数字音频二层结构示意图; 2 is a schematic diagram of a multi-layer digital audio layer structure according to an embodiment of the present invention;
图3为本发明一实施例多声道数字音频多层结构示意图;3 is a schematic diagram of a multi-channel digital audio multi-layer structure according to an embodiment of the present invention;
图4为本发明一实施例立体声左右声道二分层结构示意图;4 is a schematic diagram of a two-layer structure of stereo left and right channels according to an embodiment of the present invention;
图5为本发明一实施例立体声和差声道二分层结构示意图;FIG. 5 is a schematic diagram of a two-layer structure of stereo and differential channels according to an embodiment of the present invention; FIG.
图6为本发明一实施例环绕声二分层结构示意图;6 is a schematic diagram of a surround sound two-layer structure according to an embodiment of the present invention;
图7为本发明一实施例示出的环绕声二分层结构示意图;FIG. 7 is a schematic diagram of a two-layer structure of surround sound according to an embodiment of the present invention; FIG.
图8为本发明一实施例环绕声三分层结构示意图;FIG. 8 is a schematic diagram of a three-layer structure of surround sound according to an embodiment of the present invention; FIG.
图9为本发明一实施例环绕声三分层结构示意图;FIG. 9 is a schematic diagram of a three-layer structure of surround sound according to an embodiment of the present invention; FIG.
图10为本发明一实施例DRA & DRA+ 环绕声分层结构示意图。FIG. 10 is a schematic diagram of a hierarchical structure of a DRA & DRA+ surround sound according to an embodiment of the present invention.
具体实施方式 Detailed ways
为详细说明本发明的技术内容、构造特征、所达成的目的及效果,下面将结合实施例并配合附图予以详细说明。The details of the technical contents, structural features, objects and effects of the present invention will be described in detail below with reference to the embodiments and the accompanying drawings.
请参阅图1示出的流程示意图,本发明第一实施例对多声道数字音频编码方法包括:Referring to the flow diagram shown in FIG. 1, the multi-channel digital audio encoding method of the first embodiment of the present invention includes:
步骤S1、对多声道音频分为一基本层及至少一增强层;Step S1, dividing the multi-channel audio into a basic layer and at least one enhancement layer;
步骤S2、对一基本层及至少一增强层分别配置字节数;Step S2: configuring a number of bytes for a basic layer and at least one enhancement layer respectively;
步骤S3、对一基本层及至少一增强层分别编码。Step S3, encoding a basic layer and at least one enhancement layer separately.
结合图2示出的多声道数字音频二层结构示意图,本发明第二实施例提出对将多声道音频信号分为一基本层和一增强层的二层结构,其中基本层包含至少一全频带声道,增强层包含至少一全频带声道;基本层包含的全频带声道不大于增强层包含的全频带声道数。2, the second embodiment of the present invention provides a two-layer structure for dividing a multi-channel audio signal into a basic layer and a enhancement layer, wherein the basic layer includes at least one. The full-band channel, the enhancement layer includes at least one full-band channel; the base layer includes a full-band channel that is no larger than the full-band channel number included in the enhancement layer.
设基本层包含k个全频带声道,增强层设置为包含m个全频带声道。配置基本层包含的全频带声道不大于增强层包含的全频带声道数,即k<=m,配置基本层编码相对少的声道,从而保证其质量更高。Let the base layer contain k full-band channels, and the enhancement layer is set to contain m full-band channels. The configuration base layer includes a full-band channel that is not larger than the full-band channel number included in the enhancement layer, that is, k<=m, and configures the base layer to encode a relatively small channel, thereby ensuring higher quality.
关于各分层间有效载荷即字节数的配置方案,在总载荷(即字节数)一定的前提下本发明提出多种实施例。Regarding the arrangement scheme of the inter-layer payload, that is, the number of bytes, the present invention proposes various embodiments under the premise that the total load (i.e., the number of bytes) is constant.
第三实施例为基本层强调配置方案。由于基本层较为重要,而增强层对整体声音质量的贡献相对次之,因此有必要将净载荷分成大致对等的两部分。特别是由于信道等原因需要丢弃或无法正确得到增强层而重视基本层质量的应用场景。The third embodiment emphasizes the configuration scheme for the base layer. Since the base layer is more important, and the enhancement layer contributes less to the overall sound quality, it is necessary to divide the payload into roughly equal parts. In particular, application scenarios in which the enhancement layer is not properly obtained due to channel or the like and the quality of the base layer is emphasized.
对于基本层包含的全频带声道小于增强层包含的全频带声道数情况,本实施例对基本层配置的字节数为数据帧总字节数/2,基本层每声道的字节数为数据帧总字节数/2*基本层包含的全频带声道数;对增强层配置的字节数为数据帧总字节数/2;增强层每声道的字节数为数据帧总字节数/2*增强层包含的全频带声道数。For the case where the baseband included in the base layer is smaller than the number of full-band channels included in the enhancement layer, the number of bytes configured in the base layer in this embodiment is the total number of bytes of the data frame/2, and the byte of each channel of the base layer. The number is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer; the number of bytes configured for the enhancement layer is the total number of bytes of the data frame/2; the number of bytes per channel of the enhancement layer is data Total number of frames in the frame/2* The number of full-band channels included in the enhancement layer.
设一个数据帧总字节数为D,当k<m时,则分给基本层和增强层的字节数都为D/2,基本层每个声道的有效字节数为D/2*k,增强层每个声道的字节数为D/2*m。Let the total number of bytes of a data frame be D. When k<m, the number of bytes allocated to the base layer and the enhancement layer is D/2, and the effective number of bytes per channel of the base layer is D/2. *k, the number of bytes per channel of the enhancement layer is D/2*m.
而对于基本层包含的全频带声道等于增强层包含的全频带声道数情况,本实施例对基本层配置的字节数大于数据帧总字节数/2;对增强层配置的字节数小于数据帧总字节数/2。For the case where the baseband included in the base layer is equal to the number of full-band channels included in the enhancement layer, the number of bytes configured in the base layer in this embodiment is greater than the total number of bytes in the data frame/2; bytes configured on the enhancement layer The number is less than the total number of bytes of the data frame/2.
即当k=m时可配置给基本层大于D/2的字节数,比如3*D/5,增强层则配置2*D/5,还可采用其他比例。That is, when k=m, it can be configured to the number of bytes whose base layer is larger than D/2, such as 3*D/5, and the enhancement layer is configured with 2*D/5, and other ratios can also be used.
这样基本层的每个声道相对增强层的每个声道可用更多字节表示,从而获得保证基本层每个声道的声音质量更好。Each channel of such a base layer can be represented by more bytes per channel of the enhancement layer, thereby obtaining better sound quality for each channel of the base layer.
第四实施例为k:m配置方案,或称均匀配置方案。上述第一实施例突出了基本层的重要性;而从多声道整体方面考虑,对每个全频带声道给予平等的重视就更加合理,这样在由于某种信道等因素造成只能正确解码基本层时,所获得的声音质量较第一实施例的配置方案略差,但当基本层和增强层都能够解码时,总体多声道质量会比第一实施例更优。The fourth embodiment is a k:m configuration scheme, or a uniform configuration scheme. The above first embodiment highlights the importance of the base layer; and from the overall perspective of multi-channel, it is more reasonable to give equal attention to each full-band channel, so that it can only be correctly decoded due to certain channels and the like. At the time of the base layer, the obtained sound quality is slightly inferior to that of the first embodiment, but when both the base layer and the enhancement layer can be decoded, the overall multi-channel quality is superior to that of the first embodiment.
本实施例对每一全频带声道配置同样多的字节数,均为数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数)设一个音频帧的总字节数为D,每个全频带声道的字节数为D/(k+m),对每个全频带声道用同样字节数表示即编码,使得每个全频带声道具有同样的声音质量。In this embodiment, the same number of bytes are configured for each full-band channel, and the total number of bytes of the data frame/(the number of full-band channels included in the basic layer + the number of full-band channels included in the enhancement layer) is set to one. The total number of bytes of the audio frame is D, and the number of bytes of each full-band channel is D/(k+m), which is encoded by the same number of bytes for each full-band channel, so that each full-band The channels have the same sound quality.
第五实施例为近k:m配置方案,非基本层强调配置,亦非均匀配置。上述第一实施例突出了基本层的重要性,但是当一般情况下k<m时,基本层强调配置可能过度强调基本层的质量。第二实施例则又将基本层全频带声道作为一个普通全频带声道一样看待;因此应根据多声道具体情况给予最合理配置即接近k:m配置。本实施例认为基本层中的每个全频带声道要比增强层全频带声道重要,应给予多于均匀配置且少于第一种配置的字节数;而对于增强层中的m个全频带声道,也需要分别考虑,特别对于典型多声道环绕声5.1情况,电影音频系统中的中央声道一般设定为配音,应比两个环绕声道给予更高的重视。这种配置能够提供比前两种配置更好的多声道质量。The fifth embodiment is a near k:m configuration scheme, and the non-base layer emphasizes the configuration and is not uniformly configured. The first embodiment described above highlights the importance of the base layer, but when k < m in general, the base layer emphasis configuration may over-emphasize the quality of the base layer. The second embodiment treats the base layer full-band channel as a normal full-band channel; therefore, it should be given the most reasonable configuration, ie, close to the k:m configuration, depending on the multi-channel specific situation. This embodiment considers that each full-band channel in the base layer is more important than the enhancement-band full-band channel, and should be given more than a uniform configuration and less than the number of bytes of the first configuration; and for m in the enhancement layer The full-band channel also needs to be considered separately. Especially for the typical multi-channel surround sound 5.1 case, the center channel in the movie audio system is generally set to dubbing, which should be given more attention than the two surround channels. This configuration provides better multi-channel quality than the first two configurations.
本方案对基本层中每个全频带声道配置的字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数));对于增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数。The number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (data frame) Total number of bytes / number of full-band channels included in the base layer) > (total number of bytes of data frame / (number of full-band channels included in the base layer + number of full-band channels included in the enhancement layer)); for the enhancement layer The number of bytes of a channel configuration is greater than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer, and the remaining at least one channel The configured number of bytes is less than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the number of full-band channels included in the enhancement layer.
即设一个音频帧的总字节数为D,对基本层中每个全频带声道配置的字节数为D/k,且D/2>D/k>D/(k+m);对于增强层也根据增强层中各个全频带声道的特性给予适当的配置。例如5.1环绕声时,中央声道应配置多于D(1-1/k)/m字节,而左右环绕声对中的每个声道配置少于D(1-1/k)/m字节。That is, the total number of bytes of one audio frame is D, and the number of bytes configured for each full-band channel in the base layer is D/k, and D/2>D/k>D/(k+m); The enhancement layer is also given an appropriate configuration according to the characteristics of each full-band channel in the enhancement layer. For example, in 5.1 surround sound, the center channel should be configured more than D (1-1/k)/m bytes, and each channel in the left and right surround pairs is configured less than D (1-1/k)/m. byte.
第六实施例为受限配置方案,依赖于信道编码条件的要求。由于信道编码如LDPC(Low Density Parity Check)编码是分组编码,并且两个分层要采用不同保护等级,因此分层编码的每一层需要根据每个传输帧中LDPC编码的块大小、并综合考虑多声道基本层和增强层的特性,给出最合理地安排和配置。对于受限配置情况,一般基本层和增强层的字节数分配与第三实施例类似,但考虑传输帧中每层LDPC编码块总容量。The sixth embodiment is a limited configuration scheme that relies on the requirements of channel coding conditions. Due to channel coding such as LDPC (Low Density) Parity Check) coding is block coding, and the two layers need to adopt different protection levels, so each layer of layered coding needs to consider the multi-channel base layer and enhancement layer according to the block size of LDPC coding in each transmission frame. The characteristics are given the most reasonable arrangement and configuration. For the limited configuration case, the general base layer and enhancement layer byte number allocation is similar to that of the third embodiment, but considers the total capacity of each layer of LDPC coded blocks in the transmission frame.
本方案根据每一传输帧中LDPC编码的块大小、信道编码条件、基本层的特性和/或增强层的特性,对基本层和增强层分别配置字节数。The scheme configures the number of bytes for the base layer and the enhancement layer respectively according to the block size of the LDPC code, the channel coding condition, the characteristics of the base layer, and/or the characteristics of the enhancement layer in each transmission frame.
结合图3示出的多声道数字音频多层结构示意图,本发明还提出多分层方案。将多声道音频信号分为一基本层和多个增强层;其中基本层包含至少一全频带声道,多个增强层分别包含至少一全频带声道;基本层包含的全频带声道少于所有增强层包含的全频带声道数之和。In conjunction with the multi-channel digital audio multi-layer structure diagram shown in FIG. 3, the present invention also proposes a multi-layering scheme. The multi-channel audio signal is divided into a base layer and a plurality of enhancement layers; wherein the base layer includes at least one full-band channel, the plurality of enhancement layers respectively comprise at least one full-band channel; and the base layer includes a full-band channel The sum of the number of full-band channels included in all enhancement layers.
本发明基于多分层方案提出,第七实施例为基本层强调配置方案,基本层占据一半或以上的有效载荷。该方案的理由和特点与第三实施例相似,故不赘述。本方案对基本层配置的字节数为数据帧总字节数/2,基本层每声道的字节数为数据帧总字节数/2*基本层包含的全频带声道数;至少一增强层配置的字节数之和为数据帧总字节数/2,其中第一增强层的每个全频带声道的字节数大于数据帧总字节数/2(所有增强层包含的全频带声道数之和+基本层包含的全频带声道数), 其余至少一增强层的每个全频带声道的字节数小于数据帧总字节数/2(所有增强层包含的全频带声道数之和+基本层包含的全频带声道数)。The present invention is based on a multi-layered scheme, which is a basic layer-emphasized configuration scheme in which the base layer occupies half or more of the payload. The reason and characteristics of the scheme are similar to those of the third embodiment, and therefore will not be described again. The number of bytes configured in the basic layer of the scheme is the total number of bytes of the data frame/2, and the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer; The sum of the number of bytes of an enhancement layer configuration is the total number of bytes of the data frame/2, wherein the number of bytes of each full-band channel of the first enhancement layer is greater than the total number of bytes of the data frame/2 (all enhancement layers include The sum of the full-band channels + the number of full-band channels included in the base layer), The number of bytes of each full-band channel of the remaining at least one enhancement layer is less than the total number of bytes of the data frame/2 (the sum of the number of full-band channels included in all enhancement layers + the number of full-band channels included in the base layer).
以一基本层二增强层的三层结构为例,设一个音频帧总字节数为D,基本层包含k个全频带声道,第一增强层设置为包含m个全频带声道,第二增强层设置为包含n个全频带声道。则配置给基本层的字节数都为D/2,基本层每个声道的有效字节数为D/2k。两个增强层字节数之和也为D/2,但是第一增强层的每个全频带声道的字节数大于D/2(m+n),第二增强层的每个全频带声道的字节数小于于D/2(m+n),这样每个基本层声道相对两个增强层声道可用更多字节表示,从而获得保证基本层每个声道的声音质量更好;同时第一增强层也会比第二增强层得到更高质量的编码。如果增强层为三个或以上,那么第一增强层的每个全频带声道的字节数大于D/2(m+n), 第二增强层、第三增强层至第N增强层的每个全频带声道的字节数之和小于D/2(m+n)。Taking a three-layer structure of a basic layer and two enhancement layers as an example, the total number of bytes of an audio frame is D, the base layer includes k full-band channels, and the first enhancement layer is set to include m full-band channels, The second enhancement layer is arranged to contain n full-band channels. The number of bytes allocated to the base layer is D/2, and the number of valid bytes per channel of the base layer is D/2k. The sum of the number of bytes of the two enhancement layers is also D/2, but the number of bytes of each full-band channel of the first enhancement layer is greater than D/2 (m+n), and each full-band of the second enhancement layer The number of bytes of the channel is less than D/2 (m+n), so that each base layer channel can be represented by more bytes than the two enhancement layer channels, thereby obtaining the sound quality of each channel of the base layer. Better; at the same time the first enhancement layer will also get a higher quality coding than the second enhancement layer. If the enhancement layer is three or more, the number of bytes of each full-band channel of the first enhancement layer is greater than D/2 (m+n), The sum of the number of bytes of each full-band channel of the second enhancement layer, the third enhancement layer to the N-th enhancement layer is less than D/2 (m+n).
第八实施例为k:m:n配置,或称均匀配置方案,配置的理由和特点与第四实施例相似,故不赘述。The eighth embodiment is a k:m:n configuration, or a uniform configuration scheme. The reason and features of the configuration are similar to those of the fourth embodiment, and thus are not described herein.
本方案对每一全频带声道配置同样多的字节数,均为数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和)。设一个音频帧的总字节数为D,基本层包含k个全频带声道,第一增强层设置为包含m个全频带声道,第二增强层设置为包含n个全频带声道。此时每个全频带声道的字节数为D/(k+m+n),这时每个全频带声道用同样字节数表示(编码),因此每个全频带声道具有同样的声音质量。This scheme configures the same number of bytes for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of full-band channels included in all enhancement layers) . Let the total number of bytes of one audio frame be D, the base layer contains k full-band channels, the first enhancement layer is set to include m full-band channels, and the second enhancement layer is set to include n full-band channels. At this time, the number of bytes of each full-band channel is D/(k+m+n), and each full-band channel is represented (encoded) by the same number of bytes, so each full-band channel has the same The quality of the sound.
第九实施例为近k:m:n配置,介于第七实施例和第八实施例提供的两种配置方案的中间状态,配置的理由和特点与第五实施例相似,故不赘述。The ninth embodiment is a near-k:m:n configuration, which is intermediate to the two configuration schemes provided by the seventh embodiment and the eighth embodiment. The reason and features of the configuration are similar to those of the fifth embodiment, and thus are not described herein.
本方案对基本层中每个全频带声道配置字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和));对于第一增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和。 The number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (the total number of data frames) Number of bytes/the number of full-band channels included in the base layer)> (the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of channels of the full-band included in all enhancement layers)); The number of bytes of a channel configuration of the first enhancement layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer) / the sum of the number of channels of the full-band included in all enhancement layers, The number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the sum of the number of full-band channels included in all enhancement layers.
以一基本层二增强层的三层结构为例,设一个音频帧的总字节数为D,此时基本层中每个全频带声道的字节数为D/k,且D/2>D/k>D/(k+m+n);对于第一增强层中的全频带声道比第二增强层中的全频带给予更多的分配,例如5.1环绕声时,第一增强层传输中央声道和超重低音声道,第二增强层传输左环绕和右环绕声道,此时m=1,n=2。对全频带的中央声道应配置多于D(1-1/k)/3字节,而左右环绕声对中的每个声道配置少于D(1-1/k)/3字节;而第二增强层的左右环绕声道给予同样的分配(或者作为一个声道对同一编码)。Taking a three-layer structure of a basic layer and two enhancement layers as an example, the total number of bytes of an audio frame is D, and the number of bytes of each full-band channel in the base layer is D/k, and D/2. >D/k>D/(k+m+n); for the full band channel in the first enhancement layer to give more allocation than the full band in the second enhancement layer, such as 5.1 surround sound, the first enhancement The layer transmits the center channel and the subwoofer channel, and the second enhancement layer transmits the left surround and right surround channels, where m=1, n=2. More than D(1-1/k)/3 bytes should be configured for the center channel of the full band, and less than D(1-1/k)/3 bytes for each channel of the left and right surround pairs And the left and right surround channels of the second enhancement layer give the same assignment (or the same code as one channel pair).
第十实施例为受限配置,依赖于信道编码条件所要求,配置的理由和特点与第六实施例相似,故不赘述。本方案根据每一传输帧中LDPC编码的块大小、信道编码条件、基本层的特性和/或增强层的特性,对基本层和至少一增强层分别配置字节数。The tenth embodiment is a limited configuration, and is dependent on the channel coding conditions. The reason and characteristics of the configuration are similar to those of the sixth embodiment, and therefore will not be described again. The scheme configures the number of bytes for the base layer and the at least one enhancement layer respectively according to the block size of the LDPC code, the channel coding condition, the characteristics of the base layer, and/or the characteristics of the enhancement layer in each transmission frame.
本发明提出,上述各实施例对一基本层及至少一增强层分别采用DRA编码算法编码。还可采用带宽扩展增强编码工具对基本层和\或至少一增强层分别进行带宽扩展。The present invention proposes that each of the foregoing embodiments encodes a base layer and at least one enhancement layer by using a DRA coding algorithm. Bandwidth extension enhancement coding tools may also be used to perform bandwidth expansion on the base layer and/or at least one enhancement layer, respectively.
以下是本发明结合不同类型音频信号提出的分层及编码方案应用实例。The following are examples of application of the layering and coding scheme proposed by the present invention in combination with different types of audio signals.
参照图 4示出的立体声左右声道二分层结构示意图,立体声音频信号仅存在两个独立全频带声道,因此基本层传输左声道,增强层传输右声道。对此种情况一般两层的配置应该采用均匀配置方式,即左右声道配置同样的声音质量,即配置同样的字节数。基本层和增强层分别可采用带宽扩展增强编码工具进行带宽扩展,图中以虚线框示例。 Reference map 4 shows a schematic diagram of the stereo left and right channel two-layer structure. The stereo audio signal has only two independent full-band channels, so the base layer transmits the left channel and the enhancement layer transmits the right channel. In this case, the configuration of the two layers should be uniformly configured, that is, the left and right channels are configured with the same sound quality, that is, the same number of bytes are configured. The base layer and the enhancement layer can respectively use the bandwidth extension enhancement coding tool for bandwidth expansion, and the figure is illustrated by a dashed box.
参照图5示出的立体声和差声道二分层结构示意图,本实例中仅有二全频带声道,因此仅存在二分层方案。对于立体声信号,在编码时为了提高编码效率通常都要进行和差编码。由于在立体声信号的两个声道间存在一定的相关性,因此差信号在概率上具有比右声道更小的动态范围,因此编码需要更少的数据表示。此外对于一些应用,如卡拉OK的立体声信号,一个声道是伴音,一个声道歌词(语音),和声道由于将两个声道混合在一起,能够表示出两个声道的信息。根据以上两种分析,应当将和声道(及可能的带宽扩展)作为基本层,差声道(及可能的带宽扩展)作为增强层,并且应当采用采用基本层强调配置模式。此应用实例在仅仅能够正确解码基本层时,将比左右声道分层更好。Referring to the schematic diagram of the stereo and differential channel two hierarchical structure shown in FIG. 5, there are only two full-band channels in this example, so there is only a two-layer scheme. For stereo signals, it is usually necessary to perform the difference encoding in order to improve the coding efficiency at the time of encoding. Since there is a certain correlation between the two channels of the stereo signal, the difference signal has a smaller dynamic range than the right channel in probability, so encoding requires less data representation. In addition, for some applications, such as karaoke stereo signals, one channel is the accompanying sound, one channel lyrics (speech), and the channel can represent the two channels by mixing the two channels together. According to the above two analysis, the sum channel (and possible bandwidth extension) should be used as the base layer, the difference channel (and possible bandwidth extension) as the enhancement layer, and the base layer emphasis configuration mode should be adopted. This application example will layer better than the left and right channels when it can only correctly decode the base layer.
以下给出集中对5.1环绕声情况的实施例。参照图6示出的环绕声二分层结构示意图,本实施例为5.1环绕声,其中5个全频带声道,1个超重低音声道。在基本层传输立体声左声道(图示为L)和右声道(图示为R);在增强层传输其他声道,在增强声道中声道的排列顺序为中央声道(图示为C)、超重低音声道(图示为LFE)、左环绕和右环绕声道(图示分别LS和RS)。当然每个全频带声道可选用带宽扩展增强工具(图中用虚线表示),提高编码效率;此外对每个声道对(图示分别为L&R和LS&RS)还可进一步选择采用参数立体声编码工具降低信息冗余度,此时相应的声道对应缩混为单声道(图示分别为M0和M1)进行基本编码。可采用二分层的基本层强调配置和近k:m配置两种方式。An example of focusing on the 5.1 surround sound situation is given below. Referring to the schematic diagram of the surround sound two-layer structure shown in FIG. 6, this embodiment is 5.1 surround sound, wherein 5 full-band channels and 1 sub-subwoofer channel. The stereo left channel (shown as L) and the right channel (shown as R) are transmitted in the base layer; the other channels are transmitted in the enhancement layer, and the order of the channels in the enhanced channel is the center channel (illustration It is C), subwoofer channel (shown as LFE), left surround and right surround channels (illustrated LS and RS respectively). Of course, each full-band channel can be selected with a bandwidth extension enhancement tool (shown by dashed lines in the figure) to improve coding efficiency; in addition, for each channel pair (L&R and LS&RS respectively), a parametric stereo coding tool can be further selected. Reduce the information redundancy. At this time, the corresponding channels are down-mixed to mono (the pictures are M0 and M1 respectively) for basic encoding. Two layers of basic layer emphasis configuration and near k:m configuration can be used.
参照图7示出的环绕声二分层结构示意图,音频分层结构与上一实施例类似,只是增强层可以将声道排列顺序调整为首选编码左环绕和右环绕声道,然后是中央声道和超重低音声道。Referring to the schematic diagram of the surround sound two-layer structure shown in FIG. 7, the audio layering structure is similar to that of the previous embodiment, except that the enhancement layer can adjust the channel arrangement order to the preferred encoding left surround and right surround channels, and then the central sound. Road and subwoofer channels.
参照图8示出的环绕声三分层结构示意图,在本实例中,5.1声道被分成三层来编码,其中基本层编码左右声道(L和R),可选择使用带宽扩展增强工具和参数立体声编码工具,提高编码效率;第一增强层编码中央声道(C),并可选使用带宽扩展增强工具,然后是超重低音声道(LFE)编码;第二增强层传输左环绕和右环绕声道(LS和RS),可选带宽扩展和参数立体声增强工具。如果选择参数立体声增强工具则立体声对的基本编码应修改成对立体声对缩混后的单声道编码,如L&R缩混为M0,LS&RS缩混为M1。本应用实例宜采用数据结构为近k:m:n配置方式。Referring to the schematic diagram of the surround sound three-layer structure shown in FIG. 8, in this example, the 5.1 channel is divided into three layers for encoding, wherein the base layer encodes the left and right channels (L and R), and the bandwidth expansion enhancement tool and the optional bandwidth expansion can be selected. Parametric stereo coding tool for improved coding efficiency; first enhancement layer encoding center channel (C), optional bandwidth extension enhancement tool followed by subwoofer channel (LFE) encoding; second enhancement layer for transmission left surround and right Surround channels (LS and RS), optional bandwidth extension and parametric stereo enhancement tools. If the parametric stereo enhancement tool is selected, the basic encoding of the stereo pair should be modified to the mono encoding of the stereo pair, such as L&R downmixing to M0 and LS&RS downmixing to M1. This application example should adopt a data structure of near k:m:n configuration.
参照图9示出的环绕声三分层结构示意图,音频分层结构与上一实施例类似,但第一增强层和第二增强层互换。Referring to the schematic diagram of the surround sound three-layer structure shown in FIG. 9, the audio layered structure is similar to the previous embodiment, but the first enhancement layer and the second enhancement layer are interchanged.
参照图10示出的DRA & DRA+ 环绕声分层结构示意图,采用环绕声二分层的结构,形成基本层和增强层。在基本层中采用DRA(Digital Rise Audio)对左声道和右声道组成的立体声对进行立体声编码,并且可选带宽扩展SBR(Spectral Band Replication)技术和参数立体声编码PS(Parametric Stereo)技术。当然如果选择参数立体声编码技术,则DRA编码部分将修改为仅对缩混的单声道进行编码,而如果选择使用SBR技术,则DRA编码部分进一步修改为仅对缩混后的单声道的低频带部分编码;在增强层中,首先对中央声道C进行DRA编码,可选采用SBR带宽扩展技术,然后对超重低音声道LFE采用DRA编码,最后对左右环绕声道(LS和RS)进行立体声对DRA编码,可选带宽扩展SBR和参数立体声编码PS,提高对环绕声对的编码效率。本实例宜采用的数据结构为近k:m:n设置,或者当应用于数字音频广播时采用受限设置。Referring to Figure 10, DRA & DRA+ The schematic diagram of the surround sound layering structure adopts a structure of surround sound two layers to form a base layer and an enhancement layer. Adopt DRA (Digital Rise) in the base layer Audio) stereo encoding of stereo pairs consisting of left and right channels, and optional bandwidth extension SBR (Spectral Band Replication) Technology and Parametric Stereo Coded PS (Parametric Stereo) technology. Of course, if the parametric stereo coding technique is selected, the DRA coding portion will be modified to encode only the downmixed mono, and if the SBR technique is selected, the DRA coding portion is further modified to be only for the downmixed mono. Low-band partial encoding; in the enhancement layer, the first channel C is DRA-encoded, optionally using SBR bandwidth extension technology, then DRA encoding for the subwoofer channel LFE, and finally the left and right surround channels (LS and RS) Stereo-to-DRA encoding, optional bandwidth extension SBR and parametric stereo encoding PS, improve coding efficiency for surround pairs. The data structure that should be used in this example is set to near k:m:n, or a limited setting is applied when applied to digital audio broadcasting.
本发明提出,上述各实施例对一基本层及至少一增强层分别采用DRA编码算法编码。The present invention proposes that each of the foregoing embodiments encodes a base layer and at least one enhancement layer by using a DRA coding algorithm.
本发明可以对音频信号进行四层甚至更多分层,但一般采用二至三层分层结构,易于实现。基于声道进行分层,通过灵活控制每层声道的质量,获取最佳综合声音质量。易于满足信道编码要求:由于LDPC信道编码要求每个编码块有固定大小,通过基于声道的粗分层,可以合理安排满足信道要求。不需要精细分层时的各种限制条件,如MPEG AAC-BSAC音频编码中要求MDCT系数每32个一组进行算术编码及相关辅助数据等,影响整体编码效率,因此粗分层可保证更高效率的压缩。The present invention can perform four or more layers on an audio signal, but generally adopts a two- to three-layer hierarchical structure, which is easy to implement. Layering based on the channel, the best overall sound quality is achieved by flexibly controlling the quality of each channel. It is easy to meet channel coding requirements: Since LDPC channel coding requires a fixed size for each coding block, channel-based coarse layering can be reasonably arranged to meet channel requirements. Various restrictions when fine layering is not required, such as MPEG In AAC-BSAC audio coding, MDCT coefficients are required to perform arithmetic coding and related auxiliary data every 32 groups, which affects the overall coding efficiency, so coarse layering can ensure more efficient compression.
本发明的对多声道数字音频编码的方法由上述揭露的方法,可以达到所述目的和效果,然而以上所揭露仅为本发明的较佳实施例,自不能以此限定本发明的权利范围,至于本发明的其它等效修饰或变化,均应涵盖在本发明的权利要求范围内。The method for multi-channel digital audio encoding of the present invention achieves the above objects and effects by the method disclosed above, but the above disclosure is only a preferred embodiment of the present invention, and the scope of the present invention is not limited thereto. Other equivalent modifications or variations of the invention are intended to be included within the scope of the appended claims.

Claims (15)

1、一种对多声道数字音频编码的方法,其特征在于,包括:A method for encoding multi-channel digital audio, comprising:
将多声道音频分为一基本层及至少一增强层;Multi-channel audio is divided into a basic layer and at least one enhancement layer;
对所述一基本层及至少一增强层分别配置字节数;Configuring a number of bytes for each of the basic layer and the at least one enhancement layer;
对所述一基本层及至少一增强层分别编码。The base layer and the at least one enhancement layer are separately encoded.
2、根据权利要求1所述的对多声道数字音频编码的方法,其特征在于:2. A method of encoding a multi-channel digital audio according to claim 1 wherein:
将多声道音频信号分为一基本层和一增强层;Multi-channel audio signal is divided into a basic layer and an enhancement layer;
其中基本层包含至少一全频带声道,增强层包含至少一全频带声道;Wherein the base layer comprises at least one full-band channel, and the enhancement layer comprises at least one full-band channel;
所述基本层包含的全频带声道不大于所述增强层包含的全频带声道数。The base layer includes a full band channel that is no greater than the full band channel number included in the enhancement layer.
3、根据权利要求2所述的对多声道数字音频编码的方法,其特征在于,对于基本层包含的全频带声道小于增强层包含的全频带声道数情况,还包括:The method for encoding a multi-channel digital audio according to claim 2, wherein, for the case where the baseband channel included in the base layer is smaller than the number of full-band channels included in the enhancement layer, the method further includes:
对所述基本层配置字节数为数据帧总字节数/2,基本层每声道的字节数为数据帧总字节数/2*基本层包含的全频带声道数;The number of bytes configured for the base layer is the total number of bytes of the data frame/2, and the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer;
对所述增强层配置字节数为数据帧总字节数/2;增强层每声道的字节数为数据帧总字节数/2*增强层包含的全频带声道数。The number of bytes configured for the enhancement layer is the total number of bytes of the data frame/2; the number of bytes per channel of the enhancement layer is the total number of bytes of the data frame/2* the number of full-band channels included in the enhancement layer.
4、根据权利要求2所述的对多声道数字音频编码的方法,其特征在于,对于基本层包含的全频带声道等于增强层包含的全频带声道数情况,还包括:The method for encoding a multi-channel digital audio according to claim 2, wherein, for the case where the baseband included in the base layer is equal to the number of full-band channels included in the enhancement layer, the method further includes:
对所述基本层配置的字节数大于数据帧总字节数/2;The number of bytes configured for the base layer is greater than the total number of bytes of the data frame/2;
对所述增强层配置的字节数小于数据帧总字节数/2。The number of bytes configured for the enhancement layer is less than the total number of bytes of the data frame/2.
5、根据权利要求2所述的对多声道数字音频编码的方法,其特征在于,还包括:5. The method of encoding multi-channel digital audio according to claim 2, further comprising:
对每一全频带声道配置同样多的字节数,均为数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数)。The same number of bytes are configured for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the number of full-band channels included in the enhancement layer).
6、根据权利要求2所述的对多声道数字音频编码的方法,其特征在于,还包括:The method for encoding a multi-channel digital audio according to claim 2, further comprising:
对所述基本层中每个全频带声道配置的字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数));The number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (data frame) Total number of bytes / number of full-band channels included in the base layer)> (total number of bytes of data frame / (number of full-band channels included in the base layer + number of full-band channels included in the enhancement layer));
对于增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数。The number of bytes for a channel configuration of the enhancement layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer)/the number of full-band channels included in the enhancement layer, and at least the rest The number of bytes in the one-channel configuration is less than the total number of bytes in the data frame* (1-1/the number of full-band channels included in the base layer)/the number of full-band channels included in the enhancement layer.
7、根据权利要求2所述的对多声道数字音频编码的方法,其特征在于:7. A method of encoding multi-channel digital audio according to claim 2, wherein:
根据每一传输帧中LDPC编码的块大小、信道编码条件、所述基本层的特性和/或所述增强层的特性,对所述基本层和增强层分别配置字节数。The number of bytes is separately configured for the base layer and the enhancement layer according to the block size of the LDPC encoding in each transmission frame, the channel coding condition, the characteristics of the base layer, and/or the characteristics of the enhancement layer.
8、根据权利要求1所述的对多声道数字音频编码的方法,其特征在于,将多声道音频信号分为一基本层和多个增强层;8. The method of encoding multi-channel digital audio according to claim 1, wherein the multi-channel audio signal is divided into a base layer and a plurality of enhancement layers;
其中基本层包含至少一全频带声道,多个增强层分别包含至少一全频带声道;Wherein the base layer comprises at least one full-band channel, and the plurality of enhancement layers respectively comprise at least one full-band channel;
所述基本层包含的全频带声道少于所有增强层包含的全频带声道数之和。The base layer contains less than the sum of the full band channels of all enhancement layers.
9、根据权利要求8所述的对多声道数字音频编码的方法,其特征在于:9. A method of encoding multi-channel digital audio according to claim 8 wherein:
对所述基本层配置的字节数为数据帧总字节数/2,基本层每声道的字节数为数据帧总字节数/2*基本层包含的全频带声道数;The number of bytes configured for the base layer is the total number of bytes of the data frame/2, and the number of bytes per channel of the base layer is the total number of bytes of the data frame/2* the number of full-band channels included in the base layer;
所述至少一增强层配置的字节数之和为数据帧总字节数/2,其中第一增强层的每个全频带声道的字节数大于数据帧总字节数/2(增强层包含的全频带声道数+基本层包含的全频带声道数), 其余至少一增强层的每个全频带声道的字节数小于数据帧总字节数/2(增强层包含的全频带声道数+基本层包含的全频带声道数)。The sum of the number of bytes of the at least one enhancement layer configuration is the total number of bytes of the data frame/2, wherein the number of bytes of each full-band channel of the first enhancement layer is greater than the total number of bytes of the data frame/2 (enhanced The number of full-band channels included in the layer + the number of full-band channels included in the base layer), The number of bytes of each full-band channel of the remaining at least one enhancement layer is less than the total number of bytes of the data frame/2 (the number of full-band channels included in the enhancement layer + the number of full-band channels included in the base layer).
10、根据权利要求8所述的对多声道数字音频编码的方法,其特征在于:10. A method of encoding multi-channel digital audio according to claim 8 wherein:
对每一全频带声道配置同样多的字节数,均为数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和)。The same number of bytes are configured for each full-band channel, which is the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of full-band channels included in all enhancement layers).
11、根据权利要求8所述的对多声道数字音频编码的方法,其特征在于:11. A method of encoding multi-channel digital audio according to claim 8 wherein:
对所述基本层中每个全频带声道配置字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和));The number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, and (the total number of bytes of the data frame/2)> (the total number of data frames) The number of bytes/the number of full-band channels included in the base layer)> (the total number of bytes of the data frame / (the number of full-band channels included in the base layer + the sum of the number of full-band channels included in all enhancement layers));
对于第一增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和。The number of bytes of a channel configuration for the first enhancement layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer) / the sum of the total number of channels included in all enhancement layers And the number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame * (1-1 / the number of full-band channels included in the base layer) / the sum of the number of full-band channels included in all enhancement layers.
12、根据权利要求8所述的对多声道数字音频编码的方法,其特征在于:根据每一传输帧中LDPC编码的块大小、信道编码条件、所述基本层的特性和/或所述增强层的特性,对所述基本层和至少一增强层分别配置字节数。12. A method of encoding multi-channel digital audio according to claim 8 wherein: LDPC encoded block size, channel coding conditions, characteristics of said base layer and/or said The characteristics of the enhancement layer are respectively configured with the number of bytes for the base layer and the at least one enhancement layer.
13、根据权利要求1至12任意一项所述的对多声道数字音频编码的方法,其特征在于,还包括:对所述一基本层及至少一增强层分别采用DRA编码算法编码。The method for encoding a multi-channel digital audio according to any one of claims 1 to 12, further comprising: encoding the base layer and the at least one enhancement layer by using a DRA coding algorithm, respectively.
14、一种对多声道数字音频编码的方法,其特征在于,包括:14. A method of encoding multi-channel digital audio, comprising:
将多声道音频信号分为一基本层和一增强层;其中基本层包含至少一全频带声道,增强层包含至少一全频带声道;所述基本层包含的全频带声道数不大于所述增强层包含的全频带声道数;The multi-channel audio signal is divided into a base layer and an enhancement layer; wherein the base layer includes at least one full-band channel, and the enhancement layer includes at least one full-band channel; the base layer includes a full-band channel number not greater than The number of full-band channels included in the enhancement layer;
对所述基本层及增强层分别配置字节数;其中对所述基本层中每个全频带声道配置的字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+增强层包含的全频带声道数));对增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/增强层包含的全频带声道数;Configuring a number of bytes for the base layer and the enhancement layer respectively; wherein the number of bytes configured for each full-band channel in the base layer is the total number of bytes of the data frame/the number of full-band channels included in the base layer, And (the total number of bytes of the data frame/2)> (the total number of bytes of the data frame / the number of full-band channels included in the base layer) > (the total number of bytes of the data frame / (the number of channels of the full-band included in the base layer + The number of full-band channels included in the enhancement layer)); the number of bytes configured for one channel of the enhancement layer is greater than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer)/enhancement layer The number of full-band channels included, and the number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame* (1-1/the number of full-band channels included in the base layer) / the full-band included in the enhancement layer Number of channels
对所述基本层及增强层分别采用DRA编码算法编码。The base layer and the enhancement layer are respectively encoded by a DRA coding algorithm.
15、一种对多声道数字音频编码的方法,其特征在于,包括:15. A method of encoding multi-channel digital audio, comprising:
将多声道音频信号分为一基本层和多个增强层;其中基本层包含至少一全频带声道,多个增强层分别包含至少一全频带声道;所述基本层包含的全频带声道数不大于所有增强层包含的全频带声道数之和;The multi-channel audio signal is divided into a base layer and a plurality of enhancement layers; wherein the base layer includes at least one full-band channel, and the plurality of enhancement layers respectively comprise at least one full-band channel; the base layer includes a full-band sound The number of tracks is not greater than the sum of the number of full-band channels included in all enhancement layers;
对所述一基本层及至少一增强层分别配置字节数;其中对所述基本层中每个全频带声道配置字节数为数据帧总字节数/基本层包含的全频带声道数,且(数据帧总字节数/2)>(数据帧总字节数/基本层包含的全频带声道数)>(数据帧总字节数/(基本层包含的全频带声道数+所有增强层包含的全频带声道数之和));对于第一增强层的某声道配置的字节数大于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和,而对其余至少一声道配置的字节数小于数据帧总字节数*(1-1/基本层包含的全频带声道数)/所有增强层包含的全频带声道数之和;Configuring a number of bytes for each of the base layer and the at least one enhancement layer; wherein the number of bytes configured for each full-band channel in the base layer is a total number of bytes of the data frame / a full-band channel included in the base layer Number, and (the total number of bytes of data frame/2)> (total number of bytes of data frame / number of full-band channels included in the base layer)> (total number of bytes of data frame / (full-band channel included in the base layer) Number + the sum of the number of full-band channels included in all enhancement layers)); the number of bytes configured for a channel of the first enhancement layer is greater than the total number of bytes of the data frame* (1-1/full band included in the base layer) Number of channels) / the sum of the number of full-band channels included in all enhancement layers, and the number of bytes configured for the remaining at least one channel is less than the total number of bytes of the data frame * (1-1 / the full-band sound contained in the base layer) Number of channels) / the sum of the number of full-band channels included in all enhancement layers;
对所述一基本层及至少一增强层分别采用DRA编码算法编码。The base layer and the at least one enhancement layer are respectively coded by using a DRA coding algorithm.
PCT/CN2012/078306 2012-07-06 2012-07-06 Method for encoding multichannel digital audio WO2014005327A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201280000959.4A CN103650036B (en) 2012-07-06 2012-07-06 Method for coding multi-channel digital audio
PCT/CN2012/078306 WO2014005327A1 (en) 2012-07-06 2012-07-06 Method for encoding multichannel digital audio

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/078306 WO2014005327A1 (en) 2012-07-06 2012-07-06 Method for encoding multichannel digital audio

Publications (1)

Publication Number Publication Date
WO2014005327A1 true WO2014005327A1 (en) 2014-01-09

Family

ID=49881272

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/078306 WO2014005327A1 (en) 2012-07-06 2012-07-06 Method for encoding multichannel digital audio

Country Status (2)

Country Link
CN (1) CN103650036B (en)
WO (1) WO2014005327A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110447072A (en) * 2017-04-05 2019-11-12 高通股份有限公司 Bandwidth expansion between sound channel

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1525436A (en) * 2002-12-16 2004-09-01 ���ǵ�����ʽ���� Method and apparatus for encoding/decoding audio data with scalability
CN1878001A (en) * 2005-04-14 2006-12-13 三星电子株式会社 Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data
CN101206860A (en) * 2006-12-20 2008-06-25 华为技术有限公司 Method and apparatus for encoding and decoding layered audio
US20100161322A1 (en) * 2008-12-19 2010-06-24 Electronics And Telecommunications Research Institute Encoding and decoding apparatuses for improving sound quality of g.711 codec
CN102332267A (en) * 2010-07-01 2012-01-25 宝利通公司 The scalable audio codec of full range band

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100773539B1 (en) * 2004-07-14 2007-11-05 삼성전자주식회사 Multi channel audio data encoding/decoding method and apparatus
US8219408B2 (en) * 2008-12-29 2012-07-10 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
WO2011080916A1 (en) * 2009-12-28 2011-07-07 パナソニック株式会社 Audio encoding device and audio encoding method
CN101908938B (en) * 2010-07-27 2012-04-18 北京海尔集成电路设计有限公司 Vehicle-mounted broadcasting equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1525436A (en) * 2002-12-16 2004-09-01 ���ǵ�����ʽ���� Method and apparatus for encoding/decoding audio data with scalability
CN1878001A (en) * 2005-04-14 2006-12-13 三星电子株式会社 Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data
CN101206860A (en) * 2006-12-20 2008-06-25 华为技术有限公司 Method and apparatus for encoding and decoding layered audio
US20100161322A1 (en) * 2008-12-19 2010-06-24 Electronics And Telecommunications Research Institute Encoding and decoding apparatuses for improving sound quality of g.711 codec
CN102332267A (en) * 2010-07-01 2012-01-25 宝利通公司 The scalable audio codec of full range band

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110447072A (en) * 2017-04-05 2019-11-12 高通股份有限公司 Bandwidth expansion between sound channel
CN110447072B (en) * 2017-04-05 2020-11-06 高通股份有限公司 Inter-channel bandwidth extension

Also Published As

Publication number Publication date
CN103650036B (en) 2016-05-11
CN103650036A (en) 2014-03-19

Similar Documents

Publication Publication Date Title
CN1756086B (en) Multichannel audio data encoding/decoding method and apparatus
CN1973320B (en) Stereo coding and decoding methods and apparatuses thereof
JP4772279B2 (en) Multi-channel / cue encoding / decoding of audio signals
JP7033182B2 (en) Audio decoder for interleaving signals
US6934676B2 (en) Method and system for inter-channel signal redundancy removal in perceptual audio coding
US20050195981A1 (en) Frequency-based coding of channels in parametric multi-channel coding systems
CN102428514A (en) Audio Decoder And Decoding Method Using Efficient Downmixing
BR9806404B1 (en) Scaled stereo audio encoding / decoding process and apparatus.
JPH08505739A (en) Method for transmitting and / or storing multi-channel digital signals
CN101484936A (en) Audio decoding
WO2012138819A2 (en) Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols
KR20120032039A (en) Audio decoding
TWI501220B (en) Embedding and extracting ancillary data
WO2014005327A1 (en) Method for encoding multichannel digital audio
CN104170007A (en) Monophonic or stereo audio coding method
CN103165135B (en) Digital audio coarse layering coding method and digital audio coarse layering coding device
WO2009129822A1 (en) Efficient encoding and decoding for multi-channel signals
JP2010156822A (en) Sound compression coding device and decoding device of multi-channel sound signal
JP2002162996A (en) Method and device for encoding audio signals, and method and system for distributing music
WO2022012677A1 (en) Audio encoding method, audio decoding method, related apparatus and computer-readable storage medium
WO2016108655A1 (en) Method for encoding multi-channel audio signal and encoding device for performing encoding method, and method for decoding multi-channel audio signal and decoding device for performing decoding method
WO2022176270A1 (en) Encoding device, decoding device, encoding method, and decoding method
TW202403728A (en) Coding method and coding device for multi-channel signal, and terminal device
WO2015012594A1 (en) Method and decoder for decoding multi-channel audio signal by using reverberation signal
JP2000299669A (en) Device and method for audio encoding

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201280000959.4

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12880367

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO FORM 1205A DATED 04-05-2015)

122 Ep: pct application non-entry in european phase

Ref document number: 12880367

Country of ref document: EP

Kind code of ref document: A1