CN102956233A - Extension structure of additional data for digital audio coding and corresponding extension device - Google Patents

Extension structure of additional data for digital audio coding and corresponding extension device Download PDF

Info

Publication number
CN102956233A
CN102956233A CN2012103813584A CN201210381358A CN102956233A CN 102956233 A CN102956233 A CN 102956233A CN 2012103813584 A CN2012103813584 A CN 2012103813584A CN 201210381358 A CN201210381358 A CN 201210381358A CN 102956233 A CN102956233 A CN 102956233A
Authority
CN
China
Prior art keywords
additional data
byte
data
length
digital audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012103813584A
Other languages
Chinese (zh)
Other versions
CN102956233B (en
Inventor
闫建新
王磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Guangsheng Research And Development Institute Co ltd
Original Assignee
Shenzhen Rising Source Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Rising Source Technology Co ltd filed Critical Shenzhen Rising Source Technology Co ltd
Priority to CN201210381358.4A priority Critical patent/CN102956233B/en
Publication of CN102956233A publication Critical patent/CN102956233A/en
Application granted granted Critical
Publication of CN102956233B publication Critical patent/CN102956233B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention relates to an extension structure and corresponding extension device for additional data of digital audio coding, the additional data is arranged at the end of a corresponding audio frame, comprising: an additional data total length byte for describing a total byte length of the additional data; and an additional data byte comprising at least one additional data unit; the additional data unit includes: a data length byte for describing a byte length of the additional data unit; a data type byte for describing a data type of the additional data unit; and a data content byte for describing the additional data content of the additional data unit, the additional data total length byte being arranged before the additional data byte. The invention also relates to an extension device of the additional data of the digital audio coding, the extension structure of the additional data of the digital audio coding and the corresponding extension device can simultaneously apply various types of additional data, and the coding efficiency is high.

Description

The expansion structure of the additional data of digital audio encoding and corresponding expanding unit
Technical field
The present invention relates to the digital audio decode field, more particularly, relate to a kind of expansion structure and corresponding expanding unit of additional data of DRA digital audio encoding.
Background technology
Existing Digital Audio Coding Technology can add additional data in audio frame, and by this additional data the DAB encoding efficiency is processed, simultaneously so that each audio frame has fixing bit length, to the control of digital audio decode.
In the audio coding standard of digital audio encoding standard ISO/IEC MPEG-1, the additional data operated by rotary motion is last each audio frame, can fill the bit rate of additional data to keep fixing by the user.The additional data here is not specifically defined content, can be defined by user oneself.
At DAB(Digital Audio Broadcasting, digital signal broadcasting) in the audio coding standard, additional data also is arranged on the last of audio frame, and this additional data is comprised to front successively by rear: 2 fixing bytes are used for arranging the program related data; The CRC check word of 2 or 4 bytes is for the protection of the scale factor information in the coding; Variable byte number is used for expansion program related data; Filling bit is used for guaranteeing that whole DAB audio frame has fixing length.The particular content that wherein is used for the additional data of expansion program related data is also defined by user oneself
In ISO/IEC 13818-7MPEG-2AAC and ISO/IEC 14996-3MPEG-4AAC etc. (below be called for short ACC) audio coding standard, the additional data that is used for filling can comprise a plurality of filler cells that are arranged on the various piece of audio frame.The additional data of each filler cells filling can be extended metadata dynamic range content, SBR(Spectral Band Replication simultaneously, frequency range copies) content, SBR-CRC(Spectral Band Replication-Cyclical Redundancy Check, frequency range copies-CRC) content, immobilized substance padding data or any other padding datas etc.But each filler cells can only use one type additional data here.
In the audio coding standard of Dolby AC-3, additional data be arranged on audio frame near last position, last 1 bit in the additional data indicates whether there is effective additional data, if exist, then 14 bits before are used for illustrating effective additional data, otherwise do not have this 14 bit, the foremost of last additional data is provided with and guarantees that audio frame has the filling bit of fixing length.The particular content of 14 bits of the additional data in this audio coding standard is also defined by user oneself.
In GB/T 22726-2008 " multi-sound channel digital audio encoding and decoding technique standard ", be DRA(Digital Rise Audio) audio coding standard in, additional data is arranged on the postamble of an audio frame, and whether has additional data at the postamble that the frame head of audio frame has indicated at audio frame.But the indication that is positioned at frame head has only indicated the front byte number of additional data of this frame, and the particular content of additional data also is to be defined by user oneself.
In sum, in five kinds of audio coding standard above-mentioned, only the additional data of ACC audio coding standard has defined the type of the additional data of each filler cells.But the filler cells in the ACC audio coding standard comparatively disperses, the entire length that provides additional data that can't be clear and definite; Each filler cells has its length separately simultaneously, and when using a plurality of filler cells, demoder is analyzed every frame code stream and can be bothered very much, must carry out just leaping to after length is resolved to each filler cells the starting position of next frame.Each filler cells in the ACC audio coding standard can only use one type additional data in addition, so that the service efficiency of additional data is lower.
Therefore, be necessary to provide a kind of expansion structure and corresponding expanding unit of additional data of digital audio encoding, to solve the existing problem of prior art.
Summary of the invention
The technical problem to be solved in the present invention is, incompatible or so that the technical matters of the inefficiency of data audio coding for the type of the expansion structure of the additional data of digital audio encoding of the prior art and the additional data in the corresponding expanding unit, provide a kind of and can use simultaneously polytype additional data, and expansion structure and the corresponding expanding unit of the additional data of the high DRA digital audio encoding of code efficiency.
The present invention relates to a kind of expansion structure of additional data of digital audio encoding, described additional data is arranged on the postamble of corresponding audio frame, and it comprises:
Additional data total length byte is for the total byte length of describing described additional data; And
The additional data byte comprises at least one additional data units;
Described additional data units comprises:
The data length byte is for the byte length of describing described additional data units;
The data type byte is for the data type of describing described additional data units; And
The data content byte, for the additional data content of describing described additional data units,
Described additional data total length byte is arranged on before the described additional data byte.
In the expansion structure of the additional data of digital audio encoding of the present invention, the byte in the described additional data units is followed successively by described data type byte, described data length byte and described data content byte.
In the expansion structure of the additional data of digital audio encoding of the present invention, the byte in the described additional data units is followed successively by described data length byte, described data type byte and described data content byte.
In the expansion structure of the additional data of digital audio encoding of the present invention, described data length byte is used for describing the byte length of whole described additional data units.
In the expansion structure of the additional data of digital audio encoding of the present invention, described data length byte is used for describing the byte length of described data content byte.
In the expansion structure of the additional data of digital audio encoding of the present invention, described audio frame also comprises coded audio data, and described coded audio data adopts the audio coding standard of DRA to encode.
In the expansion structure of the additional data of digital audio encoding of the present invention, described coded audio data comprises the packed byte of sync byte, frame head byte and each audio track.
In the expansion structure of the additional data of digital audio encoding of the present invention, described audio frame also comprises padding data, and described padding data is arranged between described additional data and the described coded audio data, is used for guaranteeing the regular length of described audio frame.
In the expansion structure of the additional data of digital audio encoding of the present invention, the length of described additional data total length byte is 255 or 65792, and the length of described data length byte is 4096, and the length of described data type byte is 16.
The invention still further relates to a kind of expanding unit of additional data of digital audio encoding, it comprises the expansion structure of the additional data of above-mentioned digital audio encoding.
Implement expansion structure and the corresponding expanding unit of the additional data of digital audio encoding of the present invention, has following beneficial effect: can use simultaneously polytype additional data, and code efficiency is high, has solved incompatible or so that the technical matters of the inefficiency of data audio coding of the type of the expansion structure of additional data of existing digital audio encoding and the additional data in the corresponding expanding unit.
Description of drawings
The invention will be further described below in conjunction with drawings and Examples, in the accompanying drawing:
Fig. 1 is the structural representation of the first preferred embodiment place audio frame of expansion structure of the additional data of digital audio encoding of the present invention;
Fig. 2 is the structural representation of the second preferred embodiment place audio frame of expansion structure of the additional data of digital audio encoding of the present invention.
Embodiment
In order to make purpose of the present invention, technical scheme and advantage clearer, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, is not intended to limit the present invention.
Please refer to Fig. 1, Fig. 1 is the structural representation of the first preferred embodiment place audio frame of expansion structure of the additional data of digital audio encoding of the present invention.This audio frame comprises coded audio data, padding data and additional data.This coded audio data adopts the audio coding standard of DRA to encode, and this coded audio data comprises the packed byte of sync byte, frame head byte and each audio track, and this sync byte provides the synchronizing information of this audio frame; This frame head byte provides the Frame Properties of this audio frame, and whether the postamble that information such as frame length information, sampling rate index information, channel number information, this frame head byte also are provided for indicating this audio frame exists the additional data information of additional data; The packed byte of audio track is for the compressed information of each audio track that this audio frame is provided.
Padding data is arranged between additional data and the coded audio data, is used for guaranteeing the regular length of audio frame.This padding data is set according to the preset length of audio frame, length and the additional data total length of coded audio data, so that this audio frame can keep the bit rate fixed.
Additional data is arranged on the postamble of corresponding audio frame, comprises additional data total length byte and additional data byte.This additional data total length byte is used for describing the total byte length of this additional data, and additional data total length byte is arranged on the foremost (being that additional data total length byte is arranged on before the additional data byte) of whole additional data, when being convenient to decode, such setting skips fast the zone of additional data, directly decode the next frame audio frequency, the structure of providing convenience for the calculating of audio frame length simultaneously.The additional data byte comprises at least one additional data units, and this additional data units comprises data length byte, data type byte and data content byte, and the data length byte is used for describing the byte length of additional data units; The data type byte is used for describing the data type of additional data units; The data content byte is used for describing the additional data content of additional data units;
In the present embodiment, byte in the additional data units is followed successively by data length byte, data type byte and data content byte, the data length byte here can be described the byte length of whole additional data units, and the length of data length byte, data type byte and data content byte can arbitrarily be adjusted like this.Certainly when data length byte and data type byte adopt default byte length to describe, the byte length that the data length byte can a data of description content byte, such arranging can improve code efficiency.
The data type byte can represent with suitable byte length according to the number of the total of additional data type or the additional data type of commonly using, to reduce as much as possible the encoding amount of additional data, for example when additional data type add up to 9 kinds the time, can represent the data type byte with 4 bits.
The data content byte is used for describing the additional data content of additional data units, specifically such as metadata, low code check surround sound expansion and delamination etc.It is arranged on the last of additional data units, is used for additional data is carried out the explanation of refinement.
The audio frame of expansion structure that has the additional data of digital audio encoding of the present invention below by specific embodiment explanation is specially the concrete composition structure of audio frame of the expansion structure of the additional data with DRA digital audio encoding.
As follows is the extension syntax form of coded audio data, padding data and the additional data of this audio frame:
Figure BDA00002237704500061
Figure BDA00002237704500071
FrameHeader(wherein) be coded audio data (wherein particular content partly omits), UnpackBitPad () is padding data, and AuxiliaryData () is additional data.
Wherein coded audio data comprises the packed byte of sync byte, frame head byte and each audio track.It is the Bit data of " 1 " that padding data is used for the filling value, for the coding of on-fixed code check form, only needs to fill maximum 31 bits, guarantees that the front data length of additional data of this audio frame is the multiple of 32 bits (4 byte).Here the frame head byte of pointing out coded audio data can arrange an information bAuxData and illustrate whether this audio frame exists the additional data zone, represents then that such as bAuxData=1 the additional data zone exists.
Wherein the extension syntax form of additional data is as follows:
Figure BDA00002237704500072
Figure BDA00002237704500091
Wherein aux_data_len_total and esc_length_total are used for representing additional data total length byte, and wherein aux_data_len_total is 8 bits, can represent at most 255 length; Esc_length_total is 16 bits, if the byte length of additional data total length byte is then expanded with esc_length_total more than or equal to 256.So the byte length of additional data total length byte is 256+65536=65792 to the maximum in the audio frame.
Aux_data_len represents the data length byte, and wherein aux_data_len is 12 bits, can represent 4096 length.Filling_type represents the data type byte, wherein filling_type is 4 bits, can represent 16 length, and 16 kinds of different expansion types can be provided, concrete corresponding relation can be as shown in table 1 below, and table 1 is the particular content of expansion type in the data type byte;
Table 1
Certainly also can the expansion type of other additional data, the quantity of more or less expansion type be set according to user's needs here.Here the quantity of the expansion type of additional data and expansion type does not limit protection scope of the present invention.
Please refer to Fig. 2, Fig. 2 is the structural representation of the second preferred embodiment place audio frame of expansion structure of the additional data of digital audio encoding of the present invention.The difference of this preferred embodiment and the first preferred embodiment is, the byte in the additional data units is followed successively by data type byte, data length byte and data content byte.Can obtain first the additional data type of each additional data units when encoding like this, be more conducive to the coding of additional data type.The first preferred embodiment of the expansion structure of the additional data of the other technologies feature of this preferred embodiment and relevant beneficial effect and above-mentioned digital audio encoding is same or similar, sees also first preferred embodiment of expansion structure of the additional data of above-mentioned digital audio encoding.
The invention still further relates to a kind of expanding unit of additional data of digital audio encoding, the expanding unit of the additional data of this digital audio encoding uses the expansion structure of the additional data of above-mentioned digital audio encoding, the specific embodiment mode of the expansion structure of the additional data of the specific embodiment mode of the expanding unit of the additional data of digital audio encoding of the present invention and above-mentioned digital audio encoding is same or similar, sees also the specific embodiment of expansion structure of the additional data of above-mentioned digital audio encoding.
All additional datas of the expansion structure of the additional data of DRA digital audio encoding of the present invention and corresponding expanding unit concentrate on the postamble of audio frame, realize simple; The additional data total length byte of additional data front has provided the length in whole additional data zone simultaneously, skips easily the additional data zone in the time of can making decoding, directly decodes next audio frame; The present invention can use polytype additional data simultaneously in addition, realizes very complicated function, satisfies the requirement of certain application fully.Therefore the expansion structure code efficiency of the expansion structure of the additional data of DRA digital audio encoding of the present invention and the more existing additional data of corresponding expanding unit is higher.
The above only is embodiments of the invention; be not so limit claim of the present invention; every equivalent structure transformation that utilizes instructions of the present invention and accompanying drawing content to do, or directly or indirectly be used in other relevant technical fields, all in like manner be included in the scope of patent protection of the present invention.

Claims (10)

1. the expansion structure of the additional data of a digital audio encoding, described additional data is arranged on the postamble of corresponding audio frame, it is characterized in that, comprising:
Additional data total length byte is for the total byte length of describing described additional data; And
The additional data byte comprises at least one additional data units;
Described additional data units comprises:
The data length byte is for the byte length of describing described additional data units;
The data type byte is for the data type of describing described additional data units; And
The data content byte, for the additional data content of describing described additional data units,
Described additional data total length byte is arranged on before the described additional data byte.
2. the expansion structure of the additional data of digital audio encoding according to claim 1 is characterized in that, the byte in the described additional data units is followed successively by described data type byte, described data length byte and described data content byte.
3. the expansion structure of the additional data of digital audio encoding according to claim 1 is characterized in that, the byte in the described additional data units is followed successively by described data length byte, described data type byte and described data content byte.
4. the expansion structure of the additional data of digital audio encoding according to claim 3 is characterized in that, described data length byte is used for describing the byte length of whole described additional data units.
5. the expansion structure of the additional data of digital audio encoding according to claim 3 is characterized in that, described data length byte is used for describing the byte length of described data content byte.
6. the expansion structure of the additional data of digital audio encoding according to claim 1 is characterized in that, described audio frame also comprises coded audio data, and described coded audio data adopts the audio coding standard of DRA to encode.
7. the expansion structure of the additional data of digital audio encoding according to claim 6 is characterized in that, described coded audio data comprises the packed byte of sync byte, frame head byte and each audio track.
8. the expansion structure of the additional data of digital audio encoding according to claim 6, it is characterized in that, described audio frame also comprises padding data, and described padding data is arranged between described additional data and the described coded audio data, is used for guaranteeing the regular length of described audio frame.
9. the expansion structure of the additional data of digital audio encoding according to claim 1, it is characterized in that, the length of described additional data total length byte is 255 or 65792, and the length of described data length byte is 4096, and the length of described data type byte is 16.
10. the expanding unit of the additional data of a digital audio encoding is characterized in that, comprises that claim 1 is to claim 9 expansion structure of the additional data of arbitrary described digital audio encoding wherein.
CN201210381358.4A 2012-10-10 2012-10-10 Extension structure of additional data for digital audio coding and corresponding extension device Active CN102956233B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210381358.4A CN102956233B (en) 2012-10-10 2012-10-10 Extension structure of additional data for digital audio coding and corresponding extension device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210381358.4A CN102956233B (en) 2012-10-10 2012-10-10 Extension structure of additional data for digital audio coding and corresponding extension device

Publications (2)

Publication Number Publication Date
CN102956233A true CN102956233A (en) 2013-03-06
CN102956233B CN102956233B (en) 2015-07-08

Family

ID=47764965

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210381358.4A Active CN102956233B (en) 2012-10-10 2012-10-10 Extension structure of additional data for digital audio coding and corresponding extension device

Country Status (1)

Country Link
CN (1) CN102956233B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12069118B2 (en) 2022-03-23 2024-08-20 Sercomm Corporation Streaming media processing method, transmitting device and receiving device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1430222A (en) * 2001-12-27 2003-07-16 株式会社东芝 Method and equipment for processing audio information having system header
CN1802834A (en) * 2003-08-18 2006-07-12 阿尔卡特公司 VoIP communication method capable of carrying out transmission on additive data
CN1863302A (en) * 2005-11-03 2006-11-15 华为技术有限公司 Multimedia communication method and terminal thereof
CN102365680A (en) * 2009-02-03 2012-02-29 三星电子株式会社 Audio signal encoding and decoding method, and apparatus for same
US20120233443A1 (en) * 2001-10-29 2012-09-13 Julien Sebot Processor to execute shift right merge instructions

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120233443A1 (en) * 2001-10-29 2012-09-13 Julien Sebot Processor to execute shift right merge instructions
CN1430222A (en) * 2001-12-27 2003-07-16 株式会社东芝 Method and equipment for processing audio information having system header
CN1802834A (en) * 2003-08-18 2006-07-12 阿尔卡特公司 VoIP communication method capable of carrying out transmission on additive data
CN1863302A (en) * 2005-11-03 2006-11-15 华为技术有限公司 Multimedia communication method and terminal thereof
CN102365680A (en) * 2009-02-03 2012-02-29 三星电子株式会社 Audio signal encoding and decoding method, and apparatus for same

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12069118B2 (en) 2022-03-23 2024-08-20 Sercomm Corporation Streaming media processing method, transmitting device and receiving device

Also Published As

Publication number Publication date
CN102956233B (en) 2015-07-08

Similar Documents

Publication Publication Date Title
ES2749089T3 (en) Audio encoder and decoder with limit metadata and program loudness
KR101401224B1 (en) Apparatus, method, and computer-readable medium for decoding an audio signal
JP2023029578A (en) Audio processing unit and method for decoding encoded audio bit stream
US20060241796A1 (en) Digital audio processing
CA2578190C (en) Device and method for generating a coded multi-channel signal and device and method for decoding a coded multi-channel signal
CN105556837A (en) Dynamic range control for a wide variety of playback environments
CN109473114A (en) The dynamic range control of metadata driven
WO2008022565A1 (en) Audio decoding
TWI745862B (en) Audio transmitter processor, audio receiver processor and related methods and computer programs
MY179633A (en) Systems and methods of communicating redundant frame information
US8948406B2 (en) Signal processing method, encoding apparatus using the signal processing method, decoding apparatus using the signal processing method, and information storage medium
CN112489666B (en) Bluetooth LE audio propagation data processing method, device and storage medium
WO2018231568A3 (en) System and method for encoding image data and other data types into one data format and decoding of same
CN100489964C (en) Audio encoding
CN101292428A (en) Method and apparatus for encoding/decoding
KR20200096328A (en) Audio encoder and decoder
CN102956233B (en) Extension structure of additional data for digital audio coding and corresponding extension device
CN105637584A (en) Time- alignment of qmf based processing data
CN101800048A (en) Multi-channel digital audio coding method based on DRA coder and coding system thereof
BR112021008089A2 (en) audio encoder and audio decoder
CN107276551A (en) Coded audio bitstream of the decoding with the metadata container in retention data space
KR20150135495A (en) Methods and devices for coding and decoding depth information, and video processing and playing device
KR101702802B1 (en) Depth information encoding and decoding method, system, and device
EP4174851A1 (en) Audio encoding method, audio decoding method, related apparatus and computer-readable storage medium
CN101685636B (en) DRA data format conversion method and implementation device thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220520

Address after: 510530 No. 10, Nanxiang 2nd Road, Science City, Luogang District, Guangzhou, Guangdong

Patentee after: Guangdong Guangsheng research and Development Institute Co.,Ltd.

Address before: 518057 6th floor, software building, No. 9, Gaoxin Zhongyi Road, high tech Zone, Nanshan District, Shenzhen, Guangdong Province

Patentee before: SHENZHEN RISING SOURCE TECHNOLOGY Co.,Ltd.

TR01 Transfer of patent right