KR20150009476A - Method for encoding and decoding of multi channel audio signal, encoder and decoder - Google Patents
Method for encoding and decoding of multi channel audio signal, encoder and decoder Download PDFInfo
- Publication number
- KR20150009476A KR20150009476A KR20140089722A KR20140089722A KR20150009476A KR 20150009476 A KR20150009476 A KR 20150009476A KR 20140089722 A KR20140089722 A KR 20140089722A KR 20140089722 A KR20140089722 A KR 20140089722A KR 20150009476 A KR20150009476 A KR 20150009476A
- Authority
- KR
- South Korea
- Prior art keywords
- channel
- audio signal
- data
- size
- encoding
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 118
- 238000000034 method Methods 0.000 title claims abstract description 35
- 238000010586 diagram Methods 0.000 description 12
- 239000000284 extract Substances 0.000 description 8
- 238000000605 extraction Methods 0.000 description 3
- 238000004590 computer program Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
The present invention relates to an encoding method and a decoding method for a multi-channel audio signal, and to an encoder and a decoder for performing the method. More particularly, the present invention relates to a method and apparatus for allocating different bit rates according to an audio frame for each channel.
Recently, as the quality of multimedia contents increases, contents including multichannel audio signals such as 7.1 channel, 10.2 channel, 13.2 channel and 22.2 channel more than 5.1 channel are being generated. For example, there is an attempt to use multi-channel audio signals such as 10.2 channels and 22.2 channels in high-quality broadcasting such as 13.2 channel multi-channel audio signals and UHDTV in the movie field.
As described above, since a multi-channel audio signal has a large capacity, it is important to efficiently encode the multi-channel audio signal. In the conventional audio encoding technique, the same bit rate is assigned to each channel or the encoding is performed at a substantially constant bit rate for all the audio signal segments for each channel.
In other audio coding techniques, audio may be encoded with a variable bit rate (VBR). However, in such an encoding technique, the encoding efficiency may be good due to a small signal difference for each channel for an audio signal having a small channel. However, since a multi-channel audio signal, such as 10.2 channel and 22.2 channel, The efficiency may not be good.
Therefore, there is a need for a method for efficiently encoding a multi-channel audio signal.
The present invention provides a method and apparatus for assigning different bit rates to audio frames for respective channels when encoding multi-channel audio signals.
The present invention provides a method and apparatus capable of providing a high-quality multi-channel audio signal even under the same bit rate environment.
According to an embodiment of the present invention, there is provided an encoding method, comprising: extracting characteristics of an audio signal in a multi-channel audio signal; Setting a size of data to be allocated to a channel based on the extracted characteristics of each channel; And encoding the audio signal for each channel based on the size of data to be allocated for each channel.
The step of extracting the characteristic of each of the channels of the audio signal may include extracting energy for each channel for each of a plurality of frames constituting the audio signal and setting the size of the audio signal, The size of the data to be allocated to the channel can be set correspondingly.
The step of setting the size of the data may set a size of data proportional to the size of the extracted energy for each channel.
The step of setting the size of the data may include the steps of allocating the same or different data sizes to the frames in the same order on a channel basis and setting the sizes of the same or different data You can assign a size.
The step of encoding the channel-specific audio signal may encode a mono-type audio signal or a stereo-type audio signal.
And generating a bit stream by multiplexing the encoded audio signal for each channel.
According to an embodiment of the present invention, there is provided a decoding method comprising: extracting an audio signal per channel encoded in a bitstream; And decoding the audio signal for each channel based on the size of data allocated for each channel.
The decoding may restore a mono audio signal or a stereo audio signal from the encoded audio signal for each channel.
The size of the data allocated for each channel may be determined based on the energy per channel for each of a plurality of frames constituting the audio signal for each channel.
An encoder according to an embodiment of the present invention includes a channel characteristic extracting unit for extracting characteristics of each channel of an audio signal in a multi-channel audio signal; A data setting unit for setting a size of data to be allocated to a channel based on the extracted characteristics of each channel; And a plurality of encoding units for encoding an audio signal for each channel based on a size of data to be allocated for each channel.
The channel characteristic extracting unit extracts energy for each channel for each of a plurality of frames constituting the audio signal, and the data setting unit sets a size of data to be allocated to the channel corresponding to the extracted energy for each channel have.
The data setting unit may set a size of data proportional to the extracted energy of each channel.
The data setting unit may allocate the sizes of data that are the same or different from each other for the frames in the same order and allocate the same size or different sizes of data to frames of different orders of the same channel have.
Each of the plurality of encoding units may encode a monaural audio signal or a stereo audio signal.
And a bitstream generation unit for generating a bitstream by multiplexing the encoded audio signals for each channel.
According to an embodiment of the present invention, there is provided a decoder including: a bitstream analyzer for extracting an audio signal for each channel encoded in a bitstream; And a plurality of decoders for decoding the audio signal for each channel based on the size of data allocated for each channel.
Each of the plurality of decoding units may recover a mono audio signal or a stereo audio signal from the encoded audio signal of each channel.
The size of the data allocated for each channel may be determined based on the energy per channel for each of a plurality of frames constituting the audio signal for each channel.
According to an embodiment of the present invention, encoding efficiency of a multi-channel audio signal can be improved by assigning different bit rates to audio frames of each channel.
According to an embodiment of the present invention, a high-quality multi-channel audio signal can be provided even in the same bit rate environment.
According to an embodiment of the present invention, a high-quality audio signal can be reproduced in a reproducing terminal including a decoder by encoding a multi-channel audio signal more efficiently.
1 is a diagram illustrating an encoder and a decoder in accordance with one embodiment.
2 is a diagram showing a detailed configuration of an encoder according to an embodiment.
3 is a diagram illustrating operation of an encoder in accordance with one embodiment.
4 is a diagram illustrating a size of data for each channel according to an embodiment.
5 is a diagram showing a detailed configuration of a decoder according to an embodiment.
6 is a diagram illustrating an operation of a decoder according to an embodiment.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.
1 is a diagram illustrating an encoder and a decoder in accordance with one embodiment.
Referring to Figure 1, an
The
Here, the characteristic of each channel means a characteristic of each frame for each channel, and for example, the characteristic may include a magnitude of energy corresponding to a frame. And, the size of the data may mean a bit necessary for encoding. In other words, the
The
2 is a diagram showing a detailed configuration of an encoder according to an embodiment.
2, the
The channel
Here, the frame may be divided according to the time interval of the audio signal. For example, the characteristic of the audio signal per channel may mean the amount of energy contained in the frame included in the audio signal corresponding to each channel. Specifically, when the audio signal corresponding to the channel 1 and the channel 2 includes N frames, the channel
The
Specifically, the
For example, the
The
The plurality of encoding
The result encoded by the plurality of encoding
3 is a diagram illustrating operation of an encoder in accordance with one embodiment.
In
In
In
In
4 is a diagram illustrating a size of data for each channel according to an embodiment.
In particular, referring to FIG. 4, it can be seen that the multi-channel audio signal is composed of 10 channels from channel 1 to channel 10. It is assumed that a plurality of encoding units encodes two channels and encodes them into a stereo format. It can be seen that the audio signals corresponding to the respective channels are composed of the frames 1 to N. [ At this time, the size of the data for each channel in one frame may be the same or may be different. The size of data per channel included in each frame may be the same as or different from the size of data of the previous frame. This is because they can have the same or different energy depending on the channel for the frame.
For example, the size (bits) of data allocated to channels 1 and 2 in frame 1 and the size (bits) of data allocated to channels 3 and 4 may be different. On the other hand, even if the channels 1 and 2 are the same, the sizes of data allocated to the frames 1 and 2 may be different. Here, the size of data allocated for each channel is related to the energy determined for each channel for a frame classified according to the time interval of the multi-channel audio signal. Specifically, the size of data allocated when performing encoding may be related to the amount of energy determined in a particular frame. In Fig. 4, the size of the data corresponds to the length of each block.
That is, referring to FIG. 4, the size of data for each channel allocated to each frame may be determined according to characteristics of each channel. Then, the size of the data for each channel may be the same or different even if it is the same frame. In addition, the size of data allocated to each frame may be the same or different even if the same channel is used.
In the case of FIG. 4, since it is assumed that a plurality of encoding units couple and encode two channels, it can be seen that the sizes of data allocated to channels encoded for two frames are set to the same size. If a plurality of encoding units encode an audio signal corresponding to one channel in a mono form, the sizes of data allocated to channels 1 and 2 may be different from each other. That is, in the case of one frame, the size of data classified according to 10 channels can be allocated to encode the frame.
5 is a diagram showing a detailed configuration of a decoder according to an embodiment.
Referring to FIG. 5, the
The
Each of the plurality of
6 is a diagram illustrating an operation of a decoder according to an embodiment.
In
In
The apparatus described above may be implemented as a hardware component, a software component, and / or a combination of hardware components and software components. For example, the apparatus and components described in the embodiments may be implemented within a computer system, such as, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA) A programmable logic unit (PLU), a microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. The processing device may also access, store, manipulate, process, and generate data in response to execution of the software. For ease of understanding, the processing apparatus may be described as being used singly, but those skilled in the art will recognize that the processing apparatus may have a plurality of processing elements and / As shown in FIG. For example, the processing unit may comprise a plurality of processors or one processor and one controller. Other processing configurations are also possible, such as a parallel processor.
The software may include a computer program, code, instructions, or a combination of one or more of the foregoing, and may be configured to configure the processing device to operate as desired or to process it collectively or collectively Device can be commanded. The software and / or data may be in the form of any type of machine, component, physical device, virtual equipment, computer storage media, or device , Or may be permanently or temporarily embodied in a transmitted signal wave. The software may be distributed over a networked computer system and stored or executed in a distributed manner. The software and data may be stored on one or more computer readable recording media.
The method according to an embodiment may be implemented in the form of a program command that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions to be recorded on the medium may be those specially designed and configured for the embodiments or may be available to those skilled in the art of computer software. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; Magneto-optical media, and hardware devices specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the embodiments, and vice versa.
While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. For example, it is to be understood that the techniques described may be performed in a different order than the described methods, and / or that components of the described systems, structures, devices, circuits, Lt; / RTI > or equivalents, even if it is replaced or replaced. Therefore, other implementations, other embodiments, and equivalents to the claims are also within the scope of the following claims.
101: Encoder
102: decoder
Claims (15)
Setting a size of data to be allocated to a channel based on the extracted characteristics of each channel; And
Encoding the audio signal for each channel based on the size of data to be allocated for each channel
/ RTI >
The step of extracting the channel-specific characteristics of the audio signal includes:
Extracting energy for each channel for each of the plurality of frames constituting the audio signal,
Wherein the step of setting the size of the data comprises:
And setting a size of data to be allocated to a channel corresponding to the extracted energy for each channel.
Wherein the step of setting the size of the data comprises:
And setting a size of data proportional to the extracted energy of each channel.
Wherein the step of setting the size of the data comprises:
Allocating the same size or different data size to each frame for frames corresponding to the same order,
And allocating sizes of the same or different data to each other for a frame in a different order of the same channel.
Wherein the step of encoding the channel-
An encoding method for encoding a mono-type audio signal or a stereo-type audio signal.
Generating a bit stream by multiplexing the encoded audio signal for each channel;
≪ / RTI >
Decoding the audio signal for each channel based on the size of data allocated for each channel
/ RTI >
Wherein the decoding comprises:
A decoding method for recovering a mono audio signal or a stereo audio signal from an encoded audio signal per channel.
The size of the data allocated for each channel may be,
Wherein the energy of each channel is determined based on energy per channel for each of a plurality of frames constituting the audio signal per channel.
A data setting unit for setting a size of data to be allocated to a channel based on the extracted characteristics of each channel; And
A plurality of encoding units for encoding an audio signal for each channel based on a size of data to be allocated for each channel,
/ RTI >
The channel characteristic extracting unit,
Extracting energy for each channel for each of the plurality of frames constituting the audio signal,
Wherein the data setting unit comprises:
And setting a size of data to be allocated to the channel in correspondence with the extracted energy for each channel.
Wherein the data setting unit comprises:
And setting a size of data proportional to the extracted energy of each channel.
Wherein the data setting unit comprises:
Allocating the same size or different data size to each frame for frames corresponding to the same order,
An encoder that allocates the same or different data sizes for frames of different orders of the same channel.
Wherein each of the plurality of encoding units comprises:
An encoder that encodes a mono audio signal or a stereo audio signal.
A bitstream generation unit for generating a bitstream by multiplexing the encoded audio signals for each channel,
Lt; / RTI >
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/333,092 US20150025894A1 (en) | 2013-07-16 | 2014-07-16 | Method for encoding and decoding of multi channel audio signal, encoder and decoder |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020130083312 | 2013-07-16 | ||
KR20130083312 | 2013-07-16 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20150009476A true KR20150009476A (en) | 2015-01-26 |
Family
ID=52572684
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR20140089722A KR20150009476A (en) | 2013-07-16 | 2014-07-16 | Method for encoding and decoding of multi channel audio signal, encoder and decoder |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20150009476A (en) |
-
2014
- 2014-07-16 KR KR20140089722A patent/KR20150009476A/en not_active Application Discontinuation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11830504B2 (en) | Methods and apparatus for decoding a compressed HOA signal | |
JP5006315B2 (en) | Audio signal encoding and decoding method and apparatus | |
CN106463125B (en) | Audio segmentation based on spatial metadata | |
JP5254808B2 (en) | Audio signal processing method and apparatus | |
TWI648729B (en) | A method for compressing a high-order fidelity stereo signal by compressing a high-order fidelity stereo signal, a device for compressing a high-order fidelity stereo signal, and a device for decompressing a compressed high-order fidelity stereo signal | |
JP7413418B2 (en) | Audio decoder for interleaving signals | |
US11869523B2 (en) | Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations | |
US20150025894A1 (en) | Method for encoding and decoding of multi channel audio signal, encoder and decoder | |
RU2718418C2 (en) | Decoding device, decoding method and program | |
KR20150009476A (en) | Method for encoding and decoding of multi channel audio signal, encoder and decoder | |
WO2014068817A1 (en) | Audio signal coding device and audio signal decoding device | |
JP2015011076A (en) | Acoustic signal encoder, acoustic signal encoding method, and acoustic signal decoder | |
TW202123220A (en) | Multichannel audio encode and decode using directional metadata | |
US20240185872A1 (en) | Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations | |
CN109526234B (en) | Apparatus and method for encoding and decoding multi-channel audio signal | |
TWI412021B (en) | Method and apparatus for encoding and decoding an audio signal | |
KR20160081844A (en) | Encoding method and encoder for multi-channel audio signal, and decoding method and decoder for multi-channel audio signal | |
KR20140128563A (en) | Updating method of the decoded object list |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Withdrawal due to no request for examination |