WO2010041877A2 - Procédé et appareil de traitement d'un signal - Google Patents

Procédé et appareil de traitement d'un signal Download PDF

Info

Publication number
WO2010041877A2
WO2010041877A2 PCT/KR2009/005748 KR2009005748W WO2010041877A2 WO 2010041877 A2 WO2010041877 A2 WO 2010041877A2 KR 2009005748 W KR2009005748 W KR 2009005748W WO 2010041877 A2 WO2010041877 A2 WO 2010041877A2
Authority
WO
WIPO (PCT)
Prior art keywords
information
signal
downmix
modified
downmix gain
Prior art date
Application number
PCT/KR2009/005748
Other languages
English (en)
Other versions
WO2010041877A3 (fr
Inventor
Hyen O Oh
Yang Won Jung
Original Assignee
Lg Electronics Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lg Electronics Inc. filed Critical Lg Electronics Inc.
Publication of WO2010041877A2 publication Critical patent/WO2010041877A2/fr
Publication of WO2010041877A3 publication Critical patent/WO2010041877A3/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

Definitions

  • the present invention relates to an apparatus for processing an audio signal and method thereof.
  • the present invention is suitable for a wide scope of applications, it is particularly suitable for processing an audio signal received via a digital medium, a broadcast signal or the like.
  • parameters are extracted from each object signal. Theses parameters include object level information indicating a level of each of the object signals and downmix gain information applied to each of the object signals.
  • the downmix gain information transferred to a decoder may correspond to a redundant information.
  • the present invention is directed to an apparatus for processing a signal and method thereof that substantially obviate one or more of the problems due to limitations and disadvantages of the related art.
  • An object of the present invention is to provide an apparatus for processing a signal and method thereof, by which an audio signal can be efficiently decoded according to the environment and use purposes. For this, since a modified signal generated from applying downmix gain information to an object signal is used, downmix gain information is sent in a manner of being included in an extension region of a frame of object information instead of the frame of the object information.
  • Another object of the present invention is to provide an apparatus for processing a signal and method thereof, by which object level information can be obtained using downmix gain information and modified object level information, in case that a decoder side attempts to obtain an original signal, in a manner that the downmix gain information is included in an extension region of a frame of object information.
  • the present invention provides the following effects or advantages.
  • the signal is decoded using object information including modified object level information. Therefore, since the signal can be decoded without using downmix gain information, the present invention is able to raise coding efficiency.
  • downmix gain information is included in an extension region of a frame of object information and then transferred.
  • a decoder side attempts to obtain an original object signal, it is able to calculate the object level information using the downmix gain information and modified object level information. Therefore, a signal can be efficiently decoded according to the environment and use purposes.
  • FIG. 1 is a schematic diagram of a signal encoding apparatus according to one embodiment of the present invention.
  • FIG. 2 is a schematic diagram of an object information generating unit according to one embodiment of the present invention
  • FIG. 3 is a flowchart for a signal encoding method in the object information generating unit shown in FIG. 2;
  • FIG. 4 is a schematic diagram of an object information generating unit shown in FIG. 2 according to another embodiment of the present invention.
  • FIG. 5 is a flowchart for a signal encoding method in the signal encoding apparatus shown in FIG. 4;
  • FIG. 6 is a schematic diagram of a signal decoding apparatus according to another embodiment of the present invention.
  • FIG. 7 is a schematic diagram of an information processing unit according another embodiment of the present invention.
  • FIG. 8 and FIG. 9 are diagrams of syntax structures according to various embodiments of the present invention.
  • FIG. 10 is a schematic diagram of a configuration of a product including a modified object level information obtaining unit, downmix gain information obtaining unit, an object level information calculating unit and an object information processing unit according to another embodiment of the present invention
  • FIG. 11 is schematic diagrams for relations of products including a modified object level information obtaining unit, downmix gain information obtaining unit, an object level information calculating unit and an object information processing unit according to another embodiment of the present invention, respectively
  • FIG. 12 is a schematic block diagram of a broadcast signal decoding apparatus including a modified object level information obtaining unit, downmix gain information obtaining unit, an object level information calculating unit and an object information processing unit according to another embodiment of the present invention.
  • a method of decoding an audio signal comprises receiving (a) a downmix signal being generated by downmixing a plurality of modified object signals, the modified object signals being generated by applying downmix gain information to object signal, (b) object information including the downmix gain information applied to the object signal and modified object level information indicating energy level of the modified object signal, and (c) rendering information to determine output level and position of the object signal which is included in the downmix signal; obtaining the modified object level information from a frame of the object information and downmix gain information from extension region of the object information; generating downmix processing information by using the modified object level information and the rendering information, the downmix processing information in order to allow object panning and object level control of the downmix signal; generating output signal by modifying the downmix signal with the downmix processing information; and generating object level information by applying the downmix gain information to the modified object level information, in order to obtain the object signal.
  • the downmix gain information is included in the extension region of the object information based on downmix gain flag and wherein the downmix gain flag indicates whether the downmix gain information is included in the object information.
  • the downmix gain flag is included in header of the object information.
  • the object level information is not included in the object information.
  • the object information further includes object gain difference information indicating a ratio of contributing left channel and right channel of a stereo signal, when the downmix signal is a stereo signal.
  • the method of decoding an audio signal further comprises generating multichannel information by using the modified object level information and the rendering information, the multi-channel information in order to convert the downmix signal into plural-channel audio signal; and generating multi-channel audio signal by applying the multi-channel information to the output signal.
  • an apparatus for decoding an audio signal comprises a signal receiving unit receiving (a) a downmix signal being generated by downmixing a plurality of modified object signals, the modified object signal being generated by applying downmix gain information to object signal, (b) object information including downmix gain information applied to the object signal and modified object level information indicating energy level of the modified object signal , and (c) rendering information to determine output level and position of the object signal which is included in the downmix signal; a modified object level information obtaining unit obtaining the modified object level information from a frame of the object information; a downmix gain obtaining unit obtaining the downmix gain information from extension region of the object information; an object information processing unit generating downmix processing information and multi-channel information by using the modified object level information and the rendering information, the downmix processing information in order to allow object panning and object level control of the downmix signal, the multi-channel information in order to convert the downmix signal into plural-channel audio signal; an object level information calculating unit
  • the concept 'coding' in the present invention includes both encoding and decoding.
  • 'information' in this disclosure is the terminology that generally includes values, parameters, coefficients, elements and the like and its meaning can be construed as different occasionally, by which the present invention is non-limited.
  • Stereo signal is taken as an example for a signal in this disclosure, by which examples of the present invention are non-limited.
  • a signal in this disclosure may include a plural channel signal having at least three or more channels.
  • FIG. 1 is a schematic diagram of a signal encoding apparatus 100 according to one embodiment of the present invention.
  • a signal encoding apparatus 100 includes an object information generating unit 110 and a downmix signal generating unit 120.
  • the object information generating unit 110 receives inputs of a plurality of object signals Sl to Sn and downmix gain information (DMG) and then generates object information (OI).
  • an object signal means such a source included in an audio signal as a guitar, a violin, a vocal and the like. Each source is named one object.
  • the object information means the information indicating attributes of a plurality of the objects.
  • the object information can include object level information indicating a level of an object signal from a plurality of the inputted object signals, and the downmix gain information.
  • the object information generating unit 110 is able to generate a modified object signal using a plurality of object signals and downmix gain information.
  • the object information includes modified object level information on the modified object signal instead of the object level information. Details of this process will be explained with reference to FIGs. 2 to 5 later.
  • the downmix signal generating unit 120 receives inputs of a plurality of the object signals and the downmix gain information and then generates a downmix signal (DMX).
  • DMX downmix signal
  • the downmix signal can include a mono or stereo signal, by which examples of the present invention are non-limited.
  • FIG. 2 shows an object information generating unit 110 according to one embodiment of the present invention.
  • an object information generating unit 110 includes a modified object generating unit 210 and a modified object level information generating unit 220.
  • the modified object generating unit 210 receives inputs of an object signal S 1 and downmix gain information D 1 and then generates a modified object
  • modified object signal S 1 can be represented as Formula 1.
  • T is the number that indicates each object signal.
  • a first object signal can be represented as 5,
  • a second object signal can be represented as S 2
  • downmix gain information applied to the first object signal can be represented as /J 1 .
  • the modified object level information generating unit 220 estimates and generates modified object level information using a level of the modified object signal.
  • Object level information (OLD) of the related art is estimated from an object signal S 1 before downmix gain information is applied.
  • modified object level information (MLOD) is estimated from a modified object signal S 1 generated from applying downmix gain information to an object signal.
  • the related art object level information (OLD) estimated from the object signal and the modified object level information (MOLD) estimated from the modified object signal according to one embodiment of the present invention are represented as Formula 2 and Formula 3, respectively, as follows.
  • ' S"' m ' indicates an object signal on a sub-band defined by a time slot
  • object information is the information on a modified object signal having downmix gain information applied thereto. And, it will be more efficient in aspect of coding efficiency if a modified object signal is finally reconstructed in decoding. Therefore, object information according to one embodiment of the present invention includes modified object level information (MOLD) on a modified object signal generated from applying downmix gain information to an object signal. And, object level information (OLD) may not be separately transferred.
  • MOLD modified object level information
  • FIG. 3 is a flowchart for a signal encoding method in the object information generating unit shown in FIG. 2.
  • a modified object signal is then generated by applying the downmix gain information to the object signal according to Formula 1 [S320]. And, it is able to generate modified object level information from the generated modified object signal according to
  • the downmix signal can include a simple sum of the modified object signals and can be represented as Formula 4, by which the present invention is non-limited.
  • the object information(OI) is generated by using the modified object signal.
  • the object information can further include inter-object cross correlation information (IOC) indicating whether object signals on two channels have correlation with each other and downmix gain difference information (DCLD) indicating a ratio that an object signal is included in a stereo downmix signal if a downmix signal is the stereo signal.
  • IOC inter-object cross correlation information
  • DCLD downmix gain difference information
  • object information (OI) can further include modified object level information (MOLD) indicating a level of a modified object signal generated from applying downmix gain information to an inputted object signal.
  • object information (01) includes the modified object level information (MOLD)
  • DMG downmix gain information
  • an audio signal processing method and apparatus in case that a decoder side attempts to obtain an original object signal as well as upmixed multi-channel audio signal, it is able to use the object level information (OLD) using downmix gain information (DMG) in a manner that the downmix gain information (DMG) is included in an extension region of a frame.
  • FIG. 4 is a schematic diagram of an object information generating unit 110 according to another embodiment of the present invention.
  • an object information generating unit 110 includes a modified object generating unit 410, a modified object level information generating unit 420 and an object gain difference information generating unit 430.
  • the modified object generating unit 410 receives inputs of an object signal S 1 and downmix gain information D 1 and then generates a modified object
  • the downmix gain information D 1 is the value to indicate how much an object signal contributes to a stereo channel and may include a pair of values.
  • it is able to derive amended downmix gain information (D 1 1 , D 2 1 ) to apply a normalized gain
  • downmix gain information D 1 , and D 2 1 can be the normalized value represented as
  • the modified object generating unit 410 is able to generate a modified object signal S 1 using the amended downmix gain information normalized as Formula 6. This can be represented as Formula 7. [Formula 7]
  • modified object signals S 1 , and S 2 1 included in a stereo downmix signal can be derived by Formula 8. [Formula 8]
  • S 1 1 and S 2 1 may be the modified object signals included in left
  • the modified object level information generating unit 420 generates modified object level information (MOLD) by estimating energy level of modified object signal S 1 .
  • the object gain difference information generating unit 430 is further included.
  • the object gain difference information generating unit 430 derives object gain difference information (DCLD) not using the amended downmix gain information D 1 , and D 2 , used to generate a modified object
  • the derived object gain difference information is represented as Formula 9, by which examples of the present invention are non-limited. Moreover, it is able to derive object gain difference information (DCLD) using the amended downmix gains Z ) 1 , and Z ) 2 ( as well. [Formula 9]
  • the modified object level information (MOLD) generated by the modified object level information generating unit 420 and the object gain difference information (DCLD) generated by the object gain difference information generating unit 430 can be transferred in a manner of being included in the object information (OI). Meanwhile, the object information (OI) can carry the downmix gain information
  • the downmix gain information (DMG) can be included in an extension region of a frame of the object information.
  • the downmix gain information(DMG) comprises downmix gain value to which the object signal is applied and inverse downmix gain value, thereby transmitting an inverse downmix gain value (inverseDMG) or downmix gain value to a decoder side.
  • FIG. 5 is a flowchart for a signal encoding method in the object information generating unit shown in FIG. 4. Referring to FIG. 5, an object signal and downmix gain information are received [S510]. A modified object signal is then generated by applying the downmix gain information to the object signal according to Formula 7 [S520].
  • modified object level information indicates energy level of the modified object signal generated from applying the downmix gain information to the object signal.
  • DMG downmix gain information
  • the downmix gain difference information indicates a ratio that a modified object signal is included in each channel of a downmix signal.
  • a modified object signal is generated and used, whereby object level information (OLD) is not transferred to a decoder side.
  • object level information OLD
  • DMG downmix gain information
  • FIG. 6 is a schematic diagram of a signal decoding apparatus 600 according to an embodiment of the present invention.
  • a signal decoding apparatus 600 includes a signal receiving unit 610, an information processing unit 620, a downmix signal processing unit 630 and a multi-channel decoding unit 640.
  • the signal receiving unit 610 receives a downmix signal generated from downmixing a plurality of modified object signals, object information (OI) including downmix gain information applied to an object signal and modified object level information indicating an energy level of a modified object signal generated from applying the downmix gain information to the object signal, and rendering information (RI).
  • object information received by the signal receiving unit 610 can be partitioned into a header and a plurality of frames.
  • the header includes configuration information identically applicable to whole frames and each of the frames can include object information related to each frame.
  • the object information further is partitioned into an extension region of the header and an extension region of the frame.
  • the rendering information may include the information for determining an output level and position of a modified object signal included in a downmix signal. Unlike the object information, the rendering information may not be received from an encoder but can be determined using a signal inputted by a user.
  • the information processing unit 620 receives inputs of the object information (OI) and the rendering information (RI) and then generates downmix processing information (DPI) and multi-channel information (MI).
  • the downmix processing information allows object panning and object level control of the downmix signal and the multi-channel information is used to generate multi-channel audio signal from the downmix signal.
  • the information processing unit 620 includes a modified object level information obtaining unit 621, a downmix gain information obtaining unit 622 and an object information processing unit 623.
  • the modified object level information obtaining unit 621 is able to obtain the modified object level information from the object information, preferably, from a frame of the object information.
  • the modified object level information is generated by applying the downmix gain information to the object signal in an encoder side. If the modified object level information is included in the object information, the object information may not include object level information indicating a energy level of the object signal.
  • a downmix gain can be included in an extension region of a frame of the object information [not shown in the drawing]. If the downmix signal is a stereo signal, downmix gain difference information (DCLD) can be further included in the frame of the object information.
  • the downmix gain information obtaining unit 622 is able to obtain the downmix gain information (DMG) from the object information, preferably, from an extension region of a frame of the object information.
  • DMG downmix gain information
  • Whether the object information (01) includes the downmix gain (DMG) can be determined based on downmix gain flag information (bsTransmitDMG). Details of this process will be explained with reference to FIG. 9 later.
  • the information processing unit 620 calculates object level information(OLD) by using obtained downmix gain information. Details of this process will be explained with reference to FIG 7 later.
  • the object information processing unit 623 receives the modified object level information and rendering information, and then generates the downmix processing information and the multi-channel information.
  • the downmix processing information controls object level included in the downmix signal. And, if the downmix signal is a stereo signal, the downmix processing information is used to move a signal on one of two channels of the stereo signal to the other channel (this is called 'panning' in the following description).
  • the object information processing unit 623 generates multi-channel information (MI) for upmixing a downmix signal into a multi-channel signal using the object information (01) further including inter-object cross correlation (IOC) information as well as the modified object level information (MOLD) and the downmix gain difference information (DCLD).
  • MI multi-channel information
  • IOC inter-object cross correlation
  • MOLD modified object level information
  • DCLD downmix gain difference information
  • the object information processing unit 623 does not use downmix gain information because the modified object level information includes an attribute of the downmix gain information.
  • the multi-channel information can include inter-channel level difference information (CLD) indicating a level difference between two channel signals, inter-channel correlation (ICC) information indicating correlation between two channel signals and channel prediction coefficient (CPC) information used to generate at least three channels from two channel signals.
  • CLD inter-channel level difference information
  • ICC inter-channel correlation
  • CPC channel prediction coefficient
  • the downmix signal processing unit 630 receives inputs of the downmix signal (DMV) and the downmix processing information (DPI) and is then able to generate an output signal.
  • the downmix processing information (DPI) is applied to the downmix signal to pan an object signal (panning) and to control an object level.
  • the multi-channel decoding unit 640 is able to decode an audio signal having a multi-channel by applying the multi-channel information (MI) to the output signal outputted from the downmix signal processing unit 630.
  • the multi-channel decoding unit 630 generates an audio signal that has at least three multi-channels.
  • FIG. 7 is a schematic diagram of an information processing unit 700 according another embodiment of the present invention.
  • the information processing unit 700 includes a downmix gain information obtaining unit 710, a modified object level information obtaining unit 720, an object level information calculating unit 730 and an object information processing unit 740.
  • a downmix gain information obtaining unit 710 a modified object level information obtaining unit 720
  • an object level information calculating unit 730 an object information processing unit 740.
  • Detailed compositions and functions of the downmix gain information obtaining unit 710, the modified object level information obtaining unit 720 and object information processing unit 740 are as good as the former description with reference to FIGs. 2 to 6 and Formulas 1 to 8 and are omitted in the following description.
  • the object level information calculating unit 730 calculates object level information(OLD) indicating energy level of original object signal and then outputs the calculated object level information, when the downmix gain information obtaining unit 710 obtains the downmix gain information(DMG).
  • the object level information (OLD) is not included in the object information, thereby not being transmitted from an encoder side.
  • the object level information is calculated with the downmix gain information and the modified object level information and is used to reconstruct the original object signal which is input signal inputted into an encoder.
  • an audio signal processing method and apparatus is able to calculate object level information failing to be received from an encoder using the obtained modified object level information and the obtained downmix gain information. Since the previously received informations (downmix gain information, modified object level information) are used, an audio signal processing method and apparatus according to another embodiment of the present invention is able to reduce a transmission bit size.
  • FIG. 8 shows a structure of syntax according to one embodiment of the present invention. Referring to FIG.
  • a downmix gain (DMG) is not included in a frame (SAOCFrame()) of the object information but is included in an extension region (SAOCExtensionFrameData(x)) of the frame.
  • SAOCFrame() a frame of the object information
  • SAOCExtensionFrameData(x) an extension region of the frame.
  • DMG downmix gain information
  • OLD object level information
  • FIG. 9 shows a structure of syntax according to another embodiment of the present invention.
  • downmix gain flag information indicating whether a downmix gain is included in object information can be included in a header (SAOCSpecificConfigO) of the object information.
  • SAOCSpecificConfigO The downmix gain flag information is non-limited by those shown in FIG. 9 but can be located at any position in a header syntax.
  • the meaning of the downmix gain flag information is shown in Table 1. [Table 1]
  • a downmix gain is included in object information.
  • the downmix gain can be included in each frame or an extension region of a frame of object information, is non-limited by those shown in FIG. 9, by can be located at any position in a frame syntax. In case that the frame of the object information has an extension region, the downmix gain can be included in the extension region (SAOCExtensionFrame ()).
  • a multi-channel audio signal is decoded by means of using modified object level information (OLD) and an object signal is obtained by means of calculating object level information (OLD) by using the modified object level information (MOLD) and downmix gain information (DMG), according the downmix gain flag information (bsTransmitDMG). Therefore, the present invention enables a signal to be efficiently processed according to an environment and use purpose.
  • OLD modified object level information
  • MOLD modified object level information
  • DMG downmix gain information
  • bsTransmitDMG downmix gain flag information
  • downmix gain information can be included in each frame. If a method of generating a downmix signal is uniform within a predetermined time, a single file, a single song or the like, the downmix gain information can be constant within a same interval. In this case, including downmix gain information in a frame at a timing point of applying modified downmix gain information will be more effective than including downmix gain information in each frame. Therefore, a signal processing method/apparatus according to a further embodiment of the present invention can further include downmix gain update flag information (DMG_update_header_flag) indicating whether downmix gain information is updated per frame.
  • DMG_update_header_flag downmix gain update flag information
  • the downmix gain update flag information (DMG_update_header_flag) indicates how the downmix gain information is included in the corresponding frame and can be included in header of object information. Moreover, the meaning of the downmix gain update flag information can be represented as Table 2, by which examples of the present invention are non-limited. [Table 2]
  • downmix gain update header flag if downmix gain update header flag is set to 0, downmix gain information is same in every frame with a uniform value that is not updated. Hence, downmix gain information (DMG) can be included not in frame but in header. If downmix gain update header flag is set to 1, downmix gain frame flag information
  • DMG_update_frame_flag indicating whether downmix gain information is updated per frame. If the downmix gain frame flag information indicates that corresponding frame includes updated downmix gain information, object information is able to include the updated downmix gain information.
  • downmix gain update header flag is set to 2
  • downmix gain information updated per predetermined frame period set in header can be included. If downmix gain update header flag is set to 3, updated downmix gain information updated per frame is included to be usable.
  • the downmix gain information should be understood to generally indicate information relevant to downmix gain information for efficient coding as well as a value of the downmix gain information. For instance, in case of using differential coding scheme, a differential value, an interpolation value, an indicator for generating a previous value or the like can be included. In case that downmix gain information is same in every frame, downmix gain information can be included in header without having downmix gain header flag information included in header of a bitstream.
  • FIG. 10 is a diagram for a configuration of a product including a DMG obtaining unit 1041, a MOLD obtaining unit 1042, an OLD calculating unit 1043 and an object information processing unit 1044 according to one embodiment of the present invention.
  • FIG. 11 is a diagram for relations between products including a DMG obtaining unit 1041, a MOLD obtaining unit 1042, an OLD calculating unit 1043 and an object information processing unit 1044 according to one embodiment of the present invention.
  • a wire/wireless communication unit 1010 receives a bitstream by wire/wireless communications.
  • the wire/wireless communication unit 1010 includes at least one of a wire communication unit 1011, an infrared communication unit 1012, a Bluetooth unit 1013 and a wireless LAN communication unit 1014.
  • a user authenticating unit 1020 receives an input of user information and then performs user authentication.
  • the user authenticating unit 1020 can include at least one of a fingerprint recognizing unit 1021, an iris recognizing unit 1022, a face recognizing unit 1023 and a voice recognizing unit 1024.
  • the user authentication can be performed in a manner of receiving an input of fingerprint information, iris information, face contour information or voice information, converting the inputted information to user information, and then determining whether the user information matches registered user data.
  • An input unit 1030 is an input device for enabling a user to input various kinds of commands. And, the input unit 1030 can include at least one of a keypad unit 1031, a touchpad unit 1032 and a remote controller unit 1033, by which examples of the input unit 1030 are non-limited.
  • a signal decoding unit 1040 includes a DMG obtaining unit 1041, an MOLD obtaining unit 1042, a DMG obtaining unit 1042, an OLD calculating unit 1043 and an object information processing unit 1044, which have the same configurations and functions of the former units of the same names in FIGs. 6-7, respectively. And, details of the signal decoding unit 1040 are omitted in the following description.
  • a control unit 1050 receives input signals from the input devices and controls all processes of the signal decoding unit 1040 and an output unit 1060. As mentioned in the foregoing description, if such a user input as 'on/off' of a phase shift of an output signal, an input/output of metadata, on/off operation of a signal decoding unit and the like is inputted to the control unit 1050 from the input unit 1030, the control unit 1050 decodes a signal using the user input.
  • the output unit 1060 is an element for outputting an output signal and the like generated by the signal decoding unit 1040.
  • the output unit 1060 can include a signal output unit 1061 and a display unit 1062. If an output signal is an audio signal, it is outputted via the signal output unit 1061. If an output signal is a video signal, it is outputted via the display unit 1062. Moreover, if metadata is inputted to the input unit 1030, it is displayed on a screen via the display unit 1062.
  • FIG. 11 shows relation between terminals or between terminal and server, which correspond to the product shown in FIG. 10.
  • bidirectional communications of data or bitstream can be performed between a first terminal 1110 and a second terminal 1120 via wire/wireless communication units.
  • the data or bitstream exchanged via the wire/wireless communication unit may include the former data including the modified object level information, downmix gain information, downmix gain flag information and the like of the present invention described with reference to FIGs. 1 to 9.
  • wire/wireless communications can be performed between a server 1130 and a first terminal 1140.
  • FIG. 12 is a schematic block diagram of a broadcast signal decoding apparatus 1200 including a DMG obtaining unit 1041, a MOLD obtaining unit 1042, an OLD calculating unit 1043 and an object information processing unit 1044 according to another embodiment of the present invention.
  • a demultiplexer 1212 receives a plurality of data related to a TV broadcast from a tuner 1210. The received data are separated by the demultiplexer 1212 and are then decoded by a data decoder 1230. Meanwhile, the data separated by the demultiplexer 1220 can be stored in such a storage medium 1250 as an HDD.
  • the data separated by the demultiplexer 1220 are inputted to a signal decoding unit 1240.
  • the signal decoding unit 1240 includes a DMG obtaining unit 1041, a MOLD obtaining unit 1042, an OLD calculating unit 1043 and an object information processing unit 1044 according to one embodiment of the present invention. They have the same configurations and functions of the former units of the same names shown in FIGs. 6-7 and their details are omitted in the following description.
  • the signal decoding unit 1240 decodes a multi-channel audio signal and object signal using the received downmix gain information and modified object level information. If a video signal is inputted, the signal decoding unit 1240 decodes and outputs the video signal. If metadata is generated, the signal decoding unit 1240 outputs the metadata in a text type.
  • an output unit 1270 displays the outputted metadata.
  • the output unit 1270 includes a speaker unit (not shown in the drawing) and outputs a plural channel signal, which is decoded using the inter-channel phase difference information, via the speaker unit included in the output unit 1270.
  • the data decoded by the signal decoding unit 1240 can be stored in a storage medium 1250 such as an HDD.
  • the signal decoding apparatus 1200 can further include an application manager 1260 capable of controlling a plurality of data received according to an input of information from a user.
  • the application manager 1260 includes a user interface manager 1261 and a service manager 1262.
  • the user interface manager 1261 controls an interface for receiving an input of information from a user. For instance, the user interface manager 1261 is able to control a font type of text displayed on the output unit 1270, a screen brightness, a menu configuration and the like.
  • the service manager 1262 is able to control a received broadcast signal using information inputted by a user.
  • the service manager 1262 is able to provide a broadcast channel setting, an alarm function setting, an adult authentication function, etc.
  • the data outputted from the application manager 1260 are usable by being transferred to the output unit 1270 as well as the signal decoding unit 1240.
  • the present invention uses modified object level information having a downmix gain reflected thereon and transmits the downmix gain by having it included in an extension region of a frame of object information. And, the present invention is able to efficiently process a signal according to ability and use purpose of the audio signal processing apparatus.
  • the present invention applied decoding/encoding method can be implemented in a program recorded medium as computer-readable codes.
  • multimedia data having the data structure of the present invention can be stored in the computer-readable recoding medium.
  • the computer-readable recording media include all kinds of storage devices in which data readable by a computer system are stored.
  • the computer-readable media include ROM, RAM, CD-ROM, magnetic tapes, floppy discs, optical data storage devices, and the like for example and also include carrier-wave type implementations (e.g., transmission via Internet).
  • a bitstream generated by the encoding method is stored in a computer-readable recording medium or can be transmitted via wire/wireless communication network.
  • the present invention is applicable to encoding and decoding of an audio signal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)

Abstract

Un procédé de décodage d'un signal audio consiste à recevoir (a) un signal de mélange-abaissement généré en mélangeant-abaissant une pluralité de signaux d'objet modifiés, les signaux d'objet modifiés étant générés en appliquant des informations de gain de mélange-abaissement à un signal d'objet, (b) des informations d'objet comprenant les informations de gain de mélange-abaissement appliquées au signal d'objet et des informations de niveau d'objet modifiées indiquant un niveau d'énergie du signal d'objet modifié, et (c) des informations de reproduction; obtenir les informations de niveau d'objet modifiées d'une trame des informations d'objet et des informations de gain de mélange-abaissement d'une région d'extension des informations d'objet; générer des informations de traitement de mélange-abaissement en utilisant les informations de niveau d'objet modifiées et les informations de reproduction, les informations de traitement de mélange-abaissement permettant d’appliquer une commande de panoramique d'objet et de niveau d'objet au signal de mélange-abaissement; générer un signal de sortie en modifiant le signal de mélange-abaissement avec les informations de traitement de mélange-abaissement; et générer des informations de niveau d'objet en appliquant les informations de gain de mélange-abaissement aux informations de niveau d'objet modifiées, afin d'obtenir le signal d'objet.
PCT/KR2009/005748 2008-10-08 2009-10-08 Procédé et appareil de traitement d'un signal WO2010041877A2 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US10391908P 2008-10-08 2008-10-08
US61/103,919 2008-10-08
US10501608P 2008-10-13 2008-10-13
US61/105,016 2008-10-13

Publications (2)

Publication Number Publication Date
WO2010041877A2 true WO2010041877A2 (fr) 2010-04-15
WO2010041877A3 WO2010041877A3 (fr) 2010-07-22

Family

ID=42101090

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2009/005748 WO2010041877A2 (fr) 2008-10-08 2009-10-08 Procédé et appareil de traitement d'un signal

Country Status (1)

Country Link
WO (1) WO2010041877A2 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107731239A (zh) * 2013-04-03 2018-02-23 杜比实验室特许公司 用于生成和交互式地渲染基于对象的音频的方法和系统
CN109410964A (zh) * 2013-05-24 2019-03-01 杜比国际公司 包括音频对象的音频场景的高效编码

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080205670A1 (en) * 2006-12-07 2008-08-28 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal
KR20080089308A (ko) * 2007-03-30 2008-10-06 한국전자통신연구원 다채널로 구성된 다객체 오디오 신호의 인코딩 및 디코딩장치 및 방법

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080205670A1 (en) * 2006-12-07 2008-08-28 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal
KR20080089308A (ko) * 2007-03-30 2008-10-06 한국전자통신연구원 다채널로 구성된 다객체 오디오 신호의 인코딩 및 디코딩장치 및 방법

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
BREEBAART ET AL.: 'MPEG Spatial Audio Coding / MPEG Surround: Overview and Cu rrent Status' PROC. 119TH AES CONVENTION. October 2005, NEW YORK, *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107731239A (zh) * 2013-04-03 2018-02-23 杜比实验室特许公司 用于生成和交互式地渲染基于对象的音频的方法和系统
CN109410964A (zh) * 2013-05-24 2019-03-01 杜比国际公司 包括音频对象的音频场景的高效编码
CN109410964B (zh) * 2013-05-24 2023-04-14 杜比国际公司 包括音频对象的音频场景的高效编码
US11705139B2 (en) 2013-05-24 2023-07-18 Dolby International Ab Efficient coding of audio scenes comprising audio objects

Also Published As

Publication number Publication date
WO2010041877A3 (fr) 2010-07-22

Similar Documents

Publication Publication Date Title
US8060042B2 (en) Method and an apparatus for processing an audio signal
RU2643644C2 (ru) Кодирование и декодирование аудиосигналов
US8615316B2 (en) Method and an apparatus for processing an audio signal
US8258849B2 (en) Method and an apparatus for processing a signal
EP2083584B1 (fr) Procédé et appareil de traitement de signal audio
EP2169666B1 (fr) Procédé et appareil de traitement de signal
JP5319704B2 (ja) オーディオ信号の処理方法及び装置
KR101108060B1 (ko) 신호 처리 방법 및 이의 장치
CN110024421B (zh) 用于自适应控制去相关滤波器的方法和装置
WO2010005224A2 (fr) Procédé et appareil pour traiter un signal audio
US8346380B2 (en) Method and an apparatus for processing a signal
WO2010041877A2 (fr) Procédé et appareil de traitement d'un signal
EP2227804B1 (fr) Procédé et appareil permettant de traiter un signal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09819384

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09819384

Country of ref document: EP

Kind code of ref document: A2