JP2007310087A

JP2007310087A - Voice encoding apparatus and voice decoding apparatus

Info

Publication number: JP2007310087A
Application number: JP2006137997A
Authority: JP
Inventors: Masatoshi Tsukiyama; 雅敏槻山; Takashi Furuhata; 貴司古畑
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2006-05-17
Filing date: 2006-05-17
Publication date: 2007-11-29

Abstract

<P>PROBLEM TO BE SOLVED: To provide a voice encoding apparatus enabling a voice decoding apparatus to output a voice signal according to an order desired by the voice encoding apparatus side, without needing notifying and changing operation to the voice decoding apparatus. <P>SOLUTION: An encoding parameter conversion section 102 arbitrarily determines correspondence relation of input digital voice signals 107 to 109, and a first input terminal 104 to a third input terminal 106. On the basis of input channel information 111, encoding parameters 112 to 115 are output. A voice encoding section 101 encodes digital voice signals 107 to 109 which are input to the first input terminal 104 to the third input terminal 106, while attaching the encoding parameters 112 to 115, and outputs them as encoded transmission data 110. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

この発明は、放送局間や、中継車と放送局との間などで音声を伝送する場合に用いられ、音声符号化方式としてＭＰＥＧ−２ＡＡＣ（Advanced Audio Coding）(ISO/IEC 13818-7)を使用して伝送する音声符号化装置及び音声復号装置に関するものである。 The present invention is used when audio is transmitted between broadcasting stations or between a relay vehicle and a broadcasting station, and MPEG-2 AAC (Advanced Audio Coding) (ISO / IEC 13818-7) is used as an audio encoding method. The present invention relates to a speech encoding device and a speech decoding device that transmit data using.

音声符号化方式としてＭＰＥＧ−２ＡＡＣを使用して伝送する場合において、音声符号化装置に音声信号を構成するＬ(left)信号、Ｒ(Right)信号、Ｃ（Center）信号、ＬＦＥ（Low Frequency Effect）信号、ＬＳ（Left Surround）信号、ＲＳ(Right Surround)信号が入力される順序、及び音声復号装置からＬ信号、Ｒ信号、Ｃ信号、ＬＦＥ信号、ＬＳ信号、ＲＳ信号が出力される順序については、ＭＰＥＧ−２ＡＡＣを定めた国際規格（ISO/IEC13818-7:1997）に明確な規定がなく（例えば、非特許文献１参照）、音声符号化装置及び音声復号装置製造者もしくは使用者の慣習として順序を決めていた。 When transmission is performed using MPEG-2 AAC as the audio encoding method, the L (left) signal, R (Right) signal, C (Center) signal, and LFE (Low Frequency) constituting the audio signal are transmitted to the audio encoding device. Effect) signal, LS (Left Surround) signal, RS (Right Surround) signal input order, and the order in which the L signal, R signal, C signal, LFE signal, LS signal, and RS signal are output from the speech decoding apparatus. Is not clearly defined in the international standard (ISO / IEC13818-7: 1997) that defines MPEG-2 AAC (see, for example, Non-Patent Document 1), and the manufacturer or user of a speech encoding device and speech decoding device The order was decided as a custom.

ＩＳＯ／ＩＥＣ１３８１８−７−１９９７ISO / IEC 13818-7-1997

従来のＭＰＥＧ−２ＡＡＣ方式においては、符号化する際の情報として符号化したデータ（エレメント）の種別を識別するための識別子、例えばＳＣＥ(single_channel_element)、ＣＰＥ(channel_pair_element)、ＬＣＥ(lfe_channel_element)が用意されており、例えばＬ信号、Ｒ信号をchannel_pair_elementとして符号化した場合は識別子ＣＰＥを付加、ＬＳ信号、ＲＳ信号をchannel_pair_elementとして符号化した場合は識別子ＣＰＥを付加、Ｃ信号をsingle_channel_elementとして符号化した場合はＳＣＥを付加、ＬＦＥ信号をlfe_channel_elementとして符号化した場合は識別子ＬＦＥを付加することができる。 In the conventional MPEG-2 AAC system, identifiers for identifying the type of encoded data (element), such as SCE (single_channel_element), CPE (channel_pair_element), and LCE (lfe_channel_element), are prepared as information for encoding. For example, when the L signal and the R signal are encoded as channel_pair_element, the identifier CPE is added. When the LS signal and the RS signal are encoded as the channel_pair_element, the identifier CPE is added, and when the C signal is encoded as the single_channel_element. Can add an identifier LFE when an SCE is added and an LFE signal is encoded as an lfe_channel_element.

また、ＭＰＥＧ−２ＡＡＣ方式で符号化を行う音声符号化装置にはデジタル音声信号を２チャンネルずつ入力するための入力端子が具備され、従来では、どの入力端子にどのデジタル音声信号を入力するかは音声符号化装置製造者が慣習を元に予め決定し、音声符号化装置使用者に周知して使用していた。そして、音声符号化装置使用者は音声符号化装置製造者より予め決められた入力端子に決められた音声信号を入力し、音声符号化装置内にて符号化を行って音声復号装置に伝送していた。 In addition, an audio encoding device that performs encoding by the MPEG-2 AAC system has an input terminal for inputting digital audio signals by two channels at a time. Conventionally, which digital audio signal is input to which input terminal. Has been determined in advance by a speech encoding device manufacturer based on customs, and is well known to the speech encoding device user. Then, the speech encoding device user inputs the speech signal determined by the speech encoding device manufacturer to a predetermined input terminal, encodes the speech encoding device, and transmits it to the speech decoding device. It was.

このような音声符号化装置より伝送された音声符号化信号は、従来のＭＰＥＧ−２ＡＡＣ方式の音声復号装置にて上述した識別手段を用いて、Ｌ信号、Ｒ信号、ＬＳ信号、ＲＳ信号、Ｃ信号、ＬＦＥ信号として復号され、音声復号装置製造者が予め定めた、音声復号装置に具備された復号した音声信号を出力するための出力端子からデジタル音声信号を出力していた。 A speech encoded signal transmitted from such a speech encoding device is used to identify an L signal, an R signal, an LS signal, an RS signal, and the like using the above-described identification unit in the conventional MPEG-2 AAC speech decoding device. A digital audio signal is output from an output terminal for outputting a decoded audio signal provided in the audio decoding device, which is decoded as a C signal and an LFE signal and is predetermined by the audio decoding device manufacturer.

従来のＭＰＥＧ−２ＡＡＣ音声符号化装置及び音声復号装置は以上のように構成されているため、音声符号化装置使用者が音声符号化装置に具備する音声入力端子に入力するデジタル音声信号を、例えば、２番目の入力端子に入力していたＣ信号、ＬＦＥ信号を３番目の入力端子に変更し、３番目の入力端子に入力していたＬＳ信号、ＲＳ信号を２番目の入力端子に変更した場合、変更した内容を符号化音声信号の伝送とは別の手段にて音声復号装置使用者に通知し、これにより音声復号装置使用者が出力端子を変更する必要があった。 Since the conventional MPEG-2 AAC audio encoding device and audio decoding device are configured as described above, a digital audio signal input to an audio input terminal provided to the audio encoding device by a user of the audio encoding device, For example, the C signal and LFE signal that were input to the second input terminal are changed to the third input terminal, and the LS signal and RS signal that are input to the third input terminal are changed to the second input terminal. In this case, the changed content is notified to the speech decoding apparatus user by means different from the transmission of the encoded speech signal, and the speech decoding apparatus user needs to change the output terminal.

仮に変更した内容を通知しなかった場合は、音声復号装置製造者が予め定めた通りに、例えば２番目の出力端子からＣ信号、ＬＦＥ信号が出力され、３番目の出力端子からＬＳ信号、ＲＳ信号が出力され、音声符号化装置への入力の順序と音声復号装置からの出力の順序が異なってしまうという事態が発生する。 If the changed content is not notified, for example, the C signal and the LFE signal are output from the second output terminal, and the LS signal and RS are output from the third output terminal as predetermined by the speech decoding device manufacturer. A signal is output, and the order of input to the speech coding apparatus and the order of output from the speech decoding apparatus are different.

従来の音声復号装置は音声符号化装置から伝送される符号化信号に基づいて音声復号装置使用者の操作を必要とせず復号を行うことを特徴としているが、上記例のように、音声符号化装置で信号と入力端子との関係を変更した場合は、音声復号装置に対する操作が必要となり、通知や変更作業等が必要となる、といった問題点があった。 A conventional speech decoding apparatus is characterized in that decoding is performed without requiring the user's operation based on a coded signal transmitted from the speech encoding apparatus. However, as in the above example, speech encoding is performed. When the relationship between the signal and the input terminal is changed by the apparatus, there is a problem that an operation on the speech decoding apparatus is required, and notification, change work, and the like are required.

この発明は上記のような課題を解決するためになされたもので、音声復号装置への通知や変更作業を必要とせず、音声符号化装置側で所望する順序に従って音声復号装置で音声信号を出力することのできる音声符号化装置及び音声復号装置を得ることを目的とする。 The present invention has been made to solve the above-described problems, and does not require notification or change work to the speech decoding device, and outputs speech signals in the speech decoding device according to a desired order on the speech coding device side. An object of the present invention is to obtain a speech encoding device and speech decoding device capable of performing the above.

この発明に係る音声符号化装置は、音声信号と入力端子との対応関係を任意に決定する入力チャンネル情報に基づいて音声信号に対応した符号化パラメータを決定する符号化パラメータ変換部と、符号化パラメータ変換部から出力された符号化パラメータを音声符号化信号に付与して符号化伝送データとして出力する音声符号化部とを備えたものである。 A speech coding apparatus according to the present invention includes a coding parameter conversion unit that determines a coding parameter corresponding to a speech signal based on input channel information that arbitrarily determines a correspondence relationship between the speech signal and an input terminal, and a coding A speech encoding unit that adds the encoding parameter output from the parameter converting unit to the speech encoded signal and outputs the encoded transmission data.

この発明の音声符号化装置は、音声信号と入力端子との対応関係を示す符号化パラメータを音声符号化信号に付与するようにしたので、音声復号装置への通知や変更作業を必要とせず、音声符号化装置側で所望する順序に従って音声復号装置で音声信号を出力することができる。 Since the speech encoding device of the present invention is configured to add the encoding parameter indicating the correspondence between the speech signal and the input terminal to the speech encoded signal, it does not require notification or change work to the speech decoding device, The speech signal can be output by the speech decoding device according to a desired order on the speech encoding device side.

実施の形態１．
図１は、この発明の実施の形態１による音声符号化装置及び音声復号装置を示す構成図である。
図において、音声符号化装置１００は、音声符号化部１０１、符号化パラメータ変換部１０２、入力チャンネル情報入力部１０３、第１番入力端子１０４、第２番入力端子１０５、第３番入力端子１０６を備えている。また、音声復号装置２００は、音声復号部２０１、出力チャンネル判定部２０２、出力チャンネル選択部２０３、第１番出力端子２０４、第２番出力端子２０５、第３番出力端子２０６を備えている。更に、音声符号化装置１００と音声復号装置２００とは伝送路３００を介して接続されている。 Embodiment 1 FIG.
1 is a block diagram showing a speech encoding apparatus and speech decoding apparatus according to Embodiment 1 of the present invention.
In the figure, a speech coding apparatus 100 includes a speech coding unit 101, a coding parameter conversion unit 102, an input channel information input unit 103, a first input terminal 104, a second input terminal 105, and a third input terminal 106. It has. The speech decoding apparatus 200 also includes a speech decoding unit 201, an output channel determination unit 202, an output channel selection unit 203, a first output terminal 204, a second output terminal 205, and a third output terminal 206. Furthermore, speech encoding apparatus 100 and speech decoding apparatus 200 are connected via transmission line 300.

音声符号化部１０１は、第１番入力端子１０４から入力されたデジタル音声信号１０７と、第２番入力端子１０５から入力されたデジタル音声信号１０８と、第３番入力端子１０６から入力されたデジタル音声信号１０９を、符号化パラメータ変換部１０２からの符号化パラメータに基づき、ＭＰＥＧ−２ＡＡＣ符号化方式に則って符号化を行い、符号化伝送データ１１０として出力する機能を有している。
符号化パラメータ変換部１０２は、入力チャンネル情報入力部１０３から出力される入力チャンネル情報１１１に基づいて符号化パラメータ１１２〜１１５を生成し、これら符号化パラメータ１１２〜１１５を音声符号化部１０１に出力する機能部である。 The speech encoding unit 101 includes a digital speech signal 107 input from the first input terminal 104, a digital speech signal 108 input from the second input terminal 105, and a digital input from the third input terminal 106. The audio signal 109 is encoded based on the encoding parameter from the encoding parameter conversion unit 102 in accordance with the MPEG-2 AAC encoding method and output as encoded transmission data 110.
The encoding parameter conversion unit 102 generates encoding parameters 112 to 115 based on the input channel information 111 output from the input channel information input unit 103, and outputs these encoding parameters 112 to 115 to the speech encoding unit 101. It is a functional part to do.

ここで、符号化パラメータ１１２は、音声符号化部１０１で使用する第１のペアチャンネルエレメントに付加される符号化パラメータで、第１番入力端子１０４の入力端子情報を表している。また、符号化パラメータ１１３は、音声符号化部１０１で使用する単一チャンネルエレメントに付加される符号化パラメータで、第２番入力端子１０５の入力端子情報を表している。更に、符号化パラメータ１１４は、音声符号化部１０１で使用する第２のペアチャンネルエレメントに付加される符号化パラメータで、第３番入力端子１０６の入力端子情報を表している。また、符号化パラメータ１１５は、音声符号化部１０１で使用するＬＦＥチャンネルエレメントに付加される符号化パラメータであり、第２番入力端子１０５の入力端子情報を表している。 Here, the encoding parameter 112 is an encoding parameter added to the first pair channel element used in the speech encoding unit 101 and represents the input terminal information of the first input terminal 104. An encoding parameter 113 is an encoding parameter added to a single channel element used in the speech encoding unit 101 and represents input terminal information of the second input terminal 105. Furthermore, the encoding parameter 114 is an encoding parameter added to the second pair channel element used in the speech encoding unit 101 and represents input terminal information of the third input terminal 106. An encoding parameter 115 is an encoding parameter added to the LFE channel element used in the speech encoding unit 101 and represents input terminal information of the second input terminal 105.

入力チャンネル情報入力部１０３は、デジタル音声信号１０７〜デジタル音声信号１０９が、第１番入力端子１０４〜第３番入力端子１０６のいずれかに入力したかという情報を、音声符号化装置１００の使用者が入力するための機能部であり、これらの入力情報が入力チャンネル情報１１１として符号化パラメータ変換部１０２に出力するよう構成されている。 The input channel information input unit 103 uses information indicating whether the digital audio signal 107 to the digital audio signal 109 are input to any of the first input terminal 104 to the third input terminal 106 to use the audio encoding device 100. The input unit is configured to output the input information as input channel information 111 to the encoding parameter conversion unit 102.

また、実施の形態１では、デジタル音声信号１０７は、ＭＰＥＧ−２ＡＡＣによる音声符号化を行う対象となるデジタル音声信号のＬ(left)信号及びＲ(Right)信号、デジタル音声信号１０８は、デジタル音声信号のＣ（Center）信号、及びＬＦＥ（Low Frequency Effect）信号、デジタル音声信号１０９は、デジタル音声信号のＬＳ（Left Surround）信号、ＲＳ(Right Surround)信号となるよう設定されている。 In the first embodiment, the digital audio signal 107 is an L (left) signal and an R (Right) signal of a digital audio signal to be subjected to audio encoding by MPEG-2 AAC, and the digital audio signal 108 is digital. The C (Center) signal of the audio signal, the LFE (Low Frequency Effect) signal, and the digital audio signal 109 are set to be an LS (Left Surround) signal and an RS (Right Surround) signal of the digital audio signal.

次に、音声復号装置２００の構成について説明する。
音声復号装置２００における音声復号部２０１は、伝送路３００を介して入力された符号化伝送データ１１０を、ＭＰＥＧ−２ＡＡＣ方式に則って復号し、デジタル音声信号２０７，２０８，２０９として出力する機能を有し、また、音声符号化データに含まれる符号化パラメータを分離する機能を有している。図中、音声復号部２０１から出力される符号化パラメータ１１２〜１１５は、これらの符号化パラメータである。
出力チャンネル判定部２０２は、音声符号化装置１００の符号化パラメータ変換部１０２における符号化パラメータ１１２〜１１５と入力端子との対応関係を示す情報を予め有し、音声復号部２０１から送出された符号化パラメータ１１２〜１１５に基づいて、デジタル音声信号２０７〜２０９を、第１番出力端子２０４及び第２番出力端子２０５及び第３番出力端子２０６のいずれかから出力させるかを決定し、これを出力チャンネル情報２１０として出力する機能を有している。 Next, the configuration of speech decoding apparatus 200 will be described.
The audio decoding unit 201 in the audio decoding device 200 has a function of decoding the encoded transmission data 110 input via the transmission path 300 in accordance with the MPEG-2 AAC system and outputting the decoded data as digital audio signals 207, 208, and 209. And has a function of separating the encoding parameters included in the speech encoded data. In the figure, encoding parameters 112 to 115 output from the speech decoding unit 201 are these encoding parameters.
The output channel determination unit 202 has information indicating the correspondence relationship between the encoding parameters 112 to 115 and the input terminals in the encoding parameter conversion unit 102 of the audio encoding device 100 in advance, and the code sent from the audio decoding unit 201 The digital audio signals 207 to 209 are determined to be output from the first output terminal 204, the second output terminal 205, or the third output terminal 206 based on the control parameters 112 to 115, It has a function of outputting as output channel information 210.

出力チャンネル選択部２０３は、出力チャンネル判定部２０２から通知される出力チャンネル情報２１０を元に、デジタル音声信号２０７であるＬ信号及びＲ信号及びデジタル音声信号２０８であるＣ信号及びＬＦＥ信号、デジタル音声信号２０９であるＬＳ信号及びＲＳ信号を、第１番出力端子２０４及び第２番出力端子２０５及び第３番出力端子２０６の何れに出力するかを選択する機能を有している。また、それぞれの第１番出力端子２０４〜第３番出力端子２０６からは、デジタル音声信号２１１〜２１３が出力されるよう構成されている。 The output channel selection unit 203 is based on the output channel information 210 notified from the output channel determination unit 202, and the L signal and R signal that are the digital audio signal 207, the C signal and LFE signal that are the digital audio signal 208, and the digital audio signal. The LS signal and the RS signal, which are the signals 209, have a function of selecting which of the first output terminal 204, the second output terminal 205, and the third output terminal 206 is to be output. Also, digital audio signals 211 to 213 are output from the first output terminal 204 to the third output terminal 206, respectively.

図２はＭＰＥＧ−２ＡＡＣ音声符号化方式を定めたISO/IEC13818-7:1997に記載されている音声伝送フレームＡＤＴＳ：Audio Data Transport Stream frameの構成図である。
図示のように、ＭＰＥＧ−２ＡＡＣ音声符号化方式により符号化された符号化データを伝送するための音声伝送フレーム２１は、符号化データブロック２２で構成され、符号化データブロック２２は、データブロック識別子２３、データブロック情報２４、符号化エレメント２５より構成されている。 FIG. 2 is a configuration diagram of an audio transmission frame ADTS (Audio Data Transport Stream frame) described in ISO / IEC 13818-7: 1997 that defines the MPEG-2 AAC audio encoding system.
As shown in the figure, an audio transmission frame 21 for transmitting encoded data encoded by the MPEG-2 AAC audio encoding method is composed of an encoded data block 22, and the encoded data block 22 is a data block. It comprises an identifier 23, data block information 24, and an encoding element 25.

図３はＭＰＥＧ−２ＡＡＣ音声符号化方式を定めたISO/IEC13818-7:1997に記載されているデータブロック識別子２３の説明図である。
ここで、例えば、ＳＣＥ識別子３１は符号化エレメント２５が単一チャンネルの符号化データであることを示す識別子であり、３２は単一チャンネル符号化データを示している。また、ＣＰＥ識別子３３は、符号化エレメント２５がペアチャンネルの符号化データであることを示す識別子であり、３４はペアチャンネル符号化データである。ＬＦＥ識別子３５は、符号化エレメント２５がＬＦＥチャンネルの符号化データであることを示すＬＦＥ識別子であり、３６はＬＦＥ符号化データを示している。尚、３７はデータストリームエレメント識別子、３８はデータストリームエレメントを示しているが、これらデータストリームエレメント識別子３７及びデータストリームエレメント３８については実施の形態２で詳細に説明する。 FIG. 3 is an explanatory diagram of the data block identifier 23 described in ISO / IEC13818-7: 1997 that defines the MPEG-2 AAC audio coding system.
Here, for example, the SCE identifier 31 is an identifier indicating that the encoding element 25 is single channel encoded data, and 32 indicates single channel encoded data. The CPE identifier 33 is an identifier indicating that the encoding element 25 is pair channel encoded data, and 34 is pair channel encoded data. The LFE identifier 35 is an LFE identifier indicating that the encoding element 25 is encoded data of the LFE channel, and 36 indicates the LFE encoded data. Reference numeral 37 denotes a data stream element identifier and 38 denotes a data stream element. The data stream element identifier 37 and the data stream element 38 will be described in detail in the second embodiment.

次に、実施の形態１における動作として、例えばＭＰＥＧ−２ＡＡＣ方式の５．１ｃｈサラウンド方式で符号化する場合の動作について説明する。
５．１ｃｈサラウンド方式は、Ｌ信号、Ｒ信号、Ｃ信号、ＬＦＥ信号、ＬＳ信号、ＲＳ信号で構成され、かつ、デジタル音声信号は物理的な信号線１本につき２つの信号を伝送できる。このため、図１に示すように、第１番入力端子１０４に、Ｌ信号とＲ信号、第２番入力端子１０５にＣ信号とＬＦＥ信号、第３番入力端子１０６にＬＳ信号とＲＳ信号を入力する場合、入力チャンネル情報入力部１０３では、このような入力端子とチャンネルを示す入力チャンネル情報１１１を出力する。 Next, as an operation in the first embodiment, for example, an operation in the case of encoding in the 5.1ch surround scheme of the MPEG-2 AAC scheme will be described.
The 5.1 channel surround system is composed of an L signal, an R signal, a C signal, an LFE signal, an LS signal, and an RS signal, and a digital audio signal can transmit two signals per physical signal line. For this reason, as shown in FIG. 1, the L and R signals are input to the first input terminal 104, the C and LFE signals are input to the second input terminal 105, and the LS and RS signals are input to the third input terminal 106. When inputting, the input channel information input unit 103 outputs the input channel information 111 indicating such input terminals and channels.

即ち、入力チャンネル情報入力部１０３は、第１番入力端子１０４の第１チャンネルがＬ信号、第１番入力端子１０４の第２チャンネルがＲ信号、第２番入力端子１０５の第１チャンネルがＣ信号、第２番入力端子１０５の第２チャンネルがＬＦＥ信号、第３番入力端子１０６の第１チャンネルがＬＳ信号、第３番入力端子１０６の第２チャンネルがＲＳ信号であることを示す入力チャンネル情報１１１を符号化パラメータ変換部１０２に通知する。 That is, in the input channel information input unit 103, the first channel of the first input terminal 104 is an L signal, the second channel of the first input terminal 104 is an R signal, and the first channel of the second input terminal 105 is a C signal. An input channel indicating that the second channel of the second input terminal 105 is an LFE signal, the first channel of the third input terminal 106 is an LS signal, and the second channel of the third input terminal 106 is an RS signal. Information 111 is notified to the encoding parameter conversion unit 102.

符号化パラメータ変換部１０２では、入力チャンネル情報入力部１０３から通知された入力チャンネル情報１１１を基に、第１番入力端子１０４から入力されるデジタル音声信号１０７の符号化パラメータ１１２、第２番入力端子１０５から入力されるデジタル音声信号１０８の符号化パラメータ１１３及び符号化パラメータ１１５、第３番入力端子１０６から入力されるデジタル音声信号１０９の符号化パラメータ１１４を生成し、これら符号化パラメータ１１２〜１１５を音声符号化部１０１に出力する。尚、符号化パラメータの具体的な生成方法については後述する。 In the encoding parameter conversion unit 102, the encoding parameter 112 and the second input of the digital audio signal 107 input from the first input terminal 104 based on the input channel information 111 notified from the input channel information input unit 103. An encoding parameter 113 and an encoding parameter 115 of the digital audio signal 108 input from the terminal 105, and an encoding parameter 114 of the digital audio signal 109 input from the third input terminal 106 are generated. 115 is output to the speech encoding unit 101. A specific method for generating encoding parameters will be described later.

音声符号化部１０１では、入力されたＬ信号、Ｒ信号を第1のペアチャンネルエレメント（ＣＰＥ）として符号化を行い、符号化パラメータ１１２を付加する。また、ＬＳ信号、ＲＳ信号は、第２のペアチャンネルエレメント（ＣＰＥ）として符号化を行い、符号化パラメータ１１４を付加する。更に、Ｃ信号は単一チャンネルエレメント（ＳＣＥ）として符号化を行い、符号化パラメータ１１３を付加し、ＬＦＥ信号はＬＦＥエレメントとして符号化を行い、符号化パラメータ１１５を付加する。そして、音声符号化部１０１ではこれらの符号化を行ったデータを符号化伝送データ１１０として出力する。 The speech encoding unit 101 encodes the input L signal and R signal as the first pair channel element (CPE), and adds the encoding parameter 112. Further, the LS signal and the RS signal are encoded as a second pair channel element (CPE), and an encoding parameter 114 is added. Further, the C signal is encoded as a single channel element (SCE) and an encoding parameter 113 is added, and the LFE signal is encoded as an LFE element and an encoding parameter 115 is added. The voice encoding unit 101 outputs the encoded data as encoded transmission data 110.

音声符号化装置１００から出力された符号化伝送データ１１０は伝送路３００を経由して音声復号装置２００で受信される。音声復号装置２００では、音声復号部２０１においてＭＰＥＧ−２ＡＡＣ方式に則ってデジタル音声信号に復号すると同時に、第１のペアチャンネルエレメントに付加されていた符号化パラメータ１１２、第２のペアチャンネルエレメントに付加されていた符号化パラメータ１１４、単一チャンネルエレメントに付加されていた符号化パラメータ１１３、ＬＦＥチャンネルエレメントに付加されていた符号化パラメータ１１５を分離し、出力チャンネル判定部２０２に通知する。 The encoded transmission data 110 output from the speech encoding apparatus 100 is received by the speech decoding apparatus 200 via the transmission path 300. In the audio decoding apparatus 200, the audio decoding unit 201 decodes the digital audio signal in accordance with the MPEG-2 AAC system, and simultaneously converts the encoding parameter 112 added to the first pair channel element and the second pair channel element. The encoding parameter 114 added, the encoding parameter 113 added to the single channel element, and the encoding parameter 115 added to the LFE channel element are separated and notified to the output channel determination unit 202.

出力チャンネル判定部２０２では、通知された符号化パラメータ１１２より第１のペアチャンネルエレメントが音声符号化装置１００の第1番入力端子から入力されたことを認知し、符号化パラメータ１１４より第２のペアチャンネルエレメントが音声符号化装置１００の第３番入力端子から入力されたことを認知し、符号化パラメータ１１３より単一チャンネルエレメントが音声符号化装置１００の第２番入力端子の第１チャンネルから入力されたことを認知し、符号化パラメータ１１５よりＬＦＥチャンネルエレメントが音声符号化装置１００の第２番入力端子の第２チャンネルから入力されたことを認知する。これにより、第１番出力端子２０４からＬ信号、Ｒ信号、第２番出力端子２０５からＣ信号、ＬＦＥ信号、第３番出力端子からＬＳ信号、ＲＳ信号を出力することを意味する出力チャンネル情報２１０を生成し、出力チャンネル選択部２０３に通知する。 The output channel determination unit 202 recognizes that the first pair channel element is input from the first input terminal of the speech encoding apparatus 100 based on the notified encoding parameter 112, and the second is determined based on the encoding parameter 114. Recognizing that the pair channel element is input from the third input terminal of the speech encoding apparatus 100, the single channel element is detected from the first channel of the second input terminal of the speech encoding apparatus 100 based on the encoding parameter 113. It is recognized that the LFE channel element has been input from the second channel of the second input terminal of the speech encoding apparatus 100 from the encoding parameter 115. Thus, the output channel information means that the L signal and the R signal are output from the first output terminal 204, the C signal and the LFE signal are output from the second output terminal 205, and the LS signal and the RS signal are output from the third output terminal. 210 is generated and notified to the output channel selection unit 203.

出力チャンネル選択部２０３では、出力チャンネル情報２１０を基に音声復号部２０１で復号し出力したデジタル音声信号２０７（Ｌ信号及びＲ信号）を、第１番出力端子２０４に接続し、また、音声復号部２０１が復号し出力したデジタル音声信号２０８（Ｃ信号及びＬＦＥ信号）を、第２番出力端子２０５に接続し、更に、音声復号部２０１が復号し出力したデジタル音声信号２０９（ＬＳ信号及びＲＳ信号）を、第３番出力端子２０６に接続する。 In the output channel selection unit 203, the digital audio signal 207 (L signal and R signal) decoded and output by the audio decoding unit 201 based on the output channel information 210 is connected to the first output terminal 204, and the audio decoding is performed. The digital audio signal 208 (C signal and LFE signal) decoded and output by the unit 201 is connected to the second output terminal 205, and further, the digital audio signal 209 (LS signal and RS signal) decoded and output by the audio decoding unit 201 is connected. Signal) is connected to the third output terminal 206.

その結果、第１番出力端子２０４から出力されるデジタル音声信号２１１は音声符号化装置１００の第１番入力端子１０４から入力したデジタル音声信号１０７と同じＬ信号、Ｒ信号となり、第２番出力端子２０５から出力されるデジタル音声信号２１２は、音声符号化装置１００の第２番入力端子１０５から入力したデジタル音声信号１０８と同じＣ信号、ＬＦＥ信号となり、第３番出力端子２０６から出力されるデジタル音声信号２１３は、音声符号化装置１００の第３番入力端子１０６から入力したデジタル音声信号１０９と同じＬＳ信号及びＲＳ信号となる。 As a result, the digital audio signal 211 output from the first output terminal 204 becomes the same L signal and R signal as the digital audio signal 107 input from the first input terminal 104 of the audio encoding device 100, and the second output. The digital audio signal 212 output from the terminal 205 becomes the same C signal and LFE signal as the digital audio signal 108 input from the second input terminal 105 of the audio encoding device 100, and is output from the third output terminal 206. The digital audio signal 213 is the same LS signal and RS signal as the digital audio signal 109 input from the third input terminal 106 of the audio encoding device 100.

このように、音声符号化装置１００から音声復号装置２００に送信する符号化伝送データ１１０中に、音声復号装置２００側でデジタル音声信号を出力させたい出力端子とチャンネルの情報を含ませることができるため、例えば、音声符号化装置１００の使用者が、音声符号化装置１００への入力順序と音声復号装置２００の出力順序を意識的に変更したい場合でも入力チャンネル情報入力部１０３への操作によって容易に実現することができる。即ち、入力チャンネル情報入力部１０３から出力される入力チャンネル情報１１１を、音声復号装置２００で出力させたい出力端子に対応した値とすることにより、この入力チャンネル情報１１１に対応した符号化パラメータ１１２〜１１５が生成され、入力順序と出力順序の変更も容易に実現することができる。 As described above, the encoded transmission data 110 transmitted from the speech encoding apparatus 100 to the speech decoding apparatus 200 can include information on an output terminal and a channel on which the speech decoding apparatus 200 wants to output a digital speech signal. Therefore, for example, even when the user of the speech encoding apparatus 100 wants to consciously change the input order to the speech encoding apparatus 100 and the output order of the speech decoding apparatus 200, it is easy to operate by operating the input channel information input unit 103. Can be realized. That is, by setting the input channel information 111 output from the input channel information input unit 103 to a value corresponding to the output terminal to be output by the speech decoding apparatus 200, the encoding parameters 112 to 112 corresponding to the input channel information 111 are set. 115 is generated, and the input order and the output order can be easily changed.

次に図２と図３を用いて符号化パラメータ変換部１０２が、入力チャンネル情報１１１を用いて符号化パラメータを生成する動作について説明する。
図２の符号化データブロック２２に含まれるデータブロック識別子２３は、図３に示すように符号化エレメント２５が単一チャンネルエレメントの場合はＩＤ＿ＳＣＥ、符号化エレメント２５がペアチャンネルエレメントの場合はＩＤ＿ＣＰＥ、符号化エレメント２５がＬＦＥチャンネルエレメントの場合はＩＤ＿ＬＦＥとなる。 Next, an operation in which the encoding parameter conversion unit 102 generates an encoding parameter using the input channel information 111 will be described with reference to FIGS.
As shown in FIG. 3, the data block identifier 23 included in the encoded data block 22 of FIG. 2 includes ID_SCE when the encoding element 25 is a single channel element, ID_CPE when the encoding element 25 is a pair channel element, When the encoding element 25 is an LFE channel element, ID_LFE is set.

一方、図２に示すデータブロック情報２４はISO/IEC13818-7:1997ではelement_instance_tag（エレメントインスタンスタグ）と称され、同じ符号化エレメントが存在した場合に区別するためにユニークな数字を付与すると規定されている。このため、例えば図１の例では、Ｌ信号及びＲ信号を符号化したペアチャンネルエレメントに付加するデータブロック情報（element_instance_tag）を“１”、ＬＳ信号及びＲＳ信号を符号化したペアチャンネルエレメントに付加するデータブロック情報（element_instance_tag）を“５”、Ｃ信号を符号化した単一チャンネルエレメントに付加するデータブロック情報（element_instance_tag）を“３”、ＬＦＥ信号を符号化したＬＦＥチャンネルエレメントに付加するデータブロック情報（element_instance_tag）を“４”として２個存在するペアチャンネルエレメントを区別する規格本来の使用目的を損なうことなく、入力チャンネル情報としても使用することが可能となる。即ち、この場合は、符号化パラメータ１１２が“１”、符号化パラメータ１１３が“３”、符号化パラメータ１１４が“５”、符号化パラメータ１１５が“４”となる。 On the other hand, the data block information 24 shown in FIG. 2 is referred to as element_instance_tag (element instance tag) in ISO / IEC13818-7: 1997, and is specified to give a unique number to distinguish when the same encoded element exists. ing. Therefore, for example, in the example of FIG. 1, the data block information (element_instance_tag) added to the pair channel element that encodes the L signal and the R signal is “1”, and is added to the pair channel element that encodes the LS signal and the RS signal. Data block information (element_instance_tag) to be added is "5", data block information (element_instance_tag) to be added to a single channel element encoded with a C signal is "3", and a data block to be added to an LFE channel element encoded with an LFE signal The information (element_instance_tag) is “4” and can be used as input channel information without impairing the original intended purpose of distinguishing two existing pair channel elements. That is, in this case, the encoding parameter 112 is “1”, the encoding parameter 113 is “3”, the encoding parameter 114 is “5”, and the encoding parameter 115 is “4”.

以上のように、実施の形態１の音声符号化装置によれば、ＭＰＥＧ−２ＡＡＣ音声符号化を行う音声符号化装置であって、音声信号と入力端子との対応関係を任意に決定する入力チャンネル情報に基づいて音声信号に対応した符号化パラメータを決定する符号化パラメータ変換部と、音声信号を符号化すると共に、音声符号化信号に、符号化パラメータ変換部から出力された符号化パラメータを付与し、符号化伝送データとして出力する音声符号化部とを備えたので、音声復号装置への通知や変更作業を必要とせず、音声符号化装置側で所望する順序に従って音声復号装置で音声信号を出力することができる。 As described above, according to the speech coding apparatus of Embodiment 1, it is a speech coding apparatus that performs MPEG-2 AAC speech coding, and an input that arbitrarily determines a correspondence relationship between a speech signal and an input terminal. An encoding parameter conversion unit that determines an encoding parameter corresponding to the audio signal based on the channel information, and encodes the audio signal, and the encoding parameter output from the encoding parameter conversion unit is added to the audio encoded signal. And a speech encoding unit that outputs the encoded transmission data as an encoded transmission data, so that there is no need to notify the speech decoding device or change work, and the speech signal is transmitted by the speech decoding device according to a desired order on the speech encoding device side. Can be output.

また、実施の形態１の音声符号化装置によれば、符号化パラメータは、ＭＰＥＧ−２ＡＡＣで定めるエレメントインスタンスタグを用いて付与するようにしたので、ペアチャンネルエレメントを区別する規格本来の使用目的を損なうことなく、入力チャンネル情報としても使用することが可能であり、容易に符号化パラメータの付与を行うことができる。 Also, according to the audio encoding device of the first embodiment, the encoding parameter is assigned using the element instance tag defined by MPEG-2 AAC, so that the standard intended purpose of distinguishing pair channel elements is used. Can be used as input channel information without impairing the coding parameters, and encoding parameters can be easily assigned.

また、実施の形態１の音声復号装置によれば、ＭＰＥＧ−２ＡＡＣ音声復号を行う音声復号装置であって、符号化伝送データから音声信号を復号すると共に、符号化伝送データを送出した音声符号化装置における音声信号と入力端子との対応関係を任意に決定する入力チャンネル情報に基づいて生成された符号化パラメータを分離する音声復号部と、音声復号部で分離された符号化パラメータに基づいて、復号された音声信号の出力端子を判定する出力チャンネル判定部とを備えたので、音声符号化装置からの通知や変更作業を必要とせず、音声符号化装置側で所望する順序に従って音声信号を出力することができる。 Moreover, according to the audio decoding device of the first embodiment, the audio decoding device performs MPEG-2 AAC audio decoding, and decodes the audio signal from the encoded transmission data and transmits the encoded transmission data. A speech decoding unit that separates encoding parameters generated based on input channel information that arbitrarily determines a correspondence between a speech signal and an input terminal in the encoding device, and a coding parameter separated by the speech decoding unit And an output channel determination unit that determines an output terminal of the decoded speech signal, so that notification or change work from the speech encoding device is not required, and the speech signal is output according to a desired order on the speech encoding device side. Can be output.

また、実施の形態１の音声復号装置によれば、符号化パラメータは、ＭＰＥＧ−２ＡＡＣで定めるエレメントインスタンスタグから分離されるようにしたので、容易に符号化伝送データから符号化パラメータの分離を行うことができる。 Further, according to the speech decoding apparatus of the first embodiment, since the encoding parameter is separated from the element instance tag defined in MPEG-2 AAC, the encoding parameter can be easily separated from the encoded transmission data. It can be carried out.

実施の形態２．
実施の形態２では、符号化パラメータとして、データストリームエレメント（data_stream_element）を用いるようにしたものであり、図面上の構成は実施の形態１と同様であるため、図１を援用して説明する。実施の形態２の音声符号化部１０１では、符号化パラメータ１１２〜１１５を、対応したデジタル音声信号１０７〜１０９に付与する場合に、データストリームエレメントを用いて行うよう構成されている。また、音声復号部２０１では、受信した符号化伝送データ１１０から、デジタル音声信号を復号すると共に、データストリームエレメントから符号化パラメータ１１２〜１１５を分離するよう構成されている。これ以外の構成は実施の形態１と同様である。 Embodiment 2. FIG.
In the second embodiment, a data stream element (data_stream_element) is used as an encoding parameter, and since the configuration on the drawing is the same as that of the first embodiment, the description will be given with reference to FIG. The audio encoding unit 101 according to Embodiment 2 is configured to use data stream elements when encoding parameters 112 to 115 are assigned to corresponding digital audio signals 107 to 109. The audio decoding unit 201 is configured to decode a digital audio signal from the received encoded transmission data 110 and to separate the encoding parameters 112 to 115 from the data stream element. Other configurations are the same as those in the first embodiment.

次に実施の形態２による符号化パラメータについて、図３と図４を用いて説明する。
図４は、データストリームエレメントの構成図である。
図４において、符号化データブロック２２は、データストリームエレメントを示す識別子４１、データストリームエレメント内に記載されたＬ信号、Ｒ信号のチャンネル情報４２、Ｃ信号のチャンネル情報４３、ＬＦＥ信号のチャンネル情報４４、ＬＳ信号、ＲＳ信号のチャンネル情報４５から構成されている。 Next, encoding parameters according to the second embodiment will be described with reference to FIGS.
FIG. 4 is a configuration diagram of the data stream element.
In FIG. 4, an encoded data block 22 includes an identifier 41 indicating a data stream element, an L signal described in the data stream element, an R signal channel information 42, a C signal channel information 43, and an LFE signal channel information 44. , LS signal and RS signal channel information 45.

実施の形態１によるチャンネル追従方法においては、音声符号化装置１００に入力するデジタル音声信号の入力情報をデータブロック情報２４を用いて音声復号装置２００に伝送したが、実施の形態２では、データブロック情報２４の代わりにデータストリームエレメント３８及びデータストリームエレメント識別子３７（図３参照）を用いて伝送を行う。 In the channel tracking method according to the first embodiment, the input information of the digital audio signal input to the audio encoding device 100 is transmitted to the audio decoding device 200 using the data block information 24. However, in the second embodiment, the data block Transmission is performed using the data stream element 38 and the data stream element identifier 37 (see FIG. 3) instead of the information 24.

データストリームエレメント３８については規格上使用者が任意に使用目的を決定することができるため、例えば、図４に示すようにデータストリームエレメント内にＬ信号、Ｒ信号のチャンネル情報４２、Ｃ信号のチャンネル情報４３、ＬＦＥ信号のチャンネル情報４４、ＬＳ信号、ＲＳ信号のチャンネル情報４５を記載し、伝送する。即ち、図２に示した実施の形態１と同様に、Ｌ信号及びＲ信号を符号化したペアチャンネルエレメントに付加するデータストリームエレメントを“１”、ＬＳ信号及びＲＳ信号を符号化したペアチャンネルエレメントに付加するデータストリームエレメントを“５”、Ｃ信号を符号化した単一チャンネルエレメントに付加するデータストリームエレメントを“３”、ＬＦＥ信号を符号化したＬＦＥチャンネルエレメントに付加するデータストリームエレメントを“４”とする。 Since the user can arbitrarily determine the purpose of use of the data stream element 38 according to the standard, for example, as shown in FIG. 4, the channel information 42 of the L signal, the R signal, and the channel of the C signal are included in the data stream element. Information 43, channel information 44 of the LFE signal, channel information 45 of the LS signal and RS signal are described and transmitted. That is, as in the first embodiment shown in FIG. 2, the data stream element to be added to the pair channel element in which the L signal and the R signal are encoded is “1”, and the pair channel element in which the LS signal and the RS signal are encoded. "5" for the data stream element to be added to "3", "3" for the data stream element to be added to the single channel element encoded with the C signal, and "4" for the data stream element to be added to the LFE channel element to which the LFE signal is encoded. ".

これにより、音声復号装置２００では、このような符号化伝送データ１１０を受信すると、音声復号部２０１において分離し、出力チャンネル判定部２０２に通知することが可能で、実施の形態１と同じ効果を得ることができる。 As a result, when receiving such encoded transmission data 110, speech decoding apparatus 200 can separate it in speech decoding section 201 and notify it to output channel determination section 202, which has the same effect as in the first embodiment. Obtainable.

以上のように、実施の形態２の音声符号化装置によれば、符号化パラメータを、ＭＰＥＧ−２ＡＡＣで定めるデータストリームエレメントを用いて付与するようにしたので、容易に符号化パラメータの付与を行うことができる。 As described above, according to the audio encoding device of the second embodiment, the encoding parameter is assigned using the data stream element defined by MPEG-2 AAC. It can be carried out.

また、実施の形態２の音声復号装置によれば、符号化パラメータを、ＭＰＥＧ−２ＡＡＣで定めるデータストリームエレメントから分離するようにしたので、容易に符号化伝送データから符号化パラメータの分離を行うことができる。 Also, according to the speech decoding apparatus of the second embodiment, since the encoding parameter is separated from the data stream element defined by MPEG-2 AAC, the encoding parameter is easily separated from the encoded transmission data. be able to.

この発明の実施の形態１による音声符号化装置及び音声復号装置を示す構成図である。It is a block diagram which shows the audio | voice encoding apparatus and audio | voice decoding apparatus by Embodiment 1 of this invention. この発明に係る音声伝送フレームＡＤＴＳの構成図である。It is a block diagram of the audio | voice transmission frame ADTS which concerns on this invention. この発明に係るデータブロック識別子の説明図である。It is explanatory drawing of the data block identifier which concerns on this invention. この発明に係るデータストリームエレメントの構成図である。It is a block diagram of the data stream element based on this invention.

Explanation of symbols

１００音声符号化装置、１０１音声符号化部、１０２符号化パラメータ変換部、１０３入力チャンネル情報入力部、１０４第１番入力端子、１０５第２番入力端子、１０６第３番入力端子、１０７，１０８，１０９，２１１，２１２，２１３デジタル音声信号、１１０符号化伝送データ、１１１入力チャンネル情報、１１２，１１３，１１４，１１５符号化パラメータ、２００音声復号装置、２０１音声復号部、２０２出力チャンネル判定部、２０３出力チャンネル選択部、２０４第１番出力端子、２０５第２番出力端子、２０６第３番出力端子、２１０出力チャンネル情報。 DESCRIPTION OF SYMBOLS 100 Speech coding apparatus, 101 Speech coding part, 102 Encoding parameter conversion part, 103 Input channel information input part, 104 1st input terminal, 105 2nd input terminal, 106 3rd input terminal, 107,108 , 109, 211, 212, 213 Digital audio signal, 110 encoded transmission data, 111 input channel information, 112, 113, 114, 115 encoding parameter, 200 audio decoding device, 201 audio decoding unit, 202 output channel determination unit, 203 output channel selection unit, 204 first output terminal, 205 second output terminal, 206 third output terminal, 210 output channel information.

Claims

An audio encoding device that performs MPEG-2 AAC audio encoding,
An encoding parameter conversion unit that determines an encoding parameter corresponding to the audio signal based on input channel information that arbitrarily determines a correspondence relationship between the audio signal and the input terminal;
A speech code comprising: a speech encoding unit that encodes the speech signal, adds the encoding parameter output from the encoding parameter conversion unit to the speech encoded signal, and outputs the encoded parameter as encoded transmission data Device.

The speech encoding apparatus according to claim 1, wherein the encoding parameter is assigned by using an element instance tag defined by MPEG-2 AAC.

The audio encoding apparatus according to claim 1, wherein the encoding parameter is assigned using a data stream element defined by MPEG-2 AAC.

An audio decoding device that performs MPEG-2 AAC audio decoding,
Encoding generated based on input channel information for arbitrarily determining a correspondence between an audio signal and an input terminal in an audio encoding device that has decoded the audio signal from the encoded transmission data and sent out the encoded transmission data A speech decoder for separating parameters;
An audio decoding apparatus comprising: an output channel determination unit that determines an output terminal of the decoded audio signal based on the encoding parameter separated by the audio decoding unit.

5. The speech decoding apparatus according to claim 4, wherein the encoding parameter is separated from an element instance tag defined by MPEG-2 AAC.

5. The audio decoding apparatus according to claim 4, wherein the encoding parameter is separated from a data stream element defined by MPEG-2 AAC.