JPWO2012160782A1

JPWO2012160782A1 - Bitstream transmission apparatus, bitstream transmission / reception system, bitstream reception apparatus, bitstream transmission method, bitstream reception method, and bitstream

Info

Publication number: JPWO2012160782A1
Application number: JP2013516194A
Authority: JP
Inventors: 孝祐西尾; 宮阪　修二; 修二宮阪
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 2011-05-20
Filing date: 2012-05-16
Publication date: 2014-07-31
Anticipated expiration: 2032-05-16
Also published as: EP2711924A1; CN103620678A; CN103620678B; EP2711924A4; WO2012160782A1; EP2711924B1; JP5843856B2

Abstract

本発明に係るビットストリーム送信装置（２００）は、オーディオ信号を符号化して得られたエレメンタリオーディオストリームを取得する入力部（１００）と、エレメンタリオーディオストリーム（３００）のサンプリング周波数と符号化形式とを解析する解析部（１０１）と、解析されたサンプリング周波数と符号化形式とが規格に適合しない場合、エレメンタリオーディオストリーム（３００）の符号化形式を示す情報として規格に適合する符号化形式を記載したプリアンブル信号を生成するプリアンブル生成部（１０２）と、生成されたプリアンブル信号をエレメンタリオーディオストリーム（３００）に付加することで、規格のオーディオビットストリーム（３０１）を成形する成形部（１０３）とを備える。The bitstream transmission device (200) according to the present invention includes an input unit (100) that acquires an elementary audio stream obtained by encoding an audio signal, a sampling frequency of the elementary audio stream (300), and an encoding format. When the analyzed sampling frequency and the encoded format do not conform to the standard, the encoding unit conforms to the standard as information indicating the encoding format of the elementary audio stream (300). Are added to the elementary audio stream (300), and a shaping unit (103) for shaping the standard audio bitstream (301) is added to the elementary audio stream (300). ).

Description

本発明は、圧縮符号化されたオーディオ信号を別の装置に向けて送信するビットストリーム送信装置、及び、当該ビットストリーム送信装置から送信されたオーディオ信号を受信してオーディオを出力する受信装置を備えたビットストリーム送受信システムに関するものである。 The present invention includes a bitstream transmission apparatus that transmits a compression-coded audio signal to another apparatus, and a reception apparatus that receives the audio signal transmitted from the bitstream transmission apparatus and outputs audio. The present invention relates to a bitstream transmission / reception system.

従来、圧縮符号化されて記録されたビデオ信号及びオーディオ信号をＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、半導体メモリ、及びハードディスク等の記録媒体から読み出し、読み出したビデオ信号及びオーディオ信号をデコード（伸張）して出力するデータ再生装置が知られている。そして、データ再生装置は、伸張されたデータを当該データ再生装置に接続された映像モニタ、及び音響スピーカ等に送る。これにより、ビデオ及びオーディオの再生が実現される。また、デジタル放送を再生する場合、データ再生装置は、圧縮符号化されたビデオ信号及びオーディオ信号を受信し、記録媒体からの再生と同様に、圧縮符号化されたビデオ信号及びオーディオ信号をデコードして出力する。これにより、ビデオ及びオーディオの再生が実現される。 Conventionally, video signals and audio signals recorded after compression encoding are read from a recording medium such as a CD-ROM, DVD-ROM, semiconductor memory, and hard disk, and the read video signals and audio signals are decoded (expanded). A data reproducing apparatus for outputting is known. Then, the data reproduction device sends the decompressed data to a video monitor, an audio speaker, and the like connected to the data reproduction device. This realizes video and audio playback. Also, when playing back digital broadcast, the data playback device receives the compressed and encoded video signal and audio signal, and decodes the compressed and encoded video signal and audio signal in the same way as playback from the recording medium. Output. This realizes video and audio playback.

ビデオ信号及びオーディオ信号の圧縮符号化に関しては、多くの規格が規定されている。 Many standards have been defined for compression encoding of video signals and audio signals.

例えば、画像データ符号化規格の主な例としてはＭＰＥＧ−２やＭＰＥＧ−４などが挙げられる。 For example, MPEG-2, MPEG-4, etc. are mentioned as main examples of image data coding standards.

また、オーディオデータ符号化規格の主な例としてはＤｏｌｂｙＤｉｇｉｔａｌ、ＤＴＳ（ＤｉｇｉｔａｌＴｈｅａｔｅｒＳｙｓｔｅｍ）、ＭＰＥＧＡｕｄｉｏ、及びＭＰＥＧ−４ＡＡＣ（ＡｄａｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）などが挙げられる。また、オーディオデータ符号化規格として、情報量を削減しないＰＣＭ（ＰｕｌｓｅＣｏｄｅＭｏｄｕｌａｔｉｏｎ）方式なども挙げられる。 Also, main examples of the audio data coding standard include Dolby Digital, DTS (Digital Theater System), MPEG Audio, and MPEG-4 AAC (Advanced Audio Coding). In addition, as an audio data encoding standard, a PCM (Pulse Code Modulation) system that does not reduce the amount of information can be cited.

ＤｏｌｂｙＤｉｇｉｔａｌは米Ｄｏｌｂｙ社の登録商標である。 Dolby Digital is a registered trademark of Dolby Corporation.

ＤＴＳは米ＤｉｇｌｔａｌＴｈｅａｔｅｒＳｙｓｔｅｍ社の登録商標である。 DTS is a registered trademark of Digital Theater System.

これらのオーディオデータ符号化方式で符号化された信号を以降「エレメンタリオーディオストリーム」と呼ぶ。 Signals encoded by these audio data encoding schemes are hereinafter referred to as “elementary audio streams”.

また、エレメンタリオーディオストリームをデータ再生装置でデコードすると同時に、外部機器へ当該エレメンタリオーディオストリームを出力することも可能である。 It is also possible to output the elementary audio stream to an external device at the same time as the elementary audio stream is decoded by the data reproducing apparatus.

例えば、デジタルテレビとＡＶレシーバとを、Ｓ／ＰＤＩＦ（ＳｏｎｙＰｈｉｌｉｐｓＤｉｇｉｔａｌＩｎｔｅｒＦａｃｅ）又はＨＤＭＩ（Ｈｉｇｈ−ＤｅｆｉｎｉｔｉｏｎＭｕｌｔｉｍｅｄｉａＩｎｔｅｒｆａｃｅ）を用いて接続する。これにより、ＩＥＣ６０９５８／ＩＥＣ６１９３７に従って、デジタルテレビからＡＶレシーバへエレメンタリオーディオストリームを伝送することができる。 For example, a digital television and an AV receiver are connected using S / PDIF (Sony Philips Digital Interface) or HDMI (High-Definition Multimedia Interface). Thereby, according to IEC60958 / IEC61937, an elementary audio stream can be transmitted from a digital television to an AV receiver.

ＡＶレシーバは、受信したエレメンタリオーディオストリームを、デコードして出力することでオーディオ信号の再生を実現する。 The AV receiver realizes reproduction of an audio signal by decoding and outputting the received elementary audio stream.

図７は、ＩＥＣ６０９５８／ＩＥＣ６１９３７における出力ビットストリームのデータ構成の一例を示す図である。図７（ａ）は、ＩＥＣ６０９５８／ＩＥＣ６１９３７で定められたエレメンタリオーディオストリームのデータ構成の一例を示している。図７（ｂ）は、エレメンタリオーディオストリームを伝送するためのオーディオビットストリームのデータ構成の一例を示している。 FIG. 7 is a diagram illustrating an example of a data configuration of an output bitstream in IEC60958 / IEC61937. FIG. 7A shows an example of the data structure of an elementary audio stream defined by IEC60958 / IEC61937. FIG. 7B shows an example of the data structure of an audio bit stream for transmitting an elementary audio stream.

図７（ａ）に示すように、エレメンタリオーディオストリーム３００は、所定数のサンプル（例えば、１０２４サンプル）を含む単位であるフレームを複数含む。各フレームは、ヘッダ部（Ｈｅａｄｅｒ）と、エンコードデータ部（ＥｎｃｏｄｅｄＤａｔａ）とを含む。エンコードデータ部には実際の音の符号化信号が記録され、ヘッダ部には、エンコードデータ部に記録された符号化信号の属性、例えば、サンプリングレート及びチャネル数などが記録される。 As shown in FIG. 7A, the elementary audio stream 300 includes a plurality of frames that are units including a predetermined number of samples (for example, 1024 samples). Each frame includes a header part (Header) and an encoded data part (EncodedData). The encoded data portion of the actual sound is recorded in the encoded data portion, and the attributes of the encoded signal recorded in the encoded data portion, such as the sampling rate and the number of channels, are recorded in the header portion.

ヘッダ部に符号化信号の属性を記録する形式には、ＡＤＴＳ（ＡｕｄｉｏＤａｔａＴｒａｎｓｐｏｒｔＳｔｒｅａｍ）形式と呼ばれるもの、及びＬＯＡＳ（ＬｏｗＯｖｅｒｈｅａｄＡｕｄｉｏＳｔｒｅａｍ）形式と呼ばれるものがある。図８Ａ及び図８Ｂは、ＩＥＣ６０９５８／ＩＥＣ６１９３７におけるビットストリームのヘッダ部のデータ形式の一例を示す図である。図８Ａは、ＡＤＴＳ形式の固定ヘッダにおける記述の一例を示す。図８Ａのテーブルの上から８行目には、符号化信号のサンプリング周波数をＡＤＴＳで記述するためのフォーマットが示されている。ここでは、符号化信号のサンプリング周波数を０ｘ３〜０ｘ８から選択することが規定されている。図８Ｂは、ＬＯＡＳ形式のヘッダにおける記述の一例を示す。図８Ｂに示すように、ＬＯＡＳ形式においても、ヘッダに符号化信号のサンプリング周波数を記述する形式が規定されている。 As formats for recording the attribute of the encoded signal in the header portion, there are a format called ADTS (Audio Data Transport Stream) format and a format called LOAS (Low Overhead Audio Stream) format. 8A and 8B are diagrams illustrating an example of the data format of the header part of the bit stream in IEC60958 / IEC61937. FIG. 8A shows an example of a description in a fixed header in the ADTS format. The eighth line from the top of the table of FIG. 8A shows a format for describing the sampling frequency of the encoded signal in ADTS. Here, it is specified that the sampling frequency of the encoded signal is selected from 0x3 to 0x8. FIG. 8B shows an example of the description in the LOAS format header. As shown in FIG. 8B, the format for describing the sampling frequency of the encoded signal in the header is also defined in the LOAS format.

図７（ａ）のエレメンタリオーディオストリーム３００のエンコードデータ部に記録される符号化信号の符号化方式としては、国際標準規格のＭＰＥＧオーディオＡＡＣ方式、及びＨＥＡＡＣ（Ｈｉｇｈ−ＥｆｆｉｃｉｅｎｃｙＡｄｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）方式がよく知られている。ＨＥＡＡＣ方式は、ＡＡＣ方式に対してＳＢＲ（ＳｐｅｃｔｒａｌＢａｎｄＲｅｐｌｉｃａｔｉｏｎ）と呼ばれる周波数帯域拡大技術を適用した方式である。また、ＨＥＡＡＣ方式は、ＳＢＲ情報をあえて処理しないようにすれば、又はそもそもＳＢＲ情報を処理できないデコーダであれば、ＡＡＣと互換性のある再生処理が行える。 As an encoding method of the encoded signal recorded in the encoded data part of the elementary audio stream 300 of FIG. 7A, the international standard MPEG audio AAC method and the HEAAC (High-Efficiency Advanced Audio Coding) method are used. well known. The HEAAC scheme is a scheme in which a frequency band expansion technique called SBR (Spectral Band Replication) is applied to the AAC scheme. Further, the HEAAC method can perform reproduction processing compatible with AAC if the SBR information is not processed intentionally or if it is a decoder that cannot process SBR information in the first place.

上記エンコードデータ部に記述される符号化信号がＳＢＲ技術を適用して符号化されたものであるか否かは、エンコードデータ部内の情報を解析することで判断できる。すなわち、図７（ａ）のエレメンタリオーディオストリーム３００内の破線枠のビット構造を調べることで上記を解析できる。破線枠はＡＡＣ規格におけるＥｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄと呼ばれる情報格納構造である。当該Ｅｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄ内にＳＢＲ情報が格納されているかどうかに応じて、符号化信号にＳＢＲ技術が適用されているかどうかを判断することができる。エレメンタリオーディオストリーム３００では、Ｅｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄは各フレームの最後に配置されているが、必ずしも各フレームにＥｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄが存在する必要はない。 Whether or not the encoded signal described in the encoded data portion is encoded by applying the SBR technique can be determined by analyzing information in the encoded data portion. That is, the above can be analyzed by examining the bit structure of the dashed frame in the elementary audio stream 300 of FIG. A broken line frame is an information storage structure called Extension_payload in the AAC standard. Whether or not the SBR technique is applied to the encoded signal can be determined according to whether or not SBR information is stored in the Extension_payload. In the elementary audio stream 300, the Extension_payload is arranged at the end of each frame, but the Extension_payload does not necessarily exist in each frame.

図７（ｂ）に示すように、オーディオビットストリーム３０１は、ＩＥＣ６１９３７形式オーディオビットストリームである。ＩＥＣ６１９３７形式オーディオビットストリームは、エレメンタリオーディオストリームの前にＰａ、Ｐｂ、Ｐｃ及びＰｄと名付けられたプリアンブル信号が付加されたものである。さらに、エレメンタリオーディオストリームの後には、Ｓｔｕｆｆｉｎｇと呼ばれる、信号長を調節するための無効信号が挿入される場合もある。 As shown in FIG. 7B, the audio bit stream 301 is an IEC 61937 format audio bit stream. The IEC61937 format audio bitstream is obtained by adding preamble signals named Pa, Pb, Pc, and Pd to the elementary audio stream. Furthermore, an invalid signal called “Stuffing” for adjusting the signal length may be inserted after the elementary audio stream.

図９は、図７（ｂ）に示したプリアンブル信号Ｐｃのデータ形式を示す図である。図９に破線で囲んで示す部分は、プリアンブル信号Ｐｃ内に、ＬＯＡＳ形式で、ＳＢＲ情報の有無（ＡＡＣかＨＥＡＡＣかを示す信号）を記載するためのデータの一例を示す部分である。例えば、プリアンブル信号Ｐｃの０ビット目から４ビット目までの５ビットが示す値が２３の場合、当該エレメンタリオーディオストリーム３００はＬＯＡＳ形式である。さらに、プリアンブル信号Ｐｃの５ビット目から６ビット目までの２ビットが示す値が１の場合、当該エレメンタリオーディオストリームにはＳＢＲ情報が含まれていないことが示される。すなわち、エンコードデータ部に記録されている符号化信号が、ＡＡＣＬＣ方式又はＡＡＣ方式で符号化されていることが示される。また、プリアンブル信号Ｐｃの０ビット目から４ビット目までの５ビットが示す値が２３で、かつ、プリアンブル信号Ｐｃの５ビット目から６ビット目までの２ビットが示す値が２の場合、エレメンタリオーディオストリーム３００にはＳＢＲ情報が含まれていることが示される。すなわち、エンコードデータ部に記録されている符号化信号が、ＨＥＡＡＣ方式で符号化されていることが示される。 FIG. 9 is a diagram showing a data format of the preamble signal Pc shown in FIG. A portion surrounded by a broken line in FIG. 9 is a portion showing an example of data for describing the presence / absence of SBR information (a signal indicating AAC or HEAAC) in the LOAS format in the preamble signal Pc. For example, when the value indicated by 5 bits from the 0th bit to the 4th bit of the preamble signal Pc is 23, the elementary audio stream 300 is in the LOAS format. Furthermore, when the value indicated by 2 bits from the 5th bit to the 6th bit of the preamble signal Pc is 1, it indicates that the elementary audio stream does not include SBR information. That is, it is indicated that the encoded signal recorded in the encoded data portion is encoded by the AACLC method or the AAC method. Also, when the value indicated by the 5th bit from the 0th bit to the 4th bit of the preamble signal Pc is 23 and the value indicated by the 2nd bit from the 5th bit to the 6th bit of the preamble signal Pc is 2, the element It is indicated that the mental audio stream 300 includes SBR information. That is, it is indicated that the encoded signal recorded in the encoded data portion is encoded by the HEAAC method.

さて、上記技術を用いるビットストリーム送信装置は、エレメンタリオーディオストリーム３００のようなエレメンタリオーディオストリームを受信した場合、当該エレメンタリオーディオストリームのＥｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄからＳＢＲ情報の有無を検出する。そして、ビットストリーム送信装置は、ＳＢＲ情報がない場合は、上記プリアンブル信号Ｐｃに図９に示したビット構造に従って、ＡＡＣを指定する。一方、Ｅｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄにＳＢＲ情報がある場合は、ビットストリーム送信装置はＨＥＡＡＣを指定する。他のプリアンブル信号Ｐａ、Ｐｂ及びＰｄもそれぞれ、そのビット構造と信号の意味の規定とにそって設定される。そして、必要に応じてＳｔｕｆｆｉｎｇが挿入されることで、ＩＥＣ６１９３７形式のオーディオビットストリームが生成される。 Now, when receiving an elementary audio stream such as the elementary audio stream 300, the bit stream transmission device using the above technique detects the presence or absence of SBR information from the Extension_payload of the elementary audio stream. Then, when there is no SBR information, the bit stream transmission apparatus designates AAC according to the bit structure shown in FIG. 9 in the preamble signal Pc. On the other hand, when there is SBR information in Extension_payload, the bitstream transmission apparatus specifies HEAAC. Other preamble signals Pa, Pb, and Pd are also set in accordance with the bit structure and the definition of the meaning of the signal. Then, stuffing is inserted as necessary to generate an audio bit stream in the IEC 61937 format.

ＩＥＣ６１９３７−１１Ｄｉｇｉｔａｌａｕｄｉｏ − Ｉｎｔｅｒｆａｃｅｆｏｒｎｏｎ−ｌｉｎｅａｒＰＣＭｅｎｃｏｄｅｄａｕｄｉｏｂｉｔｓｔｒｅａｍｓａｐｐｌｙｉｎｇＩＥＣ６０９５８ − Ｐａｒｔ１１：ＭＰＥＧ−４ＡＡＣａｎｄｉｔｓｅｘｔｅｎｓｉｏｎｓｉｎＬＡＴＭ／ＬＯＡＳIEC 61937-11 Digital audio-Interface for non-linear PCM encoded audio bitstreams applying IEC 60958-Part 11: MPEG-4 AAC and it's LA Extensions ＩＳＯ／ＩＥＣ１４４９６−３Ｉｎｆｏｒｍａｔｉｏｎｔｅｃｈｎｏｌｏｇｙ − Ｃｏｄｉｎｇｏｆａｕｄｉｏ−ｖｉｓｕａｌｏｂｊｅｃｔｓ −Ｐａｒｔ３：ＡｕｄｉｏISO / IEC 14496-3 Information technology-Coding of audio-visual objects-Part 3: Audio ＥＴＳＩＴＳ１０１１５４ＴｅｃｈｎｉｃａｌＳｐｅｃｉｆｉｃａｔｉｏｎＤｉｇｉｔａｌＶｉｄｅｏＢｒｏａｄｃａｓｔｉｎｇ（ＤＶＢ）；ＳｐｅｃｉｆｉｃａｔｉｏｎｆｏｒｔｈｅｕｓｅｏｆＶｉｄｅｏａｎｄＡｕｄｉｏＣｏｄｉｎｇｉｎＢｒｏａｄｃａｓｔｉｎｇＡｐｐｌｉｃａｔｉｏｎｓｂａｓｅｄｏｎｔｈｅＭＰＥＧ−２ＴｒａｎｓｐｏｒｔＳｔｒｅａｍETSI TS 101 154 Technical Specification Digital Video Broadcasting (DVB); Specialization for the use of Video and Audio Coding in Broadcasting

しかしながら、上記従来技術には２点の課題がある。 However, there are two problems with the conventional technology.

課題の１点目として、ＭＥＰＧ−４ＡＡＣで定義されているパラメータの中で、ビットストリーム規格が定義されていないものがある、という点である。図１０は、エレメンタリオーディオストリームのサンプリングレートと当該サンプリングレートに対して選択可能なオーディオデータのオーディオ符号化データ形式との関係を示す図である。 The first problem is that some of the parameters defined in MPEG-4 AAC do not define the bitstream standard. FIG. 10 is a diagram illustrating the relationship between the sampling rate of the elementary audio stream and the audio encoded data format of audio data that can be selected for the sampling rate.

図１０に示すように、ＡＡＣサンプリングレートが、１６ｋＨｚ、２２．０５ｋＨｚ、２４ｋＨｚで、かつオーディオ符号化データ形式がＡＡＣの場合、ビットストリーム規格が定義されていない（図１０の上から１〜３行目）。また、ＡＡＣサンプリングレートが、３２ｋＨｚ、４４．１ｋＨｚで、かつオーディオ符号化データ形式がＨＥＡＡＣ（ＡＡＣ符号化信号にＳＢＲ情報が付加されている）場合、ビットストリーム規格が定義されていない（図１０の４〜５行目）。特に後者、すなわち、ＡＡＣサンプリングレートが、３２ｋＨｚ、４４．１ｋＨｚで、オーディオ符号化データ形式がＨＥＡＡＣである場合は、デジタル放送等で運用が規定されているにもかかわらずビットストリーム規格が定義されていない。この結果、ＨＥＡＡＣを用いて符号化された高音質のエレメンタリオーディオストリームを受信したとしても、このエレメンタリオーディオストリームを外部のマルチチャネルスピーカシステムなどで再生して楽しむことができないという課題がある。 As shown in FIG. 10, when the AAC sampling rate is 16 kHz, 22.05 kHz, 24 kHz and the audio encoded data format is AAC, the bitstream standard is not defined (from the top to the bottom of lines 1 to 3 in FIG. 10). Eye). In addition, when the AAC sampling rate is 32 kHz, 44.1 kHz, and the audio encoded data format is HEAAC (SBR information is added to the AAC encoded signal), the bitstream standard is not defined (FIG. 10). Lines 4-5). In particular, in the latter case, that is, when the AAC sampling rate is 32 kHz, 44.1 kHz and the audio encoded data format is HEAAC, the bitstream standard is defined even though the operation is regulated by digital broadcasting or the like. Absent. As a result, there is a problem that even if a high-quality elementary audio stream encoded using HEAAC is received, the elementary audio stream cannot be reproduced and enjoyed by an external multi-channel speaker system or the like.

課題の２点目として、例えば、オーディオ符号化データ形式が異なる符号化データが同一コンテンツに含まれるような場合には、出力ビットストリームにおける出力サンプリング周波数の切り替えが増加することである。 As a second problem, for example, when encoded data having different audio encoded data formats are included in the same content, switching of the output sampling frequency in the output bit stream is increased.

例えば、デジタル放送において、番組コンテンツ部分はオーディオ符号化データ形式がＨＥＡＡＣで、かつ、ＡＡＣサンプリングレートが４８ＫＨｚで放送されており、その一方で、コマーシャル部分はオーディオ符号化データ形式がＡＡＣＬＣかつＡＡＣサンプリングレートが４８ＫＨｚで放送されている場合、番組コンテンツ部分とコマーシャル部分との切変わり目で出力サンプリング周波数の切り替えが発生する。 For example, in digital broadcasting, the program content portion is broadcast with an audio encoded data format of HEAAC and an AAC sampling rate of 48 KHz, while the commercial portion has an audio encoded data format of AACLC and AAC sampling rate. Is broadcast at 48 KHz, the output sampling frequency is switched at the transition between the program content portion and the commercial portion.

図１１Ａ及び図１１Ｂは、ＨＥＡＡＣデコーダが復号し再生する周波数帯域と、ＡＡＣデコーダが復号し再生する周波数帯域との違いを示す図である。図１１Ａは、ＨＥＡＡＣでかつＡＡＣサンプリングレートが４８ＫＨｚの符号化データにおける再生可能な周波数帯域を示している。ＨＥＡＡＣは、ＡＡＣが再生する周波数帯域の信号に対し、ＳＢＲ技術（ＳｐｅｃｔｒａｌＢａｎｄＲｅｐｌｉｃａｔｉｏｎ）を用いることでＡＡＣの２倍の周波数帯域の信号を再生する。すなわち図１１Ａの実線部分は４８ｋＨｚのＡＡＣによって再生される０〜２４ｋＨｚの信号を示しており、破線部分の２４〜４８ｋＨｚの帯域がＳＢＲ技術によって拡大される帯域を示している。それによって、９６ｋＨｚの信号が再生される。一方図１１ＢはＡＡＣでかつＡＡＣサンプリングレートが４８ＫＨｚの符号化データが再生する周波数帯域を示している。図１１Ｂに示すように、ＨＥＡＡＣにおいてＡＡＣ部が再生する周波数帯域と同じ周波数帯域の信号が生成される。 11A and 11B are diagrams showing the difference between the frequency band decoded and reproduced by the HEAAC decoder and the frequency band decoded and reproduced by the AAC decoder. FIG. 11A shows a reproducible frequency band in encoded data with HEAAC and an AAC sampling rate of 48 KHz. HEAAC reproduces a signal in a frequency band twice that of AAC by using SBR technology (Spectral Band Replication) for a signal in a frequency band reproduced by AAC. That is, the solid line part of FIG. 11A shows a signal of 0 to 24 kHz reproduced by AAC of 48 kHz, and the band of 24 to 48 kHz of the broken line part shows a band expanded by the SBR technique. As a result, a 96 kHz signal is reproduced. On the other hand, FIG. 11B shows a frequency band in which encoded data having AAC and an AAC sampling rate of 48 KHz is reproduced. As shown in FIG. 11B, a signal in the same frequency band as the frequency band reproduced by the AAC unit in HEAAC is generated.

図１１Ａ及び図１１Ｂから分かるように、番組コンテンツ部分ではＩＥＣ６０９５８のフレームレートが９６ＫＨｚの信号が出力され、コマーシャル部分ではＩＥＣ６０９５８のフレームレートが４８ＫＨｚの信号が出力されることになる。このようなビットストリームを受信した場合、ＡＶレシーバにおいて受信したビットストリームのＩＥＣ６０９５８のフレームレートの変化に応答して再度同期をとるための処理が必要となる。そして、この処理に時間がかかるため、番組部分とコマーシャル部分との切り替わりで音切れを生じるという問題がある。 As can be seen from FIGS. 11A and 11B, a signal with an IEC 60958 frame rate of 96 KHz is output in the program content portion, and a signal with an IEC 60958 frame rate of 48 KHz is output in the commercial portion. When such a bit stream is received, a process for re-synchronizing in response to a change in the frame rate of IEC 60958 of the bit stream received by the AV receiver is required. And since this process takes time, there is a problem that the sound is cut off by switching between the program part and the commercial part.

そこで、本発明は、サンプリング周波数に対応する実際の符号化形式が所定の形式に適合しない場合であっても、適合するオーディオビットストリームとして送信することができるビットストリーム送信装置を提供することを目的とする。 In view of the above, an object of the present invention is to provide a bitstream transmission device that can transmit an audio bitstream that conforms to a case where the actual encoding format corresponding to the sampling frequency does not conform to a predetermined format. And

上記の課題を解決するために、本発明の一形態に係るビットストリーム送信装置は、所定の形式のオーディオビットストリームを送信するビットストリーム送信装置であって、オーディオ信号を符号化して得られたエレメンタリオーディオストリームを取得する入力部と、前記エレメンタリオーディオストリームのサンプリング周波数と符号化形式とを解析する解析部と、前記エレメンタリオーディオストリームの符号化形式を示す情報を含むプリアンブル信号を、前記解析部の解析結果に応じて生成するプリアンブル生成部と、前記プリアンブル生成部が生成した前記プリアンブル信号を前記エレメンタリオーディオストリームに付加することで、前記所定の形式のオーディオビットストリームを成形する成形部と、前記成形部で成形された前記オーディオビットストリームを出力する出力部とを備え、前記プリアンブル生成部は、前記解析部の解析結果から得られる前記サンプリング周波数と前記符号化形式とに応じて、前記オーディオビットストリームの前記所定の形式に適合する符号化形式で、かつ、前記エレメンタリオーディオストリームの解析結果から得られた符号化形式とは異なる符号化形式を、前記プリアンブル信号内の前記符号化形式を示す情報として記載する。 In order to solve the above problems, a bitstream transmission device according to an aspect of the present invention is a bitstream transmission device that transmits an audio bitstream of a predetermined format, and is an element obtained by encoding an audio signal. An input unit for acquiring a primary audio stream, an analysis unit for analyzing a sampling frequency and a coding format of the elementary audio stream, and a preamble signal including information indicating a coding format of the elementary audio stream are analyzed. A preamble generation unit that generates according to the analysis result of the unit, and a shaping unit that shapes the audio bitstream of the predetermined format by adding the preamble signal generated by the preamble generation unit to the elementary audio stream; The molded part is molded An output unit that outputs the audio bitstream, wherein the preamble generation unit, according to the sampling frequency obtained from the analysis result of the analysis unit and the encoding format, the predetermined format of the audio bitstream And an encoding format different from the encoding format obtained from the analysis result of the elementary audio stream is described as information indicating the encoding format in the preamble signal.

このように、プリアンブル生成部は、前記解析部の解析結果から得られる前記サンプリング周波数と前記符号化形式とに応じて、前記オーディオビットストリームの前記所定の形式に適合する符号化形式で、かつ、前記エレメンタリオーディオストリームの解析結果から得られた符号化形式とは異なる符号化形式を、前記プリアンブル信号内の前記符号化形式を示す情報として記載する。これにより、本発明の一形態に係るビットストリーム送信装置は、サンプリング周波数に対応する実際の符号化形式が所定の形式に適合しない場合であっても、適合するオーディオビットストリームとして送信することができる。 In this way, the preamble generation unit has an encoding format that conforms to the predetermined format of the audio bitstream according to the sampling frequency and the encoding format obtained from the analysis result of the analysis unit, and An encoding format different from the encoding format obtained from the analysis result of the elementary audio stream is described as information indicating the encoding format in the preamble signal. As a result, the bitstream transmission apparatus according to an aspect of the present invention can transmit an audio bitstream that conforms to a case where the actual encoding format corresponding to the sampling frequency does not conform to the predetermined format. .

また、前記プリアンブル生成部は、前記出力部から出力するデータの出力伝送速度が予め定められた範囲となるように、前記プリアンブル信号内の前記符号化形式を示す情報を記載するとしてもよい。 The preamble generation unit may describe information indicating the encoding format in the preamble signal so that an output transmission rate of data output from the output unit falls within a predetermined range.

ここで、「予め定められた範囲」とは、例えば、１５３６ｋｂｐｓと１４１１．２ｋｂｐｓと１０２４ｋｂｐｓとであり、規格上伝送可能なビットレートである。これにより、本発明の一形態に係るビットストリーム送信装置は、エレメンタリオーディオストリームのサンプリング周波数と符号化形式とに応じて定まるオーディオビットストリームの出力伝送速度を、伝送可能な伝送速度にすることができる。 Here, the “predetermined range” is, for example, 1536 kbps, 1411.2 kbps, and 1024 kbps, which are bit rates that can be transmitted according to the standard. As a result, the bitstream transmission device according to an aspect of the present invention can set the output transmission rate of the audio bitstream determined according to the sampling frequency and the encoding format of the elementary audio stream to a transmittable transmission rate. it can.

また、前記所定の形式のオーディオビットストリームはＩＥＣ６１９３７規格で規定されたオーディオビットストリームであり、前記解析部の解析結果から得られた前記サンプリング周波数が、予め定められた第１の周波数である場合に、前記第１の周波数に対応する前記符号化形式が前記ＩＥＣ６１９３７規格の対応範囲外であれば、前記プリアンブル生成部は、前記符号化形式を示す情報として、前記エレメンタリオーディオストリームにおけるＳＢＲ（ＳｐｅｃｔｒａｌＢａｎｄＲｅｐｌｉｃａｔｉｏｎ）情報の有無の点で前記解析された符号化形式と異なる符号化形式を記載することで、前記オーディオビットストリームがＩＥＣ６１９３７規格の対応範囲内となるようなプリアンブル信号を生成するとしてもよい。 Further, when the audio bit stream of the predetermined format is an audio bit stream defined by the IEC 61937 standard, and the sampling frequency obtained from the analysis result of the analysis unit is a first predetermined frequency. If the encoding format corresponding to the first frequency is outside the corresponding range of the IEC 61937 standard, the preamble generation unit uses the SBR (Spectral Band) in the elementary audio stream as information indicating the encoding format. It is also possible to generate a preamble signal such that the audio bit stream falls within the corresponding range of the IEC 61937 standard by describing a coding format different from the analyzed coding format in terms of presence or absence of (Replication) information.

これにより、本発明の一形態に係るビットストリーム送信装置は、前記解析部の解析結果から得られる前記サンプリング周波数と前記符号化形式とが前記オーディオビットストリームの前記所定の形式に適合しない場合、プリアンブル信号のＳＢＲ情報の有無だけを変更することにより、所定の形式に適合するオーディオビットストリームを成形することができる。よって、当該ビットストリーム送信装置は、少ない処理負荷で所定の形式に適合したオーディオビットストリームを生成して送信することができる。 Accordingly, the bitstream transmission device according to an aspect of the present invention provides a preamble when the sampling frequency and the encoding format obtained from the analysis result of the analysis unit do not conform to the predetermined format of the audio bitstream. By changing only the presence / absence of the SBR information of the signal, an audio bit stream that conforms to a predetermined format can be formed. Therefore, the bit stream transmission apparatus can generate and transmit an audio bit stream that conforms to a predetermined format with a small processing load.

また、前記プリアンブル生成部は、さらに、前記解析部の解析結果から得られた前記サンプリング周波数が、予め定められた第２の周波数である場合に、前記エレメンタリオーディオストリームが前記ＳＢＲ情報の含まれない符号化形式で符号化されていることを、前記プリアンブル信号内の前記符号化形式を示す情報として記載するとしてもよい。 The preamble generation unit may further include the elementary audio stream in the SBR information when the sampling frequency obtained from the analysis result of the analysis unit is a predetermined second frequency. It may be described as information indicating the encoding format in the preamble signal that the encoding is not performed.

これにより、本発明の一形態に係るビットストリーム送信装置は、エレメンタリオーディオストリームのサンプリング周波数が第２の周波数である場合に、前記エレメンタリオーディオストリームが前記ＳＢＲ情報の含まれない符号化形式で符号化されていることを、前記プリアンブル信号内の前記符号化形式を示す情報として記載することができる。よって、エレメンタリオーディオストリームのサンプリング周波数が第２の周波数のときには、当該エレメンタリオーディオストリームを常にＳＢＲ情報なしの符号化形式で復号することができるので、サンプリング周波数の変動を防止することができる。 Accordingly, the bitstream transmission device according to an aspect of the present invention can encode the elementary audio stream in an encoding format that does not include the SBR information when the sampling frequency of the elementary audio stream is the second frequency. Encoding can be described as information indicating the encoding format in the preamble signal. Therefore, when the sampling frequency of the elementary audio stream is the second frequency, the elementary audio stream can always be decoded in the encoding format without the SBR information, so that the sampling frequency can be prevented from changing.

また、前記プリアンブル生成部は、前記解析部の解析結果から得られた前記サンプリング周波数が、予め定められた第３の周波数である場合に、前記エレメンタリオーディオストリームが前記ＳＢＲ情報の含まれる前記符号化形式で符号化されていなくても、前記ＳＢＲ情報の含まれる符号化形式で符号化されていることを示すプリアンブル信号を生成するとしてもよい。 The preamble generation unit may include the code in which the elementary audio stream is included in the SBR information when the sampling frequency obtained from the analysis result of the analysis unit is a predetermined third frequency. Even if it is not encoded in the encoded format, a preamble signal indicating that it is encoded in the encoded format including the SBR information may be generated.

これにより、本発明の一形態に係るビットストリーム送信装置は、エレメンタリオーディオストリームのサンプリング周波数が第３の周波数であるときには強制的にＳＢＲ処理を行うように制御することができる。これにより、当該ビットストリーム送信装置は、第３の周波数が低域の場合には、オーディオビットストリームを所定の形式に適合させることができる。 Accordingly, the bit stream transmission device according to an aspect of the present invention can be controlled to forcibly perform the SBR process when the sampling frequency of the elementary audio stream is the third frequency. As a result, the bit stream transmission device can adapt the audio bit stream to a predetermined format when the third frequency is in a low frequency range.

また、前記プリアンブル生成部は、前記解析部の解析結果から得られた前記サンプリング周波数が、予め定められた第４の周波数である場合に、前記エレメンタリオーディオストリームが前記ＳＢＲ情報の含まれる符号化形式で符号化されていても、前記ＳＢＲ情報の含まれない符号化形式で符号化されていることを示すプリアンブル信号を生成するとしてもよい。 The preamble generation unit may encode the elementary audio stream including the SBR information when the sampling frequency obtained from the analysis result of the analysis unit is a predetermined fourth frequency. Even if it is encoded in a format, a preamble signal indicating that it is encoded in an encoding format not including the SBR information may be generated.

これにより、本発明の一形態に係るビットストリーム送信装置は、エレメンタリオーディオストリームのサンプリング周波数が第４の周波数であるときにはＳＢＲなしのＡＡＣで復号を行うように制御することができる。よって、当該ビットストリーム送信装置は、第４の周波数が高域の場合には、オーディオビットストリームを所定の形式に適合させることができる。 Accordingly, the bit stream transmission device according to an aspect of the present invention can be controlled to perform decoding with AAC without SBR when the sampling frequency of the elementary audio stream is the fourth frequency. Therefore, the bit stream transmitting apparatus can adapt the audio bit stream to a predetermined format when the fourth frequency is a high frequency band.

また、本発明の一形態に係るビットストリーム送受信システムは、前記ビットストリーム送信装置と、当該ビットストリーム送信装置から送信されたオーディオビットストリームを受信し、受信した前記オーディオビットストリームを復号するビットストリーム受信装置とを備えたビットストリーム送受信システムであって、前記ビットストリーム受信装置は、受信した前記オーディオビットストリームに含まれる前記プリアンブル信号を除去することによって前記エレメンタリオーディオストリームを抽出するプリアンブル除去部と、前記プリアンブル信号に記載されている前記エレメンタリオーディオストリームの符号化形式を示す情報と、前記エレメンタリオーディオストリームに含まれる前記エレメンタリオーディオストリームの符号化形式を示す情報とが不整合である場合、前記プリアンブル信号に記載されている符号化形式を示す情報によって示される符号化形式で前記エレメンタリオーディオストリームを復号するエレメンタリデコード部とを備えてもよい。 In addition, a bitstream transmission / reception system according to an aspect of the present invention includes the bitstream transmission device and a bitstream reception that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream. A bitstream transmission / reception system comprising: a preamble removal unit that extracts the elementary audio stream by removing the preamble signal included in the received audio bitstream; Information indicating the encoding format of the elementary audio stream described in the preamble signal, and a code of the elementary audio stream included in the elementary audio stream An elementary decoding unit for decoding the elementary audio stream in an encoding format indicated by the information indicating the encoding format described in the preamble signal when the information indicating the encoding format is inconsistent Also good.

また、本発明の一形態に係るビットストリーム送受信システムは、前記ビットストリーム送信装置と、当該ビットストリーム送信装置から送信されたオーディオビットストリームを受信し、受信した前記オーディオビットストリームを復号するビットストリーム受信装置とを備えたビットストリーム送受信システムであって、前記ビットストリーム受信装置は、受信した前記オーディオビットストリームに含まれる前記プリアンブル信号を除去することによって前記エレメンタリオーディオストリームを抽出するプリアンブル除去部と、前記プリアンブル信号に記載されている前記エレメンタリオーディオストリームの符号化形式を示す情報と前記エレメンタリオーディオストリームに含まれる前記エレメンタリオーディオストリームの符号化形式を示す情報とが不整合である場合、前記プリアンブル信号に記載されている前記符号化形式を示す情報に従って、高域に拡大される信号を０として前記ＳＢＲ情報を用いた復号処理を行うエレメンタリデコード部とを備えてもよい。 In addition, a bitstream transmission / reception system according to an aspect of the present invention includes the bitstream transmission device and a bitstream reception that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream. A bitstream transmission / reception system comprising: a preamble removal unit that extracts the elementary audio stream by removing the preamble signal included in the received audio bitstream; Information indicating the encoding format of the elementary audio stream described in the preamble signal and the code of the elementary audio stream included in the elementary audio stream If the information indicating the format is inconsistent, an element for performing the decoding process using the SBR information with the signal expanded to a high frequency set to 0 in accordance with the information indicating the encoding format described in the preamble signal. A mental decoding unit may be provided.

また、本発明の一形態に係るビットストリーム送受信システムは、前記ビットストリーム送信装置と、当該ビットストリーム送信装置から送信されたオーディオビットストリームを受信し、受信した前記オーディオビットストリームを復号するビットストリーム受信装置とを備えたビットストリーム送受信システムであって、前記ビットストリーム受信装置は、受信した前記オーディオビットストリームから前記プリアンブル信号を除去することによって前記エレメンタリオーディオストリームを抽出するプリアンブル除去部と、前記プリアンブル信号に記載されている前記エレメンタリオーディオストリームの符号化形式を示す情報と前記エレメンタリオーディオストリームに含まれる前記エレメンタリオーディオストリームの符号化形式を示す情報とが不整合の場合、前記プリアンブル信号に記載されている前記符号化形式を示す情報に従って、前記ＳＢＲ情報を用いた復号処理を行った場合の１／２のサンプリング周波数の信号を生成するエレメンタリデコード部とを備えてもよい。 In addition, a bitstream transmission / reception system according to an aspect of the present invention includes the bitstream transmission device and a bitstream reception that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream. A bit stream transmission / reception system comprising: a preamble removal unit that extracts the elementary audio stream by removing the preamble signal from the received audio bit stream; and the preamble Information indicating the encoding format of the elementary audio stream described in the signal and the encoding format of the elementary audio stream included in the elementary audio stream If the information shown is inconsistent, a signal having a sampling frequency of ½ when the decoding process using the SBR information is performed is generated according to the information indicating the encoding format described in the preamble signal. An elementary decoding unit may be provided.

また、前記ビットストリーム受信装置は、当該ビットストリーム受信装置の能力を示す情報を前記ビットストリーム送信装置に通知する能力通知部を有し、前記解析部は、前記エレメンタリオーディオストリームのサンプリングレートを示すＦＳ情報を解析し、前記プリアンブル生成部は、当該ＦＳ情報が予め定められた値の場合、前記通知手段からの情報に応じ、ＳＢＲ情報が含まれていることを示すプリアンブル信号を生成するか否かを切り替えてもよい。 In addition, the bitstream reception device includes a capability notification unit that notifies the bitstream transmission device of information indicating the capability of the bitstream reception device, and the analysis unit indicates a sampling rate of the elementary audio stream. Analyzing the FS information, if the preamble generation unit has a predetermined value, whether to generate a preamble signal indicating that the SBR information is included according to the information from the notification unit May be switched.

また、本発明の一形態に係るビットストリーム受信装置は、オーディオ信号を符号化して得られたエレメンタリオーディオストリームに、前記エレメンタリオーディオストリームの符号化形式を示す情報を含むプリアンブル信号を付加することで成形された、前記所定の形式のオーディオビットストリームであって、前記エレメンタリオーディオストリームのサンプリング周波数と前記符号化形式とが前記オーディオビットストリームの前記所定の形式に適合しない場合、前記オーディオビットストリームの前記所定の形式に適合する符号化形式が、前記プリアンブル信号内の前記符号化形式を示す情報として記載されたオーディオビットストリームをビットストリーム送信装置から受信する受信部と、受信した前記オーディオビットストリームに含まれる前記プリアンブル信号を除去することによって前記エレメンタリオーディオストリームを抽出するプリアンブル除去部と、前記プリアンブル信号に記載されている前記エレメンタリオーディオストリームの符号化形式を示す情報と前記エレメンタリオーディオストリームに含まれる前記エレメンタリオーディオストリームの符号化形式を示す情報とが不整合である場合、前記プリアンブル信号に記載されている符号化形式を示す情報によって示される符号化形式で前記エレメンタリオーディオストリームを復号するエレメンタリデコード部とを備えるとしてもよい。 The bit stream receiving apparatus according to an aspect of the present invention adds a preamble signal including information indicating an encoding format of the elementary audio stream to the elementary audio stream obtained by encoding the audio signal. The audio bitstream of the predetermined format, which is formed in the case where the sampling frequency of the elementary audio stream and the encoding format do not conform to the predetermined format of the audio bitstream A receiving unit that receives an audio bitstream described as information indicating the encoding format in the preamble signal from a bitstream transmitting device, and an encoding format that conforms to the predetermined format; and the received audio bitstream A preamble removing unit that extracts the elementary audio stream by removing the preamble signal included in the message, information indicating an encoding format of the elementary audio stream described in the preamble signal, and the elementary audio When the information indicating the encoding format of the elementary audio stream included in the stream is inconsistent, the elementary audio stream is encoded in the encoding format indicated by the information indicating the encoding format described in the preamble signal. And an elementary decoding unit for decoding.

これにより、本ビットストリーム送受信システムのビットストリーム受信装置は、前記プリアンブル信号に記載されている符号化形式を示す情報と前記エレメンタリオーディオストリームに含まれる前記エレメンタリオーディオストリームの符号化形式を示す情報とが不整合である場合、前記プリアンブル信号に記載されている符号化形式を示す情報によって示される符号化形式で前記エレメンタリオーディオストリームを復号することができる。よって、当該ビットストリーム受信装置は、エレメンタリオーディオストリーム内で符号化形式を示す情報とプリアンブル信号内で符号化形式を示す情報とが不整合である場合であっても、オーディオビットストリームを矛盾無く復号することができる。 As a result, the bit stream receiving device of the present bit stream transmission / reception system provides information indicating the encoding format described in the preamble signal and information indicating the encoding format of the elementary audio stream included in the elementary audio stream. Are inconsistent, the elementary audio stream can be decoded in the encoding format indicated by the information indicating the encoding format described in the preamble signal. Therefore, the bit stream receiving apparatus can consistently process the audio bit stream even when the information indicating the encoding format in the elementary audio stream and the information indicating the encoding format in the preamble signal are inconsistent. Can be decrypted.

また、前記ビットストリーム受信装置は、前記エレメンタリオーディオストリームの前記サンプリング周波数が、予め定められた第３の周波数である場合に、前記エレメンタリオーディオストリームが前記ＳＢＲ情報の含まれる前記符号化形式で符号化されていなくても、前記ＳＢＲ情報の含まれる符号化形式で符号化されていることを示すプリアンブル信号を含んだオーディオビットストリームを受信し、前記エレメンタリデコード部は、前記プリアンブル信号に記載されている前記エレメンタリオーディオストリームの符号化形式を示す情報と、前記エレメンタリオーディオストリームに含まれる前記エレメンタリオーディオストリームの符号化形式を示す情報とが不整合である場合、前記プリアンブル信号に記載されている前記符号化形式を示す情報に従って、高域に拡大される信号を０として前記ＳＢＲ情報を用いた復号処理を行ってもよい。 In addition, the bitstream reception device may be configured such that when the sampling frequency of the elementary audio stream is a predetermined third frequency, the elementary audio stream is in the encoding format in which the SBR information is included. Even if it is not encoded, an audio bitstream including a preamble signal indicating that it is encoded in the encoding format including the SBR information is received, and the elementary decoding unit describes the preamble signal. If the information indicating the encoding format of the elementary audio stream is mismatched with the information indicating the encoding format of the elementary audio stream included in the elementary audio stream, the information is described in the preamble signal. Said encoding being According to the information indicating the expression, the signal is expanded in the high band may be subjected to decoding processing employing the SBR information as 0.

また、前記ビットストリーム受信装置は、前記エレメンタリオーディオストリームの前記サンプリング周波数が、予め定められた第４の周波数である場合に、前記エレメンタリオーディオストリームが前記ＳＢＲ情報の含まれる符号化形式で符号化されていても、前記ＳＢＲ情報の含まれない符号化形式で符号化されていることを示すプリアンブル信号を含んだオーディオビットストリームを受信し、前記エレメンタリデコード部は、前記プリアンブル信号に記載されている前記エレメンタリオーディオストリームの符号化形式を示す情報と、前記エレメンタリオーディオストリームに含まれる前記エレメンタリオーディオストリームの符号化形式を示す情報とが不整合の場合、前記プリアンブル信号に記載されている前記符号化形式を示す情報に従って、前記ＳＢＲ情報を用いた復号処理を行った場合の１／２のサンプリング周波数の信号を生成してもよい。 In addition, the bitstream reception device may encode the elementary audio stream in an encoding format including the SBR information when the sampling frequency of the elementary audio stream is a predetermined fourth frequency. An audio bitstream including a preamble signal indicating that it is encoded in an encoding format that does not include the SBR information, and the elementary decoding unit is described in the preamble signal If the information indicating the encoding format of the elementary audio stream and the information indicating the encoding format of the elementary audio stream included in the elementary audio stream are inconsistent, the information is described in the preamble signal. Indicates the encoding format Accordance broadcast, the decoding processing employing the SBR information may generate a signal of half the sampling frequency when performing.

なお、これらの全般的または具体的な態様は、システム、方法、集積回路、コンピュータプログラムまたは記録媒体で実現されてもよく、システム、方法、集積回路、コンピュータプログラムおよび記録媒体の任意な組み合わせで実現されてもよい。また、本発明は、上記ビットストリーム送信装置により生成されるオーディオビットストリームとして実現されてもよい。 These general or specific aspects may be realized by a system, a method, an integrated circuit, a computer program, or a recording medium, and are realized by any combination of the system, method, integrated circuit, computer program, and recording medium. May be. Further, the present invention may be realized as an audio bit stream generated by the bit stream transmission device.

以上のように、本発明は、サンプリング周波数に対応する実際の符号化形式が所定の形式に適合しない場合であっても、適合するオーディオビットストリームとして送信することができるビットストリーム送信装置を提供できる。 As described above, the present invention can provide a bit stream transmission device that can transmit an audio bit stream that conforms to a case where the actual encoding format corresponding to the sampling frequency does not conform to a predetermined format. .

また、本発明の一形態によれば、入力されたエレメンタリオーディオストリームのサンプリングレートとそのサンプリングレートに対応する符号化形式とをＩＥＣ６１９３７規格で送信する場合の不都合を解消することができる。 Further, according to one aspect of the present invention, it is possible to eliminate the inconvenience when the sampling rate of the input elementary audio stream and the encoding format corresponding to the sampling rate are transmitted according to the IEC 61937 standard.

また、本発明の他の形態によれば、入力されたエレメンタリオーディオストリームがＩＥＣ６１９３７規格範囲外のものであってもＩＥＣ６１９３７規格範囲内になるようにしてビットストリーム送信できる。 According to another aspect of the present invention, even if the input elementary audio stream is outside the IEC 61937 standard range, the bit stream can be transmitted so as to be within the IEC 61937 standard range.

さらに、本発明のさらに他の形態によれば、連続して入力されるエレメンタリオーディオストリームをデコードした場合、デコード信号のサンプリングレートの変化が生じないようにすることができる。 Furthermore, according to still another aspect of the present invention, when an elementary audio stream that is continuously input is decoded, it is possible to prevent the sampling rate of the decoded signal from changing.

また、さらに本発明の他の形態によれば、受信したビットストリームに付加されたプリアンブル信号に記載されているパラメータＰと該ビットストリームを構成しているエレメンタリオーディオストリームに含まれる前記パラメータＰに対応するパラメータＥとが不整合の場合でも正常にデコード信号を生成することができる。 Furthermore, according to another aspect of the present invention, the parameter P described in the preamble signal added to the received bit stream and the parameter P included in the elementary audio stream constituting the bit stream are included in the parameter P. Even when the corresponding parameter E is inconsistent, the decode signal can be generated normally.

また、本発明のさらに他の形態によれば、低ＦＳのＡＡＣの場合、ＩＥＣ６１９３７規格範囲内になるようにしてビットストリーム送信できる。 According to still another aspect of the present invention, in the case of AFS with low FS, bitstream transmission can be performed so as to be within the IEC 61937 standard range.

本発明のさらに他の形態によれば、低ＦＳのＡＡＣの場合、予め定められた中ＦＳのサンプリングレートの再生音を生成できる。 According to still another aspect of the present invention, in the case of AAC with low FS, it is possible to generate a reproduction sound with a predetermined sampling rate of medium FS.

本発明のさらに他の形態によれば、高ＦＳのＨＥＡＡＣの場合、ＩＥＣ６１９３７規格範囲内になるようにしてビットストリーム送信できる。 According to still another aspect of the present invention, in the case of a high FS HEAAC, a bit stream can be transmitted within the IEC 61937 standard range.

本発明のさらに他の形態によれば、高ＦＳのＡＡＣの場合、予め定められた中ＦＳのサンプリングレートの再生音を生成できる。 According to still another aspect of the present invention, in the case of a high FS AAC, it is possible to generate a reproduction sound having a predetermined medium FS sampling rate.

図１は、本発明の実施の形態１におけるビットストリーム送信装置の構成の一例を示す図である。FIG. 1 is a diagram showing an example of a configuration of a bit stream transmission apparatus according to Embodiment 1 of the present invention. 図２は、本発明の実施の形態１におけるビットストリーム受信装置の構成の一例を示す図である。FIG. 2 is a diagram showing an example of the configuration of the bit stream receiving apparatus according to Embodiment 1 of the present invention. 図３は、本発明の実施の形態１におけるビットストリーム送受信システムのオーディオシステムへの適用例を示す図である。FIG. 3 is a diagram illustrating an application example of the bit stream transmission / reception system according to Embodiment 1 of the present invention to an audio system. 図４Ａは、本発明の実施の形態１に係る、ＡＡＣ方式とＨＥＡＡＣ方式の再生帯域の違いを説明するための図である。FIG. 4A is a diagram for explaining a difference between reproduction bands of the AAC method and the HEAAC method according to Embodiment 1 of the present invention. 図４Ｂは、本発明の実施の形態１に係る、ＡＡＣ方式とＨＥＡＡＣ方式の再生帯域の違いを説明するための図である。FIG. 4B is a diagram for explaining a difference between reproduction bands of the AAC method and the HEAAC method according to Embodiment 1 of the present invention. 図５Ａは、本発明の実施の形態１に係る、図１０に示したストリーム出力規定の意味を説明するための図である。FIG. 5A is a diagram for explaining the meaning of the stream output regulation shown in FIG. 10 according to Embodiment 1 of the present invention. 図５Ｂは、本発明の実施の形態１に係る、図１０に示したストリーム出力規定の意味を説明するための図である。FIG. 5B is a diagram for explaining the meaning of the stream output regulation shown in FIG. 10 according to Embodiment 1 of the present invention. 図５Ｃは、本発明の実施の形態１に係る、図１０に示したストリーム出力規定の意味を説明するための図である。FIG. 5C is a diagram for explaining the meaning of the stream output regulation shown in FIG. 10 according to Embodiment 1 of the present invention. 図６は、本発明の実施の形態２におけるビットストリーム送信受信システムの構成の一例を示す図である。FIG. 6 is a diagram showing an example of the configuration of the bitstream transmission / reception system according to Embodiment 2 of the present invention. 図７は、ＩＥＣ６０９５８／ＩＥＣ６１９３７における出力ビットストリームのデータ構成の一例を説明する図である。FIG. 7 is a diagram for explaining an example of the data configuration of the output bitstream in IEC60958 / IEC61937. 図８Ａは、ＡＤＴＳ形式の固定ヘッダの記述の一例を示す図である。FIG. 8A is a diagram illustrating an example of a description of a fixed header in the ADTS format. 図８Ｂは、ＬＯＡＳ形式のヘッダの記述の一例を示す図である。FIG. 8B is a diagram illustrating an example of a description of a LOAS format header. 図９は、図７（ｂ）に示したプリアンブル信号Ｐｃのデータ形式を示す図である。FIG. 9 is a diagram showing a data format of the preamble signal Pc shown in FIG. 図１０は、エレメンタリオーディオストリームのサンプリングレートとオーディオ符号化データ形式との組み合わせに対するＩＥＣ６１９３７規格におけるストリーム出力規定の有無を示す図である。FIG. 10 is a diagram showing the presence / absence of a stream output regulation in the IEC 61937 standard for a combination of a sampling rate of an elementary audio stream and an audio encoded data format. 図１１Ａは、ＨＥＡＡＣで、かつ、ＡＡＣサンプリングレートが４８ＫＨｚの符号化データが再生可能な周波数帯域を示す図である。FIG. 11A is a diagram illustrating a frequency band in which encoded data with HEAAC and an AAC sampling rate of 48 KHz can be reproduced. 図１１Ｂは、ＡＡＣで、かつ、ＡＡＣサンプリングレートが４８ＫＨｚの符号化データが再生可能な周波数帯域を示す図である。FIG. 11B is a diagram showing frequency bands in which encoded data with AAC and an AAC sampling rate of 48 KHz can be reproduced.

以下、本発明の実施の形態について、図面を用いて詳細に説明する。なお、以下で説明する実施の形態は、いずれも本発明の一具体例を示すものである。以下の実施の形態で示される数値、形状、材料、構成要素、構成要素の配置位置及び接続形態、ステップ、ステップの順序などは、一例であり、本発明を限定する主旨ではない。本発明は、請求の範囲によって特定される。よって、以下の実施の形態における構成要素のうち、本発明の最上位概念を示す独立請求項に記載されていない構成要素については、本発明の課題を達成するのに必ずしも必要ではないが、より好ましい形態を構成するものとして説明される。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Note that each of the embodiments described below shows a specific example of the present invention. The numerical values, shapes, materials, constituent elements, arrangement positions and connecting forms of the constituent elements, steps, order of steps, and the like shown in the following embodiments are merely examples, and are not intended to limit the present invention. The invention is specified by the claims. Therefore, among the constituent elements in the following embodiments, constituent elements that are not described in the independent claims indicating the highest concept of the present invention are not necessarily required to achieve the object of the present invention. It will be described as constituting a preferred form.

（実施の形態１）
以下本発明の実施の形態１におけるビットストリーム送信装置について図面を参照しながら説明する。(Embodiment 1)
Hereinafter, a bitstream transmission apparatus according to Embodiment 1 of the present invention will be described with reference to the drawings.

図１は本実施の形態１におけるビットストリーム送信装置２００の構成を示す図である。図１に示すように、ビットストリーム送信装置２００は、入力部１００、解析部１０１、プリアンブル生成部１０２、成形部１０３及び出力部１０４を備える。 FIG. 1 is a diagram illustrating a configuration of a bitstream transmission apparatus 200 according to the first embodiment. As illustrated in FIG. 1, the bitstream transmission device 200 includes an input unit 100, an analysis unit 101, a preamble generation unit 102, a shaping unit 103, and an output unit 104.

入力部１００は、オーディオ信号を符号化して得られたエレメンタリオーディオストリーム３００を取得する。 The input unit 100 acquires an elementary audio stream 300 obtained by encoding an audio signal.

解析部１０１は、取得されたエレメンタリオーディオストリーム３００のサンプリングレートとオーディオ符号化データ形式とを解析する。 The analysis unit 101 analyzes the sampling rate and audio encoded data format of the acquired elementary audio stream 300.

プリアンブル生成部１０２は、解析部１０１の解析結果に応じて、ＩＥＣ６１９３７で規定された形式のオーディオビットストリームを構成するプリアンブル信号を生成する。これにおいて、プリアンブル生成部１０２は、解析部１０１の解析結果から得られるサンプリング周波数が予め定められた値である場合、エレメンタリオーディオストリーム３００のオーディオ符号化データ形式を示すパラメータを、ＩＥＣ６１９３７の規定に従う値に変更することでプリアンブル信号を生成する。 The preamble generation unit 102 generates a preamble signal that constitutes an audio bitstream in a format defined by IEC 61937 according to the analysis result of the analysis unit 101. In this case, when the sampling frequency obtained from the analysis result of the analysis unit 101 is a predetermined value, the preamble generation unit 102 sets a parameter indicating the audio encoded data format of the elementary audio stream 300 in accordance with IEC 61937. A preamble signal is generated by changing the value.

成形部１０３は、プリアンブル生成部１０２が生成したプリアンブル信号をエレメンタリオーディオストリーム３００に付加することで、ＩＥＣ６１９３７形式のオーディオビットストリームを成形する。 The shaping unit 103 shapes the audio bit stream in the IEC 61937 format by adding the preamble signal generated by the preamble generation unit 102 to the elementary audio stream 300.

出力部１０４は、成形部１０３で成形されたオーディオビットストリーム３０１を出力する。 The output unit 104 outputs the audio bitstream 301 formed by the forming unit 103.

図２は、実施の形態１におけるビットストリーム受信装置の構成の一例を示す図である。図２に示すように、ビットストリーム受信装置２０１は、図１に示したビットストリーム送信装置２００から送信されたオーディオビットストリーム３０１を受信し、受信したオーディオビットストリーム３０１をデコードする。このビットストリーム受信装置２０１は、プリアンブル除去部２０２及びエレメンタリデコード部２０３を備える。 FIG. 2 is a diagram illustrating an example of a configuration of the bit stream receiving apparatus according to the first embodiment. As illustrated in FIG. 2, the bitstream receiving apparatus 201 receives the audio bitstream 301 transmitted from the bitstream transmitting apparatus 200 illustrated in FIG. 1, and decodes the received audio bitstream 301. The bit stream receiving apparatus 201 includes a preamble removing unit 202 and an elementary decoding unit 203.

プリアンブル除去部２０２は、ビットストリーム受信装置２０１に内蔵され、ビットストリーム送信装置２００から送信されるＩＥＣ６１９３７形式オーディオビットストリーム３０１からプリアンブル信号Ｐａ、Ｐｂ、Ｐｃ及びＰｄを除去することでエレメンタリオーディオストリーム３００を抜き出す。 The preamble removing unit 202 is incorporated in the bitstream receiving apparatus 201 and removes the preamble signals Pa, Pb, Pc, and Pd from the IEC61937 format audio bitstream 301 transmitted from the bitstream transmitting apparatus 200, thereby the elementary audio stream 300. Extract.

エレメンタリデコード部２０３は、ビットストリーム受信装置２０１に内蔵され、プリアンブル除去部２０２で抜き出されたエレメンタリオーディオストリーム３００をデコードすることでオーディオ信号（マルチチャネルデコード信号）を生成し、生成したオーディオ信号を出力する。エレメンタリデコード部２０３は、エレメンタリオーディオストリーム３００をデコードする際において、プリアンブル信号Ｐｃで示されるオーディオ符号化データ形式と、Ｅｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄで示されるオーディオ符号化データ形式とが矛盾する場合には、プリアンブル信号Ｐｃで示されるオーディオ符号化データ形式でエレメンタリオーディオストリーム３００をデコードする。 The elementary decoding unit 203 is built in the bitstream receiving apparatus 201, generates an audio signal (multi-channel decoding signal) by decoding the elementary audio stream 300 extracted by the preamble removing unit 202, and generates the generated audio Output a signal. The elementary decoding unit 203, when decoding the elementary audio stream 300, if the audio encoded data format indicated by the preamble signal Pc and the audio encoded data format indicated by the Extension_payload contradict each other, The elementary audio stream 300 is decoded in the audio encoded data format indicated by Pc.

図３は、図１に示したビットストリーム送信装置２００と図２に示したビットストリーム受信装置２０１とを備えた送受信システムを、オーディオ機器へ実装した例を示す図である。 FIG. 3 is a diagram illustrating an example in which a transmission / reception system including the bit stream transmission device 200 illustrated in FIG. 1 and the bit stream reception device 201 illustrated in FIG. 2 is installed in an audio device.

図３に示すように、ビットストリーム送信装置２００とビットストリーム受信装置２０１とを備えるビットストリーム送受信システム２１０は、ＩＥＣ６１９３７形式のオーディオビットストリーム３０１を伝送する接続線を介して接続される。この接続線は、例えば、ＨＤＭＩなどである。ビットストリーム送信装置２００は、例えば、テレビ２２０に内蔵され、テレビ２２０内部のデコーダ及び２チャネルダウンミックス部を介して、例えば、２チャネルのステレオスピーカに接続される。テレビ２２０は、図示しないアンテナ、チューナ、又はＣＤドライバなどを介して放送波又はＣＤなどからエレメンタリオーディオストリーム３００を取得し、取得したエレメンタリオーディオストリーム３００をビットストリーム送信装置２００の入力部１００に入力する。デコーダ２２１は、入力部１００から成形部１０３に入力されるエレメンタリオーディオストリーム３００を取り込んで、一旦、マルチチャネルオーディオ信号に復号する。２チャネルダウンミックス部２２２（２ｃｈダウンミックス部）は、エレメンタリオーディオストリーム３００をデコーダで復号して得られたマルチチャネルオーディオ信号を、２チャネルのステレオオーディオ信号にダウンミックスし、当該ステレオオーディオ信号をステレオスピーカ２２３に出力する。 As shown in FIG. 3, a bit stream transmission / reception system 210 including a bit stream transmission device 200 and a bit stream reception device 201 is connected via a connection line that transmits an audio bit stream 301 in the IEC 61937 format. This connection line is, for example, HDMI. The bit stream transmission device 200 is built in, for example, the television 220, and is connected to, for example, a 2-channel stereo speaker via a decoder and a 2-channel downmix unit inside the television 220. The television 220 acquires the elementary audio stream 300 from a broadcast wave or CD via an antenna, tuner, CD driver, or the like (not shown), and the acquired elementary audio stream 300 is input to the input unit 100 of the bitstream transmission device 200. input. The decoder 221 takes in the elementary audio stream 300 input from the input unit 100 to the shaping unit 103 and temporarily decodes it into a multi-channel audio signal. The 2-channel downmix unit 222 (2ch downmix unit) downmixes the multichannel audio signal obtained by decoding the elementary audio stream 300 with a decoder into a 2-channel stereo audio signal, and converts the stereo audio signal into Output to the stereo speaker 223.

ビットストリーム受信装置２０１は、例えば、５．１チャネルなどのマルチチャネルスピーカ２３１を備えたシアターシステム２３０に内蔵される。このビットストリーム受信装置２０１は、ＨＤＭＩなどを介してビットストリーム送信装置２００から入力された、ＩＥＣ６１９３７の規定に従うオーディオビットストリーム３０１を復号し、復号によって得られたマルチチャネルオーディオ信号を、マルチチャネルスピーカ２３１に出力する。 The bit stream receiving apparatus 201 is built in a theater system 230 including a multi-channel speaker 231 such as 5.1 channel, for example. The bitstream receiving apparatus 201 decodes an audio bitstream 301 that is input from the bitstream transmitting apparatus 200 via HDMI or the like according to IEC61937, and a multichannel audio signal obtained by the decoding is converted into a multichannel speaker 231. Output to.

以下では、以上のように構成されたビットストリーム送信装置２００のより詳細な動作について説明する。 Hereinafter, a more detailed operation of the bitstream transmission device 200 configured as described above will be described.

まず、入力部１００は、エレメンタリオーディオストリーム３００を取得する。エレメンタリオーディオストリーム３００は、図７で示したようなデータ形式のストリームであり、本実施の形態１では、ヘッダ形式はＬＯＡＳ形式であり、エンコードデータは、ＳＢＲ情報を含まないＡＡＣ方式のエンコードデータであるとする。エレメンタリオーディオストリーム３００内のエンコードデータ部にはＥｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄ３１０は含まれておらず、ヘッダ内に記載されているサンプリングレート（サンプリング周波数）は２４ｋＨｚとする。 First, the input unit 100 acquires an elementary audio stream 300. The elementary audio stream 300 is a stream in the data format as shown in FIG. 7, and in the first embodiment, the header format is the LOAS format, and the encoded data is AAC encoded data that does not include SBR information. Suppose that The encoding data portion in the elementary audio stream 300 does not include the Extension_payload 310, and the sampling rate (sampling frequency) described in the header is 24 kHz.

次に解析部１０１で、エレメンタリオーディオストリーム３００を解析する。本実施の形態１では、入力されたエレメンタリオーディオストリーム３００は、上で述べたように、サンプリングレートが２４ｋＨｚで、ＳＢＲ情報を含んでいないものであるので、そのことが解析部１０１によって解析される。サンプリングレートは図８Ｂに示したＬＯＡＳヘッダ形式の規定に沿ってビットストリームを解析することで得られる。また、ＳＢＲ情報の有無は、図７に示したＥｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄの規定に沿ってビットストリームを解析することで得られる。 Next, the analysis unit 101 analyzes the elementary audio stream 300. In the first embodiment, as described above, the input elementary audio stream 300 has a sampling rate of 24 kHz and does not include SBR information. Therefore, this is analyzed by the analysis unit 101. The The sampling rate can be obtained by analyzing the bit stream in accordance with the LOAS header format specification shown in FIG. 8B. Also, the presence / absence of SBR information can be obtained by analyzing the bit stream in accordance with the definition of Extension_payload shown in FIG.

次にプリアンブル生成部１０２で、解析部１０１の解析結果に応じてＩＥＣ６１９３７形式のプリアンブル信号を生成する。エレメンタリオーディオストリーム３００が、サンプリングレートが２４ｋＨｚで、ＳＢＲ情報を含んでいない場合、図１０に示したように、ＩＥＣ６１９３７形式のオーディオビットストリームが定義されていない。よって、プリアンブル生成部１０２は、エレメンタリオーディオストリーム３００がＳＢＲ情報を含んでいることを示すプリアンブル信号を生成する。具体的には、プリアンブル生成部１０２は、プリアンブル信号Ｐｃの０から４ビット目の５ビットが示す値を２３に設定する。これにより、プリアンブル信号Ｐｃは、ヘッダ形式がＬＯＡＳ形式であることを示す。さらに、プリアンブル生成部１０２は、プリアンブル信号Ｐｃの５から６ビット目の２ビットを２に設定する。これにより、プリアンブル信号Ｐｃは、エレメンタリオーディオストリーム３００にＳＢＲ情報が含まれていることを示す。ここでＳＢＲ情報が含まれているとは、すなわち、プリアンブル信号Ｐｃは、エレメンタリオーディオストリーム３００がＨＥＡＡＣ方式で符号化されていることを示す。本実施の形態１では、サンプリングレートは２４ｋＨｚとしたが、２２．０５ｋＨｚや１６ｋＨｚであっても同様であることはいうまでもない。 Next, the preamble generation unit 102 generates a preamble signal in IEC 61937 format according to the analysis result of the analysis unit 101. When the elementary audio stream 300 has a sampling rate of 24 kHz and does not include SBR information, an audio bit stream in the IEC 61937 format is not defined as shown in FIG. Therefore, the preamble generation unit 102 generates a preamble signal indicating that the elementary audio stream 300 includes SBR information. Specifically, the preamble generation unit 102 sets the value indicated by 5 bits from 0 to the 4th bit of the preamble signal Pc to 23. Thereby, the preamble signal Pc indicates that the header format is the LOAS format. Further, the preamble generation unit 102 sets 2 bits of the 5th to 6th bits of the preamble signal Pc to 2. Accordingly, the preamble signal Pc indicates that the elementary audio stream 300 includes SBR information. Here, SBR information is included, that is, the preamble signal Pc indicates that the elementary audio stream 300 is encoded by the HEAAC method. In the first embodiment, the sampling rate is set to 24 kHz, but it goes without saying that the same applies to 22.05 kHz and 16 kHz.

次に、成形部１０３は、プリアンブル生成部１０２から出力されたプリアンブル信号をエレメンタリオーディオストリーム３００に付加する。すなわち、成形部１０３は、図７のオーディオビットストリーム３０１に示したようなＩＥＣ６１９３７形式のオーディオビットストリームを成形する。 Next, the shaping unit 103 adds the preamble signal output from the preamble generation unit 102 to the elementary audio stream 300. That is, the forming unit 103 forms an audio bit stream in the IEC 61937 format as shown in the audio bit stream 301 of FIG.

最後に、出力部１０４は、成形部１０３からの信号であるオーディオビットストリーム３０１を外部機器に出力する。 Finally, the output unit 104 outputs an audio bitstream 301 that is a signal from the shaping unit 103 to an external device.

また、図２のように構成されたビットストリーム受信装置２０１の動作について、以下説明する。 Further, the operation of the bitstream receiving apparatus 201 configured as shown in FIG. 2 will be described below.

まず、ビットストリーム送信装置２００は、入力されたエレメンタリオーディオストリーム３００をＩＥＣ６１９３７形式のオーディオビットストリーム３０１に成形し、当該オーディオビットストリーム３０１を出力する。 First, the bit stream transmission apparatus 200 forms the input elementary audio stream 300 into an audio bit stream 301 in the IEC 61937 format, and outputs the audio bit stream 301.

例えば、入力されたエレメンタリオーディオストリーム３００のサンプリングレート（サンプリング周波数）が２４ｋＨｚで、ＳＢＲ情報がない（ＡＡＣ方式）符号化が行われている場合、プリアンブル信号の中のＰｃには、ＳＢＲ情報がある（ＨＥＡＡＣ方式）を示すコードが記載されている。また、例えば、入力されたエレメンタリオーディオストリーム３００のサンプリングレート（サンプリング周波数）が４８ｋＨｚで、ＳＢＲ情報がある（ＨＥＡＡＣ方式）の場合、プリアンブル信号の中のＰｃには、ＳＢＲ情報がない（ＡＡＣ方式）を示すコードが記載されている。 For example, when the sampling rate (sampling frequency) of the input elementary audio stream 300 is 24 kHz and encoding is performed without SBR information (AAC method), SBR information is included in Pc in the preamble signal. A code indicating a certain (HEAAC system) is described. For example, when the sampling rate (sampling frequency) of the input elementary audio stream 300 is 48 kHz and there is SBR information (HEAAC system), Pc in the preamble signal has no SBR information (AAC system) ) Is indicated.

プリアンブル除去部２０２は、ビットストリーム送信装置１０００から送信されるＩＥＣ６１９３７形式のオーディオビットストリーム３０１からプリアンブル信号Ｐａ、Ｐｂ、Ｐｃ及びＰｄを除去することでエレメンタリオーディオストリーム３００を抜き出す。もし、ＩＥＣ６１９３７形式のオーディオビットストリーム３０１にＳｔｕｆｆｉｎｇ信号が入っている場合は、プリアンブル除去部２０２は、当該Ｓｔｕｆｆｉｎｇ信号も取り除き、エレメンタリオーディオストリーム３００を生成する。このようにして生成されたエレメンタリオーディオストリーム３００は、ＡＡＣ方式、又はＨＥＡＡＣ方式のエレメンタリオーディオストリームであり、ＭＰＥＧ規格で規定されたデコード手順にそってデコードできるものである。また、当該プリアンブル除去部２０２では、ＳＢＲ情報の有無を示すＳＢＲ有無情報をプリアンブル信号Ｐｃから検出し、当該ＳＢＲ有無情報を後続のエレメンタリデコード部２０３に送出する。 The preamble removal unit 202 extracts the elementary audio stream 300 by removing the preamble signals Pa, Pb, Pc, and Pd from the IEC 61937-format audio bit stream 301 transmitted from the bit stream transmission apparatus 1000. If a stuffing signal is included in the audio bit stream 301 in the IEC 61937 format, the preamble removal unit 202 also removes the stuffing signal and generates an elementary audio stream 300. The elementary audio stream 300 generated in this way is an AAC or HEAAC elementary audio stream, and can be decoded according to a decoding procedure defined by the MPEG standard. Further, the preamble removing unit 202 detects SBR presence / absence information indicating the presence / absence of SBR information from the preamble signal Pc, and sends the SBR presence / absence information to the subsequent elementary decoding unit 203.

エレメンタリデコード部２０３は、抜き出されたエレメンタリオーディオストリーム３００をデコードすることでオーディオ信号を生成する。また、エレメンタリデコード部２０３は、プリアンブル除去部２０２から送出されるＳＢＲ有無情報を参照し、エレメンタリオーディオストリーム３００に含まれているＳＢＲ情報の有無を示す情報と、プリアンブル除去部２０２からのＳＢＲ有無情報とが矛盾するかを判定し、矛盾する場合には、プリアンブル除去部２０２からのＳＢＲ有無情報の示す内容に従う。 The elementary decoding unit 203 generates an audio signal by decoding the extracted elementary audio stream 300. Further, the elementary decoding unit 203 refers to the SBR presence / absence information sent from the preamble removal unit 202, information indicating the presence / absence of SBR information included in the elementary audio stream 300, and the SBR from the preamble removal unit 202. It is determined whether the presence / absence information contradicts, and if there is a conflict, the content indicated by the SBR presence / absence information from the preamble removal unit 202 is followed.

例えば、エレメンタリオーディオストリーム３００が、サンプリングレートが２４ｋＨｚで、「ＳＢＲなし」であっても、ＳＢＲ有無情報が「ＳＢＲ情報あり」を示している場合は、エレメンタリデコード部２０３は、帯域拡大処理を実施する。もちろん、実際にはエレメンタリオーディオストリーム３００のＥｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄ内にＳＢＲ情報がないので、エレメンタリデコード部２０３は、完全な帯域拡大処理は実施できないが、例えば、デコード信号のサンプリングレートを２倍にアップサンプリングする処理を実行するなどの処理を行う。すなわち、エレメンタリデコード部２０３は、ＳＢＲによって高域に複製される成分が０であるような帯域拡大処理を実施することと等価な処理を行う。 For example, even if the elementary audio stream 300 has a sampling rate of 24 kHz and “no SBR”, if the SBR presence / absence information indicates “with SBR information”, the elementary decoding unit 203 performs the band expansion process. To implement. Of course, since there is actually no SBR information in the Extension_payload of the elementary audio stream 300, the elementary decoding unit 203 cannot perform complete band expansion processing. For example, the sampling rate of the decoded signal is upsampled to twice. The process of performing the process to perform is performed. That is, the elementary decoding unit 203 performs a process equivalent to performing a band expansion process such that the component replicated in the high band by the SBR is zero.

また、例えば、エレメンタリオーディオストリーム３００が、サンプリングレートが４８ｋＨｚで、「ＳＢＲ情報あり」であっても、ＳＢＲ有無情報が「ＳＢＲなし」を示している場合は、エレメンタリデコード部２０３は、帯域拡大処理を実施しない。例えば、エレメンタリデコード部２０３は、実際には存在しているＳＢＲ情報を読み飛ばして無視する。または、エレメンタリデコード部２０３は、実際に存在しているＳＢＲ情報を適用はするが、出力のサンプリングレートを１／２にダウンサンプリングして結果として４８ｋＨｚの信号を生成する、などの処理を行う。 In addition, for example, even if the elementary audio stream 300 has a sampling rate of 48 kHz and “SBR information is present”, if the SBR presence / absence information indicates “SBR not present”, the elementary decoding unit 203 Do not perform the enlargement process. For example, the elementary decoding unit 203 skips and ignores SBR information that actually exists. Alternatively, the elementary decoding unit 203 performs processing such as applying the SBR information that actually exists, but downsampling the output sampling rate to ½, and generating a 48 kHz signal as a result. .

なお、エレメンタリデコード部２０３は、エレメンタリオーディオストリーム３００に含まれているＳＢＲ情報の有無を示す情報と、プリアンブル除去部２０２からのＳＢＲ有無情報とが矛盾する場合に、エレメンタリオーディオストリーム３００に含まれているＳＢＲ情報の有無を示す情報に従ってもよい。 The elementary decoding unit 203 adds to the elementary audio stream 300 when the information indicating the presence / absence of the SBR information included in the elementary audio stream 300 and the SBR presence / absence information from the preamble removing unit 202 contradict each other. Information indicating the presence or absence of the included SBR information may be followed.

このようにして生成されたデコード信号は、エレメンタリデコード部２０３から外部に送信され、スピーカ又はヘッドホンなで可聴信号に変換される。 The decoded signal generated in this way is transmitted to the outside from the elementary decoding unit 203 and converted into an audible signal using a speaker or headphones.

上記のように本実施の形態１に係るビットストリーム送信装置２００は、オーディオ信号を符号化して得られたエレメンタリオーディオストリームを取得する入力部１００と、エレメンタリオーディオストリームのサンプリング周波数とオーディオ符号化データ形式とを解析する解析部１０１と、解析部１０１の解析結果に応じてＩＥＣ６１９３７形式のオーディオビットストリームを構成するプリアンブル信号を生成するプリアンブル生成部１０２と、プリアンブル生成部１０２が生成したプリアンブル信号をエレメンタリオーディオストリームに付加することで、ＩＥＣ６１９３７形式のオーディオビットストリームを成形する成形部１０３と、成形部１０３で成形されたオーディオビットストリームを出力する出力部１０４とを備え、プリアンブル生成部１０２は、解析部１０１の解析結果から得られるサンプリング周波数が予め定められた値である場合、エレメンタリオーディオストリームのオーディオ符号化データ形式を示すパラメータを、オーディオビットストリームのＩＥＣ６１９３７に適合する形式に変更してプリアンブル信号を生成する。 As described above, the bitstream transmission apparatus 200 according to Embodiment 1 includes the input unit 100 that acquires an elementary audio stream obtained by encoding an audio signal, the sampling frequency of the elementary audio stream, and the audio encoding. An analysis unit 101 that analyzes the data format, a preamble generation unit 102 that generates a preamble signal that constitutes an audio bitstream in IEC 61937 format according to the analysis result of the analysis unit 101, and a preamble signal generated by the preamble generation unit 102 By adding to the elementary audio stream, a forming unit 103 that forms an audio bit stream in the IEC 61937 format, and an output unit 104 that outputs the audio bit stream formed by the forming unit 103 are provided. When the sampling frequency obtained from the analysis result of the analysis unit 101 is a predetermined value, the rumble generation unit 102 conforms the parameter indicating the audio encoded data format of the elementary audio stream to the IEC 61937 of the audio bit stream. Change the format to generate a preamble signal.

これにより、当該ビットストリーム送信装置２００は、ＩＥＣ６１９３７に適合しないためにビットストリーム受信装置２０１に送信できなかったオーディオビットストリーム３０１を問題なく送信することができる。この結果、当該ビットストリーム送信装置２００は、エレメンタリオーディオストリーム３００のサンプリングレートとオーディオ符号化データ形式との組み合わせが、ＩＥＣ６１９３７規格の範囲外であってもＩＥＣ６１９３７規格の範囲内のオーディオビットストリーム３０１として出力できる。特に、本実施の形態１によれば、プリアンブル信号に記述されるＳＢＲ情報の有無のみを変更することでそれが実施できるので、処理が非常に簡単である。 Accordingly, the bit stream transmission device 200 can transmit the audio bit stream 301 that could not be transmitted to the bit stream reception device 201 because it does not conform to IEC 61937 without any problem. As a result, the bitstream transmission apparatus 200 can obtain the audio bitstream 301 within the range of the IEC61937 standard even if the combination of the sampling rate of the elementary audio stream 300 and the audio encoded data format is outside the range of the IEC61937 standard. Can output. In particular, according to the first embodiment, it can be implemented by changing only the presence / absence of SBR information described in the preamble signal, so the processing is very simple.

なお、本実施の形態１では、ヘッダに記載されたサンプリングレートが２４ｋＨｚ（又は２２．０５ｋＨｚ、１６ｋＨｚ）で「ＳＢＲ情報なし」という例で説明したが、ヘッダに記載されたサンプリングレートが４４．１ｋＨｚ（又は３２ｋＨｚ）で「ＳＢＲ情報あり」という場合も、同様に処理できる。その場合の動作は以下のようになる。 In the first embodiment, the example in which the sampling rate described in the header is 24 kHz (or 22.05 kHz, 16 kHz) and “no SBR information” has been described, but the sampling rate described in the header is 44.1 kHz. The same processing can be performed when “SBR information exists” at (or 32 kHz). The operation in that case is as follows.

解析部１０１は、エレメンタリオーディオストリーム３００のサンプリングレートとオーディオ符号化データ形式とを解析する。入力されたエレメンタリオーディオストリーム３００が、上で述べたように、サンプリングレートが４４．１ｋＨｚで、ＳＢＲ情報を含んでいることが、解析部１０１によって解析される。 The analysis unit 101 analyzes the sampling rate of the elementary audio stream 300 and the audio encoded data format. As described above, the analysis unit 101 analyzes that the input elementary audio stream 300 has a sampling rate of 44.1 kHz and includes SBR information.

次にプリアンブル生成部１０２は、解析部１０１の解析結果に応じてＩＥＣ６１９３７形式のオーディオビットストリーム３０１に含まれるプリアンブル信号を生成する。このとき、エレメンタリオーディオストリーム３００のサンプリングレートが４４．１ｋＨｚで、ＳＢＲ情報を含んでいる場合、図１０に示したように、ＩＥＣ６１９３７形式のオーディオビットストリームが定義されていないので、プリアンブル生成部１０２は、ＳＢＲ情報を含んでいないことを示すプリアンブル信号を生成する。具体的には、プリアンブル生成部１０２は、プリアンブル信号Ｐｃの０から４ビット目の５ビットが示す値を２３に設定する。これにより、プリアンブル信号Ｐｃは、ヘッダ形式がＬＯＡＳ形式であることを示す。さらに、プリアンブル生成部１０２は、プリアンブル信号Ｐｃの５から６ビット目の２ビットを１に設定する。これにより、プリアンブル信号ＰｃはＳＢＲ情報が含まれていないことを示す。すなわち、プリアンブル信号Ｐｃは、エレメンタリオーディオストリーム３００のオーディオ符号化データ形式がＡＡＣ方式であることを示す。 Next, the preamble generation unit 102 generates a preamble signal included in the audio bitstream 301 in the IEC 61937 format according to the analysis result of the analysis unit 101. At this time, when the sampling rate of the elementary audio stream 300 is 44.1 kHz and SBR information is included, an audio bit stream in the IEC61937 format is not defined as shown in FIG. Generates a preamble signal indicating that SBR information is not included. Specifically, the preamble generation unit 102 sets the value indicated by 5 bits from 0 to the 4th bit of the preamble signal Pc to 23. Thereby, the preamble signal Pc indicates that the header format is the LOAS format. Further, the preamble generation unit 102 sets 2 bits of the 5th to 6th bits of the preamble signal Pc to 1. As a result, the preamble signal Pc indicates that no SBR information is included. That is, the preamble signal Pc indicates that the audio encoded data format of the elementary audio stream 300 is the AAC format.

次に、成形部１０３で、プリアンブル生成部１０２の出力のプリアンブル信号をエレメンタリオーディオストリーム３００に付加する。すなわち、成形部１０３は、図７に示したようなＩＥＣ６１９３７形式のオーディオビットストリーム３０１を成形する。 Next, the shaping unit 103 adds the preamble signal output from the preamble generation unit 102 to the elementary audio stream 300. That is, the forming unit 103 forms the audio bit stream 301 in the IEC61937 format as shown in FIG.

最後に、出力部１０４が、成形部１０３で成形されたオーディオビットストリーム３０１を外部機器に出力する。 Finally, the output unit 104 outputs the audio bitstream 301 formed by the forming unit 103 to an external device.

このようにすることで、ビットストリーム送信装置２００は、入力されたエレメンタリオーディオストリーム３００が、ＩＥＣ６１９３７規格の範囲外であってもＩＥＣ６１９３７規格の範囲内のオーディオビットストリーム３０１として出力できる。 In this way, the bitstream transmission device 200 can output the input elementary audio stream 300 as an audio bitstream 301 within the range of the IEC61937 standard even if the input elementary audio stream 300 is outside the range of the IEC61937 standard.

なお、ＳＢＲ情報を上記のように強制的に変更すれば、プリアンブル信号に含まれているＳＢＲの有無を示す情報と、エンコードデータに含まれるＳＢＲの有無を示す情報とが不整合となるが、不整合となった場合にはプリアンブル信号に含まれているＳＢＲの有無を示す情報に従って処理が行われるように予め定めておいてもよい。前述したように、ＡＡＣのエレメンタリオーディオストリームとＨＥＡＡＣのエレメンタリオーディオストリームとは再生互換があるので、当該不整合なＩＥＣ６１９３７形式のオーディオビットストリームであっても外部受信機では当該ストリームを再生可能である。 If the SBR information is forcibly changed as described above, the information indicating the presence or absence of SBR included in the preamble signal and the information indicating the presence or absence of SBR included in the encoded data become inconsistent. In the case of inconsistency, it may be determined in advance so that processing is performed according to information indicating the presence or absence of SBR included in the preamble signal. As described above, since the AAC elementary audio stream and the HEAAC elementary audio stream are compatible with each other, even if the inconsistent IEC 61937 format audio bit stream is used, the external receiver can reproduce the stream. is there.

その考え方を図４Ａ及び図４Ｂに示す。図４Ａ（ａ）〜（ｂ２）及び図４Ｂ（ａ）〜（ｂ２）において、横軸は周波数を表している。 The concept is shown in FIGS. 4A and 4B. 4A (a) to (b2) and FIGS. 4B (a) to (b2), the horizontal axis represents frequency.

図４Ａ（ａ）は、ＨＥＡＡＣで符号化されたエレメンタリオーディオストリーム３００に含まれる周波数成分を、周波数スペクトルで模式的に示している。図４Ａの中においてサンプリング周波数の１／４よりも低域の実線で示す信号は、ＡＡＣ方式で符号化されており、サンプリング周波数の１／４よりも高域の破線で示す信号は、ＳＢＲ方式で符号化されている。 FIG. 4A (a) schematically shows frequency components included in the elementary audio stream 300 encoded by HEAAC in the frequency spectrum. In FIG. 4A, a signal indicated by a solid line lower than ¼ of the sampling frequency is encoded by the AAC method, and a signal indicated by a broken line higher than ¼ of the sampling frequency is indicated by the SBR method. It is encoded with.

図４Ａ（ｂ１）は、図４Ａ（ａ）に示したＨＥＡＡＣの信号をＨＥＡＡＣデコーダで処理した場合の周波数成分を示している。この場合、低域及び高域の両方を含む全帯域の信号が再生される。 FIG. 4A (b1) shows frequency components when the HEAAC signal shown in FIG. 4A (a) is processed by the HEAAC decoder. In this case, the entire band signal including both the low frequency band and the high frequency band is reproduced.

図４Ａ（ｂ２）は、図４Ａ（ａ）に示したＨＥＡＡＣの信号をＡＡＣデコーダで処理した場合の周波数成分を示している。この場合、ＳＢＲ方式で符号化されている高域は復号されず、ＡＡＣ方式で符号化されている低域の信号のみがＡＡＣデコーダで復号されて再生される。 FIG. 4A (b2) shows frequency components when the HEAAC signal shown in FIG. 4A (a) is processed by the AAC decoder. In this case, the high frequency band encoded by the SBR method is not decoded, and only the low frequency band signal encoded by the AAC method is decoded and reproduced by the AAC decoder.

図４Ｂ（ａ）は、ＡＡＣで符号化されたエレメンタリオーディオストリーム３００に含まれる周波数成分を、周波数スペクトルで模式的に示している。この場合、全帯域がＡＡＣ方式で符号化されている。 FIG. 4B (a) schematically shows frequency components included in the elementary audio stream 300 encoded by AAC using a frequency spectrum. In this case, the entire band is encoded by the AAC method.

図４Ｂ（ｂ１）は、図４Ｂ（ａ）に示したＡＡＣの信号をＨＥＡＡＣデコーダで処理した場合の周波数成分を示している。この場合、全帯域の信号がＨＥＡＡＣデコーダで復号されて再生される。 FIG. 4B (b1) shows frequency components when the AAC signal shown in FIG. 4B (a) is processed by the HEAAC decoder. In this case, the entire band signal is decoded by the HEAAC decoder and reproduced.

図４Ｂ（ｂ２）は、図４Ｂ（ａ）に示したＡＡＣの信号をＡＡＣデコーダで処理した場合の周波数成分を示している。この場合も図４Ｂ（ｂ１）に示した場合と同様に、全帯域の信号がＡＡＣデコーダで復号されて再生される。 FIG. 4B (b2) shows frequency components when the AAC signal shown in FIG. 4B (a) is processed by the AAC decoder. Also in this case, as in the case shown in FIG. 4B (b1), the signal of the entire band is decoded by the AAC decoder and reproduced.

つまり、実態として信号がＨＥＡＡＣである場合、すなわち、エレメンタリオーディオストリーム３００がＨＥＡＡＣオーディオ符号化データ形式で符号化されたエレメンタリオーディオストリームである場合において、プリアンブル信号により、エレメンタリオーディオストリーム３００がＡＡＣで符号化されたことを示す場合には、エレメンタリオーディオストリーム３００はＡＡＣデコーダで処理される。このような場合であっても、本来のＨＥＡＡＣでデコードされた場合と同様の音質は得られないまでもＡＡＣでデコードされた可聴オーディオが再生される（図４Ａ（ｂ２））。また、仮に、プリアンブル信号がＡＡＣを示し、実態としても信号がＡＡＣであったとしても、このＡＡＣの信号をＨＥＡＡＣデコードで処理して差し障りはない。つまり、このＡＡＣの信号をＨＥＡＡＣデコードで処理しておけば、本来のＡＡＣでデコードされた場合と同様の音質が得られる（図４Ｂ（ｂ２））。 That is, when the signal is actually HEAAC, that is, when the elementary audio stream 300 is an elementary audio stream encoded in the HEAAC audio encoded data format, the preamble audio signal causes the elementary audio stream 300 to be AAC. In the case of indicating that the data is encoded by the AAC decoder, the elementary audio stream 300 is processed by the AAC decoder. Even in such a case, the audible audio decoded by the AAC is reproduced until the same sound quality as that obtained by the original HEAAC is not obtained (FIG. 4A (b2)). Also, even if the preamble signal indicates AAC and the signal is actually AAC, this AAC signal is processed by HEAAC decoding without any problem. That is, if this AAC signal is processed by HEAAC decoding, the same sound quality as that obtained by decoding by the original AAC can be obtained (FIG. 4B (b2)).

（実施の形態１の応用例）
なお、本実施の形態１では、「ヘッダに記載されたサンプリングレートが２４ｋＨｚ（又は２２．０５ｋＨｚ、１６ｋＨｚ）でＳＢＲ情報なし」及び「ヘッダに記載されたサンプリングレートが４４．１ｋＨｚ（又は３２ｋＨｚ）でＳＢＲ情報あり」という例を用いて第１の課題を解決する構成について説明したが、「ヘッダに記載されたサンプリングレートが４８ｋＨｚでＳＢＲ情報あり」という場合についても上記と同様の処理を行うことにより、第２の課題を解決することができる。以下では、「ヘッダに記載されたサンプリングレートが４８ｋＨｚでＳＢＲ情報あり」という場合の処理について詳細に説明する。(Application example of Embodiment 1)
In the first embodiment, “the sampling rate described in the header is 24 kHz (or 22.05 kHz, 16 kHz) and there is no SBR information” and “the sampling rate described in the header is 44.1 kHz (or 32 kHz). The configuration that solves the first problem has been described using the example of “with SBR information”, but the same processing as described above is performed when “the sampling rate described in the header is 48 kHz and there is SBR information”. The second problem can be solved. In the following, the processing in the case of “the sampling rate described in the header is 48 kHz and there is SBR information” will be described in detail.

まず、解析部１０１は、エレメンタリオーディオストリーム３００のサンプリングレートとオーディオ符号化データ形式とを解析する。ここで、入力されたエレメンタリオーディオストリーム３００は、上で述べたように、サンプリングレートが４８ｋＨｚで、ＳＢＲ情報を含んでいることが、解析部１０１によって解析される。 First, the analysis unit 101 analyzes the sampling rate and the audio encoded data format of the elementary audio stream 300. Here, as described above, the analysis unit 101 analyzes that the input elementary audio stream 300 has a sampling rate of 48 kHz and includes SBR information.

次に、プリアンブル生成部１０２は、解析部１０１の解析結果に応じてＩＥＣ６１９３７形式のプリアンブル信号を生成する。サンプリングレートが４８ｋＨｚの場合、図１０に示したように、ＳＢＲ情報を含んでいる場合も含んでいない場合も、いずれの場合であっても、ＩＥＣ６１９３７形式のオーディオビットストリームが定義されている。しかし、本実施の形態のプリアンブル生成部１０２は、ＳＢＲ情報を含んでいる場合であっても常に、ＳＢＲ情報を含んでいないことを示すプリアンブル信号を生成する。具体的にはプリアンブル生成部１０２は、プリアンブル信号Ｐｃの０から４ビット目の５ビットが示す値を２３に設定する。これにより、プリアンブル信号Ｐｃは、ヘッダ形式がＬＯＡＳ形式であることを示す。さらに、プリアンブル生成部１０２は、プリアンブル信号Ｐｃの５から６ビット目の２ビットを１に設定する。これにより、プリアンブル信号Ｐｃは、エレメンタリオーディオストリーム３００にＳＢＲ情報が含まれていないことを示す。すなわち、プリアンブル信号Ｐｃは、サンプリングレートが４８ｋＨｚのときは常に、エレメンタリオーディオストリーム３００がＡＡＣ方式で符号化されていることを示すプリアンブル信号を生成する。 Next, the preamble generation unit 102 generates an IEC 61937 format preamble signal according to the analysis result of the analysis unit 101. When the sampling rate is 48 kHz, as shown in FIG. 10, an audio bit stream in the IEC 61937 format is defined regardless of whether the SBR information is included or not. However, preamble generation section 102 of the present embodiment always generates a preamble signal indicating that it does not include SBR information, even when it includes SBR information. Specifically, the preamble generation unit 102 sets the value indicated by 5 bits from 0 to 4 of the preamble signal Pc to 23. Thereby, the preamble signal Pc indicates that the header format is the LOAS format. Further, the preamble generation unit 102 sets 2 bits of the 5th to 6th bits of the preamble signal Pc to 1. Thereby, the preamble signal Pc indicates that the elementary audio stream 300 does not include SBR information. That is, the preamble signal Pc generates a preamble signal indicating that the elementary audio stream 300 is encoded by the AAC method whenever the sampling rate is 48 kHz.

次に、成形部１０３は、プリアンブル生成部１０２で生成されたプリアンブル信号をエレメンタリオーディオストリーム３００に付加する。すなわち、成形部１０３は、図７に示したようなＩＥＣ６１９３７形式のオーディオビットストリーム３０１を成形する。 Next, the shaping unit 103 adds the preamble signal generated by the preamble generation unit 102 to the elementary audio stream 300. That is, the forming unit 103 forms the audio bit stream 301 in the IEC61937 format as shown in FIG.

最後に、出力部１０４では、成形部１０３で生成されたオーディオビットストリーム３０１を外部機器に出力する。 Finally, the output unit 104 outputs the audio bitstream 301 generated by the shaping unit 103 to an external device.

このようにすることで、実際のＤＴＶ放送などで、出力サンプリング周波数の切り替えが増加することを抑制することができる。例えば、デジタル放送において、番組コンテンツ部分はオーディオ符号化データ形式がＨＥＡＡＣで、かつ、ＡＡＣサンプリングレートが４８ＫＨｚで放送されていて、一方で、コマーシャル部分はオーディオ符号化データ形式がＡＡＣＬＣで、かつ、ＡＡＣサンプリングレートが４８ＫＨｚで放送されている場合、ビットストリーム送信装置は、番組コンテンツ部分ではＩＥＣ６０９５８フレームレート９６ＫＨｚで信号を出力し、コマーシャル部分ではＩＥＣ６０９５８のフレームレート４８ＫＨｚで信号を出力することになる。しかしながら上記のようにプリアンブル生成部１０２でＳＢＲ情報の有無を変更することにより、番組コンテンツ部分とコマーシャル部分との切り替わりで再生フレームレートが変化することを防止することができる。この結果、帯域拡張による番組コンテンツ部分での高音質なオーディオ再生は望めないが、ビットストリーム受信装置２０１においてＩＥＣ６０９５８フレームレートの変化に追従するために時間がかかることに起因する音飛びなどの不具合を回避することができる。 By doing so, it is possible to suppress an increase in switching of the output sampling frequency in an actual DTV broadcast or the like. For example, in digital broadcasting, the program content part is broadcast with an audio encoded data format of HEAAC and an AAC sampling rate of 48 KHz, while the commercial part is an audio encoded data format of AACLC and AACLC When the sampling rate is broadcast at 48 KHz, the bit stream transmission apparatus outputs a signal at an IEC60958 frame rate of 96 KHz in the program content part, and outputs a signal at a frame rate of 48 KHz of IEC60958 in the commercial part. However, by changing the presence / absence of the SBR information in the preamble generation unit 102 as described above, it is possible to prevent the playback frame rate from changing due to switching between the program content part and the commercial part. As a result, high-quality audio reproduction in the program content portion due to band expansion cannot be expected, but the bitstream receiving apparatus 201 has problems such as sound skipping due to the time required to follow the change in the IEC60958 frame rate. It can be avoided.

図５Ａ〜図５Ｃは、図１０に示したストリーム出力規定に基づくストリームの伝送例を説明する図である。図５Ａ〜図５Ｃにおいて、ａ、ｂ、ｃ及びｄの文字が記された○はプリアンブル信号Ｐａ、Ｐｂ、Ｐｃ及びＰｄを示し、ハッチングされた○はエレメンタリオーディオストリームを示し、○はデータ長を調節するための無効なデータであるパディングデータを示している。また、ここで伝送されるストリームのヘッダ形式はＬＯＡＳ形式であるとする。 5A to 5C are diagrams for explaining an example of stream transmission based on the stream output rule shown in FIG. In FIGS. 5A to 5C, ◯ with characters a, b, c, and d indicate preamble signals Pa, Pb, Pc, and Pd, hatched ◯ indicates an elementary audio stream, and ◯ indicates a data length. The padding data, which is invalid data for adjusting the value, is shown. The header format of the stream transmitted here is assumed to be the LOAS format.

図５Ａは、例えば、サンプリングレートが４８ｋＨｚでＡＡＣ符号化（ＳＢＲ情報なし）されたストリームを伝送できる理由を示す図である。ここでは、１サンプルを３２ビットで表す。ビットストリーム送信装置は、サンプリング周波数が４８ｋＨｚのエレメンタリオーディオストリームなので４８ｋＨｚ×３２ｂｉｔｓ＝１５３６ｋｂｐｓの伝送速度でデータを伝送する。具体的には、２１．３３３ｍｓｅｃ（４８ｋＨｚで１０２４サンプル分の時間）ごとにプリアンブル信号Ｐａ、Ｐｂ、Ｐｃ及びＰｄをつけて、３２７６８ビットの１フレームの信号を順次伝送することが定められている。 FIG. 5A is a diagram illustrating the reason why a stream that is AAC encoded (no SBR information) at a sampling rate of 48 kHz can be transmitted, for example. Here, one sample is represented by 32 bits. Since the bit stream transmission device is an elementary audio stream with a sampling frequency of 48 kHz, it transmits data at a transmission rate of 48 kHz × 32 bits = 1536 kbps. Specifically, it is defined that one frame signal of 32768 bits is sequentially transmitted with the preamble signals Pa, Pb, Pc and Pd added every 21.333 msec (time of 1024 samples at 48 kHz).

これに対し、デコーダは、４８ｋＨｚのＡＡＣの場合、２１．３３３ｍｓｅｃごとにＰＣＭデータを生成する。よって、信号の供給レートとデコーダの処理レートとが一致することで、デコーダは信号を矛盾なく処理をできる。 On the other hand, in the case of 48 kHz AAC, the decoder generates PCM data every 21.333 msec. Therefore, when the signal supply rate matches the processing rate of the decoder, the decoder can process the signal without contradiction.

図５Ｂは、例えば、サンプリングレートが２４ｋＨｚでＡＡＣ符号化（ＳＢＲ情報なし）されたストリームを伝送できない理由を示す図である。サンプリングレートが２４ｋＨｚの信号の場合、ビットストリーム送信装置は、２４ｋＨｚ×３２ｂｉｔｓ＝７６８ｋｂｐｓの伝送速度でデータを伝送することが好ましいしが、７６８ｋｂｐｓというモードがない。仮にビットストリーム送信装置が１５３６ｋｂｐｓの伝送速度を用いるとすると、図５Ｂに示すようにデータが伝送される。その場合、デコーダは、２１．３３３ｍｓｅｃごとに供給される信号に対して、４２．６６６７ｍｓｅｃ分のＰＣＭデータを生成するので（２４ｋＨｚで１０２４サンプルは４２．６６６７ｍｓｅｃだから）、デコーダの処理速度に対してビットストリーム送信装置からの信号の供給が過剰となる。これにより、ストリームがオーバーフローする。 FIG. 5B is a diagram illustrating a reason why a stream that is AAC encoded (no SBR information) at a sampling rate of 24 kHz cannot be transmitted, for example. In the case of a signal with a sampling rate of 24 kHz, the bit stream transmission device preferably transmits data at a transmission rate of 24 kHz × 32 bits = 768 kbps, but does not have a mode of 768 kbps. If the bitstream transmission device uses a transmission rate of 1536 kbps, data is transmitted as shown in FIG. 5B. In that case, since the decoder generates PCM data for 42.6667 msec for the signal supplied every 21.333 msec (since 1024 samples are 42.6667 msec at 24 kHz), the bit rate with respect to the processing speed of the decoder Supply of signals from the stream transmission device becomes excessive. As a result, the stream overflows.

図５Ｃは、例えば、サンプリングレートが２４ｋＨｚでＨＥＡＡＣ符号化（ＳＢＲ有り）されたストリームを伝送できる理由を示す図である。ＨＥＡＡＣで符号化されている場合、図１１Ａに示したように、ＡＡＣ部のサンプリングレートが２４ｋＨｚであっても、ビットストリーム送信装置は、ストリーム全体をサンプリングレートが４８ｋＨｚの信号と見なすので、４８ｋＨｚ×３２ｂｉｔｓ＝１５３６ｋｂｐｓの伝送速度でデータを伝送する。具体的には、４２．６６６７ｍｓｅｃ（４８ｋＨｚで２０４８サンプル分の時間）ごとにプリアンブル信号Ｐａ、Ｐｂ、Ｐｃ及びＰｄをつけて１フレームごとに信号を伝送することが定められている。 FIG. 5C is a diagram illustrating a reason why a stream that is HEAAC encoded (with SBR) at a sampling rate of 24 kHz can be transmitted, for example. When encoded by HEAAC, as shown in FIG. 11A, even if the sampling rate of the AAC unit is 24 kHz, the bitstream transmission apparatus regards the entire stream as a signal having a sampling rate of 48 kHz. Data is transmitted at a transmission rate of 32 bits = 1536 kbps. Specifically, it is determined that a signal is transmitted frame by frame with a preamble signal Pa, Pb, Pc, and Pd added every 42.6667 msec (time of 2048 samples at 48 kHz).

これに対し、デコーダは、２４ｋＨｚのＨＥＡＡＣの場合、４２．６６６７ｍｓｅｃごとに、ＰＣＭデータを生成するので、信号の供給レートとデコーダの処理レートとが一致する。よって、デコーダは信号を矛盾なく処理できる。 On the other hand, in the case of a 24 kHz HEAAC, the decoder generates PCM data every 42.6667 msec, so that the signal supply rate matches the decoder processing rate. Therefore, the decoder can process the signal without contradiction.

上記と同様にして、サンプリングレートが４４．１ｋＨｚでＡＡＣ符号化されたストリームと、サンプリングレートが２２．０５ｋＨｚでＨＥＡＡＣ符号化されたストリームとは伝送可能であるのに対し、サンプリングレートが２２．０５ｋＨｚでＡＡＣ符号化されたストリームを伝送できない理由を説明することができる。また、サンプリングレートが３２ｋＨｚでＡＡＣ符号化されたストリームと、サンプリングレートが１６ｋＨｚでＨＥＡＡＣ符号化されたストリームとは伝送可能であるのに対し、サンプリングレートが１６ｋＨｚでＡＡＣ符号化されたストリームを伝送できない理由を説明することができる。 In the same manner as described above, an AAC encoded stream with a sampling rate of 44.1 kHz and a HEAAC encoded stream with a sampling rate of 22.05 kHz can be transmitted, whereas the sampling rate is 22.05 kHz. The reason why an AAC-encoded stream cannot be transmitted is described in FIG. A stream encoded with AAC at a sampling rate of 32 kHz and a stream encoded with HEAAC at a sampling rate of 16 kHz can be transmitted, but an AAC encoded stream at a sampling rate of 16 kHz cannot be transmitted. Explain why.

なぜならば、出力の伝送レート１４１１．２ｋｂｐｓ（＝４４．１ｋＨｚ×３２ｂｉｔｓ）と、１０２４ｋｂｐｓ（＝３２ｋＨｚ×３２ｂｉｔｓ）とは、それぞれ、ＩＥＣ６０９５８で規格化されているので、ビットストリーム送信装置は、それを用いることができるが、７０５．６ｋｂｐｓ（＝２２．０５ｋＨｚ×３２ｂｉｔｓ）と、５１２ｋｂｐｓ（＝１６ｋＨｚ×３２ｂｉｔｓ）は、規格化されていないからである。 This is because the output transmission rate of 1411.2 kbps (= 44.1 kHz × 32 bits) and 1024 kbps (= 32 kHz × 32 bits) are standardized by IEC60958, respectively. This is because 705.6 kbps (= 22.05 kHz × 32 bits) and 512 kbps (= 16 kHz × 32 bits) are not standardized.

上記例とは異なり、サンプリングレート４８ｋＨｚではＡＡＣ符号化されたストリームも、ＨＥＡＡＣ符号化されたストリームも、いずれも問題なく伝送することができる。 Unlike the above example, at an sampling rate of 48 kHz, both AAC encoded streams and HEAAC encoded streams can be transmitted without problems.

その理由は、サンプリングレートが４８ｋＨｚでＡＡＣ符号化されたストリームでは、サンプリングレートが４８ｋＨｚなので、ビットストリーム送信装置は、４８ｋＨｚ×３２ｂｉｔｓ＝１５３６ｋｂｐｓの伝送速度でデータを伝送する。また、２１．３３３ｍｓｅｃ（４８ｋＨｚで１０２４サンプル分の時間）ごとにプリアンブル信号Ｐａ、Ｐｂ、Ｐｃ及びＰｄをつけて１フレーム（３２７６８ビット）ごとに信号を伝送することが定められている。 The reason is that in a stream that is AAC encoded at a sampling rate of 48 kHz, the sampling rate is 48 kHz. Therefore, the bit stream transmission device transmits data at a transmission rate of 48 kHz × 32 bits = 1536 kbps. Further, it is defined that a signal is transmitted every frame (32768 bits) with preamble signals Pa, Pb, Pc and Pd added every 21.333 msec (time of 1024 samples at 48 kHz).

これに対し、デコーダは、４８ｋＨｚのＡＡＣの場合、２１．３３３ｍｓｅｃごとにＰＣＭデータを生成するので、信号の供給レートとデコーダの処理レートとが一致する。よって、デコーダは信号を矛盾なく処理できる。 On the other hand, in the case of 48 kHz AAC, the decoder generates PCM data every 21.333 msec, so that the signal supply rate matches the processing rate of the decoder. Therefore, the decoder can process the signal without contradiction.

また、サンプリングレートが４８ｋＨｚでＨＥＡＡＣ符号化されたストリームでは、ビットストリーム送信装置は、サンプリングレートを９６ｋＨｚとみなすので、９６ｋＨｚ×３２ｂｉｔｓ＝３０７２ｋｂｐｓの伝送速度でデータを伝送する。また、２１．３３３ｍｓｅｃ（９６ｋＨｚで２０４８サンプル分の時間）ごとにプリアンブル信号Ｐａ、Ｐｂ、Ｐｃ及びＰｄをつけて１フレーム（３２７６８ビット）ごとに信号を伝送することが定められている。 In addition, in a stream that is HEAAC encoded at a sampling rate of 48 kHz, the bit stream transmission apparatus regards the sampling rate as 96 kHz, and therefore transmits data at a transmission rate of 96 kHz × 32 bits = 3072 kbps. Further, it is determined that a signal is transmitted every frame (32768 bits) with preamble signals Pa, Pb, Pc and Pd added every 21.333 msec (time of 2048 samples at 96 kHz).

これに対し、デコーダでは、４８ｋＨｚのＨＥＡＡＣの場合、２１．３３３ｍｓｅｃごとにＰＣＭデータを生成するので、信号の供給レートとデコーダの処理レートとが一致する。よって、デコーダは信号を矛盾なく処理できる。 On the other hand, in the case of the 48 kHz HEAAC, since the PCM data is generated every 21.333 msec, the signal supply rate matches the processing rate of the decoder. Therefore, the decoder can process the signal without contradiction.

以上のように、エレメンタリオーディオストリームがサンプリングレート４８ｋＨｚで符号化されている場合には、ＡＡＣでもＨＥＡＡＣでも問題なく伝送することが可能である。一方で、サンプリングレート４８ｋＨｚでＡＡＣ符号化されたストリームと、サンプリングレート４８ｋＨｚでＨＥＡＡＣ符号化されたストリームとが切り替わる場合には、伝送レートが１５３６ｋｂｐｓのストリームと、伝送レートが３０７２ｋｂｐｓのストリームとが切り替わることになる。このとき、ビットストリーム送受信システム全体において、例えば、Ａ／Ｄ変換器の動作周波数を切り替えるというような同期を取り直す処理が必要となる。このようなビットストリーム送受信システム全体の同期を取り直す処理には、時間を要するため、サンプリングレートの切り替わり時にタイムラグが生じる。例えば、番組本編とＣＭとのつなぎ目で音声が途切れるというような不具合を生じることがある。 As described above, when the elementary audio stream is encoded at the sampling rate of 48 kHz, it is possible to transmit the AAC or HEAAC without any problem. On the other hand, when a stream that is AAC encoded at a sampling rate of 48 kHz and a stream that is HEAAC encoded at a sampling rate of 48 kHz are switched, a stream having a transmission rate of 1536 kbps and a stream having a transmission rate of 3072 kbps are switched. become. At this time, in the entire bitstream transmission / reception system, for example, a process of re-synchronizing such as switching the operating frequency of the A / D converter is required. Since it takes time to re-synchronize the entire bit stream transmission / reception system, a time lag occurs when the sampling rate is switched. For example, there may be a problem that the sound is interrupted at the joint between the main program and the CM.

これに対し、サンプリングレートが４８ｋＨｚでＨＥＡＡＣ符号化（ＳＢＲ有り）されたストリームを、ＳＢＲ無しとして伝送することで、伝送レートを一定のサンプリングレートにすることが可能となる。これにより、音飛びなどの不具合を防止することが可能となる。 On the other hand, by transmitting a stream that is HEAAC encoded (with SBR) at a sampling rate of 48 kHz without SBR, the transmission rate can be set to a constant sampling rate. This makes it possible to prevent problems such as sound skipping.

なお、上記実施の形態では、ビットストリーム送受信システムは、サンプリングレートの切り替わり時にタイムラグが生じるために、エレメンタリオーディオストリームのサンプリングレートが４８ｋＨｚである場合には、エレメンタリオーディオストリームがＨＥＡＡＣで符号化されている場合であっても、ＡＡＣで符号化されている場合であっても、常にＡＡＣで符号化されているとして処理することとしたが、本発明はこれに限定されない。例えば、サンプリングレートの切り替わり時に生じるタイムラグを気にしないのであれば、ビットストリーム送受信システムは、エレメンタリオーディオストリームが常にＨＥＡＡＣの信号であるとして処理を行うとしてもよい。 In the above embodiment, the bit stream transmission / reception system generates a time lag when the sampling rate is switched. Therefore, when the sampling rate of the elementary audio stream is 48 kHz, the elementary audio stream is encoded with HEAAC. However, the present invention is not limited to this. However, the present invention is not limited to this. For example, if the time lag that occurs when the sampling rate is switched is not concerned, the bitstream transmission / reception system may perform processing assuming that the elementary audio stream is always a HEAAC signal.

（実施の形態２）
以下、本発明の実施の形態２におけるビットストリーム送受信システム２１０について図面を参照しながら説明する。(Embodiment 2)
Hereinafter, the bitstream transmission / reception system 210 according to the second embodiment of the present invention will be described with reference to the drawings.

図６は本実施の形態２におけるビットストリーム送受信システムの構成を示す図である。図６において、実施の形態２のビットストリーム送受信システム２１０は、ビットストリーム送信装置１０００とビットストリーム受信装置１００１とを備える。 FIG. 6 is a diagram showing a configuration of a bit stream transmission / reception system according to the second embodiment. In FIG. 6, the bitstream transmission / reception system 210 according to the second embodiment includes a bitstream transmission apparatus 1000 and a bitstream reception apparatus 1001.

ビットストリーム送信装置１０００は、入力部１００、解析部１０１、成形部１０３、出力部１０４及びプリアンブル生成部１０１０を備える。ビットストリーム送信装置１０００において、ビットストリーム送信装置２００の各処理部と同じ参照符号を付した処理部は実施の形態１における図１に記載されたものと同じであるので説明を省略する。以下では主に、プリアンブル生成部１０１０について説明する。 The bitstream transmission apparatus 1000 includes an input unit 100, an analysis unit 101, a shaping unit 103, an output unit 104, and a preamble generation unit 1010. In the bitstream transmission apparatus 1000, the processing units denoted by the same reference numerals as the processing units of the bitstream transmission apparatus 200 are the same as those described in FIG. Hereinafter, the preamble generation unit 1010 will be mainly described.

プリアンブル生成部１０１０は、ビットストリーム受信装置１００１から、当該ビットストリーム受信装置１００１の能力情報を受信し、受信した能力情報に従って、プリアンブル信号Ｐｃに含まれる、エレメンタリオーディオストリームのオーディオ符号化データ形式を示す情報を変更する。具体的には、プリアンブル生成部１０１０は、ビットストリーム受信装置１００１から受信した能力情報により、ビットストリーム受信装置１００１が、サンプリングレートが４８ｋＨｚでＨＥＡＡＣにより符号化されたエレメンタリオーディオストリーム３００を復号する能力があるか否かを判断する。ビットストリーム受信装置１００１が、サンプリングレートが４８ｋＨｚでＨＥＡＡＣにより符号化されたエレメンタリオーディオストリーム３００を復号する能力があると判断された場合、プリアンブル生成部１０１０は、上記実施の形態１とは異なり、実際のエレメンタリオーディオストリーム３００がサンプリングレート４８ｋＨｚでＨＥＡＡＣにより符号化されている場合には、ＨＥＡＡＣにより符号化されていることを示すプリアンブル信号Ｐｃを生成する。具体的には、プリアンブル生成部１０１０は、プリアンブル信号Ｐｃの０から４ビット目の５ビットが示す値を２３に設定する。これにより、プリアンブル信号Ｐｃは、ヘッダ形式がＬＯＡＳ形式であることを示す。さらに、プリアンブル生成部１０１０は、プリアンブル信号Ｐｃの５から６ビット目の２ビットを２に設定する。これにより、プリアンブル信号Ｐｃは、エレメンタリオーディオストリーム３００にＳＢＲ情報が含まれていることを示す。すなわち、プリアンブル信号Ｐｃは、エレメンタリオーディオストリーム３００がＨＥＡＡＣ方式で符号化されていることを示す。 The preamble generation unit 1010 receives the capability information of the bit stream reception device 1001 from the bit stream reception device 1001, and in accordance with the received capability information, converts the audio encoded data format of the elementary audio stream included in the preamble signal Pc. Change the information shown. Specifically, the preamble generation unit 1010 is capable of decoding the elementary audio stream 300 encoded by HEAAC at a sampling rate of 48 kHz by the bit stream reception device 1001 based on the capability information received from the bit stream reception device 1001. Judge whether there is. When it is determined that the bitstream receiving apparatus 1001 has the capability of decoding the elementary audio stream 300 encoded by HEAAC at a sampling rate of 48 kHz, the preamble generation unit 1010 is different from the first embodiment, When the actual elementary audio stream 300 is encoded by HEAAC at a sampling rate of 48 kHz, a preamble signal Pc indicating that it is encoded by HEAAC is generated. Specifically, the preamble generation unit 1010 sets the value indicated by the 5th bit from 0 to the 4th bit of the preamble signal Pc to 23. Thereby, the preamble signal Pc indicates that the header format is the LOAS format. Further, the preamble generation unit 1010 sets 2 bits of the 5th to 6th bits of the preamble signal Pc to 2. Accordingly, the preamble signal Pc indicates that the elementary audio stream 300 includes SBR information. That is, the preamble signal Pc indicates that the elementary audio stream 300 is encoded by the HEAAC method.

上記のビットストリーム送信装置１０００に対し、実施の形態２のビットストリーム受信装置１００１は、ビットストリーム送信装置１０００から送信されたオーディオビットストリーム３０１を受信し、受信したオーディオビットストリーム３０１をデコードする。ビットストリーム受信装置１００１は、プリアンブル除去部２０２、エレメンタリデコード部２０３及び能力情報通知部１０１１を備える。図６において、ビットストリーム受信装置２０１の各処理部と同じ参照符号を付した処理部は実施の形態１における図２に記載されたものと同じであり、すでに説明しているので説明を省略する。以下では主に、能力情報通知部１０１１について説明する。 In contrast to the above bitstream transmission apparatus 1000, the bitstream reception apparatus 1001 according to Embodiment 2 receives the audio bitstream 301 transmitted from the bitstream transmission apparatus 1000 and decodes the received audio bitstream 301. The bitstream receiving apparatus 1001 includes a preamble removing unit 202, an elementary decoding unit 203, and a capability information notification unit 1011. In FIG. 6, the processing units denoted by the same reference numerals as the processing units of the bitstream receiving apparatus 201 are the same as those described in FIG. . Hereinafter, the capability information notification unit 1011 will be mainly described.

能力情報通知部１０１１は、ビットストリーム受信装置１００１内のエレメンタリデコード部２０３が、サンプリングレートが４８ｋＨｚでＨＥＡＡＣ符号化されたエレメンタリオーディオストリーム３００を復号できるか否かを示す能力情報を内部のメモリに記憶している。能力情報通知部１０１１は、ビットストリーム送信装置１０００から能力情報の送信要求があったとき、メモリに記憶している能力情報をビットストリーム送信装置１０００のプリアンブル生成部１０１０に通知する。 The capability information notification unit 1011 stores capability information indicating whether or not the elementary decoding unit 203 in the bitstream receiving apparatus 1001 can decode the HEAAC-encoded elementary audio stream 300 at a sampling rate of 48 kHz. I remember it. When there is a capability information transmission request from the bitstream transmission apparatus 1000, the capability information notification unit 1011 notifies the preamble generation unit 1010 of the bitstream transmission apparatus 1000 of the capability information stored in the memory.

このように構成することによって、ビットストリーム送信装置１０００は、ビットストリーム受信装置１００１のエレメンタリデコード部２０３が、サンプリングレートが４８ｋＨｚでＨＥＡＡＣ符号化されたエレメンタリオーディオストリームを復号する能力を備えるときには、エレメンタリオーディオストリーム３００のサンプリングレートが４８ｋＨｚの場合であってもＨＥＡＡＣで符号化されているときには、プリアンブル信号ＰｃにＳＢＲ情報ありを示すパラメータを記述することができる。従って、ＡＡＣで符号化されたコンテンツと、ＨＥＡＡＣで符号化されたコンテンツとのつなぎ目で音が途切れる不具合が問題にならないような場合には、ビットストリーム受信装置１００１は、ビットストリーム送信装置１０００から伝送されたオーディオビットストリーム３０１に含まれるプリアンブル信号に従ってエレメンタリオーディオストリーム３００を復号することができる。これにより、ユーザが、ＨＥＡＡＣで符号化されたコンテンツを、高域まで帯域拡張された高音質で楽しむことができるという効果がある。 By configuring in this way, the bitstream transmission apparatus 1000 has the ability to decode the HEAAC-encoded elementary audio stream at the sampling rate of 48 kHz when the elementary decoding unit 203 of the bitstream reception apparatus 1001 has the capability of decoding. Even when the sampling rate of the elementary audio stream 300 is 48 kHz, a parameter indicating the presence of SBR information can be described in the preamble signal Pc when it is encoded by HEAAC. Therefore, when the problem that the sound is interrupted at the joint between the content encoded with AAC and the content encoded with HEAAC does not become a problem, the bit stream reception device 1001 transmits the bit stream from the bit stream transmission device 1000. The elementary audio stream 300 can be decoded in accordance with the preamble signal included in the audio bit stream 301 thus obtained. Thereby, there is an effect that the user can enjoy the content encoded by HEAAC with high sound quality that is band-extended to a high frequency range.

また、ビットストリーム受信装置１００１のエレメンタリデコード部２０３が、サンプリングレートが４８ｋＨｚのＨＥＡＡＣで符号化されたストリームを復号する能力を備える場合でも、ビットストリーム受信装置１００１は、あえてビットストリーム送信装置１０００に対して能力情報を通知しなくてもよい。この場合、ビットストリーム受信装置１００１は、例えば、実施の形態１のビットストリーム送信装置２００から受信したオーディオビットストリーム３０１から抽出されたエレメンタリオーディオストリーム３００のヘッダ及びＥｘｔｅｎｓｉｏｎ＿ｐａｙｌｏａｄを解釈し、エレメンタリオーディオストリーム３００にＳＢＲ情報が含まれている場合にはそのＳＢＲ情報に従ってエレメンタリオーディオストリーム３００を復号するとしてもよい。さらにまた、オーディオビットストリーム３０１から抽出されたエレメンタリオーディオストリーム３００にＳＢＲ情報が含まれない場合であっても、ビットストリーム受信装置１００１は、高域まで帯域拡張を行うＨＥＡＡＣデコードを行うとしてもよい。 Even when the elementary decoding unit 203 of the bitstream receiving apparatus 1001 has a capability of decoding a stream encoded by HEAAC with a sampling rate of 48 kHz, the bitstream receiving apparatus 1001 dares to send the bitstream transmitting apparatus 1000 to the bitstream transmitting apparatus 1000. However, it is not necessary to notify the ability information. In this case, for example, the bitstream receiving apparatus 1001 interprets the header and the Extension_payload of the elementary audio stream 300 extracted from the audio bitstream 301 received from the bitstream transmitting apparatus 200 of Embodiment 1, and the elementary audio stream When SBR information is included in 300, the elementary audio stream 300 may be decoded according to the SBR information. Furthermore, even when the SBR information is not included in the elementary audio stream 300 extracted from the audio bitstream 301, the bitstream reception device 1001 may perform HEAAC decoding that extends the bandwidth to the high frequency range. .

プリアンブル生成部１０１０は、解析部１０１の解析結果から得られたサンプリング周波数が予め定められた第２の周波数である場合に、プリアンブル信号内にあって前記符号化形式を示す情報として、前記エレメンタリオーディオストリームにＳＢＲ情報が含まれない符号化形式を記載する。これにより、例えば、実際のＤＴＶ放送などで、番組コンテンツ部分はオーディオ符号化データ形式がＨＥＡＡＣかつＡＡＣサンプリングレートが４８ＫＨｚで放送されていて、一方で、コマーシャル部分はオーディオ符号化データ形式がＡＡＣＬＣかつＡＡＣサンプリングレートが４８ＫＨｚで放送されている場合でも、最終的なデコード信号のサンプリングレートが変動しないですむ。よって、番組コンテンツとＣＭ部分の切り替え部分が滑らかに遷移することとなる。このようにすることによって、エレメンタリオーディオストリーム３００がサンプリング周波数４８ｋＨｚでＨＥＡＡＣにより符号化されており、ビットストリーム受信装置１００１がこれを復号する能力を備える場合には、当該ビットストリーム受信装置１００１は、ＨＥＡＡＣでデコードされた高音質のオーディオ信号を再生することができる。 When the sampling frequency obtained from the analysis result of the analysis unit 101 is a predetermined second frequency, the preamble generation unit 1010 uses the elementary signal as information indicating the encoding format in the preamble signal. An encoding format in which the SBR information is not included in the audio stream is described. Thus, for example, in actual DTV broadcasting, the program content part is broadcast with an audio encoded data format of HEAAC and an AAC sampling rate of 48 KHz, while the commercial part has an audio encoded data format of AACLC and AAC. Even when the sampling rate is broadcast at 48 KHz, the sampling rate of the final decoded signal does not need to fluctuate. Therefore, the switching portion between the program content and the CM portion smoothly transitions. By doing in this way, when the elementary audio stream 300 is encoded by HEAAC at a sampling frequency of 48 kHz, and the bit stream receiving device 1001 has the ability to decode it, the bit stream receiving device 1001 A high-quality audio signal decoded by HEAAC can be reproduced.

また、実際のＤＴＶ放送として、サンプリングレートが２４ｋＨｚ、「ＳＢＲ情報あり」で放送されているような場合において、放送波の受信状態の不良などで信号に不具合が生じＳＢＲ情報が欠落したような場合であっても、最終的な出力信号のサンプリングレートが一定に保たれる。このように、当該ビットストリーム送受信システムは、受信状態の不良の影響を少なくとどめることができる。 Also, when the actual DTV broadcast is broadcast at a sampling rate of 24 kHz and “with SBR information”, when a signal malfunctions due to a bad reception state of the broadcast wave, and the SBR information is missing Even so, the final sampling rate of the output signal is kept constant. In this way, the bit stream transmission / reception system can reduce the influence of a bad reception state.

なお、本実施の形態では、プリアンブル除去部２０２がＳＢＲ有無情報を抽出し、当該情報をエレメンタリデコード部２０３に送出していたが、そのようにせず、エレメンタリオーディオストリーム３００が示すサンプリングレートが１６ｋＨｚから２４ｋＨｚの場合は、ビットストリーム受信装置は、常に「ＳＢＲ情報あり」の前提でエレメンタリオーディオストリーム３００をデコードしてもよい。また、逆に、エレメンタリオーディオストリーム３００が示すサンプリングレートが３２ｋＨｚから４８ｋＨｚの場合は、ビットストリーム受信装置は、常に「ＳＢＲ情報なし」の前提でエレメンタリオーディオストリーム３００をデコードしてもよい。 In the present embodiment, preamble removing section 202 extracts SBR presence / absence information and sends the information to elementary decoding section 203. However, the sampling rate indicated by elementary audio stream 300 is not changed. In the case of 16 kHz to 24 kHz, the bit stream receiving apparatus may always decode the elementary audio stream 300 on the premise that “SBR information is present”. Conversely, when the sampling rate indicated by the elementary audio stream 300 is 32 kHz to 48 kHz, the bit stream receiving device may always decode the elementary audio stream 300 on the premise of “no SBR information”.

以上、本発明の実施の形態に係るビットストリーム送信装置及びビットストリーム送受信システムについて説明したが、本発明は、この実施の形態に限定されるものではない。 The bitstream transmission apparatus and the bitstream transmission / reception system according to the embodiment of the present invention have been described above, but the present invention is not limited to this embodiment.

また、上記実施の形態に係るビットストリーム送信装置又はビットストリーム送受信システムに含まれる各処理部は典型的には集積回路であるＬＳＩとして実現される。これらは個別に１チップ化されてもよいし、一部又は全てを含むように１チップ化されてもよい。 In addition, each processing unit included in the bit stream transmission device or the bit stream transmission / reception system according to the above embodiment is typically realized as an LSI which is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them.

また、本発明の実施の形態に係る、ビットストリーム送信装置又はビットストリーム送受信システムの機能の一部又は全てを、ＣＰＵ等のプロセッサがプログラムを実行することにより実現してもよい。 Moreover, you may implement | achieve part or all of the function of the bit stream transmission apparatus or bit stream transmission / reception system based on embodiment of this invention, when processors, such as CPU, run a program.

さらに、本発明は上記プログラムであってもよいし、上記プログラムが記録された非一時的なコンピュータ読み取り可能な記録媒体であってもよい。また、上記プログラムは、インターネット等の伝送媒体を介して流通させることができるのは言うまでもない。 Furthermore, the present invention may be the above program or a non-transitory computer-readable recording medium on which the above program is recorded. Needless to say, the program can be distributed via a transmission medium such as the Internet.

また、上記実施の形態１及び２に係る、ビットストリーム送信装置及びビットストリーム送受信システム、並びにそれらの変形例の機能のうち少なくとも一部を組み合わせてもよい。 Moreover, you may combine at least one part among the functions of the bit stream transmission apparatus which concerns on the said Embodiment 1 and 2, a bit stream transmission / reception system, and those modifications.

また、ブロック図における機能ブロックの分割は一例であり、複数の機能ブロックを一つの機能ブロックとして実現したり、一つの機能ブロックを複数に分割したり、一部の機能を他の機能ブロックに移してもよい。また、類似する機能を有する複数の機能ブロックの機能を単一のハードウェア又はソフトウェアが並列又は時分割に処理してもよい。 In addition, division of functional blocks in the block diagram is an example, and a plurality of functional blocks can be realized as one functional block, a single functional block can be divided into a plurality of functions, or some functions can be transferred to other functional blocks. May be. In addition, functions of a plurality of functional blocks having similar functions may be processed in parallel or time-division by a single hardware or software.

更に、本発明の主旨を逸脱しない限り、本実施の形態に対して当業者が思いつく範囲内の変更を施した各種変形例も本発明に含まれる。 Further, various modifications in which the present embodiment is modified within the scope conceivable by those skilled in the art are also included in the present invention without departing from the gist of the present invention.

本発明にかかるビットストリーム送信装置は、ＳＢＲ情報の有無に関する情報を変更する手段を備えているので、ＩＥＣ６１９３７規格で定義されていないビットストリームが入力された場合であってもＩＥＣ６１９３７規格を用いてビットストリームを送信することができる。よって、本発明は、幅広くＤＴＶ放送の受信機、及び、オーディオ符号化信号が格納された記録媒体から信号を読み出して再生する再生機器に適用できる。また、本発明は、オーディオ符号化データ形式が異なるコンテンツの切り替え時におけるサンプリングレートの変化を抑制できるので、途切れのない良好な再生音を享受することができる。 Since the bitstream transmitting apparatus according to the present invention includes means for changing information regarding the presence or absence of SBR information, even if a bitstream not defined in the IEC61937 standard is input, the bitstream using the IEC61937 standard is used. Stream can be sent. Therefore, the present invention can be applied to a wide range of DTV broadcast receivers and playback devices that read out and play back signals from recording media storing audio encoded signals. In addition, the present invention can suppress a change in sampling rate at the time of switching contents with different audio encoded data formats, so that it is possible to enjoy good reproduced sound without interruption.

１００入力部
１０１解析部
１０２、１０１０プリアンブル生成部
１０３成形部
１０４出力部
２００、１０００ビットストリーム送信装置
２０１、１００１ビットストリーム受信装置
２０２プリアンブル除去部
２０３エレメンタリデコード部
２１０ビットストリーム送受信システム
２２０テレビ
２２１デコーダ
２２２２チャネルダウンミックス部
２２３スピーカ
２３０シアターシステム
２３１マルチチャネルスピーカ
３００エレメンタリオーディオストリーム
３０１オーディオビットストリーム
１０１１能力情報通知部DESCRIPTION OF SYMBOLS 100 Input part 101 Analysis part 102,1010 Preamble generation part 103 Forming part 104 Output part 200, 1000 Bit stream transmission apparatus 201, 1001 Bit stream reception apparatus 202 Preamble removal part 203 Elementary decoding part 210 Bit stream transmission / reception system 220 Television 221 Decoder 222 2-channel downmix unit 223 speaker 230 theater system 231 multi-channel speaker 300 elementary audio stream 301 audio bit stream 1011 capability information notification unit

Claims

A bitstream transmission device for transmitting an audio bitstream of a predetermined format,
An input unit for obtaining an elementary audio stream obtained by encoding an audio signal;
An analysis unit for analyzing the sampling frequency and encoding format of the elementary audio stream;
A preamble generation unit that generates a preamble signal including information indicating an encoding format of the elementary audio stream according to an analysis result of the analysis unit;
A shaping unit for shaping the audio bitstream of the predetermined format by adding the preamble signal generated by the preamble generation unit to the elementary audio stream;
An output unit that outputs the audio bitstream formed by the forming unit;
The preamble generation unit has an encoding format suitable for the predetermined format of the audio bitstream according to the sampling frequency and the encoding format obtained from the analysis result of the analysis unit, and the elementary A bit stream transmission device that describes an encoding format different from an encoding format obtained from an analysis result of an audio stream as information indicating the encoding format in the preamble signal.

The bit stream according to claim 1, wherein the preamble generation unit describes information indicating the encoding format in the preamble signal so that an output transmission rate of data output from the output unit falls within a predetermined range. Transmitter device.

The audio bit stream of the predetermined format is an audio bit stream defined by the IEC 61937 standard,
When the sampling frequency obtained from the analysis result of the analysis unit is a predetermined first frequency, the encoding format corresponding to the first frequency is outside the corresponding range of the IEC 61937 standard. For example, the preamble generation unit describes an encoding format different from the analyzed encoding format in terms of presence / absence of SBR (Spectral Band Replication) information in the elementary audio stream as information indicating the encoding format. The bit stream transmission apparatus according to claim 1, wherein a preamble signal is generated such that the audio bit stream falls within a corresponding range of the IEC 61937 standard.

The preamble generation unit further includes a code in which the elementary audio stream is not included in the SBR information when the sampling frequency obtained from the analysis result of the analysis unit is a predetermined second frequency. The bit stream transmission device according to claim 3, wherein the information indicating that the data is encoded in an encoded format is described as information indicating the encoding format in the preamble signal.

The preamble generation unit includes the encoding format in which the elementary audio stream is included in the SBR information when the sampling frequency obtained from the analysis result of the analysis unit is a predetermined third frequency. The bit stream transmission device according to claim 3, wherein a preamble signal indicating that the encoding is performed in an encoding format including the SBR information is generated even if the encoding is not performed by the encoding.

The preamble generation unit, when the sampling frequency obtained from the analysis result of the analysis unit is a predetermined fourth frequency, the elementary audio stream is encoded in a format including the SBR information. The bit stream transmission apparatus according to claim 3, wherein a preamble signal indicating that the encoding is performed in an encoding format that does not include the SBR information even if encoded is generated.

A bitstream transmission device according to any one of claims 1 to 6, and a bitstream reception device that receives an audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream. A bitstream transmission / reception system comprising:
The bit stream receiver
A preamble removing unit that extracts the elementary audio stream by removing the preamble signal included in the received audio bitstream;
When the information indicating the encoding format of the elementary audio stream described in the preamble signal and the information indicating the encoding format of the elementary audio stream included in the elementary audio stream are inconsistent, A bitstream transmission / reception system comprising: an elementary decoding unit that decodes the elementary audio stream in an encoding format indicated by information indicating an encoding format described in the preamble signal.

A bitstream transmission / reception system comprising: the bitstream transmission device according to claim 5; and a bitstream reception device that receives an audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream. There,
The bit stream receiver
A preamble removing unit that extracts the elementary audio stream by removing the preamble signal included in the received audio bitstream;
When the information indicating the encoding format of the elementary audio stream described in the preamble signal and the information indicating the encoding format of the elementary audio stream included in the elementary audio stream are inconsistent, A bitstream transmission / reception system comprising: an elementary decoding unit that performs a decoding process using the SBR information with a signal expanded to a high frequency set to 0 in accordance with information indicating the encoding format described in a preamble signal.

A bitstream transmission / reception system comprising: the bitstream transmission device according to claim 6; and a bitstream reception device that receives an audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream. There,
The bit stream receiver
A preamble removing unit that extracts the elementary audio stream by removing the preamble signal from the received audio bitstream;
When the information indicating the encoding format of the elementary audio stream described in the preamble signal and the information indicating the encoding format of the elementary audio stream included in the elementary audio stream are inconsistent, the preamble A bitstream transmission / reception system comprising: an elementary decoding unit that generates a signal having a sampling frequency of ½ when the decoding process using the SBR information is performed according to information indicating the encoding format described in the signal .

The bitstream reception device has a capability notification unit that notifies the bitstream transmission device of information indicating the capability of the bitstream reception device,
The analysis unit analyzes FS information indicating a sampling rate of the elementary audio stream,
The preamble generation unit switches whether or not to generate a preamble signal indicating that SBR information is included according to information from the capability notification unit when the FS information is a predetermined value. The bit stream transmission / reception system of any one of 7-9.

An audio bit stream of a predetermined format formed by adding a preamble signal including information indicating the encoding format of the elementary audio stream to the elementary audio stream obtained by encoding the audio signal. When the sampling frequency of the elementary audio stream and the encoding format do not conform to the predetermined format of the audio bitstream, the encoding format conforming to the predetermined format of the audio bitstream is the preamble signal. A reception unit that receives an audio bitstream described as information indicating the encoding format in a bitstream transmission device;
A preamble removing unit that extracts the elementary audio stream by removing the preamble signal included in the received audio bitstream;
When the information indicating the encoding format of the elementary audio stream described in the preamble signal and the information indicating the encoding format of the elementary audio stream included in the elementary audio stream are inconsistent, A bitstream receiving apparatus comprising: an elementary decoding unit that decodes the elementary audio stream in an encoding format indicated by information indicating an encoding format described in the preamble signal.

When the sampling frequency of the elementary audio stream is a predetermined third frequency, the bit stream reception device encodes the elementary audio stream in the encoding format including the SBR information. Even if not, an audio bitstream including a preamble signal indicating that it is encoded in an encoding format including the SBR information is received,
The elementary decoding unit includes information indicating an encoding format of the elementary audio stream described in the preamble signal, and information indicating an encoding format of the elementary audio stream included in the elementary audio stream; The bit according to claim 11, wherein when the signal is inconsistent, a signal that is expanded to a high frequency is set to 0 according to information indicating the encoding format described in the preamble signal, and the decoding process using the SBR information is performed. Stream receiver.

When the sampling frequency of the elementary audio stream is a predetermined fourth frequency, the bit stream receiving device encodes the elementary audio stream in an encoding format including the SBR information. Even if the audio bit stream including the preamble signal indicating that it is encoded in the encoding format not including the SBR information,
The elementary decoding unit includes information indicating an encoding format of the elementary audio stream described in the preamble signal, and information indicating an encoding format of the elementary audio stream included in the elementary audio stream; If the signal is mismatched, a signal having a sampling frequency of ½ when the decoding process using the SBR information is performed is generated according to the information indicating the coding format described in the preamble signal. The bitstream receiving device described.

A bitstream transmission method for transmitting an audio bitstream in a predetermined format,
Acquire an elementary audio stream obtained by encoding an audio signal,
Analyzing the sampling frequency and encoding format of the elementary audio stream;
A preamble signal including information indicating an encoding format of the elementary audio stream is generated according to the analysis result,
By adding the generated preamble signal to the elementary audio stream, the audio bit stream of the predetermined format is formed,
Outputting the shaped audio bitstream;
In the generation of the preamble signal, the elementary audio stream is in an encoding format suitable for the predetermined format of the audio bitstream according to the sampling frequency obtained from the analysis result and the encoding format. A bitstream transmission method in which an encoding format different from the encoding format obtained from the analysis result is described as information indicating the encoding format in the preamble signal.

The audio bit stream of the predetermined format formed by adding a preamble signal including information indicating the encoding format of the elementary audio stream to the elementary audio stream obtained by encoding the audio signal. In accordance with the sampling frequency of the elementary audio stream and the encoding format, the encoding format conforms to the predetermined format of the audio bitstream and is obtained from the analysis result of the elementary audio stream. An encoding format different from the encoding format received from the bitstream transmitting device is an audio bitstream described as information indicating the encoding format in the preamble signal;
Extracting the elementary audio stream by removing the preamble signal contained in the received audio bitstream;
When the information indicating the encoding format of the elementary audio stream described in the preamble signal and the information indicating the encoding format of the elementary audio stream included in the elementary audio stream are inconsistent, A bitstream receiving method for decoding the elementary audio stream in an encoding format indicated by information indicating an encoding format described in the preamble signal.

An input unit for obtaining an elementary audio stream obtained by encoding an audio signal;
An analysis unit for analyzing the sampling frequency and encoding format of the elementary audio stream;
A preamble generation unit that generates a preamble signal including information indicating an encoding format of the elementary audio stream according to an analysis result of the analysis unit;
A shaping unit for shaping the audio bitstream of the predetermined format by adding the preamble signal generated by the preamble generation unit to the elementary audio stream;
An output unit that outputs the audio bitstream formed by the forming unit;
The preamble generation unit has an encoding format suitable for the predetermined format of the audio bitstream according to the sampling frequency and the encoding format obtained from the analysis result of the analysis unit, and the elementary An integrated circuit that describes an encoding format different from an encoding format obtained from an analysis result of an audio stream as information indicating the encoding format in the preamble signal.

The audio bit stream of the predetermined format formed by adding a preamble signal including information indicating the encoding format of the elementary audio stream to the elementary audio stream obtained by encoding the audio signal. In accordance with the sampling frequency of the elementary audio stream and the encoding format, the encoding format conforms to the predetermined format of the audio bitstream and is obtained from the analysis result of the elementary audio stream. A receiving unit that receives an audio bitstream described as information indicating the encoding format in the preamble signal from a bitstream transmitting device, wherein the encoding format is different from the encoding format described above.
A preamble removing unit that extracts the elementary audio stream by removing the preamble signal included in the received audio bitstream;
When the information indicating the encoding format of the elementary audio stream described in the preamble signal and the information indicating the encoding format of the elementary audio stream included in the elementary audio stream are inconsistent, An integrated circuit comprising: an elementary decoding unit that decodes the elementary audio stream in an encoding format indicated by information indicating the encoding format described in the preamble signal.

An audio bitstream described in a predetermined format,
The audio bit stream is formed by adding a preamble signal to an elementary audio stream obtained by encoding an audio signal,
The audio bitstream as information indicating the encoding format in the preamble signal in accordance with the sampling frequency and encoding format of the elementary audio stream described in the elementary audio stream. An audio bitstream in which an encoding format conforming to the predetermined format is described and an encoding format different from the encoding format described in the elementary audio stream is described.

As the information indicating the encoding format of the elementary audio stream, the encoding format described inside the elementary audio stream in terms of the presence or absence of SBR (Special Band Replication) information in the elementary audio stream; The audio bitstream according to claim 18, wherein different encoding formats are described.