EP2711924A1 - Bit stream transmission device, bit stream reception/transmission system, bit stream reception device, bit stream transmission method, bit stream reception method, and bit stream - Google Patents

Bit stream transmission device, bit stream reception/transmission system, bit stream reception device, bit stream transmission method, bit stream reception method, and bit stream Download PDF

Info

Publication number
EP2711924A1
EP2711924A1 EP12788834.5A EP12788834A EP2711924A1 EP 2711924 A1 EP2711924 A1 EP 2711924A1 EP 12788834 A EP12788834 A EP 12788834A EP 2711924 A1 EP2711924 A1 EP 2711924A1
Authority
EP
European Patent Office
Prior art keywords
bitstream
audio stream
elementary
coding format
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP12788834.5A
Other languages
German (de)
French (fr)
Other versions
EP2711924B1 (en
EP2711924A4 (en
Inventor
Kosuke Nishio
Shuji Miyasaka
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Socionext Inc
Original Assignee
Panasonic Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Corp filed Critical Panasonic Corp
Publication of EP2711924A1 publication Critical patent/EP2711924A1/en
Publication of EP2711924A4 publication Critical patent/EP2711924A4/en
Application granted granted Critical
Publication of EP2711924B1 publication Critical patent/EP2711924B1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes

Definitions

  • the present invention relates to bitstream reception and transmission systems that include a bitstream transmission device that transmits compressed and coded audio signals to other devices and a reception device that receives audio signals transmitted from the bitstream transmission device and outputs audio.
  • Conventional data reproduction devices which read compressed and coded video and audio signals recorded on storage media such as CD-ROMs, DVD-ROMs, semiconductor memory, and hard disks, then decode (decompress) and output the read video and audio signals.
  • the data reproduction device sends the decompressed data to a video monitor and acoustic speakers, for example, which are connected to the data reproduction device.
  • Video and audio reproduction is thusly achieved.
  • the data reproduction device receives compressed and coded video and audio signals, and similar to reproduction with storage media, decodes and outputs the compressed and coded video and audio signals. Video and audio reproduction is thusly achieved.
  • MPEG-2 and MPEG-4 are main examples of image data coding standards.
  • Dolby Digital, Digital Theater System (DTS), MPEG Audio, and MPEG-4 Advanced Audio Coding (AAC) are main examples of audio data coding standards.
  • DTS Digital, Digital Theater System
  • AAC MPEG-4 Advanced Audio Coding
  • PCM Pulse Code Modulation
  • Dolby Digital is a registered trademark of Dolby Laboratories, USA.
  • DTS Digital Theater Systems, inc., USA.
  • Signals coded with these audio data coding methods are hereinafter referred to as elementary audio streams.
  • the data reproduction device is capable of outputting the elementary audio stream to an external device while decoding the elementary audio stream.
  • a digital television and audio-video receiver can be connected over Sony Philips Digital InterFace (S/PDIF) or High-Definition Multimedia Interface (HDMI).
  • S/PDIF Sony Philips Digital InterFace
  • HDMI High-Definition Multimedia Interface
  • transmission of the elementary audio stream from the digital television to the audio-video receiver in accordance with IEC 60958/IEC 61937 standards is possible.
  • Reproduction of the audio signal is achieved by the audio-video receiver decoding and outputting the received elementary audio stream.
  • FIG. 7 shows an example of the data construction of an output bitstream output in IEC 60958/IEC 61937.
  • (a) in FIG. 7 shows an example of the data construction of an elementary audio stream defined in IEC 60958/IEC 61937.
  • (b) in FIG. 7 is an example of the data construction of an audio bitstream for transmission of the elementary audio stream.
  • the elementary audio stream 300 includes a plurality of frames, which are units including a given number of samples (for example, 1024 samples).
  • Each frame includes a header portion (Header) and a coded data portion (CodedData).
  • a coded signal of actual sound is recorded in the CodedData, and attributes of the coded signal recorded in the CodedData, such as the sampling rate and number of channels, are recorded in the Header.
  • FIG. 8A and FIG. 8B show examples of the data format of the Header of the bitstream in IEC 60958/IEC 61937.
  • FIG. 8A is an example of descriptions found in a fixed Header in the ADTS format.
  • the 8th line from the top of the table shown in FIG. 8A shows a format for describing the sampling frequency of the coded signal in the ADTS format.
  • the sampling frequency of the coded signal be selected from 0x3 to 0x8.
  • FIG. 8B is an example of descriptions found in the Header in the LOAS format. As FIG. 8B shows, a description stipulating the sampling frequency of the coded signal is found in the Header in the LOAS format as well.
  • HEAAC High-Efficiency Advanced Audio Coding
  • SBR Spectral Band Replication
  • the audio bitstream 301 is an IEC 61937 format audio bitstream.
  • the IEC 61937 format audio bitstream has preamble signals named Pa, Pb, Pc, and Pd located in front of the elementary audio stream. Furthermore, there are instances where an invalid signal, called Stuffing, for adjusting signal length is inserted after the elementary audio stream.
  • FIG. 9 shows the data format of the preamble signal Pc shown in (b) in FIG. 7 .
  • the portion in FIG. 9 enclosed by the dashed line is an example of data for specifying the presence or absence of SBR information (signal indicating AAC or HEAAC) in LOAS format, in the preamble signal Pc.
  • SBR information signal indicating AAC or HEAAC
  • LOAS format LOAS format
  • bitstream transmission device When a bitstream transmission device using the above technology receives an elementary audio stream such as the elementary audio stream 300, the bitstream transmission device detects the presence or absence of SBR information using the Extension_payload in the elementary audio stream. Then, when SBR information is not present, the bitstream transmission device specifies AAC in accordance with the bit structure shown in FIG. 9 in the preamble signal Pc. On the other hand, when SBR information is present in Extension_payload, the bitstream transmission device specifies HEAAC.
  • the other preamble signals Pa, Pb, and Pd are also set in accordance with the bit structure thereof and the regulation of the meaning of the signal. Then, when necessary, an IEC 61937 format audio bitstream is generated by inserting stuffing.
  • FIG. 10 shows the relationship between the sampling rate of the elementary audio stream and the audio data coding format of the audio data selectable with respect to the sampling rate.
  • FIG. 10 shows, when the AAC sampling rate is either 16 kHz, 22.05 kHz, or 24 kHz and the audio data coding format is AAC, the bitstream standard is not defined (first through third rows from the top in FIG. 10 ). Moreover, when the AAC sampling rate is 32 kHz or 44.1 kHz and the audio data coding format is HEAAC (AAC coded signal with SBR information included therein), the bitstream standard is not defined (fourth through fifth rows from the top in FIG. 10 ).
  • the second problem is that when, for example, coded data of different audio data coding formats are included in the same content, instances of switching between output sampling frequency in the output bitstream increases.
  • the audio data coding format of the program content is HEAAC and the AAC sampling rate is 48 kHz
  • the audio data coding format of the commercial content is AACLC and the AAC sampling rate is 48 kHz
  • switching of the output sampling frequency occurs at switches between program content and commercial content.
  • FIG. 11A and FIG. 11B show the difference between the frequency band at which an HEAAC decoder decodes and performs reproduction and the frequency band at which an AAC decoder decodes and performs reproduction.
  • FIG. 11A shows the frequency band in which reproduction in HEAAC of coded data having an AAC sampling rate of 48 KHz is possible. Compared to the reproduction frequency band of a signal in AAC, HEAAC reproduces a signal having a bandwidth twice that of AAC by using SBR (Spectral Band Replication) technology.
  • SBR Spectrum Band Replication
  • FIG. 11B shows the AAC reproduction frequency band of coded data having an AAC sampling rate of 48 KHz.
  • a signal is generated having the same frequency band as the frequency band reproduced by the AAC portion in HEAAC.
  • FIG. 11A and FIG. 11B demonstrate, a signal having an IEC 60958 frame rate of 96 kHz is output during the program content portion, and a signal having an IEC 60958 frame rate of 48 kHz is output during the commercial content portion.
  • processing is required for resynchronization in response to the change in IEC 60958 frame rate of the bitstream received in the audio-video receiver. Since this processing takes time, a problem arises in that a break in sound occurs when a switch is made between program content and commercial content.
  • the present invention aims to provide a bitstream transmission device capable of transmission of a compatible audio bitstream even when the actual coding format corresponding to the sampling frequency does not conform to a given format.
  • a bitstream transmission device that transmits an audio bitstream in a given format
  • the bitstream transmission device including: an input unit configured to acquire an elementary audio stream obtained by coding an audio signal; an analysis unit configured to analyze a sampling frequency and a coding format of the elementary audio stream; a preamble generation unit configured to generate, based on a result of the analysis by the analysis unit, a preamble signal including information indicating a coding format of the elementary audio stream; a forming unit configured to form the audio bitstream in the given format by incorporating in the elementary audio stream the preamble signal generated by the preamble generation unit; and an output unit configured to output the audio bitstream formed by the forming unit, wherein based on the sampling frequency and the coding format resulting from the analysis by the analysis unit, the preamble generation unit is configured to specify, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the
  • the preamble generation unit is configured to specify, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream and is different from the coding format of the elementary audio stream resulting from the analysis by the analysis unit.
  • the preamble generation unit may be configured to specify the information included in the preamble signal and indicating the coding format to limit an output transmission rate of data output from the output unit to a predetermined range.
  • the "predetermined range” is, for example, 1536 kbps, 1411.2 kbps, and 1024 kbps, and is a bit rate transmittable in accordance with a standard. This enables the bitstream transmission device according to an aspect of the present invention to make the output transmission rate of an audio bitstream defined in accordance with the sampling frequency and the coding format of the elementary audio stream a transmittable transmission rate.
  • the audio bitstream in the given format may be an audio bitstream in a format defined by the IEC 61937 standard, and when the sampling frequency resulting from the analysis by the analysis unit is a predetermined first frequency and the coding format corresponding to the first frequency does not comply with the IEC 61937 standard, to generate the preamble signal that conforms the audio bitstream to the IEC 61937 standard, the preamble generation unit may be configured to specify, as the information indicating the coding format, a coding format different from the analyzed coding format in terms of presence or absence of spectral band replication (SBR) information in the elementary audio stream.
  • SBR spectral band replication
  • bitstream transmission device can generate and transmit an audio bitstream compatible with the given format with little processing load.
  • the preamble generation unit may be further configured to specify, as the information included in the preamble signal and indicating the coding format, that the elementary audio stream is coded in a coding format not including the SBR information.
  • the bitstream transmission device to, when the sampling frequency of the elementary audio stream is a second frequency, specify, as the information included in the preamble signal and indicating the coding format, that the elementary audio stream is coded in a coding format not including the SBR information. Consequently, when the sampling frequency of the elementary audio stream is the second frequency, since the elementary audio stream can be regularly decoded in a coding format without SBR information, it is possible to prevent fluctuation in the sampling frequency.
  • the preamble generation unit may be configured to generate the preamble signal indicating that the elementary audio stream is coded in a coding format including the SBR information, even when the elementary audio stream is not coded in a coding format including the SBR information.
  • bitstream transmission device This enables the bitstream transmission device according to an aspect of the present invention to perform control so that SBR processing is compulsory when the sampling frequency of the elementary audio stream is the third frequency.
  • the bitstream transmission device is capable of adapting the audio bitstream to a given format when the third frequency is in the low band.
  • the preamble generation unit may be configured to generate the preamble signal indicating that the elementary audio stream is coded in a coding format not including the SBR information, even when the elementary audio stream is coded in a coding format including the SBR information.
  • bitstream transmission device This enables the bitstream transmission device according to an aspect of the present invention to perform control so that decoding in AAC without SBR is performed when the sampling frequency of the elementary audio stream is the fourth frequency.
  • the bitstream transmission device is capable of adapting the audio bitstream to a given format when the fourth frequency is in the high band.
  • a bitstream reception and transmission system may be a bitstream reception and transmission system including the bitstream transmission device and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream
  • the bitstream reception device includes: a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream according to the coding format indicated by the information specified in the preamble signal.
  • a bitstream reception and transmission system may be a bitstream reception and transmission system including the bitstream transmission device and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream
  • the bitstream reception device includes: a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, perform decoding where a signal with an expanded high band is 0 and which uses the SBR information, in accordance with the information specified in the preamble signal and indicating the coding format.
  • a bitstream reception and transmission system may be a bitstream reception and transmission system including the bitstream transmission device and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream
  • the bitstream reception device includes: a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal from the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, generate a signal having a sampling frequency that is half a sampling frequency when decoding processing using the SBR information is performed, in accordance with the information specified in the preamble signal and indicating the coding format.
  • the bitstream reception device may include a capability notification unit configured to notify the bitstream transmission device with information indicating a capability of the bitstream reception device, the analysis unit may be configured to analyze Fs information indicating a sampling rate of the elementary audio stream, and when the Fs information indicates a predetermined value, the preamble generation unit may be configured to switch between generating or not generating a preamble signal indicating inclusion of SBR information, based on the information from the capability notification unit.
  • a bitstream reception device may include: a reception unit configured to receive, from a bitstream transmission device, an audio bitstream in a given format that (i) is formed by incorporating, into an elementary audio stream which is a coded audio signal, a preamble signal including information indicating a coding format of the elementary audio stream, and (ii) specifies, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream when a sampling frequency and the coding format of the elementary audio stream do not comply with the given format of the audio bitstream; a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream according to
  • bitstream reception device in the bitstream reception and transmission system to, when information specified in the preamble signal and indicating a coding format and information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream in the coding format indicated by the information that is specified in the preamble signal.
  • the bitstream reception device is capable of decoding audio bitstreams without discrepancy, even if information indicating a coding format included in the elementary audio stream and information indicating a coding format included in the preamble signal are inconsistent.
  • the bitstream reception device may receive the audio bitstream including the preamble signal indicating that the elementary audio stream is coded in a coding format including SBR information when the sampling frequency of the elementary audio stream is a predetermined third frequency, even when the elementary audio stream is not coded in a coding format including the SBR information
  • the elementary decoding unit may be configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, perform decoding where a signal with an expanded high band is 0 and which uses the SBR information, in accordance with the information specified in the preamble signal and indicating the coding format.
  • the bitstream reception device may receive the audio bitstream including the preamble signal indicating that the elementary audio stream is coded in a coding format not including SBR information when the sampling frequency of the elementary audio stream is a predetermined fourth frequency, even when the elementary audio stream is coded in a coding format including the SBR information
  • the elementary decoding unit may be configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, generate a signal having a sampling frequency that is half a sampling frequency when decoding processing using the SBR information is performed, in accordance with the information specified in the preamble signal and indicating the coding format.
  • the present invention provides a bitstream transmission device capable of transmission of a compatible audio bitstream even when the actual coding format corresponding to the sampling frequency does not conform to a given format.
  • bitstream transmission in IEC 61937 is possible even if the input elementary audio stream does not comply with the IEC 61937 standard.
  • bitstream transmission in IEC 61937 is possible even in the case of AAC at a low Fs.
  • bitstream transmission in IEC 61937 is possible even in the case of HEAAC at a high Fs.
  • FIG. 1 shows the configuration of the bitstream transmission device 200 according to Embodiment 1.
  • the bitstream transmission device 200 includes an input unit 100, an analysis unit 101, a preamble generation unit 102, a forming unit 103, and an output unit 104.
  • the input unit 100 acquires an elementary audio stream 300 obtained by coding an audio signal.
  • the analysis unit 101 analyzes the sampling rate and the audio data coding format of the acquired elementary audio stream 300.
  • the preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, a preamble signal forming a portion of the audio bitstream which is in a format defined by IEC 61937.
  • the preamble generation unit 102 when the sampling frequency resulting from the analysis by the analysis unit 101 is a predetermined value, the preamble generation unit 102 generates the preamble signal by changing the parameter indicating the audio data coding format of the elementary audio stream 300 to a value in accordance with IEC 61937.
  • the forming unit 103 forms an IEC 61937 format audio bitstream by incorporating in the elementary audio stream 300 the preamble signal generated by the preamble generation unit 102.
  • the output unit 104 outputs the audio bitstream 301 formed by the forming unit 103.
  • FIG. 2 shows an example of the configuration of the bitstream reception device according to Embodiment 1.
  • the bitstream reception device 201 receives the audio bitstream 301 transmitted from the bitstream transmission device 200 shown in FIG. 1 , and decodes the received audio bitstream 301.
  • This bitstream reception device 201 includes a preamble removal unit 202 and an elementary decoding unit 203.
  • the preamble removal unit 202 is installed in the bitstream reception device 201 and extracts the elementary audio stream 300 by removing the preamble signals Pa, Pb, Pc, and Pd from the IEC 61937 format audio bitstream 301 transmitted from the bitstream transmission device 200.
  • the elementary decoding unit 203 is installed in the bitstream reception device 201 and generates an audio signal (multi-channel decoded signal) by decoding the elementary audio stream 300 extracted by the preamble removal unit 202, and outputs the generated audio signal.
  • the elementary decoding unit 203 decodes the elementary audio stream 300 in the audio data coding format indicated in the preamble signal Pc.
  • FIG. 3 shows an example of implementation of a transmission and reception system including the bitstream transmission device 200 shown in FIG. 1 and the bitstream reception device 201 shown in FIG. 2 with audio equipment.
  • the bitstream transmission device 200 and the bitstream reception device 201 included in the bitstream reception and transmission system 210 are connected via a cable for transmitting the IEC 61937 format audio bitstream 301.
  • This cable is, for example, an HDMI cable.
  • the bitstream transmission device 200 is installed in, for example, a television 220, and is connected to, for example, two-channel stereo speakers via a decoder inside the television 220 and a two-channel downmix unit.
  • the television 220 obtains the elementary audio stream 300 from broadcast waves via an antenna or tuner (not shown in the Drawings) or from a CD via a CD driver (not shown in the Drawings), and inputs the elementary audio stream 300 to the input unit 100 of the bitstream transmission device 200.
  • the decoder 221 takes the elementary audio stream 300 inputted from the input unit 100 to the forming unit 103, and temporarily decodes it into a multi-channel audio signal.
  • the two-channel downmix unit 222 (2ch downmix unit) downmixes, into a two-channel stereo audio signal, the multi-channel audio signal obtained as a result of the elementary audio stream 300 being decoded by the decoder, and outputs the stereo audio signal to the stereo speakers 223.
  • the bitstream reception device 201 is installed in, for example, a theater system 230 including multi-channel speakers 231, such as 5.1 channel speakers.
  • This bitstream reception device 201 decodes the audio bitstream 301 that is in accordance with IEC 61937 and is inputted from the bitstream transmission device 200 over HDMI, for example, and outputs the multi-channel audio signal obtained as a result of the decoding to the multi-channel speakers 231.
  • bitstream transmission device 200 having the above-described configuration will be described in further detail.
  • the input unit 100 obtains the elementary audio stream 300.
  • the elementary audio stream 300 is a stream having the data format like that shown in FIG. 7 , and in Embodiment 1, the format of the header is LOAS, and the coded data is AAC coded data which does not include SBR information.
  • the Extension_payload 310 is not included in the CodedData in the elementary audio stream 300, and the sampling rate (sampling frequency) specified in the header is 24 kHz.
  • the analysis unit 101 analyzes the elementary audio stream 300.
  • the inputted elementary audio stream 300 has a sampling rate of 24 kHz and does not include SBR information, as described above, and this information is analyzed by the analysis unit 101.
  • the sampling rate is obtained by analyzing the bitstream in accordance with the stipulations in the LOAS header format shown in FIG. 8B .
  • the presence or absence of SBR information is obtained by analyzing the bitstream in accordance with the stipulations in the Extension_payload shown in FIG. 7 .
  • the preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, an IEC 61937 format preamble signal.
  • the preamble generation unit 102 generates a preamble signal which indicates that the elementary audio stream 300 includes SBR information. More specifically, the preamble generation unit 102 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS.
  • the preamble generation unit 102 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 2. This causes the preamble signal Pc to indicate that the elementary audio stream 300 includes SBR information.
  • inclusion of SBR information means that the preamble signal Pc indicates that the elementary audio stream 300 is coded in HEAAC.
  • the sampling rate is 24 kHz, but it goes without saying that the above holds true even if the sampling rate is 22.05 kHz or 16 kHz.
  • the forming unit 103 incorporates the preamble signal outputted by the preamble generation unit 102 in the elementary audio stream 300.
  • the forming unit 103 forms an IEC 61937 format audio bitstream such as the audio bitstream 301 shown in FIG. 7 .
  • the output unit 104 outputs the audio bitstream 301, which is a signal from the forming unit 103, to an external device.
  • bitstream reception device 201 having the configuration shown in FIG. 2 will be described.
  • the bitstream transmission device 200 forms the inputted elementary audio stream 300 into the IEC 61937 format audio bitstream 301, and outputs the audio bitstream 301.
  • the sampling rate (sampling frequency) of the inputted elementary audio stream 300 is 24 kHz and coding which does not include SBR information is performed (AAC)
  • AAC coding which does not include SBR information
  • a code is specified in the preamble signal Pc among the preamble signals that indicates that SBR information is present (HEAAC).
  • HEAAC sampling rate (sampling frequency) of the inputted elementary audio stream 300
  • HEAAC a code is specified in the preamble signal Pc among the preamble signals that indicates that SBR information is not present
  • the preamble removal unit 202 extracts the elementary audio stream 300 by removing the preamble signals Pa, Pb, Pc, and Pd from the IEC 61937 format audio bitstream 301 transmitted from the bitstream transmission device 1000. If a stuffing signal is included in the IEC 61937 audio bitstream 301, the preamble removal unit 202 extracts the stuffing signal as well, and generates the elementary audio stream 300.
  • the elementary audio stream 300 generated in this manner is an AAC or HEAAC elementary audio stream, and as such, can be decoded in accordance with the decoding processes defined by MPEG standards.
  • the preamble removal unit 202 detects SBR presence/absence information, which indicates the presence or absence of SBR information, in the preamble signal Pc, and sends this SBR presence/absence information to the elementary decoding unit 203 down line.
  • the elementary decoding unit 203 generates an audio signal by decoding the extracted elementary audio stream 300. Moreover, the elementary decoding unit 203 refers to the SBR presence/absence information sent from the preamble removal unit 202, and determines whether the information indicating the presence or absence of the SBR information is included in the elementary audio stream 300 and the SBR presence/absence information from the preamble removal unit 202 are inconsistent or not. When inconsistent, the information shown in the SBR presence/absence information from the preamble removal unit 202 is followed.
  • the elementary decoding unit 203 performs bandwidth expansion processing.
  • the elementary decoding unit 203 cannot perform complete bandwidth expansion processing, but, for example, can perform processing which up-samples the sampling rate of the decoded signal by two times. In other words, the elementary decoding unit 203 performs processing equivalent to bandwidth expansion processing which makes a component duplicated in the high band by SBR 0.
  • the elementary decoding unit 203 when the SBR presence/absence information indicates "no SBR" even if the elementary audio stream 300 has a sampling rate of 48 kHz and indicates "(SBR information present", the elementary decoding unit 203 does not perform bandwidth expansion processing.
  • the elementary decoding unit 203 ignores and skips over the SBR information which is actually present.
  • the elementary decoding unit 203 does apply the SBR information which is actually present, but performs processing which generates a signal at 48 kHz as a result of down-sampling the outputted sampling rate in half.
  • the elementary decoding unit 203 may follow the information indicating the presence or absence of the SBR information included in the elementary audio stream 300.
  • the decoded signal generated in this manner is transmitted from the elementary decoding unit 203 to an external device, and converted into a signal listenable with speakers or headphones, for example.
  • the bitstream transmission device 200 includes an input unit 100 which acquires an elementary audio stream obtained by coding an audio signal; an analysis unit 101 which analyzes a sampling frequency and a coding format of the elementary audio stream; a preamble generation unit 102 which generates, based on a result of the analysis by the analysis unit 101, a preamble signal included in an IEC 61937 format audio bitstream; a forming unit 103 which forms the IEC 61937 format audio bitstream by incorporating in the elementary audio stream the preamble signal generated by the preamble generation unit 102; and an output unit 104 which outputs the audio bitstream formed by the forming unit 103, wherein when the sampling frequency resulting from the analysis by the analysis unit 101 is a predetermined value, the preamble generation unit 102 changes a parameter indicating the audio data coding format of the elementary audio stream to a format that complies with the IEC 61937 format of the audio bitstream.
  • bitstream transmission device 200 transmits the audio bitstream 301 to the bitstream reception device 201 without a problem, which was previously not transmittable due to incompatibility with IEC 61937.
  • the bitstream transmission device 200 is capable of outputting an audio bitstream 301 that complies with the IEC 61937 standard, even if the combination of the sampling rate and the audio data coding format of the elementary audio stream 300 does not comply with the IEC 61937 standard.
  • processing is substantially simplified.
  • Embodiment 1 an example is given that a sampling rate of 24 kHz (or 22.05 kHz or 16 kHz) and "no SBR information" are specified in the header, but the same processing can be performed when a sampling rate of 44.1 kHz (or 32 kHz) and "SBR information present" are specified in the header. The operations performed when this is the case are described below.
  • the analysis unit 101 analyzes the sampling rate and the audio data coding format of the elementary audio stream 300.
  • the analysis unit 101 analyzes that the inputted elementary audio stream 300 has a sampling rate of 44.1 kHz and includes SBR information.
  • the preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, a preamble signal to be included in an IEC 61937 format audio bitstream 301.
  • the preamble generation unit 102 when the sampling rate of the elementary audio stream 300 is 44.1 kHz and SBR information is included, the audio bitstream does not comply with IEC 61937 standards, as FIG. 10 shows, and as such, the preamble generation unit 102 generates a preamble signal that indicates that SBR information is not included. More specifically, the preamble generation unit 102 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS.
  • the preamble generation unit 102 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 1. With this, the preamble signal Pc indicates that SBR information is not included. In other words, the preamble signal Pc indicates that the audio data coding format of the elementary audio stream 300 is AAC.
  • the forming unit 103 incorporates the preamble signal outputted by the preamble generation unit 102 in the elementary audio stream 300.
  • the forming unit 103 forms an IEC 61937 format audio bitstream 301 such as is shown in FIG. 7 .
  • the output unit 104 outputs, to an external device, the audio bitstream 301 formed by the forming unit 103.
  • the bitstream transmission device 200 is capable of outputting an audio bitstream 301 that complies with the IEC 61937 standard even if the inputted elementary audio stream 300 does not comply with the IEC 61937 standard.
  • the information indicating the presence or absence of SBR included in the preamble signal and the information indicating the presence or absence of SBR included in the coded data will be inconsistent, but in this case, processing may be preset to follow the information indicating the presence or absence of SBR included in the preamble signal.
  • processing may be preset to follow the information indicating the presence or absence of SBR included in the preamble signal.
  • an AAC elementary audio stream and an HEAAC elementary audio stream have reproduction compatibility, even if the audio bitstream is an inconsistent IEC 61937 audio bitstream, the stream is capable of being reproduced by an external reception device.
  • FIG. 4A and FIG. 4B This concept is depicted in FIG. 4A and FIG. 4B .
  • the horizontal axes in (a) through (b2) in FIG. 4A and (a) through (b2) in FIG. 4B represent frequency.
  • the signal is actually an HEAAC signal-that is to say, when the elementary audio stream 300 is a elementary audio stream coded in the HEAAC audio data coding format-and the preamble signal shows that the elementary audio stream 300 is coded in AAC, the elementary audio stream 300 is processed with an AAC decoder. Even in this case, while the same sound quality as when the audio is decoded in conventional HEAAC cannot be obtained, audible audio decoded in AAC is reproduced ((b2) in FIG. 4A ). Moreover, if the preamble signal were to indicate AAC when the signal actually is AAC, there would be no harm in processing this AAC signal with an HEAAC decoder. In other words, when this AAC signal is decoded with an HEAAC decoder, it is possible to obtain the same sound quality as when decoded with a conventional AAC decoder ((b2) in FIG. 4B ).
  • the analysis unit 101 analyzes the sampling rate and the audio data coding format of the elementary audio stream 300.
  • the analysis unit 101 analyzes that the inputted elementary audio stream 300 has a sampling rate of 48 kHz and includes SBR information.
  • the preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, an IEC 61937 format preamble signal.
  • the sampling rate is 48 kHz, regardless of inclusion or exclusion of SBR information, in either case, the audio bitstream meets IEC 61937 standards, as FIG. 10 shows.
  • the preamble generation unit 102 according to this embodiment always generates the preamble signal to indicate that SBR information is not included, even if SBR information is included. More specifically, the preamble generation unit 102 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS.
  • the preamble generation unit 102 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 1. This causes the preamble signal Pc to indicate that the elementary audio stream 300 does not include SBR information. In other words, when the sampling rate is 48 kHz, the preamble signal Pc is always generated to indicate that the elementary audio stream 300 is coded in AAC.
  • the forming unit 103 incorporates the preamble signal outputted by the preamble generation unit 102 in the elementary audio stream 300.
  • the forming unit 103 forms an IEC 61937 format audio bitstream 301 such as is shown in FIG. 7 .
  • the output unit 104 outputs, to an external device, the audio bitstream 301 generated by the forming unit 103.
  • the bitstream transmission device outputs a signal having an IEC 60958 frame rate of 96 KHz for the program content, and outputs a signal having a 48 KHz IEC 60958 frame rate for the commercial content.
  • FIG. 5A through FIG. 5C illustrate transmission examples of streams based on the stream output stipulations shown in FIG. 10 .
  • the circles containing the letters a, b, c, and d represent the preamble signals Pa, Pb, Pc, and Pd
  • the shaded circles represent the elementary audio stream
  • the blank circles represent padding data, which is invalid data for adjusting the data length.
  • the header format of the transmitted stream is LOAS.
  • FIG. 5A illustrates why it is possible to transmit a stream coded in AAC (no SBR information) at a sampling rate of 48 kHz, for example.
  • AAC no SBR information
  • one sample indicates 32 bits.
  • the sampling frequency of the elementary audio stream is 48 kHz
  • the bitstream transmission device transmits data at a transmission rate of 1536 kbps (48 kHz x 32 bits). More specifically, the preamble signals Pa, Pb, Pc, and Pd are added every 21.333 msec (time for 1024 samples at 48 kHz), and 32768-bit, one frame signals are continuously transmitted.
  • the decoder when AAC at 48 kHz, the decoder generates PCM data every 21.333 msec. As such, since the signal supply rate and the decoder processing rate match, the decoder can processes the signals without discrepancy.
  • FIG. 5B illustrates why it is possible to transmit a stream coded in AAC (no SBR information) at a sampling rate of 24 kHz, for example.
  • the bitstream transmission device transmit data at a transmission rate of 768 kbps (24 kHz x 32 bits), but the bitstream transmission device does not have a 768 kbps mode. If the bitstream transmission device were to use a transmission rate of 1536 kbps, the data would be transmitted as is shown in FIG. 5B .
  • FIG. 5C illustrates why it is possible to transmit a stream coded in HEAAC (SBR information present) at a sampling rate of 24 kHz, for example.
  • the bitstream transmission device considers the entire stream to be a signal having a sampling rate of 48 kHz, and as such, transmits the data at a transmission rate of 1536 kbps (48 kHz x 32 bits). More specifically, the preamble signals Pa, Pb, Pc, and Pd are added every 42.6667 msec (time for 2048 samples at 48 kHz), and signals are continuously transmitted per frame.
  • the decoder since the decoder generates PCM data every 42.6667 msec when HEAAC at 24 kHz, the supply rate of the signals and the processing rate of the decoder match. As such, the decoder can process the signals without discrepancy.
  • the bitstream transmission device can use these transmission rates, but 705.6 kbps (22.05 kHz x 32 bits) and 512 kbps (16 kHz x 32 bits) are not standardized.
  • streams coded in AAC at a sampling rate of 48 kHz and streams coded in HEAAC at a sampling rate of 48 kHz can be transmitted with no problem.
  • the bitstream transmission device transmits data at a transmission rate of 1536 kbps (48 kHz x32 bits). More specifically, the preamble signals Pa, Pb, Pc, and Pd are added every 21.333 msec (time for 1024 samples at 48 kHz), and signals are continuously transmitted per frame (32768 bits).
  • the decoder since the decoder generates PCM data every 21.333 msec when AAC at 48 kHz, the supply rate of the signals and the processing rate of the decoder match. As such, the decoder can process the signals without discrepancy.
  • the bitstream transmission device regards the sampling rate to be 96 kHz, and as such, transmits data at a transmission rate of 3072 kbps (96 kHz x 32 bits).
  • the preamble signals Pa, Pb, Pc, and Pd are added every 21.333 msec (time for 2048 samples at 96 kHz), and signals are transmitted per frame (32768 bits).
  • the decoder since the decoder generates PCM data every 21.333 msec when HEAAC at 48 kHz, the supply rate of the signals and the processing rate of the decoder match. As such, the decoder can process the signals without discrepancy.
  • the transmission rate a constant sampling rate by transmitting, as "no SBR", the stream coded in HEAAC (SBR present) at a sampling rate of 48 kHz. This makes it possible to prevent defects such as jumps in audio.
  • the bitstream reception and transmission system since a time lag occurs at the switching of the sampling rate, when the sampling rate of the elementary audio stream is 48 kHz, the bitstream reception and transmission system is described as performing processing as if the elementary audio stream were always coded in AAC, regardless of whether it is coded in HEAAC or AAC, but the present invention is not limited to this example. For example, if time lag occurring at the switching of the sampling rate is not a concern, the bitstream reception and transmission system may perform processing as if the elementary audio stream were always an HEAAC signal.
  • bitstream transmission and reception system 210 according to Embodiment 2 of the present invention will hereinafter be described with reference to the Drawings.
  • FIG. 6 shows the configuration of the bitstream transmission and reception system according to Embodiment 2.
  • the bitstream transmission and reception system 210 according to Embodiment 2 includes the bitstream transmission device 1000 and a bitstream reception device 1001.
  • the bitstream transmission device 1000 includes the input unit 100, the analysis unit 101, the forming unit 103, the output unit 104, and a preamble generation unit 1010.
  • all processing units having the same reference numerals as those in the bitstream transmission device 200 are the same as those disclosed in FIG. 1 according to Embodiment 1, and as such, descriptions thereof will be omitted.
  • the preamble generation unit 1010 receives capability information on the bitstream reception device 1001 from the bitstream reception device 1001, and based on the capability information, changes information included in the preamble signal Pc that indicates the audio data coding format of the elementary audio stream. More specifically, the preamble generation unit 1010 determines, based on the capability information received from the bitstream reception device 1001, whether or not the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 coded in HEAAC at a sampling rate of 48 kHz.
  • the preamble generation unit 1010 determines that the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 coded in HEAAC at a sampling rate of 48 kHz, and the elementary audio stream 300 is coded in HEAAC at a sampling rate of 48 kHz, the preamble generation unit 1010 generates the preamble signal Pc showing that the elementary audio stream 300 is coded in HEAAC, different than in Embodiment 1. More specifically, the preamble generation unit 1010 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS.
  • the preamble generation unit 1010 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 2. This causes the preamble signal Pc to indicate that the elementary audio stream 300 includes SBR information. In other words, the preamble signal Pc indicates that the elementary audio stream 300 is coded in HEAAC.
  • the bitstream reception device 1001 receives the audio bitstream 301 transmitted from the bitstream transmission device 1000, and decodes the received audio bitstream 301.
  • the bitstream reception device 1001 includes the preamble removal unit 202, the elementary decoding unit 203, and a capability information notification unit 1011.
  • all processing units having the same reference numerals as those in the bitstream reception device 201 are the same as those disclosed in FIG. 2 according to Embodiment 1, and as such, descriptions thereof will be omitted. The following focuses on the description of the capability information notification unit 1011.
  • the capability information notification unit 1011 stores, in internal memory, capability information indicating whether or not the elementary decoding unit 203 in the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 coded in HEAAC at a sampling rate of 48 kHz.
  • the capability information notification unit 1011 notifies the preamble generation unit 1010 of the bitstream transmission device 1000 with the capability information stored in memory.
  • the bitstream transmission device 1000 is capable of describing a parameter indicating that SBR information is present in the preamble signal Pc.
  • the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 in accordance with the preamble signal included in the audio bitstream 301 transmitted from the bitstream transmission device 1000. This has the advantageous effect that the user can enjoy content coded in HEAAC in high audio quality and having a bandwidth that has been expanded into the high band.
  • the bitstream reception device 1001 may, for example, interpret the header and Extension_payload of the elementary audio stream 300 extracted from the audio bitstream 301 received from the bitstream transmission device 200 according to Embodiment 1, and decode the elementary audio stream 300 according to the SBR information when SBR information is included in the elementary audio stream 300. Furthermore, even when SBR information is not included in the elementary audio stream 300 extracted from the audio bitstream 301, the bitstream reception device 1001 may perform HEAAC decoding in which the bandwidth is expanded into the high band.
  • the preamble generation unit 1010 specifies, as information included in the preamble signal and indicating the coding format, a coding format in which SBR information is not included in the elementary audio stream.
  • a coding format in which SBR information is not included in the elementary audio stream For example, in an actual DTV broadcast, even when the programming content is broadcast having an audio data coding format of HEAAC and an AAC sampling rate of 48 kHz, while on the other hand the commercial content is broadcast having an audio data coding format of AACLC and an AAC sampling rate of 48 kHz, processing can be done without changing a final sampling rate of the decoded signal. As such, the transition between program content and commercial content is smooth.
  • the bitstream reception device 1001 is capable of reproducing the high audio quality audio signal decoded in HEAAC.
  • the bitstream reception and transmission system when the broadcast has a sampling rate of 24 kHz and indicates "(SBR information present), even if SBR information is lost as a result of a defect in the signal from poor reception of the broadcast waves, the final sampling rate of the output signal is held constant. In this way, it is possible for the bitstream reception and transmission system to keep the influences caused by poor reception to a minimum.
  • the preamble removal unit 202 extracts SBR presence/absence information and transmits it to the elementary decoding unit 203, but when the sampling rate that the elementary audio stream 300 indicates is from 16 kHz to 24 kHz, the bitstream reception device may, without doing the above, decode the elementary audio stream 300 under the pretense that "(SBR information present" is always indicated. Moreover, in contrast, when the sampling rate that the elementary audio stream 300 indicates is from 32 kHz to 48 kHz, the bitstream reception device may decode the elementary audio stream 300 under the pretense that "no SBR information" is always indicated
  • bitstream transmission device and the bitstream transmission and reception system according to the embodiments of the present invention has been described, but the present invention is not limited to these embodiments.
  • each processing unit included in the bitstream transmission device or the bitstream transmission and reception system according to the above embodiments are typical realized as an LSI, which is an integrated circuit.
  • Each of these processing units may be individually realized as a single chip, or a portion or all of the processing units may be realized as a single chip.
  • bitstream transmission device or the bitstream reception and transmission system according to the embodiments of the present invention may be realized by a processor such as a CPU executing a computer program.
  • the present invention may be the above-described computer program, and may be a non-transitory computer-readable recording medium having the computer program recorded thereon. Moreover, it goes without saying that the above-described computer program is capable of being circulated over a transmission medium such as the Internet.
  • bitstream transmission device and the bitstream reception and transmission system according to the above-described Embodiment 1 and Embodiment 2, as well the variations thereof, may be, at least in part, combined.
  • a plurality of function blocks may be realized as a single function block, a single function block may be divided into a plurality of function blocks, or a portion of functions of one block may be may be transferred to a different function block.
  • the functions of a plurality of function blocks having similar functions may be processed by a single piece of hardware or software in parallel or by time-division.
  • the bitstream transmission device is capable of changing information relating to the presence or absence of SBR information, and thus is capable of transmitting bitstreams using the IEC 61937 standard even if the bitstream inputted is not defined by the IEC 61937 standard.
  • the present invention is widely applicable to DTV broadcast reception devices and reproduction devices which read and reproduce a signal from a recording medium storing an audio coded signal.
  • the present invention is capable of suppressing occurrences of changes in sampling rate at the time of switching between content of different audio data coding formats, the present invention can reproduce quality sound without interruptions.

Abstract

A bitstream transmission device (200) according to the present invention includes: an input unit (100) which acquires an elementary audio stream obtained by coding an audio signal; an analysis unit (101) which analyzes a sampling frequency and a coding format of the elementary audio stream (300); a preamble generation unit (102) which, when the analyzed sampling frequency and coding format do not comply with a standard, generates a preamble signal specifying a coding format that complies with the standard as information indicating a coding format of the elementary audio stream (300); and a forming unit (103) which forms an audio bitstream (301) in accordance with the standard by incorporating the generated preamble signal in the elementary audio stream (300).

Description

    [Technical Field]
  • The present invention relates to bitstream reception and transmission systems that include a bitstream transmission device that transmits compressed and coded audio signals to other devices and a reception device that receives audio signals transmitted from the bitstream transmission device and outputs audio.
  • [Background Art]
  • Conventional data reproduction devices are known which read compressed and coded video and audio signals recorded on storage media such as CD-ROMs, DVD-ROMs, semiconductor memory, and hard disks, then decode (decompress) and output the read video and audio signals. The data reproduction device sends the decompressed data to a video monitor and acoustic speakers, for example, which are connected to the data reproduction device. Video and audio reproduction is thusly achieved. Moreover, in the case of digital broadcast reproduction, the data reproduction device receives compressed and coded video and audio signals, and similar to reproduction with storage media, decodes and outputs the compressed and coded video and audio signals. Video and audio reproduction is thusly achieved.
  • There are many standards for the compression and coding of video and audio signals.
  • For example, MPEG-2 and MPEG-4 are main examples of image data coding standards.
  • Moreover, Dolby Digital, Digital Theater System (DTS), MPEG Audio, and MPEG-4 Advanced Audio Coding (AAC) are main examples of audio data coding standards. Moreover, the Pulse Code Modulation (PCM) method is an example of an audio data coding standard which does not reduce the amount of data.
  • Dolby Digital is a registered trademark of Dolby Laboratories, USA.
  • DTS is a registered trademark of Digital Theater Systems, inc., USA.
  • Signals coded with these audio data coding methods are hereinafter referred to as elementary audio streams.
  • Moreover, the data reproduction device is capable of outputting the elementary audio stream to an external device while decoding the elementary audio stream.
  • For example, a digital television and audio-video receiver can be connected over Sony Philips Digital InterFace (S/PDIF) or High-Definition Multimedia Interface (HDMI). In this case, transmission of the elementary audio stream from the digital television to the audio-video receiver in accordance with IEC 60958/IEC 61937 standards is possible.
  • Reproduction of the audio signal is achieved by the audio-video receiver decoding and outputting the received elementary audio stream.
  • FIG. 7 shows an example of the data construction of an output bitstream output in IEC 60958/IEC 61937. (a) in FIG. 7 shows an example of the data construction of an elementary audio stream defined in IEC 60958/IEC 61937. (b) in FIG. 7 is an example of the data construction of an audio bitstream for transmission of the elementary audio stream.
  • As (a) in FIG. 7 shows, the elementary audio stream 300 includes a plurality of frames, which are units including a given number of samples (for example, 1024 samples). Each frame includes a header portion (Header) and a coded data portion (CodedData). A coded signal of actual sound is recorded in the CodedData, and attributes of the coded signal recorded in the CodedData, such as the sampling rate and number of channels, are recorded in the Header.
  • Formats for recording the attributes of the coded signal in the Header include Audio Data Transport Stream (ADTS) and Low Overhead Audio Stream (LOAS). FIG. 8A and FIG. 8B show examples of the data format of the Header of the bitstream in IEC 60958/IEC 61937. FIG. 8A is an example of descriptions found in a fixed Header in the ADTS format. The 8th line from the top of the table shown in FIG. 8A shows a format for describing the sampling frequency of the coded signal in the ADTS format. Here, it is stipulated that the sampling frequency of the coded signal be selected from 0x3 to 0x8. FIG. 8B is an example of descriptions found in the Header in the LOAS format. As FIG. 8B shows, a description stipulating the sampling frequency of the coded signal is found in the Header in the LOAS format as well.
  • International standard MPEG audio AAC and High-Efficiency Advanced Audio Coding (HEAAC) are well known as coding schemes for the coded signal recorded in the CodedData of the elementary audio stream 300 shown in (a) in FIG. 7. HEAAC applies a bandwidth expansion technique called Spectral Band Replication (SBR) to AAC. Moreover, HEAAC is capable of reproduction processing compatible with AAC if the SBR information is not purposely processed, or if the decoder is not capable of processing the SBR information.
  • It is possible to determine whether or not the coded signal described in the CodedData is coded using SBR technology or not by analyzing information in the CodedData. In other words, it is possible to analyze information in the CodedData by looking through the bit construction of the dashed line box in elementary audio stream 300 shown in (a) in FIG. 7. The dashed line box has an information storage structure in the AAC standard called Extension_payload. It is possible to determine whether SBR technology has been applied the coded signal based on whether SBR information is stored in the Extension_payload or not. In the elementary audio stream 300, Extension_payload is located at the end of each frame, but Extension_payload is not necessarily required to be present in every frame.
  • As (b) in FIG. 7 shows, the audio bitstream 301 is an IEC 61937 format audio bitstream. The IEC 61937 format audio bitstream has preamble signals named Pa, Pb, Pc, and Pd located in front of the elementary audio stream. Furthermore, there are instances where an invalid signal, called Stuffing, for adjusting signal length is inserted after the elementary audio stream.
  • FIG. 9 shows the data format of the preamble signal Pc shown in (b) in FIG. 7. The portion in FIG. 9 enclosed by the dashed line is an example of data for specifying the presence or absence of SBR information (signal indicating AAC or HEAAC) in LOAS format, in the preamble signal Pc. For example, when the value indicated by the 5 bits from bit 0 to bit 4 in the preamble signal Pc is 23, the elementary audio stream 300 is in LOAS format. Furthermore, when the value indicated by the 2 bits from bit 5 to bit 6 in the preamble signal Pc is 1, this indicates that SBR information is not included in the elementary audio stream. In other words, this indicates whether the coded signal recorded in the CodedData is coded in AACLC or AAC. Moreover, when the value indicated by the 5 bits from bit 0 to bit 4 in the preamble signal Pc is 23, and the value indicated by the 2 bits from bit 5 to bit 6 in the preamble signal Pc is 2, this indicates that SBR information is included in the elementary audio stream 300. In other words, this indicates that the coded signal recorded in the CodedData is coded in HEAAC.
  • When a bitstream transmission device using the above technology receives an elementary audio stream such as the elementary audio stream 300, the bitstream transmission device detects the presence or absence of SBR information using the Extension_payload in the elementary audio stream. Then, when SBR information is not present, the bitstream transmission device specifies AAC in accordance with the bit structure shown in FIG. 9 in the preamble signal Pc. On the other hand, when SBR information is present in Extension_payload, the bitstream transmission device specifies HEAAC. The other preamble signals Pa, Pb, and Pd are also set in accordance with the bit structure thereof and the regulation of the meaning of the signal. Then, when necessary, an IEC 61937 format audio bitstream is generated by inserting stuffing.
  • [Citation List] [Non Patent Literature]
    • [NPL 1] IEC 61937-11 Digital audio - Interface for non-linear PCM encoded audio bitstreams applying IEC 60958 - Part 11: MPEG-4 AAC and its extensions in LATM/LOAS
    • [NPL 2] ISO/IEC 14496-3 Information technology - Coding of audio-visual objects - Part 3: Audio
    • [NPL 3] ETSI TS 101 154 Technical Specification Digital Video Broadcasting (DVB); Specification for the use of Video and Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream
    [Summary of Invention] [Technical Problem]
  • However, there are two problems with the above described conventional techniques.
  • The first problem is that among the parameters defined by MPEG-4 AAC, there are instances where the bitstream standard is not defined. FIG. 10 shows the relationship between the sampling rate of the elementary audio stream and the audio data coding format of the audio data selectable with respect to the sampling rate.
  • As FIG. 10 shows, when the AAC sampling rate is either 16 kHz, 22.05 kHz, or 24 kHz and the audio data coding format is AAC, the bitstream standard is not defined (first through third rows from the top in FIG. 10). Moreover, when the AAC sampling rate is 32 kHz or 44.1 kHz and the audio data coding format is HEAAC (AAC coded signal with SBR information included therein), the bitstream standard is not defined (fourth through fifth rows from the top in FIG. 10). Especially in the latter case-that is to say, when the AAC sampling rate is 32 kHz or 44.1 kHz and the audio data coding format is HEAAC-even though operation is regulated in, for example, digital broadcast, the bitstream standard is not defined. As a result, there is a problem that even if a high sound quality elementary audio stream coded in HEAAC is received, this elementary audio stream cannot be reproduced and listened to on, for example, an external multi-channel speaker system.
  • The second problem is that when, for example, coded data of different audio data coding formats are included in the same content, instances of switching between output sampling frequency in the output bitstream increases.
  • For example, in digital broadcasting, when the audio data coding format of the program content is HEAAC and the AAC sampling rate is 48 kHz, while on the other hand, the audio data coding format of the commercial content is AACLC and the AAC sampling rate is 48 kHz, switching of the output sampling frequency occurs at switches between program content and commercial content.
  • FIG. 11A and FIG. 11B show the difference between the frequency band at which an HEAAC decoder decodes and performs reproduction and the frequency band at which an AAC decoder decodes and performs reproduction. FIG. 11A shows the frequency band in which reproduction in HEAAC of coded data having an AAC sampling rate of 48 KHz is possible. Compared to the reproduction frequency band of a signal in AAC, HEAAC reproduces a signal having a bandwidth twice that of AAC by using SBR (Spectral Band Replication) technology. In other words, the solid line in FIG. 11A shows a 0-24 kHz signal reproduced in 48 kHz AAC, and the 24-48 kHz band of the dashed line is the band expanded by SBR. With this, a 96 kHz signal is reproduced. On the other hand, FIG. 11B shows the AAC reproduction frequency band of coded data having an AAC sampling rate of 48 KHz. As FIG. 11B shows, a signal is generated having the same frequency band as the frequency band reproduced by the AAC portion in HEAAC.
  • As FIG. 11A and FIG. 11B demonstrate, a signal having an IEC 60958 frame rate of 96 kHz is output during the program content portion, and a signal having an IEC 60958 frame rate of 48 kHz is output during the commercial content portion. When such a bitstream is received, processing is required for resynchronization in response to the change in IEC 60958 frame rate of the bitstream received in the audio-video receiver. Since this processing takes time, a problem arises in that a break in sound occurs when a switch is made between program content and commercial content.
  • In light of the above, the present invention aims to provide a bitstream transmission device capable of transmission of a compatible audio bitstream even when the actual coding format corresponding to the sampling frequency does not conform to a given format.
  • [Solution to Problem]
  • In order to solve the above described problems, a bitstream transmission device according to an aspect of the present invention is a bitstream transmission device that transmits an audio bitstream in a given format, the bitstream transmission device including: an input unit configured to acquire an elementary audio stream obtained by coding an audio signal; an analysis unit configured to analyze a sampling frequency and a coding format of the elementary audio stream; a preamble generation unit configured to generate, based on a result of the analysis by the analysis unit, a preamble signal including information indicating a coding format of the elementary audio stream; a forming unit configured to form the audio bitstream in the given format by incorporating in the elementary audio stream the preamble signal generated by the preamble generation unit; and an output unit configured to output the audio bitstream formed by the forming unit, wherein based on the sampling frequency and the coding format resulting from the analysis by the analysis unit, the preamble generation unit is configured to specify, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream and is different from the coding format of the elementary audio stream resulting from the analysis by the analysis unit.
  • In this way, based on the sampling frequency and the coding format resulting from the analysis by the analysis unit, the preamble generation unit is configured to specify, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream and is different from the coding format of the elementary audio stream resulting from the analysis by the analysis unit. This enables the bitstream transmission device according to an aspect of the present invention to transmit a compatible audio bitstream even when the actual coding format corresponding to the sampling frequency does not conform to a given format.
  • Moreover, the preamble generation unit may be configured to specify the information included in the preamble signal and indicating the coding format to limit an output transmission rate of data output from the output unit to a predetermined range.
  • Here, the "predetermined range" is, for example, 1536 kbps, 1411.2 kbps, and 1024 kbps, and is a bit rate transmittable in accordance with a standard. This enables the bitstream transmission device according to an aspect of the present invention to make the output transmission rate of an audio bitstream defined in accordance with the sampling frequency and the coding format of the elementary audio stream a transmittable transmission rate.
  • Moreover, the audio bitstream in the given format may be an audio bitstream in a format defined by the IEC 61937 standard, and when the sampling frequency resulting from the analysis by the analysis unit is a predetermined first frequency and the coding format corresponding to the first frequency does not comply with the IEC 61937 standard, to generate the preamble signal that conforms the audio bitstream to the IEC 61937 standard, the preamble generation unit may be configured to specify, as the information indicating the coding format, a coding format different from the analyzed coding format in terms of presence or absence of spectral band replication (SBR) information in the elementary audio stream.
  • This enables the bitstream transmission device according to an aspect of the present invention to form an audio bitstream which complies with a given format when the sampling frequency and the coding format resulting from the analysis by the analysis unit do not conform to the given format of the audio bitstream, by simply changing the presence or absence of SBR information in the preamble signal. As such, the bitstream transmission device can generate and transmit an audio bitstream compatible with the given format with little processing load.
  • Moreover, when the sampling frequency resulting from the analysis by the analysis unit is a predetermined second frequency, the preamble generation unit may be further configured to specify, as the information included in the preamble signal and indicating the coding format, that the elementary audio stream is coded in a coding format not including the SBR information.
  • This enables the bitstream transmission device according to an aspect of the present invention to, when the sampling frequency of the elementary audio stream is a second frequency, specify, as the information included in the preamble signal and indicating the coding format, that the elementary audio stream is coded in a coding format not including the SBR information. Consequently, when the sampling frequency of the elementary audio stream is the second frequency, since the elementary audio stream can be regularly decoded in a coding format without SBR information, it is possible to prevent fluctuation in the sampling frequency.
  • Moreover, when the sampling frequency resulting from the analysis by the analysis unit is a predetermined third frequency, the preamble generation unit may be configured to generate the preamble signal indicating that the elementary audio stream is coded in a coding format including the SBR information, even when the elementary audio stream is not coded in a coding format including the SBR information.
  • This enables the bitstream transmission device according to an aspect of the present invention to perform control so that SBR processing is compulsory when the sampling frequency of the elementary audio stream is the third frequency. With this, the bitstream transmission device is capable of adapting the audio bitstream to a given format when the third frequency is in the low band.
  • Moreover, when the sampling frequency resulting from the analysis by the analysis unit is a predetermined fourth frequency, the preamble generation unit may be configured to generate the preamble signal indicating that the elementary audio stream is coded in a coding format not including the SBR information, even when the elementary audio stream is coded in a coding format including the SBR information.
  • This enables the bitstream transmission device according to an aspect of the present invention to perform control so that decoding in AAC without SBR is performed when the sampling frequency of the elementary audio stream is the fourth frequency. With this, the bitstream transmission device is capable of adapting the audio bitstream to a given format when the fourth frequency is in the high band.
  • Moreover, a bitstream reception and transmission system according to an aspect of the present invention may be a bitstream reception and transmission system including the bitstream transmission device and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream, wherein the bitstream reception device includes: a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream according to the coding format indicated by the information specified in the preamble signal.
  • Moreover, a bitstream reception and transmission system according to an aspect of the present invention may be a bitstream reception and transmission system including the bitstream transmission device and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream, wherein the bitstream reception device includes: a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, perform decoding where a signal with an expanded high band is 0 and which uses the SBR information, in accordance with the information specified in the preamble signal and indicating the coding format.
  • Moreover, a bitstream reception and transmission system according to an aspect of the present invention may be a bitstream reception and transmission system including the bitstream transmission device and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream, wherein the bitstream reception device includes: a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal from the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, generate a signal having a sampling frequency that is half a sampling frequency when decoding processing using the SBR information is performed, in accordance with the information specified in the preamble signal and indicating the coding format.
  • Moreover, the bitstream reception device may include a capability notification unit configured to notify the bitstream transmission device with information indicating a capability of the bitstream reception device, the analysis unit may be configured to analyze Fs information indicating a sampling rate of the elementary audio stream, and when the Fs information indicates a predetermined value, the preamble generation unit may be configured to switch between generating or not generating a preamble signal indicating inclusion of SBR information, based on the information from the capability notification unit.
  • Moreover, a bitstream reception device according to an aspect of the present invention may include: a reception unit configured to receive, from a bitstream transmission device, an audio bitstream in a given format that (i) is formed by incorporating, into an elementary audio stream which is a coded audio signal, a preamble signal including information indicating a coding format of the elementary audio stream, and (ii) specifies, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream when a sampling frequency and the coding format of the elementary audio stream do not comply with the given format of the audio bitstream; a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream according to the coding format indicated by the information specified in the preamble signal.
  • This enables the bitstream reception device in the bitstream reception and transmission system to, when information specified in the preamble signal and indicating a coding format and information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream in the coding format indicated by the information that is specified in the preamble signal. As such, the bitstream reception device is capable of decoding audio bitstreams without discrepancy, even if information indicating a coding format included in the elementary audio stream and information indicating a coding format included in the preamble signal are inconsistent.
  • Moreover, the bitstream reception device may receive the audio bitstream including the preamble signal indicating that the elementary audio stream is coded in a coding format including SBR information when the sampling frequency of the elementary audio stream is a predetermined third frequency, even when the elementary audio stream is not coded in a coding format including the SBR information, and the elementary decoding unit may be configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, perform decoding where a signal with an expanded high band is 0 and which uses the SBR information, in accordance with the information specified in the preamble signal and indicating the coding format.
  • Moreover, the bitstream reception device may receive the audio bitstream including the preamble signal indicating that the elementary audio stream is coded in a coding format not including SBR information when the sampling frequency of the elementary audio stream is a predetermined fourth frequency, even when the elementary audio stream is coded in a coding format including the SBR information, and the elementary decoding unit may be configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, generate a signal having a sampling frequency that is half a sampling frequency when decoding processing using the SBR information is performed, in accordance with the information specified in the preamble signal and indicating the coding format.
  • It should be noted that general or specific embodiments may be realized as a system, method, integrated circuit, computer program, storage media, or any elective combination thereof. Moreover, the present invention may be realized as an audio bitstream generated by the above-described bitstream transmission device.
  • [Advantageous Effects of Invention]
  • In this way, the present invention provides a bitstream transmission device capable of transmission of a compatible audio bitstream even when the actual coding format corresponding to the sampling frequency does not conform to a given format.
  • Moreover, according to an aspect of the present invention, it is possible to reduce disadvantages when the sampling rate of an input elementary audio stream and the coding format corresponding to that sampling rate are transmitted in accordance with the IEC 61937 standard.
  • Moreover, according to another aspect of the present invention, bitstream transmission in IEC 61937 is possible even if the input elementary audio stream does not comply with the IEC 61937 standard.
  • Furthermore, according to yet another aspect of the present invention, it is possible to keep the sampling rate of the decoded signal from varying when input elementary audio streams are continuously decoded.
  • Moreover, according to yet another aspect of the present invention, it is possible to normally generate a decoded signal even when a parameter P specified in the preamble signal added to the received bitstream and a parameter E corresponding to the parameter P and included in the elementary audio stream making up a part of the bitstream are inconsistent.
  • Moreover, according to yet another aspect of the present invention, bitstream transmission in IEC 61937 is possible even in the case of AAC at a low Fs.
  • According to yet another aspect of the present invention, it is possible to generate reproduced sound at the sampling rate according to the predetermined mid Fs, even in the case of AAC at a low Fs.
  • According to yet another aspect of the present invention, bitstream transmission in IEC 61937 is possible even in the case of HEAAC at a high Fs.
  • According to yet another aspect of the present invention, it is possible to generate reproduced sound at the sampling rate according to the predetermined mid Fs, even in the case of AAC at a high Fs.
  • [Brief Description of Drawings]
    • [FIG. 1] FIG. 1 shows an example of the configuration of the bitstream transmission device according to Embodiment 1 of the present invention.
    • [FIG. 2] FIG. 2 shows an example of the configuration of the bitstream reception device according to Embodiment 1 of the present invention.
    • [FIG. 3] FIG. 3 shows an example of usage of the bitstream reception and transmission system according to Embodiment 1 of the present invention with an audio system.
    • [FIG. 4A] FIG. 4A illustrates the difference in the reproduction band between AAC and HEAAC according to Embodiment 1 of the present invention.
    • [FIG. 4B] FIG. 4B illustrates the difference in the reproduction band between AAC and HEAAC according to Embodiment 1 of the present invention.
    • [FIG. 5A] FIG. 5A illustrates the meaning of the stream output regulations shown in FIG. 10, according to Embodiment 1 of the present invention.
    • [FIG. 5B] FIG. 5B illustrates the meaning of the stream output regulations shown in FIG. 10, according to Embodiment 1 of the present invention.
    • [FIG. 5C] FIG. 5C illustrates the meaning of the stream output regulations shown in FIG. 10, according to Embodiment 1 of the present invention.
    • [FIG. 6] FIG. 6 shows an example of the configuration of the bitstream transmission device according to Embodiment 2 of the present invention.
    • [FIG. 7] FIG. 7 shows an example of the data construction of an output bitstream in IEC 60958/IEC 61937.
    • [FIG. 8A] FIG. 8A is an example of descriptions found in a fixed Header in the ADTS format.
    • [FIG. 8B] FIG. 8B is an example of descriptions found in the Header in the LOAS format.
    • [FIG. 9] FIG. 9 shows the data format of the preamble signal Pc shown in (b) in FIG. 7.
    • [FIG. 10] FIG. 10 shows the presence or absence of a stream output regulation under IEC 61937 with respect to the combination of the sampling rate and the audio data coding format of the elementary audio stream.
    • [FIG. 11A] FIG. 11A shows the frequency band in which reproduction in HEAAC of coded data having an AAC sampling rate of 48 KHz is possible.
    • [FIG. 11B] FIG. 11B shows the frequency band in which reproduction in AAC of coded data having an AAC sampling rate of 48 KHz is possible.
    [Description of Embodiments]
  • Hereinafter, certain exemplary embodiments are described in greater detail with reference to the accompanying Drawings. It is to be noted that the embodiment described below shows a specific example of the present disclosure. The numerical values, shapes, materials, structural elements, the arrangement and connection of the structural elements, steps, the processing order of the steps etc. shown in the following exemplary embodiment are mere examples, and therefore do not limit the present disclosure. The present invention is defined by the scope of the claims. Therefore, among the structural elements in the following exemplary embodiment, structural elements not recited in any one of the independent claims defining the most generic part of the inventive concept are described as preferred structural elements, and are not absolutely necessary to overcome the problem according to the present invention.
  • (Embodiment 1)
  • The bitstream transmission device according to Embodiment 1 of the present invention will hereinafter be described with reference to the Drawings.
  • FIG. 1 shows the configuration of the bitstream transmission device 200 according to Embodiment 1. As FIG. 1 shows, the bitstream transmission device 200 includes an input unit 100, an analysis unit 101, a preamble generation unit 102, a forming unit 103, and an output unit 104.
  • The input unit 100 acquires an elementary audio stream 300 obtained by coding an audio signal.
  • The analysis unit 101 analyzes the sampling rate and the audio data coding format of the acquired elementary audio stream 300.
  • The preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, a preamble signal forming a portion of the audio bitstream which is in a format defined by IEC 61937. Here, when the sampling frequency resulting from the analysis by the analysis unit 101 is a predetermined value, the preamble generation unit 102 generates the preamble signal by changing the parameter indicating the audio data coding format of the elementary audio stream 300 to a value in accordance with IEC 61937.
  • The forming unit 103 forms an IEC 61937 format audio bitstream by incorporating in the elementary audio stream 300 the preamble signal generated by the preamble generation unit 102.
  • The output unit 104 outputs the audio bitstream 301 formed by the forming unit 103.
  • FIG. 2 shows an example of the configuration of the bitstream reception device according to Embodiment 1. As FIG. 2 shows, the bitstream reception device 201 receives the audio bitstream 301 transmitted from the bitstream transmission device 200 shown in FIG. 1, and decodes the received audio bitstream 301. This bitstream reception device 201 includes a preamble removal unit 202 and an elementary decoding unit 203.
  • The preamble removal unit 202 is installed in the bitstream reception device 201 and extracts the elementary audio stream 300 by removing the preamble signals Pa, Pb, Pc, and Pd from the IEC 61937 format audio bitstream 301 transmitted from the bitstream transmission device 200.
  • The elementary decoding unit 203 is installed in the bitstream reception device 201 and generates an audio signal (multi-channel decoded signal) by decoding the elementary audio stream 300 extracted by the preamble removal unit 202, and outputs the generated audio signal. Upon decoding the elementary audio stream 300, when the audio data coding format indicated in the preamble signal Pc and the audio data coding format indicated in Extension_payload are inconsistent, the elementary decoding unit 203 decodes the elementary audio stream 300 in the audio data coding format indicated in the preamble signal Pc.
  • FIG. 3 shows an example of implementation of a transmission and reception system including the bitstream transmission device 200 shown in FIG. 1 and the bitstream reception device 201 shown in FIG. 2 with audio equipment.
  • As FIG. 3 shows, the bitstream transmission device 200 and the bitstream reception device 201 included in the bitstream reception and transmission system 210 are connected via a cable for transmitting the IEC 61937 format audio bitstream 301. This cable is, for example, an HDMI cable. The bitstream transmission device 200 is installed in, for example, a television 220, and is connected to, for example, two-channel stereo speakers via a decoder inside the television 220 and a two-channel downmix unit. The television 220 obtains the elementary audio stream 300 from broadcast waves via an antenna or tuner (not shown in the Drawings) or from a CD via a CD driver (not shown in the Drawings), and inputs the elementary audio stream 300 to the input unit 100 of the bitstream transmission device 200. The decoder 221 takes the elementary audio stream 300 inputted from the input unit 100 to the forming unit 103, and temporarily decodes it into a multi-channel audio signal. The two-channel downmix unit 222 (2ch downmix unit) downmixes, into a two-channel stereo audio signal, the multi-channel audio signal obtained as a result of the elementary audio stream 300 being decoded by the decoder, and outputs the stereo audio signal to the stereo speakers 223.
  • The bitstream reception device 201 is installed in, for example, a theater system 230 including multi-channel speakers 231, such as 5.1 channel speakers. This bitstream reception device 201 decodes the audio bitstream 301 that is in accordance with IEC 61937 and is inputted from the bitstream transmission device 200 over HDMI, for example, and outputs the multi-channel audio signal obtained as a result of the decoding to the multi-channel speakers 231.
  • Hereinafter, operations performed by the bitstream transmission device 200 having the above-described configuration will be described in further detail.
  • First, the input unit 100 obtains the elementary audio stream 300. The elementary audio stream 300 is a stream having the data format like that shown in FIG. 7, and in Embodiment 1, the format of the header is LOAS, and the coded data is AAC coded data which does not include SBR information. The Extension_payload 310 is not included in the CodedData in the elementary audio stream 300, and the sampling rate (sampling frequency) specified in the header is 24 kHz.
  • Next, the analysis unit 101 analyzes the elementary audio stream 300. In Embodiment 1, the inputted elementary audio stream 300 has a sampling rate of 24 kHz and does not include SBR information, as described above, and this information is analyzed by the analysis unit 101. The sampling rate is obtained by analyzing the bitstream in accordance with the stipulations in the LOAS header format shown in FIG. 8B. Moreover, the presence or absence of SBR information is obtained by analyzing the bitstream in accordance with the stipulations in the Extension_payload shown in FIG. 7.
  • Next, the preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, an IEC 61937 format preamble signal. When the elementary audio stream 300 has a sampling rate of 24 kHz and does not include SBR information, requirements to be an IEC 61937 format audio bitstream are not met, as FIG. 10 shows. As such, the preamble generation unit 102 generates a preamble signal which indicates that the elementary audio stream 300 includes SBR information. More specifically, the preamble generation unit 102 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS. Furthermore, the preamble generation unit 102 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 2. This causes the preamble signal Pc to indicate that the elementary audio stream 300 includes SBR information. Here, inclusion of SBR information means that the preamble signal Pc indicates that the elementary audio stream 300 is coded in HEAAC. In Embodiment 1, the sampling rate is 24 kHz, but it goes without saying that the above holds true even if the sampling rate is 22.05 kHz or 16 kHz.
  • Next, the forming unit 103 incorporates the preamble signal outputted by the preamble generation unit 102 in the elementary audio stream 300. In other words, the forming unit 103 forms an IEC 61937 format audio bitstream such as the audio bitstream 301 shown in FIG. 7.
  • Lastly, the output unit 104 outputs the audio bitstream 301, which is a signal from the forming unit 103, to an external device.
  • Hereinafter, operations performed by the bitstream reception device 201 having the configuration shown in FIG. 2 will be described.
  • First, the bitstream transmission device 200 forms the inputted elementary audio stream 300 into the IEC 61937 format audio bitstream 301, and outputs the audio bitstream 301.
  • For example, when the sampling rate (sampling frequency) of the inputted elementary audio stream 300 is 24 kHz and coding which does not include SBR information is performed (AAC), a code is specified in the preamble signal Pc among the preamble signals that indicates that SBR information is present (HEAAC). Moreover, for example, when the sampling rate (sampling frequency) of the inputted elementary audio stream 300 is 48 kHz and SBR information is present (HEAAC), a code is specified in the preamble signal Pc among the preamble signals that indicates that SBR information is not present (AAC).
  • The preamble removal unit 202 extracts the elementary audio stream 300 by removing the preamble signals Pa, Pb, Pc, and Pd from the IEC 61937 format audio bitstream 301 transmitted from the bitstream transmission device 1000. If a stuffing signal is included in the IEC 61937 audio bitstream 301, the preamble removal unit 202 extracts the stuffing signal as well, and generates the elementary audio stream 300. The elementary audio stream 300 generated in this manner is an AAC or HEAAC elementary audio stream, and as such, can be decoded in accordance with the decoding processes defined by MPEG standards. Moreover, the preamble removal unit 202 detects SBR presence/absence information, which indicates the presence or absence of SBR information, in the preamble signal Pc, and sends this SBR presence/absence information to the elementary decoding unit 203 down line.
  • The elementary decoding unit 203 generates an audio signal by decoding the extracted elementary audio stream 300. Moreover, the elementary decoding unit 203 refers to the SBR presence/absence information sent from the preamble removal unit 202, and determines whether the information indicating the presence or absence of the SBR information is included in the elementary audio stream 300 and the SBR presence/absence information from the preamble removal unit 202 are inconsistent or not. When inconsistent, the information shown in the SBR presence/absence information from the preamble removal unit 202 is followed.
  • For example, when the SBR presence/absence information indicates "SBR information present" even if the elementary audio stream 300 has a sampling rate of 24 kHz and indicates "no SBR", the elementary decoding unit 203 performs bandwidth expansion processing. Of course, since SBR information is not actually present in the Extension_payload of the elementary audio stream 300, the elementary decoding unit 203 cannot perform complete bandwidth expansion processing, but, for example, can perform processing which up-samples the sampling rate of the decoded signal by two times. In other words, the elementary decoding unit 203 performs processing equivalent to bandwidth expansion processing which makes a component duplicated in the high band by SBR 0.
  • Moreover, for example, when the SBR presence/absence information indicates "no SBR" even if the elementary audio stream 300 has a sampling rate of 48 kHz and indicates "(SBR information present", the elementary decoding unit 203 does not perform bandwidth expansion processing. The elementary decoding unit 203, for example, ignores and skips over the SBR information which is actually present. Moreover, the elementary decoding unit 203 does apply the SBR information which is actually present, but performs processing which generates a signal at 48 kHz as a result of down-sampling the outputted sampling rate in half.
  • It should be noted that when the information indicating the presence or absence of the SBR information included in the elementary audio stream 300 and the SBR presence/absence information from the preamble removal unit 202 are inconsistent, the elementary decoding unit 203 may follow the information indicating the presence or absence of the SBR information included in the elementary audio stream 300.
  • The decoded signal generated in this manner is transmitted from the elementary decoding unit 203 to an external device, and converted into a signal listenable with speakers or headphones, for example.
  • As described above, the bitstream transmission device 200 according to Embodiment 1 includes an input unit 100 which acquires an elementary audio stream obtained by coding an audio signal; an analysis unit 101 which analyzes a sampling frequency and a coding format of the elementary audio stream; a preamble generation unit 102 which generates, based on a result of the analysis by the analysis unit 101, a preamble signal included in an IEC 61937 format audio bitstream; a forming unit 103 which forms the IEC 61937 format audio bitstream by incorporating in the elementary audio stream the preamble signal generated by the preamble generation unit 102; and an output unit 104 which outputs the audio bitstream formed by the forming unit 103, wherein when the sampling frequency resulting from the analysis by the analysis unit 101 is a predetermined value, the preamble generation unit 102 changes a parameter indicating the audio data coding format of the elementary audio stream to a format that complies with the IEC 61937 format of the audio bitstream.
  • This enables the bitstream transmission device 200 transmit the audio bitstream 301 to the bitstream reception device 201 without a problem, which was previously not transmittable due to incompatibility with IEC 61937. As a result, the bitstream transmission device 200 is capable of outputting an audio bitstream 301 that complies with the IEC 61937 standard, even if the combination of the sampling rate and the audio data coding format of the elementary audio stream 300 does not comply with the IEC 61937 standard. In particular, in accordance with Embodiment 1, since it is possible to do this merely by changing the presence or absence of SBR information described in the preamble signal, processing is substantially simplified.
  • It should be noted that in Embodiment 1, an example is given that a sampling rate of 24 kHz (or 22.05 kHz or 16 kHz) and "no SBR information" are specified in the header, but the same processing can be performed when a sampling rate of 44.1 kHz (or 32 kHz) and "SBR information present" are specified in the header. The operations performed when this is the case are described below.
  • The analysis unit 101 analyzes the sampling rate and the audio data coding format of the elementary audio stream 300. The analysis unit 101 analyzes that the inputted elementary audio stream 300 has a sampling rate of 44.1 kHz and includes SBR information.
  • Next, the preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, a preamble signal to be included in an IEC 61937 format audio bitstream 301. Here, when the sampling rate of the elementary audio stream 300 is 44.1 kHz and SBR information is included, the audio bitstream does not comply with IEC 61937 standards, as FIG. 10 shows, and as such, the preamble generation unit 102 generates a preamble signal that indicates that SBR information is not included. More specifically, the preamble generation unit 102 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS. Furthermore, the preamble generation unit 102 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 1. With this, the preamble signal Pc indicates that SBR information is not included. In other words, the preamble signal Pc indicates that the audio data coding format of the elementary audio stream 300 is AAC.
  • Next, the forming unit 103 incorporates the preamble signal outputted by the preamble generation unit 102 in the elementary audio stream 300. In other words, the forming unit 103 forms an IEC 61937 format audio bitstream 301 such as is shown in FIG. 7.
  • Lastly, the output unit 104 outputs, to an external device, the audio bitstream 301 formed by the forming unit 103.
  • With this, the bitstream transmission device 200 is capable of outputting an audio bitstream 301 that complies with the IEC 61937 standard even if the inputted elementary audio stream 300 does not comply with the IEC 61937 standard.
  • It should be noted that by forcefully changing the SBR information in the manner described above, the information indicating the presence or absence of SBR included in the preamble signal and the information indicating the presence or absence of SBR included in the coded data will be inconsistent, but in this case, processing may be preset to follow the information indicating the presence or absence of SBR included in the preamble signal. As previously described, since an AAC elementary audio stream and an HEAAC elementary audio stream have reproduction compatibility, even if the audio bitstream is an inconsistent IEC 61937 audio bitstream, the stream is capable of being reproduced by an external reception device.
  • This concept is depicted in FIG. 4A and FIG. 4B. The horizontal axes in (a) through (b2) in FIG. 4A and (a) through (b2) in FIG. 4B represent frequency.
    • (a) in FIG. 4A schematically shows a frequency spectrum of the frequency component included in the elementary audio stream 300 coded in HEAAC. The signal depicted by the solid line of the low band lower than 1/4 of the sampling frequency in FIG. 4A is coded in AAC, and signal depicted by the dashed line of the high band higher than 1/4 of the sampling frequency is coded in SBR.
    • (b1) in FIG. 4A shows the frequency component when the HEAAC signal shown in (a) in FIG. 4A is processed with an HEAAC decoder. In this case, the signal of the whole band including both the low band and the high band is reproduced.
    • (b2) in FIG. 4A shows the frequency component when the HEAAC signal shown in (a) in FIG. 4A is processed with an AAC decoder. In this case, the high band coded in SBR is not decoded, and only the signal of the low band coded in AAC is decoded and reproduced by the AAC decoder.
    • (a) in FIG. 4B schematically shows a frequency spectrum of the frequency component included in the elementary audio stream 300 coded in AAC. In this case, the whole band is coded in AAC.
    • (b1) in FIG. 4B shows the frequency component when the AAC signal shown in (a) in FIG. 4B is processed with an HEAAC decoder. In this case, the signal of the whole band is decoded and reproduced by the HEAAC decoder.
    • (b2) in FIG. 4B shows the frequency component when the AAC signal shown in (a) in FIG. 4B is processed with an AAC decoder. In this case as well, just like the case shown in (b1) in FIG. 4B, the signal of the whole band is decoded and reproduced by the AAC decoder.
  • In other words, when the signal is actually an HEAAC signal-that is to say, when the elementary audio stream 300 is a elementary audio stream coded in the HEAAC audio data coding format-and the preamble signal shows that the elementary audio stream 300 is coded in AAC, the elementary audio stream 300 is processed with an AAC decoder. Even in this case, while the same sound quality as when the audio is decoded in conventional HEAAC cannot be obtained, audible audio decoded in AAC is reproduced ((b2) in FIG. 4A). Moreover, if the preamble signal were to indicate AAC when the signal actually is AAC, there would be no harm in processing this AAC signal with an HEAAC decoder. In other words, when this AAC signal is decoded with an HEAAC decoder, it is possible to obtain the same sound quality as when decoded with a conventional AAC decoder ((b2) in FIG. 4B).
  • (Application Example of Embodiment 1)
  • It should be noted that in Embodiment 1, "a sampling rate of 24 kHz (or 22.05 kHz or 16 kHz) and 'no SBR information' are specified in the header" and "a sampling rate of 44.1 kHz (or 32 kHz) and 'SBR information present' are specified in the header" are used as examples of solutions to the first problem. However, it is also possible to solve the second problem when "a sampling rate of 48 kHz and 'SBR information present' are specified in the header" by performing the same processes as above. Hereinafter, processing performed when "a sampling rate of 48 kHz and 'SBR information present' are specified in the header" will be described in detail.
  • First, the analysis unit 101 analyzes the sampling rate and the audio data coding format of the elementary audio stream 300. Here, as previously described, the analysis unit 101 analyzes that the inputted elementary audio stream 300 has a sampling rate of 48 kHz and includes SBR information.
  • Next, the preamble generation unit 102 generates, based on the result of the analysis by the analysis unit 101, an IEC 61937 format preamble signal. When the sampling rate is 48 kHz, regardless of inclusion or exclusion of SBR information, in either case, the audio bitstream meets IEC 61937 standards, as FIG. 10 shows. However, the preamble generation unit 102 according to this embodiment always generates the preamble signal to indicate that SBR information is not included, even if SBR information is included. More specifically, the preamble generation unit 102 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS. Furthermore, the preamble generation unit 102 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 1. This causes the preamble signal Pc to indicate that the elementary audio stream 300 does not include SBR information. In other words, when the sampling rate is 48 kHz, the preamble signal Pc is always generated to indicate that the elementary audio stream 300 is coded in AAC.
  • Next, the forming unit 103 incorporates the preamble signal outputted by the preamble generation unit 102 in the elementary audio stream 300. In other words, the forming unit 103 forms an IEC 61937 format audio bitstream 301 such as is shown in FIG. 7.
  • Lastly, the output unit 104 outputs, to an external device, the audio bitstream 301 generated by the forming unit 103.
  • With this, it is possible to restrict an increase in instances of switches between output sampling frequencies in actual DTV broadcasts, for example. For example, in digital broadcasting, when the audio data coding format of the program content is HEAAC and the AAC sampling rate is 48 kHz, while on the other hand, the audio data coding format of the commercial content is AACLC and the AAC sampling rate is 48 kHz, the bitstream transmission device outputs a signal having an IEC 60958 frame rate of 96 KHz for the program content, and outputs a signal having a 48 KHz IEC 60958 frame rate for the commercial content. However, by changing the presence or absence of SBR information with the preamble generation unit 102 as described above, it is possible to prevent variation in reproduction frame rate upon switching between program content and commercial content. As a result, high quality audio reproduction cannot be expected in the program content due to bandwidth expansion, but it is possible to avoid defects such as jumps in audio resulting from the time it takes to keep up with the changes in IEC 60958 frame rate in the bitstream reception device 201.
  • FIG. 5A through FIG. 5C illustrate transmission examples of streams based on the stream output stipulations shown in FIG. 10. In FIG. 5A through FIG. 5C, the circles containing the letters a, b, c, and d represent the preamble signals Pa, Pb, Pc, and Pd, the shaded circles represent the elementary audio stream, and the blank circles represent padding data, which is invalid data for adjusting the data length. Moreover, here it is assumed that the header format of the transmitted stream is LOAS.
  • FIG. 5A illustrates why it is possible to transmit a stream coded in AAC (no SBR information) at a sampling rate of 48 kHz, for example. Here, one sample indicates 32 bits. Since the sampling frequency of the elementary audio stream is 48 kHz, the bitstream transmission device transmits data at a transmission rate of 1536 kbps (48 kHz x 32 bits). More specifically, the preamble signals Pa, Pb, Pc, and Pd are added every 21.333 msec (time for 1024 samples at 48 kHz), and 32768-bit, one frame signals are continuously transmitted.
  • Consequently, when AAC at 48 kHz, the decoder generates PCM data every 21.333 msec. As such, since the signal supply rate and the decoder processing rate match, the decoder can processes the signals without discrepancy.
  • FIG. 5B illustrates why it is possible to transmit a stream coded in AAC (no SBR information) at a sampling rate of 24 kHz, for example. When the signal has a sampling rate of 24 kHz, it is preferable that the bitstream transmission device transmit data at a transmission rate of 768 kbps (24 kHz x 32 bits), but the bitstream transmission device does not have a 768 kbps mode. If the bitstream transmission device were to use a transmission rate of 1536 kbps, the data would be transmitted as is shown in FIG. 5B. In this case, since the decoder generates 42.6667 msec PCM data per signal supplied every 21.333 msec (because 1024 samples at 24 kHz equates to 42.6667 msec), the supply of signals from the bitstream transmission device becomes excessive with respect to the processing rate of the decoder. This causes the stream to overflow.
  • FIG. 5C illustrates why it is possible to transmit a stream coded in HEAAC (SBR information present) at a sampling rate of 24 kHz, for example. When the stream is coded in HEAAC, even if the sampling rate of the AAC portion is 24 kHz, as FIG. 11A shows, the bitstream transmission device considers the entire stream to be a signal having a sampling rate of 48 kHz, and as such, transmits the data at a transmission rate of 1536 kbps (48 kHz x 32 bits). More specifically, the preamble signals Pa, Pb, Pc, and Pd are added every 42.6667 msec (time for 2048 samples at 48 kHz), and signals are continuously transmitted per frame.
  • Consequently, since the decoder generates PCM data every 42.6667 msec when HEAAC at 24 kHz, the supply rate of the signals and the processing rate of the decoder match. As such, the decoder can process the signals without discrepancy.
  • Similarly, it is possible to explain the reason why a stream coded in AAC at a sampling rate of 22.05 kHz cannot be transmitted, even though a stream coded in AAC at a sampling rate of 44.1 kHz and a stream coded in HEAAC at a sampling rate of 22.05 kHz can be transmitted. Moreover, it is possible to explain the reason why a stream coded in AAC at a sampling rate of 16 kHz cannot be transmitted, even though a stream coded in AAC at a sampling rate of 32 kHz and a stream coded in HEAAC at a sampling rate of 16 kHz can be transmitted.
  • This is because the output transmission rates 1411.2 kbps (44.1 kHz x 32 bits) and 1024 kbps (32 kHz x 32 bits) are each standardized by IEC 60958, and as such, the bitstream transmission device can use these transmission rates, but 705.6 kbps (22.05 kHz x 32 bits) and 512 kbps (16 kHz x 32 bits) are not standardized.
  • Different from the above example, streams coded in AAC at a sampling rate of 48 kHz and streams coded in HEAAC at a sampling rate of 48 kHz can be transmitted with no problem.
  • This is because in a stream coded in AAC at a sampling rate 48 kHz, the sampling rate is 48 kHz, and as such, the bitstream transmission device transmits data at a transmission rate of 1536 kbps (48 kHz x32 bits). More specifically, the preamble signals Pa, Pb, Pc, and Pd are added every 21.333 msec (time for 1024 samples at 48 kHz), and signals are continuously transmitted per frame (32768 bits).
  • Consequently, since the decoder generates PCM data every 21.333 msec when AAC at 48 kHz, the supply rate of the signals and the processing rate of the decoder match. As such, the decoder can process the signals without discrepancy.
  • Moreover, with a stream coded in HEAAC at a sampling rate of 48 kHz, the bitstream transmission device regards the sampling rate to be 96 kHz, and as such, transmits data at a transmission rate of 3072 kbps (96 kHz x 32 bits). Moreover, the preamble signals Pa, Pb, Pc, and Pd are added every 21.333 msec (time for 2048 samples at 96 kHz), and signals are transmitted per frame (32768 bits).
  • Consequently, since the decoder generates PCM data every 21.333 msec when HEAAC at 48 kHz, the supply rate of the signals and the processing rate of the decoder match. As such, the decoder can process the signals without discrepancy.
  • With the above, when the elementary audio stream is coded at a sampling rate of 48 kHz, it is possible to perform transmission without problem when either AAC or HEAAC. On the other hand, when a stream coded in AAC at a sampling rate of 48 kHz and a stream coded in HEAAC at a sampling rate of 48 kHz are switched between, switching between a stream having a transmission rate of 1536 kbps and a stream having a transmission rate of 3072 kbps occurs. At this time, in the whole bitstream reception and transmission system, it is necessary to perform processing which regains synchronization such as the switching of operating frequency of the A/D converter, for example. Since time is required for the processing which regains synchronization of the entire bitstream transmission and reception system, a time lag occurs upon switching of the sampling rate. For example, a defect like a break in audio at the transition from program to commercial occurs.
  • In response to this, it is possible to make the transmission rate a constant sampling rate by transmitting, as "no SBR", the stream coded in HEAAC (SBR present) at a sampling rate of 48 kHz. This makes it possible to prevent defects such as jumps in audio.
  • It should be noted that in the above embodiment, since a time lag occurs at the switching of the sampling rate, when the sampling rate of the elementary audio stream is 48 kHz, the bitstream reception and transmission system is described as performing processing as if the elementary audio stream were always coded in AAC, regardless of whether it is coded in HEAAC or AAC, but the present invention is not limited to this example. For example, if time lag occurring at the switching of the sampling rate is not a concern, the bitstream reception and transmission system may perform processing as if the elementary audio stream were always an HEAAC signal.
  • (Embodiment 2)
  • Hereinafter, a bitstream transmission and reception system 210 according to Embodiment 2 of the present invention will hereinafter be described with reference to the Drawings.
  • FIG. 6 shows the configuration of the bitstream transmission and reception system according to Embodiment 2. In FIG. 6, the bitstream transmission and reception system 210 according to Embodiment 2 includes the bitstream transmission device 1000 and a bitstream reception device 1001.
  • The bitstream transmission device 1000 includes the input unit 100, the analysis unit 101, the forming unit 103, the output unit 104, and a preamble generation unit 1010. In the bitstream transmission device 1000, all processing units having the same reference numerals as those in the bitstream transmission device 200 are the same as those disclosed in FIG. 1 according to Embodiment 1, and as such, descriptions thereof will be omitted. The following focuses on the description of the preamble generation unit 1010.
  • The preamble generation unit 1010 receives capability information on the bitstream reception device 1001 from the bitstream reception device 1001, and based on the capability information, changes information included in the preamble signal Pc that indicates the audio data coding format of the elementary audio stream. More specifically, the preamble generation unit 1010 determines, based on the capability information received from the bitstream reception device 1001, whether or not the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 coded in HEAAC at a sampling rate of 48 kHz. If the preamble generation unit 1010 determines that the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 coded in HEAAC at a sampling rate of 48 kHz, and the elementary audio stream 300 is coded in HEAAC at a sampling rate of 48 kHz, the preamble generation unit 1010 generates the preamble signal Pc showing that the elementary audio stream 300 is coded in HEAAC, different than in Embodiment 1. More specifically, the preamble generation unit 1010 sets the value indicated by the five bits from bit 0 to bit 4 of the preamble signal Pc to 23. This causes the preamble signal Pc to indicate that the header format is LOAS. Furthermore, the preamble generation unit 1010 sets the value indicated by the two bits from bit 5 to bit 6 of the preamble signal Pc to 2. This causes the preamble signal Pc to indicate that the elementary audio stream 300 includes SBR information. In other words, the preamble signal Pc indicates that the elementary audio stream 300 is coded in HEAAC.
  • Regarding the previously described bitstream transmission device 1000, the bitstream reception device 1001 according to Embodiment 2 receives the audio bitstream 301 transmitted from the bitstream transmission device 1000, and decodes the received audio bitstream 301. The bitstream reception device 1001 includes the preamble removal unit 202, the elementary decoding unit 203, and a capability information notification unit 1011. In FIG. 6, all processing units having the same reference numerals as those in the bitstream reception device 201 are the same as those disclosed in FIG. 2 according to Embodiment 1, and as such, descriptions thereof will be omitted. The following focuses on the description of the capability information notification unit 1011.
  • The capability information notification unit 1011 stores, in internal memory, capability information indicating whether or not the elementary decoding unit 203 in the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 coded in HEAAC at a sampling rate of 48 kHz. When a transmission request for capability information is made from the bitstream transmission device 1000, the capability information notification unit 1011 notifies the preamble generation unit 1010 of the bitstream transmission device 1000 with the capability information stored in memory.
  • With this configuration, when the elementary decoding unit 203 of the bitstream reception device 1001 is capable of decoding the elementary audio stream coded in HEAAC at a sampling rate of 48 kHz and the elementary audio stream 300 is coded w in HEAAC and even if it has a sampling rate of 48 kHz, the bitstream transmission device 1000 is capable of describing a parameter indicating that SBR information is present in the preamble signal Pc. As such, when a defect such as a break in sound at the switching point between content coded in AAC and content coded in HEAAC is not problematic, the bitstream reception device 1001 is capable of decoding the elementary audio stream 300 in accordance with the preamble signal included in the audio bitstream 301 transmitted from the bitstream transmission device 1000. This has the advantageous effect that the user can enjoy content coded in HEAAC in high audio quality and having a bandwidth that has been expanded into the high band.
  • Moreover, even when the elementary decoding unit 203 of the bitstream reception device 1001 is capable of decoding a stream coded in HEAAC at a sampling rate of 48 kHz, the bitstream reception device 1001 is not required to notify the bitstream transmission device 1000 of capability information. In this case, the bitstream reception device 1001 may, for example, interpret the header and Extension_payload of the elementary audio stream 300 extracted from the audio bitstream 301 received from the bitstream transmission device 200 according to Embodiment 1, and decode the elementary audio stream 300 according to the SBR information when SBR information is included in the elementary audio stream 300. Furthermore, even when SBR information is not included in the elementary audio stream 300 extracted from the audio bitstream 301, the bitstream reception device 1001 may perform HEAAC decoding in which the bandwidth is expanded into the high band.
  • When the sampling frequency resulting from the analysis by the analysis unit 101 is a predetermined second frequency, the preamble generation unit 1010 specifies, as information included in the preamble signal and indicating the coding format, a coding format in which SBR information is not included in the elementary audio stream. With this, for example, in an actual DTV broadcast, even when the programming content is broadcast having an audio data coding format of HEAAC and an AAC sampling rate of 48 kHz, while on the other hand the commercial content is broadcast having an audio data coding format of AACLC and an AAC sampling rate of 48 kHz, processing can be done without changing a final sampling rate of the decoded signal. As such, the transition between program content and commercial content is smooth. By doing this, when the elementary audio stream 300 is coded in HEAAC and has a sampling frequency of 48 kHz and the bitstream reception device 1001 is capable of decoding such an elementary audio stream 300, the bitstream reception device 1001 is capable of reproducing the high audio quality audio signal decoded in HEAAC.
  • Moreover, in an actual DTV broadcast, when the broadcast has a sampling rate of 24 kHz and indicates "(SBR information present", even if SBR information is lost as a result of a defect in the signal from poor reception of the broadcast waves, the final sampling rate of the output signal is held constant. In this way, it is possible for the bitstream reception and transmission system to keep the influences caused by poor reception to a minimum.
  • It should be noted that in Embodiment 2, the preamble removal unit 202 extracts SBR presence/absence information and transmits it to the elementary decoding unit 203, but when the sampling rate that the elementary audio stream 300 indicates is from 16 kHz to 24 kHz, the bitstream reception device may, without doing the above, decode the elementary audio stream 300 under the pretense that "(SBR information present" is always indicated. Moreover, in contrast, when the sampling rate that the elementary audio stream 300 indicates is from 32 kHz to 48 kHz, the bitstream reception device may decode the elementary audio stream 300 under the pretense that "no SBR information" is always indicated
  • Hereinbefore, the bitstream transmission device and the bitstream transmission and reception system according to the embodiments of the present invention has been described, but the present invention is not limited to these embodiments.
  • Moreover, each processing unit included in the bitstream transmission device or the bitstream transmission and reception system according to the above embodiments are typical realized as an LSI, which is an integrated circuit. Each of these processing units may be individually realized as a single chip, or a portion or all of the processing units may be realized as a single chip.
  • Moreover, a portion or all of the functions of the bitstream transmission device or the bitstream reception and transmission system according to the embodiments of the present invention may be realized by a processor such as a CPU executing a computer program.
  • Moreover, the present invention may be the above-described computer program, and may be a non-transitory computer-readable recording medium having the computer program recorded thereon. Moreover, it goes without saying that the above-described computer program is capable of being circulated over a transmission medium such as the Internet.
  • Moreover, the functions of the bitstream transmission device and the bitstream reception and transmission system according to the above-described Embodiment 1 and Embodiment 2, as well the variations thereof, may be, at least in part, combined.
  • Moreover, the division of the function blocks in the block diagrams is just one example. A plurality of function blocks may be realized as a single function block, a single function block may be divided into a plurality of function blocks, or a portion of functions of one block may be may be transferred to a different function block. Moreover, the functions of a plurality of function blocks having similar functions may be processed by a single piece of hardware or software in parallel or by time-division.
  • Furthermore, as long as they do not depart from the essence of the present invention, various modifications to the present embodiment which may be conceived by those skilled in the art are intended to be included within the scope of this invention.
  • [Industrial Applicability]
  • The bitstream transmission device according to the present invention is capable of changing information relating to the presence or absence of SBR information, and thus is capable of transmitting bitstreams using the IEC 61937 standard even if the bitstream inputted is not defined by the IEC 61937 standard. As such, the present invention is widely applicable to DTV broadcast reception devices and reproduction devices which read and reproduce a signal from a recording medium storing an audio coded signal. Moreover, since the present invention is capable of suppressing occurrences of changes in sampling rate at the time of switching between content of different audio data coding formats, the present invention can reproduce quality sound without interruptions.
  • [Reference Signs List]
    • 100 input unit
    • 101 analysis unit
    • 102, 1010 preamble generation unit
    • 103 forming unit
    • 104 output unit
    • 200, 1000 bitstream transmission device
    • 201, 1001 bitstream reception device
    • 202 preamble removal unit
    • 203 elementary decoding unit
    • 210 bitstream reception and transmission system
    • 220 television
    • 221 decoder
    • 222 two-channel downmix unit
    • 223 speakers
    • 230 theater system
    • 231 multi-channel speakers
    • 300 elementary audio stream
    • 301 audio bitstream
    • 1011 capability information notification unit

Claims (19)

  1. A bitstream transmission device that transmits an audio bitstream in a given format, the bitstream transmission device comprising:
    an input unit configured to acquire an elementary audio stream obtained by coding an audio signal;
    an analysis unit configured to analyze a sampling frequency and a coding format of the elementary audio stream;
    a preamble generation unit configured to generate, based on a result of the analysis by the analysis unit, a preamble signal including information indicating a coding format of the elementary audio stream;
    a forming unit configured to form the audio bitstream in the given format by incorporating in the elementary audio stream the preamble signal generated by the preamble generation unit; and
    an output unit configured to output the audio bitstream formed by the forming unit,
    wherein based on the sampling frequency and the coding format resulting from the analysis by the analysis unit, the preamble generation unit is configured to specify, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream and is different from the coding format of the elementary audio stream resulting from the analysis by the analysis unit.
  2. The bitstream transmission device according to Claim 1,
    wherein the preamble generation unit is configured to specify the information included in the preamble signal and indicating the coding format to limit an output transmission rate of data output from the output unit to a predetermined range.
  3. The bitstream transmission device according to Claim 1,
    wherein the audio bitstream in the given format is an audio bitstream in a format defined by the IEC 61937 standard, and
    when the sampling frequency resulting from the analysis by the analysis unit is a predetermined first frequency and the coding format corresponding to the first frequency does not comply with the IEC 61937 standard, to generate the preamble signal that conforms the audio bitstream to the IEC 61937 standard, the preamble generation unit is configured to specify, as the information indicating the coding format, a coding format different from the analyzed coding format in terms of presence or absence of spectral band replication (SBR) information in the elementary audio stream.
  4. The bitstream transmission device according to Claim 3,
    wherein when the sampling frequency resulting from the analysis by the analysis unit is a predetermined second frequency, the preamble generation unit is further configured to specify, as the information included in the preamble signal and indicating the coding format, that the elementary audio stream is coded in a coding format not including the SBR information.
  5. The bitstream transmission device according to Claim 3,
    wherein when the sampling frequency resulting from the analysis by the analysis unit is a predetermined third frequency, the preamble generation unit is configured to generate the preamble signal indicating that the elementary audio stream is coded in a coding format including the SBR information, even when the elementary audio stream is not coded in a coding format including the SBR information.
  6. The bitstream transmission device according to Claim 3,
    wherein when the sampling frequency resulting from the analysis by the analysis unit is a predetermined fourth frequency, the preamble generation unit is configured to generate the preamble signal indicating that the elementary audio stream is coded in a coding format not including the SBR information, even when the elementary audio stream is coded in a coding format including the SBR information.
  7. A bitstream reception and transmission system comprising the bitstream transmission device according to any one of Claims 1 to 6 and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream,
    wherein the bitstream reception device includes:
    a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and
    an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream according to the coding format indicated by the information specified in the preamble signal.
  8. A bitstream reception and transmission system comprising the bitstream transmission device according to Claim 5 and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream,
    wherein the bitstream reception device includes:
    a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and
    an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, perform decoding where a signal with an expanded high band is 0 and which uses the SBR information, in accordance with the information specified in the preamble signal and indicating the coding format.
  9. A bitstream reception and transmission system comprising the bitstream transmission device according to Claim 6 and a bitstream reception device that receives the audio bitstream transmitted from the bitstream transmission device and decodes the received audio bitstream,
    wherein the bitstream reception device includes:
    a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal from the received audio bitstream; and
    an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, generate a signal having a sampling frequency that is half a sampling frequency when decoding processing using the SBR information is performed, in accordance with the information specified in the preamble signal and indicating the coding format.
  10. The bitstream transmission and reception system according to any one of Claims 7 to 9,
    wherein the bitstream reception device includes a capability notification unit configured to notify the bitstream transmission device with information indicating a capability of the bitstream reception device,
    the analysis unit is configured to analyze Fs information indicating a sampling rate of the elementary audio stream, and
    when the Fs information indicates a predetermined value, the preamble generation unit is configured to switch between generating or not generating a preamble signal indicating inclusion of SBR information, based on the information from the capability notification unit.
  11. A bitstream reception device comprising:
    a reception unit configured to receive, from a bitstream transmission device, an audio bitstream in a given format that (i) is formed by incorporating, into an elementary audio stream which is a coded audio signal, a preamble signal including information indicating a coding format of the elementary audio stream, and (ii) specifies, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream when a sampling frequency and the coding format of the elementary audio stream do not comply with the given format of the audio bitstream;
    a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and
    an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream according to the coding format indicated by the information specified in the preamble signal.
  12. The bitstream reception device according to Claim 11,
    wherein the bitstream reception device receives the audio bitstream including the preamble signal indicating that the elementary audio stream is coded in a coding format including SBR information when the sampling frequency of the elementary audio stream is a predetermined third frequency, even when the elementary audio stream is not coded in a coding format including the SBR information, and
    the elementary decoding unit is configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, perform decoding where a signal with an expanded high band is 0 and which uses the SBR information, in accordance with the information specified in the preamble signal and indicating the coding format.
  13. The bitstream reception device according to Claim 11,
    wherein the bitstream reception device receives the audio bitstream including the preamble signal indicating that the elementary audio stream is coded in a coding format not including SBR information when the sampling frequency of the elementary audio stream is a predetermined fourth frequency, even when the elementary audio stream is coded in a coding format including the SBR information, and
    the elementary decoding unit is configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, generate a signal having a sampling frequency that is half a sampling frequency when decoding processing using the SBR information is performed, in accordance with the information specified in the preamble signal and indicating the coding format.
  14. A bitstream transmission method of transmitting an audio bitstream in a given format, the bitstream transmission method comprising:
    acquiring an elementary audio stream obtained by coding an audio signal;
    analyzing a sampling frequency and a coding format of the elementary audio stream;
    generating, based on a result of the analyzing, a preamble signal including information indicating a coding format of the elementary audio stream;
    forming the audio bitstream in the given format by incorporating in the elementary audio stream the generates preamble signal; and
    outputting the formed audio bitstream,
    wherein based on the sampling frequency and the coding format resulting from the analyzing, in the generating of a preamble signal, a coding format that complies with the given format of the audio bitstream and is different from the coding format of the elementary audio stream resulting from the analyzing is specified as the information included in the preamble signal and indicating the coding format.
  15. A bitstream reception method comprising:
    receiving, from a bitstream transmission device, an audio bitstream in a given format that (i) is formed by incorporating, into an elementary audio stream which is a coded audio signal, a preamble signal including information indicating a coding format of the elementary audio stream, and based on a sampling rate and the coding format of the elementary audio stream, (ii) specifies, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream and is different from a coding format resulting from an analysis of the elementary bit stream;
    extracting the elementary audio stream by removing the preamble signal included in the received audio bitstream; and
    when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decoding the elementary audio stream according to the coding format indicated by the information specified in the preamble signal.
  16. An integrated circuit comprising:
    an input unit configured to acquire an elementary audio stream obtained by coding an audio signal;
    an analysis unit configured to analyze a sampling frequency and a coding format of the elementary audio stream;
    a preamble generation unit configured to generate, based on a result of the analysis by the analysis unit, a preamble signal including information indicating a coding format of the elementary audio stream;
    a forming unit configured to form the audio bitstream in the given format by incorporating in the elementary audio stream the preamble signal generated by the preamble generation unit; and
    an output unit configured to output the audio bitstream formed by the forming unit,
    wherein based on the sampling frequency and the coding format resulting from the analysis by the analysis unit, the preamble generation unit is configured to specify, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream and is different from the coding format of the elementary audio stream resulting from the analysis by the analysis unit.
  17. An integrated circuit comprising:
    a reception unit configured to receive, from a bitstream transmission device, an audio bitstream in a given format that (i) is formed by incorporating, into an elementary audio stream which is a coded audio signal, a preamble signal including information indicating a coding format of the elementary audio stream, and based on a sampling rate and the coding format of the elementary audio stream, (ii) specifies, as the information included in the preamble signal and indicating the coding format, a coding format that complies with the given format of the audio bitstream and is different from a coding format resulting from an analysis of the elementary bit stream,
    a preamble removal unit configured to extract the elementary audio stream by removing the preamble signal included in the received audio bitstream; and
    an elementary decoding unit configured to, when the information specified in the preamble signal and indicating the coding format of the elementary audio stream and the information included in the elementary audio stream and indicating the coding format of the elementary audio stream are inconsistent, decode the elementary audio stream according to the coding format indicated by the information specified in the preamble signal.
  18. An audio bitstream described having a given format,
    wherein the audio bitstream is formed by incorporating a preamble signal in an elementary audio stream obtained by coding an audio signal, and
    based on a sampling frequency and a coding format of the elementary audio stream that are specified in the elementary audio stream, a coding format that complies with the given format of the audio bitstream and is different from the coding format specified in the elementary audio stream is specified in the audio bitstream as information included in the preamble signal and indicating the coding format.
  19. The audio bitstream according to Claim 18,
    wherein the information indicating the coding format of the elementary audio stream specifies a coding format different from the coding format specified in the elementary audio stream in terms of presence or absence of spectral band replication (SBR) information in the elementary audio stream.
EP12788834.5A 2011-05-20 2012-05-16 Bit stream transmission device, bit stream reception/transmission system, bit stream reception device, bit stream transmission method, bit stream reception method, and bit stream Active EP2711924B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2011114178 2011-05-20
PCT/JP2012/003205 WO2012160782A1 (en) 2011-05-20 2012-05-16 Bit stream transmission device, bit stream reception/transmission system, bit stream reception device, bit stream transmission method, bit stream reception method, and bit stream

Publications (3)

Publication Number Publication Date
EP2711924A1 true EP2711924A1 (en) 2014-03-26
EP2711924A4 EP2711924A4 (en) 2015-04-01
EP2711924B1 EP2711924B1 (en) 2018-11-14

Family

ID=47216875

Family Applications (1)

Application Number Title Priority Date Filing Date
EP12788834.5A Active EP2711924B1 (en) 2011-05-20 2012-05-16 Bit stream transmission device, bit stream reception/transmission system, bit stream reception device, bit stream transmission method, bit stream reception method, and bit stream

Country Status (4)

Country Link
EP (1) EP2711924B1 (en)
JP (1) JP5843856B2 (en)
CN (1) CN103620678B (en)
WO (1) WO2012160782A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10134413B2 (en) 2015-03-13 2018-11-20 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101427756B1 (en) * 2013-04-26 2014-08-08 주식회사 코아로직 A method and an apparatus for transferring multi-channel audio signal
US10878828B2 (en) * 2014-09-12 2020-12-29 Sony Corporation Transmission device, transmission method, reception device, and reception method
CN104392724A (en) * 2014-12-11 2015-03-04 福建联迪商用设备有限公司 Audio communication method and device
WO2019051427A1 (en) * 2017-09-11 2019-03-14 Cohere Technologies, Inc. Wireless local area networks using orthogonal time frequency space modulation

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060265227A1 (en) * 2005-05-11 2006-11-23 Noriaki Sadamura Audio decoding device
US20060288851A1 (en) * 2003-06-17 2006-12-28 Akihisa Kawamura Receiving apparatus, sending apparatus and transmission system

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100265112B1 (en) * 1997-03-31 2000-10-02 윤종용 Dvd dics and method and apparatus for dvd disc
JP2004007359A (en) * 2002-04-08 2004-01-08 Matsushita Electric Ind Co Ltd Multimedia data decoding device
JP2006050387A (en) * 2004-08-06 2006-02-16 Matsushita Electric Ind Co Ltd Data reproducing method, and data reproducing apparatus
JP4863990B2 (en) * 2004-10-07 2012-01-25 パナソニック株式会社 Information recording medium, TS packet determining device, and data reproducing device
JP2006202353A (en) * 2005-01-18 2006-08-03 Matsushita Electric Ind Co Ltd Reproduction device having corrective function of management information of mpeg2 transport stream
JP5107574B2 (en) * 2005-02-24 2012-12-26 パナソニック株式会社 Data reproduction apparatus, data reproduction method, program, and integrated circuit
JP4613674B2 (en) * 2005-04-11 2011-01-19 パナソニック株式会社 Audio playback device
JP2007017628A (en) * 2005-07-06 2007-01-25 Matsushita Electric Ind Co Ltd Decoder
CN101740032A (en) * 2008-11-19 2010-06-16 北京中星微电子有限公司 Audio coding and decoding methods, and audio coding and decoding devices
CN101409093B (en) * 2008-11-28 2011-06-15 炬力集成电路设计有限公司 Method and apparatus for determining audio data sampling point position

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060288851A1 (en) * 2003-06-17 2006-12-28 Akihisa Kawamura Receiving apparatus, sending apparatus and transmission system
US20060265227A1 (en) * 2005-05-11 2006-11-23 Noriaki Sadamura Audio decoding device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"IEC CDV 61937-6 (SC 29 N 4069)", 56. MPEG MEETING; 05-03-2001 - 09-03-2001; SINGAPORE; (MOTION PICTUREEXPERT GROUP OR ISO/IEC JTC1/SC29/WG11),, no. M6962h, 19 February 2001 (2001-02-19), XP030036092, ISSN: 0000-0285 *
See also references of WO2012160782A1 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10134413B2 (en) 2015-03-13 2018-11-20 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US10262669B1 (en) 2015-03-13 2019-04-16 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US10262668B2 (en) 2015-03-13 2019-04-16 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US10453468B2 (en) 2015-03-13 2019-10-22 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US10553232B2 (en) 2015-03-13 2020-02-04 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US10734010B2 (en) 2015-03-13 2020-08-04 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US10943595B2 (en) 2015-03-13 2021-03-09 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US11367455B2 (en) 2015-03-13 2022-06-21 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US11417350B2 (en) 2015-03-13 2022-08-16 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US11664038B2 (en) 2015-03-13 2023-05-30 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US11842743B2 (en) 2015-03-13 2023-12-12 Dolby International Ab Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element

Also Published As

Publication number Publication date
EP2711924B1 (en) 2018-11-14
JPWO2012160782A1 (en) 2014-07-31
EP2711924A4 (en) 2015-04-01
JP5843856B2 (en) 2016-01-13
CN103620678B (en) 2015-08-19
WO2012160782A1 (en) 2012-11-29
CN103620678A (en) 2014-03-05

Similar Documents

Publication Publication Date Title
EP1590800B1 (en) Continuous backup audio
US9378743B2 (en) Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols
US20100324915A1 (en) Encoding and decoding apparatuses for high quality multi-channel audio codec
EP2711924B1 (en) Bit stream transmission device, bit stream reception/transmission system, bit stream reception device, bit stream transmission method, bit stream reception method, and bit stream
IL295809B1 (en) Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
JP4613674B2 (en) Audio playback device
JP6728154B2 (en) Audio signal encoding and decoding
US20140310010A1 (en) Apparatus for encoding and apparatus for decoding supporting scalable multichannel audio signal, and method for apparatuses performing same
KR20060027344A (en) Receiving apparatus, sending apparatus and transmission system
US11830508B2 (en) Method and apparatus for processing of auxiliary media streams embedded in a MPEGH 3D audio stream
KR101038442B1 (en) Embedded data signaling
US11025406B2 (en) Audio return channel clock switching
US20210098011A1 (en) Audio Return Channel Data Loopback
US20130191133A1 (en) Apparatus for audio data processing and method therefor
RU2780733C2 (en) Method and device for processing auxiliary media streams embedded in the mpeg-h 3d audio stream
WO2024076828A1 (en) Method, apparatus, and medium for encoding and decoding of audio bitstreams with parametric flexible rendering configuration data
WO2024074284A1 (en) Method, apparatus, and medium for efficient encoding and decoding of audio bitstreams
WO2016038876A1 (en) Encoding device, decoding device, and speech signal processing device

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20131116

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAX Request for extension of the european patent (deleted)
RA4 Supplementary search report drawn up and despatched (corrected)

Effective date: 20150303

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 19/00 20130101AFI20150225BHEP

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: SOCIONEXT INC.

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20170316

REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Ref document number: 602012053538

Country of ref document: DE

Free format text: PREVIOUS MAIN CLASS: G10L0019000000

Ipc: G10L0019160000

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 19/16 20130101AFI20180613BHEP

INTG Intention to grant announced

Effective date: 20180716

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAJ Information related to disapproval of communication of intention to grant by the applicant or resumption of examination proceedings by the epo deleted

Free format text: ORIGINAL CODE: EPIDOSDIGR1

GRAL Information related to payment of fee for publishing/printing deleted

Free format text: ORIGINAL CODE: EPIDOSDIGR3

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

GRAR Information related to intention to grant a patent recorded

Free format text: ORIGINAL CODE: EPIDOSNIGR71

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE PATENT HAS BEEN GRANTED

INTC Intention to grant announced (deleted)
INTG Intention to grant announced

Effective date: 20181004

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

Ref country code: AT

Ref legal event code: REF

Ref document number: 1065791

Country of ref document: AT

Kind code of ref document: T

Effective date: 20181115

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602012053538

Country of ref document: DE

REG Reference to a national code

Ref country code: NL

Ref legal event code: MP

Effective date: 20181114

REG Reference to a national code

Ref country code: LT

Ref legal event code: MG4D

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 1065791

Country of ref document: AT

Kind code of ref document: T

Effective date: 20181114

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190214

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: HR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190214

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190314

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: RS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: AL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190215

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190314

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602012053538

Country of ref document: DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: SM

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20190815

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190531

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190531

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

REG Reference to a national code

Ref country code: BE

Ref legal event code: MM

Effective date: 20190531

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190516

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190516

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190531

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT; INVALID AB INITIO

Effective date: 20120516

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20181114

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20230526

Year of fee payment: 12

Ref country code: DE

Payment date: 20230519

Year of fee payment: 12

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20230524

Year of fee payment: 12