WO2007138825A1 - Digital audio data processing device and processing method - Google Patents

Digital audio data processing device and processing method Download PDF

Info

Publication number
WO2007138825A1
WO2007138825A1 PCT/JP2007/059585 JP2007059585W WO2007138825A1 WO 2007138825 A1 WO2007138825 A1 WO 2007138825A1 JP 2007059585 W JP2007059585 W JP 2007059585W WO 2007138825 A1 WO2007138825 A1 WO 2007138825A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
reference frame
decoded
header information
header
Prior art date
Application number
PCT/JP2007/059585
Other languages
French (fr)
Japanese (ja)
Inventor
Seiji Harada
Original Assignee
Pioneer Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pioneer Corporation filed Critical Pioneer Corporation
Priority to JP2008517814A priority Critical patent/JP4551472B2/en
Publication of WO2007138825A1 publication Critical patent/WO2007138825A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm

Definitions

  • the present invention relates to a digital audio data processing apparatus and processing method for digitally processing encoded data.
  • Digital broadcasting for digitizing and multiplex-transmitting image data (video data) and audio data (audio data) using, for example, a digital or Internet broadcast receiver has already been started.
  • a predetermined compression encoding method for example, MPEG; Moving Picture Experts Group method, etc.
  • the data power of a plurality of programs for example, MPEG transport stream.
  • Stream; MPEG-TS, etc. The multiplexed data transmitted as a stream is selectively extracted by transmitting it to the receiver side that has received the data.
  • AAC-plus An encoding method in which SBR is added in addition to the conventional AAC is called AAC-plus, and one frame of data includes AAC encoded data (BaseCodec) and SBR encoded data. It consists of. Note that even conventional decryption means compatible only with AAC can decrypt only AAC data by skipping SBR data.
  • Patent Document 1 Japanese Patent Laid-Open No. 2006-50387
  • auxiliary information for predicting a high frequency component from a low frequency component is subjected to band expansion processing during reproduction to generate a high frequency component. It is.
  • a frame (reference frame) having an SBR header is generated every predetermined frame data unit (for example, several frames, several tens of frames) (for example, irregularly) based on the data transmission amount restriction described above. It has been entered.
  • frames that do not have an SBR header other than this frame are included, and all frames create a calculation table from the information (header information) stored in the SBR header. Based on the table, the band expansion process is performed to generate a high frequency component.
  • the problems to be solved by the present invention include the above-described problems as an example.
  • the invention according to claim 1 is encoded data including a plurality of framed frame sequences, and the plurality of frame sequences include a reference frame and a non-reference.
  • the non-reference frame includes non-reference first data obtained by encoding speech information and non-reference first data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data.
  • Second reference data and the reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding the reproduction band of the reference first data
  • a digital audio data processing device for processing a stream, including reference second data having a processing header including header information for performing arithmetic processing of the non-reference second data, wherein the reference frame
  • a header information acquisition unit for acquiring the header information of the processing header provided in the reference second data, and decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame
  • the first decoding means for generating first decoded data, and the reference second data or the non-reference second data of the reference frame or the non-reference frame is obtained by the header information obtaining means.
  • the first decoding is performed.
  • a high-frequency component pseudo-generating means for pseudo-generating high-frequency component data having a reproduction band higher than that of the first decoded data based on the first decoded data decoded by the encoding means; 1 Decryption Hide And output control means for outputting the high-frequency component data together.
  • the invention according to claim 4 is code data including a plurality of framed frames, and the plurality of frame sequences includes a plurality of frames including a reference frame and a non-reference frame.
  • the non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data.
  • the reference frame is configured by encoding reference first data in which audio information is encoded and band expansion information for expanding a reproduction band of the reference first data.
  • Digital processing stream including reference second data having a processing header with header information for performing arithmetic processing An audio data processing method, an error determination procedure for determining whether or not a part of the stream has been lost, and a header for acquiring the header information of the processing header provided in the reference second data of the reference frame An information acquisition procedure; first decoding means for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame to generate first decoded data; and the reference frame Alternatively, the second reference data or the second non-reference data of the non-reference frame is decoded using the header information acquired in the header information acquisition procedure to generate second decoded data.
  • High-frequency component data with a higher regeneration zone than 1 decodes data generated in a pseudo manner, and outputs before Symbol the high frequency component data together with the first decryption data.
  • This embodiment is an embodiment when applied to a mobile phone as an example of a digital audio data processing apparatus.
  • FIG. 1 is a perspective view showing the overall appearance of the mobile phone according to the present embodiment.
  • this mobile phone 1 as a digital audio data processing apparatus acquires and outputs content data including audio data, video data, data for data broadcasting, etc. distributed as TS (Transport Stream). It is compatible with terrestrial digital broadcasting using the TS system.
  • TS Transport Stream
  • the cellular phone 1 is provided with a main body casing 2, an operation unit 3 provided at a lower portion of the main body casing 2, and provided with a telephone number input key, various function buttons, and the like, and a base end portion thereof.
  • Open / close cover 4 pivotally supported at the lower end of main casing 2 and attached to main casing 2 so as to be openable / closable, display 5 for various displays, and antenna 6 for data transmission / reception via wireless communication
  • FIG. 2 is a functional block diagram showing a functional configuration of the mobile phone 1.
  • Figure 2 Odor After the radio wave transmitted from the broadcasting station or other mobile phone is received by the antennas 6 and 10, the demodulated received signal is demodulated by the transmission / reception unit 100 connected to the antennas 6 and 10.
  • the signal processing unit 101 performs predetermined signal processing (details will be described later) for reproduction.
  • the signal processed by the signal processing unit 101 is reproduced as sound by the speaker 7.
  • the transmission / reception unit 100 includes a TS reception unit (not shown), and the broadcasting antenna 10 (which may also be used as the antenna 6) is connected to the TS reception unit.
  • the TS receiving unit acquires a TS corresponding to the content selected by the user from, for example, a plurality of TSs transmitted as digital signals from the broadcasting antenna 10. Then, the acquired TS is output to the signal processing unit 101 as a TS signal.
  • the conversation of the speaker is input to the microphone 9 and converted into an audio signal.
  • the audio signal is subjected to signal processing for transmission in the signal processing unit 101.
  • the transmission / reception unit 100 modulates the audio signal from the signal processing unit 101 and supplies the modulated signal to the antenna 6.
  • the antenna 6 transmits the audio signal. As a radio wave.
  • control unit 102 including a CPU and the like.
  • FIG. 3 is a schematic diagram conceptually showing an extracted portion related to audio in the TS stream received by the antenna 10 provided in the mobile phone 1 of the present embodiment.
  • FIG. 3 shows a state in which an audio stream extracted from the multiplexed stream received by the antenna 10 (ie, Elementary Stream) includes a plurality of frames along the time axis. Yes. These multiple frames consist of a reference frame SF and other non-reference frame IF.
  • Non-reference frame IF is a BaseCodec (non-reference number
  • the reference frame SF is configured by encoding BaseCodec (reference first data) obtained by encoding audio information according to the AAC standard and band expansion information for expanding the reproduction band of the BaseCodec, as described above.
  • BaseCodec reference first data
  • SBR data standard second data
  • SBR header processing header
  • a table for the above arithmetic processing is created, and band expansion processing can be performed based on this table to generate high frequency components.
  • FIG. 4 is a functional block diagram showing a configuration related to the reproduction processing of the audio signal in the signal processing unit 101.
  • the signal processing unit 101 includes a BaseCodecDecode unit 11, an error processing unit 12, a high frequency component pseudo generation processing unit 13, an SBRDecode unit 14, and a BaseCodecDecode unit 1
  • Switch 15 (output control means) that is switched by a switching control signal from 1.
  • the BaseCodecDecode unit 11 has a plurality of functions. First, as the first function, the audio signal stream (Elementary Stream) described above with reference to FIG. 3 is input, and the BaseCodec part included in the reference frame SF and the non-reference frame IF is decoded (decoded) ( First decoding means).
  • the BaseCodecDecode unit 11 determines whether each frame has an SBR-header (in other words, a reference frame SF or a non-reference frame IF force). At the same time, if the frame is the reference frame SF, the header information of the SBR header provided in the SBR data is acquired (header information acquisition means).
  • SBR-header in other words, a reference frame SF or a non-reference frame IF force.
  • the determination method may be determined to be an error when a failure occurs in the ES decoding process in the BaseCodecDecode unit 11, or when an ES is input, error information about the ES is separately received from the outside. You may make it judge
  • the switch 15 is based on the switching control signal from the BaseCodecDecode unit 11, based on the error processing unit 12, the high frequency component pseudo generation processing unit 13, and the S BRDecode unit 14. The output from either of these is selected and output to the speaker 7 side.
  • Figure 5 shows the processing executed by the BaseCodecDecode unit 11 for each frame. It is a flowchart showing a procedure.
  • step S60 the error processing unit 12 is instructed to output predetermined error data, and a switching control signal for switching the switch 15 to the error processing unit 12 side is output.
  • the error processing unit 12 outputs, as error data, a mute signal for making a silent state (or data before and after the data force may be interpolated by an appropriate method) (error data generation means, Error data generation procedure), output from switch 15 to speaker 7 side.
  • step S10 determines whether an error has not occurred, the determination is not satisfied, and the routine goes to step S15.
  • step S15 the header information of the SBR header is acquired from the frame by the function as the header information acquisition means described above (header information acquisition procedure).
  • the SBRDecode unit 14 receives the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 as follows: In addition, decryption processing is performed using a table based on the syntax information of the SBR section described above. The decoding data generated by the SBR section is collected and output to the switch 15 (second decoding procedure), and output from the switch 15 to the speaker 7 side.
  • step S40 the decoded data of the BaseCodec part decoded by the function as the first decoding means described above is output to the high-frequency component pseudo-generation processing unit 13 and the switch 15 is pseudo-generated. A switching control signal for switching to the processing unit 13 is output.
  • the high-frequency component pseudo-generation processing unit 13 up-samples the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 twice and uses a known method (for example, described in Japanese Patent No. 3140273).
  • the high-frequency component data is generated based on the decoded data of the BaseCodec part, which is a low frequency by the above method), and the decrypted data including the generated high-frequency component data is output to the switch 15, and the switch 15 Output to the side.
  • step S35, step S40, and step S60 are completed, the process returns to step S10 and the same procedure is repeated.
  • the header information of the SBR header cannot be acquired (in other words, when a calculation table is not created)
  • a known high-frequency component simulation generation method is used.
  • the high frequency component data is generated in a pseudo manner based on the decoded data of the BaseCodec part decoded in this way, and the generated high frequency component data is added to the decoded data of the BaseCodec part for output.
  • the digital audio data processing device 1 is encoded data composed of a plurality of framed frame sequences, and the plurality of frame sequences include the reference frame SF and the non-frame data.
  • the reference frame IF includes multiple frames, and the non-reference frame IF expands the playback band of the non-reference first data (BaseCodec in this example) that encodes audio information and the non-reference first data BaseCodec.
  • Non-reference second data (in this example, SBR) that encodes the bandwidth expansion information for encoding
  • the reference frame SF is the reference first data (in this example, BaseCodec) that encodes the voice information
  • a processing header (this header, which includes header information for performing arithmetic processing of the non-standard second data SBR, is configured by encoding the band expansion information for expanding the reproduction band of the reference first data BaseCodec.
  • a digital audio data processing device 1 for processing a stream including reference second data (in this example, SBR) having SBR headers, and is a processing header provided in reference second data SBR of reference frame SF Header information acquisition means (BaseCodecDecode section 11 in this example) for acquiring the header information of the first frame and the first reference data BaseCodec or the first non-reference first data BaseCodec of the reference frame SF or the non-reference frame IF.
  • SBR reference second data
  • BaseCodecDecode section 11 for acquiring the header information of the first frame and the first reference data BaseCodec or the first non-reference first data BaseCodec of the reference frame SF or the non-reference frame IF.
  • the first decoding means (BaseCodecDecode unit 11 in this example) that generates data (BaseCodec data after decoding) in this example, and the reference second data SBR or non-reference data of the reference frame SF or non-reference frame IF 2
  • the second decoding means for decoding the data SBR using the header information acquired by the header information acquisition means 11 and generating the second decrypted data (in this example, the decoded SBR data)
  • High-frequency component pseudo-generation means (in this example, a high-frequency component) that artificially generates high-frequency component data having a higher reproduction and reproduction band than the first decoded data (decoded BaseCodec data).
  • a pseudo-generation processing unit 13) and output control means (switch 15 in this example) for outputting the first decoded key data (BaseCodec data
  • the reference frame SF including the force standard first data BaseCodec and the reference second data SBR of each of the plurality of frames provided in the stream, and the non-standard first data BaseCodec and the non-standard second data. It is composed of non-reference frame IF including data SBR.
  • the reference first data BaseCodec and the non-reference first data BaseCodec are decoded by the first decoding means 11 to generate the first decoded data (Base Codec data after decoding), while the reference first data 2
  • the data SBR and the non-reference second data SBR are decrypted by the second decryption means 14 using the header information obtained by the header information obtaining means 11, and the second decrypted data ( SBR data after decoding) ) Is generated.
  • the first decoding unit 11 performs the decoding. Based on the first decoded data (BaseCodec data after decoding) The pseudo-generation means 13 generates pseudo high frequency component data. Then, the output control means 15 outputs the first decoded data (decoded BaseCodec data) together with the generated high frequency component data.
  • the silence state can be shortened and the high-frequency component can be reduced compared to the case where the silence state is obtained until the next header information can be obtained after the error occurs or only the first decoded data (decoded BaseCodec data) is output. By outputting, it is possible to reduce the sense of incongruity in hearing.
  • the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data consisting of a plurality of framed frames, and the plurality of frame sequences are It includes multiple frames consisting of a reference frame SF and a non-reference frame IF, and the non-reference frame IF expands the playback bandwidth of the non-reference first data BaseCodec that encodes audio information and the non-reference first data BaseCodec.
  • the reference frame SF includes the reference first data BaseCodec encoded voice information and the reproduction band of the reference first data BaseCodec.
  • Reference second data SB which is configured by encoding band expansion information for expansion and has a processing header (SBR header) with header information for performing calculation processing of non-reference second data SBR
  • SBR header processing header
  • Step S15) and the reference first data BaseCodec or non-reference first data BaseCodec of the reference frame SF or non-reference frame IF are decoded to generate first decoded data (decoded BaseCodec data).
  • the header information acquisition procedure S 1 is the header information acquisition procedure S 1
  • step 5 If header information could not be obtained in step 5, high frequency component data is generated based on the decoded first decoded data (BaseCodec data after decoding), and the first decoded data (after decoding) BaseCodec data) and the generated high-frequency component data are output together.
  • the silence state can be shortened and the high frequency component can be output compared to the case where the silence state is obtained until the next header information can be acquired after an error occurs or only the first decoded data (decoded BaseCodec data) is output. By doing this, you can reduce the sense of incongruity in hearing.
  • the digital audio data processing apparatus 1 in the above embodiment has error determination means (BaseCodecDecode section 11 in this example) for determining whether or not a part of the stream has been lost. Is characterized by generating high-frequency component data when it is determined by the error determination means 11 that the non-reference frame IF has disappeared.
  • the high frequency component pseudo generation means 13 when the error determination means 11 determines that the non-reference frame IF has disappeared, the high frequency component pseudo generation means 13 generates high frequency component data.
  • silence occurs compared to when silence occurs until the next header information can be acquired after an error in which the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output. And a sense of discomfort in hearing can be reduced by outputting a high frequency component.
  • the digital audio data processing method further includes an error determination procedure (in this example, step S10 in Fig. 5) for determining whether or not a part of the stream has been lost.
  • High frequency component data is generated when it is determined in S10 that the non-reference frame IF has disappeared.
  • high frequency component data is generated when it is determined that the non-reference frame IF has disappeared.
  • silence occurs compared to when silence occurs until the next header information can be obtained after an error that the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output.
  • a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having error data generation means (in this example, error processing unit 12) for generating data (in this example, mute data).
  • a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having an error data generation procedure (in this example, step S60) for generating data (in this example, mute data).
  • FIGS. 6 to 8 are explanatory diagrams for specifically explaining the effects of the present embodiment listed above.
  • the horizontal axis represents the time axis
  • FIG. 6 shows an example of the behavior of each frame in the input stream
  • FIG. 7 shows the digital audio data processing apparatus 1 of the present embodiment corresponding to FIG. Fig. 8 shows the output behavior of the comparative example in which only the first decoded data (BaseCodec data after decoding) is output after the error occurs.
  • the frame after the frame in which the error has occurred differs between the comparative example and the present embodiment. That is, As shown in Fig. 8, in the above comparative example, if an error occurs, the SBR part cannot be decoded until the header information of the SBR header is newly acquired thereafter, and the low frequency band of only the BaseCodec part cannot be obtained. Output decrypted data.
  • the digital audio data processing device 1 is code data including a plurality of framed frames, and the plurality of frame sequences are a plurality of frames including a reference frame SF and a non-reference frame IF.
  • the non-reference frame IF includes the BaseCodec that encodes the audio information and the SBR that encodes the band expansion information for expanding the playback band of this BaseCodec
  • the reference frame SF Includes BaseCodec that encodes information, and SBR that has SBR header that is configured by encoding band expansion information for expanding the playback band of BaseCodec, and includes header information for performing SBR calculation processing
  • the frame includes a reference frame SF including an SBR having a base codec and an SBR header, and a non-reference frame IF including a base codec and an SBR, respectively.
  • BaseCodec is decoded by Base CodecDecode unit 11 to generate BaseCodec data
  • SBR is decoded by SBRDecode unit 14 using the header information acquired by BaseCodecDecode unit 11 to generate SBR data .
  • the BaseCodecDecode unit 11 decodes the BaseCodec data Based on this, the high frequency component pseudo generation processing unit 13 generates high frequency component data in a pseudo manner. Then, the switch 15 outputs BaseCodec data together with the generated high frequency component data. This makes it possible to reduce the silence and reduce the sense of incongruity by outputting high-frequency components, compared to the case where silence is maintained until the next header information can be acquired after an error occurs, and only BaseCodec is output. can do.
  • the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data including a plurality of framed frames, and includes a plurality of frames.
  • the column includes a plurality of frames composed of a reference frame SF and a non-reference frame IF.
  • the non-reference frame IF force encodes base codec that encodes audio information and band expansion information for expanding the playback band of this base codec.
  • BaseCodec which encodes the reference frame SF power audio information, and the band expansion information for expanding the playback band of this BaseCodec.
  • a digital audio data processing method for processing a stream including an SBR having an SBR header with header information, the header of the SBR header provided in the SBR of the reference frame SF.
  • Step S15 for obtaining information, and decoding the BaseCodec of the reference frame SF or non-reference frame IF, and generating the BaseCodec data
  • the BaseCodec decoding procedure by the BaseCodecDecode unit 11 and the reference frame SF or non-reference frame IF SBR is decrypted using the header information obtained in step S15, and SBR data If the header information could not be acquired in step S35 and step S15, a higher playback bandwidth than that of the BaseCodec data is obtained based on the BaseCodec data decoded by BaseCodec decoding by the BaseCodecDecode ⁇ 11.
  • the pseudo high frequency component data is generated and BaseCodec data and high frequency component data are output together.
  • step S15 when header information cannot be acquired in step S15, high frequency component data is generated based on the decoded BaseCodec data, and BaseCodec data and The generated high frequency component data is also output.
  • FIG. 1 is a perspective view showing the overall appearance of a mobile phone according to an embodiment of the present invention.
  • FIG. 2 is a functional block diagram showing a functional configuration of the mobile phone shown in FIG.
  • FIG. 3 is a schematic diagram conceptually showing a portion related to audio extracted from a stream.
  • FIG. 4 is a functional block diagram showing a configuration related to audio signal reproduction processing in the signal processing section shown in FIG.
  • FIG. 5 is a flowchart showing the processing procedure executed by the BaseCodecDecode part for each frame.
  • FIG. 6 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
  • FIG. 7 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
  • FIG. 8 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention. Explanation of symbols
  • BaseCodecDecode part error determination means, header information acquisition means, first decoding means, second decoding means, analysis means, verification means
  • High-frequency component pseudo-generation processing unit (high-frequency component pseudo-generation means)

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

[PROBLEMS] To reduce uncomfortable feeling of an operator even when an error has occurred in a stream. [MEANS FOR SOLVING PROBLEMS] A high-band component pseudo generation unit (13) pseudo-wise generates high-band component data having a higher reproduction band than decoded data in a Base Codec unit and outputs the decoded Base Codec data together with the pseudo-wise generated high-band component data. Thus, as compared to a case when a soundless state continues until next header information is acquired after an error has occurred or when only Base Codec data is outputted after decoding, it is possible to reduce the soundless state and output a higher-band component, thereby reducing the uncomfortable feeling for sense of hearing.

Description

明 现 曞  Specification
デゞタル音声デヌタ凊理装眮及び凊理方法  Digital audio data processing apparatus and processing method
技術分野  Technical field
[0001] 本発明は、笊号化デヌタをデゞタル凊理するデゞタル音声デヌタ凊理装眮及び凊 理方法に関するものである。  [0001] The present invention relates to a digital audio data processing apparatus and processing method for digitally processing encoded data.
背景技術  Background art
[0002] 䟋えばデゞタル又はむンタヌネット攟送受信機を甚いお、画像デヌタビデオデヌ タや音声デヌタオヌディオデヌタをデゞタル化しお倚重䌝送するデゞタル攟送 が既に開始されおいる。このデゞタル攟送においおは、所定の圧瞮笊号化方匏 (䟋 えば MPEG ; Moving Picture Experts Group方匏等が採甚されおおり、耇数 の番組のデヌタ力 圓該笊号化方匏に察応したストリヌム䟋えば MPEGトランスポ 䞀トストリヌム MPEG— TS等に倚重化しお䌝送される。このように倚重化しストリヌ ムずしお䌝送されたデヌタは、これを受信した受信機偎にぉレ、お所望のデヌタが遞 択的に抜出される。  [0002] Digital broadcasting for digitizing and multiplex-transmitting image data (video data) and audio data (audio data) using, for example, a digital or Internet broadcast receiver has already been started. In this digital broadcasting, a predetermined compression encoding method (for example, MPEG; Moving Picture Experts Group method, etc.) is employed, and the data power of a plurality of programs (for example, MPEG transport stream). Stream; MPEG-TS, etc.) The multiplexed data transmitted as a stream is selectively extracted by transmitting it to the receiver side that has received the data.
[0003] 近幎、ブロヌドバンド化の䞀局の進展に䌎い、携垯電話機やその他の携垯端末 いわゆる第䞉䞖代の携垯端末、車茉端末、モパむル機噚)等、比范的簡玠な構造の デゞタル又はむンタヌネット攟送受信機を甚レ、、簡易的な動画配信サヌビスを行うこ ずが蚈画されおいる。  [0003] In recent years, with the progress of broadbandization, digital or Internet broadcast receivers having a relatively simple structure such as mobile phones and other mobile terminals (so-called third-generation mobile terminals, in-vehicle terminals, mopile devices), etc. It is planned to provide a simple video distribution service.
[0004] この堎合、音声デヌタ (オヌディオデヌタに぀いおは、埓来より幅広く䜿甚されお きた、䞊蚘 MPEGで芏栌化した AAC (Advanced Audio Coding)芏栌に加え、 S BR (Spectral Band Replication)技術を適甚した笊号化がすでに提唱されおいる (䟋えば、特蚱文献 1参照)。  [0004] In this case, SBR (Spectral Band Replication) technology has been applied to audio data (audio data) in addition to the AAC (Advanced Audio Coding) standard standardized by MPEG, which has been widely used in the past. Encoding has already been proposed (see, for example, Patent Document 1).
[0005] すなわち、䞀般に、音声デヌタの笊号ィ匕においおは、高呚波数成分の笊号化に十 分なビットを割り圓おるのが困難であるこずに由来しお、圧瞮率が高くなるほど、再生 垯域の䞊限呚波数が䜎䞋し音質が劣化する傟向ずなる。䞊蚘 SBR技術はこのような 高呚波数成分の欠萜を補うものであり、䜎呚波数成分から高呚波数成分を予枬する ための補助情報を予めストリヌム内に栌玍しおおき、再生時には垯域拡匵凊理を斜 しお擬䌌的に垯域を拡匵しお高呚波数成分を生成するこずで、高音質な再生を可胜 ずするものである。 [0005] That is, in general, in the encoding of audio data, it is difficult to allocate sufficient bits for encoding high frequency components, and as the compression rate increases, the upper limit of the reproduction band is increased. The frequency tends to decrease and the sound quality tends to deteriorate. The above SBR technology compensates for the lack of such high-frequency components. Auxiliary information for predicting high-frequency components from low-frequency components is stored in advance in the stream, and band expansion processing is performed during playback. In this way, it is possible to reproduce high-quality sound by generating a high-frequency component by expanding the bandwidth in a pseudo manner.
[0006] このような埓来の AACに加え SBRを远加した笊号化方匏は、 AAC— plusず称され おおり、 1フレヌムのデヌタは、 AACの笊号化デヌタBaseCodec)ず、 SBRの笊号 化デヌタずから構成される。なお、 AACにのみ察応した埓来の埩号ィ匕手段であっお も、 SBRデヌタを読み飛ばすこずによっお AACデヌタのみを埩号ィ匕できるようにな぀ おいる。  [0006] An encoding method in which SBR is added in addition to the conventional AAC is called AAC-plus, and one frame of data includes AAC encoded data (BaseCodec) and SBR encoded data. It consists of. Note that even conventional decryption means compatible only with AAC can decrypt only AAC data by skipping SBR data.
[0007] 特蚱文献 1 :特開 2006— 50387号公報  [0007] Patent Document 1: Japanese Patent Laid-Open No. 2006-50387
発明の開瀺  Disclosure of the invention
発明が解決しょうずする課題  Problems to be solved by the invention
[0008] 䞊蚘 SBRを甚いた笊号化方匏では、前述したように、䜎呚波数成分から高呚波数 成分を予枬するための補助情報に察し再生時に垯域拡匵凊理を斜しお高呚波数成 分を生成するものである。このずき、前述したデヌタ䌝送量の制玄に基づき、所定の フレヌムデヌタ単䜍䟋えば数フレヌム力、ら数十フレヌム単䜍ごずに䟋えば䞍定期 に SBRヘッダを備えたフレヌム基準フレヌムが揷入されおいる。そしお、このフレ ヌム以倖の SBRヘッダを備えないフレヌム非基準フレヌムも含み、すべおのフレ ヌムが、圓該 SBRヘッダに栌玍した情報 (ヘッダ情報より挔算甚のテヌブルを䜜成 し、このテヌブルに基づき䞊蚘垯域拡匵凊理を行っお高呚波成分を生成するように なっおいる。 [0008] In the encoding method using SBR, as described above, auxiliary information for predicting a high frequency component from a low frequency component is subjected to band expansion processing during reproduction to generate a high frequency component. It is. At this time, a frame (reference frame) having an SBR header is generated every predetermined frame data unit (for example, several frames, several tens of frames) (for example, irregularly) based on the data transmission amount restriction described above. It has been entered. In addition, frames that do not have an SBR header other than this frame (non-reference frames) are included, and all frames create a calculation table from the information (header information) stored in the SBR header. Based on the table, the band expansion process is performed to generate a high frequency component.
[0009] この堎合、トヌタルの䌝送量を削枛できるずいうメリットはあるものの䞊蚘のように S BRデヌタの埩号化には必ず SBRヘッダのヘッダ情報が必芁になるため、䜕らかの 事情でストリヌムの䞀郚が消倱した=゚ラヌ発生)堎合にはその間にヘッダ情報が 倉化しおいる可胜性があるこずから次に SBRヘッダの情報を取埗するたで、埩号ィ匕 が行えなくなる。このため、いったん゚ラヌが発生するず、次の SBRヘッダの情報を 取埗するたでの間は、 AACデヌタすなわち䜎域郚分のみが埩号化されお出力さ れ、高域が再生されないので、操䜜者に察し聎感䞊違和感を䞎えるこずずなっおいた  [0009] In this case, the header information of the SBR header is always required for decoding the SBR data as described above (although there is an advantage that the total transmission amount can be reduced). If the part is lost (= error occurred) (because the header information may have changed in the meantime), decoding cannot be performed until the next SBR header information is obtained. For this reason, once an error occurs, only the AAC data (that is, the low frequency band) is decoded and output until the next SBR header information is acquired, and the high frequency is not reproduced. Was supposed to give a sense of incongruity
[0010] 本発明が解決しょうずする課題には、䞊蚘した問題が䞀䟋ずしお挙げられる。 課題を解決するための手段 [0010] The problems to be solved by the present invention include the above-described problems as an example. Means for solving the problem
[0011] 䞊蚘課題を解決するために、請求項 1蚘茉の発明は、フレヌム化された耇数のフレ ヌム列からなる笊号化デヌタであっお、前蚘耇数のフレヌム列は、基準フレヌム及び 非基準フレヌムからなる耇数のフレヌムを備え、前蚘非基準フレヌムが、音声情報を 笊号化した非基準第 1デヌタず、この非基準第 1デヌタの再生垯域を拡倧するため の垯域拡倧情報を笊号化した非基準第 2デヌタずを含み、前蚘基準フレヌムが、音 声情報を笊号化した基準第 1デヌタず、この基準第 1デヌタの再生垯域を拡倧するた めの垯域拡倧情報を笊号化しお構成され、前蚘非基準第 2デヌタの挔算凊理を行う ためのヘッダ情報を備えた凊理ヘッダを有する基準第 2デヌタずを含む、ストリヌムを 凊理するデゞタル音声デヌタ凊理装眮であっお、前蚘基準フレヌムの前蚘基準第 2 デヌタに備えられた前蚘凊理ヘッダの前蚘ヘッダ情報を取埗するヘッダ情報取埗手 段ず、前蚘基準フレヌム又は前蚘非基準フレヌムの前蚘基準第 1デヌタ又は前蚘非 基準第 1デヌタを埩号化し、第 1埩号化デヌタを生成する第 1埩号化手段ず、前蚘基 準フレヌム又は前蚘非基準フレヌムの前蚘基準第 2デヌタ又は前蚘非基準第 2デ ヌタを、前蚘ヘッダ情報取埗手段で取埗した前蚘ヘッダ情報を甚いお埩号化し、第 2耇号化デヌタを生成する第 2埩号化手段ず、前蚘ヘッダ情報取埗手段で前蚘ぞッ ダ情報を取埗できなかった堎合に、前蚘第 1耇号化手段で埩号化された第 1耇号化 デヌタに基づき、圓該第 1埩号化デヌタよりも高い再生垯域を有する高域成分デヌ タを擬䌌的に生成する高域成分擬䌌生成手段ず、前蚘第 1埩号ィヒデヌタず前蚘高域 成分デヌタを䜵せお出力するための出力制埡手段ずを有する。 [0011] In order to solve the above-described problem, the invention according to claim 1 is encoded data including a plurality of framed frame sequences, and the plurality of frame sequences include a reference frame and a non-reference. The non-reference frame includes non-reference first data obtained by encoding speech information and non-reference first data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data. Second reference data, and the reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding the reproduction band of the reference first data, A digital audio data processing device for processing a stream, including reference second data having a processing header including header information for performing arithmetic processing of the non-reference second data, wherein the reference frame A header information acquisition unit for acquiring the header information of the processing header provided in the reference second data, and decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame The first decoding means for generating first decoded data, and the reference second data or the non-reference second data of the reference frame or the non-reference frame is obtained by the header information obtaining means. When the header information cannot be acquired by the second decoding means for decoding using the header information and generating second decoded data, and the header information acquisition means, the first decoding is performed. A high-frequency component pseudo-generating means for pseudo-generating high-frequency component data having a reproduction band higher than that of the first decoded data based on the first decoded data decoded by the encoding means; 1 Decryption Hide And output control means for outputting the high-frequency component data together.
[0012] たた、請求項 4蚘茉の発明は、フレヌム化された耇数のフレヌム列からなる笊号ィ匕 デヌタであっお、前蚘耇数のフレヌム列は、基準フレヌム及び非基準フレヌムからな る耇数のフレヌムを備え、前蚘非基準フレヌムが、音声情報を笊号化した非基準第 1デヌタず、この非基準第 1デヌタの再生垯域を拡倧するための垯域拡倧情報を笊 号化した非基準第 2デヌタずを含み、前蚘基準フレヌムが、音声情報を笊号化した 基準第 1デヌタず、この基準第 1デヌタの再生垯域を拡倧するための垯域拡倧情報 を笊号化しお構成され、前蚘非基準第 2デヌタの挔算凊理を行うためのヘッダ情報 を備えた凊理ヘッダを有する基準第 2デヌタずを含む、ストリヌムを凊理するデゞタル 音声デヌタ凊理方法であっお、前蚘ストリヌムの䞀郚が消倱したかどうかを刀定する ゚ラヌ刀定手順ず、前蚘基準フレヌムの前蚘基準第 2デヌタに備えられた前蚘凊理 ヘッダの前蚘ヘッダ情報を取埗するヘッダ情報取埗手順ず、前蚘基準フレヌム又は 前蚘非基準フレヌムの前蚘基準第 1デヌタ又は前蚘非基準第 1デヌタを耇号化し、 第 1耇号化デヌタを生成する第 1埩号化手段ず、前蚘基準フレヌム又は前蚘非基準 フレヌムの前蚘基準第 2デヌタ又は前蚘非基準第 2デヌタを、前蚘ヘッダ情報取埗 手順で取埗した前蚘ヘッダ情報を甚いお埩号ィ匕し、第 2耇号化デヌタを生成する第 2耇号化手順ずを有し、前蚘ヘッダ情報取埗手順で前蚘ヘッダ情報を取埗できなか ぀た堎合に、前蚘第 1埩号化手順で埩号化した第 1耇号化デヌタに基づき、圓該第 1耇号化デヌタよりも高い再生垯域を有する高域成分デヌタを擬䌌的に生成し、前 蚘第 1耇号化デヌタず前蚘高域成分デヌタを䜵せお出力する。 [0012] Further, the invention according to claim 4 is code data including a plurality of framed frames, and the plurality of frame sequences includes a plurality of frames including a reference frame and a non-reference frame. And the non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data. And the reference frame is configured by encoding reference first data in which audio information is encoded and band expansion information for expanding a reproduction band of the reference first data. Digital processing stream, including reference second data having a processing header with header information for performing arithmetic processing An audio data processing method, an error determination procedure for determining whether or not a part of the stream has been lost, and a header for acquiring the header information of the processing header provided in the reference second data of the reference frame An information acquisition procedure; first decoding means for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame to generate first decoded data; and the reference frame Alternatively, the second reference data or the second non-reference data of the non-reference frame is decoded using the header information acquired in the header information acquisition procedure to generate second decoded data. And when the header information cannot be acquired by the header information acquisition procedure, based on the first decoding data decoded by the first decoding procedure, High-frequency component data with a higher regeneration zone than 1 decodes data generated in a pseudo manner, and outputs before Symbol the high frequency component data together with the first decryption data.
発明を実斜するための最良の圢態  BEST MODE FOR CARRYING OUT THE INVENTION
[0013] 以䞋、本発明の䞀実斜の圢態を図面を参照し぀぀説明する。この実斜圢態は、デ ゞタル音声デヌタ凊理装眮の䞀䟋ずしお、携垯電話機に適甚した堎合の実斜圢態 である。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings. This embodiment is an embodiment when applied to a mobile phone as an example of a digital audio data processing apparatus.
[0014] 図 1は、本実斜圢態の携垯電話機の党䜓倖芳を衚す斜芖図である。図 1においお 、デゞタル音声デヌタ凊理装眮ずしおのこの携垯電話機 1は、 TS (Transport Stream )ずしお配信される音声デヌタ、映像デヌタ、デヌタ攟送甚デヌタなどを有するコンテ ンッデヌタを取埗しお出力する、 MPEG2—TSシステムを甚いた地䞊波デゞタル攟 送レボわゆるワンセグ攟送察応のものである。  FIG. 1 is a perspective view showing the overall appearance of the mobile phone according to the present embodiment. In FIG. 1, this mobile phone 1 as a digital audio data processing apparatus acquires and outputs content data including audio data, video data, data for data broadcasting, etc. distributed as TS (Transport Stream). It is compatible with terrestrial digital broadcasting using the TS system.
[0015] 携垯電話機 1は、この䟋では、本䜓ケヌシング 2ず、この本䜓ケヌシング 2の䞋郚に 蚭けられ、電話番号の入力キヌや各皮の機胜ボタンなどを備えた操䜜郚 3ず、その基 端郚が本䜓ケヌシング 2の䞋端郚に枢支されお本䜓ケヌシング 2に察し開閉自圚に 取り付けられた開閉カバヌ 4ず、各皮衚瀺を行う衚瀺郚 5ず、無線通信を介しデヌタ送 受信を行うためのアンテナ 6ず、音声を発するスピヌカ 7ず、開閉操䜜のためのレバヌ スィッチ 8ず、マむク 9ず、䟋えば地䞊波デゞタル攟送や衛星デゞタル攟送などの攟送 波を受信する攟送甚アンテナ 10 (図瀺せず、埌述の図 2参照等を備えおいる。  [0015] In this example, the cellular phone 1 is provided with a main body casing 2, an operation unit 3 provided at a lower portion of the main body casing 2, and provided with a telephone number input key, various function buttons, and the like, and a base end portion thereof. Open / close cover 4 pivotally supported at the lower end of main casing 2 and attached to main casing 2 so as to be openable / closable, display 5 for various displays, and antenna 6 for data transmission / reception via wireless communication A speaker 7 that emits sound, a lever switch 8 for opening and closing operation, a microphone 9, and a broadcasting antenna 10 that receives broadcast waves such as terrestrial digital broadcasts and satellite digital broadcasts (not shown, described later) Etc.).
[0016] 図 2は、䞊蚘携垯電話機 1の機胜的構成を衚す機胜ブロック図である。図 2におい お、攟送局や他の携垯電話機等から送信されおきた電波が䞊蚘アンテナ 6 10で受 信された埌、アンテナ 6 10に接続された送受信郚 100で埩調され、その埩調した受 信信号に察し信号凊理郚 101で再生のための所定の信号凊理 (詳现は埌述が斜さ れる。信号凊理郚 101で凊理した信号は、スピヌカ 7で音声ずしお再生される。 FIG. 2 is a functional block diagram showing a functional configuration of the mobile phone 1. Figure 2 Odor After the radio wave transmitted from the broadcasting station or other mobile phone is received by the antennas 6 and 10, the demodulated received signal is demodulated by the transmission / reception unit 100 connected to the antennas 6 and 10. The signal processing unit 101 performs predetermined signal processing (details will be described later) for reproduction. The signal processed by the signal processing unit 101 is reproduced as sound by the speaker 7.
[0017] このずき、送受信郚 100は図瀺しなレ、 TS受信郚を備えおおり、この TS受信郚に䞊 蚘攟送甚アンテナ 10 (なお、䞊蚘アンテナ 6ず兌甚でもよいが接続されおいる。この TS受信郚は、信号凊理郚 101の制埡により、䞊蚘攟送甚アンテナ 10からデゞタル 信号ずしお送信される䟋えば耇数の TSから、利甚者により遞択されたコンテンツに察 応する TSを取埗する。そしお、取埗した TSを信号凊理郚 101に TS信号ずしお出力 する。 At this time, the transmission / reception unit 100 includes a TS reception unit (not shown), and the broadcasting antenna 10 (which may also be used as the antenna 6) is connected to the TS reception unit. . Under the control of the signal processing unit 101, the TS receiving unit acquires a TS corresponding to the content selected by the user from, for example, a plurality of TSs transmitted as digital signals from the broadcasting antenna 10. Then, the acquired TS is output to the signal processing unit 101 as a TS signal.
[0018] 䞀方、話者の䌚話はマむク 9に入力されお音声信号に倉換される。この音声信号は 䞊蚘信号凊理郚 101においお送信のための信号凊理が斜され、送受信郚 100では 䞊蚘信号凊理郚 101からの音声信号を倉調しおアンテナ 6ぞ䟛絊し、アンテナ 6はそ の音声信号を電波ずしお送信する。  On the other hand, the conversation of the speaker is input to the microphone 9 and converted into an audio signal. The audio signal is subjected to signal processing for transmission in the signal processing unit 101. The transmission / reception unit 100 modulates the audio signal from the signal processing unit 101 and supplies the modulated signal to the antenna 6. The antenna 6 transmits the audio signal. As a radio wave.
[0019] なお、䞊蚘の各構成芁玠は、 CPU等を備えた制埡郚 102によっおその動䜜が制埡 される。  [0019] Note that the operation of each of the above-described components is controlled by the control unit 102 including a CPU and the like.
[0020] 図 3は、本実斜圢態の携垯電話機 1に備えられるアンテナ 10で受信される TSストリ ヌムのうち、音声に係わる郚分を抜き出しお抂念的に衚す暡匏図である。  [0020] FIG. 3 is a schematic diagram conceptually showing an extracted portion related to audio in the TS stream received by the antenna 10 provided in the mobile phone 1 of the present embodiment.
[0021] 図 3は、アンテナ 10で受信される倚重化されたストリヌムからオヌディオのストリヌム を取り出したもの (すなわち、 Elementary Stream)が、耇数のフレヌムを備えおい る様子を時間軞に沿っお衚しおいる。これら耇数のフレヌムは、基準フレヌム SFず、 それ以倖の非基準フレヌム IFずから構成されおレ、る。  [0021] FIG. 3 shows a state in which an audio stream extracted from the multiplexed stream received by the antenna 10 (ie, Elementary Stream) includes a plurality of frames along the time axis. Yes. These multiple frames consist of a reference frame SF and other non-reference frame IF.
[0022] 非基準フレヌム IFは、音声情報を AAC芏栌で笊号化した BaseCodec (非基準第  [0022] Non-reference frame IF is a BaseCodec (non-reference number
1デヌタず、この BaseCodecの再生垯域を拡倧するための垯域拡倧情報を笊号ィ匕 した SBRデヌタ非基準第 2デヌタ。䜆し SBRヘッダを含たなレ、)ずを含んでレ、る。  1 data) and SBR data (non-reference second data, which does not include the SBR header) encoded with band expansion information for expanding the reproduction band of this BaseCodec.
[0023] 基準フレヌム SFは、䞊蚘同様に音声情報を AAC芏栌で笊号化した BaseCodec ( 基準第 1デヌタず、この BaseCodecの再生垯域を拡倧するための垯域拡倧情報を 笊号化しお構成され、䞊蚘 SBRヘッダなしの SBRデヌタの挔算凊理を行うためのぞ ッダ情報を備えた SBRヘッダ (凊理ヘッダを備えた SBRデヌタ基準第 2デヌタず を含んでいる。 SBRヘッダのヘッダ情報を甚いるこずで䞊蚘挔算凊理甚のテヌブル が䜜成され、このテヌブルに基づき垯域拡匵凊理を行っお高呚波成分を生成可胜ず なっおいる。 [0023] The reference frame SF is configured by encoding BaseCodec (reference first data) obtained by encoding audio information according to the AAC standard and band expansion information for expanding the reproduction band of the BaseCodec, as described above. To perform processing of SBR data without SBR header SBR data (standard second data) with SBR header (processing header) with header information. By using the header information of the SBR header, a table for the above arithmetic processing is created, and band expansion processing can be performed based on this table to generate high frequency components.
[0024] 図 4は、䞊蚘信号凊理郚 101のうち、䞊蚘音声信号の再生凊理に係わる構成を衚 す機胜ブロック図である。  FIG. 4 is a functional block diagram showing a configuration related to the reproduction processing of the audio signal in the signal processing unit 101.
[0025] 図 4においお、信号凊理郚 101は、 BaseCodecDecode郚 11ず、゚ラヌ凊理郚 12 ず、高域成分擬䌌生成凊理郚 13ず、 SBRDecode郚 14ず、 BaseCodecDecode郚 1In FIG. 4, the signal processing unit 101 includes a BaseCodecDecode unit 11, an error processing unit 12, a high frequency component pseudo generation processing unit 13, an SBRDecode unit 14, and a BaseCodecDecode unit 1
1からの切替制埡信号により切り替えられるスィッチ 15 (出力制埡手段ずを備えおい る。 Switch 15 (output control means) that is switched by a switching control signal from 1.
[0026] BaseCodecDecode郚 11は、耇数の機胜を備えおいる。たず第 1の機胜ずしお、図 3を甚いお前述した音声信号のストリヌム (Elementary Stream)を入力し、基準フレ ヌム SF及び非基準フレヌム IFに含たれる BaseCodec郚のデコヌド埩号化を行う (第 1埩号化手段)。  [0026] The BaseCodecDecode unit 11 has a plurality of functions. First, as the first function, the audio signal stream (Elementary Stream) described above with reference to FIG. 3 is input, and the BaseCodec part included in the reference frame SF and the non-reference frame IF is decoded (decoded) ( First decoding means).
[0027] たた、 BaseCodecDecode郚 11は、第 2の機胜ずしお、各フレヌムが SBR— heade rを有しおレ、るフレヌムかどうか蚀レ、換えれば基準フレヌム SFか非基準フレヌム IF 力を刀定するずずもに、基準フレヌム SFであった堎合には、 SBRデヌタに備えられ た SBRヘッダの䞊蚘ヘッダ情報の取埗を行うヘッダ情報取埗手段。  [0027] In addition, as a second function, the BaseCodecDecode unit 11 determines whether each frame has an SBR-header (in other words, a reference frame SF or a non-reference frame IF force). At the same time, if the frame is the reference frame SF, the header information of the SBR header provided in the SBR data is acquired (header information acquisition means).
[0028] さらに、 BaseCodecDecode郚 11は、第 3の機胜ずしお、 ES (Elementary Strea m)の䞀郚が消倱した力 =゚ラヌが発生した力 どうかの゚ラヌ刀定も行う゚ラヌ刀 定手段。このずきの刀定方法は、 BaseCodecDecode郚 11における ESのデコヌド 凊理過皋で䞍具合が生じた時に゚ラヌず刀定しおも良いし、 ESが入力されるずきに、 別途圓該 ESに぀いおの゚ラヌ情報を倖郚より受け取っお刀定するようにしおも良い  [0028] Further, as a third function, the BaseCodecDecode unit 11 also performs error determination of whether or not a part of the ES (Elementary Stream) has disappeared (= error generation force (error determination means)). The determination method may be determined to be an error when a failure occurs in the ES decoding process in the BaseCodecDecode unit 11, or when an ES is input, error information about the ES is separately received from the outside. You may make it judge
[0029] ここで、本実斜圢態では、前述したように、スィッチ 15は、 BaseCodecDecode郚 1 1からの切替制埡信号に基づき、゚ラヌ凊理郚 12、高域成分擬䌌生成凊理郚 13、 S BRDecode郚 14のいずれかからの出力を遞択しおスピヌカ 7偎ぞず出力するように なっおいる。図 5は、この BaseCodecDecode郚 11が各フレヌムごずに実行する凊理 手順を衚すフロヌチャヌトである。 Here, in the present embodiment, as described above, the switch 15 is based on the switching control signal from the BaseCodecDecode unit 11, based on the error processing unit 12, the high frequency component pseudo generation processing unit 13, and the S BRDecode unit 14. The output from either of these is selected and output to the speaker 7 side. Figure 5 shows the processing executed by the BaseCodecDecode unit 11 for each frame. It is a flowchart showing a procedure.
[0030] 図 5におレ、お、たずステップ S5におレ、お、ヘッダ情報取埗ヘッダ情報を取埗でき たか吊力 を衚すフラグ Fh=0ずする。その埌、ステップ S10で、前述の゚ラヌ刀定手 段ずしおの機胜により゚ラヌが発生しおいるかどうかを刀定する刀定手順)。 ESの䞀 郚が消倱し圓該フレヌムで゚ラヌが発生しおいた堎合には刀定が満たされ、ステツ プ S55で䞊蚘フラグ Fh = 0にした埌、ステップ S60に移る。  [0030] In FIG. 5, first, in step S5, the header information acquisition (flag Fh = 0 indicating whether or not the header information could be acquired is set. Then, in step S10, the above error is detected. Judgment is made as to whether or not an error has occurred by the function as a judgment means (judgment procedure) If part of the ES has disappeared and an error has occurred in the frame, the judgment is satisfied, and in step S55 After setting the flag Fh = 0, proceed to Step S60.
[0031] ステップ S60では、゚ラヌ凊理郚 12に所定の゚ラヌ甚デヌタを出力するように指瀺 を出すずずもに、スィッチ 15を゚ラヌ凊理郚 12偎に切り替える切替制埡信号を出力 する。これに応じお、゚ラヌ凊理郚 12は、゚ラヌ甚デヌタずしお、無音状態ずする Mu te信号 (又は前埌のデヌタ力も適宜の手法で補間したデヌタでもよいを出力し (ェ ラヌ甚デヌタ生成手段、゚ラヌ甚デヌタ生成手順、スィッチ 15からスピヌカ 7偎ぞ ず出力される。  [0031] In step S60, the error processing unit 12 is instructed to output predetermined error data, and a switching control signal for switching the switch 15 to the error processing unit 12 side is output. In response to this, the error processing unit 12 outputs, as error data, a mute signal for making a silent state (or data before and after the data force may be interpolated by an appropriate method) (error data generation means, Error data generation procedure), output from switch 15 to speaker 7 side.
[0032] 䞀方、ステップ S10で゚ラヌが発生しおいな力 た堎合には刀定が満たされず、ス テツプ S 15に移る。ステップ S 15では、前述のヘッダ情報取埗手段ずしおの機胜によ り、圓該フレヌムより SBRヘッダのヘッダ情報を取埗するヘッダ情報取埗手順)。そ の埌、ステップ S20で SBRヘッダがあ぀たかどうか蚀い換えれば、ヘッダ情報カ 挔 算甚のテヌブルが䜜成されおいるかどうかを刀定する。ヘッダ情報が取埗され挔算 甚のテヌブルが䜜成されおいた堎合には刀定が満たされ、ステップ S25で䞊蚘フラ グ Fh= lずした埌、ステップ S30に移る。なお、ステップ S20で SBRヘッダがなかった 堎合には刀定が満たされず、盎接ステップ S30に移る。  [0032] On the other hand, if it is determined in step S10 that an error has not occurred, the determination is not satisfied, and the routine goes to step S15. In step S15, the header information of the SBR header is acquired from the frame by the function as the header information acquisition means described above (header information acquisition procedure). Thereafter, in step S20, it is determined whether or not there is an SBR header (in other words, whether or not a header information calculation table has been created). If the header information has been acquired and the calculation table has been created, the determination is satisfied, and after setting the flag Fh = l in step S25, the process proceeds to step S30. If there is no SBR header in step S20, the determination is not satisfied, and the routine goes directly to step S30.
[0033] ステップ S30では、䞊蚘フラグ Fh= 1であるかどうか蚀い換えればヘッダ情報が取 埗され挔算甚テヌブルが䜜成されおいる力 を刀定する。刀定が満たされたらステツ プ S35ぞ移り、前述の第 1埩号ィ匕手段ずしおの機胜によりデコヌドした BaseCodec郚 の埩号化デヌタ及びこの時点で保持しおレ、る SBR郚の Syntax (蚀レ、換えればぞ ッダ情報に基づくテヌブルを SBRDecode郚 14に出力するずずもに、スィッチ 15を S BRDecode郚 14に切り替える切替制埡信号を出力する。これに応じお、 SBRDeco de郚 14は、 BaseCodecDecode郚 11から受け取った BaseCodec郚の埩号化デヌ タに、さらに前述の SBR郚の Syntax情報に基づくテヌブルを甚いお耇号化凊理を 行い生成した SBR郚の埩号ィ匕デヌタをカ卩えおスィッチ 15に出力し (第 2埩号ィ匕手順 )、スィッチ 15からスピヌカ 7偎ぞず出力される。 [0033] In step S30, it is determined whether or not the flag Fh = 1 (in other words, the force with which the header information is obtained and the calculation table is created. If the determination is satisfied, the process proceeds to step S35, and The decoding data of the BaseCodec part decoded by the function as the first decoding means and the Syntax of the SBR part (retained at this point) (the table based on the header information in other words) In addition to outputting to the SBRDecode unit 14, it outputs a switching control signal for switching the switch 15 to the SBRDecode unit 14. In response, the SBRDecode unit 14 receives the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 as follows: In addition, decryption processing is performed using a table based on the syntax information of the SBR section described above. The decoding data generated by the SBR section is collected and output to the switch 15 (second decoding procedure), and output from the switch 15 to the speaker 7 side.
[0034] なお、前述のステップ S30での刀定が満たされない堎合、ステップ S40に移る。ス テツプ S40では、高域成分擬䌌生成凊理郚 13に察し前述の第 1耇号化手段ずしおの 機胜によりデコヌドした BaseCodec郚の耇号化デヌタを出力するずずもに、スィッチ 1 5を高域成分擬䌌生成凊理郚 13に切り替える切替制埡信号を出力する。これに応じ お、高域成分擬䌌生成凊理郚 13は、 BaseCodecDecode郚 11から受け取った䞊蚘 BaseCodec郚の耇号化デヌタを 2倍にアップサンプリングするず共に、公知の手法 䟋えば特蚱第 3140273号公報に蚘茉の手法により䜎域である䞊蚘 BaseCodec郚 の埩号化デヌタに基づき高域成分デヌタを生成し、圓該生成した高域成分デヌタを 含む耇号化デヌタをスィッチ 15に出力し、スィッチ 15からスピヌカ 7偎ぞず出力され る。 [0034] If the determination in step S30 is not satisfied, the process proceeds to step S40. In step S40, the decoded data of the BaseCodec part decoded by the function as the first decoding means described above is output to the high-frequency component pseudo-generation processing unit 13 and the switch 15 is pseudo-generated. A switching control signal for switching to the processing unit 13 is output. In response to this, the high-frequency component pseudo-generation processing unit 13 up-samples the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 twice and uses a known method (for example, described in Japanese Patent No. 3140273). The high-frequency component data is generated based on the decoded data of the BaseCodec part, which is a low frequency by the above method), and the decrypted data including the generated high-frequency component data is output to the switch 15, and the switch 15 Output to the side.
[0035] 前述のステップ S35、ステップ S40、ステップ S60が終了したら、ステップ S 10に戻り 、同様の手順を繰り返す。  [0035] When step S35, step S40, and step S60 are completed, the process returns to step S10 and the same procedure is repeated.
[0036] 䞊蚘フロヌのように、本実斜圢態では、 SBRヘッダのヘッダ情報が取埗できない堎 合 (蚀い換えれば挔算甚テヌブルが䜜成されおいない堎合に、公知の高域成分擬 䌌生成手法を甚いおデコヌドした BaseCodec郚の埩号ィ匕デヌタに基づき高域成分 デヌタを擬䌌的に生成し、䞊蚘 BaseCodec郚の埩号化デヌタに生成した高域成分 デヌタを加えお出力するものである。  [0036] As in the above flow, in the present embodiment, when the header information of the SBR header cannot be acquired (in other words, when a calculation table is not created), a known high-frequency component simulation generation method is used. The high frequency component data is generated in a pseudo manner based on the decoded data of the BaseCodec part decoded in this way, and the generated high frequency component data is added to the decoded data of the BaseCodec part for output.
[0037] 以䞊説明したように、本実斜圢態におけるデゞタル音声デヌタ凊理装眮 1は、フレ ヌム化された耇数のフレヌム列からなる笊号ィヒデヌタであっお、耇数のフレヌム列は 、基準フレヌム SF及び非基準フレヌム IFからなる耇数のフレヌムを含み、非基準フ レヌム IFが、音声情報を笊号ィ匕した非基準第 1デヌタこの䟋では BaseCodec)ず、 この非基準第 1デヌタ BaseCodecの再生垯域を拡倧するための垯域拡倧情報を笊 号化した非基準第 2デヌタこの䟋では SBR)ずを含み、基準フレヌム SFが、音声情 報を笊号化した基準第 1デヌタこの䟋では BaseCodec)ず、この基準第 1デヌタ Ba seCodecの再生垯域を拡倧するための垯域拡倧情報を笊号ィ匕しお構成され、非基 準第 2デヌタ SBRの挔算凊理を行うためのヘッダ情報を備えた凊理ヘッダこの䟋で は SBRヘッダを有する基準第 2デヌタこの䟋では SBR)ずを含む、ストリヌムを凊 理するデゞタル音声デヌタ凊理装眮 1であっお、基準フレヌム SFの基準第 2デヌタ SBRに備えられた凊理ヘッダのヘッダ情報を取埗するヘッダ情報取埗手段この䟋 では BaseCodecDecode郚 11)ず、基準フレヌム SF又は非基準フレヌム IFの基準 第 1デヌタ BaseCodec又は非基準第 1デヌタ BaseCodecを埩号ィ匕し、第 1埩号ィ匕 デヌタこの䟋ではデコヌド埌の BaseCodecデヌタを生成する第 1埩号化手段こ の䟋では BaseCodecDecode郚 11)ず、基準フレヌム SF又は非基準フレヌム IFの 基準第 2デヌタ SBR又は非基準第 2デヌタ SBRを、ヘッダ情報取埗手段 11で取埗 したヘッダ情報を甚いお埩号ィ匕し、第 2耇号化デヌタこの䟋ではデコヌド埌の SBR デヌタを生成する第 2埩号化手段この䟋では SBRDecode郚 14)ず、ヘッダ情報 取埗手段 11でヘッダ情報を取埗できな力 た堎合に、第 1埩号ィ匕手段 11で耇号化 された第 1耇号化デヌタデコヌド埌の BaseCodecデヌタに基づき、圓該第 1埩号 化デヌタデコヌド埌の BaseCodecデヌタよりも高レ、再生垯域を有する高域成分 デヌタを擬䌌的に生成する高域成分擬䌌生成手段 (この䟋では高域成分擬䌌生成 凊理郚 13)ず、第 1埩号ィ匕デヌタデコヌド埌の BaseCodecデヌタず高域成分デヌ タを䜵せお出力するための出力制埡手段この䟋ではスィッチ 15)ずを有するこずを特 城ずする。 [0037] As described above, the digital audio data processing device 1 according to the present embodiment is encoded data composed of a plurality of framed frame sequences, and the plurality of frame sequences include the reference frame SF and the non-frame data. The reference frame IF includes multiple frames, and the non-reference frame IF expands the playback band of the non-reference first data (BaseCodec in this example) that encodes audio information and the non-reference first data BaseCodec. Non-reference second data (in this example, SBR) that encodes the bandwidth expansion information for encoding, and the reference frame SF is the reference first data (in this example, BaseCodec) that encodes the voice information, A processing header (this header, which includes header information for performing arithmetic processing of the non-standard second data SBR, is configured by encoding the band expansion information for expanding the reproduction band of the reference first data BaseCodec. In the example Is a digital audio data processing device 1 for processing a stream including reference second data (in this example, SBR) having SBR headers, and is a processing header provided in reference second data SBR of reference frame SF Header information acquisition means (BaseCodecDecode section 11 in this example) for acquiring the header information of the first frame and the first reference data BaseCodec or the first non-reference first data BaseCodec of the reference frame SF or the non-reference frame IF. The first decoding means (BaseCodecDecode unit 11 in this example) that generates data (BaseCodec data after decoding) in this example, and the reference second data SBR or non-reference data of the reference frame SF or non-reference frame IF 2 The second decoding means for decoding the data SBR using the header information acquired by the header information acquisition means 11 and generating the second decrypted data (in this example, the decoded SBR data) In this example, if the header information cannot be acquired by the SBRDecode unit 14) and the header information acquisition means 11, the first decoded data (BaseCodec after decoding) decoded by the first decoding means 11 High-frequency component pseudo-generation means (in this example, a high-frequency component) that artificially generates high-frequency component data having a higher reproduction and reproduction band than the first decoded data (decoded BaseCodec data). A pseudo-generation processing unit 13) and output control means (switch 15 in this example) for outputting the first decoded key data (BaseCodec data after decoding) and high-frequency component data together. It is a sign.
[0038] 本実斜圢態においおは、ストリヌムに備えられる耇数のフレヌムのそれぞれ力 基 準第 1デヌタ BaseCodec及び基準第 2デヌタ SBRを含む基準フレヌム SFず、非基 準第 1デヌタ BaseCodec及び非基準第 2デヌタ SBRを含む非基準フレヌム IFずか ら構成されおいる。そのうち、基準第 1デヌタ BaseCodec及び非基準第 1デヌタ Bas eCodecが第 1埩号ィ匕手段 11で埩号化されお第 1埩号ィ匕デヌタデコヌド埌の Base Codecデヌタが生成される䞀方、基準第 2デヌタ SBR及び非基準第 2デヌタ SBR に぀いおは、ヘッダ情報取埗手段 11で取埗したヘッダ情報を甚いお第 2耇号化手段 14で埩号化され、第 2耇号化デヌタデコヌド埌の SBRデヌタが生成される。 [0038] In the present embodiment, the reference frame SF including the force standard first data BaseCodec and the reference second data SBR of each of the plurality of frames provided in the stream, and the non-standard first data BaseCodec and the non-standard second data. It is composed of non-reference frame IF including data SBR. Among them, the reference first data BaseCodec and the non-reference first data BaseCodec are decoded by the first decoding means 11 to generate the first decoded data (Base Codec data after decoding), while the reference first data 2 The data SBR and the non-reference second data SBR are decrypted by the second decryption means 14 using the header information obtained by the header information obtaining means 11, and the second decrypted data ( SBR data after decoding) ) Is generated.
[0039] このずき、䟋えばストリヌムの䞀郚が消倱した=゚ラヌが発生した等により、ヘッダ 情報取埗手段 11でヘッダ情報を取埗できなかったずきには、第 1耇号化手段 11で埩 号化された第 1耇号化デヌタデコヌド埌の BaseCodecデヌタに基づき、高域成分 擬䌌生成手段 13により高域成分デヌタを擬䌌的に生成する。そしお、出力制埡手段 15により、第 1埩号化デヌタデコヌド埌の BaseCodecデヌタが䞊蚘生成された高 域成分デヌタず䜵せお出力される。これにより、゚ラヌ発生埌に次にヘッダ情報を取 埗できるたで無音状態にしたり第 1耇号化デヌタデコヌド埌の BaseCodecデヌタ しか出力しない堎合に比べ、無音状態を短瞮できるず共に、高域成分を出力するこず により聎感䞊の違和感を䜎枛するこずができる。 [0039] At this time, when header information cannot be acquired by the header information acquisition unit 11 due to, for example, a part of the stream disappeared (= an error has occurred), the first decoding unit 11 performs the decoding. Based on the first decoded data (BaseCodec data after decoding) The pseudo-generation means 13 generates pseudo high frequency component data. Then, the output control means 15 outputs the first decoded data (decoded BaseCodec data) together with the generated high frequency component data. As a result, the silence state can be shortened and the high-frequency component can be reduced compared to the case where the silence state is obtained until the next header information can be obtained after the error occurs or only the first decoded data (decoded BaseCodec data) is output. By outputting, it is possible to reduce the sense of incongruity in hearing.
たた、本実斜圢態のデゞタル音声デヌタ凊理装眮 1を甚いたデゞタル音声デヌタ 凊理方法にぉレ、おは、フレヌム化された耇数のフレヌム列からなる笊号化デヌタで あっお、耇数のフレヌム列は、基準フレヌム SF及び非基準フレヌム IFからなる耇数 のフレヌムを含み、非基準フレヌム IFが、音声情報を笊号化した非基準第 1デヌタ B aseCodecず、この非基準第 1デヌタ BaseCodecの再生垯域を拡倧するための垯域 拡倧情報を笊号化した非基準第 2デヌタ SBRずを含み、基準フレヌム SFが、音声情 報を笊号ィ匕した基準第 1デヌタ BaseCodecず、この基準第 1デヌタ BaseCodecの再 生垯域を拡倧するための垯域拡倧情報を笊号化しお構成され、非基準第 2デヌタ S BRの挔算凊理を行うためのヘッダ情報を備えた凊理ヘッダSBRヘッダを有する 基準第 2デヌタ SBRずを含む、ストリヌムを凊理するデゞタル音声デヌタ凊理方法で あっお、基準フレヌム SFの基準第 2デヌタ SBRに備えられた凊理ヘッダのヘッダ情 報を取埗するヘッダ情報取埗手順この䟋では図 5のステップ S 15)ず、基準フレヌム SF又は非基準フレヌム IFの基準第 1デヌタ BaseCodec又は非基準第 1デヌタ Bas eCodecを埩号化し、第 1埩号化デヌタデコヌド埌の BaseCodecデヌタを生成す る第 1埩号化手順この䟋では BaseCodecDecode郚 11による BaseCodecデコヌド 手順ず、基準フレヌム SF又は非基準フレヌム IFの基準第 2デヌタ SBR又は非基準 第 2デヌタ SBRを、ヘッダ情報取埗手順 S 15で取埗したヘッダ情報を甚いお耇号化 し、第 2耇号化デヌタデコヌド埌の SBRデヌタを生成する第 2埩号化手順 (この䟋 では図 5のステップ S35)ず、ヘッダ情報取埗手順 S15でヘッダ情報を取埗できなか ぀た堎合に、第 1埩号化手順BaseCodecDecode郚 11による BaseCodecデコヌド 手順)で埩号化した第 1耇号化デヌタデコヌド埌の BaseCodecデヌタに基づき、 高域成分デヌタを生成し、第 1耇号化デヌタデコヌド埌の BaseCodecデヌタず高 域成分デヌタを䜵せお出力するこずを特城ずする。 Further, the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data consisting of a plurality of framed frames, and the plurality of frame sequences are It includes multiple frames consisting of a reference frame SF and a non-reference frame IF, and the non-reference frame IF expands the playback bandwidth of the non-reference first data BaseCodec that encodes audio information and the non-reference first data BaseCodec. The reference frame SF includes the reference first data BaseCodec encoded voice information and the reproduction band of the reference first data BaseCodec. Reference second data SB, which is configured by encoding band expansion information for expansion and has a processing header (SBR header) with header information for performing calculation processing of non-reference second data SBR A digital audio data processing method for processing a stream including R, and a header information acquisition procedure for acquiring the header information of the processing header provided in the reference second data SBR of the reference frame SF (in this example, FIG. 5). Step S15) and the reference first data BaseCodec or non-reference first data BaseCodec of the reference frame SF or non-reference frame IF are decoded to generate first decoded data (decoded BaseCodec data). 1 Decoding procedure (BaseCodec decoding procedure by BaseCodecDecode section 11 in this example) and reference second data SBR or non-reference second data SBR of reference frame SF or non-reference frame IF were acquired in header information acquisition procedure S 15 The second decoding procedure (in this example, step S35 in FIG. 5) for decoding the header information to generate the second decoded data (decoded SBR data), and the header information acquisition procedure If header information cannot be obtained in order S15, the high-frequency component is based on the first decoded data (BaseCodec data after decoding) decoded in the first decoding procedure (BaseCodec decoding procedure by BaseCodecDecode section 11). Data is generated and the first decrypted data (BaseCodec data after decoding) and high The band component data is also output together.
[0041] 本実斜圢態のデゞタル音声デヌタ凊理方法にぉレ、おは、ヘッダ情報取埗手順 S 1  [0041] In the digital audio data processing method of the present embodiment, the header information acquisition procedure S 1
5でヘッダ情報を取埗できなかった堎合に、埩号化された第 1埩号化デヌタデコ䞀 ド埌の BaseCodecデヌタに基づき、高域成分デヌタを生成し、第 1耇号化デヌタ デコヌド埌の BaseCodecデヌタず生成された高域成分デヌタずが䜵せお出力され る。これにより、゚ラヌ発生埌に次にヘッダ情報を取埗できるたで無音状態にしたり第 1耇号化デヌタデコヌド埌の BaseCodecデヌタしか出力しない堎合に比べ、無音 状態を短瞮できるず共に、高域成分を出力するこずにより聎感䞊の違和感を䜎枛する こず力 Sできる。  If header information could not be obtained in step 5, high frequency component data is generated based on the decoded first decoded data (BaseCodec data after decoding), and the first decoded data (after decoding) BaseCodec data) and the generated high-frequency component data are output together. As a result, the silence state can be shortened and the high frequency component can be output compared to the case where the silence state is obtained until the next header information can be acquired after an error occurs or only the first decoded data (decoded BaseCodec data) is output. By doing this, you can reduce the sense of incongruity in hearing.
[0042] 䞊蚘実斜圢態におけるデゞタル音声デヌタ凊理装眮 1においおは、ストリヌムの䞀 郚が消倱したかどうかを刀定する゚ラヌ刀定手段この䟋では BaseCodecDecode 郚 11)を有し、高域成分擬䌌生成手段 13は、゚ラヌ刀定手段 11で非基準フレヌム I Fが消倱したず刀定された堎合に、高域成分デヌタを生成するこずを特城ずする。  [0042] The digital audio data processing apparatus 1 in the above embodiment has error determination means (BaseCodecDecode section 11 in this example) for determining whether or not a part of the stream has been lost. Is characterized by generating high-frequency component data when it is determined by the error determination means 11 that the non-reference frame IF has disappeared.
[0043] 本実斜圢態においおは、゚ラヌ刀定手段 11で非基準フレヌム IFが消倱したず刀定 した堎合に、高域成分擬䌌生成手段 13により高域成分デヌタを生成する。これによ り、非基準フレヌム IFが消倱する゚ラヌが発生した埌に次にヘッダ情報を取埗できる たで無音状態にしたり第 1埩号化デヌタデコヌド埌の BaseCodecデヌタしか出力 しない堎合に比べ、無音状態を短瞮できるず共に、高域成分を出力するこずにより聎 感䞊の違和感を䜎枛するこずができる。  In this embodiment, when the error determination means 11 determines that the non-reference frame IF has disappeared, the high frequency component pseudo generation means 13 generates high frequency component data. As a result, silence occurs compared to when silence occurs until the next header information can be acquired after an error in which the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output. And a sense of discomfort in hearing can be reduced by outputting a high frequency component.
[0044] たた本実斜圢態におけるデゞタル音声デヌタ凊理方法においおは、ストリヌムの䞀 郚が消倱したかどうかを刀定する゚ラヌ刀定手順 (この䟋では図 5のステップ S 10)を 有し、この゚ラヌ刀定手順 S 10で非基準フレヌム IFが消倱したず刀定された堎合に、 高域成分デヌタを生成するこずを特城ずする。  [0044] The digital audio data processing method according to the present embodiment further includes an error determination procedure (in this example, step S10 in Fig. 5) for determining whether or not a part of the stream has been lost. High frequency component data is generated when it is determined in S10 that the non-reference frame IF has disappeared.
[0045] 本実斜圢態においおは、非基準フレヌム IFが消倱したず刀定した堎合に、高域成 分デヌタを生成する。これにより、非基準フレヌム IFが消倱する゚ラヌが発生した埌 に次にヘッダ情報を取埗できるたで無音状態にしたり第 1耇号化デヌタ (デコヌド埌 の BaseCodecデヌタしか出力しない堎合に比べ、無音状態を短瞮できるず共に、 高域成分を出力するこずにより聎感䞊の違和感を䜎枛するこずができる。 [0046] 䞊蚘実斜圢態におけるデゞタル音声デヌタ凊理装眮 1においおは、゚ラヌ刀定手 段 11で非基準フレヌム IFが消倱したず刀定された堎合、圓該消倱した非基準フレヌ ム IFに察応した所定の゚ラヌ甚デヌタこの䟋では muteデヌタを生成する゚ラヌ 甚デヌタ生成手段この䟋でぱラヌ凊理郚 12)を有するこずを特城ずする。 In the present embodiment, high frequency component data is generated when it is determined that the non-reference frame IF has disappeared. As a result, silence occurs compared to when silence occurs until the next header information can be obtained after an error that the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output. Can be shortened, and discomfort in hearing can be reduced by outputting a high frequency component. In the digital audio data processing device 1 in the above embodiment, when it is determined by the error determination unit 11 that the non-reference frame IF has been lost, a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having error data generation means (in this example, error processing unit 12) for generating data (in this example, mute data).
[0047] これにより、゚ラヌ発生時においお察応する所定の゚ラヌ甚デヌタを出力するこず ができる。  [0047] Thereby, it is possible to output predetermined error data corresponding to when an error occurs.
[0048] たた、䞊蚘実斜圢態におけるデゞタル音声デヌタ凊理方法においおは、゚ラヌ刀 定手順 S10で非基準フレヌム IFが消倱したず刀定された堎合、圓該消倱した非基準 フレヌム IFに察応した所定の゚ラヌ甚デヌタこの䟋では muteデヌタを生成する ゚ラヌ甚デヌタ生成手順 (この䟋ではステップ S60)を有するこずを特城ずする。  [0048] Also, in the digital audio data processing method in the above embodiment, when it is determined that the non-reference frame IF has been lost in the error determination procedure S10, a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having an error data generation procedure (in this example, step S60) for generating data (in this example, mute data).
[0049] これにより、゚ラヌ発生時においお察応する所定の゚ラヌ甚デヌタを出力するこず ができる。  [0049] Thereby, it is possible to output predetermined error data corresponding to when an error occurs.
[0050] 図 6〜図 8は、それぞれ、以䞊列挙した本実斜圢態の各効果を、具䜓的に説明する ための説明図である。図䞭、暪軞には時間軞をずり、図 6は、入力されるストリヌムに おける各フレヌムの挙動の䟋を衚し、図 7は、図 6に察応した本実斜圢態のデゞタノレ 音声デヌタ凊理装眮 1からの出力挙動を衚し、図 8は、゚ラヌ発生埌に第 1埩号化デ ヌタデコヌド埌の BaseCodecデヌタしか出力しない比范䟋による出力挙動を衚し おいる。  FIGS. 6 to 8 are explanatory diagrams for specifically explaining the effects of the present embodiment listed above. In the figure, the horizontal axis represents the time axis, FIG. 6 shows an example of the behavior of each frame in the input stream, and FIG. 7 shows the digital audio data processing apparatus 1 of the present embodiment corresponding to FIG. Fig. 8 shows the output behavior of the comparative example in which only the first decoded data (BaseCodec data after decoding) is output after the error occurs.
[0051] 図 7及び図 8においお、 SBRヘッダのある基準フレヌム SFを゚ラヌ発生なく受信 できおいる堎合、及び、その埌の SBRヘッダのない非基準フレヌム IFをその埌゚ラ 䞀発生なく受信できおいる堎合は、本実斜圢態及び比范䟋ずも同様であり、 BaseCo dec郚ず SBR郚ずの䞡方をデコヌドでき、それら第 1埩号化デコヌドデヌタ及び第 2 耇号化デヌタ (「SBRデコヌド」ず衚す)を䜵せお出力するこずができる。  [0051] In FIG. 7 and FIG. 8, when the reference frame SF with the SBR header can be received (without error), and the subsequent non-reference frame IF without the SBR header can be received without error. If this is the case, this is the same as in this embodiment and the comparative example, and both the BaseCo dec part and the SBR part can be decoded, and the first decoded decoded data and the second decoded data (referred to as “SBR decode”). Can be output together.
[0052] たた、あるフレヌムで゚ラヌが発生したフレヌムの少なくずも䞀郚が消倱した堎合 に぀いおも、本実斜圢態ず比范䟋ずで差異はなぐ所定の゚ラヌ甚デヌタこの䟋で は無音状態である mute)を出力するこずができる。  [0052] In addition, even when an error occurs in a certain frame (at least a part of the frame is lost), there is predetermined error data (in this example, in a silent state) that does not differ between this embodiment and the comparative example. A certain mute) can be output.
[0053] 䞀方、䞊蚘゚ラヌが発生したフレヌムの埌次に SBRヘッダを取埗できるフレヌム の前たでのフレヌムに぀いおは、䞊蚘比范䟋ず本実斜圢態ずで異なる。すなわち、 図 8に瀺すように、䞊蚘比范䟋では、゚ラヌが発生したら、その埌新たに SBRヘッダ のヘッダ情報を取埗するたで、 SBR郚のデコヌド凊理を行うこずができず、 BaseCod ec郚のみの䜎域の埩号化デヌタを出力する。 On the other hand, the frame after the frame in which the error has occurred (until the frame before the next SBR header can be acquired) differs between the comparative example and the present embodiment. That is, As shown in Fig. 8, in the above comparative example, if an error occurs, the SBR part cannot be decoded until the header information of the SBR header is newly acquired thereafter, and the low frequency band of only the BaseCodec part cannot be obtained. Output decrypted data.
[0054] 䞀方、図 7に瀺すように、本実斜圢態では、 BaseCodec郚の埩号ィ匕デヌタに基づ き、圓該 BaseCodecデヌタよりも高い再生垯域を有する高域成分デヌタを擬䌌的に 生成し、埩号化された BaseCodecデヌタず擬䌌的に生成した高域成分デヌタずを䜵 せお出力するので、聎感䞊の違和感を䜎枛するこずができる。  On the other hand, as shown in FIG. 7, in the present embodiment, on the basis of the decoded code data of the BaseCodec part, high frequency component data having a reproduction band higher than the BaseCodec data is generated in a pseudo manner, Since the decoded BaseCodec data and the pseudo high frequency component data are output together, it is possible to reduce the sense of discomfort in the sense of hearing.
[0055] なお、以䞊は、地䞊波デゞタル攟送 (䟋えばいわゆる lSegmentを利甚した攟送)に 察応した携垯電話機に適甚した堎合を䟋にずっお説明したが、これに限られず、他の 携垯端末、モパむル機噚等に適甚が可胜である。特に、゚ラヌが倚発する車茉端末 に適甚した堎合に効果的である。  [0055] Note that the above has been described by taking as an example a case where the present invention is applied to a mobile phone that supports terrestrial digital broadcasting (for example, broadcasting using so-called lSegment), but is not limited thereto, and other mobile terminals, mopile devices, etc. It can be applied to. This is particularly effective when applied to in-vehicle terminals where errors frequently occur.
[0056] 䞊蚘実斜圢態におけるデゞタル音声デヌタ凊理装眮 1は、フレヌム化された耇数 のフレヌム列からなる笊号ィ匕デヌタであっお、耇数のフレヌム列は、基準フレヌム SF 及び非基準フレヌム IFからなる耇数のフレヌムを含み、非基準フレヌム IFが、音声 情報を笊号化した BaseCodecず、この BaseCodecの再生垯域を拡倧するための垯 域拡倧情報を笊号ィ匕した SBRずを含み、基準フレヌム SFが、音声情報を笊号化した BaseCodecず、この BaseCodecの再生垯域を拡倧するための垯域拡倧情報を笊号 化しお構成され、 SBRの挔算凊理を行うためのヘッダ情報を備えた SBRヘッダを有 する SBRずを含む、ストリヌムを凊理するデゞタル音声デヌタ凊理装眮 1であっお、 基準フレヌム SFの SBRに備えられた凊理ヘッダのヘッダ情報を取埗する BaseCod ecDecode郚 11ず、基準フレヌム SF又は非基準フレヌム IFの BaseCodecを埩号化 し、デコヌド埌の BaseCodecデヌタを生成する BaseCodecDecode郚 11ず、基準フ レヌム SF又は非基準フレヌム IFの SBRを、 BaseCodecDecode郚 11で取埗したぞ ッダ情報を甚いお埩号ィ匕し、デコヌド埌の SBRデヌタを生成する SBRDecode郚 14 ず、 BaseCodecDecode郚 11でヘッダ情報を取埗できなかった堎合に、 BaseCode cDecode郚 11で埩号化された BaseCodecに基づき、圓該 BaseCodecデヌタよりも 高い再生垯域を有する高域成分デヌタを擬䌌的に生成する高域成分擬䌌生成凊 理郹 13ず、 BaseCodecず高域成分デヌタを䜵せお出力するためのスィッチ 15ずを有 する。 [0056] The digital audio data processing device 1 according to the above embodiment is code data including a plurality of framed frames, and the plurality of frame sequences are a plurality of frames including a reference frame SF and a non-reference frame IF. The non-reference frame IF includes the BaseCodec that encodes the audio information and the SBR that encodes the band expansion information for expanding the playback band of this BaseCodec, and the reference frame SF Includes BaseCodec that encodes information, and SBR that has SBR header that is configured by encoding band expansion information for expanding the playback band of BaseCodec, and includes header information for performing SBR calculation processing A digital audio data processing device 1 for processing a stream, and a BaseCodeDecode unit 11 for obtaining header information of a processing header provided in an SBR of a reference frame SF; BaseCodecDecode unit 11 that decodes BaseCodec of quasi-frame SF or non-reference frame IF and generates decoded BaseCodec data, and header obtained by BaseCodecDecode unit 11 for SBR of base frame SF or non-reference frame IF When the header information cannot be obtained by the SBRDecode unit 14 that generates the SBR data after decoding using the information, and the BaseCodecDecode unit 11, the BaseCodecDecode unit 11 A high-frequency component pseudo-generation processing unit 13 that artificially generates high-frequency component data having a reproduction band higher than that of BaseCodec data, and a switch 15 for outputting BaseCodec and high-frequency component data together are provided. To do.
[0057] 本実斜圢態においおは、ストリヌムに備えられる耇数のフレヌムのそれぞれ力 Ba seCodec及び SBRヘッダを有する SBRを含む基準フレヌム SFず、 BaseCodec及び SBRを含む非基準フレヌム IFずから構成されおいる。そのうち、 BaseCodecが Base CodecDecode郚 11で埩号化されお BaseCodecデヌタが生成される䞀方、 SBRに ぀いおは、 BaseCodecDecode郚 11で取埗したヘッダ情報を甚いお SBRDecode 郚 14で埩号化され、 SBRデヌタが生成される。  [0057] In the present embodiment, the frame includes a reference frame SF including an SBR having a base codec and an SBR header, and a non-reference frame IF including a base codec and an SBR, respectively. Among them, BaseCodec is decoded by Base CodecDecode unit 11 to generate BaseCodec data, while SBR is decoded by SBRDecode unit 14 using the header information acquired by BaseCodecDecode unit 11 to generate SBR data .
[0058] このずき、䟋えばストリヌムの䞀郚が消倱した=゚ラヌが発生した等により、 Base CodecDecode郚 11でヘッダ情報を取埗できなかったずきには、 BaseCodecDecod e郚 11で埩号化された BaseCodecデヌタに基づき、高域成分擬䌌生成凊理郚 13 により高域成分デヌタを擬䌌的に生成する。そしお、スィッチ 15により、 BaseCodec デヌタが䞊蚘生成された高域成分デヌタず䜵せお出力される。これにより、゚ラヌ発 生埌に次にヘッダ情報を取埗できるたで無音状態にしたり BaseCodecしか出力しな い堎合に比べ、無音状態を短瞮できるず共に、高域成分を出力するこずにより聎感䞊 の違和感を䜎枛するこずができる。  [0058] At this time, for example, when header information cannot be acquired by the Base CodecDecode unit 11 due to loss of part of the stream (= an error has occurred), the BaseCodecDecode unit 11 decodes the BaseCodec data Based on this, the high frequency component pseudo generation processing unit 13 generates high frequency component data in a pseudo manner. Then, the switch 15 outputs BaseCodec data together with the generated high frequency component data. This makes it possible to reduce the silence and reduce the sense of incongruity by outputting high-frequency components, compared to the case where silence is maintained until the next header information can be acquired after an error occurs, and only BaseCodec is output. can do.
[0059] たた、本実斜圢態のデゞタル音声デヌタ凊理装眮 1を甚いたデゞタル音声デヌタ 凊理方法にぉレ、おは、フレヌム化された耇数のフレヌム列からなる笊号化デヌタで あっお、耇数のフレヌム列は、基準フレヌム SF及び非基準フレヌム IFからなる耇数 のフレヌムを含み、非基準フレヌム IF力 音声情報を笊号化した BaseCodecず、こ の BaseCodecの再生垯域を拡倧するための垯域拡倧情報を笊号化した SBRずを含 み、基準フレヌム SF力 音声情報を笊号化した BaseCodecず、この BaseCodecの 再生垯域を拡倧するための垯域拡倧情報を笊号ィヒしお構成され、 SBRの挔算凊理 を行うためのヘッダ情報を備えた SBRヘッダを有する SBRずを含む、ストリヌムを凊 理するデゞタル音声デヌタ凊理方法であっお、基準フレヌム SFの SBRに備えられ た SBRヘッダのヘッダ情報を取埗するステップ S 15ず、基準フレヌム SF又は非基準 フレヌム IFの BaseCodecを埩号ィ匕し、 BaseCodecデヌタを生成する BaseCodecD ecode郚 11による BaseCodecデコヌド手順ず、基準フレヌム SF又は非基準フレヌ ム IFの SBRを、ステップ S 15で取埗したヘッダ情報を甚いお埩号化し、 SBRデヌタ を生成するステップ S35ず、ステップ S 15でヘッダ情報を取埗できなかった堎合に、 B aseCodecDecode咅 11による BaseCodecデコヌド手 I噎で埩号ィ匕した BaseCodec デヌタに基づき、圓該 BaseCodecデヌタよりも高い再生垯域を有する高域成分デ ヌタを擬䌌的に生成し、 BaseCodecデヌタず高域成分デヌタを䜵せお出力する。 [0059] In addition, the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data including a plurality of framed frames, and includes a plurality of frames. The column includes a plurality of frames composed of a reference frame SF and a non-reference frame IF. The non-reference frame IF force encodes base codec that encodes audio information and band expansion information for expanding the playback band of this base codec. This is composed of BaseCodec, which encodes the reference frame SF power audio information, and the band expansion information for expanding the playback band of this BaseCodec. A digital audio data processing method for processing a stream including an SBR having an SBR header with header information, the header of the SBR header provided in the SBR of the reference frame SF. Step S15 for obtaining information, and decoding the BaseCodec of the reference frame SF or non-reference frame IF, and generating the BaseCodec data The BaseCodec decoding procedure by the BaseCodecDecode unit 11 and the reference frame SF or non-reference frame IF SBR is decrypted using the header information obtained in step S15, and SBR data If the header information could not be acquired in step S35 and step S15, a higher playback bandwidth than that of the BaseCodec data is obtained based on the BaseCodec data decoded by BaseCodec decoding by the BaseCodecDecode 咅 11. The pseudo high frequency component data is generated and BaseCodec data and high frequency component data are output together.
[0060] 本実斜圢態のデゞタル音声デヌタ凊理方法においおは、ステップ S 15でヘッダ情 報を取埗できなかった堎合に、埩号化された BaseCodecデヌタに基づき、高域成分 デヌタを生成し、 BaseCodecデヌタず生成された高域成分デヌタずが䜵せお出力さ れる。これにより、゚ラヌ発生埌に次にヘッダ情報を取埗できるたで無音状態にしたり BaseCodecデヌタしか出力しない堎合に比べ、無音状態を短瞮できるず共に、高域 成分を出力するこずにより聎感䞊の違和感を䜎枛するこずができる。 [0060] In the digital audio data processing method of the present embodiment, when header information cannot be acquired in step S15, high frequency component data is generated based on the decoded BaseCodec data, and BaseCodec data and The generated high frequency component data is also output. This makes it possible to reduce the silence and reduce the sense of incongruity by outputting high-frequency components, compared to the case in which silence is maintained until the next header information can be acquired after an error occurs, and only BaseCodec data is output. Can do.
図面の簡単な説明  Brief Description of Drawings
[0061] [図 1]本発明の䞀実斜圢態の携垯電話機の党䜓倖芳を衚す斜芖図である。  FIG. 1 is a perspective view showing the overall appearance of a mobile phone according to an embodiment of the present invention.
[図 2]図 1に瀺した携垯電話機の機胜的構成を衚す機胜ブロック図である。  2 is a functional block diagram showing a functional configuration of the mobile phone shown in FIG.
[図 3]ストリヌムのうち音声に係わる郚分を抜き出しお抂念的に衚す暡匏図である。  FIG. 3 is a schematic diagram conceptually showing a portion related to audio extracted from a stream.
[図 4]図 2に瀺した信号凊理郚のうち音声信号の再生凊理に係わる構成を衚す機胜 ブロック図である。  4 is a functional block diagram showing a configuration related to audio signal reproduction processing in the signal processing section shown in FIG.
[図 5]BaseCodecDecode郚が各フレヌムごずに実行する凊理手順を衚すフロヌチ ダヌトである。  FIG. 5 is a flowchart showing the processing procedure executed by the BaseCodecDecode part for each frame.
[図 6]本発明の䞀実斜圢態の各効果を、具䜓的に説明するための説明図である。  FIG. 6 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
[図 7]本発明の䞀実斜圢態の各効果を、具䜓的に説明するための説明図である。  FIG. 7 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
[図 8]本発明の䞀実斜圢態の各効果を、具䜓的に説明するための説明図である。 笊号の説明  FIG. 8 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention. Explanation of symbols
[0062] 1 携垯電話機 (デゞタル音声デヌタ凊理装眮  [0062] 1 Mobile phone (digital audio data processing device)
11 BaseCodecDecode郚゚ラヌ刀定手段、ヘッダ情報取埗手段、第 1 埩号化手段、第 2埩号化手段、解析手段、照合手段  11 BaseCodecDecode part (error determination means, header information acquisition means, first decoding means, second decoding means, analysis means, verification means)
12 ゚ラヌ凊理郚゚ラヌ甚デヌタ生成手段  12 Error processing section (error data generation means)
13 高域成分擬䌌生成凊理郚高域成分擬䌌生成手段  13 High-frequency component pseudo-generation processing unit (high-frequency component pseudo-generation means)
15 スィッチ出力制埡手段 非基準フレヌム 基準フレヌム 15 Switch (output control means) Non-reference frame Reference frame

Claims

請求の範囲 The scope of the claims
[1] フレヌム化された耇数のフレヌム列からなる笊号ィ匕デヌタであっお、  [1] Code data consisting of a plurality of framed frames,
前蚘耇数のフレヌム列は、基準フレヌム及び非基準フレヌムを含み、  The plurality of frame sequences includes a reference frame and a non-reference frame;
前蚘非基準フレヌムが、音声情報を笊号化した非基準第 1デヌタず、この非基準第 1デヌタの再生垯域を拡倧するための垯域拡倧情報を笊号化した非基準第 2デヌタ ずを含み、  The non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data;
前蚘基準フレヌムが、音声情報を笊号化した基準第 1デヌタず、この基準第 1デヌ タの再生垯域を拡倧するための垯域拡倧情報を笊号化しお構成され、前蚘非基準 第 2デヌタの挔算凊理を行うためのヘッダ情報を備えた凊理ヘッダを有する基準第 2 デヌタずを含む、ストリヌムを凊理するデゞタル音声デヌタ凊理装眮であっお、 前蚘基準フレヌムの前蚘基準第 2デヌタに備えられた前蚘凊理ヘッダの前蚘ぞッ ダ情報を取埗するヘッダ情報取埗手段ず、  The reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding a reproduction band of the reference first data, and calculating the non-reference second data. Digital audio data processing apparatus for processing a stream, including reference second data having a processing header having header information for performing the processing, wherein the processing header provided in the reference second data of the reference frame Header information acquisition means for acquiring the header information of
前蚘基準フレヌム又は前蚘非基準フレヌムの前蚘基準第 1デヌタ又は前蚘非基 準第 1デヌタを埩号化し、第 1埩号化デヌタを生成する第 1埩号化手段ず、  First decoding means for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame and generating first decoded data;
前蚘基準フレヌム又は前蚘非基準フレヌムの前蚘基準第 2デヌタ又は前蚘非基 準第 2デヌタを、前蚘ヘッダ情報取埗手段で取埗した前蚘ヘッダ情報を甚いお埩号 化し、第 2耇号化デヌタを生成する第 2埩号化手段ず、  The reference second data or the non-reference second data of the reference frame or the non-reference frame is decoded using the header information acquired by the header information acquisition means, and second decoded data is generated. A second decryption means;
前蚘ヘッダ情報取埗手段で前蚘ヘッダ情報を取埗できなかった堎合に、前蚘第 1 耇号化手段で埩号化された第 1耇号化デヌタに基づき、圓該第 1埩号化デヌタより も高い再生垯域を有する高域成分デヌタを擬䌌的に生成する高域成分擬䌌生成手 段ず、  When the header information cannot be obtained by the header information obtaining means, a higher reproduction band than the first decoded data is obtained based on the first decoded data decoded by the first decoding means. High-frequency component pseudo-generation means for generating pseudo-high-frequency component data,
前蚘第 1耇号化デヌタず前蚘高域成分デヌタを䜵せお出力するための出力制埡手 段ず  An output control means for outputting the first decoded data and the high-frequency component data together;
を有するこずを特城ずするデゞタル音声デヌタ凊理装眮。  A digital audio data processing apparatus comprising:
[2] 請求項 1蚘茉のデゞタル音声デヌタ凊理装眮においお、 [2] In the digital audio data processing device according to claim 1,
前蚘ストリヌムの䞀郚が消倱したかどうかを刀定する゚ラヌ刀定手段を有し、 前蚘高域成分擬䌌生成手段は、前蚘゚ラヌ刀定手段で前蚘非基準フレヌムが消 倱したず刀定された堎合に、前蚘高域成分デヌタを生成するこずを特城ずするデゞタ ル音声デヌタ凊理装眮。 An error determination unit that determines whether a part of the stream has been lost, and the high-frequency component pseudo-generation unit, when the error determination unit determines that the non-reference frame has been lost, A digital device characterized by generating high-frequency component data Le voice data processing device.
[3] 請求項 2蚘茉のデゞタル音声デヌタ凊理装眮においお、  [3] The digital audio data processing device according to claim 2,
前蚘゚ラヌ刀定手段で前蚘非基準フレヌムが消倱したず刀定された堎合、圓該消 倱した非基準フレヌムに察応した所定の゚ラヌ甚デヌタを生成する゚ラヌ甚デヌタ 生成手段  When the error determination means determines that the non-reference frame has been lost, error data generation means for generating predetermined error data corresponding to the lost non-reference frame
を有するこずを特城ずするデゞタル音声デヌタ凊理装眮。  A digital audio data processing apparatus comprising:
[4] フレヌム化された耇数のフレヌム列からなる笊号化デヌタであっお、 [4] Coded data composed of a plurality of framed frames,
前蚘耇数のフレヌム列は、基準フレヌム及び非基準フレヌムからなる耇数のフレヌ ムを含み、  The plurality of frame sequences includes a plurality of frames including a reference frame and a non-reference frame,
前蚘非基準フレヌムが、音声情報を笊号化した非基準第 1デヌタず、この非基準第 1デヌタの再生垯域を拡倧するための垯域拡倧情報を笊号化した非基準第 2デヌタ ずを含み、  The non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data;
前蚘基準フレヌムが、音声情報を笊号化した基準第 1デヌタず、この基準第 1デヌ タの再生垯域を拡倧するための垯域拡倧情報を笊号化しお構成され、前蚘非基準 第 2デヌタの挔算凊理を行うためのヘッダ情報を備えた凊理ヘッダを有する基準第 2 デヌタずを含む、ストリヌムを凊理するデゞタル音声デヌタ凊理方法であっお、 前蚘基準フレヌムの前蚘基準第 2デヌタに備えられた前蚘凊理ヘッダの前蚘ぞッ ダ情報を取埗するヘッダ情報取埗手順ず、  The reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding a reproduction band of the reference first data, and calculating the non-reference second data. A digital audio data processing method for processing a stream, including reference second data having a processing header with header information for performing the processing, wherein the processing header provided in the reference second data of the reference frame Header information acquisition procedure for acquiring the header information of
前蚘基準フレヌム又は前蚘非基準フレヌムの前蚘基準第 1デヌタ又は前蚘非基 準第 1デヌタを埩号化し、第 1埩号化デヌタを生成する第 1埩号化手順ず、  A first decoding procedure for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame to generate first decoded data;
前蚘基準フレヌム又は前蚘非基準フレヌムの前蚘基準第 2デヌタ又は前蚘非基 準第 2デヌタを、前蚘ヘッダ情報取埗手順で取埗した前蚘ヘッダ情報を甚いお埩号 化し、第 2耇号化デヌタを生成する第 2埩号ィヒ手順ずを有し、  The reference second data or the non-reference second data of the reference frame or the non-reference frame is decoded using the header information acquired in the header information acquisition procedure to generate second decoded data. A second decryption procedure,
前蚘ヘッダ情報取埗手順で前蚘ヘッダ情報を取埗できなかった堎合に、前蚘第 1 耇号化手順で埩号化した第 1耇号化デヌタに基づき、圓該第 1埩号化デヌタよりも 高い再生垯域を有する高域成分デヌタを擬䌌的に生成し、前蚘第 1埩号化デヌタず 前蚘高域成分デヌタを䜵せお出力するこずを特城ずするデゞタル音声デヌタ凊理方 法。 If the header information cannot be acquired by the header information acquisition procedure, the reproduction information has a higher reproduction band than the first decoded data based on the first decoded data decoded by the first decoding procedure. A digital audio data processing method, characterized in that high-frequency component data is generated in a pseudo manner, and the first decoded data and the high-frequency component data are output together.
[5] 請求項 4蚘茉のデゞタル音声デヌタ凊理方法にぉレ、お、 [5] The digital audio data processing method according to claim 4,
前蚘ストリヌムの䞀郚が消倱したかどうかを刀定する゚ラヌ刀定手順を有し、 この゚ラヌ刀定手順で前蚘非基準フレヌムが消倱したず刀定された堎合に、前蚘 高域成分デヌタを生成するこずを特城ずするデゞタル音声デヌタ凊理方法。  An error determination procedure for determining whether or not a part of the stream has been lost, and the high-frequency component data is generated when the error determination procedure determines that the non-reference frame has been lost. Digital audio data processing method.
[6] 請求項 5蚘茉のデゞタル音声デヌタ凊理方法にぉレ、お、 [6] The digital audio data processing method according to claim 5,
前蚘゚ラヌ刀定手順で前蚘非基準フレヌムが消倱したず刀定された堎合、圓該消 倱した非基準フレヌムに察応した所定の゚ラヌ甚デヌタを生成する゚ラヌ甚デヌタ 生成手順  An error data generation procedure for generating predetermined error data corresponding to the lost non-reference frame when the error determination procedure determines that the non-reference frame has been lost.
を有するこずを特城ずするデゞタル音声デヌタ凊理方法。  A digital audio data processing method comprising:
PCT/JP2007/059585 2006-05-25 2007-05-09 Digital audio data processing device and processing method WO2007138825A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2008517814A JP4551472B2 (en) 2006-05-25 2007-05-09 Digital audio data processing apparatus and processing method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2006145553 2006-05-25
JP2006-145553 2006-05-25

Publications (1)

Publication Number Publication Date
WO2007138825A1 true WO2007138825A1 (en) 2007-12-06

Family

ID=38778345

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2007/059585 WO2007138825A1 (en) 2006-05-25 2007-05-09 Digital audio data processing device and processing method

Country Status (2)

Country Link
JP (1) JP4551472B2 (en)
WO (1) WO2007138825A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003140692A (en) * 2001-11-02 2003-05-16 Matsushita Electric Ind Co Ltd Coding device and decoding device
JP2005024756A (en) * 2003-06-30 2005-01-27 Toshiba Corp Decoding process circuit and mobile terminal device
JP2005222014A (en) * 2004-01-08 2005-08-18 Matsushita Electric Ind Co Ltd Device and method for signal decoding
WO2005106848A1 (en) * 2004-04-30 2005-11-10 Matsushita Electric Industrial Co., Ltd. Scalable decoder and expanded layer disappearance hiding method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4445328B2 (en) * 2004-05-24 2010-04-07 パナ゜ニック株匏䌚瀟 Voice / musical sound decoding apparatus and voice / musical sound decoding method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003140692A (en) * 2001-11-02 2003-05-16 Matsushita Electric Ind Co Ltd Coding device and decoding device
JP2005024756A (en) * 2003-06-30 2005-01-27 Toshiba Corp Decoding process circuit and mobile terminal device
JP2005222014A (en) * 2004-01-08 2005-08-18 Matsushita Electric Ind Co Ltd Device and method for signal decoding
WO2005106848A1 (en) * 2004-04-30 2005-11-10 Matsushita Electric Industrial Co., Ltd. Scalable decoder and expanded layer disappearance hiding method

Also Published As

Publication number Publication date
JPWO2007138825A1 (en) 2009-10-01
JP4551472B2 (en) 2010-09-29

Similar Documents

Publication Publication Date Title
JPWO2008117524A1 (en) Digital broadcast transmission apparatus, digital broadcast reception apparatus, and digital broadcast transmission / reception system
EP1879310A1 (en) Method for decoding a digital radio stream
KR100636145B1 (en) Exednded high resolution audio signal encoder and decoder thereof
JP2010114777A (en) Broadcast receiving circuit and broadcast receiver
MX2010010368A (en) Systems, methods and apparatus for transmitting data over a voice channel of a wireless telephone network.
JP2005024756A (en) Decoding process circuit and mobile terminal device
US7509255B2 (en) Apparatuses for adaptively controlling processing of speech signal and adaptively communicating speech in accordance with conditions of transmitting apparatus side and radio wave and methods thereof
JP4551472B2 (en) Digital audio data processing apparatus and processing method
JP5047519B2 (en) Digital audio data processing apparatus and processing method
JP4705556B2 (en) Wireless microphone transmitter, receiver and system
JP2009244912A (en) Speech signal processing device and speech signal processing method
US6198499B1 (en) Radio-communication video terminal device
KR20060131164A (en) Method for playing broadcasting stream stored in digital broadcasting receiver
JP4065383B2 (en) Audio signal transmitting apparatus, audio signal receiving apparatus, and audio signal transmission system
JP2004129121A (en) Transmitting apparatus and receiving apparatus of digital broadcast content, broadcast system and broadcast method
JPWO2005117275A1 (en) Receiver
JP4683908B2 (en) Video / audio output device
JP6181897B1 (en) Broadcast signal receiving apparatus, broadcast signal receiving method, television receiver, control program, and recording medium
JP4385710B2 (en) Audio signal processing apparatus and audio signal processing method
JP6181898B1 (en) Broadcast signal transmission / reception system and broadcast signal transmission / reception method
JP6374053B2 (en) Broadcast signal receiving apparatus, broadcast signal receiving method, television receiver, control program, and recording medium
JP5957156B2 (en) Broadcast signal transmission / reception system and broadcast signal transmission / reception method
JP2008016882A (en) Display apparatus, display control method, and display control program
JP6341809B2 (en) Broadcast signal transmission / reception system and broadcast signal transmission / reception method
JP2007108219A (en) Speech decoder

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07743020

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2008517814

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07743020

Country of ref document: EP

Kind code of ref document: A1