WO2007138825A1 - Digital audio data processing device and processing method - Google Patents
Digital audio data processing device and processing method Download PDFInfo
- Publication number
- WO2007138825A1 WO2007138825A1 PCT/JP2007/059585 JP2007059585W WO2007138825A1 WO 2007138825 A1 WO2007138825 A1 WO 2007138825A1 JP 2007059585 W JP2007059585 W JP 2007059585W WO 2007138825 A1 WO2007138825 A1 WO 2007138825A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- reference frame
- decoded
- header information
- header
- Prior art date
Links
- 238000003672 processing method Methods 0.000 title claims description 18
- 238000000034 method Methods 0.000 claims description 50
- 238000010586 diagram Methods 0.000 description 10
- 230000006870 function Effects 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 7
- 238000004364 calculation method Methods 0.000 description 7
- 230000005236 sound signal Effects 0.000 description 7
- 230000000052 comparative effect Effects 0.000 description 5
- 230000000694 effects Effects 0.000 description 4
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
Definitions
- the present invention relates to a digital audio data processing apparatus and processing method for digitally processing encoded data.
- Digital broadcasting for digitizing and multiplex-transmitting image data (video data) and audio data (audio data) using, for example, a digital or Internet broadcast receiver has already been started.
- a predetermined compression encoding method for example, MPEG; Moving Picture Experts Group method, etc.
- the data power of a plurality of programs for example, MPEG transport stream.
- Stream; MPEG-TS, etc. The multiplexed data transmitted as a stream is selectively extracted by transmitting it to the receiver side that has received the data.
- AAC-plus An encoding method in which SBR is added in addition to the conventional AAC is called AAC-plus, and one frame of data includes AAC encoded data (BaseCodec) and SBR encoded data. It consists of. Note that even conventional decryption means compatible only with AAC can decrypt only AAC data by skipping SBR data.
- Patent Document 1 Japanese Patent Laid-Open No. 2006-50387
- auxiliary information for predicting a high frequency component from a low frequency component is subjected to band expansion processing during reproduction to generate a high frequency component. It is.
- a frame (reference frame) having an SBR header is generated every predetermined frame data unit (for example, several frames, several tens of frames) (for example, irregularly) based on the data transmission amount restriction described above. It has been entered.
- frames that do not have an SBR header other than this frame are included, and all frames create a calculation table from the information (header information) stored in the SBR header. Based on the table, the band expansion process is performed to generate a high frequency component.
- the problems to be solved by the present invention include the above-described problems as an example.
- the invention according to claim 1 is encoded data including a plurality of framed frame sequences, and the plurality of frame sequences include a reference frame and a non-reference.
- the non-reference frame includes non-reference first data obtained by encoding speech information and non-reference first data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data.
- Second reference data and the reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding the reproduction band of the reference first data
- a digital audio data processing device for processing a stream, including reference second data having a processing header including header information for performing arithmetic processing of the non-reference second data, wherein the reference frame
- a header information acquisition unit for acquiring the header information of the processing header provided in the reference second data, and decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame
- the first decoding means for generating first decoded data, and the reference second data or the non-reference second data of the reference frame or the non-reference frame is obtained by the header information obtaining means.
- the first decoding is performed.
- a high-frequency component pseudo-generating means for pseudo-generating high-frequency component data having a reproduction band higher than that of the first decoded data based on the first decoded data decoded by the encoding means; 1 Decryption Hide And output control means for outputting the high-frequency component data together.
- the invention according to claim 4 is code data including a plurality of framed frames, and the plurality of frame sequences includes a plurality of frames including a reference frame and a non-reference frame.
- the non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data.
- the reference frame is configured by encoding reference first data in which audio information is encoded and band expansion information for expanding a reproduction band of the reference first data.
- Digital processing stream including reference second data having a processing header with header information for performing arithmetic processing An audio data processing method, an error determination procedure for determining whether or not a part of the stream has been lost, and a header for acquiring the header information of the processing header provided in the reference second data of the reference frame An information acquisition procedure; first decoding means for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame to generate first decoded data; and the reference frame Alternatively, the second reference data or the second non-reference data of the non-reference frame is decoded using the header information acquired in the header information acquisition procedure to generate second decoded data.
- High-frequency component data with a higher regeneration zone than 1 decodes data generated in a pseudo manner, and outputs before Symbol the high frequency component data together with the first decryption data.
- This embodiment is an embodiment when applied to a mobile phone as an example of a digital audio data processing apparatus.
- FIG. 1 is a perspective view showing the overall appearance of the mobile phone according to the present embodiment.
- this mobile phone 1 as a digital audio data processing apparatus acquires and outputs content data including audio data, video data, data for data broadcasting, etc. distributed as TS (Transport Stream). It is compatible with terrestrial digital broadcasting using the TS system.
- TS Transport Stream
- the cellular phone 1 is provided with a main body casing 2, an operation unit 3 provided at a lower portion of the main body casing 2, and provided with a telephone number input key, various function buttons, and the like, and a base end portion thereof.
- Open / close cover 4 pivotally supported at the lower end of main casing 2 and attached to main casing 2 so as to be openable / closable, display 5 for various displays, and antenna 6 for data transmission / reception via wireless communication
- FIG. 2 is a functional block diagram showing a functional configuration of the mobile phone 1.
- Figure 2 Odor After the radio wave transmitted from the broadcasting station or other mobile phone is received by the antennas 6 and 10, the demodulated received signal is demodulated by the transmission / reception unit 100 connected to the antennas 6 and 10.
- the signal processing unit 101 performs predetermined signal processing (details will be described later) for reproduction.
- the signal processed by the signal processing unit 101 is reproduced as sound by the speaker 7.
- the transmission / reception unit 100 includes a TS reception unit (not shown), and the broadcasting antenna 10 (which may also be used as the antenna 6) is connected to the TS reception unit.
- the TS receiving unit acquires a TS corresponding to the content selected by the user from, for example, a plurality of TSs transmitted as digital signals from the broadcasting antenna 10. Then, the acquired TS is output to the signal processing unit 101 as a TS signal.
- the conversation of the speaker is input to the microphone 9 and converted into an audio signal.
- the audio signal is subjected to signal processing for transmission in the signal processing unit 101.
- the transmission / reception unit 100 modulates the audio signal from the signal processing unit 101 and supplies the modulated signal to the antenna 6.
- the antenna 6 transmits the audio signal. As a radio wave.
- control unit 102 including a CPU and the like.
- FIG. 3 is a schematic diagram conceptually showing an extracted portion related to audio in the TS stream received by the antenna 10 provided in the mobile phone 1 of the present embodiment.
- FIG. 3 shows a state in which an audio stream extracted from the multiplexed stream received by the antenna 10 (ie, Elementary Stream) includes a plurality of frames along the time axis. Yes. These multiple frames consist of a reference frame SF and other non-reference frame IF.
- Non-reference frame IF is a BaseCodec (non-reference number
- the reference frame SF is configured by encoding BaseCodec (reference first data) obtained by encoding audio information according to the AAC standard and band expansion information for expanding the reproduction band of the BaseCodec, as described above.
- BaseCodec reference first data
- SBR data standard second data
- SBR header processing header
- a table for the above arithmetic processing is created, and band expansion processing can be performed based on this table to generate high frequency components.
- FIG. 4 is a functional block diagram showing a configuration related to the reproduction processing of the audio signal in the signal processing unit 101.
- the signal processing unit 101 includes a BaseCodecDecode unit 11, an error processing unit 12, a high frequency component pseudo generation processing unit 13, an SBRDecode unit 14, and a BaseCodecDecode unit 1
- Switch 15 (output control means) that is switched by a switching control signal from 1.
- the BaseCodecDecode unit 11 has a plurality of functions. First, as the first function, the audio signal stream (Elementary Stream) described above with reference to FIG. 3 is input, and the BaseCodec part included in the reference frame SF and the non-reference frame IF is decoded (decoded) ( First decoding means).
- the BaseCodecDecode unit 11 determines whether each frame has an SBR-header (in other words, a reference frame SF or a non-reference frame IF force). At the same time, if the frame is the reference frame SF, the header information of the SBR header provided in the SBR data is acquired (header information acquisition means).
- SBR-header in other words, a reference frame SF or a non-reference frame IF force.
- the determination method may be determined to be an error when a failure occurs in the ES decoding process in the BaseCodecDecode unit 11, or when an ES is input, error information about the ES is separately received from the outside. You may make it judge
- the switch 15 is based on the switching control signal from the BaseCodecDecode unit 11, based on the error processing unit 12, the high frequency component pseudo generation processing unit 13, and the S BRDecode unit 14. The output from either of these is selected and output to the speaker 7 side.
- Figure 5 shows the processing executed by the BaseCodecDecode unit 11 for each frame. It is a flowchart showing a procedure.
- step S60 the error processing unit 12 is instructed to output predetermined error data, and a switching control signal for switching the switch 15 to the error processing unit 12 side is output.
- the error processing unit 12 outputs, as error data, a mute signal for making a silent state (or data before and after the data force may be interpolated by an appropriate method) (error data generation means, Error data generation procedure), output from switch 15 to speaker 7 side.
- step S10 determines whether an error has not occurred, the determination is not satisfied, and the routine goes to step S15.
- step S15 the header information of the SBR header is acquired from the frame by the function as the header information acquisition means described above (header information acquisition procedure).
- the SBRDecode unit 14 receives the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 as follows: In addition, decryption processing is performed using a table based on the syntax information of the SBR section described above. The decoding data generated by the SBR section is collected and output to the switch 15 (second decoding procedure), and output from the switch 15 to the speaker 7 side.
- step S40 the decoded data of the BaseCodec part decoded by the function as the first decoding means described above is output to the high-frequency component pseudo-generation processing unit 13 and the switch 15 is pseudo-generated. A switching control signal for switching to the processing unit 13 is output.
- the high-frequency component pseudo-generation processing unit 13 up-samples the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 twice and uses a known method (for example, described in Japanese Patent No. 3140273).
- the high-frequency component data is generated based on the decoded data of the BaseCodec part, which is a low frequency by the above method), and the decrypted data including the generated high-frequency component data is output to the switch 15, and the switch 15 Output to the side.
- step S35, step S40, and step S60 are completed, the process returns to step S10 and the same procedure is repeated.
- the header information of the SBR header cannot be acquired (in other words, when a calculation table is not created)
- a known high-frequency component simulation generation method is used.
- the high frequency component data is generated in a pseudo manner based on the decoded data of the BaseCodec part decoded in this way, and the generated high frequency component data is added to the decoded data of the BaseCodec part for output.
- the digital audio data processing device 1 is encoded data composed of a plurality of framed frame sequences, and the plurality of frame sequences include the reference frame SF and the non-frame data.
- the reference frame IF includes multiple frames, and the non-reference frame IF expands the playback band of the non-reference first data (BaseCodec in this example) that encodes audio information and the non-reference first data BaseCodec.
- Non-reference second data (in this example, SBR) that encodes the bandwidth expansion information for encoding
- the reference frame SF is the reference first data (in this example, BaseCodec) that encodes the voice information
- a processing header (this header, which includes header information for performing arithmetic processing of the non-standard second data SBR, is configured by encoding the band expansion information for expanding the reproduction band of the reference first data BaseCodec.
- a digital audio data processing device 1 for processing a stream including reference second data (in this example, SBR) having SBR headers, and is a processing header provided in reference second data SBR of reference frame SF Header information acquisition means (BaseCodecDecode section 11 in this example) for acquiring the header information of the first frame and the first reference data BaseCodec or the first non-reference first data BaseCodec of the reference frame SF or the non-reference frame IF.
- SBR reference second data
- BaseCodecDecode section 11 for acquiring the header information of the first frame and the first reference data BaseCodec or the first non-reference first data BaseCodec of the reference frame SF or the non-reference frame IF.
- the first decoding means (BaseCodecDecode unit 11 in this example) that generates data (BaseCodec data after decoding) in this example, and the reference second data SBR or non-reference data of the reference frame SF or non-reference frame IF 2
- the second decoding means for decoding the data SBR using the header information acquired by the header information acquisition means 11 and generating the second decrypted data (in this example, the decoded SBR data)
- High-frequency component pseudo-generation means (in this example, a high-frequency component) that artificially generates high-frequency component data having a higher reproduction and reproduction band than the first decoded data (decoded BaseCodec data).
- a pseudo-generation processing unit 13) and output control means (switch 15 in this example) for outputting the first decoded key data (BaseCodec data
- the reference frame SF including the force standard first data BaseCodec and the reference second data SBR of each of the plurality of frames provided in the stream, and the non-standard first data BaseCodec and the non-standard second data. It is composed of non-reference frame IF including data SBR.
- the reference first data BaseCodec and the non-reference first data BaseCodec are decoded by the first decoding means 11 to generate the first decoded data (Base Codec data after decoding), while the reference first data 2
- the data SBR and the non-reference second data SBR are decrypted by the second decryption means 14 using the header information obtained by the header information obtaining means 11, and the second decrypted data ( SBR data after decoding) ) Is generated.
- the first decoding unit 11 performs the decoding. Based on the first decoded data (BaseCodec data after decoding) The pseudo-generation means 13 generates pseudo high frequency component data. Then, the output control means 15 outputs the first decoded data (decoded BaseCodec data) together with the generated high frequency component data.
- the silence state can be shortened and the high-frequency component can be reduced compared to the case where the silence state is obtained until the next header information can be obtained after the error occurs or only the first decoded data (decoded BaseCodec data) is output. By outputting, it is possible to reduce the sense of incongruity in hearing.
- the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data consisting of a plurality of framed frames, and the plurality of frame sequences are It includes multiple frames consisting of a reference frame SF and a non-reference frame IF, and the non-reference frame IF expands the playback bandwidth of the non-reference first data BaseCodec that encodes audio information and the non-reference first data BaseCodec.
- the reference frame SF includes the reference first data BaseCodec encoded voice information and the reproduction band of the reference first data BaseCodec.
- Reference second data SB which is configured by encoding band expansion information for expansion and has a processing header (SBR header) with header information for performing calculation processing of non-reference second data SBR
- SBR header processing header
- Step S15) and the reference first data BaseCodec or non-reference first data BaseCodec of the reference frame SF or non-reference frame IF are decoded to generate first decoded data (decoded BaseCodec data).
- the header information acquisition procedure S 1 is the header information acquisition procedure S 1
- step 5 If header information could not be obtained in step 5, high frequency component data is generated based on the decoded first decoded data (BaseCodec data after decoding), and the first decoded data (after decoding) BaseCodec data) and the generated high-frequency component data are output together.
- the silence state can be shortened and the high frequency component can be output compared to the case where the silence state is obtained until the next header information can be acquired after an error occurs or only the first decoded data (decoded BaseCodec data) is output. By doing this, you can reduce the sense of incongruity in hearing.
- the digital audio data processing apparatus 1 in the above embodiment has error determination means (BaseCodecDecode section 11 in this example) for determining whether or not a part of the stream has been lost. Is characterized by generating high-frequency component data when it is determined by the error determination means 11 that the non-reference frame IF has disappeared.
- the high frequency component pseudo generation means 13 when the error determination means 11 determines that the non-reference frame IF has disappeared, the high frequency component pseudo generation means 13 generates high frequency component data.
- silence occurs compared to when silence occurs until the next header information can be acquired after an error in which the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output. And a sense of discomfort in hearing can be reduced by outputting a high frequency component.
- the digital audio data processing method further includes an error determination procedure (in this example, step S10 in Fig. 5) for determining whether or not a part of the stream has been lost.
- High frequency component data is generated when it is determined in S10 that the non-reference frame IF has disappeared.
- high frequency component data is generated when it is determined that the non-reference frame IF has disappeared.
- silence occurs compared to when silence occurs until the next header information can be obtained after an error that the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output.
- a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having error data generation means (in this example, error processing unit 12) for generating data (in this example, mute data).
- a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having an error data generation procedure (in this example, step S60) for generating data (in this example, mute data).
- FIGS. 6 to 8 are explanatory diagrams for specifically explaining the effects of the present embodiment listed above.
- the horizontal axis represents the time axis
- FIG. 6 shows an example of the behavior of each frame in the input stream
- FIG. 7 shows the digital audio data processing apparatus 1 of the present embodiment corresponding to FIG. Fig. 8 shows the output behavior of the comparative example in which only the first decoded data (BaseCodec data after decoding) is output after the error occurs.
- the frame after the frame in which the error has occurred differs between the comparative example and the present embodiment. That is, As shown in Fig. 8, in the above comparative example, if an error occurs, the SBR part cannot be decoded until the header information of the SBR header is newly acquired thereafter, and the low frequency band of only the BaseCodec part cannot be obtained. Output decrypted data.
- the digital audio data processing device 1 is code data including a plurality of framed frames, and the plurality of frame sequences are a plurality of frames including a reference frame SF and a non-reference frame IF.
- the non-reference frame IF includes the BaseCodec that encodes the audio information and the SBR that encodes the band expansion information for expanding the playback band of this BaseCodec
- the reference frame SF Includes BaseCodec that encodes information, and SBR that has SBR header that is configured by encoding band expansion information for expanding the playback band of BaseCodec, and includes header information for performing SBR calculation processing
- the frame includes a reference frame SF including an SBR having a base codec and an SBR header, and a non-reference frame IF including a base codec and an SBR, respectively.
- BaseCodec is decoded by Base CodecDecode unit 11 to generate BaseCodec data
- SBR is decoded by SBRDecode unit 14 using the header information acquired by BaseCodecDecode unit 11 to generate SBR data .
- the BaseCodecDecode unit 11 decodes the BaseCodec data Based on this, the high frequency component pseudo generation processing unit 13 generates high frequency component data in a pseudo manner. Then, the switch 15 outputs BaseCodec data together with the generated high frequency component data. This makes it possible to reduce the silence and reduce the sense of incongruity by outputting high-frequency components, compared to the case where silence is maintained until the next header information can be acquired after an error occurs, and only BaseCodec is output. can do.
- the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data including a plurality of framed frames, and includes a plurality of frames.
- the column includes a plurality of frames composed of a reference frame SF and a non-reference frame IF.
- the non-reference frame IF force encodes base codec that encodes audio information and band expansion information for expanding the playback band of this base codec.
- BaseCodec which encodes the reference frame SF power audio information, and the band expansion information for expanding the playback band of this BaseCodec.
- a digital audio data processing method for processing a stream including an SBR having an SBR header with header information, the header of the SBR header provided in the SBR of the reference frame SF.
- Step S15 for obtaining information, and decoding the BaseCodec of the reference frame SF or non-reference frame IF, and generating the BaseCodec data
- the BaseCodec decoding procedure by the BaseCodecDecode unit 11 and the reference frame SF or non-reference frame IF SBR is decrypted using the header information obtained in step S15, and SBR data If the header information could not be acquired in step S35 and step S15, a higher playback bandwidth than that of the BaseCodec data is obtained based on the BaseCodec data decoded by BaseCodec decoding by the BaseCodecDecode â 11.
- the pseudo high frequency component data is generated and BaseCodec data and high frequency component data are output together.
- step S15 when header information cannot be acquired in step S15, high frequency component data is generated based on the decoded BaseCodec data, and BaseCodec data and The generated high frequency component data is also output.
- FIG. 1 is a perspective view showing the overall appearance of a mobile phone according to an embodiment of the present invention.
- FIG. 2 is a functional block diagram showing a functional configuration of the mobile phone shown in FIG.
- FIG. 3 is a schematic diagram conceptually showing a portion related to audio extracted from a stream.
- FIG. 4 is a functional block diagram showing a configuration related to audio signal reproduction processing in the signal processing section shown in FIG.
- FIG. 5 is a flowchart showing the processing procedure executed by the BaseCodecDecode part for each frame.
- FIG. 6 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
- FIG. 7 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
- FIG. 8 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention. Explanation of symbols
- BaseCodecDecode part error determination means, header information acquisition means, first decoding means, second decoding means, analysis means, verification means
- High-frequency component pseudo-generation processing unit (high-frequency component pseudo-generation means)
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
[PROBLEMS] To reduce uncomfortable feeling of an operator even when an error has occurred in a stream. [MEANS FOR SOLVING PROBLEMS] A high-band component pseudo generation unit (13) pseudo-wise generates high-band component data having a higher reproduction band than decoded data in a Base Codec unit and outputs the decoded Base Codec data together with the pseudo-wise generated high-band component data. Thus, as compared to a case when a soundless state continues until next header information is acquired after an error has occurred or when only Base Codec data is outputted after decoding, it is possible to reduce the soundless state and output a higher-band component, thereby reducing the uncomfortable feeling for sense of hearing.
Description
æ 现 æž Â Specification
ããžã¿ã«é³å£°ããŒã¿åŠçè£
眮åã³åŠçæ¹æ³  Digital audio data processing apparatus and processing method
æè¡åé Â Technical field
[0001] æ¬çºæã¯ã笊å·åããŒã¿ãããžã¿ã«åŠçããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮åã³åŠ çæ¹æ³ã«é¢ãããã®ã§ããã  [0001] The present invention relates to a digital audio data processing apparatus and processing method for digitally processing encoded data.
èæ¯æè¡ Â Background art
[0002] äŸãã°ããžã¿ã«åã¯ã€ã³ã¿ãŒãããæŸéåä¿¡æ©ãçšããŠãç»åããŒã¿ïŒãããªã㌠ã¿ïŒãé³å£°ããŒã¿ïŒãªãŒãã£ãªããŒã¿ïŒãããžã¿ã«åããŠå€éäŒéããããžã¿ã«æŸé ãæ¢ã«éå§ãããŠããããã®ããžã¿ã«æŸéã«ãããŠã¯ãæå®ã®å§çž®ç¬Šå·åæ¹åŒ (äŸ ãã° MPEG ; Moving Picture Experts Groupæ¹åŒçïŒãæ¡çšãããŠãããè€æ° ã®çªçµã®ããŒã¿å åœè©²ç¬Šå·åæ¹åŒã«å¯Ÿå¿ããã¹ããªãŒã ïŒäŸãã° MPEGãã©ã³ã¹ã äžãã¹ããªãŒã ïŒ MPEGâ TSçïŒã«å€éåããŠäŒéãããããã®ããã«å€éåãã¹ããªãŒ ã ãšããŠäŒéãããããŒã¿ã¯ããããåä¿¡ããåä¿¡æ©åŽã«ãã¬ããŠææã®ããŒã¿ãéž æçã«æœåºãããã  [0002] Digital broadcasting for digitizing and multiplex-transmitting image data (video data) and audio data (audio data) using, for example, a digital or Internet broadcast receiver has already been started. In this digital broadcasting, a predetermined compression encoding method (for example, MPEG; Moving Picture Experts Group method, etc.) is employed, and the data power of a plurality of programs (for example, MPEG transport stream). Stream; MPEG-TS, etc.) The multiplexed data transmitted as a stream is selectively extracted by transmitting it to the receiver side that has received the data.
[0003] è¿å¹ŽããããŒããã³ãåã®äžå±€ã®é²å±ã«äŒŽããæºåž¯é»è©±æ©ããã®ä»ã®æºåž¯ç«¯æ«ïŒ ãããã第äžäžä»£ã®æºåž¯ç«¯æ«ãè»èŒç«¯æ«ãã¢ãã€ã«æ©åš)çãæ¯èŒçç°¡çŽ ãªæ§é ã® ããžã¿ã«åã¯ã€ã³ã¿ãŒãããæŸéåä¿¡æ©ãçšã¬ããç°¡æçãªåç»é
ä¿¡ãµãŒãã¹ãè¡ãã ãšãèšç»ãããŠããã  [0003] In recent years, with the progress of broadbandization, digital or Internet broadcast receivers having a relatively simple structure such as mobile phones and other mobile terminals (so-called third-generation mobile terminals, in-vehicle terminals, mopile devices), etc. It is planned to provide a simple video distribution service.
[0004] ãã®å Žåãé³å£°ããŒã¿ (ãªãŒãã£ãªããŒã¿ïŒã«ã€ããŠã¯ãåŸæ¥ããå¹
åºã䜿çšãã㊠ãããäžèš MPEGã§èŠæ Œåãã AAC (Advanced Audio Coding)èŠæ Œã«å ãã S BR (Spectral Band Replication)æè¡ãé©çšãã笊å·åããã§ã«æå±ãããŠãã (äŸãã°ãç¹èš±æç® 1åç
§)ã Â [0004] In this case, SBR (Spectral Band Replication) technology has been applied to audio data (audio data) in addition to the AAC (Advanced Audio Coding) standard standardized by MPEG, which has been widely used in the past. Encoding has already been proposed (see, for example, Patent Document 1).
[0005] ããªãã¡ãäžè¬ã«ãé³å£°ããŒã¿ã®ç¬Šå·ã£åã«ãããŠã¯ãé«åšæ³¢æ°æåã®ç¬Šå·åã«å åãªããããå²ãåœãŠãã®ãå°é£ã§ããããšã«ç±æ¥ããŠãå§çž®çãé«ããªãã»ã©ãåç 垯åã®äžéåšæ³¢æ°ãäœäžãé³è³ªãå£åããåŸåãšãªããäžèš SBRæè¡ã¯ãã®ãã㪠é«åšæ³¢æ°æåã®æ¬ èœãè£ããã®ã§ãããäœåšæ³¢æ°æåããé«åšæ³¢æ°æåãäºæž¬ãã ããã®è£å©æ
å ±ãäºãã¹ããªãŒã å
ã«æ ŒçŽããŠãããåçæã«ã¯åž¯åæ¡åŒµåŠçãæœ
ããŠæ¬äŒŒçã«åž¯åãæ¡åŒµããŠé«åšæ³¢æ°æåãçæããããšã§ãé«é³è³ªãªåçãå¯èœ ãšãããã®ã§ããã [0005] That is, in general, in the encoding of audio data, it is difficult to allocate sufficient bits for encoding high frequency components, and as the compression rate increases, the upper limit of the reproduction band is increased. The frequency tends to decrease and the sound quality tends to deteriorate. The above SBR technology compensates for the lack of such high-frequency components. Auxiliary information for predicting high-frequency components from low-frequency components is stored in advance in the stream, and band expansion processing is performed during playback. In this way, it is possible to reproduce high-quality sound by generating a high-frequency component by expanding the bandwidth in a pseudo manner.
[0006] ãã®ãããªåŸæ¥ã® AACã«å ã SBRãè¿œå ãã笊å·åæ¹åŒã¯ã AACâ plusãšç§°ãã ãŠããã 1ãã¬ãŒã ã®ããŒã¿ã¯ã AACã®ç¬Šå·åããŒã¿ïŒBaseCodec)ãšã SBRã®ç¬Šå· åããŒã¿ãšããæ§æãããããªãã AACã«ã®ã¿å¯Ÿå¿ããåŸæ¥ã®åŸ©å·ã£åæ段ã§ãã£ãŠ ãã SBRããŒã¿ãèªã¿é£ã°ãããšã«ãã£ãŠ AACããŒã¿ã®ã¿ã埩å·ã£åã§ããããã«ãªã€ ãŠããã  [0006] An encoding method in which SBR is added in addition to the conventional AAC is called AAC-plus, and one frame of data includes AAC encoded data (BaseCodec) and SBR encoded data. It consists of. Note that even conventional decryption means compatible only with AAC can decrypt only AAC data by skipping SBR data.
[0007] ç¹èš±æç® 1 :ç¹é 2006â 50387å·å
¬å ±  [0007] Patent Document 1: Japanese Patent Laid-Open No. 2006-50387
çºæã®é瀺  Disclosure of the invention
çºæã解決ããããšããèª²é¡ Â Problems to be solved by the invention
[0008] äžèš SBRãçšãã笊å·åæ¹åŒã§ã¯ãåè¿°ããããã«ãäœåšæ³¢æ°æåããé«åšæ³¢æ° æåãäºæž¬ããããã®è£å©æ
å ±ã«å¯Ÿãåçæã«åž¯åæ¡åŒµåŠçãæœããŠé«åšæ³¢æ°æ åãçæãããã®ã§ããããã®ãšããåè¿°ããããŒã¿äŒééã®å¶çŽã«åºã¥ããæå®ã® ãã¬ãŒã ããŒã¿åäœïŒäŸãã°æ°ãã¬ãŒã åããæ°åãã¬ãŒã åäœïŒããšã«ïŒäŸãã°äžå®æ ã«ïŒ SBRããããåãããã¬ãŒã ïŒåºæºãã¬ãŒã ïŒãæ·å
¥ãããŠããããããŠããã®ã㬠ãŒã 以å€ã® SBRããããåããªããã¬ãŒã ïŒéåºæºãã¬ãŒã ïŒãå«ã¿ããã¹ãŠã®ã㬠ãŒã ããåœè©² SBRãããã«æ ŒçŽããæ
å ± (ãããæ
å ±ïŒããæŒç®çšã®ããŒãã«ãäœæ ãããã®ããŒãã«ã«åºã¥ãäžèšåž¯åæ¡åŒµåŠçãè¡ã£ãŠé«åšæ³¢æåãçæããããã« ãªã£ãŠããã [0008] In the encoding method using SBR, as described above, auxiliary information for predicting a high frequency component from a low frequency component is subjected to band expansion processing during reproduction to generate a high frequency component. It is. At this time, a frame (reference frame) having an SBR header is generated every predetermined frame data unit (for example, several frames, several tens of frames) (for example, irregularly) based on the data transmission amount restriction described above. It has been entered. In addition, frames that do not have an SBR header other than this frame (non-reference frames) are included, and all frames create a calculation table from the information (header information) stored in the SBR header. Based on the table, the band expansion process is performed to generate a high frequency component.
[0009] ãã®å ŽåãïŒããŒã¿ã«ã®äŒééãåæžã§ãããšããã¡ãªããã¯ãããã®ã®ïŒäžèšã®ããã« S BRããŒã¿ã®åŸ©å·åã«ã¯å¿
ã SBRãããã®ãããæ
å ±ãå¿
èŠã«ãªããããäœããã® äºæ
ã§ã¹ããªãŒã ã®äžéšãæ¶å€±ããïŒ=ãšã©ãŒçºç)å Žåã«ã¯ïŒãã®éã«ãããæ
å ±ã å€åããŠããå¯èœæ§ãããããšããïŒæ¬¡ã« SBRãããã®æ
å ±ãååŸãããŸã§ã埩å·ã£å ãè¡ããªããªãããã®ããããã£ãããšã©ãŒãçºçãããšã次㮠SBRãããã®æ
å ±ã ååŸãããŸã§ã®éã¯ã AACããŒã¿ïŒããªãã¡äœåéšåïŒã®ã¿ã埩å·åãããŠåºåã ããé«åãåçãããªãã®ã§ãæäœè
ã«å¯ŸãèŽæäžéåæãäžããããšãšãªã£ãŠãã  [0009] In this case, the header information of the SBR header is always required for decoding the SBR data as described above (although there is an advantage that the total transmission amount can be reduced). If the part is lost (= error occurred) (because the header information may have changed in the meantime), decoding cannot be performed until the next SBR header information is obtained. For this reason, once an error occurs, only the AAC data (that is, the low frequency band) is decoded and output until the next SBR header information is acquired, and the high frequency is not reproduced. Was supposed to give a sense of incongruity
[0010] æ¬çºæã解決ããããšãã課é¡ã«ã¯ãäžèšããåé¡ãäžäŸãšããŠæããããã
課é¡ã解決ããããã®æ段 [0010] The problems to be solved by the present invention include the above-described problems as an example. Means for solving the problem
[0011] äžèšèª²é¡ã解決ããããã«ãè«æ±é
1èšèŒã®çºæã¯ããã¬ãŒã åãããè€æ°ã®ã㬠ãŒã åãããªã笊å·åããŒã¿ã§ãã£ãŠãåèšè€æ°ã®ãã¬ãŒã åã¯ãåºæºãã¬ãŒã åã³ éåºæºãã¬ãŒã ãããªãè€æ°ã®ãã¬ãŒã ãåããåèšéåºæºãã¬ãŒã ããé³å£°æ
å ±ã 笊å·åããéåºæºç¬¬ 1ããŒã¿ãšããã®éåºæºç¬¬ 1ããŒã¿ã®åç垯åãæ¡å€§ãããã ã®åž¯åæ¡å€§æ
å ±ã笊å·åããéåºæºç¬¬ 2ããŒã¿ãšãå«ã¿ãåèšåºæºãã¬ãŒã ããé³ å£°æ
å ±ã笊å·åããåºæºç¬¬ 1ããŒã¿ãšããã®åºæºç¬¬ 1ããŒã¿ã®åç垯åãæ¡å€§ããã ãã®åž¯åæ¡å€§æ
å ±ã笊å·åããŠæ§æãããåèšéåºæºç¬¬ 2ããŒã¿ã®æŒç®åŠçãè¡ã ããã®ãããæ
å ±ãåããåŠçããããæããåºæºç¬¬ 2ããŒã¿ãšãå«ããã¹ããªãŒã ã åŠçããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮ã§ãã£ãŠãåèšåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 2 ããŒã¿ã«åããããåèšåŠçãããã®åèšãããæ
å ±ãååŸãããããæ
å ±ååŸæ 段ãšãåèšåºæºãã¬ãŒã åã¯åèšéåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 1ããŒã¿åã¯åèšé åºæºç¬¬ 1ããŒã¿ã埩å·åãã第 1埩å·åããŒã¿ãçæãã第 1埩å·åæ段ãšãåèšåº æºãã¬ãŒã åã¯åèšéåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 2ããŒã¿åã¯åèšéåºæºç¬¬ 2ã ãŒã¿ããåèšãããæ
å ±ååŸæ段ã§ååŸããåèšãããæ
å ±ãçšããŠåŸ©å·åãã第 2è€å·åããŒã¿ãçæãã第 2埩å·åæ段ãšãåèšãããæ
å ±ååŸæ段ã§åèšãžã ãæ
å ±ãååŸã§ããªãã£ãå Žåã«ãåèšç¬¬ 1è€å·åæ段ã§åŸ©å·åããã第 1è€å·å ããŒã¿ã«åºã¥ããåœè©²ç¬¬ 1埩å·åããŒã¿ãããé«ãåç垯åãæããé«åæåã㌠ã¿ãæ¬äŒŒçã«çæããé«åæåæ¬äŒŒçææ段ãšãåèšç¬¬ 1埩å·ã£ãããŒã¿ãšåèšé«å æåããŒã¿ã䜵ããŠåºåããããã®åºåå¶åŸ¡æ段ãšãæããã [0011] In order to solve the above-described problem, the invention according to claim 1 is encoded data including a plurality of framed frame sequences, and the plurality of frame sequences include a reference frame and a non-reference. The non-reference frame includes non-reference first data obtained by encoding speech information and non-reference first data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data. Second reference data, and the reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding the reproduction band of the reference first data, A digital audio data processing device for processing a stream, including reference second data having a processing header including header information for performing arithmetic processing of the non-reference second data, wherein the reference frame A header information acquisition unit for acquiring the header information of the processing header provided in the reference second data, and decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame The first decoding means for generating first decoded data, and the reference second data or the non-reference second data of the reference frame or the non-reference frame is obtained by the header information obtaining means. When the header information cannot be acquired by the second decoding means for decoding using the header information and generating second decoded data, and the header information acquisition means, the first decoding is performed. A high-frequency component pseudo-generating means for pseudo-generating high-frequency component data having a reproduction band higher than that of the first decoded data based on the first decoded data decoded by the encoding means; 1 Decryption Hide And output control means for outputting the high-frequency component data together.
[0012] ãŸããè«æ±é
4èšèŒã®çºæã¯ããã¬ãŒã åãããè€æ°ã®ãã¬ãŒã åãããªã笊å·ã£å ããŒã¿ã§ãã£ãŠãåèšè€æ°ã®ãã¬ãŒã åã¯ãåºæºãã¬ãŒã åã³éåºæºãã¬ãŒã ãã㪠ãè€æ°ã®ãã¬ãŒã ãåããåèšéåºæºãã¬ãŒã ããé³å£°æ
å ±ã笊å·åããéåºæºç¬¬ 1ããŒã¿ãšããã®éåºæºç¬¬ 1ããŒã¿ã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊 å·åããéåºæºç¬¬ 2ããŒã¿ãšãå«ã¿ãåèšåºæºãã¬ãŒã ããé³å£°æ
å ±ã笊å·åãã åºæºç¬¬ 1ããŒã¿ãšããã®åºæºç¬¬ 1ããŒã¿ã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ± ã笊å·åããŠæ§æãããåèšéåºæºç¬¬ 2ããŒã¿ã®æŒç®åŠçãè¡ãããã®ãããæ
å ± ãåããåŠçããããæããåºæºç¬¬ 2ããŒã¿ãšãå«ããã¹ããªãŒã ãåŠçããããžã¿ã«
é³å£°ããŒã¿åŠçæ¹æ³ã§ãã£ãŠãåèšã¹ããªãŒã ã®äžéšãæ¶å€±ãããã©ãããå€å®ãã ãšã©ãŒå€å®æé ãšãåèšåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 2ããŒã¿ã«åããããåèšåŠç ãããã®åèšãããæ
å ±ãååŸãããããæ
å ±ååŸæé ãšãåèšåºæºãã¬ãŒã å㯠åèšéåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 1ããŒã¿åã¯åèšéåºæºç¬¬ 1ããŒã¿ãè€å·åãã 第 1è€å·åããŒã¿ãçæãã第 1埩å·åæ段ãšãåèšåºæºãã¬ãŒã åã¯åèšéåºæº ãã¬ãŒã ã®åèšåºæºç¬¬ 2ããŒã¿åã¯åèšéåºæºç¬¬ 2ããŒã¿ããåèšãããæ
å ±ååŸ æé ã§ååŸããåèšãããæ
å ±ãçšããŠåŸ©å·ã£åãã第 2è€å·åããŒã¿ãçæãã第 2è€å·åæé ãšãæããåèšãããæ
å ±ååŸæé ã§åèšãããæ
å ±ãååŸã§ããªã ã€ãå Žåã«ãåèšç¬¬ 1埩å·åæé ã§åŸ©å·åãã第 1è€å·åããŒã¿ã«åºã¥ããåœè©²ç¬¬ 1è€å·åããŒã¿ãããé«ãåç垯åãæããé«åæåããŒã¿ãæ¬äŒŒçã«çæããå èšç¬¬ 1è€å·åããŒã¿ãšåèšé«åæåããŒã¿ã䜵ããŠåºåããã [0012] Further, the invention according to claim 4 is code data including a plurality of framed frames, and the plurality of frame sequences includes a plurality of frames including a reference frame and a non-reference frame. And the non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data. And the reference frame is configured by encoding reference first data in which audio information is encoded and band expansion information for expanding a reproduction band of the reference first data. Digital processing stream, including reference second data having a processing header with header information for performing arithmetic processing An audio data processing method, an error determination procedure for determining whether or not a part of the stream has been lost, and a header for acquiring the header information of the processing header provided in the reference second data of the reference frame An information acquisition procedure; first decoding means for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame to generate first decoded data; and the reference frame Alternatively, the second reference data or the second non-reference data of the non-reference frame is decoded using the header information acquired in the header information acquisition procedure to generate second decoded data. And when the header information cannot be acquired by the header information acquisition procedure, based on the first decoding data decoded by the first decoding procedure, High-frequency component data with a higher regeneration zone than 1 decodes data generated in a pseudo manner, and outputs before Symbol the high frequency component data together with the first decryption data.
çºæãå®æœããããã®æè¯ã®åœ¢æ
 BEST MODE FOR CARRYING OUT THE INVENTION
[0013] 以äžãæ¬çºæã®äžå®æœã®åœ¢æ
ãå³é¢ãåç
§ãã€ã€èª¬æããããã®å®æœåœ¢æ
ã¯ãã ãžã¿ã«é³å£°ããŒã¿åŠçè£
眮ã®äžäŸãšããŠãæºåž¯é»è©±æ©ã«é©çšããå Žåã®å®æœåœ¢æ
ã§ããã Hereinafter, an embodiment of the present invention will be described with reference to the drawings. This embodiment is an embodiment when applied to a mobile phone as an example of a digital audio data processing apparatus.
[0014] å³ 1ã¯ãæ¬å®æœåœ¢æ
ã®æºåž¯é»è©±æ©ã®å
šäœå€èŠ³ãè¡šãæèŠå³ã§ãããå³ 1ã«ãã㊠ãããžã¿ã«é³å£°ããŒã¿åŠçè£
眮ãšããŠã®ãã®æºåž¯é»è©±æ© 1ã¯ã TS (Transport Stream )ãšããŠé
ä¿¡ãããé³å£°ããŒã¿ãæ åããŒã¿ãããŒã¿æŸéçšããŒã¿ãªã©ãæããã³ã³ã ã³ãããŒã¿ãååŸããŠåºåããã MPEG2âTSã·ã¹ãã ãçšããå°äžæ³¢ããžã¿ã«æŸ éïŒã¬ãœãããã¯ã³ã»ã°æŸéïŒå¯Ÿå¿ã®ãã®ã§ããã  FIG. 1 is a perspective view showing the overall appearance of the mobile phone according to the present embodiment. In FIG. 1, this mobile phone 1 as a digital audio data processing apparatus acquires and outputs content data including audio data, video data, data for data broadcasting, etc. distributed as TS (Transport Stream). It is compatible with terrestrial digital broadcasting using the TS system.
[0015] æºåž¯é»è©±æ© 1ã¯ããã®äŸã§ã¯ãæ¬äœã±ãŒã·ã³ã° 2ãšããã®æ¬äœã±ãŒã·ã³ã° 2ã®äžéšã« èšããããé»è©±çªå·ã®å
¥åããŒãåçš®ã®æ©èœãã¿ã³ãªã©ãåããæäœéš 3ãšããã®åº 端éšãæ¬äœã±ãŒã·ã³ã° 2ã®äžç«¯éšã«æ¢æ¯ãããŠæ¬äœã±ãŒã·ã³ã° 2ã«å¯Ÿãééèªåšã« åãä»ããããééã«ã㌠4ãšãå皮衚瀺ãè¡ãè¡šç€ºéš 5ãšãç¡ç·éä¿¡ãä»ãããŒã¿é åä¿¡ãè¡ãããã®ã¢ã³ãã 6ãšãé³å£°ãçºããã¹ããŒã« 7ãšãééæäœã®ããã®ã¬ã㌠ã¹ã£ãã 8ãšããã€ã¯ 9ãšãäŸãã°å°äžæ³¢ããžã¿ã«æŸéãè¡æããžã¿ã«æŸéãªã©ã®æŸé æ³¢ãåä¿¡ããæŸéçšã¢ã³ãã 10 (å³ç€ºãããåŸè¿°ã®å³ 2åç
§ïŒçãåããŠããã  [0015] In this example, the cellular phone 1 is provided with a main body casing 2, an operation unit 3 provided at a lower portion of the main body casing 2, and provided with a telephone number input key, various function buttons, and the like, and a base end portion thereof. Open / close cover 4 pivotally supported at the lower end of main casing 2 and attached to main casing 2 so as to be openable / closable, display 5 for various displays, and antenna 6 for data transmission / reception via wireless communication A speaker 7 that emits sound, a lever switch 8 for opening and closing operation, a microphone 9, and a broadcasting antenna 10 that receives broadcast waves such as terrestrial digital broadcasts and satellite digital broadcasts (not shown, described later) Etc.).
[0016] å³ 2ã¯ãäžèšæºåž¯é»è©±æ© 1ã®æ©èœçæ§æãè¡šãæ©èœãããã¯å³ã§ãããå³ 2ã«ãã
ãŠãæŸéå±ãä»ã®æºåž¯é»è©±æ©çããéä¿¡ãããŠããé»æ³¢ãäžèšã¢ã³ãã 6ïŒ 10ã§å ä¿¡ãããåŸãã¢ã³ãã 6ïŒ 10ã«æ¥ç¶ãããéåä¿¡éš 100ã§åŸ©èª¿ããããã®åŸ©èª¿ããå ä¿¡ä¿¡å·ã«å¯Ÿãä¿¡å·åŠçéš 101ã§åçã®ããã®æå®ã®ä¿¡å·åŠç (詳现ã¯åŸè¿°ïŒãæœã ãããä¿¡å·åŠçéš 101ã§åŠçããä¿¡å·ã¯ãã¹ããŒã« 7ã§é³å£°ãšããŠåçãããã FIG. 2 is a functional block diagram showing a functional configuration of the mobile phone 1. Figure 2 Odor After the radio wave transmitted from the broadcasting station or other mobile phone is received by the antennas 6 and 10, the demodulated received signal is demodulated by the transmission / reception unit 100 connected to the antennas 6 and 10. The signal processing unit 101 performs predetermined signal processing (details will be described later) for reproduction. The signal processed by the signal processing unit 101 is reproduced as sound by the speaker 7.
[0017] ãã®ãšããéåä¿¡éš 100ã¯å³ç€ºããªã¬ã TSåä¿¡éšãåããŠããããã® TSåä¿¡éšã«äž èšæŸéçšã¢ã³ãã 10 (ãªããäžèšã¢ã³ãã 6ãšå
Œçšã§ãããïŒãæ¥ç¶ãããŠããããã® TSåä¿¡éšã¯ãä¿¡å·åŠçéš 101ã®å¶åŸ¡ã«ãããäžèšæŸéçšã¢ã³ãã 10ããããžã¿ã« ä¿¡å·ãšããŠéä¿¡ãããäŸãã°è€æ°ã® TSãããå©çšè
ã«ããéžæãããã³ã³ãã³ãã«å¯Ÿ å¿ãã TSãååŸããããããŠãååŸãã TSãä¿¡å·åŠçéš 101ã« TSä¿¡å·ãšããŠåºå ããã At this time, the transmission / reception unit 100 includes a TS reception unit (not shown), and the broadcasting antenna 10 (which may also be used as the antenna 6) is connected to the TS reception unit. . Under the control of the signal processing unit 101, the TS receiving unit acquires a TS corresponding to the content selected by the user from, for example, a plurality of TSs transmitted as digital signals from the broadcasting antenna 10. Then, the acquired TS is output to the signal processing unit 101 as a TS signal.
[0018] äžæ¹ã話è
ã®äŒè©±ã¯ãã€ã¯ 9ã«å
¥åãããŠé³å£°ä¿¡å·ã«å€æãããããã®é³å£°ä¿¡å·ã¯ äžèšä¿¡å·åŠçéš 101ã«ãããŠéä¿¡ã®ããã®ä¿¡å·åŠçãæœãããéåä¿¡éš 100ã§ã¯ äžèšä¿¡å·åŠçéš 101ããã®é³å£°ä¿¡å·ãå€èª¿ããŠã¢ã³ãã 6ãžäŸçµŠããã¢ã³ãã 6ã¯ã ã®é³å£°ä¿¡å·ãé»æ³¢ãšããŠéä¿¡ããã  On the other hand, the conversation of the speaker is input to the microphone 9 and converted into an audio signal. The audio signal is subjected to signal processing for transmission in the signal processing unit 101. The transmission / reception unit 100 modulates the audio signal from the signal processing unit 101 and supplies the modulated signal to the antenna 6. The antenna 6 transmits the audio signal. As a radio wave.
[0019] ãªããäžèšã®åæ§æèŠçŽ ã¯ã CPUçãåããå¶åŸ¡éš 102ã«ãã£ãŠãã®åäœãå¶åŸ¡ ãããã  [0019] Note that the operation of each of the above-described components is controlled by the control unit 102 including a CPU and the like.
[0020] å³ 3ã¯ãæ¬å®æœåœ¢æ
ã®æºåž¯é»è©±æ© 1ã«åããããã¢ã³ãã 10ã§åä¿¡ããã TSã¹ã㪠ãŒã ã®ãã¡ãé³å£°ã«ä¿ããéšåãæãåºããŠæŠå¿µçã«è¡šãæš¡åŒå³ã§ããã  [0020] FIG. 3 is a schematic diagram conceptually showing an extracted portion related to audio in the TS stream received by the antenna 10 provided in the mobile phone 1 of the present embodiment.
[0021] å³ 3ã¯ãã¢ã³ãã 10ã§åä¿¡ãããå€éåãããã¹ããªãŒã ãããªãŒãã£ãªã®ã¹ããªãŒã ãåãåºãããã® (ããªãã¡ã Elementary Stream)ããè€æ°ã®ãã¬ãŒã ãåããŠã ãæ§åãæé軞ã«æ²¿ã£ãŠè¡šããŠããããããè€æ°ã®ãã¬ãŒã ã¯ãåºæºãã¬ãŒã SFãšã ãã以å€ã®éåºæºãã¬ãŒã IFãšããæ§æãããŠã¬ããã  [0021] FIG. 3 shows a state in which an audio stream extracted from the multiplexed stream received by the antenna 10 (ie, Elementary Stream) includes a plurality of frames along the time axis. Yes. These multiple frames consist of a reference frame SF and other non-reference frame IF.
[0022] éåºæºãã¬ãŒã IFã¯ãé³å£°æ
å ±ã AACèŠæ Œã§ç¬Šå·åãã BaseCodec (éåºæºç¬¬  [0022] Non-reference frame IF is a BaseCodec (non-reference number
1ããŒã¿ïŒãšããã® BaseCodecã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·ã£å ãã SBRããŒã¿ïŒéåºæºç¬¬ 2ããŒã¿ãäœã SBRããããå«ãŸãªã¬ã)ãšãå«ãã§ã¬ããã  1 data) and SBR data (non-reference second data, which does not include the SBR header) encoded with band expansion information for expanding the reproduction band of this BaseCodec.
[0023] åºæºãã¬ãŒã SFã¯ãäžèšåæ§ã«é³å£°æ
å ±ã AACèŠæ Œã§ç¬Šå·åãã BaseCodec ( åºæºç¬¬ 1ããŒã¿ïŒãšããã® BaseCodecã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã 笊å·åããŠæ§æãããäžèš SBRããããªãã® SBRããŒã¿ã®æŒç®åŠçãè¡ãããã®ãž
ããæ
å ±ãåãã SBRããã (åŠçãããïŒãåãã SBRããŒã¿ïŒåºæºç¬¬ 2ããŒã¿ïŒãš ãå«ãã§ããã SBRãããã®ãããæ
å ±ãçšããããšã§äžèšæŒç®åŠççšã®ããŒãã« ãäœæããããã®ããŒãã«ã«åºã¥ã垯åæ¡åŒµåŠçãè¡ã£ãŠé«åšæ³¢æåãçæå¯èœãš ãªã£ãŠããã [0023] The reference frame SF is configured by encoding BaseCodec (reference first data) obtained by encoding audio information according to the AAC standard and band expansion information for expanding the reproduction band of the BaseCodec, as described above. To perform processing of SBR data without SBR header SBR data (standard second data) with SBR header (processing header) with header information. By using the header information of the SBR header, a table for the above arithmetic processing is created, and band expansion processing can be performed based on this table to generate high frequency components.
[0024] å³ 4ã¯ãäžèšä¿¡å·åŠçéš 101ã®ãã¡ãäžèšé³å£°ä¿¡å·ã®åçåŠçã«ä¿ããæ§æãè¡š ãæ©èœãããã¯å³ã§ããã  FIG. 4 is a functional block diagram showing a configuration related to the reproduction processing of the audio signal in the signal processing unit 101.
[0025] å³ 4ã«ãããŠãä¿¡å·åŠçéš 101ã¯ã BaseCodecDecodeéš 11ãšããšã©ãŒåŠçéš 12 ãšãé«åæåæ¬äŒŒçæåŠçéš 13ãšã SBRDecodeéš 14ãšã BaseCodecDecodeéš 1In FIG. 4, the signal processing unit 101 includes a BaseCodecDecode unit 11, an error processing unit 12, a high frequency component pseudo generation processing unit 13, an SBRDecode unit 14, and a BaseCodecDecode unit 1
1ããã®åæ¿å¶åŸ¡ä¿¡å·ã«ããåãæ¿ããããã¹ã£ãã 15 (åºåå¶åŸ¡æ段ïŒãšãåããŠã ãã Switch 15 (output control means) that is switched by a switching control signal from 1.
[0026] BaseCodecDecodeéš 11ã¯ãè€æ°ã®æ©èœãåããŠããããŸã第 1ã®æ©èœãšããŠãå³ 3ãçšããŠåè¿°ããé³å£°ä¿¡å·ã®ã¹ããªãŒã (Elementary Stream)ãå
¥åããåºæºã㬠ãŒã SFåã³éåºæºãã¬ãŒã IFã«å«ãŸãã BaseCodecéšã®ãã³ãŒãïŒåŸ©å·åïŒãè¡ã (第 1埩å·åæ段)ã  [0026] The BaseCodecDecode unit 11 has a plurality of functions. First, as the first function, the audio signal stream (Elementary Stream) described above with reference to FIG. 3 is input, and the BaseCodec part included in the reference frame SF and the non-reference frame IF is decoded (decoded) ( First decoding means).
[0027] ãŸãã BaseCodecDecodeéš 11ã¯ã第 2ã®æ©èœãšããŠãåãã¬ãŒã ã SBRâ heade rãæããŠã¬ãããã¬ãŒã ãã©ããïŒèšã¬ãæããã°åºæºãã¬ãŒã SFãéåºæºãã¬ãŒã IF åïŒãå€å®ãããšãšãã«ãåºæºãã¬ãŒã SFã§ãã£ãå Žåã«ã¯ã SBRããŒã¿ã«åããã ã SBRãããã®äžèšãããæ
å ±ã®ååŸãè¡ãïŒãããæ
å ±ååŸæ段ïŒã  [0027] In addition, as a second function, the BaseCodecDecode unit 11 determines whether each frame has an SBR-header (in other words, a reference frame SF or a non-reference frame IF force). At the same time, if the frame is the reference frame SF, the header information of the SBR header provided in the SBR data is acquired (header information acquisition means).
[0028] ããã«ã BaseCodecDecodeéš 11ã¯ã第 3ã®æ©èœãšããŠã ES (Elementary Strea m)ã®äžéšãæ¶å€±ããå ïŒ=ãšã©ãŒãçºçããå ã©ããã®ãšã©ãŒå€å®ãè¡ãïŒãšã©ãŒå€ å®æ段ïŒããã®ãšãã®å€å®æ¹æ³ã¯ã BaseCodecDecodeéš 11ã«ããã ESã®ãã³ãŒã åŠçéçšã§äžå
·åãçããæã«ãšã©ãŒãšå€å®ããŠãè¯ããã ESãå
¥åããããšãã«ã å¥éåœè©² ESã«ã€ããŠã®ãšã©ãŒæ
å ±ãå€éšããåãåã£ãŠå€å®ããããã«ããŠãè¯ã  [0028] Further, as a third function, the BaseCodecDecode unit 11 also performs error determination of whether or not a part of the ES (Elementary Stream) has disappeared (= error generation force (error determination means)). The determination method may be determined to be an error when a failure occurs in the ES decoding process in the BaseCodecDecode unit 11, or when an ES is input, error information about the ES is separately received from the outside. You may make it judge
[0029] ããã§ãæ¬å®æœåœ¢æ
ã§ã¯ãåè¿°ããããã«ãã¹ã£ãã 15ã¯ã BaseCodecDecodeéš 1 1ããã®åæ¿å¶åŸ¡ä¿¡å·ã«åºã¥ãããšã©ãŒåŠçéš 12ãé«åæåæ¬äŒŒçæåŠçéš 13ã S BRDecodeéš 14ã®ããããããã®åºåãéžæããŠã¹ããŒã« 7åŽãžãšåºåããããã« ãªã£ãŠãããå³ 5ã¯ããã® BaseCodecDecodeéš 11ãåãã¬ãŒã ããšã«å®è¡ããåŠç
æé ãè¡šããããŒãã£ãŒãã§ããã Here, in the present embodiment, as described above, the switch 15 is based on the switching control signal from the BaseCodecDecode unit 11, based on the error processing unit 12, the high frequency component pseudo generation processing unit 13, and the S BRDecode unit 14. The output from either of these is selected and output to the speaker 7 side. Figure 5 shows the processing executed by the BaseCodecDecode unit 11 for each frame. It is a flowchart showing a procedure.
[0030] å³ 5ã«ãã¬ããŠããŸãã¹ããã S5ã«ãã¬ããŠããããæ
å ±ååŸïŒãããæ
å ±ãååŸã§ã ããåŠå ãè¡šããã©ã° Fh=0ãšããããã®åŸãã¹ããã S10ã§ãåè¿°ã®ãšã©ãŒå€å®æ 段ãšããŠã®æ©èœã«ãããšã©ãŒãçºçããŠãããã©ãããå€å®ããïŒå€å®æé )ã ESã®äž éšãæ¶å€±ãåœè©²ãã¬ãŒã ã§ãšã©ãŒãçºçããŠããå Žåã«ã¯å€å®ãæºããããã¹ãã ã S55ã§äžèšãã©ã° Fh = 0ã«ããåŸãã¹ããã S60ã«ç§»ãã  [0030] In FIG. 5, first, in step S5, the header information acquisition (flag Fh = 0 indicating whether or not the header information could be acquired is set. Then, in step S10, the above error is detected. Judgment is made as to whether or not an error has occurred by the function as a judgment means (judgment procedure) If part of the ES has disappeared and an error has occurred in the frame, the judgment is satisfied, and in step S55 After setting the flag Fh = 0, proceed to Step S60.
[0031] ã¹ããã S60ã§ã¯ããšã©ãŒåŠçéš 12ã«æå®ã®ãšã©ãŒçšããŒã¿ãåºåããããã«æ瀺 ãåºããšãšãã«ãã¹ã£ãã 15ããšã©ãŒåŠçéš 12åŽã«åãæ¿ããåæ¿å¶åŸ¡ä¿¡å·ãåºå ãããããã«å¿ããŠããšã©ãŒåŠçéš 12ã¯ããšã©ãŒçšããŒã¿ãšããŠãç¡é³ç¶æ
ãšãã Mu teä¿¡å· (åã¯ååŸã®ããŒã¿åãé©å®ã®ææ³ã§è£éããããŒã¿ã§ãããïŒãåºåã (㧠ã©ãŒçšããŒã¿çææ段ããšã©ãŒçšããŒã¿çææé ïŒãã¹ã£ãã 15ããã¹ããŒã« 7åŽãž ãšåºåãããã  [0031] In step S60, the error processing unit 12 is instructed to output predetermined error data, and a switching control signal for switching the switch 15 to the error processing unit 12 side is output. In response to this, the error processing unit 12 outputs, as error data, a mute signal for making a silent state (or data before and after the data force may be interpolated by an appropriate method) (error data generation means, Error data generation procedure), output from switch 15 to speaker 7 side.
[0032] äžæ¹ãã¹ããã S10ã§ãšã©ãŒãçºçããŠããªå ãå Žåã«ã¯å€å®ãæºãããããã¹ ããã S 15ã«ç§»ããã¹ããã S 15ã§ã¯ãåè¿°ã®ãããæ
å ±ååŸæ段ãšããŠã®æ©èœã«ã ããåœè©²ãã¬ãŒã ãã SBRãããã®ãããæ
å ±ãååŸããïŒãããæ
å ±ååŸæé )ãã ã®åŸãã¹ããã S20㧠SBRãããããã€ããã©ããïŒèšãæããã°ããããæ
å ±ã« æŒ ç®çšã®ããŒãã«ãäœæãããŠãããã©ããïŒãå€å®ããããããæ
å ±ãååŸããæŒç® çšã®ããŒãã«ãäœæãããŠããå Žåã«ã¯å€å®ãæºããããã¹ããã S25ã§äžèšãã© ã° Fh= lãšããåŸãã¹ããã S30ã«ç§»ãããªããã¹ããã S20㧠SBRãããããªãã£ã å Žåã«ã¯å€å®ãæºãããããçŽæ¥ã¹ããã S30ã«ç§»ãã  [0032] On the other hand, if it is determined in step S10 that an error has not occurred, the determination is not satisfied, and the routine goes to step S15. In step S15, the header information of the SBR header is acquired from the frame by the function as the header information acquisition means described above (header information acquisition procedure). Thereafter, in step S20, it is determined whether or not there is an SBR header (in other words, whether or not a header information calculation table has been created). If the header information has been acquired and the calculation table has been created, the determination is satisfied, and after setting the flag Fh = l in step S25, the process proceeds to step S30. If there is no SBR header in step S20, the determination is not satisfied, and the routine goes directly to step S30.
[0033] ã¹ããã S30ã§ã¯ãäžèšãã©ã° Fh= 1ã§ãããã©ããïŒèšãæããã°ãããæ
å ±ãå åŸããæŒç®çšããŒãã«ãäœæãããŠããå ãå€å®ãããå€å®ãæºãããããã¹ãã ã S35ãžç§»ããåè¿°ã®ç¬¬ 1埩å·ã£åæ段ãšããŠã®æ©èœã«ãããã³ãŒããã BaseCodecéš ã®åŸ©å·åããŒã¿åã³ïŒãã®æç¹ã§ä¿æããŠã¬ããïŒ SBRéšã® Syntax (èšã¬ãæããã°ãž ããæ
å ±ã«åºã¥ãããŒãã«ïŒã SBRDecodeéš 14ã«åºåãããšãšãã«ãã¹ã£ãã 15ã S BRDecodeéš 14ã«åãæ¿ããåæ¿å¶åŸ¡ä¿¡å·ãåºåãããããã«å¿ããŠã SBRDeco deéš 14ã¯ã BaseCodecDecodeéš 11ããåãåã£ã BaseCodecéšã®åŸ©å·åã㌠ã¿ã«ãããã«åè¿°ã® SBRéšã® Syntaxæ
å ±ã«åºã¥ãããŒãã«ãçšããŠè€å·ååŠçã
è¡ãçæãã SBRéšã®åŸ©å·ã£åããŒã¿ãã«å©ããŠã¹ã£ãã 15ã«åºåã (第 2埩å·ã£åæé )ãã¹ã£ãã 15ããã¹ããŒã« 7åŽãžãšåºåãããã [0033] In step S30, it is determined whether or not the flag Fh = 1 (in other words, the force with which the header information is obtained and the calculation table is created. If the determination is satisfied, the process proceeds to step S35, and The decoding data of the BaseCodec part decoded by the function as the first decoding means and the Syntax of the SBR part (retained at this point) (the table based on the header information in other words) In addition to outputting to the SBRDecode unit 14, it outputs a switching control signal for switching the switch 15 to the SBRDecode unit 14. In response, the SBRDecode unit 14 receives the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 as follows: In addition, decryption processing is performed using a table based on the syntax information of the SBR section described above. The decoding data generated by the SBR section is collected and output to the switch 15 (second decoding procedure), and output from the switch 15 to the speaker 7 side.
[0034] ãªããåè¿°ã®ã¹ããã S30ã§ã®å€å®ãæºããããªãå Žåãã¹ããã S40ã«ç§»ããã¹ ããã S40ã§ã¯ãé«åæåæ¬äŒŒçæåŠçéš 13ã«å¯Ÿãåè¿°ã®ç¬¬ 1è€å·åæ段ãšããŠã® æ©èœã«ãããã³ãŒããã BaseCodecéšã®è€å·åããŒã¿ãåºåãããšãšãã«ãã¹ã£ãã 1 5ãé«åæåæ¬äŒŒçæåŠçéš 13ã«åãæ¿ããåæ¿å¶åŸ¡ä¿¡å·ãåºåãããããã«å¿ã ãŠãé«åæåæ¬äŒŒçæåŠçéš 13ã¯ã BaseCodecDecodeéš 11ããåãåã£ãäžèš BaseCodecéšã®è€å·åããŒã¿ã 2åã«ã¢ãããµã³ããªã³ã°ãããšå
±ã«ãå
¬ç¥ã®ææ³ïŒ äŸãã°ç¹èš±ç¬¬ 3140273å·å
¬å ±ã«èšèŒã®ææ³ïŒã«ããäœåã§ããäžèš BaseCodecéš ã®åŸ©å·åããŒã¿ã«åºã¥ãé«åæåããŒã¿ãçæããåœè©²çæããé«åæåããŒã¿ã å«ãè€å·åããŒã¿ãã¹ã£ãã 15ã«åºåããã¹ã£ãã 15ããã¹ããŒã« 7åŽãžãšåºåãã ãã [0034] If the determination in step S30 is not satisfied, the process proceeds to step S40. In step S40, the decoded data of the BaseCodec part decoded by the function as the first decoding means described above is output to the high-frequency component pseudo-generation processing unit 13 and the switch 15 is pseudo-generated. A switching control signal for switching to the processing unit 13 is output. In response to this, the high-frequency component pseudo-generation processing unit 13 up-samples the decoded data of the BaseCodec unit received from the BaseCodecDecode unit 11 twice and uses a known method (for example, described in Japanese Patent No. 3140273). The high-frequency component data is generated based on the decoded data of the BaseCodec part, which is a low frequency by the above method), and the decrypted data including the generated high-frequency component data is output to the switch 15, and the switch 15 Output to the side.
[0035] åè¿°ã®ã¹ããã S35ãã¹ããã S40ãã¹ããã S60ãçµäºããããã¹ããã S 10ã«æ»ã ãåæ§ã®æé ãç¹°ãè¿ãã  [0035] When step S35, step S40, and step S60 are completed, the process returns to step S10 and the same procedure is repeated.
[0036] äžèšãããŒã®ããã«ãæ¬å®æœåœ¢æ
ã§ã¯ã SBRãããã®ãããæ
å ±ãååŸã§ããªãå Ž å (èšãæããã°æŒç®çšããŒãã«ãäœæãããŠããªãå ŽåïŒã«ãå
¬ç¥ã®é«åæåæ¬ äŒŒçæææ³ãçšããŠãã³ãŒããã BaseCodecéšã®åŸ©å·ã£åããŒã¿ã«åºã¥ãé«åæå ããŒã¿ãæ¬äŒŒçã«çæããäžèš BaseCodecéšã®åŸ©å·åããŒã¿ã«çæããé«åæå ããŒã¿ãå ããŠåºåãããã®ã§ããã  [0036] As in the above flow, in the present embodiment, when the header information of the SBR header cannot be acquired (in other words, when a calculation table is not created), a known high-frequency component simulation generation method is used. The high frequency component data is generated in a pseudo manner based on the decoded data of the BaseCodec part decoded in this way, and the generated high frequency component data is added to the decoded data of the BaseCodec part for output.
[0037] 以äžèª¬æããããã«ãæ¬å®æœåœ¢æ
ã«ãããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ã¯ãã㬠ãŒã åãããè€æ°ã®ãã¬ãŒã åãããªã笊å·ã£ãããŒã¿ã§ãã£ãŠãè€æ°ã®ãã¬ãŒã å㯠ãåºæºãã¬ãŒã SFåã³éåºæºãã¬ãŒã IFãããªãè€æ°ã®ãã¬ãŒã ãå«ã¿ãéåºæºã ã¬ãŒã IFããé³å£°æ
å ±ã笊å·ã£åããéåºæºç¬¬ 1ããŒã¿ïŒãã®äŸã§ã¯ BaseCodec)ãšã ãã®éåºæºç¬¬ 1ããŒã¿ BaseCodecã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊 å·åããéåºæºç¬¬ 2ããŒã¿ïŒãã®äŸã§ã¯ SBR)ãšãå«ã¿ãåºæºãã¬ãŒã SFããé³å£°æ
å ±ã笊å·åããåºæºç¬¬ 1ããŒã¿ïŒãã®äŸã§ã¯ BaseCodec)ãšããã®åºæºç¬¬ 1ããŒã¿ Ba seCodecã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·ã£åããŠæ§æãããéåº æºç¬¬ 2ããŒã¿ SBRã®æŒç®åŠçãè¡ãããã®ãããæ
å ±ãåããåŠçãããïŒãã®äŸã§
㯠SBRãããïŒãæããåºæºç¬¬ 2ããŒã¿ïŒãã®äŸã§ã¯ SBR)ãšãå«ããã¹ããªãŒã ãåŠ çããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ã§ãã£ãŠãåºæºãã¬ãŒã SFã®åºæºç¬¬ 2ããŒã¿ SBRã«åããããåŠçãããã®ãããæ
å ±ãååŸãããããæ
å ±ååŸæ段ïŒãã®äŸ ã§ã¯ BaseCodecDecodeéš 11)ãšãåºæºãã¬ãŒã SFåã¯éåºæºãã¬ãŒã IFã®åºæº 第 1ããŒã¿ BaseCodecåã¯éåºæºç¬¬ 1ããŒã¿ BaseCodecã埩å·ã£åãã第 1埩å·ã£å ããŒã¿ïŒãã®äŸã§ã¯ãã³ãŒãåŸã® BaseCodecããŒã¿ïŒãçæãã第 1埩å·åæ段ïŒã ã®äŸã§ã¯ BaseCodecDecodeéš 11)ãšãåºæºãã¬ãŒã SFåã¯éåºæºãã¬ãŒã IFã® åºæºç¬¬ 2ããŒã¿ SBRåã¯éåºæºç¬¬ 2ããŒã¿ SBRãããããæ
å ±ååŸæ段 11ã§ååŸ ãããããæ
å ±ãçšããŠåŸ©å·ã£åãã第 2è€å·åããŒã¿ïŒãã®äŸã§ã¯ãã³ãŒãåŸã® SBR ããŒã¿ïŒãçæãã第 2埩å·åæ段ïŒãã®äŸã§ã¯ SBRDecodeéš 14)ãšããããæ
å ± ååŸæ段 11ã§ãããæ
å ±ãååŸã§ããªå ãå Žåã«ã第 1埩å·ã£åæ段 11ã§è€å·å ããã第 1è€å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒã«åºã¥ããåœè©²ç¬¬ 1åŸ©å· åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒãããé«ã¬ãåç垯åãæããé«åæå ããŒã¿ãæ¬äŒŒçã«çæããé«åæåæ¬äŒŒçææ段 (ãã®äŸã§ã¯é«åæåæ¬äŒŒçæ åŠçéš 13)ãšã第 1埩å·ã£åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒãšé«åæåã㌠ã¿ã䜵ããŠåºåããããã®åºåå¶åŸ¡æ段ïŒãã®äŸã§ã¯ã¹ã£ãã 15)ãšãæããããšãç¹ åŸŽãšããã [0037] As described above, the digital audio data processing device 1 according to the present embodiment is encoded data composed of a plurality of framed frame sequences, and the plurality of frame sequences include the reference frame SF and the non-frame data. The reference frame IF includes multiple frames, and the non-reference frame IF expands the playback band of the non-reference first data (BaseCodec in this example) that encodes audio information and the non-reference first data BaseCodec. Non-reference second data (in this example, SBR) that encodes the bandwidth expansion information for encoding, and the reference frame SF is the reference first data (in this example, BaseCodec) that encodes the voice information, A processing header (this header, which includes header information for performing arithmetic processing of the non-standard second data SBR, is configured by encoding the band expansion information for expanding the reproduction band of the reference first data BaseCodec. In the example Is a digital audio data processing device 1 for processing a stream including reference second data (in this example, SBR) having SBR headers, and is a processing header provided in reference second data SBR of reference frame SF Header information acquisition means (BaseCodecDecode section 11 in this example) for acquiring the header information of the first frame and the first reference data BaseCodec or the first non-reference first data BaseCodec of the reference frame SF or the non-reference frame IF. The first decoding means (BaseCodecDecode unit 11 in this example) that generates data (BaseCodec data after decoding) in this example, and the reference second data SBR or non-reference data of the reference frame SF or non-reference frame IF 2 The second decoding means for decoding the data SBR using the header information acquired by the header information acquisition means 11 and generating the second decrypted data (in this example, the decoded SBR data) In this example, if the header information cannot be acquired by the SBRDecode unit 14) and the header information acquisition means 11, the first decoded data (BaseCodec after decoding) decoded by the first decoding means 11 High-frequency component pseudo-generation means (in this example, a high-frequency component) that artificially generates high-frequency component data having a higher reproduction and reproduction band than the first decoded data (decoded BaseCodec data). A pseudo-generation processing unit 13) and output control means (switch 15 in this example) for outputting the first decoded key data (BaseCodec data after decoding) and high-frequency component data together. It is a sign.
[0038] æ¬å®æœåœ¢æ
ã«ãããŠã¯ãã¹ããªãŒã ã«åããããè€æ°ã®ãã¬ãŒã ã®ããããå åº æºç¬¬ 1ããŒã¿ BaseCodecåã³åºæºç¬¬ 2ããŒã¿ SBRãå«ãåºæºãã¬ãŒã SFãšãéåº æºç¬¬ 1ããŒã¿ BaseCodecåã³éåºæºç¬¬ 2ããŒã¿ SBRãå«ãéåºæºãã¬ãŒã IFãšã ãæ§æãããŠããããã®ãã¡ãåºæºç¬¬ 1ããŒã¿ BaseCodecåã³éåºæºç¬¬ 1ããŒã¿ Bas eCodecã第 1埩å·ã£åæ段 11ã§åŸ©å·åãããŠç¬¬ 1埩å·ã£åããŒã¿ïŒãã³ãŒãåŸã® Base CodecããŒã¿ïŒãçæãããäžæ¹ãåºæºç¬¬ 2ããŒã¿ SBRåã³éåºæºç¬¬ 2ããŒã¿ SBR ã«ã€ããŠã¯ããããæ
å ±ååŸæ段 11ã§ååŸãããããæ
å ±ãçšããŠç¬¬ 2è€å·åæ段 14ã§åŸ©å·åããã第 2è€å·åããŒã¿ïŒãã³ãŒãåŸã® SBRããŒã¿ïŒãçæãããã [0038] In the present embodiment, the reference frame SF including the force standard first data BaseCodec and the reference second data SBR of each of the plurality of frames provided in the stream, and the non-standard first data BaseCodec and the non-standard second data. It is composed of non-reference frame IF including data SBR. Among them, the reference first data BaseCodec and the non-reference first data BaseCodec are decoded by the first decoding means 11 to generate the first decoded data (Base Codec data after decoding), while the reference first data 2 The data SBR and the non-reference second data SBR are decrypted by the second decryption means 14 using the header information obtained by the header information obtaining means 11, and the second decrypted data ( SBR data after decoding) ) Is generated.
[0039] ãã®ãšããäŸãã°ã¹ããªãŒã ã®äžéšãæ¶å€±ããïŒ=ãšã©ãŒãçºçããïŒçã«ãããããã æ
å ±ååŸæ段 11ã§ãããæ
å ±ãååŸã§ããªãã£ããšãã«ã¯ã第 1è€å·åæ段 11ã§åŸ© å·åããã第 1è€å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒã«åºã¥ããé«åæå
æ¬äŒŒçææ段 13ã«ããé«åæåããŒã¿ãæ¬äŒŒçã«çæããããããŠãåºåå¶åŸ¡æ段 15ã«ããã第 1埩å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒãäžèšçæãããé« åæåããŒã¿ãšäœµããŠåºåããããããã«ããããšã©ãŒçºçåŸã«æ¬¡ã«ãããæ
å ±ãå åŸã§ãããŸã§ç¡é³ç¶æ
ã«ããã第 1è€å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒ ããåºåããªãå Žåã«æ¯ã¹ãç¡é³ç¶æ
ãççž®ã§ãããšå
±ã«ãé«åæåãåºåããããš ã«ããèŽæäžã®éåæãäœæžããããšãã§ããã [0039] At this time, when header information cannot be acquired by the header information acquisition unit 11 due to, for example, a part of the stream disappeared (= an error has occurred), the first decoding unit 11 performs the decoding. Based on the first decoded data (BaseCodec data after decoding) The pseudo-generation means 13 generates pseudo high frequency component data. Then, the output control means 15 outputs the first decoded data (decoded BaseCodec data) together with the generated high frequency component data. As a result, the silence state can be shortened and the high-frequency component can be reduced compared to the case where the silence state is obtained until the next header information can be obtained after the error occurs or only the first decoded data (decoded BaseCodec data) is output. By outputting, it is possible to reduce the sense of incongruity in hearing.
ãŸããæ¬å®æœåœ¢æ
ã®ããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ãçšããããžã¿ã«é³å£°ããŒã¿ åŠçæ¹æ³ã«ãã¬ããŠã¯ããã¬ãŒã åãããè€æ°ã®ãã¬ãŒã åãããªã笊å·åããŒã¿ã§ ãã£ãŠãè€æ°ã®ãã¬ãŒã åã¯ãåºæºãã¬ãŒã SFåã³éåºæºãã¬ãŒã IFãããªãè€æ° ã®ãã¬ãŒã ãå«ã¿ãéåºæºãã¬ãŒã IFããé³å£°æ
å ±ã笊å·åããéåºæºç¬¬ 1ããŒã¿ B aseCodecãšããã®éåºæºç¬¬ 1ããŒã¿ BaseCodecã®åç垯åãæ¡å€§ããããã®åž¯å æ¡å€§æ
å ±ã笊å·åããéåºæºç¬¬ 2ããŒã¿ SBRãšãå«ã¿ãåºæºãã¬ãŒã SFããé³å£°æ
å ±ã笊å·ã£åããåºæºç¬¬ 1ããŒã¿ BaseCodecãšããã®åºæºç¬¬ 1ããŒã¿ BaseCodecã®å ç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·åããŠæ§æãããéåºæºç¬¬ 2ããŒã¿ S BRã®æŒç®åŠçãè¡ãããã®ãããæ
å ±ãåããåŠçãããïŒSBRãããïŒãæãã åºæºç¬¬ 2ããŒã¿ SBRãšãå«ããã¹ããªãŒã ãåŠçããããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã§ ãã£ãŠãåºæºãã¬ãŒã SFã®åºæºç¬¬ 2ããŒã¿ SBRã«åããããåŠçãããã®ãããæ
å ±ãååŸãããããæ
å ±ååŸæé ïŒãã®äŸã§ã¯å³ 5ã®ã¹ããã S 15)ãšãåºæºãã¬ãŒã SFåã¯éåºæºãã¬ãŒã IFã®åºæºç¬¬ 1ããŒã¿ BaseCodecåã¯éåºæºç¬¬ 1ããŒã¿ Bas eCodecã埩å·åãã第 1埩å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒãçæã ã第 1埩å·åæé ïŒãã®äŸã§ã¯ BaseCodecDecodeéš 11ã«ãã BaseCodecãã³ãŒã æé ïŒãšãåºæºãã¬ãŒã SFåã¯éåºæºãã¬ãŒã IFã®åºæºç¬¬ 2ããŒã¿ SBRåã¯éåºæº 第 2ããŒã¿ SBRãããããæ
å ±ååŸæé S 15ã§ååŸãããããæ
å ±ãçšããŠè€å·å ãã第 2è€å·åããŒã¿ïŒãã³ãŒãåŸã® SBRããŒã¿ïŒãçæãã第 2埩å·åæé (ãã®äŸ ã§ã¯å³ 5ã®ã¹ããã S35)ãšããããæ
å ±ååŸæé S15ã§ãããæ
å ±ãååŸã§ããªã ã€ãå Žåã«ã第 1埩å·åæé ïŒBaseCodecDecodeéš 11ã«ãã BaseCodecãã³ãŒã æé )ã§åŸ©å·åãã第 1è€å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒã«åºã¥ãã é«åæåããŒã¿ãçæãã第 1è€å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒãšé«
åæåããŒã¿ã䜵ããŠåºåããããšãç¹åŸŽãšããã Further, the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data consisting of a plurality of framed frames, and the plurality of frame sequences are It includes multiple frames consisting of a reference frame SF and a non-reference frame IF, and the non-reference frame IF expands the playback bandwidth of the non-reference first data BaseCodec that encodes audio information and the non-reference first data BaseCodec. The reference frame SF includes the reference first data BaseCodec encoded voice information and the reproduction band of the reference first data BaseCodec. Reference second data SB, which is configured by encoding band expansion information for expansion and has a processing header (SBR header) with header information for performing calculation processing of non-reference second data SBR A digital audio data processing method for processing a stream including R, and a header information acquisition procedure for acquiring the header information of the processing header provided in the reference second data SBR of the reference frame SF (in this example, FIG. 5). Step S15) and the reference first data BaseCodec or non-reference first data BaseCodec of the reference frame SF or non-reference frame IF are decoded to generate first decoded data (decoded BaseCodec data). 1 Decoding procedure (BaseCodec decoding procedure by BaseCodecDecode section 11 in this example) and reference second data SBR or non-reference second data SBR of reference frame SF or non-reference frame IF were acquired in header information acquisition procedure S 15 The second decoding procedure (in this example, step S35 in FIG. 5) for decoding the header information to generate the second decoded data (decoded SBR data), and the header information acquisition procedure If header information cannot be obtained in order S15, the high-frequency component is based on the first decoded data (BaseCodec data after decoding) decoded in the first decoding procedure (BaseCodec decoding procedure by BaseCodecDecode section 11). Data is generated and the first decrypted data (BaseCodec data after decoding) and high The band component data is also output together.
[0041] æ¬å®æœåœ¢æ
ã®ããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã«ãã¬ããŠã¯ããããæ
å ±ååŸæé S 1  [0041] In the digital audio data processing method of the present embodiment, the header information acquisition procedure S 1
5ã§ãããæ
å ±ãååŸã§ããªãã£ãå Žåã«ã埩å·åããã第 1埩å·åããŒã¿ïŒãã³äž ãåŸã® BaseCodecããŒã¿ïŒã«åºã¥ããé«åæåããŒã¿ãçæãã第 1è€å·åããŒã¿ïŒ ãã³ãŒãåŸã® BaseCodecããŒã¿ïŒãšçæãããé«åæåããŒã¿ãšã䜵ããŠåºåãã ããããã«ããããšã©ãŒçºçåŸã«æ¬¡ã«ãããæ
å ±ãååŸã§ãããŸã§ç¡é³ç¶æ
ã«ããã第 1è€å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒããåºåããªãå Žåã«æ¯ã¹ãç¡é³ ç¶æ
ãççž®ã§ãããšå
±ã«ãé«åæåãåºåããããšã«ããèŽæäžã®éåæãäœæžãã ããšå Sã§ããã  If header information could not be obtained in step 5, high frequency component data is generated based on the decoded first decoded data (BaseCodec data after decoding), and the first decoded data (after decoding) BaseCodec data) and the generated high-frequency component data are output together. As a result, the silence state can be shortened and the high frequency component can be output compared to the case where the silence state is obtained until the next header information can be acquired after an error occurs or only the first decoded data (decoded BaseCodec data) is output. By doing this, you can reduce the sense of incongruity in hearing.
[0042] äžèšå®æœåœ¢æ
ã«ãããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ã«ãããŠã¯ãã¹ããªãŒã ã®äž éšãæ¶å€±ãããã©ãããå€å®ãããšã©ãŒå€å®æ段ïŒãã®äŸã§ã¯ BaseCodecDecode éš 11)ãæããé«åæåæ¬äŒŒçææ段 13ã¯ããšã©ãŒå€å®æ段 11ã§éåºæºãã¬ãŒã I Fãæ¶å€±ãããšå€å®ãããå Žåã«ãé«åæåããŒã¿ãçæããããšãç¹åŸŽãšããã  [0042] The digital audio data processing apparatus 1 in the above embodiment has error determination means (BaseCodecDecode section 11 in this example) for determining whether or not a part of the stream has been lost. Is characterized by generating high-frequency component data when it is determined by the error determination means 11 that the non-reference frame IF has disappeared.
[0043] æ¬å®æœåœ¢æ
ã«ãããŠã¯ããšã©ãŒå€å®æ段 11ã§éåºæºãã¬ãŒã IFãæ¶å€±ãããšå€å® ããå Žåã«ãé«åæåæ¬äŒŒçææ段 13ã«ããé«åæåããŒã¿ãçæãããããã«ã ããéåºæºãã¬ãŒã IFãæ¶å€±ãããšã©ãŒãçºçããåŸã«æ¬¡ã«ãããæ
å ±ãååŸã§ãã ãŸã§ç¡é³ç¶æ
ã«ããã第 1埩å·åããŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒããåºå ããªãå Žåã«æ¯ã¹ãç¡é³ç¶æ
ãççž®ã§ãããšå
±ã«ãé«åæåãåºåããããšã«ããèŽ æäžã®éåæãäœæžããããšãã§ããã  In this embodiment, when the error determination means 11 determines that the non-reference frame IF has disappeared, the high frequency component pseudo generation means 13 generates high frequency component data. As a result, silence occurs compared to when silence occurs until the next header information can be acquired after an error in which the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output. And a sense of discomfort in hearing can be reduced by outputting a high frequency component.
[0044] ãŸãæ¬å®æœåœ¢æ
ã«ãããããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã«ãããŠã¯ãã¹ããªãŒã ã®äž éšãæ¶å€±ãããã©ãããå€å®ãããšã©ãŒå€å®æé (ãã®äŸã§ã¯å³ 5ã®ã¹ããã S 10)ã æãããã®ãšã©ãŒå€å®æé S 10ã§éåºæºãã¬ãŒã IFãæ¶å€±ãããšå€å®ãããå Žåã«ã é«åæåããŒã¿ãçæããããšãç¹åŸŽãšããã  [0044] The digital audio data processing method according to the present embodiment further includes an error determination procedure (in this example, step S10 in Fig. 5) for determining whether or not a part of the stream has been lost. High frequency component data is generated when it is determined in S10 that the non-reference frame IF has disappeared.
[0045] æ¬å®æœåœ¢æ
ã«ãããŠã¯ãéåºæºãã¬ãŒã IFãæ¶å€±ãããšå€å®ããå Žåã«ãé«åæ åããŒã¿ãçæãããããã«ãããéåºæºãã¬ãŒã IFãæ¶å€±ãããšã©ãŒãçºçããåŸ ã«æ¬¡ã«ãããæ
å ±ãååŸã§ãããŸã§ç¡é³ç¶æ
ã«ããã第 1è€å·åããŒã¿ (ãã³ãŒãåŸ ã® BaseCodecããŒã¿ïŒããåºåããªãå Žåã«æ¯ã¹ãç¡é³ç¶æ
ãççž®ã§ãããšå
±ã«ã é«åæåãåºåããããšã«ããèŽæäžã®éåæãäœæžããããšãã§ããã
[0046] äžèšå®æœåœ¢æ
ã«ãããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ã«ãããŠã¯ããšã©ãŒå€å®æ 段 11ã§éåºæºãã¬ãŒã IFãæ¶å€±ãããšå€å®ãããå Žåãåœè©²æ¶å€±ããéåºæºãã¬ãŒ ã IFã«å¯Ÿå¿ããæå®ã®ãšã©ãŒçšããŒã¿ïŒãã®äŸã§ã¯ muteããŒã¿ïŒãçæãããšã©ãŒ çšããŒã¿çææ段ïŒãã®äŸã§ã¯ãšã©ãŒåŠçéš 12)ãæããããšãç¹åŸŽãšããã In the present embodiment, high frequency component data is generated when it is determined that the non-reference frame IF has disappeared. As a result, silence occurs compared to when silence occurs until the next header information can be obtained after an error that the non-reference frame IF is lost, or when only the first decoded data (decoded BaseCodec data) is output. Can be shortened, and discomfort in hearing can be reduced by outputting a high frequency component. In the digital audio data processing device 1 in the above embodiment, when it is determined by the error determination unit 11 that the non-reference frame IF has been lost, a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having error data generation means (in this example, error processing unit 12) for generating data (in this example, mute data).
[0047] ããã«ããããšã©ãŒçºçæã«ãããŠå¯Ÿå¿ããæå®ã®ãšã©ãŒçšããŒã¿ãåºåããããš ãã§ããã  [0047] Thereby, it is possible to output predetermined error data corresponding to when an error occurs.
[0048] ãŸããäžèšå®æœåœ¢æ
ã«ãããããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã«ãããŠã¯ããšã©ãŒå€ å®æé S10ã§éåºæºãã¬ãŒã IFãæ¶å€±ãããšå€å®ãããå Žåãåœè©²æ¶å€±ããéåºæº ãã¬ãŒã IFã«å¯Ÿå¿ããæå®ã®ãšã©ãŒçšããŒã¿ïŒãã®äŸã§ã¯ muteããŒã¿ïŒãçæãã ãšã©ãŒçšããŒã¿çææé (ãã®äŸã§ã¯ã¹ããã S60)ãæããããšãç¹åŸŽãšããã  [0048] Also, in the digital audio data processing method in the above embodiment, when it is determined that the non-reference frame IF has been lost in the error determination procedure S10, a predetermined error corresponding to the lost non-reference frame IF is used. It is characterized by having an error data generation procedure (in this example, step S60) for generating data (in this example, mute data).
[0049] ããã«ããããšã©ãŒçºçæã«ãããŠå¯Ÿå¿ããæå®ã®ãšã©ãŒçšããŒã¿ãåºåããããš ãã§ããã  [0049] Thereby, it is possible to output predetermined error data corresponding to when an error occurs.
[0050] å³ 6ãå³ 8ã¯ãããããã以äžåæããæ¬å®æœåœ¢æ
ã®åå¹æããå
·äœçã«èª¬æãã ããã®èª¬æå³ã§ãããå³äžã暪軞ã«ã¯æé軞ããšããå³ 6ã¯ãå
¥åãããã¹ããªãŒã ã« ãããåãã¬ãŒã ã®æåã®äŸãè¡šããå³ 7ã¯ãå³ 6ã«å¯Ÿå¿ããæ¬å®æœåœ¢æ
ã®ããžã¿ã㬠é³å£°ããŒã¿åŠçè£
眮 1ããã®åºåæåãè¡šããå³ 8ã¯ããšã©ãŒçºçåŸã«ç¬¬ 1埩å·åã ãŒã¿ïŒãã³ãŒãåŸã® BaseCodecããŒã¿ïŒããåºåããªãæ¯èŒäŸã«ããåºåæåãè¡šã ãŠããã  FIGS. 6 to 8 are explanatory diagrams for specifically explaining the effects of the present embodiment listed above. In the figure, the horizontal axis represents the time axis, FIG. 6 shows an example of the behavior of each frame in the input stream, and FIG. 7 shows the digital audio data processing apparatus 1 of the present embodiment corresponding to FIG. Fig. 8 shows the output behavior of the comparative example in which only the first decoded data (BaseCodec data after decoding) is output after the error occurs.
[0051] å³ 7åã³å³ 8ã«ãããŠã SBRãããã®ããåºæºãã¬ãŒã SFãïŒãšã©ãŒçºçãªãïŒåä¿¡ ã§ããŠããå Žåãåã³ããã®åŸã® SBRãããã®ãªãéåºæºãã¬ãŒã IFããã®åŸãšã© äžçºçãªãåä¿¡ã§ããŠããå Žåã¯ãæ¬å®æœåœ¢æ
åã³æ¯èŒäŸãšãåæ§ã§ããã BaseCo decéšãš SBRéšãšã®äž¡æ¹ããã³ãŒãã§ããããã第 1埩å·åãã³ãŒãããŒã¿åã³ç¬¬ 2 è€å·åããŒã¿ (ãSBRãã³ãŒãããšè¡šã)ã䜵ããŠåºåããããšãã§ããã  [0051] In FIG. 7 and FIG. 8, when the reference frame SF with the SBR header can be received (without error), and the subsequent non-reference frame IF without the SBR header can be received without error. If this is the case, this is the same as in this embodiment and the comparative example, and both the BaseCo dec part and the SBR part can be decoded, and the first decoded decoded data and the second decoded data (referred to as âSBR decodeâ). Can be output together.
[0052] ãŸãããããã¬ãŒã ã§ãšã©ãŒãçºçããïŒãã¬ãŒã ã®å°ãªããšãäžéšãæ¶å€±ããïŒå Žå ã«ã€ããŠããæ¬å®æœåœ¢æ
ãšæ¯èŒäŸãšã§å·®ç°ã¯ãªãæå®ã®ãšã©ãŒçšããŒã¿ïŒãã®äŸã§ ã¯ç¡é³ç¶æ
ã§ãã mute)ãåºåããããšãã§ããã  [0052] In addition, even when an error occurs in a certain frame (at least a part of the frame is lost), there is predetermined error data (in this example, in a silent state) that does not differ between this embodiment and the comparative example. A certain mute) can be output.
[0053] äžæ¹ãäžèšãšã©ãŒãçºçãããã¬ãŒã ã®åŸïŒæ¬¡ã« SBRããããååŸã§ãããã¬ãŒã ã®åãŸã§ïŒã®ãã¬ãŒã ã«ã€ããŠã¯ãäžèšæ¯èŒäŸãšæ¬å®æœåœ¢æ
ãšã§ç°ãªããããªãã¡ã
å³ 8ã«ç€ºãããã«ãäžèšæ¯èŒäŸã§ã¯ããšã©ãŒãçºçãããããã®åŸæ°ãã« SBRããã ã®ãããæ
å ±ãååŸãããŸã§ã SBRéšã®ãã³ãŒãåŠçãè¡ãããšãã§ããã BaseCod ecéšã®ã¿ã®äœåã®åŸ©å·åããŒã¿ãåºåããã On the other hand, the frame after the frame in which the error has occurred (until the frame before the next SBR header can be acquired) differs between the comparative example and the present embodiment. That is, As shown in Fig. 8, in the above comparative example, if an error occurs, the SBR part cannot be decoded until the header information of the SBR header is newly acquired thereafter, and the low frequency band of only the BaseCodec part cannot be obtained. Output decrypted data.
[0054] äžæ¹ãå³ 7ã«ç€ºãããã«ãæ¬å®æœåœ¢æ
ã§ã¯ã BaseCodecéšã®åŸ©å·ã£åããŒã¿ã«åºã¥ ããåœè©² BaseCodecããŒã¿ãããé«ãåç垯åãæããé«åæåããŒã¿ãæ¬äŒŒçã« çæãã埩å·åããã BaseCodecããŒã¿ãšæ¬äŒŒçã«çæããé«åæåããŒã¿ãšã䜵 ããŠåºåããã®ã§ãèŽæäžã®éåæãäœæžããããšãã§ããã  On the other hand, as shown in FIG. 7, in the present embodiment, on the basis of the decoded code data of the BaseCodec part, high frequency component data having a reproduction band higher than the BaseCodec data is generated in a pseudo manner, Since the decoded BaseCodec data and the pseudo high frequency component data are output together, it is possible to reduce the sense of discomfort in the sense of hearing.
[0055] ãªãã以äžã¯ãå°äžæ³¢ããžã¿ã«æŸé (äŸãã°ãããã lSegmentãå©çšããæŸé)㫠察å¿ããæºåž¯é»è©±æ©ã«é©çšããå ŽåãäŸã«ãšã£ãŠèª¬æããããããã«éããããä»ã® æºåž¯ç«¯æ«ãã¢ãã€ã«æ©åšçã«é©çšãå¯èœã§ãããç¹ã«ããšã©ãŒãå€çºããè»èŒç«¯æ« ã«é©çšããå Žåã«å¹æçã§ããã  [0055] Note that the above has been described by taking as an example a case where the present invention is applied to a mobile phone that supports terrestrial digital broadcasting (for example, broadcasting using so-called lSegment), but is not limited thereto, and other mobile terminals, mopile devices, etc. It can be applied to. This is particularly effective when applied to in-vehicle terminals where errors frequently occur.
[0056] äžèšå®æœåœ¢æ
ã«ãããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ã¯ããã¬ãŒã åãããè€æ° ã®ãã¬ãŒã åãããªã笊å·ã£åããŒã¿ã§ãã£ãŠãè€æ°ã®ãã¬ãŒã åã¯ãåºæºãã¬ãŒã SF åã³éåºæºãã¬ãŒã IFãããªãè€æ°ã®ãã¬ãŒã ãå«ã¿ãéåºæºãã¬ãŒã IFããé³å£° æ
å ±ã笊å·åãã BaseCodecãšããã® BaseCodecã®åç垯åãæ¡å€§ããããã®åž¯ åæ¡å€§æ
å ±ã笊å·ã£åãã SBRãšãå«ã¿ãåºæºãã¬ãŒã SFããé³å£°æ
å ±ã笊å·åãã BaseCodecãšããã® BaseCodecã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ãç¬Šå· åããŠæ§æããã SBRã®æŒç®åŠçãè¡ãããã®ãããæ
å ±ãåãã SBRããããæ ãã SBRãšãå«ããã¹ããªãŒã ãåŠçããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ã§ãã£ãŠã åºæºãã¬ãŒã SFã® SBRã«åããããåŠçãããã®ãããæ
å ±ãååŸãã BaseCod ecDecodeéš 11ãšãåºæºãã¬ãŒã SFåã¯éåºæºãã¬ãŒã IFã® BaseCodecã埩å·å ãããã³ãŒãåŸã® BaseCodecããŒã¿ãçæãã BaseCodecDecodeéš 11ãšãåºæºã ã¬ãŒã SFåã¯éåºæºãã¬ãŒã IFã® SBRãã BaseCodecDecodeéš 11ã§ååŸãããž ããæ
å ±ãçšããŠåŸ©å·ã£åãããã³ãŒãåŸã® SBRããŒã¿ãçæãã SBRDecodeéš 14 ãšã BaseCodecDecodeéš 11ã§ãããæ
å ±ãååŸã§ããªãã£ãå Žåã«ã BaseCode cDecodeéš 11ã§åŸ©å·åããã BaseCodecã«åºã¥ããåœè©² BaseCodecããŒã¿ããã é«ãåç垯åãæããé«åæåããŒã¿ãæ¬äŒŒçã«çæããé«åæåæ¬äŒŒçæåŠ çéš 13ãšã BaseCodecãšé«åæåããŒã¿ã䜵ããŠåºåããããã®ã¹ã£ãã 15ãšãæ
ããã [0056] The digital audio data processing device 1 according to the above embodiment is code data including a plurality of framed frames, and the plurality of frame sequences are a plurality of frames including a reference frame SF and a non-reference frame IF. The non-reference frame IF includes the BaseCodec that encodes the audio information and the SBR that encodes the band expansion information for expanding the playback band of this BaseCodec, and the reference frame SF Includes BaseCodec that encodes information, and SBR that has SBR header that is configured by encoding band expansion information for expanding the playback band of BaseCodec, and includes header information for performing SBR calculation processing A digital audio data processing device 1 for processing a stream, and a BaseCodeDecode unit 11 for obtaining header information of a processing header provided in an SBR of a reference frame SF; BaseCodecDecode unit 11 that decodes BaseCodec of quasi-frame SF or non-reference frame IF and generates decoded BaseCodec data, and header obtained by BaseCodecDecode unit 11 for SBR of base frame SF or non-reference frame IF When the header information cannot be obtained by the SBRDecode unit 14 that generates the SBR data after decoding using the information, and the BaseCodecDecode unit 11, the BaseCodecDecode unit 11 A high-frequency component pseudo-generation processing unit 13 that artificially generates high-frequency component data having a reproduction band higher than that of BaseCodec data, and a switch 15 for outputting BaseCodec and high-frequency component data together are provided. To do.
[0057] æ¬å®æœåœ¢æ
ã«ãããŠã¯ãã¹ããªãŒã ã«åããããè€æ°ã®ãã¬ãŒã ã®ããããå Ba seCodecåã³ SBRããããæãã SBRãå«ãåºæºãã¬ãŒã SFãšã BaseCodecåã³ SBRãå«ãéåºæºãã¬ãŒã IFãšããæ§æãããŠããããã®ãã¡ã BaseCodecã Base CodecDecodeéš 11ã§åŸ©å·åãã㊠BaseCodecããŒã¿ãçæãããäžæ¹ã SBRã« ã€ããŠã¯ã BaseCodecDecodeéš 11ã§ååŸãããããæ
å ±ãçšã㊠SBRDecode éš 14ã§åŸ©å·åããã SBRããŒã¿ãçæãããã  [0057] In the present embodiment, the frame includes a reference frame SF including an SBR having a base codec and an SBR header, and a non-reference frame IF including a base codec and an SBR, respectively. Among them, BaseCodec is decoded by Base CodecDecode unit 11 to generate BaseCodec data, while SBR is decoded by SBRDecode unit 14 using the header information acquired by BaseCodecDecode unit 11 to generate SBR data .
[0058] ãã®ãšããäŸãã°ã¹ããªãŒã ã®äžéšãæ¶å€±ããïŒ=ãšã©ãŒãçºçããïŒçã«ããã Base CodecDecodeéš 11ã§ãããæ
å ±ãååŸã§ããªãã£ããšãã«ã¯ã BaseCodecDecod eéš 11ã§åŸ©å·åããã BaseCodecããŒã¿ã«åºã¥ããé«åæåæ¬äŒŒçæåŠçéš 13 ã«ããé«åæåããŒã¿ãæ¬äŒŒçã«çæããããããŠãã¹ã£ãã 15ã«ããã BaseCodec ããŒã¿ãäžèšçæãããé«åæåããŒã¿ãšäœµããŠåºåããããããã«ããããšã©ãŒçº çåŸã«æ¬¡ã«ãããæ
å ±ãååŸã§ãããŸã§ç¡é³ç¶æ
ã«ããã BaseCodecããåºåã㪠ãå Žåã«æ¯ã¹ãç¡é³ç¶æ
ãççž®ã§ãããšå
±ã«ãé«åæåãåºåããããšã«ããèŽæäž ã®éåæãäœæžããããšãã§ããã  [0058] At this time, for example, when header information cannot be acquired by the Base CodecDecode unit 11 due to loss of part of the stream (= an error has occurred), the BaseCodecDecode unit 11 decodes the BaseCodec data Based on this, the high frequency component pseudo generation processing unit 13 generates high frequency component data in a pseudo manner. Then, the switch 15 outputs BaseCodec data together with the generated high frequency component data. This makes it possible to reduce the silence and reduce the sense of incongruity by outputting high-frequency components, compared to the case where silence is maintained until the next header information can be acquired after an error occurs, and only BaseCodec is output. can do.
[0059] ãŸããæ¬å®æœåœ¢æ
ã®ããžã¿ã«é³å£°ããŒã¿åŠçè£
眮 1ãçšããããžã¿ã«é³å£°ããŒã¿ åŠçæ¹æ³ã«ãã¬ããŠã¯ããã¬ãŒã åãããè€æ°ã®ãã¬ãŒã åãããªã笊å·åããŒã¿ã§ ãã£ãŠãè€æ°ã®ãã¬ãŒã åã¯ãåºæºãã¬ãŒã SFåã³éåºæºãã¬ãŒã IFãããªãè€æ° ã®ãã¬ãŒã ãå«ã¿ãéåºæºãã¬ãŒã IFå é³å£°æ
å ±ã笊å·åãã BaseCodecãšãã ã® BaseCodecã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·åãã SBRãšãå« ã¿ãåºæºãã¬ãŒã SFå é³å£°æ
å ±ã笊å·åãã BaseCodecãšããã® BaseCodecã® åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·ã£ãããŠæ§æããã SBRã®æŒç®åŠç ãè¡ãããã®ãããæ
å ±ãåãã SBRããããæãã SBRãšãå«ããã¹ããªãŒã ãåŠ çããããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã§ãã£ãŠãåºæºãã¬ãŒã SFã® SBRã«åããã ã SBRãããã®ãããæ
å ±ãååŸããã¹ããã S 15ãšãåºæºãã¬ãŒã SFåã¯éåºæº ãã¬ãŒã IFã® BaseCodecã埩å·ã£åãã BaseCodecããŒã¿ãçæãã BaseCodecD ecodeéš 11ã«ãã BaseCodecãã³ãŒãæé ãšãåºæºãã¬ãŒã SFåã¯éåºæºãã¬ãŒ ã IFã® SBRããã¹ããã S 15ã§ååŸãããããæ
å ±ãçšããŠåŸ©å·åãã SBRããŒã¿
ãçæããã¹ããã S35ãšãã¹ããã S 15ã§ãããæ
å ±ãååŸã§ããªãã£ãå Žåã«ã B aseCodecDecodeå
11ã«ãã BaseCodecãã³ãŒãæ IåŽã§åŸ©å·ã£åãã BaseCodec ããŒã¿ã«åºã¥ããåœè©² BaseCodecããŒã¿ãããé«ãåç垯åãæããé«åæåã ãŒã¿ãæ¬äŒŒçã«çæãã BaseCodecããŒã¿ãšé«åæåããŒã¿ã䜵ããŠåºåããã [0059] In addition, the digital audio data processing method using the digital audio data processing device 1 of the present embodiment is encoded data including a plurality of framed frames, and includes a plurality of frames. The column includes a plurality of frames composed of a reference frame SF and a non-reference frame IF. The non-reference frame IF force encodes base codec that encodes audio information and band expansion information for expanding the playback band of this base codec. This is composed of BaseCodec, which encodes the reference frame SF power audio information, and the band expansion information for expanding the playback band of this BaseCodec. A digital audio data processing method for processing a stream including an SBR having an SBR header with header information, the header of the SBR header provided in the SBR of the reference frame SF. Step S15 for obtaining information, and decoding the BaseCodec of the reference frame SF or non-reference frame IF, and generating the BaseCodec data The BaseCodec decoding procedure by the BaseCodecDecode unit 11 and the reference frame SF or non-reference frame IF SBR is decrypted using the header information obtained in step S15, and SBR data If the header information could not be acquired in step S35 and step S15, a higher playback bandwidth than that of the BaseCodec data is obtained based on the BaseCodec data decoded by BaseCodec decoding by the BaseCodecDecode å
11. The pseudo high frequency component data is generated and BaseCodec data and high frequency component data are output together.
[0060] æ¬å®æœåœ¢æ
ã®ããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã«ãããŠã¯ãã¹ããã S 15ã§ãããæ
å ±ãååŸã§ããªãã£ãå Žåã«ã埩å·åããã BaseCodecããŒã¿ã«åºã¥ããé«åæå ããŒã¿ãçæãã BaseCodecããŒã¿ãšçæãããé«åæåããŒã¿ãšã䜵ããŠåºåã ãããããã«ããããšã©ãŒçºçåŸã«æ¬¡ã«ãããæ
å ±ãååŸã§ãããŸã§ç¡é³ç¶æ
ã«ããã BaseCodecããŒã¿ããåºåããªãå Žåã«æ¯ã¹ãç¡é³ç¶æ
ãççž®ã§ãããšå
±ã«ãé«å æåãåºåããããšã«ããèŽæäžã®éåæãäœæžããããšãã§ããã [0060] In the digital audio data processing method of the present embodiment, when header information cannot be acquired in step S15, high frequency component data is generated based on the decoded BaseCodec data, and BaseCodec data and The generated high frequency component data is also output. This makes it possible to reduce the silence and reduce the sense of incongruity by outputting high-frequency components, compared to the case in which silence is maintained until the next header information can be acquired after an error occurs, and only BaseCodec data is output. Can do.
å³é¢ã®ç°¡åãªèª¬æ  Brief Description of Drawings
[0061] [å³ 1]æ¬çºæã®äžå®æœåœ¢æ
ã®æºåž¯é»è©±æ©ã®å
šäœå€èŠ³ãè¡šãæèŠå³ã§ããã  FIG. 1 is a perspective view showing the overall appearance of a mobile phone according to an embodiment of the present invention.
[å³ 2]å³ 1ã«ç€ºããæºåž¯é»è©±æ©ã®æ©èœçæ§æãè¡šãæ©èœãããã¯å³ã§ããã  2 is a functional block diagram showing a functional configuration of the mobile phone shown in FIG.
[å³ 3]ã¹ããªãŒã ã®ãã¡é³å£°ã«ä¿ããéšåãæãåºããŠæŠå¿µçã«è¡šãæš¡åŒå³ã§ããã  FIG. 3 is a schematic diagram conceptually showing a portion related to audio extracted from a stream.
[å³ 4]å³ 2ã«ç€ºããä¿¡å·åŠçéšã®ãã¡é³å£°ä¿¡å·ã®åçåŠçã«ä¿ããæ§æãè¡šãæ©èœ ãããã¯å³ã§ããã  4 is a functional block diagram showing a configuration related to audio signal reproduction processing in the signal processing section shown in FIG.
[å³ 5]BaseCodecDecodeéšãåãã¬ãŒã ããšã«å®è¡ããåŠçæé ãè¡šããããŒã ã€ãŒãã§ããã  FIG. 5 is a flowchart showing the processing procedure executed by the BaseCodecDecode part for each frame.
[å³ 6]æ¬çºæã®äžå®æœåœ¢æ
ã®åå¹æããå
·äœçã«èª¬æããããã®èª¬æå³ã§ããã  FIG. 6 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
[å³ 7]æ¬çºæã®äžå®æœåœ¢æ
ã®åå¹æããå
·äœçã«èª¬æããããã®èª¬æå³ã§ããã  FIG. 7 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention.
[å³ 8]æ¬çºæã®äžå®æœåœ¢æ
ã®åå¹æããå
·äœçã«èª¬æããããã®èª¬æå³ã§ããã 笊å·ã®èª¬æ  FIG. 8 is an explanatory diagram for specifically explaining each effect of the embodiment of the present invention. Explanation of symbols
[0062] 1 æºåž¯é»è©±æ© (ããžã¿ã«é³å£°ããŒã¿åŠçè£
çœ®ïŒ Â [0062] 1 Mobile phone (digital audio data processing device)
11 BaseCodecDecodeéšïŒãšã©ãŒå€å®æ段ããããæ
å ±ååŸæ段ã第 1 埩å·åæ段ã第 2埩å·åæ段ã解ææ段ãç
§åææ®µïŒ Â 11 BaseCodecDecode part (error determination means, header information acquisition means, first decoding means, second decoding means, analysis means, verification means)
12 ãšã©ãŒåŠçéšïŒãšã©ãŒçšããŒã¿çæææ®µïŒ Â 12 Error processing section (error data generation means)
13 é«åæåæ¬äŒŒçæåŠçéšïŒé«åæåæ¬äŒŒçæææ®µïŒ Â 13 High-frequency component pseudo-generation processing unit (high-frequency component pseudo-generation means)
15 ã¹ã£ããïŒåºåå¶åŸ¡æ段ïŒ
éåºæºãã¬ãŒã åºæºãã¬ãŒã
15 Switch (output control means) Non-reference frame Reference frame
Claims
[1] ãã¬ãŒã åãããè€æ°ã®ãã¬ãŒã åãããªã笊å·ã£åããŒã¿ã§ãã£ãŠã  [1] Code data consisting of a plurality of framed frames,
åèšè€æ°ã®ãã¬ãŒã åã¯ãåºæºãã¬ãŒã åã³éåºæºãã¬ãŒã ãå«ã¿ã  The plurality of frame sequences includes a reference frame and a non-reference frame;
åèšéåºæºãã¬ãŒã ããé³å£°æ
å ±ã笊å·åããéåºæºç¬¬ 1ããŒã¿ãšããã®éåºæºç¬¬ 1ããŒã¿ã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·åããéåºæºç¬¬ 2ããŒã¿ ãšãå«ã¿ã  The non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data;
åèšåºæºãã¬ãŒã ããé³å£°æ
å ±ã笊å·åããåºæºç¬¬ 1ããŒã¿ãšããã®åºæºç¬¬ 1ã㌠ã¿ã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·åããŠæ§æãããåèšéåºæº 第 2ããŒã¿ã®æŒç®åŠçãè¡ãããã®ãããæ
å ±ãåããåŠçããããæããåºæºç¬¬ 2 ããŒã¿ãšãå«ããã¹ããªãŒã ãåŠçããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮ã§ãã£ãŠã åèšåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 2ããŒã¿ã«åããããåèšåŠçãããã®åèšãžã ãæ
å ±ãååŸãããããæ
å ±ååŸæ段ãšã  The reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding a reproduction band of the reference first data, and calculating the non-reference second data. Digital audio data processing apparatus for processing a stream, including reference second data having a processing header having header information for performing the processing, wherein the processing header provided in the reference second data of the reference frame Header information acquisition means for acquiring the header information of
åèšåºæºãã¬ãŒã åã¯åèšéåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 1ããŒã¿åã¯åèšéåº æºç¬¬ 1ããŒã¿ã埩å·åãã第 1埩å·åããŒã¿ãçæãã第 1埩å·åæ段ãšã  First decoding means for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame and generating first decoded data;
åèšåºæºãã¬ãŒã åã¯åèšéåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 2ããŒã¿åã¯åèšéåº æºç¬¬ 2ããŒã¿ããåèšãããæ
å ±ååŸæ段ã§ååŸããåèšãããæ
å ±ãçšããŠåŸ©å· åãã第 2è€å·åããŒã¿ãçæãã第 2埩å·åæ段ãšã  The reference second data or the non-reference second data of the reference frame or the non-reference frame is decoded using the header information acquired by the header information acquisition means, and second decoded data is generated. A second decryption means;
åèšãããæ
å ±ååŸæ段ã§åèšãããæ
å ±ãååŸã§ããªãã£ãå Žåã«ãåèšç¬¬ 1 è€å·åæ段ã§åŸ©å·åããã第 1è€å·åããŒã¿ã«åºã¥ããåœè©²ç¬¬ 1埩å·åããŒã¿ãã ãé«ãåç垯åãæããé«åæåããŒã¿ãæ¬äŒŒçã«çæããé«åæåæ¬äŒŒçææ 段ãšã  When the header information cannot be obtained by the header information obtaining means, a higher reproduction band than the first decoded data is obtained based on the first decoded data decoded by the first decoding means. High-frequency component pseudo-generation means for generating pseudo-high-frequency component data,
åèšç¬¬ 1è€å·åããŒã¿ãšåèšé«åæåããŒã¿ã䜵ããŠåºåããããã®åºåå¶åŸ¡æ 段㚠 An output control means for outputting the first decoded data and the high-frequency component data together;
ãæããããšãç¹åŸŽãšããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮ã  A digital audio data processing apparatus comprising:
[2] è«æ±é
1èšèŒã®ããžã¿ã«é³å£°ããŒã¿åŠçè£
眮ã«ãããŠã [2] In the digital audio data processing device according to claim 1,
åèšã¹ããªãŒã ã®äžéšãæ¶å€±ãããã©ãããå€å®ãããšã©ãŒå€å®æ段ãæãã åèšé«åæåæ¬äŒŒçææ段ã¯ãåèšãšã©ãŒå€å®æ段ã§åèšéåºæºãã¬ãŒã ãæ¶ å€±ãããšå€å®ãããå Žåã«ãåèšé«åæåããŒã¿ãçæããããšãç¹åŸŽãšããããžã¿
ã«é³å£°ããŒã¿åŠçè£
眮ã An error determination unit that determines whether a part of the stream has been lost, and the high-frequency component pseudo-generation unit, when the error determination unit determines that the non-reference frame has been lost, A digital device characterized by generating high-frequency component data Le voice data processing device.
[3] è«æ±é
2èšèŒã®ããžã¿ã«é³å£°ããŒã¿åŠçè£
眮ã«ãããŠã  [3] The digital audio data processing device according to claim 2,
åèšãšã©ãŒå€å®æ段ã§åèšéåºæºãã¬ãŒã ãæ¶å€±ãããšå€å®ãããå Žåãåœè©²æ¶ 倱ããéåºæºãã¬ãŒã ã«å¯Ÿå¿ããæå®ã®ãšã©ãŒçšããŒã¿ãçæãããšã©ãŒçšããŒã¿ çææ段  When the error determination means determines that the non-reference frame has been lost, error data generation means for generating predetermined error data corresponding to the lost non-reference frame
ãæããããšãç¹åŸŽãšããããžã¿ã«é³å£°ããŒã¿åŠçè£
眮ã  A digital audio data processing apparatus comprising:
[4] ãã¬ãŒã åãããè€æ°ã®ãã¬ãŒã åãããªã笊å·åããŒã¿ã§ãã£ãŠã [4] Coded data composed of a plurality of framed frames,
åèšè€æ°ã®ãã¬ãŒã åã¯ãåºæºãã¬ãŒã åã³éåºæºãã¬ãŒã ãããªãè€æ°ã®ãã¬ãŒ ã ãå«ã¿ã  The plurality of frame sequences includes a plurality of frames including a reference frame and a non-reference frame,
åèšéåºæºãã¬ãŒã ããé³å£°æ
å ±ã笊å·åããéåºæºç¬¬ 1ããŒã¿ãšããã®éåºæºç¬¬ 1ããŒã¿ã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·åããéåºæºç¬¬ 2ããŒã¿ ãšãå«ã¿ã  The non-reference frame includes non-reference first data obtained by encoding audio information, and non-reference second data obtained by encoding band expansion information for expanding the reproduction band of the non-reference first data;
åèšåºæºãã¬ãŒã ããé³å£°æ
å ±ã笊å·åããåºæºç¬¬ 1ããŒã¿ãšããã®åºæºç¬¬ 1ã㌠ã¿ã®åç垯åãæ¡å€§ããããã®åž¯åæ¡å€§æ
å ±ã笊å·åããŠæ§æãããåèšéåºæº 第 2ããŒã¿ã®æŒç®åŠçãè¡ãããã®ãããæ
å ±ãåããåŠçããããæããåºæºç¬¬ 2 ããŒã¿ãšãå«ããã¹ããªãŒã ãåŠçããããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã§ãã£ãŠã åèšåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 2ããŒã¿ã«åããããåèšåŠçãããã®åèšãžã ãæ
å ±ãååŸãããããæ
å ±ååŸæé ãšã  The reference frame is configured by encoding reference first data obtained by encoding audio information and band expansion information for expanding a reproduction band of the reference first data, and calculating the non-reference second data. A digital audio data processing method for processing a stream, including reference second data having a processing header with header information for performing the processing, wherein the processing header provided in the reference second data of the reference frame Header information acquisition procedure for acquiring the header information of
åèšåºæºãã¬ãŒã åã¯åèšéåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 1ããŒã¿åã¯åèšéåº æºç¬¬ 1ããŒã¿ã埩å·åãã第 1埩å·åããŒã¿ãçæãã第 1埩å·åæé ãšã  A first decoding procedure for decoding the reference first data or the non-reference first data of the reference frame or the non-reference frame to generate first decoded data;
åèšåºæºãã¬ãŒã åã¯åèšéåºæºãã¬ãŒã ã®åèšåºæºç¬¬ 2ããŒã¿åã¯åèšéåº æºç¬¬ 2ããŒã¿ããåèšãããæ
å ±ååŸæé ã§ååŸããåèšãããæ
å ±ãçšããŠåŸ©å· åãã第 2è€å·åããŒã¿ãçæãã第 2埩å·ã£ãæé ãšãæãã  The reference second data or the non-reference second data of the reference frame or the non-reference frame is decoded using the header information acquired in the header information acquisition procedure to generate second decoded data. A second decryption procedure,
åèšãããæ
å ±ååŸæé ã§åèšãããæ
å ±ãååŸã§ããªãã£ãå Žåã«ãåèšç¬¬ 1 è€å·åæé ã§åŸ©å·åãã第 1è€å·åããŒã¿ã«åºã¥ããåœè©²ç¬¬ 1埩å·åããŒã¿ããã é«ãåç垯åãæããé«åæåããŒã¿ãæ¬äŒŒçã«çæããåèšç¬¬ 1埩å·åããŒã¿ãš åèšé«åæåããŒã¿ã䜵ããŠåºåããããšãç¹åŸŽãšããããžã¿ã«é³å£°ããŒã¿åŠçæ¹ æ³ã
If the header information cannot be acquired by the header information acquisition procedure, the reproduction information has a higher reproduction band than the first decoded data based on the first decoded data decoded by the first decoding procedure. A digital audio data processing method, characterized in that high-frequency component data is generated in a pseudo manner, and the first decoded data and the high-frequency component data are output together.
[5] è«æ±é
4èšèŒã®ããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã«ãã¬ããŠã [5] The digital audio data processing method according to claim 4,
åèšã¹ããªãŒã ã®äžéšãæ¶å€±ãããã©ãããå€å®ãããšã©ãŒå€å®æé ãæãã ãã®ãšã©ãŒå€å®æé ã§åèšéåºæºãã¬ãŒã ãæ¶å€±ãããšå€å®ãããå Žåã«ãåèš é«åæåããŒã¿ãçæããããšãç¹åŸŽãšããããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã  An error determination procedure for determining whether or not a part of the stream has been lost, and the high-frequency component data is generated when the error determination procedure determines that the non-reference frame has been lost. Digital audio data processing method.
[6] è«æ±é
5èšèŒã®ããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã«ãã¬ããŠã [6] The digital audio data processing method according to claim 5,
åèšãšã©ãŒå€å®æé ã§åèšéåºæºãã¬ãŒã ãæ¶å€±ãããšå€å®ãããå Žåãåœè©²æ¶ 倱ããéåºæºãã¬ãŒã ã«å¯Ÿå¿ããæå®ã®ãšã©ãŒçšããŒã¿ãçæãããšã©ãŒçšããŒã¿ çææé  An error data generation procedure for generating predetermined error data corresponding to the lost non-reference frame when the error determination procedure determines that the non-reference frame has been lost.
ãæããããšãç¹åŸŽãšããããžã¿ã«é³å£°ããŒã¿åŠçæ¹æ³ã
 A digital audio data processing method comprising:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2008517814A JP4551472B2 (en) | 2006-05-25 | 2007-05-09 | Digital audio data processing apparatus and processing method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006145553 | 2006-05-25 | ||
JP2006-145553 | 2006-05-25 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007138825A1 true WO2007138825A1 (en) | 2007-12-06 |
Family
ID=38778345
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2007/059585 WO2007138825A1 (en) | 2006-05-25 | 2007-05-09 | Digital audio data processing device and processing method |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP4551472B2 (en) |
WO (1) | WO2007138825A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003140692A (en) * | 2001-11-02 | 2003-05-16 | Matsushita Electric Ind Co Ltd | Coding device and decoding device |
JP2005024756A (en) * | 2003-06-30 | 2005-01-27 | Toshiba Corp | Decoding process circuit and mobile terminal device |
JP2005222014A (en) * | 2004-01-08 | 2005-08-18 | Matsushita Electric Ind Co Ltd | Device and method for signal decoding |
WO2005106848A1 (en) * | 2004-04-30 | 2005-11-10 | Matsushita Electric Industrial Co., Ltd. | Scalable decoder and expanded layer disappearance hiding method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4445328B2 (en) * | 2004-05-24 | 2010-04-07 | ãããœããã¯æ ªåŒäŒç€Ÿ | Voice / musical sound decoding apparatus and voice / musical sound decoding method |
-
2007
- 2007-05-09 JP JP2008517814A patent/JP4551472B2/en not_active Expired - Fee Related
- 2007-05-09 WO PCT/JP2007/059585 patent/WO2007138825A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003140692A (en) * | 2001-11-02 | 2003-05-16 | Matsushita Electric Ind Co Ltd | Coding device and decoding device |
JP2005024756A (en) * | 2003-06-30 | 2005-01-27 | Toshiba Corp | Decoding process circuit and mobile terminal device |
JP2005222014A (en) * | 2004-01-08 | 2005-08-18 | Matsushita Electric Ind Co Ltd | Device and method for signal decoding |
WO2005106848A1 (en) * | 2004-04-30 | 2005-11-10 | Matsushita Electric Industrial Co., Ltd. | Scalable decoder and expanded layer disappearance hiding method |
Also Published As
Publication number | Publication date |
---|---|
JPWO2007138825A1 (en) | 2009-10-01 |
JP4551472B2 (en) | 2010-09-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JPWO2008117524A1 (en) | Digital broadcast transmission apparatus, digital broadcast reception apparatus, and digital broadcast transmission / reception system | |
EP1879310A1 (en) | Method for decoding a digital radio stream | |
KR100636145B1 (en) | Exednded high resolution audio signal encoder and decoder thereof | |
JP2010114777A (en) | Broadcast receiving circuit and broadcast receiver | |
MX2010010368A (en) | Systems, methods and apparatus for transmitting data over a voice channel of a wireless telephone network. | |
JP2005024756A (en) | Decoding process circuit and mobile terminal device | |
US7509255B2 (en) | Apparatuses for adaptively controlling processing of speech signal and adaptively communicating speech in accordance with conditions of transmitting apparatus side and radio wave and methods thereof | |
JP4551472B2 (en) | Digital audio data processing apparatus and processing method | |
JP5047519B2 (en) | Digital audio data processing apparatus and processing method | |
JP4705556B2 (en) | Wireless microphone transmitter, receiver and system | |
JP2009244912A (en) | Speech signal processing device and speech signal processing method | |
US6198499B1 (en) | Radio-communication video terminal device | |
KR20060131164A (en) | Method for playing broadcasting stream stored in digital broadcasting receiver | |
JP4065383B2 (en) | Audio signal transmitting apparatus, audio signal receiving apparatus, and audio signal transmission system | |
JP2004129121A (en) | Transmitting apparatus and receiving apparatus of digital broadcast content, broadcast system and broadcast method | |
JPWO2005117275A1 (en) | Receiver | |
JP4683908B2 (en) | Video / audio output device | |
JP6181897B1 (en) | Broadcast signal receiving apparatus, broadcast signal receiving method, television receiver, control program, and recording medium | |
JP4385710B2 (en) | Audio signal processing apparatus and audio signal processing method | |
JP6181898B1 (en) | Broadcast signal transmission / reception system and broadcast signal transmission / reception method | |
JP6374053B2 (en) | Broadcast signal receiving apparatus, broadcast signal receiving method, television receiver, control program, and recording medium | |
JP5957156B2 (en) | Broadcast signal transmission / reception system and broadcast signal transmission / reception method | |
JP2008016882A (en) | Display apparatus, display control method, and display control program | |
JP6341809B2 (en) | Broadcast signal transmission / reception system and broadcast signal transmission / reception method | |
JP2007108219A (en) | Speech decoder |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07743020 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008517814 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07743020 Country of ref document: EP Kind code of ref document: A1 |