WO1998046045A1 - Procede et dispositif de codage, procede et dispositif de decodage et support d'enregistrement - Google Patents
Procede et dispositif de codage, procede et dispositif de decodage et support d'enregistrement Download PDFInfo
- Publication number
- WO1998046045A1 WO1998046045A1 PCT/JP1998/001672 JP9801672W WO9846045A1 WO 1998046045 A1 WO1998046045 A1 WO 1998046045A1 JP 9801672 W JP9801672 W JP 9801672W WO 9846045 A1 WO9846045 A1 WO 9846045A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- signals
- encoding
- channels
- decoding
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 224
- 238000006243 chemical reaction Methods 0.000 claims description 80
- 238000001228 spectrum Methods 0.000 claims description 53
- 230000005236 sound signal Effects 0.000 claims description 48
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 claims description 9
- 230000011664 signaling Effects 0.000 claims description 2
- 230000009466 transformation Effects 0.000 claims 2
- 238000013139 quantization Methods 0.000 abstract description 58
- 230000006866 deterioration Effects 0.000 abstract description 13
- 238000012545 processing Methods 0.000 description 38
- 230000015654 memory Effects 0.000 description 35
- 238000010586 diagram Methods 0.000 description 26
- 230000003287 optical effect Effects 0.000 description 19
- 238000012546 transfer Methods 0.000 description 18
- 238000010606 normalization Methods 0.000 description 15
- 238000007726 management method Methods 0.000 description 12
- 230000003595 spectral effect Effects 0.000 description 12
- 238000000926 separation method Methods 0.000 description 11
- 238000007906 compression Methods 0.000 description 9
- 230000006835 compression Effects 0.000 description 9
- 238000013144 data compression Methods 0.000 description 9
- 238000005516 engineering process Methods 0.000 description 8
- 230000003044 adaptive effect Effects 0.000 description 6
- 238000000354 decomposition reaction Methods 0.000 description 6
- 230000007423 decrease Effects 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 230000001256 tonic effect Effects 0.000 description 2
- 238000003775 Density Functional Theory Methods 0.000 description 1
- 201000009310 astigmatism Diseases 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000002542 deteriorative effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000001678 irradiating effect Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H20/00—Arrangements for broadcast or for distribution combined with broadcast
- H04H20/86—Arrangements characterised by the broadcast information itself
- H04H20/88—Stereophonic broadcast systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H40/00—Arrangements specially adapted for receiving broadcast information
- H04H40/18—Arrangements characterised by circuits or components specially adapted for receiving
- H04H40/27—Arrangements characterised by circuits or components specially adapted for receiving specially adapted for broadcast systems covered by groups H04H20/53 - H04H20/95
- H04H40/36—Arrangements characterised by circuits or components specially adapted for receiving specially adapted for broadcast systems covered by groups H04H20/53 - H04H20/95 specially adapted for stereophonic broadcast receiving
- H04H40/45—Arrangements characterised by circuits or components specially adapted for receiving specially adapted for broadcast systems covered by groups H04H20/53 - H04H20/95 specially adapted for stereophonic broadcast receiving for FM stereophonic broadcast systems receiving
- H04H40/72—Arrangements characterised by circuits or components specially adapted for receiving specially adapted for broadcast systems covered by groups H04H20/53 - H04H20/95 specially adapted for stereophonic broadcast receiving for FM stereophonic broadcast systems receiving for noise suppression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/007—Two-channel systems in which the audio signals are in digital form
Definitions
- the present invention is suitable for extending the format of an encoded signal.
- Encoding method and apparatus, and decoding corresponding thereto TECHNICAL FIELD The present invention relates to an encoding method and apparatus, and a recording medium on which an encoded signal is recorded.
- a signal recording medium such as a magneto-optical disk has been proposed as a device capable of recording a signal such as encoded audio information or audio information (hereinafter referred to as an audio signal).
- an audio signal has been proposed as a device capable of recording a signal such as encoded audio information or audio information (hereinafter referred to as an audio signal).
- One example of the high-efficiency encoding method is to block an audio signal on the time axis in a predetermined time unit, and to set the time axis for each block. This signal is converted into a signal on the frequency axis (spectrum conversion), divided into a plurality of frequency bands, and is encoded by each band.
- a non-blocking frequency band division method in which the audio signal on the time axis is divided into a plurality of frequency bands without being blocked, so-called band division coding (sub 'band' coding: SBC) And the like.
- band division coding sub 'band' coding: SBC
- SBC band division coding
- the signal of each band is converted into a signal on the frequency axis by a spectrum, and each of the spectrum-converted signals is converted. Encoding is performed for each band.
- a filter for band division used in the above-described band division coding there is a filter such as a so-called QMF (Quadrature Mirror filter). This is described in “Digital coding of speech in subbands” (RE Crochiere, Bell System. Tech. J., Vol. 55, No. 8 1976). This filter of QMF divides the band into two equal bandwidths. In this filter, so-called aliasing does not occur when combining the divided bands later. Has become. Also, literature
- Polyphase Quadrature filters-A new subband coding technique Joseph H. Rothweiler, ICASSP 83, BOSTON, etc. A bandwidth fill scheme is described.
- the characteristic of the polyphase aid quadrature filter is that the signal can be divided at once when it is divided into multiple bands of equal bandwidth.
- spectral transform for example, an input audio signal is divided into blocks in a predetermined unit time (frame), and a discrete Fourier transform (DFT), a discrete cosine transform (Discrete Cosine Transform: DCT), Modified Discrete Cosine Transform (Modified Discrete Cosine Transform: MDCT)
- DFT discrete Fourier transform
- DCT discrete cosine transform
- MDCT Modified Discrete Cosine Transform
- MDCT Modified Discrete Cosine Transform
- this block is referred to as a conversion block.
- M 1 sample data is overlapped on each of the transform blocks on both sides, so that these DFTs and DCTs are averaged.
- M pieces of real number data can be obtained for (M ⁇ M 1) samples of data, so that these M pieces of real number data are subsequently quantized and coded.
- the conversion block for the spectrum conversion when the conversion block for the spectrum conversion is lengthened, the frequency resolution is increased, and energy is concentrated on a specific spectrum signal component. Therefore, the spectrum conversion is performed with a long conversion block length in which the sample data is overlapped by half each between the conversion blocks on both sides, and the number of obtained spectrum signal components is reduced to the original time. If to use the MDCT that does not increase with respect to the number of sample data of the axis, DFT also or DCT becomes possible to perform efficient coding than the case of using c, conversion proc adjacent to By providing a sufficiently long overlap, the connection distortion between the conversion blocks of the waveform signal can be reduced.
- a frequency division width when quantizing each signal component obtained by dividing an audio signal into frequency bands, it is preferable to use a bandwidth in consideration of, for example, human auditory characteristics. That is, it is preferable to divide the audio signal into a plurality of (for example, 25 bands) bands, which are generally called critical bands in which the higher the band, the wider the bandwidth.
- encoding data for each band encoding is performed by a predetermined bit allocation for each band or by adaptive bit allocation (bit allocation) for each band.
- bit allocation adaptive bit allocation
- bits are allocated based on the signal magnitude of each band.
- the quantization noise spectrum is flattened and the noise energy is minimized, but the actual noise sensation is not optimal because the masking effect is not used in terms of hearing.
- a real bit allocation reference value that achieves the signal-to-noise characteristic obtained by calculation as closely as possible is determined, and an integer value that approximates it is used as the number of allocated bits.
- an integer value that approximates it is used as the number of allocated bits.
- quantization accuracy information and normalization coefficient information are encoded with a predetermined number of bits for each band in which normalization and quantization are performed. What is necessary is just to encode the quantized spectrum signal component.
- the IS0 standard (IS0 / IEC 1 1 1 7 2-3: 1 993 (E), 1 993) specifies that the number of bits representing quantization accuracy information differs depending on the band.
- a high-efficiency coding method is described here, and is standardized so that the number of bits representing quantization accuracy information decreases as the frequency band increases.
- the decoding device determines the quantization accuracy information from the normalization coefficient information, for example. Since the relationship between the information and the quantization accuracy information is determined, it will not be possible to introduce quantization accuracy control based on a more advanced auditory model in the future. If there is a range of compression rates to be realized, it is necessary to determine the relationship between the normalization coefficient information and the quantization accuracy information for each compression rate.
- PCT International Publication W094 / 28633 include, in the specification and the drawings, a component that is particularly important in the sense of hearing from the spectral signal component and separate the other spectral signal.
- a method of encoding separately from the components has been proposed, which makes it possible to efficiently encode audio signals and the like at a high compression ratio with almost no deterioration in auditory perception. Has become.
- each of the above-described encoding methods can be applied to each channel of an audio signal including a plurality of channels. For example, it may be applied to each of the L channel corresponding to the left speaker and the R channel corresponding to the right speaker. Further, it is also possible to apply to (L + R) / 2 signals obtained by adding the signals of the L channel and the R channel. It is also possible to perform efficient coding on the (L + R) / 2 signal and the (L ⁇ R) / 2 signal using the above-described methods.
- the amount of data required to encode a one-channel signal is half that required to encode a two-channel signal independently, so when recording a signal on a recording medium, A method is often adopted in which both a mode for recording with a monaural signal and a mode for recording with a two-channel stereo signal are provided, and when long-term recording is required, the standard is set so that the signal can be recorded as a monaural signal. ing.
- flag information and the like relating to the standard can be recorded in advance on the signal recording medium in consideration of the case where the standard is changed or extended in the future.
- a common practice is to leave room. That is, for example, when performing standardization for the first time, “0” should be recorded as 1-bit flag information, and When the status is changed, “1” is recorded in the flag information.
- the playback device corresponding to the changed standard checks whether the flag information is “0” or “1”. If the flag information is “1”, the signal from the signal recording medium is output based on the changed standard. Is read and reproduced. If the flag information is “0”, if the playback device also complies with the first established standard, read the signal from the signal recording medium based on that standard and reproduce it. If it does not, do not reproduce the signal.
- a playback device that can only reproduce signals that have been recorded in accordance with a once established standard (hereinafter referred to as “old standard” or “first encoding method”) (hereinafter referred to as “old standard compatible playback”) (hereinafter referred to as “old standard compatible playback”)
- old standard compatible playback With the spread of “devices”, this older standard-compatible playback device uses higher-efficiency encoding methods (hereinafter referred to as “new standards” or “second encoding methods”). Cannot reproduce the recording medium recorded by using this function, which confuses the user of the device.
- the playback device at the time when the old standard was determined ignores the flag information recorded on the recording medium, and all signals recorded on the recording medium conform to the old standard. Some of them are reproduced as encoded. In other words, even if the recording medium is recorded based on the new standard, not all old standard-compatible playback devices can identify that. For this reason, in the case where the reproduction device compatible with the old standard reproduces a recording medium on which a signal based on the new standard is recorded as being interpreted as a recording medium on which a signal based on the old standard is recorded, for example. May not work properly or generate severe noise. In order to solve this problem, the applicant of the present application
- the message signal can be reproduced when played back on an old standard-compatible playback device, so that it can be easily recorded on a low-cost new standard-compatible recording device.
- the message signal is played back in accordance with the part recorded in the new standard, so that which songs are actually played back in the old standard.
- the applicant of the present application has proposed an encoding method that encodes a multi-channel signal for each frame whose size cannot be controlled by an encoder.
- the signal of the channel to be reproduced by the old standard-compatible reproducing apparatus is encoded with a smaller number of bits than the maximum number of bits that can be allocated in the frame.
- the encoding method of the channel signal which is not reproduced by the old standard-compatible reproducing apparatus has higher encoding efficiency than the encoding method of the old standard. It is possible to reduce the resulting sound quality deterioration.
- FIGS. 1A to 1H show the state of quantization noise generated when encoding and decoding a general stereo signal and reproducing the signal in these methods.
- FIGS. 1A and 1B show the time axis waveforms of the left channel component (L) of the stereo signal and the right channel component (R) of the stereo signal, respectively
- FIGS. 1C and 1D show the L and R channel components ( L-R) / 2 and (L-R) / 2 represent the time axis waveforms of the signals converted to channels, respectively.
- (L + R) / 2 is represented by A
- (L-I / 2 is represented by B.
- the level of quantization noise often depends on the original signal level. In this case, the signal level of N2 is much smaller than N1.
- FIGS. 1G and 1H show how the channels of the stereo signal are separated from the signal waveforms of (A + Nl) and (B + N2). By adding the signals (A + N1) and (B + N2), the R channel component disappears and only the L component can be extracted. Also, the signal of (A + N1) to (B + N2) By subtracting, the L channel component disappears and only the R channel component can be extracted.
- the quantization noise components N 1 and N 2 are (N 1 + N2) or (N 1—N 2) However, since the level of N 2 is extremely small compared to N 1, there is no particular problem in hearing.
- FIGS. 2A and 2H similarly show the state of quantization noise for a stereo signal in which the signal level of the right channel (R) is much smaller than the signal level of the left channel (L).
- Figures 2A and 2B show the time axis waveforms of the left channel component (L) of the stereo signal and the right channel component (R) of the stereo signal, respectively.
- Figures 2C and 2D show the L and R channel components, respectively. The time axis waveform of the signal converted to (L + R) / 2 and (LR) / 2 is shown.
- (L + R) / 2 is represented by A
- (L-R) / 2 is represented by B, as in the example of FIG.
- the time axis waveform of each component is shown.
- FIGs 2G and 2H also show the situation where each channel of the stereo signal is separated from the signal waveforms of (A + Nl) and (B + N2), as in Figure 1.
- the R channel component disappears, only the L component can be extracted
- (A + N By subtracting the signal of (B + N 2) from 1), the L channel component disappears and only the R channel component can be extracted.
- the quantization noise components N 1 and N 2 are (N 1 + N
- the present invention has been made in view of such circumstances, and has been developed in view of encoding / decoding that realizes multi-channel by extension of a new standard while enabling reproduction by an old standard-compatible reproducing apparatus.
- An object of the present invention is to provide an encoding method and apparatus, a decoding apparatus, and a recording medium capable of minimizing quantization noise generated by encoding and reducing sound quality deterioration.
- the present invention provides an encoding / decoding method for realizing multi-channel by expansion of a new standard while enabling reproduction by an old standard-compatible reproduction apparatus, for example, and optimally selects a channel signal of an expansion part according to an input signal.
- the sound quality degradation is reduced by minimizing the quantization noise generated by encoding.
- An encoding method generates a first signal from signals of a plurality of input channels, obtains signal levels of some of the plurality of input channels and other channels, and, based on the signal levels, , A second signal consisting only of signals of some channels and signals of multiple input channels , And encodes the first signal and the selected second signal.
- An encoding device includes: first signal generation means for generating a first signal from signals of a plurality of input channels; Second signal generation means for selecting any one of a second signal consisting only of signals of some channels and a second signal generated from signals of a plurality of input channels, based on the first signal, Encoding means for encoding the selected second signal.
- the decoding method separates a first coded signal, a second coded signal, and configuration information indicating a configuration state of a channel signal included in the second coded signal from a code sequence. Then, the separated first and second encoded signals are respectively decoded to generate first and second signals, and a plurality of channel signals are generated from the first and second signals based on the configuration information. Select the restoration process to perform.
- the decoding method according to the present invention is based on a first coded signal generated and coded from signals of a plurality of channels, and a signal level of a part of channels of the plurality of channels and a signal level of another channel. From a code string including a second signal composed of only some channel signals or a second encoded signal selected and encoded from a second signal generated from a plurality of channel signals. The first and second encoded signals are separated, the separated first and second encoded signals are respectively decoded, and signals of a plurality of channels are restored from the decoded first and second signals.
- the decoding device is configured such that a first coded signal, a second coded signal, and a channel signal included in the second coded signal are configured from the code sequence.
- Separation means for separating the configuration information indicating the configuration state, decoding means for decoding the separated first and second encoded signals, respectively, to generate first and second signals, and based on the configuration information.
- control means for selecting restoration processing for generating a plurality of channel signals from the first and second signals.
- a decoding device is configured to generate a first coded signal generated and coded from signals of a plurality of channels, and a signal level of some of the plurality of channels and signal levels of other channels. From a code string including a second coded signal selected and coded from either a second signal composed of only some channel signals or a second signal generated from a plurality of channel signals. Separation means for separating the first and second encoded signals; decoding means for decoding the separated first and second encoded signals, respectively; and decoding means for decoding the first and second signals. Restoration means for restoring signals of a plurality of channels.
- FIGS. 1A to 1H are diagrams used to describe the state of quantization noise generated when a general stereo signal is encoded and decoded by a conventional technique and reproduced.
- FIG. 9 is a diagram used for describing the state of quantization noise.
- FIG. 3 shows an embodiment of a recording / reproducing apparatus for compressed data according to the present invention.
- 1 is a block circuit diagram illustrating a configuration example of a recording / reproducing apparatus according to an embodiment.
- FIG. 4 is a block circuit diagram showing a specific configuration example of the encoding device.
- FIG. 5 is a block circuit diagram showing a specific configuration example of the conversion circuit.
- FIG. 6 is a block circuit diagram showing a specific configuration example of the signal component encoding circuit.
- FIG. 7 is a block circuit diagram showing a specific configuration example of the decoding device.
- FIG. 8 is a block circuit diagram showing a specific configuration example of the inverse conversion circuit.
- FIG. 9 is a block diagram showing a specific configuration example of the signal component decoding circuit.
- FIG. 10 is a diagram for explaining a basic encoding method.
- FIG. 11 is a diagram for explaining a configuration of a code sequence of a frame encoded by a basic encoding method.
- FIG. 12 is a diagram showing an example in which L and R channels are arranged for each frame.
- FIG. 13 is a diagram illustrating an example of arranging (L + R) Z2 channels in a frame.
- FIG. 14 is a diagram for explaining an encoding method for encoding a signal component by dividing it into a tone component and a noise component.
- FIG. 15 is a diagram for explaining the configuration of a code sequence of a frame encoded by an encoding method in which a signal component is encoded into tone components and noise components.
- FIG. 16 is a block circuit diagram showing a specific configuration example of a signal component encoding circuit that encodes a signal component by dividing it into a tone component and a noise component.
- Fig. 17 shows a specific configuration example of a signal component decoding circuit that decodes an encoded signal by dividing a signal component into a tone component and a noise component.
- FIG. 18 is a diagram for explaining a recording format when recording a code string of the A codec.
- FIG. 19 is a diagram for explaining a recording format when recording the code strings of the A codec and the B codec.
- FIG. 20 is a diagram for explaining a recording format that realizes that the codec of the A codec and the B codec is not mistakenly reproduced by the old-standard-compliant playback device when the codec is recorded.
- FIG. 21 is a diagram for explaining a configuration of a code string for arranging signals of the A codec and the B codec in a frame.
- FIG. 22 is a block circuit diagram showing a specific configuration of an encoding device that generates a code string in which signals of the A codec and the B codec are arranged in a frame.
- FIG. 23 is a flowchart illustrating a processing example of an encoding device that generates a code sequence for arranging the signals of the A codec and the B codec in a frame.
- FIG. 24 is a block circuit diagram showing a specific configuration of a signal component decoding device that decodes a code string for arranging A codec and B codec signals in a frame.
- FIG. 25 is a flowchart illustrating a processing example of a signal component decoding device that decodes a code string in which signals of the A codec and the B codec are arranged in a frame.
- FIG. 26 is a diagram for explaining the configuration of the code string according to the embodiment of the present invention in which the channel configuration data and the A-channel and B-channel signals are arranged in a frame.
- FIG. 27 is a block circuit diagram showing a specific configuration of a coding apparatus for generating a code string according to the embodiment of the present invention in which channel configuration data and signals of A and B channels are arranged in a frame. is there.
- FIG. 28 is a flowchart showing the flow of processing of the control circuit of the encoding device for generating a code sequence according to the embodiment of the present invention.
- FIG. 29 is a flowchart showing a processing example of the coding apparatus according to the embodiment of the present invention for generating a code string in which a channel configuration data and A-channel and B-channel signals are arranged in a frame. .
- FIG. 30 is a block circuit diagram showing a specific configuration of an encoding device according to an embodiment of the present invention that decodes a code string in which channel configuration data and A-channel and B-channel signals are arranged in a frame. is there.
- FIG. 31 is a flowchart showing the flow of the process performed by the decoding device for decoding the code string according to the embodiment of the present invention.
- FIG. 32 is a flowchart showing a flow of a process of determining a decoding mode in the decoding device for decoding a code string according to the embodiment of the present invention.
- FIGS. 33A to 33H illustrate encoding and decoding of a stereo signal in which the signal level of the right channel (R) is much smaller than the signal level of the left channel (L) according to the embodiment of the present invention.
- FIG. 9 is a diagram used to explain the state of quantization noise that occurs when reproduction is performed.
- BEST MODE FOR CARRYING OUT THE INVENTION BEST MODE FOR CARRYING OUT THE INVENTION
- FIG. 3 shows a schematic configuration of a compressed data recording and / or reproducing apparatus to which an embodiment of the present invention is applied.
- a magneto-optical disk 1 driven by a spindle motor (M) 51 is first used as a recording medium.
- M spindle motor
- the recording track of the magneto-optical disk 1 is traced with a laser beam by the optical head 53 and magneto-optical reproduction is performed.
- the optical head 53 includes, for example, a laser light source such as a laser diode, an optical component such as a collimating lens, an objective lens, a polarizing beam splitter, and a cylindrical lens, and a photodetector having a light receiving section having a predetermined pattern. It is composed of The optical head 53 is provided at a position facing the magnetic head 54 via the magneto-optical disk 1.
- a magnetic head 54 is driven by a magnetic head drive circuit 66 of a recording system to be described later to apply a modulation magnetic field corresponding to the recording data.
- the target track of the magneto-optical disk 1 is irradiated with laser light by the optical head 53 to perform thermomagnetic recording by a magnetic field modulation method.
- the optical head 53 detects the reflected light of the laser beam applied to the target track, for example, detects a focus error by a so-called astigmatism method, and detects a tracking error by a so-called push-pull method, for example. .
- the optical head 53 detects the above-mentioned focus error and tracking error, and at the same time, changes the polarization angle (Kerr rotation angle) of the reflected light of the laser beam from the target track. To Upon detection, a reproduction signal is generated.
- the output of the optical head 53 is supplied to an RF circuit 55.
- the RF circuit 55 extracts the focus error signal and the tracking error signal from the output of the optical head 53 and supplies the signals to the servo control circuit 56.
- the servo control circuit 56 includes, for example, a focus servo control circuit, a tracking servo control circuit, a spindle motor servo control circuit, a thread servo control circuit, and the like.
- the focus servo control circuit performs focus control of the optical system of the optical head 53 so that the focus error signal becomes zero.
- the tracking servo control circuit controls the tracking of the optical system of the optical head 53 so that the tracking error signal becomes zero.
- the spindle motor servo control circuit controls the spindle motor 51 so as to rotate the magneto-optical disk 1 at a predetermined rotation speed (for example, a constant linear speed).
- the thread servo control circuit moves the optical head 53 and the magnetic head 54 to the target track position of the magneto-optical disk 1 specified by the system controller 57.
- the servo control circuit 56 that performs such various control operations sends information indicating the operation state of each unit controlled by the servo control circuit 56 to the system controller 57.
- a key input operation unit 58 and a display unit (display) 59 are connected to the system controller 57.
- the system controller 57 controls the recording system and the reproduction system based on the operation input information based on the operation input information from the key input operation unit 58.
- the system controller Reference numeral 57 denotes an optical head 53 and a magnetic head 54 based on sector-based address information reproduced from a recording track of the magneto-optical disk 1 by a header time, a subcode Q data, or the like. Manages the recording position and playback position on the above recording track. Further, the system controller 57 performs control to display the reproduction time on the display section 59 based on the data compression ratio of the present compressed data recording / reproducing apparatus and the reproduction position information on the recording track.
- This playback time display is based on the so-called header time from the recording track of the magneto-optical disk 1 or the sector-based address information (absolute time information) played back by the so-called subcode Q-delay.
- the actual time information is obtained by multiplying the reciprocal of the compression ratio (for example, 4 for 1Z4 compression), and this is displayed on the display unit 59.
- the preformatted absolute time information is read to perform data compression. By multiplying the reciprocal of the rate, it is possible to display the current position with the actual recording time.
- the analog audio input signal A in from the input terminal 60 is supplied to the A / D converter 62 via the low-pass filter (LPF) 61, A / D converter
- the 62 quantizes the analog audio input signal A in.
- the digital audio signal obtained from the A / D converter 62 is supplied to an ATC (Adaptive Transform Coding) encoder 63.
- the digital audio input signal D in from the input terminal 67 is connected to the digital input / output interface circuit (digital input) 68 via the ATC. It is supplied to the encoder 63.
- the ATC encoder 63 compresses the input signal Ain by the A / D converter 62 into digital audio PCM data at a predetermined transfer rate and performs bit compression according to a predetermined data compression rate. (Data compression) Processing is performed.
- the compressed data (ATC data) output from the ATC encoder 63 is supplied to the memory 64. For example, if the data compression rate is 1/8, the data transfer rate here is the data transfer rate of the standard CD-DA format (75 sectors / Second) (9.375 sector / second).
- the writing and reading of data are controlled by the system controller 57, and the ATC data supplied from the ATC encoder 63 is temporarily stored. It is used as a buffer memory for recording on a disk. That is, for example, when the data compression ratio is 1/8, the data rate of the compressed audio data supplied from the ATC encoder 63 is the data transfer rate of the standard CD_DA format. This is reduced to 1/8 of (75 sectors / second), that is, 9.375 sectors / second, and the compressed data is continuously written to the memory 64. As for the compressed data (ATC data), it is sufficient to record one sector per eight sectors as described above. However, such recording every eight sectors is practically impossible, so It is designed to record consecutive sectors.
- This recording uses the same data transfer rate (75 sectors) as the standard CD-DA format, with the recording unit consisting of a class consisting of a predetermined plurality of sectors (for example, 32 sectors and more than 10 sectors) throughout the idle period. Evening / second).
- the data compression rate of 1/8 that was continuously written at a low transfer rate of 9.375 ( 75/8) sectors / second corresponding to the above bit compression rate
- ATC audio data is read out in bursts at a transfer rate of 75 sectors / sec as described above.
- the overall data transfer rate, including the recording pause period, is as low as 9.375 sectors / sec.
- the instantaneous data transfer rate within the time of the recording operation performed at the time is the standard 75 sectors / second described above.
- the disk rotation speed is the same speed (constant linear speed) as the standard CD-DA format
- recording with the same recording density and storage pattern as the CD-DA format is performed.
- the ATC audio data that is burst-read from the memory 64 at the above (instantaneous) transfer rate of 75 sectors / second, that is, the recorded data, is supplied to the encoder 65.
- the unit of continuous recording in one recording is a class set consisting of a plurality of sectors (for example, 32 sectors) and before and after the class set. It has several sectors for class connection at the location.
- the class connection sector is set to be longer than the length of the in-leave in the encoder 65 so that even if the in-leave is left, it does not affect the de- night of other classes.
- the encoder 65 performs encoding processing for error correction (parity addition and in-recovery processing) and EFM encoding processing on the recording data supplied in a burst manner from the memory 64 as described above. And so on.
- the recording data subjected to the encoding process by the encoder 65 is supplied to the magnetic head drive circuit 66.
- This magnetic head drive circuit The path 66 is connected to a magnetic head 54, and drives the magnetic head 54 so that a modulation magnetic field corresponding to the recording data is applied to the magneto-optical disk 1.
- the system controller 57 performs the above-described memory control on the memory 64, and stores the above-described recording data, which is burst-read from the memory 64 by this memory control, on the recording track of the magneto-optical disk 1.
- the recording position is controlled so as to continuously record.
- the control of the recording position is performed by controlling the recording position of the recording data read out from the memory 64 by the system controller 57 in a burst manner and designating the recording position on the recording track of the magneto-optical disk 1. This is done by supplying signals to the servo control circuit 56.
- This reproducing system will be described. This reproducing system is for reproducing the recorded data continuously recorded on the recording tracks of the magneto-optical disk 1 by the recording system described above.
- a decoder 71 is provided, in which a reproduction output obtained by tracing one recording track with a laser beam is binarized by an RF circuit 55 and supplied. At this time, not only the magneto-optical disk but also the same read-only optical disk as the so-called compact disk (CD: Compact Disc) can be read.
- CD Compact Disc
- the decoder 71 corresponds to the encoder 65 in the recording system described above, and performs decoding processing as described above for error correction and EFM on the reproduced output binarized by the RF circuit 55. Performs processing such as decoding, and plays back the ATC audio data with a data compression rate of 1/8 above at a transfer rate of 5 sectors / second, which is faster than the regular transfer rate.
- the reproduced data obtained by the decoder 71 is stored in a memory Supplied to 72.
- the memory 72 is controlled by the system controller 57 to write and read data overnight, and the playback data supplied from the decoder 71 at a transfer rate of 75 sectors / second is 75 sectors / second. It is written in burst at the transfer speed of.
- the memory 72 stores the reproduction data written in a burst at the transfer rate of 75 sectors / sec. In the memory 72 corresponding to the data compression rate of 1/8. It is read continuously at a transfer rate of seconds.
- the system controller 57 writes the playback data to the memory 72 at a transfer rate of 75 sectors / second, and simultaneously writes the playback data from the memory 72 at the transfer rate of 9.3 75 sectors / second. Performs memory control to read continuously. Further, the system controller 57 performs the above-mentioned memory control for the memory 72, and the reproduction data written in a burst from the memory 72 by this memory control is recorded on the recording track of the magneto-optical disk 1.
- the playback position is controlled so that playback is continued from the beginning.
- the playback position is controlled by controlling the playback position of the playback data read out from the memory 72 in a burst manner by the system controller 57 and recording tracks of the magneto-optical disk 1 or the magneto-optical disk 1. This is performed by supplying a control signal designating the above playback position to the servo control circuit 56.
- the ATC audio data obtained as read data continuously read out from the memories 72 to 9.375 sectors / second is supplied to the ATC decoder 73.
- the ATC decoder 73 corresponds to the ATC encoder 63 of the recording system. For example, by expanding the ATC data by 8 times (bit expansion), the 16-bit digital data is expanded. Play audio data.
- the digital audio data from the ATC decoder 73 is supplied to a D / A converter 74.
- the D / A converter 74 converts the digital audio data supplied from the ATC decoder 73 into an analog signal, and forms an analog audio output signal Aout.
- the analog audio signal Aout obtained by the D / A converter 74 is output from an output terminal 76 via a one-pass filter (LPF) 75.
- LPF one-pass filter
- the input signal waveform 110a is converted by the conversion circuit 111a.
- the signal frequency component 110b is converted to a signal frequency component 110b, and each obtained frequency component 110b is encoded by a signal component encoding circuit 111b.
- a code string 110d is generated from the coded signal 110c generated by the conversion circuit 111b.
- the input signal 120a is divided into two bands by a band division filter 112a as shown in FIG. 120c is converted into spectrum signal components 120d and 120e by forward spectrum conversion circuits 112b and 112c using MDCT or the like.
- the input signal 120a corresponds to the signal waveform 110a in FIG. 4
- the spectrum signal component 120 d and 120 e correspond to the signal frequency component 110 b in FIG.
- the bandwidth of the signals 120 b and 120 c divided into the above two bands is equal to the input signal 1
- the conversion circuit 111a may be many other than this specific example.
- a conversion circuit that directly converts an input signal into a spectrum signal by MDCT may be used. It may be converted by DCT.
- DCT digital coherence tomography
- each signal component 130a is normalized by the normalization circuit 113a for each predetermined band, and the quantization accuracy
- the decision circuit 1 1 3 b calculates the quantization accuracy information 1 3 0 c from the signal component 1 3 0 a and calculates the quantization accuracy information 1
- the quantizing circuit 113c quantizes the normalized signal 130b from the normalizing circuit 113a.
- Each signal component 130a corresponds to the signal frequency component 110b in FIG. 4, and the output signal 130d of the quantization circuit 113c corresponds to the encoded signal 111 in FIG. Corresponds to 0c.
- the output signal 130d includes, in addition to the quantized signal components, normalization coefficient information at the time of the normalization and the quantization accuracy information.
- a decoding device that reproduces an audio signal from a code sequence generated by the above-described coding device (decoder 7 in the example of FIG. 3).
- the code sequence decomposition circuit 114a extracts the code 144b of each signal component from the code sequence 140a by the code sequence decomposition circuit 114a.
- Each signal component 140c is restored from 0b by the signal component decoding circuit 114b, and from the restored signal component 140c, the acoustic waveform signal 114 is restored by the inverse transform circuit 114c. 0 d is reproduced.
- the inverse conversion circuit 114c of this decoding device is configured as shown in FIG. 8, and corresponds to the conversion circuit shown in FIG.
- the inverse spectrum conversion circuits 115a and 115b respectively invert the supplied input signals 150a and 150b respectively.
- the signal of each band is restored by performing spectrum conversion, and the band synthesis filter 115c synthesizes each of these band signals.
- the output signal 150 e of the band synthesis filter 115 c corresponds to the acoustic waveform signal 140 d of FIG.
- the signal component decoding circuit 114b of FIG. 7 is configured as shown in FIG. 9, and the code 140b from the code string decomposition circuit 114a of FIG. It performs inverse quantization and inverse normalization on the vector signal.
- the input code 160a is inversely quantized by the inverse quantization circuit 116a, and the inverse quantization is performed by the inverse normalization circuit 116b.
- the signal 160b obtained by the conversion is inversely normalized to output a signal component 160c.
- each spectrum component shown in FIG. 10 is obtained by converting the absolute value of the spectrum component by MDCT into [dB]. That is, in this signal encoding device, the input signal is converted into 64 spectrum signals for each predetermined conversion block, which is represented by eight symbols shown in [1] to [8] in the figure. It is normalized and quantized collectively in a band (hereinafter referred to as an encoding unit). At this time, if the quantization accuracy is changed for each of the coding units according to the manner of distribution of the frequency components, it is possible to perform audio-efficient coding with minimum deterioration of sound quality.
- FIG. 11 shows a configuration example of a code string in the case of encoding by the above-described method.
- the data for restoring the spectrum signal of each conversion block contains information encoded corresponding to a frame composed of a predetermined number of bits. Have been. At the beginning of each frame (header part), first, information obtained by coding control data such as the number of coded units being coded with a fixed number of bits is used. The information obtained by coding the quantization accuracy data and the normalization coefficient data of the unit from the low-frequency side coding unit is finally obtained for each coding unit. Information obtained by encoding the spectral coefficient data normalized and quantized based on the quantization accuracy data from the low-frequency side is arranged.
- the number of bits actually required to restore the spectrum signal of this conversion block is determined by the number of encoded units described above and each It is determined by the number of quantization bits indicated by the quantization accuracy information of the coding unit, and the amount may be different for each frame. Only the required number of bits from the beginning of each frame has meaning during playback, and the remaining area of each frame becomes an empty area and does not affect the playback signal. Normally, more bits are used effectively to improve the sound quality so that the free space in each frame is as small as possible.
- each conversion block in correspondence with a frame of a fixed number of bits, for example, when this code string is recorded on a recording medium such as an optical disc, an arbitrary conversion Since the recording position of the block can be easily calculated, it is possible to easily realize so-called random access in which reproduction is performed from an arbitrary position.
- FIGS. 12 and 13 show an example of a recording format when the data of the frame shown in FIG. 11 is arranged in a recording medium or the like in a time-series manner, for example.
- FIG. 12 shows an example in which, for example, two channels of L (left) and R (right) signals are alternately arranged for each frame.
- FIG. 13 shows that two-channel signals of L and L are (L + In this example, a single-channel signal (monaural signal generated from two channels, L and R) generated by R) / 2 is arranged for each frame.
- the recording format as shown in Fig. 12 is called, for example, the standard time mode, as shown in Fig. 13, the recording format that enables long-time recording and reproduction of signals with a small number of channels is as described above. It can be called a long time mode that can record and play twice as long as the standard time mode. Also, in the example of FIG. 12 as well, if only one monaural channel is recorded for each frame instead of two channels of L and R, it is better than when two channels of L and R are recorded. This means that a signal with twice the time can be recorded, and this case can also be called a long-time mode.
- FIG. 11 only the method described in FIG. 11 has been described as an encoding method. However, the encoding method described in FIG. 11 can further improve the encoding efficiency. .
- those with a high appearance frequency are assigned a relatively short code length, and those with a low appearance frequency are assigned a relatively long code length.
- encoding efficiency can be improved.
- the predetermined conversion block at the time of encoding the input signal that is, the time block length for spectrum conversion
- sub-information such as quantization accuracy information and normalization coefficient information can be obtained. Since the amount can be reduced relatively per block and the frequency resolution also increases, the quantization accuracy on the frequency axis can be more finely controlled and the coding efficiency can be increased .
- FIG. 14 shows a state in which three tone components, which are grouped as tone signal components, are separated from the spectrum signal components, and the signal components constituting each of these tone components are: It is encoded together with the position data on the frequency axis of each tone component.
- the spectral coefficient (a non-tone spectral signal component) in the encoding unit can be quantized with a relatively small number of steps without deteriorating the perceived sound quality.
- FIG. 14 shows only a relatively small number of spectral signal components for simplicity, the actual tone component is composed of several tens of spectral signal components. Since the energy is concentrated on several signal components in the unit, the increase in the amount of data due to the separation of such tone components is relatively small, and by separating these tone components, the overall Efficiency can be improved.
- FIG. 15 shows an example of the configuration of a code string when the encoding is performed by the method described with reference to FIG.
- a synchronization signal and a coded unit Information obtained by encoding control data such as numbers with a predetermined number of bits is arranged, and then information obtained by encoding tone component data, which is data on tone components, is arranged.
- the tone component data first, information obtained by encoding the number of each signal component in the tone component, secondly, positional information on the frequency axis of each tone component, and then quantization accuracy data in the tone component.
- the normalized coefficient data Information in which the normalized, quantized and toned signal components (spectrum coefficient data) are encoded, respectively.
- the data of the remaining signal obtained by subtracting the above-mentioned tonic signal component from the original spectral signal component (also referred to as a noisy signal component) is shown.
- the encoded information is arranged. This includes a quantization precision data and a normalization coefficient data of each encoding unit and a scan normalized and quantized based on the above-described normalization coefficient data and the quantization precision data for each encoding unit.
- Information obtained by encoding the vector coefficient data (signal components other than tone components) from the encoding unit on the lower frequency side is arranged.
- the spectrum signal components (coefficient data) of tone characteristics and other signal components have been subjected to variable-length coding.
- FIG. 16 shows a specific example of the signal component encoding circuit 11 lb of FIG. 4 in the case of separating a tone signal component from each of the above signal components.
- the signal component 170a (110b) supplied from the conversion circuit 111a in FIG. Sent to 7a.
- the signal component 170a is divided into tone signal components and other signal components (non-tone signal components), and the tone signal component 170b is a tone component encoding circuit.
- the signal component 170c of the non-tone component is sent to the non-tone component encoding circuit 117c.
- the tone component encoding circuit 117b and the non-tone component encoding circuit 117c encode the supplied signal components and output the obtained output signals 170d and 170e, respectively. I do.
- the tone component encoding circuit 11 ⁇ b also generates each piece of information constituting the tone component data of FIG. 15 at the same time as encoding the above-described tone component signal component.
- the configurations for signal encoding in the tone component encoding circuit 117b and the non-tone component encoding circuit 117c are the same as those in FIG.
- FIG. 17 shows a specific example of the signal component decoding circuit 114 b of FIG. 7 in the case where the tonal signal component is separated from each of the above signal components.
- the code 140 b supplied from the code string decomposing circuit 114 a of FIG. It consists of a non-tone signal component 180b, and these data and signal component are sent to the corresponding tone component decoding circuit 1 18a and non-tone component decoding circuit 1 18b, respectively.
- the tone component decoding circuit 118a decodes the tonal signal component from the tone component data as shown in FIG. 15 and obtains the obtained tonal signal component 180c Is output. Further, the non-tone component decoding circuit 118b decodes the non-tone signal component and outputs the obtained non-tone signal component 180d.
- Both the tonal signal component 180c and the non-tonal signal component 180d are sent to the spectrum signal synthesizing circuit 118c.
- the spectrum signal synthesizing circuit 118c synthesizes the tone-based signal component and the non-tonic signal component based on the position data, and obtains the obtained signal component 180e. Is output. Note that the configurations for signal decoding in the tone component decoding circuit 118a and the non-tone component decoding circuit 118b are the same as those in FIG.
- FIG. 18 shows an example of a format when the signal encoded as described above is recorded on a magneto-optical disk, for example.
- a total of, for example, four (four songs) audio signal data are recorded.
- management data used when recording and reproducing the audio signal data are recorded together with the audio signal data for all four.
- the first data number and the last data number are recorded at addresses 0 and 1 of the management data area, respectively.
- 1 is recorded as the value of the first data number
- 4 is recorded as the value of the last data number. From this, it can be seen that four audio signal data from the first to the fourth are recorded on this disc.
- the old standard or A-code a certain coding method
- the recording format on the disc has been standardized using this method.
- a more efficient coding method hereinafter, referred to as a new standard or a B-codec
- the signal encoded by the B codec can be recorded on the same type of disc as the signal recorded by the A codec. If the signal from the B-codec can be recorded in the same way as in the case of the A-codec, it is possible to record a signal for a longer time on the disc or to record a signal with higher sound quality. Therefore, the use of the disc is widened and convenient.
- A-codec When the encoding method described with reference to FIG. 11 described above is considered as an A-codec, for example, as described above, a relatively short time is applied to a quantized spectrum signal having a high frequency of appearance.
- a coding method using so-called variable length coding technology which allocates a code length and allocates a relatively long code length to those with a low frequency of occurrence, can be considered as a B-codec.
- the amount of sub-information such as quantization accuracy information and normalization coefficient information is relatively reduced per block by increasing the transform block length when encoding the input signal.
- Such an encoding method can be considered as a B codec.
- a coding method that separates and encodes a spectrum signal component into a tone component and a non-tone component can be considered as a B codec.
- a combination of these highly efficient coding methods can be considered a B codec.
- the area for recording the address information (start address and end address) of each audio signal data as shown in FIG. 18 is used.
- One of the spare areas provided next to the area is used as an area for codec specification information.
- the codec specification information indicates that the audio signal data specified by the address information consisting of the start address and the end address is encoded based on the old standard (A codec).
- a codec old standard
- B codec new standard
- audio signal data encoded by the A codec and audio signal data encoded by the B codec can be mixed and recorded on the same disc, and the disc conforms to the new standard (B codec). It can be played back by a compatible playback device (hereinafter referred to as a new standard compatible playback device).
- a disc on which a mixture of A-codec and B-codec data was recorded was recorded on the A-codec, that is, whether the disc was recorded using the old standard or the B-codec, that is, the new standard. It is not possible to visually determine whether the record was made at. Therefore, there is a possibility that the user will play this disc on an old standard compatible playback device.
- the playback device conforming to the old standard does not check the content of address 2, which is always set to the value 0 as shown in FIG. 18 in the old standard, and the signal recorded on the disc is Since reproduction is attempted by interpreting all as being caused by the A-codec, there is a high risk of being unable to reproduce or generating cluttered and distracting noises and causing confusion to the user.
- a message signal according to the old standard (A-code) is recorded in advance on the recording medium, and when recording according to the new standard, the contents of the reproduction management information are manipulated.
- the above-mentioned message signal is played back when played back by a playback device compatible with the old standard, so that it can be easily recorded by an inexpensive recording device compatible with the new standard.
- the message signal is played back in accordance with the part recorded in the new standard, so that which songs are actually recorded in the old standard is compliant with the old standard
- FIG. 20 shows an example in which recording is performed on a disc by the method described in the specification and drawings of Japanese Patent Application Publication No. 10-22935.
- the management data related to the new standard (codec B) is recorded separately from the management data related to the old standard (codec A).
- the old standard-compatible playback device firstly starts with the old standard start data number at address 0 and the old standard end data number at address 1 (these are the start data number and last data number in FIG. 18). (Corresponding to).
- the data recorded on this disc is determined from the old standard head data number and the old standard final data number. It can be interpreted that there is only one number from day 1 to day 1.
- the playback device conforming to the old standard uses address 5 (ie, address storage location information) in accordance with the old standard. ) To find the location in the management area where the address' data is stored.
- the playback device conforming to the old standard checks the contents from the address (address 116) indicated by the address storage position information of the address 5, thereby recording the audio signal of data number 0. Know the location (address 200,000).
- the old standard-compliant playback device ignores the codec designation information recorded at address 118, but is described in the specification and drawings of Japanese Patent Application Laid-Open Publication No. H10-22953.
- the audio signal of data number 0 is actually encoded by the A codec.
- the content of the audio signal with data number 0 is "Please use a B codec compatible player to reproduce the signal on this disc.” The confusion of the user of the compatible playback device can be avoided.
- a playback device that supports both the old standard and the new standard ie, a playback device that supports the new standard
- the playback device conforming to the new standard knows that this disc may be recorded based on the new standard (B codec) whose mode designation information value is 1. Therefore, based on the specification when the mode specification information is 1, the new standard-compatible playback device ignores the old standard head data number at address 0 and the old standard last data number at address 1, and replaces the new standard head address at address 3.
- the data to be played back on this disc is interpreted as four data numbers 2 to 5 for playback. That is, in this case, the message (the signal of the data number 0) for the playback device conforming to the old standard is not played back. However, this message can be played back by the new standard compatible playback device in order to pay attention to the user of this disc. In this case, the value of the new standard head data number at address 3 is set to 1 It is good.
- the data is recorded on the disc by the playback device conforming to the new standard. In addition to being able to reproduce a desired audio signal, the old-standard-compliant playback device plays back only a message of caution regarding disc playback, thereby avoiding unnecessary confusion to the user.
- the message signal that can be reproduced by the old-standard-compliant reproducing device is not the signal itself that one really wants to reproduce.
- Japanese Patent Application No. 9-42515 uses, for example, a recording format shown in FIG.
- the total number of bits that can be assigned to each frame is larger than Allocate a small number of bits to the minority channel.
- encoding is performed with a smaller number of bits than the total number of bits that can be allocated to each frame so that an empty recording area is formed in the frame.
- the number of bits used for the coding method of the B codec is relatively small, and the coding method of the A codec is used. Since a relatively large number of bits can be used, it is possible to minimize the degree of the sound quality deterioration.
- a signal of a channel that is not reproduced by the old standard-compatible reproducing apparatus that is, a signal of the B codec
- the signal reproduced by the old standard-compatible reproduction device can be converted to a multi-channel as described above. It is possible to minimize the decrease in sound quality due to the reduced number of bits allocated.
- various methods for actually increasing the coding efficiency such as increasing the length of the transform block, adopting a variable length code, and separating the signal component of the tone characteristic.
- FIG. 21 shows a specific example of a code string obtained by applying the technology described in the specification and the drawings of Japanese Patent Application No. 9-42515 described above.
- each frame composed of a fixed number of bits is divided into two regions, and the region 1 in Fig. 21 is divided into two regions.
- the signal of the (L + R) / 2 channel is encoded and recorded by the above-mentioned A codec encoding method, and the area 2 and the area shaded in the figure are recorded.
- the signal of the (L ⁇ R) / 2 channel is encoded and recorded by the encoding method of the B codec.
- the area 2 and the area 4 correspond to the empty recording area.
- the encoding method of the A codec is, for example, the encoding method described with reference to FIG. 11 described above.
- the encoding method of the B codec is, for example, a signal converted to a spectrum signal component with a conversion block length twice as long as that of the A codec, using the encoding method shown in Fig. 15 as an example. Can be listed.
- the conversion block length of the B codec at this time is twice as long as the conversion block length of the A codec, so that the code corresponding to the conversion block is recorded over two frames.
- the coding method of the A codec employs a fixed-length coding method. Therefore, a code sequence (hereinafter, A codec) obtained by the coding method of the A codec is used.
- a codec a code sequence obtained by the coding method of the A codec.
- the number of bits used by a codec sequence can be easily calculated. If the number of bits used by the A codec code string can be calculated in this way, the head position of the code string by the B codec coding method (hereinafter referred to as the B codec code string) can be easily determined. You can know.
- the B codec code string may be started from the end of the frame.
- the head position of the B codec code string can be easily known.
- the start position of the B codec code string can be easily calculated.
- a playback device that supports both the A codec and the B codec (a playback device that supports the new standard) can process both code strings quickly and in parallel. It becomes possible.
- the coding method of the A codec includes information on the number of coding units as shown in FIG. 11, as described above, an area for recording signals of other channels (an empty recording area) is provided. If the band of the channel to be coded by the coding method of the A codec is narrowed in order to secure, for example, the quantization accuracy data and the normalization coefficient data on the high frequency side can be omitted. It is convenient. Even in this case, the number of bits used for encoding by the encoding method of the A codec can be easily calculated.
- the signal of the (L + R) / 2 channel is recorded as an A codec code sequence, and the signal of the (L ⁇ R) / 2 channel is recorded as B Since it is recorded as a codec code string, for example, if only the area where the signal of the A codec is recorded is reproduced and decoded, it is possible to reproduce (L + R) / 2 monaural signals. Then, both the area where the signal of the A codec is recorded and the area where the signal of the B codec are recorded are reproduced and decoded, and the sum of them is calculated.
- a (right) channel signal can be generated, and if the difference is calculated, an L (left) channel signal can be generated, enabling stereo playback.
- the old-standard-compliant playback apparatus ignores the area coded by the coding method of the B codec.
- the monaural signal can be reproduced from the recording medium on which the code string is recorded.
- the recording medium on which the code string shown in FIG. A playback device equipped with a decoding circuit for decoding a deck code and a decoding circuit for decoding a B codec (a playback device conforming to the new standard) can reproduce stereo signals. In this way, after the old standard-compatible playback device has already spread, even if the encoding method shown in Fig.
- FIG. 21 shows a specific configuration of an encoding device that generates the above-described code sequence of FIG. 21.
- the input signal 190 a of the L channel and the input signal 190 b of the R channel are converted into a signal 19 9 corresponding to (L + R) / 2 by the channel conversion circuit 119 a. It is converted to a signal 190d corresponding to 0c and (L-R) / 2.
- the (L + R) / 2 signal 190c is sent to the first encoder circuit 119b, and the (L-R) / 2 signal 190d is sent to the second encoder circuit 119c.
- the first encoding circuit 119b corresponds to the signal component encoding circuit 111b of FIG. 4 having the configuration of FIG. 6, and the encoding method of the A codec is applied.
- the second encoding circuit 119c has a conversion block length twice as long as that of the first encoding circuit 119b, and the signal component encoding circuit shown in FIG. This is equivalent to 1 1 1 b, and the coding method of the above-mentioned B codec is applied.
- Both the A codec code string 190 e of the first coding circuit 119 b and the B codec code string 190 f of the second coding circuit 119 c generate code strings. It is supplied to the circuit 1 19 d.
- This code string generation circuit 119d generates the code string shown in FIG. 21 from the code strings 190e and 190f, and outputs it as an output code string signal 190g.
- FIG. 23 shows a flow of processing when the code sequence generation circuit 1 19 d of FIG. 22 generates the code sequence of FIG. 21.
- step S101 the frame number F is initialized to 1, and in step S102, the A codec code sequence 190 from the first coding circuit 119b is initialized. Receive e.
- step S103 it is determined whether or not the frame number F is even. If it is not even, the process proceeds to step S106, and if it is even, the process proceeds to step S104.
- step S104 the B codec code string 190f from the second encoding circuit 119c is received.
- step S105 the code sequence of FIG. 21 is synthesized from the code sequences 190e and 190f.
- step S106 it is determined whether or not the processing for all frames has been completed.
- the processing in FIG. 23 ends, and when not, in step S107, the processing in FIG.
- the frame number F is incremented by one, the process returns to step S102, and the above process is repeated.
- the frame number F starts from 1, but the processing unit of the encoding method of the B codec is two frames, which is twice the encoding method of the A codec. Column generation is also performed every two frames.
- FIG. 24 shows an image generated using the encoding method of the present invention described above.
- the specific configuration of the decoding device of the new standard-compatible playback device that decodes the encoded code string of FIG. 21 is shown.
- the input code sequence 200a which is the code sequence in FIG. 21, is separated into an A codec code sequence 200b and a B codec code sequence 200c by a code sequence separation circuit 120a.
- the A codec code string 200b is sent to the first decoding circuit 120b, and the B codec code string 200c is sent to the second decoding circuit 120c.
- the first decoding circuit 120b corresponds to the signal component decoding circuit 114b of FIG. 7 having the configuration of FIG. 9, and decodes an A-codec code.
- the second decoding circuit 120c has a conversion block length twice that of the second decoding circuit 120b, and corresponds to the signal component decoding circuit 114b of FIG. 7 having the configuration of FIG. It decodes the code of the B codec.
- the signal 200d decoded by the first decoding circuit 120b is equivalent to the (L + R) / 2 signal 190c
- the signal 200e decoded by the second decoding circuit 120c is (L — R) / 2 signal is equivalent to 1 90 d.
- the processing delay time differs between the (L + R) / 2 signal 200 d and the (L ⁇ R) / 2 signal 200 e because the conversion block lengths are different. Therefore, the (L + R) / 2 signal 200 d from the first decoding circuit 120 is sent to the memory circuit 120 d, and the (L—R) / 2 signal 200 e from the second decoding circuit 120 c is sent to the memory circuit 120 d. The difference is supplied to the memory circuit 120 e, and the memory circuit 120 d and 120 e absorb the processing delay time difference.
- the (L + R) / 2 signal 200f and the (L—) Z2 signal 200g passed through the memory circuits 120d and 120e, respectively, are sent to the channel conversion circuit 120f.
- This channel conversion circuit 120 f generates an L-channel signal 200 h by adding the (L + R) / 2 signal 200 f and the (L—R) / 2 signal 200 g, and generates (L + By subtracting (L-R) / 2 signal 200 g from R) / 2 signal 200 f, R channel signal 200 i is generated and these L channel and R channel signals are output.
- FIG. 25 shows the flow of processing when the code string separation circuit 120a in FIG. 24 separates the code string in FIG.
- step S201 the frame number F is initialized to 1, and in step S202, the A codec code sequence to be sent to the first decoding circuit 120b is separated and transmitted. .
- step S203 it is determined whether or not the frame number F is an odd number. If the frame number F is not an odd number, the process proceeds to step S205, and if it is an odd number, the process proceeds to step S204.
- step S204 the B codec code string to be sent to the second decoding circuit 120c is separated and transmitted.
- step S205 it is determined whether or not the processing for all the frames has been completed.
- the processing in FIG. 25 has been completed. Is incremented by 1 and the process returns to step S202, and the above-described process is repeated.
- the frame number F starts from 1, but the processing unit of the encoding method of the B codec is two frames, which is twice the encoding method of the A codec. This is done every two frames.
- B-codec signal an additional channel signal
- a stereo signal is recorded.
- quantization noise generated by encoding may cause a problem.
- a method according to an embodiment of the present invention for solving this problem will be described.
- FIG. 26 shows a specific example of a code string according to an embodiment obtained by applying the method of the present invention.
- the old standard only allowed recording of one channel signal (audio signal) of (L + R) / 2, whereas the channel configuration data and the additional channel
- the new standard shows an example of a format that enables recording of two-channel signals by combining signals with (L + R) / 2 and additional channels by recording signals.
- a channel corresponding to (L + R) / 2 is referred to as an A channel
- a channel corresponding to an additional channel is referred to as a B channel.
- the two channels are actually recorded because the space for recording the channel configuration data is secured after the A channel signal, and the channel configuration data has a value other than 0. Is the case.
- the signal of the B channel is one of (L ⁇ R) / 2, L, and R, and the value of the channel configuration data indicates which of these is recorded.
- the signal of (L + R) / 2 when the value of the channel configuration data is 2, the signal of L, and when the value of the channel configuration data is 3, the signal of R is Each is recorded.
- the user can receive only the desired audio signal of either the A channel or the B channel. It will be possible to hear.
- the channel configuration data is set to a value of 0, it is also possible to record only the A channel or the B channel without using all the bits allocated to the frame.
- FIG. 27 shows a configuration example of an encoding device according to an embodiment of the present invention that generates the code sequence shown in FIG.
- an input signal 210a is an L-channel and an R-channel signal, and the input signal 210a is converted to an A-channel, that is, (L + R) by a channel conversion circuit 121a.
- the signal is converted into a signal 210b corresponding to / 2 and a signal 210c corresponding to the B channel.
- the A channel signal 210b is sent to the first encoding circuit 121b, and the B channel signal 210c is sent to the second encoding circuit 121c.
- the first encoding circuit 121b encodes the A channel signal 210b
- the second encoding circuit 121c encodes the B channel signal 210c.
- the A-channel code sequence 210 d from the first coding circuit 121b and the B-channel code sequence 210e from the second coding circuit 121c are both code sequence generation circuits. Supplied to 1 2 1 d.
- the code sequence generation circuit 121 d generates the code sequence shown in FIG. 26 from the code sequences 210 d and 210 e and outputs it as an output code sequence signal 210 h.
- a control circuit 121 e is provided.
- This control circuit 1 2 1 e converts the channel conversion circuit 1 2 1 a to the code string generation circuit 1 2 1 d according to the input signal 2 10 f specifying the encoding mode and the input signal 2 10 a.
- the code string of FIG. 26 is generated as shown in the flowcharts of FIGS. 28 and 29 described later.
- Control signal 210g which is controlled in such a manner as to send the signal to each component.
- the control signal 210g contains information indicating whether to encode in stereo mode or monaural mode, and information indicating in what mode to encode the B channel, that is, (L-R) / 2, R , L, which indicates which signal to encode.
- FIG. 28 shows a processing flow of the control circuit 122 e in the configuration of FIG.
- step S301 (L + R) / 2 is set for the A channel.
- step S302 it is determined whether or not the stereo mode is set based on the input signal 21Of of FIG. 27. Ends the processing. In the case of the stereo mode, go to step S303.
- step S303 the signal energy of each frame of the input signal 210a in FIG. 27 is obtained for each channel ⁇ , the energy of the L channel is set to E1, and the energy of the R channel is set to Er.
- step S304 the energies E1 and Er are compared, and the ratio of E1 to Er (E1 / Er) is smaller than, for example, 30 dB, that is, the energy of the R channel is reduced to the energy of the L channel.
- the process proceeds to step S305, and if it is 30 dB or more, the R channel is set to the B channel in step S308.
- step S305 similarly, the energies E1 and Er are compared, and the ratio of Er to E1 (Er / El) is smaller than 30 dB, that is, the energy of the L channel is smaller than the energy of the R channel.
- (L-R) / 2 is set for the B channel in step S306, and if it is large, the L channel is set for the B channel in step S307.
- the channel information set here is output as the control signal 210g in FIG.
- the channel ratio of the B channel is selected by comparing the energy ratio of the L and R channels with the value of 30 dB. It is good, and the value of 30 dB may be changed according to the level of quantization noise and the like, and may be a value of 10 dB, for example.
- FIG. 29 shows a flow of processing when generating a code string as shown in FIG. 26 based on the control signal 210g in the configuration of FIG. In the example of FIG. 29, it is assumed that, for example, 200 bytes are assigned to each frame.
- step S401 it is determined whether or not recording and reproduction are to be performed in stereo as described above.
- the mode designation signal 210 of FIG. 27 indicates the stereo mode
- the processing after step S402 is performed. The process proceeds to S405.
- step S402 when encoding in the stereo mode is instructed in step S401, the above A channel, that is, (L + R) is used using 150 bytes. ) / 2 signal is encoded.
- the channel configuration data is generated and encoded using one byte.
- step S404 the B-channel signal is encoded using 49 bytes.
- the channel configuration data is 1 when the B channel is set to (L-R) / 2, 2 when the L channel is set, and 2 when the R channel is set. Are coded as 3 respectively.
- step S405 the signal of the A channel, that is, (L + R) / 2, is encoded using 200 bytes.
- FIG. 30 shows a specific example of a decoding device according to the present embodiment that decodes a code string as shown in FIG.
- the input code string 220a which is the code string in FIG. 26, is converted into an A channel code string 220b and a B channel code string 220c by a code string separation circuit 122a. Separated.
- the A channel code sequence 220b corresponds to the A channel code sequence 210d
- the B channel code sequence 220c corresponds to the B channel code sequence 210e.
- the A channel code sequence 222b is sent to the first decoding circuit 122b
- the B channel code sequence 220c is sent to the second decoding circuit 122c.
- the first decoding circuit 122b decodes the code string 220b of the A channel
- the second decoding circuit 122c decodes the code string 220c of the B channel.
- the A channel signal 220 d decoded by the first decoding circuit 122 b and the B channel signal 220 e decoded by the second decoding circuit 122 c respectively Since the byte length is different, there is a difference in the processing delay time. Therefore, the A channel signal 220 d from the first decoding circuit 122 is stored in the memory circuit 122 d, and the B channel signal 220 e from the second decoding circuit 122 c is stored in the memory circuit 122 d. The data is supplied to a circuit 122e, and the memory circuit 122d and 122e absorb the processing delay time difference. The A channel signal 220 f and the B channel signal 220 g that have passed through the memory circuits 122 d and 122 e are sent to the channel conversion circuit 122 f.
- the channel conversion circuit 1 2 2 f is connected to the A channel, that is, (L + R) / 2 Generates an audio signal from the signal 220 f and the B-channel signal 220 g and outputs them.
- the channel configuration data is also separated from the input code string 220a by the code string decomposition circuit 122a.
- the above-described channel separation data is separated and the above-described configuration is performed from the code string separation circuit 122a to the channel conversion circuit 122f.
- a control signal 22 0 h for performing such a decoding processing operation is generated and sent to each component.
- the monaural mode only the A-channel code 222b is output from the code string separation 122a, and the monaural signal is formed in the configuration after the first decoding circuit 122b. Is played.
- FIG. 31 shows a processing flow of the configuration of FIG. 30 for decoding the code string shown in FIG. 26.
- step S501 the first code string of the input code string 220a, that is, the number of bytes L1 of the A-channel code string 220b is obtained by calculation.
- step S502 it is determined whether or not the number of bytes L1 is smaller than 200.
- Step S502 determines whether the mode is the monaural mode or the stereo mode. That is, in step S502, determination is made based on the number of bytes in the code string to determine whether the data is recorded in the old standard or in the new standard. By judging from the number of bytes of the code string in this way, it becomes possible to change the mode for each frame or every several frames.
- the discrimination between the monaural mode and the stereo mode can be embedded as mode designation information in the management data as shown in FIG.
- step S503 it is determined whether or not the value of the channel configuration data is 0. If it is 0, the process proceeds to step S504, and if it is not 0, the process proceeds to step S505.
- step S504 a control signal 220h for decoding the signals of the A channel and the B channel is generated and sent to each component, whereby, as described above, the A channel code sequence 220 b is decoded by the first decoding circuit 122b, and the B-channel code sequence 220c is decoded by the second decoding circuit 122c.
- step S505 since the mode is the monaural mode, a control signal 220h for decoding the A channel signal is generated and sent to each component, and as described above, the A channel code sequence 220 Only b is decoded by the first decoding circuit 122 b.
- FIG. 32 shows a processing example of a method of determining the channel setting when the code string separation circuit 122 a in FIG. 30 is in the stereo mode of decoding.
- step S601 it is determined whether or not the channel configuration data is zero. If the channel configuration data is 0, the process proceeds to step S602, in which the A channel is output together with the L channel and the R channel in step S602, and the control signal 220h in FIG. 30 is output. Generate
- step S601 determines whether the channel configuration data is 1 or not. If it is determined that the channel configuration data is 1, the process proceeds to step S604.
- step S604 the control signal 222h of FIG. 30 is generated assuming that (A + B) is output as the L channel and (A—B) is output as the R channel.
- step S605 it is determined whether or not the channel configuration data is 2. If it is determined in step S 605 that the channel configuration data is 2, the process proceeds to step S 606, and in step S 606, the B channel is set as the L channel and the (2 A) is set as the R channel. — Generate the control signal 220h of Fig. 30 assuming that the channels of B) are output respectively. If it is determined in step S605 that the channel configuration data is not 2, the process proceeds to step S607. In this step S607, it is determined whether or not the channel configuration data is three.
- step S607 If it is determined in step S607 that the channel configuration data is 3, the flow advances to step S608, and in this step S608, the L channel is set as (2A-B): R
- the control signal 220h of FIG. 30 is generated assuming that the B channel is output as a channel. If it is determined in step S607 that the channel configuration data is not 3, the process ends.
- the signals of the L and R channels are output from the channel conversion circuit 122 in the stereo mode.
- monaural mode a monaural signal is output.
- the monaural signal is output by outputting the A channel signal together with the L channel and the R channel as described above. can get.
- the channel configuration data is 1, as described above, an L-channel signal is obtained at (A + B) and an R-channel signal is obtained at (A-B). If the channel configuration data is 2, the L channel signal is obtained from the B channel, and the R channel signal is obtained at (2 AB). Further, when the channel configuration data is 3, an L-channel signal is obtained at (2A-B), and an R-channel signal is obtained from the B channel.
- FIGS. 33A and 33B shown in FIGS. 1 and 2 An example of processing a stereo signal as shown in FIGS. 33A and 33B shown in FIGS. 1 and 2 using the method of the embodiment of the present invention will be described.
- Figures 33A and 33B show the time axis waveforms of the left channel component (L) of the stereo signal and the right channel component (R) of the stereo signal, respectively.
- Figures 33C and 33D show the L and R components, respectively.
- FIGS. 33E and 33F show that the above (L + R) / 2 A channel and R 2 B channel signals are respectively coded and decoded by the above-described high efficiency coding method according to the embodiment of the present invention.
- Each axis waveform is shown.
- Each can be represented.
- Fig. 33 G and 33H are obtained from the signal waveforms of (A + N1) and (B + N2). This figure shows how the left and right channels of a stereo signal are separated.
- the L channel is generated by (2A-B), and the R channel is the B channel itself.
- the playback device conforming to the old standard can reproduce a small number of channels
- the playback device conforming to the new standard can reproduce a larger number of channels, and optimizes channel conversion to minimize sound quality deterioration.
- the encoding method, the decoding method, and the encoded recorded medium are all included in the method of the present invention.
- the whole is divided into two bands, then the spectrum is transformed, and the spectrum coefficients are normalized and quantized and encoded with a fixed length.
- the second encoding method After dividing the whole into two bands, the spectrum is transformed, the spectrum coefficients are separated into tone components and other components, and each is normalized and quantized and coded with variable length
- various other encoding methods are conceivable.
- a first encoding method a time-series signal that is divided into bands and then decimated according to the bandwidth is used.
- the second encoding method is to perform spectrum conversion on the time-series signal of the entire band, and normalize and quantize the spectrum coefficients.
- a method of quantizing and coding may be adopted.
- by adopting a method with the highest possible encoding efficiency as described above the deterioration in sound quality when played back by a playback device conforming to the old standard is minimized. It is desirable.
- the method of the present invention is also applicable to a case where a signal reproduced by an old-standard-compliant reproducing apparatus is, for example, an image signal. That is, for example, when a luminance signal is encoded as a code string of the old standard, the color difference signal and the hue signal can be added to the code string by using the method of the present invention.
- the channel in the present invention includes a luminance signal, a color difference signal, and a hue signal in the case of a video.
- the method of the present invention can be applied to the case where the bit stream is transmitted. Also, as long as the recording medium can be randomly accessed, it is needless to say that not only a recording medium such as an optical disk but also a semiconductor memory or the like can be used.
- the signal encoding method and apparatus of the present invention the first signal generated from the signals of the plurality of input channels is encoded and some of the channels are encoded. And a second signal composed of only signals of some channels or a second signal generated from signals of a plurality of input channels, based on the signal levels of the other channels.
- the first and second signals are encoded using different encoding methods, for example, while enabling playback by the old standard-compatible playback device,
- quantization noise caused by encoding can be minimized and sound quality deterioration can be reduced.
- the first encoded signal generated and encoded from the signals of the plurality of channels and the second signal or the second signal including only the signals of some of the channels are provided.
- Decoding a code string including a second coded signal obtained by selecting and coding any one of the second signals generated from the signals of the plurality of channels, and selecting the second signal from the code string The selection information that specifies the situation is extracted to control the decoding, and the first and second encoded signals are decoded by different decoding methods at the time of decoding.
- the playback by the device is possible, and the signal extended by the new standard can be played back, the quantization noise generated by encoding and decoding is minimized, and the deterioration of sound quality can be reduced.
- a plurality of input channels A first coded signal obtained by coding the first signal generated from the signal, and a second signal generated from only a part of channel signals or a second signal generated from a plurality of input channel signals.
- a second coded signal selected and coded, a series of coded parameters thereof, and a code string having selection information of the second signal are recorded, and the first and second coded signals are recorded. Due to the different signal encoding methods, for example, it is possible to reproduce by the old standard compatible playback device and also to reproduce the signal extended by the new standard, and minimize the quantization noise generated by encoding and decoding. To minimize sound quality degradation.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Stereo-Broadcasting Methods (AREA)
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP98912767A EP0924962B1 (en) | 1997-04-10 | 1998-04-10 | Encoding method and device, decoding method and device, and recording medium |
US09/202,157 US6741965B1 (en) | 1997-04-10 | 1998-04-10 | Differential stereo using two coding techniques |
JP54260898A JP3887827B2 (ja) | 1997-04-10 | 1998-04-10 | 符号化方法及び装置、復号化方法及び装置、並びに記録媒体 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP9244897 | 1997-04-10 | ||
JP9/92448 | 1997-04-10 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1998046045A1 true WO1998046045A1 (fr) | 1998-10-15 |
Family
ID=14054698
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP1998/001672 WO1998046045A1 (fr) | 1997-04-10 | 1998-04-10 | Procede et dispositif de codage, procede et dispositif de decodage et support d'enregistrement |
Country Status (5)
Country | Link |
---|---|
US (1) | US6741965B1 (ja) |
EP (1) | EP0924962B1 (ja) |
JP (1) | JP3887827B2 (ja) |
CN (1) | CN1205842C (ja) |
WO (1) | WO1998046045A1 (ja) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2003032296A1 (fr) * | 2001-10-03 | 2003-04-17 | Sony Corporation | Appareil et procede de codage, appareil et procede de decodage et appareil et procede d'enregistrement de support d'enregistrement |
JP2006195471A (ja) * | 2005-01-13 | 2006-07-27 | Samsung Electronics Co Ltd | 多チャンネル信号符号化/復号化方法及びその装置 |
WO2006118179A1 (ja) * | 2005-04-28 | 2006-11-09 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置および音声符号化方法 |
WO2006118178A1 (ja) * | 2005-04-28 | 2006-11-09 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置および音声符号化方法 |
JP2007219542A (ja) * | 2003-04-30 | 2007-08-30 | Coding Technologies Ab | 複素指数変調フィルタバンクを基にした新型プロセッシングおよび適応型時間信号伝達方法 |
JP2009010841A (ja) * | 2007-06-29 | 2009-01-15 | Kenwood Corp | ステレオ復調装置及びステレオ復調方法 |
JP2011227256A (ja) * | 2010-04-19 | 2011-11-10 | Toshiba Corp | 信号補正装置 |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20030040203A (ko) * | 2001-05-11 | 2003-05-22 | 마쯔시다덴기산교 가부시키가이샤 | 부호화 장치, 복호화 장치 및 방송 시스템 |
US6934677B2 (en) | 2001-12-14 | 2005-08-23 | Microsoft Corporation | Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands |
US7240001B2 (en) | 2001-12-14 | 2007-07-03 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
US7299190B2 (en) | 2002-09-04 | 2007-11-20 | Microsoft Corporation | Quantization and inverse quantization for audio |
JP4676140B2 (ja) | 2002-09-04 | 2011-04-27 | マイクロソフト コーポレーション | オーディオの量子化および逆量子化 |
US7502743B2 (en) | 2002-09-04 | 2009-03-10 | Microsoft Corporation | Multi-channel audio encoding and decoding with multi-channel transform selection |
US7460990B2 (en) | 2004-01-23 | 2008-12-02 | Microsoft Corporation | Efficient coding of digital media spectral data using wide-sense perceptual similarity |
SE0400998D0 (sv) * | 2004-04-16 | 2004-04-16 | Cooding Technologies Sweden Ab | Method for representing multi-channel audio signals |
US7895012B2 (en) * | 2005-05-03 | 2011-02-22 | Hewlett-Packard Development Company, L.P. | Systems and methods for organizing and storing data |
US7539612B2 (en) | 2005-07-15 | 2009-05-26 | Microsoft Corporation | Coding and decoding scale factor information |
CN101427307B (zh) * | 2005-09-27 | 2012-03-07 | Lg电子株式会社 | 编码/解码多声道音频信号的方法和装置 |
KR100888474B1 (ko) | 2005-11-21 | 2009-03-12 | 삼성전자주식회사 | 멀티채널 오디오 신호의 부호화/복호화 장치 및 방법 |
JP2007179641A (ja) * | 2005-12-27 | 2007-07-12 | Sony Corp | 情報処理装置、コンテンツ再生可否判定通知方法及びコンテンツ再生可否判定通知プログラム |
US7953604B2 (en) | 2006-01-20 | 2011-05-31 | Microsoft Corporation | Shape and scale parameters for extended-band frequency coding |
US8190425B2 (en) | 2006-01-20 | 2012-05-29 | Microsoft Corporation | Complex cross-correlation parameters for multi-channel audio |
US7831434B2 (en) * | 2006-01-20 | 2010-11-09 | Microsoft Corporation | Complex-transform channel coding with extended-band frequency coding |
US8254717B2 (en) * | 2006-04-21 | 2012-08-28 | Tp Vision Holding B.V. | Picture enhancement by utilizing quantization precision of regions |
US7885819B2 (en) | 2007-06-29 | 2011-02-08 | Microsoft Corporation | Bitstream syntax for multi-process audio decoding |
US20090319279A1 (en) * | 2008-06-19 | 2009-12-24 | Hongwei Kong | Method and system for audio transmit loopback processing in an audio codec |
CN102157151B (zh) * | 2010-02-11 | 2012-10-03 | 华为技术有限公司 | 一种多声道信号编码方法、解码方法、装置和系统 |
US9679572B2 (en) | 2013-04-23 | 2017-06-13 | The Korea Development Bank | Method and apparatus for encoding/decoding scalable digital audio using direct audio channel data and indirect audio channel data |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6452400U (ja) * | 1987-09-28 | 1989-03-31 | ||
JPH01247000A (ja) * | 1988-03-29 | 1989-10-02 | Asahi Optical Co Ltd | ディジタル音声再生装置 |
JPH02210666A (ja) * | 1989-02-09 | 1990-08-22 | Pioneer Electron Corp | 記録再生方式 |
JPH07221717A (ja) * | 1994-02-01 | 1995-08-18 | Graphics Commun Lab:Kk | Pcmオーディオ信号の変換方法と装置 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
CA2090052C (en) * | 1992-03-02 | 1998-11-24 | Anibal Joao De Sousa Ferreira | Method and apparatus for the perceptual coding of audio signals |
EP0559348A3 (en) * | 1992-03-02 | 1993-11-03 | AT&T Corp. | Rate control loop processor for perceptual encoder/decoder |
AU673085B2 (en) * | 1993-05-31 | 1996-10-24 | Sony Corporation | Apparatus and method for coding or decoding signals, and recording medium |
JP3175446B2 (ja) * | 1993-11-29 | 2001-06-11 | ソニー株式会社 | 情報圧縮方法及び装置、圧縮情報伸張方法及び装置、圧縮情報記録/伝送装置、圧縮情報再生装置、圧縮情報受信装置、並びに記録媒体 |
JP3277692B2 (ja) * | 1994-06-13 | 2002-04-22 | ソニー株式会社 | 情報符号化方法、情報復号化方法及び情報記録媒体 |
US6041295A (en) * | 1995-04-10 | 2000-03-21 | Corporate Computer Systems | Comparing CODEC input/output to adjust psycho-acoustic parameters |
TW429700B (en) * | 1997-02-26 | 2001-04-11 | Sony Corp | Information encoding method and apparatus, information decoding method and apparatus and information recording medium |
-
1998
- 1998-04-10 JP JP54260898A patent/JP3887827B2/ja not_active Expired - Fee Related
- 1998-04-10 EP EP98912767A patent/EP0924962B1/en not_active Expired - Lifetime
- 1998-04-10 CN CNB988007819A patent/CN1205842C/zh not_active Expired - Fee Related
- 1998-04-10 WO PCT/JP1998/001672 patent/WO1998046045A1/ja active IP Right Grant
- 1998-04-10 US US09/202,157 patent/US6741965B1/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6452400U (ja) * | 1987-09-28 | 1989-03-31 | ||
JPH01247000A (ja) * | 1988-03-29 | 1989-10-02 | Asahi Optical Co Ltd | ディジタル音声再生装置 |
JPH02210666A (ja) * | 1989-02-09 | 1990-08-22 | Pioneer Electron Corp | 記録再生方式 |
JPH07221717A (ja) * | 1994-02-01 | 1995-08-18 | Graphics Commun Lab:Kk | Pcmオーディオ信号の変換方法と装置 |
Non-Patent Citations (2)
Title |
---|
JAMES D. JOHNSTON: "Perceptual Transform Coding of Wide-band Stereo Signals", ICASSP, vol. 89, pages 1993 - 1995 |
See also references of EP0924962A4 |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7318026B2 (en) | 2001-10-03 | 2008-01-08 | Sony Corporation | Encoding apparatus and method, decoding apparatus and method, and recording medium recording apparatus and method |
WO2003032296A1 (fr) * | 2001-10-03 | 2003-04-17 | Sony Corporation | Appareil et procede de codage, appareil et procede de decodage et appareil et procede d'enregistrement de support d'enregistrement |
JP2007219542A (ja) * | 2003-04-30 | 2007-08-30 | Coding Technologies Ab | 複素指数変調フィルタバンクを基にした新型プロセッシングおよび適応型時間信号伝達方法 |
JP4602375B2 (ja) * | 2003-04-30 | 2010-12-22 | ドルビー インターナショナル アクチボラゲット | ステレオ信号を符号化する符号化装置、符号化方法、復号器、復号化方法および符号化方法又は復号化方法を実行させるためのコンピュータプログラム |
JP2006195471A (ja) * | 2005-01-13 | 2006-07-27 | Samsung Electronics Co Ltd | 多チャンネル信号符号化/復号化方法及びその装置 |
US8428956B2 (en) | 2005-04-28 | 2013-04-23 | Panasonic Corporation | Audio encoding device and audio encoding method |
WO2006118179A1 (ja) * | 2005-04-28 | 2006-11-09 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置および音声符号化方法 |
WO2006118178A1 (ja) * | 2005-04-28 | 2006-11-09 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置および音声符号化方法 |
CN101167126B (zh) * | 2005-04-28 | 2011-09-21 | 松下电器产业株式会社 | 语音编码装置和语音编码方法 |
US8433581B2 (en) | 2005-04-28 | 2013-04-30 | Panasonic Corporation | Audio encoding device and audio encoding method |
JP2009010841A (ja) * | 2007-06-29 | 2009-01-15 | Kenwood Corp | ステレオ復調装置及びステレオ復調方法 |
JP2011227256A (ja) * | 2010-04-19 | 2011-11-10 | Toshiba Corp | 信号補正装置 |
US8532309B2 (en) | 2010-04-19 | 2013-09-10 | Kabushiki Kaisha Toshiba | Signal correction apparatus and signal correction method |
Also Published As
Publication number | Publication date |
---|---|
CN1205842C (zh) | 2005-06-08 |
US6741965B1 (en) | 2004-05-25 |
JP3887827B2 (ja) | 2007-02-28 |
EP0924962A4 (en) | 2006-05-31 |
EP0924962A1 (en) | 1999-06-23 |
EP0924962B1 (en) | 2012-12-12 |
CN1228236A (zh) | 1999-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP3887827B2 (ja) | 符号化方法及び装置、復号化方法及び装置、並びに記録媒体 | |
KR100565017B1 (ko) | 정보부호화방법과장치와정보복호화방법과장치및정보기록매체 | |
US6169973B1 (en) | Encoding method and apparatus, decoding method and apparatus and recording medium | |
JP3496411B2 (ja) | 情報符号化方法及び復号化装置 | |
JP3123286B2 (ja) | ディジタル信号処理装置又は方法、及び記録媒体 | |
JPH1132399A (ja) | 符号化方法及び装置、並びに記録媒体 | |
US6356211B1 (en) | Encoding method and apparatus and recording medium | |
JP3123290B2 (ja) | 圧縮データ記録装置及び方法、圧縮データ再生方法、記録媒体 | |
KR100742260B1 (ko) | 부호화장치 및 방법, 기록매체와 복호장치 및 방법 | |
EP0963055B1 (en) | Method and apparatus for transforming coded information signals | |
JP4211166B2 (ja) | 符号化装置及び方法、記録媒体、並びに復号装置及び方法 | |
US7257226B1 (en) | Information management method and information management apparatus | |
US6963860B1 (en) | Information recording and reproducing system and method and distribution medium | |
JP4470304B2 (ja) | 圧縮データ記録装置、記録方法、圧縮データ記録再生装置、記録再生方法および記録媒体 | |
EP0971350A1 (en) | Information encoding device and method, information decoding device and method, recording medium, and provided medium | |
JP2000036755A (ja) | 符号変換方法及び装置、並びにプログラム供給媒体 | |
KR100563161B1 (ko) | 부호화 방법 및 장치, 복호화 방법 및 장치,기록 매체 | |
JP3175456B2 (ja) | ディジタル信号処理装置 | |
JP3854313B2 (ja) | 複数の情報信号のエンコーディング | |
JP4123632B2 (ja) | 情報符号化方法及び装置、並びにプログラム供給媒体 | |
JP4066740B2 (ja) | 信号再生装置及び方法、信号記録装置及び方法、並びに符号列生成方法及び装置 | |
JPH10302405A (ja) | 情報符号化方法及び装置、情報復号化方法及び装置、並びに情報記録媒体 | |
Smyth | An Overview of the Coherent Acoustics Coding System | |
JPH1022935A (ja) | 情報符号化方法、情報復号化装置及び情報記録媒体 | |
JP2000036753A (ja) | 符号変換方法及び装置、並びにプログラム供給媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 98800781.9 Country of ref document: CN |
|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): CN JP KR US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1998912767 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1019980710131 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09202157 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWP | Wipo information: published in national office |
Ref document number: 1998912767 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1019980710131 Country of ref document: KR |
|
WWG | Wipo information: grant in national office |
Ref document number: 1019980710131 Country of ref document: KR |