WO1995001680A1 - Dispositif de codage de signaux numeriques, son dispositif de decodage, et son support d'enregistrement - Google Patents
Dispositif de codage de signaux numeriques, son dispositif de decodage, et son support d'enregistrement Download PDFInfo
- Publication number
- WO1995001680A1 WO1995001680A1 PCT/JP1994/001056 JP9401056W WO9501680A1 WO 1995001680 A1 WO1995001680 A1 WO 1995001680A1 JP 9401056 W JP9401056 W JP 9401056W WO 9501680 A1 WO9501680 A1 WO 9501680A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- component
- encoding
- decoding
- components
- Prior art date
Links
- 238000006243 chemical reaction Methods 0.000 claims abstract description 110
- 238000000926 separation method Methods 0.000 claims abstract description 12
- 238000010606 normalization Methods 0.000 claims description 54
- 238000000034 method Methods 0.000 claims description 51
- 230000005236 sound signal Effects 0.000 claims description 15
- 230000001256 tonic effect Effects 0.000 claims description 12
- 230000015572 biosynthetic process Effects 0.000 claims description 10
- 238000003786 synthesis reaction Methods 0.000 claims description 10
- 230000002194 synthesizing effect Effects 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 230000002542 deteriorative effect Effects 0.000 abstract description 2
- 238000001228 spectrum Methods 0.000 description 130
- 238000013139 quantization Methods 0.000 description 42
- 230000003595 spectral effect Effects 0.000 description 22
- 238000010586 diagram Methods 0.000 description 21
- 230000002093 peripheral effect Effects 0.000 description 21
- 238000012545 processing Methods 0.000 description 17
- 238000000605 extraction Methods 0.000 description 11
- 230000009466 transformation Effects 0.000 description 7
- 241000282326 Felis catus Species 0.000 description 3
- 230000000873 masking effect Effects 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B14/00—Transmission systems not characterised by the medium used for transmission
- H04B14/02—Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation
- H04B14/04—Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation using pulse code modulation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
- H04B1/667—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using a division in frequency subbands
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Definitions
- the present invention encodes a digital signal such as input digital data by high-efficiency encoding.
- BACKGROUND ART Conventionally, there have been various methods for high-efficiency coding of signals such as audio and voice, but a typical method is to block audio signals on a time axis without blocking in a certain unit time.
- Sub-band coding which is a non-blocking frequency band division method that divides and divides a signal into frequency bands, and blocks signals on the time axis in a certain unit time and converts each block.
- a so-called conversion coding which is a block frequency band division method in which the signal is converted into a signal on the frequency axis (spectrum conversion), divided into a plurality of frequency bands, and encoded for each band, can be cited.
- the above-mentioned band division coding and conversion code In this case, for example, after performing band division by the above-described band division coding, a signal for each band is converted to a signal on the frequency axis. Then, the spectrum is converted into a spectrum, and coding is performed for each band subjected to the spectrum conversion.
- a filter for band division used in the above-mentioned band division coding or the above-mentioned combination high-efficiency coding method for example, a so-called
- ICASSP 83 BOSTON Polyphase Quadrature filters-A new subband coding technique Joseph H. Rothweiler describes a filter division method of equal bandwidth.
- an input audio signal is divided into blocks in a predetermined unit time (frame), and a discrete Fourier transform (DFT), a cosine transform (DCT), a modi?
- DFT discrete Fourier transform
- DCT cosine transform
- MDCT DCT transformation
- the band in which the quantization noise is generated can be controlled, and the properties such as the masking effect are used.
- quantization is performed here. If the normalization is performed for each band beforehand, for example, using the maximum value of the absolute value of the signal component in that band, more efficient coding can be performed.
- a frequency division width for quantizing each frequency component obtained by dividing the frequency band for example, band division is performed in consideration of human auditory characteristics. That is, an audio signal may be divided into a plurality of bands (for example, 25 bands) with a bandwidth such that the higher the band generally called the critical band, the wider the bandwidth.
- a predetermined bit allocation for each band or an encoding by an adaptive bit allocation (bit allocation) for each band is required. Done. For example, when the coefficient data obtained by the MDCT processing is encoded by the bit allocation, the MDCT coefficient data of each band obtained by the MDCT processing of each block is calculated as follows. Encoding will be performed with the adaptive number of allocated bits.
- bit allocation is performed based on the signal size for each band.
- the quantization noise spectrum becomes flat and the noise energy is minimized, but the actual noise sensation is not optimal because the masking effect is not used in terms of hearing.
- ICASSP 1980 The critical band coder --digital encoding of the perceptual requirements of the auditory system .A. Kransner MIT uses auditory masking. It describes a technique for obtaining a required signal-to-noise ratio for each band and performing fixed bit allocation. However, in this method, even when the characteristics are measured with a sine wave input, the characteristic values are not so good because the bit allocation is fixed.
- the conventional signal encoding device will be described with reference to FIGS. Will be explained.
- the acoustic signal waveform supplied via the terminal 100 is converted into a signal frequency component by the conversion circuit 101, and then each component is coded by the signal component coding circuit 102. Then, a code sequence is generated by the code sequence generation circuit 103 and output from the terminal 104.
- FIG. 13 shows a specific configuration of the conversion circuit 101 of FIG.
- the signal supplied via terminal 200 (the signal via terminal 100 in FIG. 12) is divided into three signals by two-stage band division filters 201 and 202. Divided into bands.
- the band division filter 201 the signal passing through the terminal 200 is thinned out to 12, and in the band division filter 202, one of the signals thinned out to 12 by the above band division filter 201 is further processed. It is decimated to 1 (the signal at terminal 2000 is decimated to 1/4). That is, the bandwidth of the two signals from the band division filter 202 is 1Z4, which is the bandwidth of the signal from the terminal 200.
- the signals of each band divided into three bands as described above by the band division filters 201 and 202 are respectively forward spectrum transform circuits 203 and 2 for performing spectrum transformation such as MDCT.
- the spectrum signal component is made by 04 and 205.
- the outputs of the forward spectrum conversion circuits 203, 204, and 205 are sent to the signal component encoding circuit 102 of FIG.
- FIG. 14 shows a specific configuration of the signal component encoding circuit 102 of FIG.
- the output from the signal component encoding circuit 102 supplied to the terminal 300 is supplied to a predetermined band by the normalizing circuit 301. After normalization is performed every time, it is sent to the quantization circuit 303. The signal supplied to the terminal 300 is sent to the quantization accuracy determination circuit 302.
- the quantization circuit 303 converts the signal from the normalization circuit 301 into a signal based on the quantization accuracy calculated by the quantization accuracy determination circuit 303 from the signal via the terminal 300.
- the quantization is applied to the data.
- the output from the quantization circuit 303 is output from the terminal 304 and sent to the code string generation circuit 103 in FIG.
- the output signal from the terminal 304 includes, in addition to the signal component quantized by the quantization circuit 303, the normalization coefficient information in the normalization circuit 301 and the quantization accuracy.
- the quantization accuracy information in the decision circuit 302 is also included.
- FIG. 15 shows a schematic configuration of a decoding device that decodes and outputs an audio signal from the code sequence generated by the coding device having the configuration of FIG. 12. In FIG.
- each signal component is extracted from the code string generated by the configuration shown in FIG. 12 and supplied by the code string decomposition circuit 401. From these codes, each signal component is restored by a signal component decoding circuit 402, and then an inverse conversion corresponding to the conversion of the conversion circuit 101 in FIG. 12 is performed by an inverse conversion circuit 403. Will be applied. As a result, an acoustic waveform signal is obtained, and this acoustic waveform signal is output from the terminal 404.
- FIG. 16 shows a specific configuration of the inverse conversion circuit 403 in FIG.
- the configuration in Fig. 16 corresponds to the configuration example of the conversion circuit shown in Fig. 13.
- the signals supplied from the signal component decoding circuit 402 via the terminals 501, 502, and 503 are the inverse signals corresponding to the forward spectrum conversion in FIG. 13, respectively.
- the conversion is performed by an inverse spectrum conversion circuit 504, 505, 506 which performs the spectrum conversion.
- the signals in each band obtained by these inverse spectrum conversion circuits 504, 505, 506 are synthesized by a two-stage band synthesis filter.
- the outputs of the inverse spectrum conversion circuits 505 and 506 are sent to the band combining filter 507 to be combined, and the output of the band combining filter 507 is combined with the above-mentioned inverse spectrum.
- the output of the conversion circuit 504 is synthesized by the band synthesis filter 508.
- the output of the band synthesis filter 508 will be output from terminal 509 (terminal 404 in Fig. 15) o
- FIG. 17 is a diagram for describing an encoding method conventionally performed in the encoding apparatus shown in FIG.
- the spectrum signal was obtained by the conversion circuit shown in Fig. 13.
- Fig. 17 shows that the level of the absolute value of the spectrum signal obtained by MDCT is converted to a dB value. It is shown.
- the input signal is converted into 64 spectrum signals for each predetermined time block, which corresponds to five predetermined bandwidths indicated by b1 to b5 in the diagram of FIG. Normalization and quantization are performed collectively for each group (this is called an encoding unit here).
- the bandwidth of each coding unit is narrow on the low frequency side and wide on the high frequency side, so that it is possible to control the generation of quantization noise that matches the auditory characteristics.
- the frequency component is quantized.
- the band to be converted is fixed.
- Many bits must be allocated to the many spectra to which they belong.
- the normalization is performed collectively for each predetermined band, for example, in the band b3 in the figure where the signal includes a tonal component
- the numerical value is normalized based on the large normalization coefficient value determined by the tone component.
- noise included in a tone-type acoustic signal in which spectral energy is concentrated at a specific frequency is much more audible than noise added to an acoustic signal whose energy is distributed smoothly over a wide frequency band. It is easy to touch and it is a big obstacle to hearing.
- the spectral component having a large energy that is, the tone component
- the spectral component is returned to a waveform signal on the time axis, and the preceding and following blocks are blocked.
- the distortion between blocks becomes large, and when combined with the waveform signal of an adjacent time block, large connection distortion occurs, which again causes a great hearing problem.
- quantization must be performed with a sufficient number of bits in order to encode the tone component, but if the quantization accuracy is determined for each predetermined band as described above, the tone It is necessary to assign a large number of bits to a large number of spectrums in the encoding unit including the component and perform quantization, which results in poor encoding efficiency. Therefore, conventionally, the sound quality is not degraded especially for a tonal sound signal. It has been difficult to increase the coding efficiency.
- the present invention relates to a signal encoding device capable of increasing the encoding efficiency without deteriorating the sound quality, particularly for a tonal sound signal, and a signal encoding device and the like.
- the purpose of the present invention is to provide a recording medium on which the signal processed in step 1 is recorded and a signal decoding device for decoding an encoded signal reproduced from the recording medium or transmitted from the signal encoding device. is there
- a signal encoding apparatus includes: a conversion unit configured to convert an input signal into a frequency component; and a first unit configured to convert an output of the conversion unit into a tone component.
- Means, and the first encoding means encodes each signal component of the first signal into a different code length.
- the signal encoding device of the present invention performs the following. That is, the first encoding means normalizes the amplitude information of each tonic component of the first signal by a normalization coefficient when encoding the first signal. Encoding from.
- each frequency component of each tone component is encoded by a plurality of conversion rules. Which one of the plurality of conversion rules is used for coding is determined by the relative positional relationship between the maximum frequency component of the tonic component and each frequency component on the frequency axis.
- the above conversion rule is applied to the maximum frequency component.
- the conversion rule is to convert those with larger amplitude value information to shorter codes.
- a conversion rule applied to each of the other frequency components of the maximum frequency component is to convert a code having smaller amplitude value information into a shorter code.
- the input signal is a sound signal.
- the first encoding means of the signal encoding device of the present invention encodes the amplitude information of each tone component of the first signal by normalizing and quantizing the amplitude information with a normalization coefficient, and encodes the code. For this purpose, the amplitude information of the maximum frequency component is omitted.
- the signal encoding device of the present invention performs the following. That is, the separating means separates the first signal while allowing the tone components to overlap each other on the frequency axis. The smaller the value of the above-mentioned normalization coefficient, the more accurately it is set.
- the input signal is also an acoustic signal.
- the recording medium of the present invention is a medium in which a first signal composed of tone components coded to different lengths and a second signal composed of other components are recorded.
- the recording medium of the present invention is as follows. That is, the amplitude information of each tone component of the first signal is encoded by being normalized by the normalization coefficient. Further, each frequency component of each tone component is encoded by a plurality of conversion rules. Which of the plurality of conversion rules is used for coding is determined by the relative positional relationship between the maximum frequency component of the tone component and each frequency component on the frequency axis.
- the conversion rule applied to the maximum frequency component is as follows: It converts a code with a large amplitude value into a shorter code.
- the conversion rule applied to each of the other frequency components of the local maximum frequency component is to convert a code having smaller amplitude value information into a shorter code.
- the recorded signal is an acoustic signal.
- the recording medium of the present invention records a first signal composed of a tone component and a second signal composed of other components, and normalizes and quantizes the amplitude information of the tone component of the first signal.
- information other than information obtained by normalizing and quantizing the amplitude information of the maximum frequency component is recorded.
- the tone component of the first signal is recorded overlapping on the frequency axis.
- the signal decoding apparatus of the present invention comprises a first decoding means for decoding a first signal composed of tone components each having a different length, and a second decoding means composed of other components.
- Second decoding means for decoding the signals of the respective signals, and combining and inverse converting means for synthesizing the respective signals and performing an inverse conversion, or inversely converting the respective signals and synthesizing the respective signals.
- the signal decoding device of the present invention is as follows. That is, the amplitude information of each tone component of the first signal is normalized by a normalization coefficient and encoded. Each frequency component of each tone component is encoded by a plurality of conversion rules.
- Which one of the plurality of conversion rules is used for encoding is determined by the frequency of the maximum frequency component of the tone component and each frequency component. It is determined by the relative positional relationship on several axes.
- the conversion rule applied to the local maximum frequency component is to convert a code having larger amplitude value information into a shorter code.
- the conversion rule applied to components other than the local maximum frequency component is to convert a code having smaller amplitude value information into a shorter code.
- the output signal is an acoustic signal.
- the signal decoding apparatus of the present invention decodes a first signal composed of an encoded tone component without including information obtained by normalizing and quantizing the amplitude information of the maximum frequency component.
- the tone component of the first signal is coded redundantly on the frequency axis. Further, the smaller the value of the normalization coefficient for the above-mentioned normalization, the more accurately the normalization coefficient is set.
- the present invention separates and encodes an input audio signal into a signal component in which energy is concentrated at a specific frequency (tone component) and a component in which energy is smoothly distributed over a wide band (components other than tone component).
- tone component a component in which energy is smoothly distributed over a wide band
- more efficient encoding is realized by effectively applying a variable-length code to a tone component signal.
- the spectral coefficient having the maximum absolute value is encoded, for example, by encoding only positive and negative code information, thereby realizing more efficient encoding.
- FIG. 2 is a block circuit diagram showing a schematic configuration of the decoding device according to the present invention.
- FIG. 3 is a flowchart showing a processing flow in the signal component separation circuit according to the present invention.
- FIG. 4 is a diagram for explaining separation of tone components in signal encoding according to the present invention.
- FIG. 5 is a diagram showing a noise component obtained by removing a tonal component from an original spectrum signal in the signal encoding of the present invention.
- FIG. 6 is a diagram illustrating an example of a spectrum signal.
- FIG. 7 is a diagram showing a signal after subtracting a signal obtained by encoding and decoding one tone component from the spectrum signal of FIG.
- FIG. 8 is a diagram for explaining a conversion rule for a spectrum of a tone component in the present invention.
- FIG. 9 is a block diagram showing a specific configuration of the tone component encoding circuit of FIG.
- FIG. 10 is a block circuit diagram showing a specific configuration of the tone component decoding circuit of FIG.
- FIG. 11 is a diagram for explaining recording of a code string obtained by encoding according to the signal encoding of the present invention.
- FIG. 12 is a block circuit diagram showing a schematic configuration of a conventional encoding device.
- FIG. 13 is a block circuit diagram showing a specific configuration of a conversion circuit applied to the present invention and a conventional encoding device.
- FIG. 14 is a block circuit diagram showing a specific configuration of a signal component encoding circuit applied to the present invention and a conventional encoding device.
- FIG. 15 is a block circuit diagram showing a schematic configuration of a conventional decoding device.
- FIG. 16 is a block circuit diagram showing a specific configuration of an inverse conversion circuit applied to the present invention and a conventional decoding device.
- FIG. 17 is a diagram for explaining an encoding method according to the related art.
- FIG. 18 is a block circuit diagram showing another example of the synthesis inverse transform unit included in the decoding device according to the present invention.
- FIG. 19 is a block circuit diagram showing another embodiment of the encoding device according to the present invention.
- FIG. 2 OA is a code table showing a conversion rule for the maximum spectrum coefficient.
- FIG. 20B is a code table showing conversion rules of peripheral spectrum coefficients when the same conversion rule is used for all peripheral spectrum coefficients.
- FIG. 1 shows a schematic configuration of a signal encoding device according to an embodiment of the present invention.
- an acoustic waveform signal is supplied to a terminal 600.
- This acoustic signal waveform is converted to a signal frequency component by the conversion circuit 601, and then sent to the signal component separation circuit 602.
- the signal frequency component obtained by the conversion circuit 601 is divided into a tone component having a steep spectrum distribution and a signal frequency component other than that, that is, a flat component. It is separated into a noise component having a strong spectral distribution. Of these separated frequency components, the tone component having the steep spectrum distribution is the tone component encoding circuit 603, and the other signal frequency components are the noise components.
- Each is encoded by the component encoding circuit 604.
- the signal output from the tone component encoding circuit 603 is further subjected to variable length encoding by the variable length encoding circuit 610.
- the output from the variable-length encoding circuit 610 and the noise component encoding circuit 604 is output as a code string generated by a code string generation circuit 605.
- the ECC encoder 606 adds an error collection code to the code string from the code string generation circuit 605.
- the output from the ECC encoder 606 is modulated by the EFM circuit 607 and supplied to the recording head 608.
- the recording head 608 records the code string output from the EFM circuit 607 on the disk 609.
- the encoding according to the present invention works particularly effectively when energy is concentrated at a specific frequency. It is convenient to adopt a method of converting the frequency components into frequency components by the above-described spectrum conversion that can be obtained with a relatively small amount of calculation.
- tone component encoding circuit 603 and the noise component encoding circuit 604 can be basically realized by the same configuration as that of FIG. 14 described above.
- FIG. 2 shows a schematic configuration of a signal decoding device according to an embodiment of the present invention for decoding a signal encoded by the encoding device of FIG.
- the code string reproduced from the disk 609 via the reproduction head 708 is supplied to the EFM demodulation circuit 709.
- the EFM demodulation circuit 709 demodulates the input code string.
- the demodulated code string is supplied to the ECC decoder 710, where error correction is performed.
- the code string decomposition circuit 701 recognizes which part of the code string is a tone component code based on the number of tone component information in the error-corrected code string, and converts the input code string into a tone. And a noise component code. Further, the code string separating circuit 701 separates the positional information of the tone component from the input code string, and outputs the information to the synthesis circuit 704 at the subsequent stage.
- the above tone component code is a variable length decoding circuit 7.
- tone component decoding circuit 7 After variable length decoding by 1 5, tone component decoding circuit 7 0
- the noise code is sent to the noise component decoding circuit 703, where the dequantization and normalization are canceled and decoded. Then, these tone component decoding circuits 720 and noise characteristics
- the decoded signal from the component decoding circuit 703 is supplied to a synthesis circuit 704 that performs synthesis corresponding to the separation in the signal component separation circuit 602 in FIG.
- the combining circuit 704 converts the decoded signal of the tonic component into a predetermined signal of the decoded signal of the noisy component based on the positional information of the tonic component supplied from the code string separating circuit 701.
- the noise component and the tone component are synthesized on the frequency axis by adding to the position of.
- the synthesized decoded signal is converted by an inverse conversion circuit 705 that performs an inverse conversion corresponding to the conversion by the conversion circuit 601 in FIG. It is returned to the upper waveform signal.
- the output waveform signal from the inverse conversion circuit 705 is output from the terminal 707.
- the composition inverse transformation unit 711 in FIG. 2 has the configuration shown in FIG.
- the inverse transform circuit 712 constituting the synthesis inverse transform section 711 shown in FIG. 18 converts the decoded signal of the noise component on the frequency axis from the noise component decoding circuit 703 on the time axis. Inversely converted to a noise component signal.
- the inverse transform circuit 713 converts the decoded signal of the tonal component from the tonal component decoding circuit 702 into the frequency axis indicated by the position information of the tonal component supplied from the code sequence separation circuit 71. And inversely transform it to generate a tonal component signal on the time axis.
- the synthesizing circuit 714 combines the noise component signal on the time axis from the inverse transform circuit 712 with the tone component signal on the time axis from the inverse transform circuit 713 to form the original waveform signal. To play.
- FIG. 3 shows a specific processing flow for separating the tone component in the signal component separation circuit 62 of the encoding apparatus in FIG.
- I indicates the number of the spectrum signal
- N indicates the total number of spectrum signals
- P and R indicate predetermined coefficients.
- the absolute value of a certain spectral signal is locally larger than that of the other spectral component
- the absolute value of the absolute value of the spectral component is the time block (in the case of the vector transformation).
- the absolute value of the spectrum signal in the block is larger than a predetermined value compared to the maximum value of the spectrum signal, and furthermore, the spectrum and the neighboring spectrum (for example, the spectrum on both sides) ), If the sum of the energies indicates a predetermined ratio or more to the energy within a predetermined band including the spectrum, the spectrum signal and, for example, the spectrum signal on both sides thereof Is considered to be a tone component.
- the predetermined band for comparing the ratio of the energy distribution can be narrow in the low band and wide in the high band, for example, in accordance with the critical bandwidth in consideration of the characteristics of hearing.
- step S1 the maximum spectrum absolute value is substituted for a variable AO, and in step S2, the number I of the spectrum signal is set to 1.
- step S3 the absolute value of a certain spectrum in a certain time block is assigned to a variable A.
- step S4 it is determined whether or not the above-mentioned spectral absolute value is a local maximum absolute value vector that is larger than other spectral components when viewed locally, and is not a local maximum absolute value vector. If (No), the process proceeds to step S10, and if it is the maximum absolute value vector (Yes), the process proceeds to step S5.
- step S5 the variable A of the maximum absolute value vector and the variable A of the maximum vector absolute value in the time block including the maximum absolute value vector. And a coefficient P indicating a predetermined size Perform a size comparison (AZA> P), then AZA. If is greater than P (Ye s), go to step S6, AZA. If is equal to or less than P (NO), the flow proceeds to step S10.
- step S6 the energy value of the spectrum adjacent to the spectrum of the absolute value of the spectrum (maximum absolute value spectrum) (for example, the sum of the energies of the spectra on both sides) Is substituted into a variable X, and in the next step S7, the energy value in a predetermined band including the maximum absolute value spectrum and its neighboring spectrum is substituted into a variable Y.
- maximum absolute value spectrum for example, the sum of the energies of the spectra on both sides
- step S8 a magnitude comparison (XZY> R) of a ratio of the variable X of the energy value to the variable Y of the energy value in a predetermined band and a coefficient R indicating a predetermined ratio is performed.
- R a magnitude comparison of a ratio of the variable X of the energy value to the variable Y of the energy value in a predetermined band and a coefficient R indicating a predetermined ratio.
- step S9 the energy in the local absolute value spectrum and the neighboring spectrum show a predetermined ratio or more to the energy in a predetermined band including those spectra.
- the signal of the maximum absolute value spectrum and the signals of, for example, two low-frequency and two high-frequency sides adjacent to each other are regarded as tone components, and the fact is registered. .
- Signal component separation The circuit 602 supplies the frequency component determined to be a tone component by the above-described processing to the tone component encoding circuit 603, and uses the other frequency components as noise components as a noise component encoding circuit. Supply to 604. Further, the signal component separation circuit 602 supplies the number of frequency information determined to be a tone component and its position information to the code string generation circuit 605.
- FIG. 4 shows an example in which the tone component is separated from the frequency component as described above.
- TC A, TCB, TCC it is four tone characteristic components indicated by TC D have been extracted.
- the tone components are concentrated and distributed in a small number of spectrum signals as in the example of FIG. 4, even if these components are quantized with high accuracy, there are too many as a whole. The number of bits is not required. For this reason, the encoding efficiency can be improved by normalizing and quantizing the tone components once, but since the number of the spectrum signals constituting the tone components is relatively small, the normalization is performed.
- the apparatus may be simplified by omitting quantization and requantization processing.
- FIG. 5 shows an example in which the noise component is obtained when the tone component is removed from the original spectrum signal (to 0).
- the tone component is removed (set to 0) from the original spectrum signal in each of the bands b1 to b5 as described above.
- the normalization coefficient in each coding unit has a small value, and therefore, the quantization noise generated even with a small number of bits can be reduced.
- efficient encoding can be performed by separating the tone component and setting the signal around the tone component and the signal around the tone component to 0, and then encoding the noise component. It is also possible to take a method of encoding a signal obtained by subtracting a decoded signal obtained by encoding a tone component from a torque signal.
- a signal encoding apparatus will be described with reference to FIG.
- the same components as those in FIG. 1 are denoted by the same reference numerals, and description thereof is omitted.
- the spectrum signal obtained by the conversion circuit 601 is supplied to the tonal component extraction circuit 802 via the switch 801 controlled by the switch control circuit 808.
- the tone component extraction circuit 802 determines the tone component by the processing of FIG. 3 described above, and supplies only the determined tone component to the tone component encoding circuit 603. Further, the tone component extraction circuit 802 outputs the number of tone component information and the center position information thereof to the coded sequence generation circuit 605.
- the tone component encoding circuit 603 performs normalization and quantization on the input tone component, and converts the normalized and quantized tone component into a variable length encoding circuit 610 and a local decoder. 804.
- the variable-length encoding circuit 610 performs variable-length encoding on the normalized and quantized tone component, and supplies the obtained variable-length code to the code sequence generating circuit 605.
- the local decoder 804 dequantizes and denormalizes the normalized and quantized tone component, and decodes the original tone component signal. However, at this time, the decoded signal contains quantization noise.
- the output from the local decoder 804 is supplied to the adder 805 as the first decoded signal.
- adder 8 0 5 is supplied with the original spectrum signal from the conversion circuit 61 through a switch 806 controlled by a switch control circuit 808.
- the adder 805 subtracts the first decoded signal from the original spectrum signal and outputs the first difference signal.
- the switch controlled by the switch control circuit 808 uses the first difference signal as a noise component.
- the signal is supplied to the noise component encoding circuit 604 via 807.
- the first difference signal is supplied to the tone component extraction circuit 802 via the switch 801.
- the tone component extraction circuit 802, the tone component encoding circuit 603, and the local decoder 804 perform the same processing as described above, and the obtained second decoded signal is supplied to the adder 805. You.
- the first difference signal is supplied to the adder 805 through the switch 806.
- the adder 805 subtracts the second decoded signal from the first differential signal and outputs a second differential signal.
- the second difference signal is used as the noise component via the switch 807 to generate the noise component. It is supplied to the encoding circuit 604.
- the same processing as described above is performed in the tonal component extracting circuit 802, the tonal component encoding circuit 603, and the local decoder 8. 0 4, performed by adder 805.
- the switch control circuit 808 holds a threshold value for the number of tone component information, and extracts the tone component when the number of tone component information obtained from the tone component extraction circuit exceeds this threshold. Switch 807 to terminate the encoding and decoding processes. Control. Further, when the tone component is no longer extracted in the tone component encoding circuit 603, the extraction, encoding, decoding, and difference processing of the tone component can be terminated.
- FIG. 6 and FIG. 7 are diagrams for explaining such a method.
- FIG. 7 is obtained by subtracting a signal obtained by encoding and decoding the tone component of ⁇ from the spectrum signal of FIG.
- the encoding accuracy of the spectrum signal can be improved, and this is repeated. By doing so, highly accurate encoding can be performed.
- the coding accuracy can be sufficiently high even if the upper limit of the number of bits for quantizing the tone component is set low, and accordingly, the number of quantization bits is large.
- the number of bits for recording the data can be reduced.
- the method of extracting the tonal components in multiple stages as described above is not limited to the case where a signal equivalent to that obtained by encoding and decoding the tonal components is subtracted from the original spectrum signal.
- the present invention is also applicable when the spectrum signal of the extracted tonal component is set to 0.
- the expression such as “signal from which tonal component is separated” and the like include both.
- efficient encoding can be realized by decomposing the original waveform signal into tone components and noise components and performing encoding.
- efficient encoding can be performed.
- each tone component has a spectral coefficient at which the absolute value is maximized. (This is called the maximum spectral coefficient here) and the surrounding spectral coefficients (here called the peripheral spectral coefficients) concentrate energy.
- the distribution of values when quantizing each coefficient is biased, and the relative position on the frequency axis is between the maximum spectrum coefficient and the peripheral spectrum coefficient.
- the manner of distribution varies greatly depending on the relationship. That is, if the spectrum coefficients constituting each tone component are normalized by a normalization coefficient determined by the maximum spectrum coefficient, that is, for example, each spectrum coefficient constituting the tone component is converted to the corresponding tone coefficient.
- the maximum spectrum coefficient after quantization becomes a value close to +1 or -11, whereas the tone component is originally the spectrum. Since the torque coefficient has a characteristic of rapidly decreasing around the maximum spectrum coefficient, the peripheral spectrum coefficients after quantization are more distributed to values close to zero.
- DA Huf fman A Method f or Construction on Minimum Redundancy Codes, Proc.I. RE, 40, p. 1098 (1952) Can be efficiently encoded by a so-called variable length code that assigns
- each tone component is separated into a maximum spectrum coefficient and a peripheral spectrum coefficient, and a different variable length code is applied to each of them. It tries to realize efficient coding.
- the surrounding spectrum coefficients are normalized and quantified.
- the distribution of values in the case of digitization is greatly affected by the relative positional relationship between the surrounding spectrum coefficient and the maximum spectrum coefficient on the frequency axis. Therefore, the peripheral spectrum coefficient is further classified into several groups according to the relative position on the frequency axis with respect to the maximum spectrum coefficient, and a different variable length code is set for each classified group.
- the conversion may be performed according to the conversion rule.
- a method of classifying the relative position a method of performing classification based on an absolute value of a difference on a frequency axis from the maximum spectrum coefficient can be used.
- Conversion to variable-length code is performed using a total of three conversion rules, the conversion rules for the peripheral spectrum coefficients and the conversion rules for the peripheral spectrum coefficients shown as ECa and ECe in the figure. Can be.
- all the peripheral spectrum coefficients may be subjected to variable-length coding according to the same conversion rule.
- FIG. 20A shows an example of a code table that shows the conversion rule for the maximum spectrum coefficient.
- FIG. 20B shows an example of a code table showing the conversion rules of the peripheral spectrum coefficients when the same conversion rule is used for all the peripheral spectral coefficients.
- the maximum spectrum coefficient after normalization and quantization that is, the quantization value of the maximum spectrum is a value close to +1 or 11 as described above. Therefore, as shown in FIG. 20A, these +1 and ⁇ 1 have shorter code lengths than the code lengths assigned to other values. By assigning 00 and 01, the maximum spectrum coefficient can be efficiently encoded.
- the peripheral spectrum coefficient after normalization and quantization that is, the quantized value of the peripheral spectrum is close to 0 as described above. Therefore, as shown in FIG. 20, if 0 is assigned to this 0, which is a shorter code length than the code lengths assigned to other values, the peripheral spectrum coefficient can be efficiently obtained. Can be encoded.
- a plurality of code tables for the maximum spectrum coefficients and a plurality of code tables for the peripheral spectrum coefficients are provided. If the corresponding code table is selected according to the determined quantization precision, more efficient coding can be performed.
- FIG. 9 shows a specific example of the variable length coding circuit 6 10 of FIG.
- the tone component input to the terminal 800 is classified by the control circuit 801 based on the relative position on the frequency axis with the maximum spectrum component. Is sent to one of the peripheral coefficient encoding circuit 802, the peripheral spectral coefficient encoding circuit 803, and the peripheral spectral coefficient encoding circuit 804. Is encoded based on the corresponding conversion rule. The encoded output from each of the encoding circuits 802, 803, 804 is output from an output terminal 805 via a control circuit 801.
- FIG. 10 shows a specific example of the variable length decoding circuit 715 of FIG.
- the tone component inputted to the input terminal 900 is The codes are divided by the control circuit 91 in accordance with the classification in FIG. 9 above, and the corresponding maximum spectrum coefficient decoding circuit 9 02 and peripheral spectrum coefficient decoding circuit 9 0 3 and sent to one of the peripheral spectrum coefficient decoding circuits 9 04, and are decoded by each of these circuits based on the inverse conversion rule corresponding to the above-described conversion rule.
- the decoded output from each of the decoding circuits 902, 903, 904 is output from an output terminal 905 via a control circuit 901.
- FIG. 11 shows an example in which the spectral signal of FIG. 4 is encoded by the encoding device of the present embodiment. This code string is recorded on the recording medium.
- the number of tone component information t nc (for example, 4 in the example of FIG. 11) is recorded on the recording medium
- noise component information ncnc 2 , nc 3, nc 4 , and nc 5 are recorded in this order.
- the tonality component information tc A, tc B , tcc, and tc D have center position information CP (for example, 15 ton in the case of the tonality component tc B ) indicating the position of the center spectrum of the tonality component. ), Quantization accuracy information indicating the number of bits for quantization (for example, 6 in the case of the tone component tc B ), and normalization coefficient information, after being normalized and quantized, variable-length coding was done
- each signal component information SC e , SC b , S Cc, SC d , and SC e is recorded together with each signal component information SC e , SC b , S Cc, SC d , and SC e .
- the conversion rule for variable-length coding is determined for each quantization precision, and the decoding device decodes the variable-length code with reference to the quantization precision information.
- the quantization accuracy is fixedly determined by the frequency, it is not necessary to record the quantization accuracy information.
- the position of the center spectrum of each tone component is used as the position information of the tone component, but the lowest frequency spectrum of each tone component is used. Tol position (for example, tonal component
- quantization accuracy information for example, 2 for noisy nc
- signal component information S d, SC 2 , ⁇ It is recorded along with the SC 8.
- the quantization accuracy information is 0, no actual encoding is performed in the encoding unit. If the quantization precision is fixedly determined by the band, it is not necessary to record the quantization precision information.
- FIG. 11 shows an embodiment of the types and order of information recorded on a recording medium.
- the signal encoding apparatus of the present embodiment enables efficient encoding by giving the amplitude information of only the maximal spectrum of each tone component only by the normalization coefficient information. ing. That is, the tone component encoding circuit 603 performs normalization and quantization on frequency components of each tone component other than the maximum spectrum. In the tone component encoding circuit 603, normalization and quantization are performed on all tone components including the maximum spectrum, and the code sequence generation circuit in the subsequent stage is performed. It is also possible not to output the quantized value corresponding to the maximum spectrum in the path 605.
- the signal component information SC C will include only the sign indicating positive or negative.
- the signal decoding apparatus uses the normalization coefficient. From this, an approximate value of the amplitude information of the maximum spectrum can be obtained. Therefore, for example, when the spectrum information is realized by MDCT, DCT, or the like, the local maximum spectrum can be obtained from the sign indicating the positive or negative and the normalization coefficient information, and the spectrum can be obtained, for example. When the torque information is realized by DFT or the like, the local maximum spectrum can be obtained from only the phase component, and the information obtained by quantizing the amplitude information with respect to the local maximum spectrum can be obtained. Recording can be omitted. This method is particularly effective when the normalization coefficient can be obtained with high accuracy.
- the accuracy of the normalization coefficient is not sufficient, the accuracy of the maximum spectrum coefficient may not be sufficiently ensured in some cases.
- this problem can be solved by using a method of extracting a tone component over multiple stages using the configuration shown in FIG. As shown in FIGS. 6 and 7, according to this method, there is a high possibility that a frequency component duplicated on the frequency axis is extracted a plurality of times as a tone component. It is recommended that the normalization coefficient be set non-linearly, for example, by setting it at regular intervals on a logarithmic scale so that the smaller the normalization coefficient, the better the accuracy.
- the decoding device synthesizes the plurality of frequency components, Even when the accuracy of the two normalization coefficients is not sufficient, a certain degree of accuracy can be ensured. Further, in the above description, description has been made mainly of an example in which an audio signal is encoded by the signal encoding device of the embodiment of the present invention. However, in the encoding of the present invention, encoding of a general waveform signal is also performed. It is possible to apply. However, the encoding according to the present invention is particularly effective in efficiently encoding an audio signal whose tone component has a significant auditory meaning.
- the disk 609 of the above embodiment can be, for example, a magneto-optical recording medium, an optical recording medium, a phase-change optical recording medium, or the like.
- a recording medium in place of the disk 609 a semiconductor memory, an IC card, or the like can be used in addition to a tape-shaped recording medium.
- the input signal is converted into a frequency component, and the converted output is converted into a first component composed of a tonic component.
- each signal component of the first signal is encoded into a different code length, whereby Of the signals decomposed into noise components and noise components, it is possible to encode the tonal components very efficiently, and to improve the encoding efficiency for the entire signal waveform. Therefore, this pressure If the compressed signal is recorded on a recording medium, the recording capacity can be used effectively, and a good signal, for example, an audio signal can be obtained by encoding a signal obtained by reproducing the recording medium. it can
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Description
Claims
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PL94307740A PL173718B1 (pl) | 1993-06-30 | 1994-06-29 | Sposób i urządzenie do kodowania sygnałów cyfrowych |
DE69428030T DE69428030T2 (de) | 1993-06-30 | 1994-06-29 | Digitales signalkodierungsgerät, dazugehöriges dekodiergerät und aufzeichnungsträger |
RU95106457A RU2131169C1 (ru) | 1993-06-30 | 1994-06-29 | Устройство кодирования сигнала, устройство декодирования сигнала, носитель записи и способ кодирования и декодирования |
EP94919822A EP0663739B1 (en) | 1993-06-30 | 1994-06-29 | Digital signal encoding device, its decoding device, and its recording medium |
BR9405445-2A BR9405445A (pt) | 1993-06-30 | 1994-06-29 | Aparelho codificador e decodificador de sinal apropriado para codificar um sinal de entrada e decodificar um sinal codificado, suporte de gravação onde sinais codificados são gravados, e processo de codificação e de decodificação de sinal para codificar um sinal de entrada e decodificar um sinal codificado. |
US08/392,756 US5765126A (en) | 1993-06-30 | 1994-06-29 | Method and apparatus for variable length encoding of separated tone and noise characteristic components of an acoustic signal |
JP50340095A JP3721582B2 (ja) | 1993-06-30 | 1994-06-29 | 信号符号化装置及び方法並びに信号復号化装置及び方法 |
PL94322680A PL174314B1 (pl) | 1993-06-30 | 1994-06-29 | Sposób i urządzenie do dekodowania sygnałów cyfrowych |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP18332293 | 1993-06-30 | ||
JP5/183322 | 1993-06-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1995001680A1 true WO1995001680A1 (fr) | 1995-01-12 |
Family
ID=16133682
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP1994/001056 WO1995001680A1 (fr) | 1993-06-30 | 1994-06-29 | Dispositif de codage de signaux numeriques, son dispositif de decodage, et son support d'enregistrement |
Country Status (10)
Country | Link |
---|---|
US (1) | US5765126A (ja) |
EP (2) | EP0663739B1 (ja) |
JP (1) | JP3721582B2 (ja) |
KR (1) | KR100368854B1 (ja) |
CN (2) | CN1099777C (ja) |
BR (1) | BR9405445A (ja) |
DE (2) | DE69432538T2 (ja) |
PL (2) | PL174314B1 (ja) |
RU (1) | RU2131169C1 (ja) |
WO (1) | WO1995001680A1 (ja) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5758316A (en) * | 1994-06-13 | 1998-05-26 | Sony Corporation | Methods and apparatus for information encoding and decoding based upon tonal components of plural channels |
US5870703A (en) * | 1994-06-13 | 1999-02-09 | Sony Corporation | Adaptive bit allocation of tonal and noise components |
JP2003324355A (ja) * | 2002-05-07 | 2003-11-14 | Sony Corp | 符号化方法及び装置、復号方法及び装置、並びにプログラム及び記録媒体 |
US8631060B2 (en) | 2007-12-13 | 2014-01-14 | Qualcomm Incorporated | Fast algorithms for computation of 5-point DCT-II, DCT-IV, and DST-IV, and architectures |
JP2016500839A (ja) * | 2012-10-10 | 2016-01-14 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | スペクトルパターンを利用することによってシヌソイドおよびスイープを効率的に合成するための装置および方法 |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW327223B (en) * | 1993-09-28 | 1998-02-21 | Sony Co Ltd | Methods and apparatus for encoding an input signal broken into frequency components, methods and apparatus for decoding such encoded signal |
US6167093A (en) * | 1994-08-16 | 2000-12-26 | Sony Corporation | Method and apparatus for encoding the information, method and apparatus for decoding the information and method for information transmission |
US5774837A (en) * | 1995-09-13 | 1998-06-30 | Voxware, Inc. | Speech coding system and method using voicing probability determination |
JP3282661B2 (ja) * | 1997-05-16 | 2002-05-20 | ソニー株式会社 | 信号処理装置および方法 |
PT887958E (pt) * | 1997-06-23 | 2003-06-30 | Liechti Ag | Metodo para a compressao de gravacoes de ruido ambiente metodo para a deteccao de elementos de programa no mesmo dispositivos e programa de computadorpara tal |
US6269332B1 (en) * | 1997-09-30 | 2001-07-31 | Siemens Aktiengesellschaft | Method of encoding a speech signal |
US6965697B1 (en) * | 1998-07-15 | 2005-11-15 | Sony Corporation | Coding apparatus and method, decoding apparatus and method, data processing system, storage medium, and signal |
US6311154B1 (en) | 1998-12-30 | 2001-10-30 | Nokia Mobile Phones Limited | Adaptive windows for analysis-by-synthesis CELP-type speech coding |
US6298322B1 (en) * | 1999-05-06 | 2001-10-02 | Eric Lindemann | Encoding and synthesis of tonal audio signals using dominant sinusoids and a vector-quantized residual tonal signal |
US20020049586A1 (en) * | 2000-09-11 | 2002-04-25 | Kousuke Nishio | Audio encoder, audio decoder, and broadcasting system |
US6801887B1 (en) | 2000-09-20 | 2004-10-05 | Nokia Mobile Phones Ltd. | Speech coding exploiting the power ratio of different speech signal components |
WO2003036619A1 (en) * | 2001-10-19 | 2003-05-01 | Koninklijke Philips Electronics N.V. | Frequency-differential encoding of sinusoidal model parameters |
EP1493146B1 (en) * | 2002-04-11 | 2006-08-02 | Matsushita Electric Industrial Co., Ltd. | Encoding and decoding devices, methods and programs |
US6973579B2 (en) | 2002-05-07 | 2005-12-06 | Interdigital Technology Corporation | Generation of user equipment identification specific scrambling code for the high speed shared control channel |
KR100462611B1 (ko) * | 2002-06-27 | 2004-12-20 | 삼성전자주식회사 | 하모닉 성분을 이용한 오디오 코딩방법 및 장치 |
MXPA05005601A (es) * | 2002-11-29 | 2005-07-26 | Koninklije Philips Electronics | Codificacion de audio. |
SG161223A1 (en) | 2005-04-01 | 2010-05-27 | Qualcomm Inc | Method and apparatus for vector quantizing of a spectral envelope representation |
ES2705589T3 (es) | 2005-04-22 | 2019-03-26 | Qualcomm Inc | Sistemas, procedimientos y aparatos para el suavizado del factor de ganancia |
CN1831940B (zh) * | 2006-04-07 | 2010-06-23 | 安凯(广州)微电子技术有限公司 | 基于音频解码器的音调和节奏调节方法 |
KR101261524B1 (ko) | 2007-03-14 | 2013-05-06 | 삼성전자주식회사 | 노이즈를 포함하는 오디오 신호를 저비트율로부호화/복호화하는 방법 및 이를 위한 장치 |
US8548815B2 (en) | 2007-09-19 | 2013-10-01 | Qualcomm Incorporated | Efficient design of MDCT / IMDCT filterbanks for speech and audio coding applications |
RU2451998C2 (ru) * | 2007-09-19 | 2012-05-27 | Квэлкомм Инкорпорейтед | Эффективный способ проектирования набора фильтров для mdct/imdct в приложениях для кодирования речи и аудиосигналов |
RU2464540C2 (ru) * | 2007-12-13 | 2012-10-20 | Квэлкомм Инкорпорейтед | Быстрые алгоритмы для вычисления 5-точечного dct-ii, dct-iv и dst-iv, и архитектуры |
EP2229677B1 (en) | 2007-12-18 | 2015-09-16 | LG Electronics Inc. | A method and an apparatus for processing an audio signal |
CA2715432C (en) | 2008-03-05 | 2016-08-16 | Voiceage Corporation | System and method for enhancing a decoded tonal sound signal |
US8498874B2 (en) * | 2009-09-11 | 2013-07-30 | Sling Media Pvt Ltd | Audio signal encoding employing interchannel and temporal redundancy reduction |
WO2011045465A1 (en) * | 2009-10-12 | 2011-04-21 | Nokia Corporation | Method, apparatus and computer program for processing multi-channel audio signals |
EP2523473A1 (en) * | 2011-05-11 | 2012-11-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating an output signal employing a decomposer |
JP6021498B2 (ja) | 2012-08-01 | 2016-11-09 | 任天堂株式会社 | データ圧縮装置、データ圧縮プログラム、データ圧縮システム、データ圧縮方法、データ伸張装置、データ圧縮伸張システム、および圧縮データのデータ構造 |
RU2573248C2 (ru) * | 2013-10-29 | 2016-01-20 | Федеральное государственное бюджетное образовательное учреждение высшего профессионального образования Московский технический университет связи и информатики (ФГОБУ ВПО МТУСИ) | Способ измерения спектра информационных акустических сигналов телерадиовещания и устройство для его осуществления |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS61201526A (ja) * | 1985-02-27 | 1986-09-06 | テレフンケン・フエルンゼー・ウント・ルントフンク・ゲゼルシヤフト・ミツト・ベシユレンクテル・ハフツング | オーデイオ信号の伝送方法 |
Family Cites Families (61)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3973081A (en) * | 1975-09-12 | 1976-08-03 | Trw Inc. | Feedback residue compression for digital speech systems |
US4170719A (en) * | 1978-06-14 | 1979-10-09 | Bell Telephone Laboratories, Incorporated | Speech transmission system |
US4184049A (en) * | 1978-08-25 | 1980-01-15 | Bell Telephone Laboratories, Incorporated | Transform speech signal coding with pitch controlled adaptive quantizing |
US4455649A (en) * | 1982-01-15 | 1984-06-19 | International Business Machines Corporation | Method and apparatus for efficient statistical multiplexing of voice and data signals |
US4535472A (en) * | 1982-11-05 | 1985-08-13 | At&T Bell Laboratories | Adaptive bit allocator |
JPS59223032A (ja) * | 1983-06-01 | 1984-12-14 | Sony Corp | ディジタル信号伝送装置 |
US4748579A (en) * | 1985-08-14 | 1988-05-31 | Gte Laboratories Incorporated | Method and circuit for performing discrete transforms |
JPH0734291B2 (ja) * | 1986-07-28 | 1995-04-12 | 株式会社日立製作所 | デイジタル信号記録再生システム |
US4797926A (en) * | 1986-09-11 | 1989-01-10 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech vocoder |
JPH061916B2 (ja) * | 1986-10-28 | 1994-01-05 | 日本電気株式会社 | 帯域分割符号化復号化装置 |
EP0267344B1 (en) * | 1986-10-30 | 1993-09-01 | International Business Machines Corporation | Process for the multi-rate encoding of signals, and device for carrying out said process |
DE3639753A1 (de) * | 1986-11-21 | 1988-06-01 | Inst Rundfunktechnik Gmbh | Verfahren zum uebertragen digitalisierter tonsignale |
JP2751201B2 (ja) * | 1988-04-19 | 1998-05-18 | ソニー株式会社 | データ伝送装置及び受信装置 |
NL8700985A (nl) * | 1987-04-27 | 1988-11-16 | Philips Nv | Systeem voor sub-band codering van een digitaal audiosignaal. |
US4827336A (en) * | 1987-12-18 | 1989-05-02 | General Electric Company | Symbol code generation processing from interframe DPCM of TDM'd spatial-frequency analyses of video signals |
IL89672A (en) * | 1988-04-29 | 1994-04-12 | Motorola Inc | Spectral efficient method of transmitting information signal |
JP2638091B2 (ja) * | 1988-06-24 | 1997-08-06 | ソニー株式会社 | データ伝送方法 |
CA2002015C (en) * | 1988-12-30 | 1994-12-27 | Joseph Lindley Ii Hall | Perceptual coding of audio signals |
US5109417A (en) * | 1989-01-27 | 1992-04-28 | Dolby Laboratories Licensing Corporation | Low bit rate transform coder, decoder, and encoder/decoder for high-quality audio |
US5142656A (en) * | 1989-01-27 | 1992-08-25 | Dolby Laboratories Licensing Corporation | Low bit rate transform coder, decoder, and encoder/decoder for high-quality audio |
US4932062A (en) * | 1989-05-15 | 1990-06-05 | Dialogic Corporation | Method and apparatus for frequency analysis of telephone signals |
EP0405591B1 (en) * | 1989-06-30 | 1997-10-01 | Nec Corporation | Varaible length block coding with changing characteristics of input samples |
JP2844695B2 (ja) * | 1989-07-19 | 1999-01-06 | ソニー株式会社 | 信号符号化装置 |
US5054075A (en) * | 1989-09-05 | 1991-10-01 | Motorola, Inc. | Subband decoding method and apparatus |
JPH03109824A (ja) * | 1989-09-22 | 1991-05-09 | Sony Corp | ディジタル信号符号化装置 |
JP2906483B2 (ja) * | 1989-10-25 | 1999-06-21 | ソニー株式会社 | ディジタル音声データの高能率符号化方法及びディジタル音声データの復号化装置 |
US5115240A (en) * | 1989-09-26 | 1992-05-19 | Sony Corporation | Method and apparatus for encoding voice signals divided into a plurality of frequency bands |
JPH03117919A (ja) * | 1989-09-30 | 1991-05-20 | Sony Corp | ディジタル信号符号化装置 |
US5185800A (en) * | 1989-10-13 | 1993-02-09 | Centre National D'etudes Des Telecommunications | Bit allocation device for transformed digital audio broadcasting signals with adaptive quantization based on psychoauditive criterion |
US5040217A (en) * | 1989-10-18 | 1991-08-13 | At&T Bell Laboratories | Perceptual coding of audio signals |
JPH03132228A (ja) * | 1989-10-18 | 1991-06-05 | Victor Co Of Japan Ltd | 直交変換信号符号化復号化方式 |
JP2870050B2 (ja) * | 1989-10-18 | 1999-03-10 | ソニー株式会社 | ディジタルデータの高能率符号化方法 |
DE69028176T2 (de) * | 1989-11-14 | 1997-01-23 | Nippon Electric Co | Adaptive Transformationskodierung durch optimale Blocklängenselektion in Abhängigkeit von Unterschieden zwischen aufeinanderfolgenden Blöcken |
US5081681B1 (en) * | 1989-11-30 | 1995-08-15 | Digital Voice Systems Inc | Method and apparatus for phase synthesis for speech processing |
JP2827410B2 (ja) * | 1990-03-14 | 1998-11-25 | ソニー株式会社 | デイジタルデータの高能率符号化方法 |
JP2913731B2 (ja) * | 1990-03-07 | 1999-06-28 | ソニー株式会社 | ディジタルデータの高能率符号化方法 |
JP2861238B2 (ja) * | 1990-04-20 | 1999-02-24 | ソニー株式会社 | ディジタル信号符号化方法 |
US5367608A (en) * | 1990-05-14 | 1994-11-22 | U.S. Philips Corporation | Transmitter, encoding system and method employing use of a bit allocation unit for subband coding a digital signal |
US5222289A (en) * | 1990-07-10 | 1993-06-29 | Gemcor Engineering Corp. | Method and apparatus for fastening |
JP3141241B2 (ja) * | 1990-08-24 | 2001-03-05 | ソニー株式会社 | ディスク記録装置及びディスク再生装置 |
US5244705A (en) * | 1990-08-24 | 1993-09-14 | Sony Corporation | Disc-shaped recording medium |
US5049992A (en) * | 1990-08-27 | 1991-09-17 | Zenith Electronics Corporation | HDTV system with receivers operable at different levels of resolution |
US5226084A (en) * | 1990-12-05 | 1993-07-06 | Digital Voice Systems, Inc. | Methods for speech quantization and error correction |
US5134475A (en) * | 1990-12-11 | 1992-07-28 | At&T Bell Laboratories | Adaptive leak hdtv encoder |
JP2853717B2 (ja) * | 1991-03-08 | 1999-02-03 | 日本電気株式会社 | 可変ビット型adpcmトランスコーダ |
EP0506394A2 (en) * | 1991-03-29 | 1992-09-30 | Sony Corporation | Coding apparatus for digital signals |
BR9204799A (pt) * | 1991-03-29 | 1993-07-13 | Sony Corp | Processo de codificacao para um sinal digital |
JP3395192B2 (ja) * | 1991-05-25 | 2003-04-07 | ソニー株式会社 | ディジタルオーディオ信号再生装置、ディスクプレーヤの再生ポーズ回路及びディスク再生装置の再生制御回路 |
GB2257606B (en) * | 1991-06-28 | 1995-01-18 | Sony Corp | Recording and/or reproducing apparatuses and signal processing methods for compressed data |
DE69232251T2 (de) * | 1991-08-02 | 2002-07-18 | Sony Corp., Tokio/Tokyo | Digitaler Kodierer mit dynamischer Quantisierungsbitverteilung |
GB2258372B (en) * | 1991-08-02 | 1995-05-31 | Sony Corp | Apparatus for and methods of recording and/or reproducing digital data |
JP3178026B2 (ja) * | 1991-08-23 | 2001-06-18 | ソニー株式会社 | ディジタル信号符号化装置及び復号化装置 |
JP3141450B2 (ja) * | 1991-09-30 | 2001-03-05 | ソニー株式会社 | オーディオ信号処理方法 |
DE69227570T2 (de) * | 1991-09-30 | 1999-04-22 | Sony Corp., Tokio/Tokyo | Verfahren und Anordnung zur Audiodatenkompression |
JP3134455B2 (ja) * | 1992-01-29 | 2001-02-13 | ソニー株式会社 | 高能率符号化装置及び方法 |
JP3104400B2 (ja) * | 1992-04-27 | 2000-10-30 | ソニー株式会社 | オーディオ信号符号化装置及び方法 |
KR0150955B1 (ko) * | 1992-05-27 | 1998-10-15 | 강진구 | 비트고정을 위한 영상압축방법과 신장방법 및 그 장치 |
JPH0629934A (ja) * | 1992-07-10 | 1994-02-04 | Nippon Hoso Kyokai <Nhk> | 適応差分符号化伝送方法 |
JP3508146B2 (ja) * | 1992-09-11 | 2004-03-22 | ソニー株式会社 | ディジタル信号符号化復号化装置、ディジタル信号符号化装置及びディジタル信号復号化装置 |
JP3343962B2 (ja) * | 1992-11-11 | 2002-11-11 | ソニー株式会社 | 高能率符号化方法及び装置 |
JP3186413B2 (ja) * | 1994-04-01 | 2001-07-11 | ソニー株式会社 | データ圧縮符号化方法、データ圧縮符号化装置及びデータ記録媒体 |
-
1994
- 1994-06-29 CN CN94190550A patent/CN1099777C/zh not_active Expired - Lifetime
- 1994-06-29 PL PL94322680A patent/PL174314B1/pl unknown
- 1994-06-29 JP JP50340095A patent/JP3721582B2/ja not_active Expired - Lifetime
- 1994-06-29 PL PL94307740A patent/PL173718B1/pl unknown
- 1994-06-29 RU RU95106457A patent/RU2131169C1/ru active
- 1994-06-29 EP EP94919822A patent/EP0663739B1/en not_active Expired - Lifetime
- 1994-06-29 DE DE69432538T patent/DE69432538T2/de not_active Expired - Lifetime
- 1994-06-29 EP EP00125009A patent/EP1083674B1/en not_active Expired - Lifetime
- 1994-06-29 DE DE69428030T patent/DE69428030T2/de not_active Expired - Lifetime
- 1994-06-29 US US08/392,756 patent/US5765126A/en not_active Expired - Lifetime
- 1994-06-29 WO PCT/JP1994/001056 patent/WO1995001680A1/ja active IP Right Grant
- 1994-06-29 BR BR9405445-2A patent/BR9405445A/pt not_active IP Right Cessation
- 1994-06-29 KR KR1019950700784A patent/KR100368854B1/ko not_active IP Right Cessation
-
2002
- 2002-03-28 CN CN021085080A patent/CN1217502C/zh not_active Expired - Lifetime
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS61201526A (ja) * | 1985-02-27 | 1986-09-06 | テレフンケン・フエルンゼー・ウント・ルントフンク・ゲゼルシヤフト・ミツト・ベシユレンクテル・ハフツング | オーデイオ信号の伝送方法 |
Non-Patent Citations (1)
Title |
---|
See also references of EP0663739A4 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5758316A (en) * | 1994-06-13 | 1998-05-26 | Sony Corporation | Methods and apparatus for information encoding and decoding based upon tonal components of plural channels |
US5870703A (en) * | 1994-06-13 | 1999-02-09 | Sony Corporation | Adaptive bit allocation of tonal and noise components |
JP2003324355A (ja) * | 2002-05-07 | 2003-11-14 | Sony Corp | 符号化方法及び装置、復号方法及び装置、並びにプログラム及び記録媒体 |
US8631060B2 (en) | 2007-12-13 | 2014-01-14 | Qualcomm Incorporated | Fast algorithms for computation of 5-point DCT-II, DCT-IV, and DST-IV, and architectures |
JP2016500839A (ja) * | 2012-10-10 | 2016-01-14 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | スペクトルパターンを利用することによってシヌソイドおよびスイープを効率的に合成するための装置および方法 |
US9570085B2 (en) | 2012-10-10 | 2017-02-14 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns |
JP2018036668A (ja) * | 2012-10-10 | 2018-03-08 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | スペクトルパターンを利用することによってシヌソイドおよびスイープを効率的に合成するための装置および方法 |
Also Published As
Publication number | Publication date |
---|---|
KR950703231A (ko) | 1995-08-23 |
BR9405445A (pt) | 1999-09-08 |
JP3721582B2 (ja) | 2005-11-30 |
RU2131169C1 (ru) | 1999-05-27 |
EP1083674A3 (en) | 2001-04-11 |
PL173718B1 (pl) | 1998-04-30 |
DE69428030T2 (de) | 2002-05-29 |
EP1083674B1 (en) | 2003-04-16 |
KR100368854B1 (ko) | 2003-05-17 |
PL174314B1 (pl) | 1998-07-31 |
DE69428030D1 (de) | 2001-09-27 |
PL307740A1 (en) | 1995-06-12 |
DE69432538T2 (de) | 2004-04-01 |
US5765126A (en) | 1998-06-09 |
EP0663739A1 (en) | 1995-07-19 |
CN1217502C (zh) | 2005-08-31 |
EP0663739A4 (en) | 1998-09-09 |
EP1083674A2 (en) | 2001-03-14 |
DE69432538D1 (de) | 2003-05-22 |
CN1440144A (zh) | 2003-09-03 |
CN1113096A (zh) | 1995-12-06 |
CN1099777C (zh) | 2003-01-22 |
EP0663739B1 (en) | 2001-08-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP3336617B2 (ja) | 信号符号化又は復号化装置,及び信号符号化又は復号化方法,並びに記録媒体 | |
US5765126A (en) | Method and apparatus for variable length encoding of separated tone and noise characteristic components of an acoustic signal | |
JP3203657B2 (ja) | 情報符号化方法及び装置,情報復化方法及び装置,情報伝送方法,並びに情報記録媒体 | |
JP3371590B2 (ja) | 高能率符号化方法及び高能率復号化方法 | |
JP3277692B2 (ja) | 情報符号化方法、情報復号化方法及び情報記録媒体 | |
EP0645769A2 (en) | Signal encoding or decoding apparatus and recording medium | |
JP3318931B2 (ja) | 信号符号化装置、信号復号化装置及び信号符号化方法 | |
JPH0846518A (ja) | 情報符号化方法及び復号化方法、情報符号化装置及び復号化装置、並びに情報記録媒体 | |
US6995699B2 (en) | Encoding method, and encoding apparatus, and decoding method and decoding apparatus | |
JP3685823B2 (ja) | 信号符号化方法及び装置、並びに信号復号化方法及び装置 | |
JP3475985B2 (ja) | 情報符号化装置および方法、情報復号化装置および方法 | |
JP3465697B2 (ja) | 信号記録媒体 | |
JP3465698B2 (ja) | 信号復号化方法及び装置 | |
JPH09135173A (ja) | 符号化装置および符号化方法、復号化装置および復号化方法、伝送装置および伝送方法、並びに記録媒体 | |
MXPA95004960A (en) | Method and information coding device, method and information decoding device, method of transmission of information, and means of registration of information | |
JPH11220402A (ja) | 情報符号化装置および方法、並びに提供媒体 | |
JPH10107640A (ja) | 信号再生装置および方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): BR CN JP KR PL RU US VN |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): DE FR GB |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1994919822 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 08392756 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWP | Wipo information: published in national office |
Ref document number: 1994919822 Country of ref document: EP |
|
WWG | Wipo information: grant in national office |
Ref document number: 1994919822 Country of ref document: EP |