WO2005031705A1 - サンプリングレート変換装置、符号化装置、復号化装置、およびこれらの方法 - Google Patents

サンプリングレート変換装置、符号化装置、復号化装置、およびこれらの方法 Download PDF

Info

Publication number
WO2005031705A1
WO2005031705A1 PCT/JP2004/014215 JP2004014215W WO2005031705A1 WO 2005031705 A1 WO2005031705 A1 WO 2005031705A1 JP 2004014215 W JP2004014215 W JP 2004014215W WO 2005031705 A1 WO2005031705 A1 WO 2005031705A1
Authority
WO
WIPO (PCT)
Prior art keywords
spectrum
signal
code
frequency
frequency band
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2004/014215
Other languages
English (en)
French (fr)
Japanese (ja)
Inventor
Masahiro Oshikiri
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to CN2004800256756A priority Critical patent/CN1849647B/zh
Priority to US10/573,812 priority patent/US7756711B2/en
Priority to EP04788282A priority patent/EP1669981A4/en
Publication of WO2005031705A1 publication Critical patent/WO2005031705A1/ja
Anticipated expiration legal-status Critical
Priority to US12/708,290 priority patent/US8195471B2/en
Priority to US13/463,653 priority patent/US8374884B2/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Definitions

  • Sampling rate conversion device encoding device, decoding device, and methods thereof
  • the present invention relates to a sat transform apparatus, an encoding apparatus, a decoding apparatus, and methods thereof.
  • ADSL Asymmetric Digital Subscriber Line
  • W-CDMA Wideband-Code Division Multiple Access
  • ITU dnnationalization is a representative scheme for encoding narrowband signals.
  • G. 726, G. 729 equal power S standardized in the Telecommunication Union). Also, ITU-T (International) is a typical method for encoding wideband signals.
  • the scalable function is a function that can decode the audio signal even with some power of the coding code.
  • it is necessary to decode high-quality voice signals using all coding codes in good channel conditions, and transmit only part of the coding code in bad channel conditions. Can reduce the frequency of occurrence of packet loss.
  • effects such as the efficiency of network resources at the time of communication between multiple points can be obtained.
  • a signal of sampling rate power S8 kHz is encoded using the method of G. 726, G. 729, etc. standardized by ITU-T, and the error signal is further added at a sampling rate of 16 kHz.
  • coding it is possible to realize quality improvement and scalability by extending the signal band.
  • FIG. 1 is a block diagram showing a typical configuration of a coding device that performs scalable coding.
  • the number of layers N 3
  • the audio signal (audio signal, audio signal, etc.) input to the downsampling unit 12 through the input terminal 11 is downsampled from 32 kHz to 16 kHz, and the first layer code input unit 13 Given to The first layer coding unit 13 determines the first coding code so as to minimize the perceptible distortion between the input acoustic signal and the decoded signal generated after the coding.
  • This first encoded code is sent to the multiplexing unit 26 and also sent to the first layer decoding unit 14.
  • First layer decoding section 14 generates a first layer decoded signal using the first encoded code.
  • the up-sampling unit 15 up-samples the sampling frequency of the first layer decoded signal from 16 kHz to 24 kHz, and supplies this signal to the subtractor 18 and the adder 21.
  • the acoustic signal input to the downsampling unit 16 via the input terminal 11 is downsampled from 32 kHz to 24 kHz and supplied to the delay unit 17.
  • the delay unit 17 delays the down-sampled signal by a predetermined time length.
  • the subtractor 18 obtains the difference between the output signal of the delay unit 17 and the output signal of the up-sampling unit 15, generates a second layer residual signal, and is given to the second layer code input unit 19.
  • Second layer mark The coding unit 19 encodes the second layer residual signal so as to aurally improve the quality, determines a second coding code, and multiplexes the second coding code. 26 and second layer decoding unit 20 are provided.
  • the second layer decoding unit 20 performs a decoding process using the second code and the code to generate a second layer decoded residual signal.
  • the adder 21 takes the sum of the first layer decoded signal and the second layer decoded residual signal described above to generate a second layer decoded signal.
  • the up sampling unit 22 upsamples the sampling frequency of the second layer decoded signal from 24 kHz to 32 kHz and supplies this signal to the subtractor 24.
  • the acoustic signal input to the delay unit 23 through the input terminal 11 is delayed by a predetermined time length, and is provided to the subtractor 24.
  • the subtractor 24 takes the difference between the output signal of the delay unit 23 and the output signal of the upsampling unit 22 to generate a third layer residual signal.
  • This third layer residual signal is applied to third layer code block 25.
  • the third layer coding unit 25 performs code coding so that the third layer residual signal can be aurally improved in quality, determines the third coding code, and outputs the result to the multiplexing unit 26. Give that code a code.
  • the multiplexing unit 26 multiplexes the encoded code obtained from the first layer coding unit 13, the second layer coding unit 19, and the third layer coding unit 25, and outputs the multiplexed code through the output terminal 27. Do.
  • Patent Document 1 Japanese Patent Application Laid-Open No. 2000-68948
  • An object of the present invention is to provide a sampling rate conversion device and an encoding device capable of reducing the circuit scale and reducing the amount of processing operation of the code, and decoding the signal encoded by the encoding device. Apparatus, and methods of providing these. Means to solve the problem
  • the effective frequency band of the spectrum is expanded in the frequency domain to obtain a signal in the time domain.
  • ⁇ ⁇ get an equivalent signal to the case of up-sampling.
  • the sampling rate conversion device of the present invention converts the input time domain signal into the frequency domain to obtain the first spectrum, and extends the frequency band of the first spectrum obtained; And an insertion means for inserting the second spectrum into the expanded frequency band of the first space after expansion.
  • the signal equivalent to the up-sampled signal in the time domain is converted by converting the input time domain signal into the frequency domain signal and extending the frequency band of the obtained spectrum. You can get In addition, the circuit scale of the encoding device can be reduced, and the amount of processing operation of the code can also be reduced.
  • the code device of the present invention is a conversion means for obtaining a first spectrum of Na point by frequency analysis of an input sampling frequency signal of Fx with analysis length 2 ⁇ Na, and a first spectrum obtained
  • An extension means for extending the frequency band of Petal to Nb points and a second spectrum to be inserted in the expanded frequency band of the first spectrum after extension are specified, and a code representing the second spectrum is specified.
  • the encoding apparatus of the present invention has a configuration as described above, wherein the second spectrum is generated based on the first vector.
  • the second spectrum is obtained by performing frequency analysis on an input signal with a sampling frequency of Fy at a point 2'Nb, Na ⁇ k ⁇ It adopts a configuration that is determined to be similar to the spectrum included in the Nb frequency band.
  • the expanded spectrum can be determined based on the spectrum of the original signal. Since it is possible to obtain an extended spectrum with higher accuracy.
  • the coding means divides the frequency band of Na ⁇ k ⁇ Nb into two or more sub-bands, and
  • a configuration is adopted in which a coded code representing two spectra is output.
  • the encoding apparatus of the present invention has a configuration as described above, wherein the signal with the sampling frequency Fx is a signal decoded by a lower layer in a layer code.
  • the present invention can be applied to hierarchical coding configured by coding units of multiple layers, and hierarchical coding can be realized with only minimum sampling transformation.
  • the decoding apparatus of the present invention is an acquisition unit that analyzes a signal of sampling frequency Fx with analysis length 2'Na to acquire a first spectrum of a frequency band of 0 ⁇ k ⁇ Na, and a code Decoding means for receiving a second code and decoding a second spectrum in the Na ⁇ k ⁇ Nb frequency band, and combining the first and second spectra to obtain a 0 ⁇ ⁇ k ⁇ Nb frequency band And a conversion means for converting a spectrum included in a frequency band of 0 ⁇ k ⁇ Nb into a time domain signal.
  • the decoding apparatus of the present invention has a configuration as described above, wherein the second spectrum is generated based on a spectrum of a frequency band of 0 ⁇ k ⁇ Na.
  • the width of the frequency band of the combined spectrum obtained by the generation means is equal to a predetermined width after the combination.
  • the apparatus further comprises: a force for inserting a specified value in the high band part of the spectrum or means for discarding the high band part of the combined spectrum.
  • the decoding apparatus of the present invention has a configuration as described above, wherein the signal with the sampling frequency of Fx is a signal decoded by a lower layer in a layer code.
  • the circuit scale of the encoding device can be reduced, and the amount of processing operation for encoding can also be reduced.
  • FIG. 1 is a block diagram showing a typical configuration of a coder performing scalable coding.
  • FIG. 2 is a block diagram showing the main configuration of a spectrum coder according to a first embodiment.
  • FIG. 3A is a diagram showing the first spectrum
  • FIG. 3B is a diagram showing the spectrum after the effective frequency band is extended.
  • FIG. 4A Diagram for explaining in principle the effect of processing to extend the effective frequency band of extra space
  • FIG. 5 shows the main configuration of the wireless transmission device according to the first embodiment-[FIG. 6] shows the internal configuration of the code device according to the first embodiment-[FIG. 7] according to the first embodiment
  • Fig. 8 shows an internal configuration of a spectral code portion-[Fig. 8]
  • FIG. 9 shows the main configuration of the radio receiving apparatus according to the first embodiment-[FIG. 10] shows the internal configuration of the decoding apparatus according to the first embodiment-[FIG. 11] according to the first embodiment 12 shows the internal configuration of the spectrum decoding unit-[FIG. 12]
  • FIGS. 12A and 12B show the process performed by the band expanding unit according to the first embodiment.
  • FIG. 13 A diagram showing how a spectrum is generated through the processing in the coupling unit and time domain conversion unit according to the first embodiment.
  • FIG. 14A A block diagram showing the main configuration of the transmitting side when the coding apparatus according to Embodiment 1 is applied to a wired communication system.
  • FIG. 14B A block diagram showing the main configuration of the receiving side when the decoding apparatus according to Embodiment 1 is applied to a wired communication system.
  • FIG. 15 A main configuration of the decoding apparatus according to Embodiment 2 is shown-[FIG. 16] An internal configuration of a spectrum decoding unit according to Embodiment 2 is shown-[FIG. 17] Embodiment 2 Figure for explaining in more detail the processing of the correction unit according to
  • FIG. 18 A diagram for illustrating the process of the correction unit according to the second embodiment in more detail.
  • FIG. 19 A diagram for further explaining the operation of the spectrum decoding unit according to Embodiment 2.
  • FIG. 20A A diagram for further explaining the operation of the spectrum decoding unit according to Embodiment 2.
  • 20B A diagram for further explaining the operation of the spectrum decoding unit according to Embodiment 2.
  • FIG. 21 A diagram showing the main configuration of a communication system according to Embodiment 3.
  • FIG. 22 A diagram showing the main configuration of a communication system according to Embodiment 4.
  • FIG. 2 is a block diagram showing a main configuration of spectrum coder apparatus 100 according to Embodiment 1 of the present invention.
  • Spectrum coding apparatus 100 has sampling rate conversion section 101, input terminal 102, spectrum information identification section 106, and output terminal 107.
  • the sampling rate converter 101 further includes a frequency domain converter 103, a band expander 104, and an extended spectrum adder 105.
  • a signal sampled at the sampling rate Fx is input to the spectrum coding apparatus 100 via the input terminal 102.
  • Frequency domain conversion section 103 frequency-analyzes this signal by analysis length 2′Na.
  • a signal in the intermediate domain is converted to a signal in the frequency domain (frequency domain conversion) to calculate a first extra space Sl (k) (0 ⁇ k ⁇ Na).
  • the obtained first spectrum Sl (k) is given to the band expanding unit 104.
  • frequency analysis uses a modified discrete cosine transform (MDCT).
  • MD CT analysis is performed by superposing adjacent frames and analysis frames in half by half, and using the orthogonal basis where the first half of the analysis frame is an odd function and the second half is an even function, distortion between frames is canceled. It has the characteristic of being As a frequency analysis method, it is also possible to use discrete Fourier transform (DFT), discrete cosine transform (DCT) or the like.
  • DFT discrete Fourier transform
  • DCT discrete cosine transform
  • Extended spectrum assigning section 105 assigns an extended spectrum S1, (k) (Na ⁇ k ⁇ Nb) to which external power is also input to the frequency band expanded by band expanding section 104, and spectrum information identifying section 106. Output to
  • Spectrum information identification section 106 sets information required for identifying expanded spectrum S 1, (k) among the spectrums given from extended spectrum application section 105 as a code and output terminal 107. Output through.
  • This code ⁇ code is information representing the sub-band energy of the extended spectrum S1 '(k), information representing the effective frequency band, and the like. Details of this will also be described later.
  • FIG. 3A shows a first spectrum Sl (k) provided by frequency domain conversion section 103
  • FIG. 3B shows spectrum Sl (k) after the effective frequency band is expanded in band expansion section 104.
  • the band expanding unit 104 secures an area in which new spectrum information can be stored in a frequency band in which the frequency k of the first spectrum Sl (k) is expressed in the range of Na ⁇ k ⁇ Nb.
  • the size of this novel region is represented by Nb-Na.
  • Nb is a sampling rate Fx of an externally applied signal through input terminal 102.
  • the relationship between the analysis length 2'Na of the frequency domain transform unit 103 and the sampling rate Fy of the signal decoded by the decoding unit (not shown) is determined. Specifically, Nb is set by the following equation.
  • the sampling rate F y of the signal decoded by the decoding unit is determined by the following equation.
  • FIGS. 4A and 4B are diagrams for explaining in principle the effect of the process of extending the effective frequency band of the spectrum performed in band expanding section 104.
  • FIG. FIG. 4A shows a spectrum Sa (k) obtained when the signal of the sampling rate Fx is frequency-analyzed by the analysis length 2′Na.
  • the horizontal axis represents frequency, and the vertical axis represents spectral intensity.
  • the Nyquist theorem power of the effective frequency band of the signal is also 0 ⁇ FxZ2.
  • the range of the frequency index k is 0 ⁇ k ⁇ Na
  • the frequency resolution of the spectrum Sa (k) is FxZ (2′Na).
  • the spectrum Sb (k) obtained by frequency analysis with analysis length 2 ⁇ Nb is shown in Figure 4B
  • the effective frequency band of the signal is extended to 0-FyZ 2
  • the range of frequency index k is 0 ⁇ k ⁇ Nb.
  • the frequency resolution FyZ (2′Nb) of the scan vector Sb (k) is equal to FxZ (2′Na). That is The spectrum Sa (k) in the band 0 ⁇ k ⁇ Na and the spectrum Sb (k) are equal.
  • the spectrum Sb (k) when the band of the spectrum Sa (k) (0 ⁇ k ⁇ Na) is expanded to Nb is analyzed after the signal of sampling Fx is upsampled to the sampling Fy. It means that it agrees with the spectrum obtained by frequency analysis with a length 2 ⁇ ⁇ b.
  • sampling rate conversion section 101 converts an input time domain signal into a frequency domain signal, and extends the effective frequency band of the obtained spectrum to obtain a signal in the time domain.
  • a spectrum equivalent to the spectrum obtained by frequency conversion of the upsampled signal can be obtained.
  • sampling rate conversion unit 101 Since the signal output from sampling rate conversion unit 101 is a signal in the frequency domain, when a signal in the time domain is required, a time domain conversion unit is provided to perform reconversion to the time domain. Do. In the above example, since sampling rate conversion section 101 is installed in spectral code device 100, the signal in the frequency domain is input to spectrum information identification section 106 without being converted back to the time domain signal. , The encoding code is generated
  • the spectral information identification unit 106 outputs The code rate of the coded code is different. That is, part of the processing in the sampling rate conversion unit 101 has a great influence on coding. This means that the spectral code device 100 simultaneously realizes the conversion of the sampling rate of the input signal and the code ⁇ .
  • the process performed by force spectrum information identification unit 106 described with the case where extended spectrum is applied to the original spectrum in extended spectrum application unit 105 as an example is as follows: Since it is necessary to output the information necessary for specifying the extended spectrum as a code, it is sufficient if the extended spectrum to be given is specified, and the extended spectrum is not necessarily given. It doesn't have to be. Furthermore, although upsampling has been described as an example of sampling rate conversion, the above principle can also be applied to downsampling.
  • FIG. 5 is a block diagram showing the main configuration of radio transmitting apparatus 130 when encoding apparatus 120 according to the present embodiment is mounted on the transmitting side of the radio communication system.
  • the wireless transmission device 130 includes an encoding device 120, an input device 131, an AZD conversion device 132, an RF modulation device 133, and an antenna 134.
  • the input device 131 converts the sound wave W11 heard by the human ear into an analog signal which is an electrical signal, and outputs the analog signal to the AZD conversion device 132.
  • the AZD converter 132 converts this analog signal into a digital signal and outputs it to the coder 120 (signal Sl).
  • the encoder 120 encodes the input digital signal S1 to generate an encoded signal, and outputs the encoded signal to the RF modulator 133 (signal S2).
  • the RF modulator 133 modulates the coded signal S 2 to generate a modulated coded signal, and outputs the modulated coded signal to the antenna 134.
  • the antenna 134 transmits the modulation coded signal as the radio wave W12.
  • FIG. 6 is a block diagram showing an internal configuration of the above-mentioned coder device 120. As shown in FIG. Here, the case of hierarchical coding (scalable coding) will be described as an example.
  • Code decoding apparatus 120 includes input terminal 121, downsampling section 122, first layer code decoding section 123, first layer decoding section 124, delaying section 126, spectrum coding section 100a, multiplexing It has a part 127 and an output terminal 128.
  • the acoustic signal S1 of the sampling rate Fy is input to the input terminal 121.
  • the downsampling unit 122 downsamples the signal S1 input through the input terminal 121 to generate and output a signal of the sampling rate Fx.
  • the first layer code unit 123 codes this down-sampled signal, and outputs the obtained code unit code to the multiplexing unit (multiplexer) 127, and also performs first layer decoding. Also output to the part 124.
  • the first layer decoding unit 124 generates a decoded signal of the first layer based on the coded code.
  • the delay unit 126 gives a delay of a predetermined length to the signal S 1 input via the input terminal 121.
  • the magnitude of this delay is equal to the time delay that occurs when the signal passes through the downsampling unit 122, the first layer coding unit 123, and the first layer decoding unit 124.
  • Spectrum coding section 100 a is a sample signal output from first layer decoding section 124. Spectrum coding is performed using the signal S3 of the coding rate Fx and the signal S4 of the sampling rate Fy output from the delay unit 126, and the generated code code S5 is output to the multiplexing unit 127.
  • the multiplexing unit 127 multiplexes the code obtained with the first layer code 123 and the code S5 obtained with the spectral code 10a, and outputs the output terminal 128 as an output code S2. Output through.
  • This output code S2 is given to the RF modulator 133
  • FIG. 7 is a block diagram showing an internal configuration of the above-described spectrum coding unit 100a.
  • the spectrum coding unit 100a has the same basic configuration as the spectrum coding unit 100 shown in FIG. 2, and the same components are denoted by the same reference numerals. Omit.
  • a feature of spectrum encoding section 100a is that extended spectrum 31 '&) ⁇ & ⁇ 1 ⁇ ⁇ ? ⁇ ) Is applied using the input signal S3 at sampling rate Fy. That's right. According to this, since the target signal for determining the extended spectrum Sl '(k) is given, the accuracy of the extended spectrum Sl' (k) is improved, and as a result, the effect of leading to the quality improvement is obtained.
  • the frequency domain conversion unit 112 performs frequency analysis on the signal S4 of the sampling rate Fy input through the input terminal 111 with an analysis length of 2 ⁇ Nb, and the second spectrum S 2 (k) (0 ⁇ k ⁇ Nb Ask for).
  • the relationship represented by (Expression 1) holds for the sampling frequencies Fx and Fy and the analysis lengths Na and Nb.
  • Spectrum information identifying section 106 determines a coding code representing extended spectrum SI ′ (k).
  • the expanded spectrum S1 ′ (k) is determined using the second vector S2 (k) obtained by the frequency domain conversion unit 112.
  • the spectrum information identification unit 106 determines the code ⁇ code through two steps of determining the shape of the expanded spectrum S 1, (k) and determining the gain of the expanded spectrum S 1, (k).
  • the expanded spectrum Sl '(k) is determined using the band 0 ⁇ k ⁇ Na of the first spectrum Sl (k).
  • the first spectrum Sl (k) separated by a fixed value C on the frequency axis is copied to the expanded spectrum S1, (k).
  • C is a predetermined fixed value, and it is necessary to satisfy the condition of C ⁇ Na.
  • information for representing the shape of the extended spectrum Sl '(k) is not output as a code.
  • the fixed value C has a predetermined range T.
  • the value T of the variable T when similar may be output as part of the sign code.
  • the expanded spectrum S 1, (k) is expressed by the following equation.
  • the gain of the extended spectrum Sl, (k) is determined to coincide with the band of the band Na ⁇ k ⁇ Nb of the second spectrum S2 (k). Specifically, the deviation V of the beam is calculated according to the following equation, and an index obtained by quantizing this value is output as a code ⁇ code from the output terminal 107.
  • the extended spectrum SI ′ (k) may be divided into a plurality of sub-bands, and the code and code may be determined independently for each sub-band.
  • the expanded spectrum SI ′ (k) may be divided into a plurality of sub-bands, and the code and code may be determined independently for each sub-band.
  • T In the step of determining the shape of Sl, (k), T, represented by (Eq. 4) may be determined for each subband and output as a code ⁇ code, or a common T 'may be output. Only one decision may be made and output as a code / code. Then, in the step of determining the gain of the expanded spectrum Sl '(k), the deviation V (j) of the beam is calculated for each subband, and this value is calculated. The index obtained by denitrification is output as a code code via the output terminal 107. The amount of variation of power per subband is expressed by the following equation.
  • j represents a subband number
  • BL (j) is a frequency index corresponding to the minimum frequency of the j-th subband
  • BH (j) is a frequency corresponding to the maximum frequency of the j-th subband Represents an index.
  • a scalable function can be realized by outputting a code / code for each sub-band as described above.
  • the mode for LPC analysis of the signal of the sampling rate Fy as shown in FIG. But it may be 10 Ob). That is, the signal of the sampling rate Fy can be subjected to LPC analysis to obtain LPC coefficients, and the LPC coefficients can be used to determine the extended spectrum Sl '(k). In this configuration, LPC coefficients can be DFT converted into spectral information, and this spectrum can be used to determine the extended spectrum S1 '(k).
  • the circuit scale of the coding apparatus can be reduced, and the amount of processing calculation of code ⁇ can also be reduced.
  • LPF low pass filter
  • sampling rate conversion alone causes a delay of 5 ms or more.
  • the occurrence of such a delay is a problem when it is applied to two-way voice communication, the reaction of the other party seems to be delayed.
  • the cutoff characteristic can be made steep even if the order is relatively small, and the delay does not increase as much as the FIR filter.
  • IIR filters it is impossible to design filters such as FIR filters in which the amount of delay occurring at all frequencies is constant. This means that, in the scalable code system, when the input signal power is also subjected to the sampling rate conversion, the input signal must be given a certain amount of delay according to the time delay of the signal after the sampling rate conversion.
  • the IIR LPF is used, the amount of delay with respect to the frequency is not constant, which causes the problem that the subtraction processing can not be performed accurately.
  • the coding apparatus according to the present embodiment can solve these problems that occur in scalable coding systems.
  • FIG. 9 is a block diagram showing a main configuration of radio reception apparatus 180 that receives a signal transmitted from radio transmission apparatus 130. Referring to FIG. 9
  • the wireless receiver 180 includes an antenna 181, an RF demodulator 182, a decoder 170, a DZA converter 183, and an output device 184.
  • the antenna 181 receives a digital encoded acoustic signal as the radio wave W12, generates a digital reception code ⁇ acoustic signal of the electric signal, and provides the same to the RF demodulator 182.
  • the RF demodulator 182 demodulates the received coded acoustic signal from the antenna 181 to generate a demodulated coded acoustic signal S 11 and supplies it to the decoder 170.
  • Decoding device 170 receives the digital demodulation coded acoustic signal S11 from RF demodulator 182, performs decoding processing to generate digital decoded acoustic signal S12, and outputs it to DZA converter 183. give.
  • the DZA converter 183 converts the digital decoded audio signal S12 from the decoder 170 into an analog decoded audio signal and supplies it to the output device 184.
  • the output device 184 converts an analog decoded audio signal, which is an electrical signal, into air vibration and outputs it as a sound wave W13 so as to be heard by the human ear.
  • FIG. 10 is a block diagram showing an internal configuration of the decoding device 170 described above. Again, the floor The case of decoding a layer-coded signal will be described as an example.
  • the decoding device 170 includes an input terminal 171, a separating unit 172, a first layer decoding unit 173, a vector decoding unit 150, and an output terminal 176.
  • the code S 11 hierarchically coded from the RF demodulator 182 is input to the input terminal 171.
  • the separation unit 172 separates the demodulated coded acoustic signal S11 input through the input terminal 171, and uses the code code for the first layer decoding unit 173 and the code code for the spectrum decoding unit 150.
  • Generate First layer decoding section 173 decodes the decoded signal of sampling rate Fx using the encoded code obtained by demultiplexing section 172, and outputs decoded signal S 13 to spectrum decoding section 150.
  • Spectrum decoding unit 150 performs spectrum decoding, which will be described later, on code S 14 separated by separation unit 172 and signal S 13 of sampling rate Fx generated by first layer decoding unit 173. To generate a decoded signal S12 of the sampling rate Fy and output it via the output terminal 176.
  • FIG. 11 is a block diagram showing an internal configuration of spectrum decoding section 150 described above.
  • Spectrum decoding section 150 includes input terminals 152 and 153, frequency domain conversion section 154, band extension section 155, decoding section 156, coupling section 157, time domain conversion section 158, and output terminal 1 59. Have.
  • the signal S 13 sampled at the sampling rate Fx is input to the input terminal 152. Further, a code ⁇ code S14 related to the extended spectrum Sl '(k) is input to the input terminal 153.
  • the frequency domain conversion unit 154 performs frequency analysis on the time domain signal S13 input from the input terminal 152 with an analysis length of 2′Na to calculate a first spectrum Sl (k).
  • the frequency analysis method uses a modified discrete cosine transform (MDCT). MDCT performs analysis by overlapping the previous and next adjacent frames with the analysis frame by half, and using the orthogonal basis where the first half of the analysis frame is an odd function and the second half is an even function. It is characterized by being The first spectrum Sl (k) thus obtained is given to the band extending unit 155.
  • MDCT discrete Fourier transform
  • DCT discrete cosine transform
  • the region where the spectrum can be given is secured so that the band of the first spectrum Sl (k) is 0 ⁇ k ⁇ Nb.
  • the first spectrum Sl (k) whose band has been expanded is output to the combining unit 157.
  • decoding section 156 decodes code code S14 related to expanded spectrum Sl '(k) input through input terminal 153 to obtain expanded spectrum Sl' (k), and combines them. Output to section 157.
  • the combining unit 157 combines the first spectrum Sl (k) given by the band extending unit 155 with the extended spectrum Sl ′ (k). This coupling is realized by inserting the extended spectrum Sl '(k) in the band Na ⁇ k ⁇ Nb of the first spectrum Sl (k). The first spectrum S l (k) obtained by this processing is output to the time domain conversion unit 158.
  • the time domain conversion unit 158 performs time domain conversion processing corresponding to the inverse conversion of the frequency domain conversion performed by the spectral encoding unit 100a, and performs multiplication and superposition addition of appropriate window functions to obtain time. Generate signal S12 of the region. The signal S12 in the time domain generated in this manner is output as a decoded signal through the output terminal 159.
  • FIG. 12A shows the first spectrum Sl (k) provided by the frequency domain transform unit 154.
  • FIG. 12B shows a spectrum obtained as a result of the processing of the band expanding unit 155, and an area where new spectrum information can be stored is secured in a band in which the frequency k is represented by Na ⁇ k ⁇ Nb.
  • the size of this novel region is expressed as Nb-Na.
  • Nb represents the relationship between the sampling rate Fx of the signal supplied from the input terminal 152, the analysis length 2 ⁇ Na of the frequency domain conversion unit 154, and the sampling rate Fy of the signal decoded by the spectrum decoding unit 150. Depending on the following equation, Nb can be set.
  • the sampling rate Fy of the signal decoded by the spectral decoding unit 150 is determined by the following equation. [Number 8]
  • FIG. 13 is a diagram showing how a spectrum is processed by combining section 157 and time domain conversion section 158 and a decoded signal is generated.
  • Coupling section 157 inserts extension spectrum 31 and &) ⁇ & ⁇ 1 ⁇ ⁇ ? ⁇ ) Into the band of Na ⁇ k ⁇ Nb of the first spectrum Sl (k) whose band is extended. And sends the first spectrum 31 &) (0 ⁇ 1 ⁇ ⁇ ? ⁇ ) Obtained by this process to the time domain conversion unit 158.
  • the decoding apparatus of the present embodiment it is possible to decode a signal encoded by the coding apparatus according to the present embodiment.
  • the encoding apparatus or the decoding apparatus according to the present embodiment is applied to a wireless communication system
  • the coding apparatus or the decoding apparatus according to the present embodiment is described.
  • the standardization device can also be applied to a wired communication system as described below.
  • FIG. 14A is a block diagram showing the main configuration on the transmitting side when the coding apparatus according to the present embodiment is applied to a wired communication system.
  • the same components as those shown in FIG. 5 are denoted by the same reference numerals, and the description thereof will be omitted.
  • Wired transmission apparatus 140 includes coding apparatus 120, input apparatus 131, and AZD conversion apparatus 13. 2 and the output is connected to the network N1.
  • the input terminal of the AZD conversion device 132 is connected to the output terminal of the input device 131.
  • the input terminal of the code device 120 is connected to the output terminal of the AZD converter 132.
  • the output terminal of the code device 120 is connected to the network N1.
  • the input device 131 converts the sound wave W11 that can be heard by the human ear into an analog signal that is an electrical signal and supplies the analog signal to the AZD conversion device 132.
  • the AZD converter 132 converts an analog signal into a digital signal and supplies the digital signal to the coder 120.
  • the coder unit 120 codes the input digital signal to generate a code, and outputs the code to the network N1.
  • FIG. 14B is a block diagram showing the main configuration on the receiving side when the decoding apparatus according to the present embodiment is applied to a wired communication system.
  • the same components as shown in FIG. 9 are assigned the same reference numerals and descriptions thereof will be omitted.
  • Wired receiver 190 has receiver 191 connected to network N 1, decoder 170, DZA converter 183, and output device 184.
  • An input terminal of the receiving device 191 is connected to the network N1.
  • the input terminal of the decoder 170 is connected to the output terminal of the receiver 191.
  • the input terminal of the DZA converter 183 is connected to the output terminal of the decoder 170.
  • the input terminal of the output device 184 is connected to the output terminal of the DZA converter 183.
  • Receiving device 191 receives the digital coded audio signal from network N 1, generates a digital received audio signal, and provides it to decoding device 170.
  • Decoding apparatus 170 receives the received acoustic signal from receiving apparatus 191, performs decoding processing on the received acoustic signal, generates a digital decoded audio signal, and provides DZA converting apparatus 183.
  • the DZA conversion unit 183 converts the digital decoded speech signal from the decoding unit 170 to generate an analog decoded speech signal and supplies it to the output unit 184.
  • the output device 184 converts an analog decoded acoustic signal, which is an electrical signal, into air vibration and outputs it as a sound wave W13 so as to be heard by the human ear.
  • FIG. 15 is a block diagram showing the main configuration of decoding apparatus 270 according to Embodiment 2 of the present invention.
  • Decoding apparatus 270 has the same basic configuration as decoding apparatus 170 shown in FIG. 10, and the same components are denoted by the same reference numerals and the description thereof will be omitted. .
  • the feature of the present embodiment is that the maximum frequency index Nb of the first spectrum Sl (k) (0 ⁇ k ⁇ Nb) after combination is corrected to a desired value Nc, so that the desired sampling rate can be obtained. To generate a decoded signal.
  • Spectrum decoding section 250 includes coding code S 14 separated by separation section 172, signal S 13 of sampling rate F x generated by first layer decoding section 173, and input terminal 27 1. Spectral decoding is performed using the coefficient Nc (signal S21) input via the interface. Then, the obtained decoded signal of the sampling rate Fy is output through the output terminal 176.
  • FIG. 16 is a block diagram showing an internal configuration of spectrum decoding section 250 described above.
  • Coefficient Nc input through input terminal 271 is applied to correction unit 251 and time domain conversion unit 158 a.
  • the correction unit 251 determines an effective band of the first spectrum Sl (k) (0 ⁇ k ⁇ Nb) given by the combining unit 157 based on the coefficient Nc given by the input terminal 271 (signal S21). Correct to 0 ⁇ k ⁇ N c. Then, the first spectrum Sl (k) (0 ⁇ k ⁇ Nc) after band correction is given to the time domain conversion unit 158a.
  • the time domain conversion unit 158 a performs the first spectrum S l (k) (0 ⁇ k ⁇ Nc) given from the correction unit 251 under the analysis length 2 ⁇ Nc according to the coefficient Nc given via the input terminal 271. ) Is subjected to conversion processing, multiplication and superposition addition of appropriate window functions are performed, and a signal of time domain is generated and output through an output terminal 159.
  • FIGS. 17 and 18 are diagrams for describing the process of the correction unit 251 in more detail.
  • FIG. 17 shows the process of the correction unit 251 in the case of Nc Nb.
  • the band of the first spectrum Sl (k) (signal S21) given by the coupling unit 157 is 0 ⁇ k ⁇ Nb. That
  • the correction unit 251 deletes the spectrum in the range of Nc ⁇ k ⁇ Nb so that the band of the first spectrum Sl (k) is 0 ⁇ k ⁇ Nc.
  • the first spectrum Sl (k) (0 ⁇ k ⁇ Nc) (signal S22) obtained as a result is supplied to the time domain conversion unit 158a, and the time domain decoded signal S23 is generated.
  • FIG. 18 similarly shows the processing in the case of force Nc> Nb, which is the processing of the correction unit 251.
  • the band of the first spectrum Sl (k) (signal S25) given from the coupling unit 251 is 0 ⁇ k ⁇ Nb as in FIG.
  • the correction unit 251 extends the Nb ⁇ k ⁇ Nc band so that the band of the first spectrum Sl (k) is 0 ⁇ k ⁇ Nc, and assigns a specific value (for example, a zero value) to the region.
  • the first spectrum Sl (k) (0 ⁇ k ⁇ Nc) (signal S26) obtained as a result is applied to the time domain conversion unit 158a, and a time domain decoded signal S27 is generated.
  • spectrum decoding section 250 will be further described using FIG. 19, FIG. 20A and FIG. 20B.
  • the code / code input via the input terminal 153 fluctuates for each frame. That is, in the band of the first spectrum Sl (k) output from the combining unit 157, 0 ⁇ k ⁇ Na (band R1), 0 ⁇ k ⁇ Nb1 (band R2), 0 ⁇ as shown in FIG. It is assumed that there are three bands of k ⁇ Nb2 (band R3) (however, NaNb ⁇ Nb2), and one of these bands is selected for each frame.
  • FIG. 20A is a diagram for explaining the operation of spectrum decoding section 250 when coefficient Nc is equal to Nb2
  • FIG. 20B is a diagram for explaining the operation of spectrum decoding section 250 when coefficient Nc is equal to Nbl. is there.
  • the band of the spectrum obtained in the i-th frame is any one of Rl, R2, and R3.
  • processing 1 inserts a zero value into the band of Nbl ⁇ k ⁇ Nb2
  • processing 2 inserts a zero value into the band of Na ⁇ k ⁇ Nb2
  • processing 3 removes the band of Nbl ⁇ k ⁇ Nb2
  • processing 4 represents the processing of inserting a zero value into the band of Na ⁇ k ⁇ Nbl.
  • the correcting unit 251 performs the first spectrum Sl (k) (0 ⁇ k ⁇ Nb2) is output to the time domain conversion unit 158a.
  • the spectral band is R2, that is, the band of the first spectrum S l (k) is 0 ⁇ k ⁇ Nb 1. Therefore, the correction unit 251
  • the first spectrum Sl (k) (0 ⁇ k ⁇ Nb2) is converted to a time domain conversion unit after the band of the one-vector Sl (k) is expanded to Nb2 and a zero value is inserted in the band Output to 158a.
  • the correction unit 251 selects the first spectrum Sl. After expanding the band of (k) to Nb2 and inserting a zero value in the range of Na ⁇ k ⁇ Nb2, the first spectrum Sl (k) (0 ⁇ k ⁇ Nb2) is converted to the time domain conversion unit 158a. Output to
  • the band of the spectrum is R2 in the second frame to the fourth frame and the ninth frame, that is, the band of the first spectrum Sl (k) is 0 ⁇ k ⁇ Nbl
  • the first spectrum S l (k) (0 ⁇ k ⁇ Nb 1) is output to the time domain conversion unit 158 a without any processing.
  • the spectral band is R3, that is, the band of the first spectrum Sl (k) is 0 ⁇ k ⁇ Nb2, and therefore the correction unit 251 After removing the band of Nbl) k ⁇ Nb2, the first spectrum Sl (k) (0 ⁇ k ⁇ Nbl) is output to the time domain conversion unit 158a.
  • the correction unit 251 determines the first spectrum Sl After expanding the band of (k) to Nbl and inserting a zero value into the band of Na ⁇ k ⁇ Nbl, the first spectrum Sl (k) (0 ⁇ k ⁇ Nbl) is converted to the time domain conversion unit 158a. Output to
  • FIG. 21 is a diagram showing the main configuration of a communication system according to Embodiment 3 of the present invention.
  • the feature of the present embodiment is to cope with the case where the effective frequency band of the first extra frame Sl (k) received on the receiving side fluctuates with time depending on the condition of the communication network (communication environment). It is to be.
  • Layer code section 301 performs the layer coding process described in Embodiment 1 on the input signal of sampling rate Fy, and generates a scalable coded code.
  • the generated code ⁇ code power band 0 ⁇ ⁇ ⁇ ⁇ k ⁇ information about Ne (R 31), band Ne ⁇ k ⁇ Nf (R 32), and band N f ⁇ k ⁇ Ng (R 33) It shall be composed.
  • the hierarchical code unit 301 supplies the code unit code to the network control unit 302.
  • the network control unit 302 transfers, to the hierarchy decoding unit 303, the coding sequence provided by the hierarchy coding unit 301.
  • the network control unit 302 discards part of the coding code transferred to the hierarchical decoding unit 303 according to the state of the network. Therefore, the coded code input to the hierarchical decoding unit 303 is the coded code composed of the information R31 to R33 and the coded code of the information R33 when there is no code to be discarded.
  • Hierarchical decoding section 303 applies the hierarchical decoding method shown in Embodiment 1 or 2 to a given coded code to generate a decoded signal.
  • the sampling rate of the decoded signal can be set by a desired coefficient Nc, and the sampling rate Fz of the decoded signal is FyNc /. It becomes Ng.
  • Embodiment 4 even if the effective frequency band of the first extraneous signal Sl (k) received on the receiving side varies with time depending on the conditions of the communication network, The side can stably obtain a decoded signal of a desired sampling rate.
  • FIG. 22 is a diagram showing the main configuration of a communication system according to Embodiment 4 of the present invention.
  • the feature of the present embodiment is that one code ⁇ code generated by one layer code unit can be decoded for a plurality of layer decoding units having different sampling rates (different in decoding ability) that can respectively decode. Even if they are transmitted simultaneously, the receiving side is to respond to this and obtain decoded signals of different sampling rates.
  • Layer code section 401 applies the coding process shown in the first embodiment to the input signal of sampling rate Fy to generate a scalable coded code.
  • the generated code ⁇ code is information on band 0 ⁇ ⁇ k ⁇ Nh (R41), information on band Nh ⁇ k ⁇ Ni (R42), information on band Ni ⁇ k ⁇ Nj (R43) ⁇ It shall consist of this.
  • Layer code section 401 applies this encoded code to first layer decoding section 402-1, second layer decoding section 402-2 and third layer decoding section 402-3. .
  • the first layer decoding unit 402-1, the second layer decoding unit 402-2, and the third layer decoding unit 402-2 are implemented for the given encoded code.
  • a decoded signal is generated by applying the hierarchical decoding scheme shown in the first embodiment or the second embodiment.
  • the sampling rate F2 of this decoded signal is Fy'NiZNj.
  • the sampling rate F3 of this decoded signal is Fy'NhZNj.
  • the transmitting side can transmit the code without considering the decoding capability of the receiving side, it is possible to reduce the load on the communication network. Can.
  • decoded signals of these multiple types of sampling rates have a simple configuration and few
  • V can be generated with the amount of operation.
  • the code device or the decoding device according to the present invention can also be installed in a communication terminal device and a base station device in a mobile communication system, and thereby has the same operation effect as described above.
  • a communication terminal apparatus and a base station apparatus can be provided.
  • the encoding apparatus and the decoding apparatus according to the present invention have an effect of realizing scalable encoding with a simple configuration and a small amount of calculation, and can be applied to the use of a communication system such as an IP network.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
PCT/JP2004/014215 2003-09-30 2004-09-29 サンプリングレート変換装置、符号化装置、復号化装置、およびこれらの方法 Ceased WO2005031705A1 (ja)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CN2004800256756A CN1849647B (zh) 2003-09-30 2004-09-29 采样速率转换装置、编码装置、解码装置及它们的方法
US10/573,812 US7756711B2 (en) 2003-09-30 2004-09-29 Sampling rate conversion apparatus, encoding apparatus decoding apparatus and methods thereof
EP04788282A EP1669981A4 (en) 2003-09-30 2004-09-29 DEVICE FOR IMPLEMENTING THE ABDESTRATE, CODING DEVICE, DECODING DEVICE AND METHOD THEREFOR
US12/708,290 US8195471B2 (en) 2003-09-30 2010-02-18 Sampling rate conversion apparatus, coding apparatus, decoding apparatus and methods thereof
US13/463,653 US8374884B2 (en) 2003-09-30 2012-05-03 Decoding apparatus and decoding method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2003341717A JP4679049B2 (ja) 2003-09-30 2003-09-30 スケーラブル復号化装置
JP2003-341717 2003-09-30

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US11/573,812 A-371-Of-International US7520604B2 (en) 2004-08-16 2004-10-21 Multi-purpose eyeglasses
US12/708,290 Continuation US8195471B2 (en) 2003-09-30 2010-02-18 Sampling rate conversion apparatus, coding apparatus, decoding apparatus and methods thereof

Publications (1)

Publication Number Publication Date
WO2005031705A1 true WO2005031705A1 (ja) 2005-04-07

Family

ID=34386230

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2004/014215 Ceased WO2005031705A1 (ja) 2003-09-30 2004-09-29 サンプリングレート変換装置、符号化装置、復号化装置、およびこれらの方法

Country Status (5)

Country Link
US (3) US7756711B2 (enExample)
EP (2) EP1669981A4 (enExample)
JP (1) JP4679049B2 (enExample)
CN (2) CN103177730B (enExample)
WO (1) WO2005031705A1 (enExample)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2500043C2 (ru) * 2004-11-05 2013-11-27 Панасоник Корпорэйшн Кодер, декодер, способ кодирования и способ декодирования
FR2888699A1 (fr) 2005-07-13 2007-01-19 France Telecom Dispositif de codage/decodage hierachique
US8295507B2 (en) 2006-11-09 2012-10-23 Sony Corporation Frequency band extending apparatus, frequency band extending method, player apparatus, playing method, program and recording medium
JPWO2008084688A1 (ja) * 2006-12-27 2010-04-30 パナソニック株式会社 符号化装置、復号装置及びこれらの方法
JP4708446B2 (ja) * 2007-03-02 2011-06-22 パナソニック株式会社 符号化装置、復号装置およびそれらの方法
JP5294713B2 (ja) * 2007-03-02 2013-09-18 パナソニック株式会社 符号化装置、復号装置およびそれらの方法
US9327193B2 (en) 2008-06-27 2016-05-03 Microsoft Technology Licensing, Llc Dynamic selection of voice quality over a wireless system
KR101381513B1 (ko) 2008-07-14 2014-04-07 광운대학교 산학협력단 음성/음악 통합 신호의 부호화/복호화 장치
EP2447943A4 (en) * 2009-06-23 2013-01-09 Nippon Telegraph & Telephone CODING METHOD, DECODING METHOD AND PROGRAM FOR USING THESE METHODS
BE1019445A3 (fr) * 2010-08-11 2012-07-03 Reza Yves Procede d'extraction d'information audio.
AU2011288406B2 (en) 2010-08-12 2014-07-31 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Resampling output signals of QMF based audio codecs
CN102610231B (zh) * 2011-01-24 2013-10-09 华为技术有限公司 一种带宽扩展方法及装置
US9767822B2 (en) 2011-02-07 2017-09-19 Qualcomm Incorporated Devices for encoding and decoding a watermarked signal
US9767823B2 (en) 2011-02-07 2017-09-19 Qualcomm Incorporated Devices for encoding and detecting a watermarked signal
CN103650037B (zh) * 2011-07-01 2015-12-09 杜比实验室特许公司 采样率可分级的无损音频编码
US8711943B2 (en) * 2011-07-21 2014-04-29 Luca Rossato Signal processing and tiered signal encoding
EP2777042B1 (en) * 2011-11-11 2019-08-14 Dolby International AB Upsampling using oversampled sbr
US9905236B2 (en) 2012-03-23 2018-02-27 Dolby Laboratories Licensing Corporation Enabling sampling rate diversity in a voice communication system
GB201210373D0 (en) * 2012-06-12 2012-07-25 Meridian Audio Ltd Doubly compatible lossless audio sandwidth extension
CN103971691B (zh) * 2013-01-29 2017-09-29 鸿富锦精密工业(深圳)有限公司 语音信号处理系统及方法
EP4336500B8 (en) * 2014-04-17 2025-11-26 VoiceAge EVS LLC Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates
EP2980794A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and decoder using a frequency domain processor and a time domain processor
EP2980795A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor
TWI566241B (zh) * 2015-01-23 2017-01-11 宏碁股份有限公司 語音信號處理裝置及語音信號處理方法
US20170054510A1 (en) * 2015-08-17 2017-02-23 Multiphy Ltd. Electro-optical finite impulse response transmit filter
EP3382702A1 (en) 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for determining a predetermined characteristic related to an artificial bandwidth limitation processing of an audio signal
CN107886966A (zh) * 2017-10-30 2018-04-06 捷开通讯(深圳)有限公司 终端及其优化语音命令的方法、存储装置
US10824917B2 (en) 2018-12-03 2020-11-03 Bank Of America Corporation Transformation of electronic documents by low-resolution intelligent up-sampling

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998057436A2 (en) 1997-06-10 1998-12-17 Lars Gustaf Liljeryd Source coding enhancement using spectral-band replication
JP2001356788A (ja) * 2000-06-14 2001-12-26 Kenwood Corp 周波数補間装置、周波数補間方法及び記録媒体

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE4343366C2 (de) * 1993-12-18 1996-02-29 Grundig Emv Verfahren und Schaltungsanordnung zur Vergrößerung der Bandbreite von schmalbandigen Sprachsignalen
US5610942A (en) * 1995-03-07 1997-03-11 Chen; Keping Digital signal transcoder and method of transcoding a digital signal
JP3139602B2 (ja) * 1995-03-24 2001-03-05 日本電信電話株式会社 音響信号符号化方法及び復号化方法
JP3283413B2 (ja) * 1995-11-30 2002-05-20 株式会社日立製作所 符号化復号方法、符号化装置および復号装置
DE19706516C1 (de) * 1997-02-19 1998-01-15 Fraunhofer Ges Forschung Verfahren und Vorricntungen zum Codieren von diskreten Signalen bzw. zum Decodieren von codierten diskreten Signalen
JP4132154B2 (ja) * 1997-10-23 2008-08-13 ソニー株式会社 音声合成方法及び装置、並びに帯域幅拡張方法及び装置
EP0957579A1 (en) * 1998-05-15 1999-11-17 Deutsche Thomson-Brandt Gmbh Method and apparatus for sampling-rate conversion of audio signals
JP2000068943A (ja) 1998-08-17 2000-03-03 Hitachi Ltd 光伝送装置
KR20000047944A (ko) * 1998-12-11 2000-07-25 이데이 노부유끼 수신장치 및 방법과 통신장치 및 방법
US6226616B1 (en) 1999-06-21 2001-05-01 Digital Theater Systems, Inc. Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility
DE19947019A1 (de) * 1999-09-30 2001-06-07 Infineon Technologies Ag Verfahren und Vorrichtung zum Erzeugen von spreizcodierten Signalen
EP1298643B1 (en) * 2000-06-14 2005-05-11 Kabushiki Kaisha Kenwood Frequency interpolating device and frequency interpolating method
US7623496B2 (en) * 2001-04-24 2009-11-24 Intel Corporation Managing bandwidth in network supporting variable bit rate
US6895375B2 (en) * 2001-10-04 2005-05-17 At&T Corp. System for bandwidth extension of Narrow-band speech
KR100935961B1 (ko) 2001-11-14 2010-01-08 파나소닉 주식회사 부호화 장치 및 복호화 장치
JP3926726B2 (ja) * 2001-11-14 2007-06-06 松下電器産業株式会社 符号化装置および復号化装置
US20030108108A1 (en) 2001-11-15 2003-06-12 Takashi Katayama Decoder, decoding method, and program distribution medium therefor
JP2003216199A (ja) * 2001-11-15 2003-07-30 Matsushita Electric Ind Co Ltd 復号装置、復号方法及びプログラム供給媒体
JP2003241799A (ja) * 2002-02-15 2003-08-29 Nippon Telegr & Teleph Corp <Ntt> 音響符号化方法、復号化方法、符号化装置、復号化装置及び符号化プログラム、復号化プログラム
FI116498B (fi) * 2002-09-23 2005-11-30 Nokia Corp Kaistanleveyden mukauttaminen
KR100499047B1 (ko) * 2002-11-25 2005-07-04 한국전자통신연구원 서로 다른 대역폭을 갖는 켈프 방식 코덱들 간의 상호부호화 장치 및 그 방법
US20040138876A1 (en) * 2003-01-10 2004-07-15 Nokia Corporation Method and apparatus for artificial bandwidth expansion in speech processing
KR100917464B1 (ko) * 2003-03-07 2009-09-14 삼성전자주식회사 대역 확장 기법을 이용한 디지털 데이터의 부호화 방법,그 장치, 복호화 방법 및 그 장치
US7272567B2 (en) * 2004-03-25 2007-09-18 Zoran Fejzo Scalable lossless audio codec and authoring tool
SG163556A1 (en) * 2005-04-01 2010-08-30 Qualcomm Inc Systems, methods, and apparatus for wideband speech coding

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998057436A2 (en) 1997-06-10 1998-12-17 Lars Gustaf Liljeryd Source coding enhancement using spectral-band replication
JP2001356788A (ja) * 2000-06-14 2001-12-26 Kenwood Corp 周波数補間装置、周波数補間方法及び記録媒体

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
E. MOULINES; J. LAROCHE: "Non-parametric techniques for pitch-scale and time-scale modification of speech", SPEECH COMMUNICATION, vol. 16, 1995, pages 175 - 205, XP004024959, DOI: doi:10.1016/0167-6393(94)00054-E
OSHIKIRI M. ET AL.: "Jikan-shuhasu ryoiki no keisu no tekio sentaku vector ryoshika o mochiita 10 kHz taiiki scalable fugoka hoshiki", FIT 2003 JOHO KAGAKU GIJUTSU FORUM KOEN RONBUNSHU, vol. F-017, 25 August 2003 (2003-08-25), pages 239 - 240, XP002986229 *
OSHIKIRI M. ET AL.: "Pitch filtering ni yoru taiiki kakucho gijutsu o mochiita 7/10/15kHz taiiki scalable onsei fugoka hoshiki", THE ACOUSTICAL SOCIETY OF JAPAN 2004 NEN SHUNKI KENKYU HAPPYOKAI KOEN RONBUNSHU I, vol. 3-11-4, 17 March 2004 (2004-03-17), pages 327 - 328, XP002986230 *
See also references of EP1669981A4
TAKAMIZAWA Y. ET AL.: "MPEG-4 audio taiiki kakucho decoder software no kaihatsu", 2003 NEN THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS SOGO TAIKAI KOEN RONBUNSHU, vol. D-14-10, 3 March 2003 (2003-03-03), pages 177, XP002986228 *

Also Published As

Publication number Publication date
CN1849647B (zh) 2013-04-10
US7756711B2 (en) 2010-07-13
EP1669981A1 (en) 2006-06-14
JP2005107255A (ja) 2005-04-21
EP2172931A1 (en) 2010-04-07
CN103177730A (zh) 2013-06-26
EP1669981A4 (en) 2008-06-18
US8374884B2 (en) 2013-02-12
US8195471B2 (en) 2012-06-05
CN103177730B (zh) 2015-12-09
US20120221342A1 (en) 2012-08-30
CN1849647A (zh) 2006-10-18
JP4679049B2 (ja) 2011-04-27
US20060280271A1 (en) 2006-12-14
US20100161321A1 (en) 2010-06-24

Similar Documents

Publication Publication Date Title
JP4679049B2 (ja) スケーラブル復号化装置
JP4822843B2 (ja) スペクトル符号化装置、スペクトル復号化装置、音響信号送信装置、音響信号受信装置、およびこれらの方法
JP5013863B2 (ja) 符号化装置、復号化装置、通信端末装置、基地局装置、符号化方法及び復号化方法
EP1798724B1 (en) Encoder, decoder, encoding method, and decoding method
KR101835156B1 (ko) 신호 처리 장치 및 방법, 및 프로그램
JP5048697B2 (ja) 符号化装置、復号化装置、符号化方法、復号化方法、プログラムおよび記録媒体
US8738372B2 (en) Spectrum coding apparatus and decoding apparatus that respectively encodes and decodes a spectrum including a first band and a second band
US6415251B1 (en) Subband coder or decoder band-limiting the overlap region between a processed subband and an adjacent non-processed one
JP2000305599A (ja) 音声合成装置及び方法、電話装置並びにプログラム提供媒体
WO2005027095A1 (ja) 符号化装置および復号化装置
KR20060135699A (ko) 신호 복호화 장치 및 신호 복호화 방법
KR100389487B1 (ko) 정보부호화방법및장치,정보복호화방법및장치,및정보전송방법
JP5031006B2 (ja) スケーラブル復号化装置及びスケーラブル復号化方法
JP2006126826A (ja) オーディオ信号符号化/復号化方法及びその装置
JP3594829B2 (ja) Mpegオーディオの復号化方法

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200480025675.6

Country of ref document: CN

AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 1655/DELNP/2006

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2004788282

Country of ref document: EP

Ref document number: 2006280271

Country of ref document: US

Ref document number: 10573812

Country of ref document: US

WWP Wipo information: published in national office

Ref document number: 2004788282

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 10573812

Country of ref document: US