WO2013143221A1 - 信号编码和解码的方法和设备 - Google Patents
信号编码和解码的方法和设备 Download PDFInfo
- Publication number
- WO2013143221A1 WO2013143221A1 PCT/CN2012/075924 CN2012075924W WO2013143221A1 WO 2013143221 A1 WO2013143221 A1 WO 2013143221A1 CN 2012075924 W CN2012075924 W CN 2012075924W WO 2013143221 A1 WO2013143221 A1 WO 2013143221A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frequency domain
- signal
- domain signal
- frequency
- decoded
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 78
- 230000009466 transformation Effects 0.000 claims description 10
- 238000001453 impedance spectrum Methods 0.000 claims 2
- 238000001228 spectrum Methods 0.000 claims 2
- 230000003044 adaptive effect Effects 0.000 description 46
- 238000004458 analytical method Methods 0.000 description 13
- 230000005236 sound signal Effects 0.000 description 12
- 230000000694 effects Effects 0.000 description 11
- 238000012545 processing Methods 0.000 description 10
- 230000005540 biological transmission Effects 0.000 description 7
- 238000004891 communication Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L27/00—Modulated-carrier systems
- H04L27/26—Systems using multi-frequency codes
- H04L27/2601—Multicarrier modulation systems
- H04L27/2602—Signal structure
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L5/00—Arrangements affording multiple use of the transmission path
- H04L5/02—Channels characterised by the type of signal
- H04L5/06—Channels characterised by the type of signal the signals being represented by different frequencies
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
Definitions
- Embodiments of the present invention relate to the field of domain communications and, more particularly, to a method and apparatus for signal encoding and decoding. Background technique
- encoding techniques are used at the transmitting end to compress the signals to be transmitted to improve transmission efficiency, and corresponding decoding techniques are used at the receiving end to recover the transmitted signals.
- the signal can be time domain coded and/or frequency domain coded depending on the characteristics of the signal, the transmission conditions, and the like. According to certain rules, different coding ratios are allocated for the time domain signal or the frequency domain signal, and it is desirable to characterize the signal to be transmitted with as few coded bits as possible. Therefore, it is necessary to allocate the coded bits reasonably so that the output signal is recovered as little as possible by decoding at the receiving end.
- the speech can have a better codec effect, but for music, the encoding and decoding effect is relatively poor.
- the input signal is encoded by a time domain coding method using partial bits; and, based on the input signal.
- the frequency domain signal is encoded using the remaining bits, the signal characteristics are usually not considered, and the frequency domain signal is bit-allocated in a uniform manner, which results in poor encoding of the partial frequency domain signal.
- the frequency domain signal is simply recovered by using a decoding technique corresponding to the encoding technique, and the undecoded frequency domain signal is filled with noise, and then inverse frequency domain transform and time domain synthesis are performed. Processing to obtain an output signal.
- the noise fill introduces additional noise in some of the signals, reducing the quality of the output signal.
- Embodiments of the present invention provide a method and a device for encoding and decoding a signal, which can optimize bit allocation of a frequency domain signal during encoding to achieve a better coding effect by using the same bit, and can be decoded during decoding.
- the frequency domain excitation signal is expanded to achieve a better output signal.
- a method for signal encoding comprising: obtaining a frequency domain signal according to an input signal; assigning a predetermined bit to the frequency domain signal according to a predetermined allocation rule; and frequency domain signal having a bit allocation When the highest frequency is greater than a predetermined value, the bit allocation of the frequency domain signal is adjusted; the frequency domain signal is encoded according to the bit allocation of the frequency domain signal.
- a method for signal decoding comprising: obtaining a decoded frequency domain signal from a received bitstream; and if the decoded frequency domain signal satisfies a predetermined condition, The undecoded frequency domain signal is predicted according to the decoded frequency domain signal; and the final output time domain signal is obtained according to the decoded frequency domain signal and the predicted undecoded frequency domain signal.
- an apparatus for signal encoding comprising: a frequency domain transform unit that obtains a frequency domain signal according to an input signal; and a bit allocation unit that allocates a predetermined bit to the frequency domain according to a predetermined allocation rule a bit adjustment unit that adjusts a bit allocation of the frequency domain signal when a highest frequency of a frequency domain signal having a bit allocation is greater than or equal to a predetermined value; and a frequency domain coding unit that allocates a frequency domain according to a bit of the frequency domain signal The signal is encoded.
- an apparatus for signal decoding comprising: a decoding unit, obtaining a decoded frequency domain signal from a received bitstream; and a spreading unit, configured to predict an undecoded frequency domain a signal, when the decoded frequency domain signal satisfies a predetermined condition, predicting an undecoded frequency domain signal according to the decoded frequency domain signal; and outputting the unit according to the decoded frequency domain signal and the predicted frequency The domain signal is used to obtain the final output time domain signal.
- the bit allocation of the frequency domain signal is adjusted by encoding according to the highest frequency of the frequency domain signal with bit allocation, and the frequency domain coding is performed with the same number of bits. Good coding effect;
- the decoded frequency domain signal is used as a guide to set the undecoded frequency domain signal, so that the output signal achieves better results.
- FIG. 1 illustrates a method of encoding a signal in accordance with an embodiment of the present invention
- FIG. 3 illustrates a method of decoding a signal in accordance with an embodiment of the present invention
- FIG. 4 illustrates a method of obtaining a decoded frequency domain signal from a received bit stream in a time-frequency joint decoding method
- Figure 5 illustrates an exemplary implementation of an encoding device and/or a decoding device in accordance with the present invention
- FIG. 6 illustrates an encoding device that encodes a signal in accordance with an embodiment of the present invention
- FIG. 8 illustrates an apparatus for decoding a signal in accordance with an embodiment of the present invention
- Figure 9 illustrates a block diagram of a decoding unit in time-frequency joint decoding. detailed description
- the coding technical solution and the decoding technical solution of the present invention can be applied to transmission and reception in various communication systems, such as: GSM, Code Division Multiple Access (CDMA), Wideband Code Division WCDMA (Wandaband Code Division Multiple Access Wireless), General Packet Radio Service (GPRS), Long Term Evolution (LTE), etc.
- GSM Global System for Mobile Communications
- CDMA Code Division Multiple Access
- WCDMA Wideband Code Division WCDMA
- GPRS General Packet Radio Service
- LTE Long Term Evolution
- Coding technology solutions and decoding technology solutions widely used in a variety of electronic devices, such as: mobile phones, wireless devices, personal data assistants (PDA), handheld or portable computers, GPS receivers / navigators, cameras, audio / video Players, camcorders, video recorders, surveillance equipment, etc. usually,
- PDA personal data assistants
- Such an electronic device includes an audio encoder or an audio decoder, and the audio encoder or decoder can be directly implemented by a digital circuit or a chip such as a DSP (digital signal processor), or the software code drives the processor to execute a process in the software code. achieve.
- DSP digital signal processor
- an audio time domain signal is first transformed into a frequency domain signal, and then a coded bit is allocated to an audio frequency domain signal for encoding, and the encoded signal is transmitted to a decoding end through a communication system.
- the decoding end decodes and recovers the encoded signal.
- FIG. 1 illustrates a method 100 of encoding a signal in accordance with an embodiment of the present invention. As shown in Figure 1, the method includes:
- the frequency domain signal is obtained according to the input signal.
- the input signals may be of various types such as image signals, data signals, audio signals, video signals, text signals, and the like.
- the frequency domain signal can be obtained by frequency-domain transforming the input signal by using an algorithm such as Fast Fourier Transform (FFT) or Discrete Cosine Transform (DCT).
- FFT Fast Fourier Transform
- DCT Discrete Cosine Transform
- the type of input signal and the frequency domain transform algorithm do not constitute a limitation of the present invention.
- the predetermined bit tot-bit is a bit to be used for frequency domain coding of a frequency domain signal.
- the predetermined allocation rule may be, for example, allocating a larger number of bits in a predetermined bit to a low frequency band signal in a frequency domain signal, and allocating remaining bits in a predetermined bit to a larger energy than the low frequency band signal Frequency band.
- the more bits may be allocated in the low band signal either equally for all low frequency bands or according to the energy distribution of the low band signals.
- the reason for allocating more bits for low-band signals is that, in speech-audio signals, for example, low-band signals typically contain more sensitive information from the human ear.
- the frequency domain signal is usually divided into subbands at equal intervals in frequency, or subbands are divided according to frequency domain coefficients, for example, one subband per 16 frequency domain coefficients. For example, for a wideband signal of 20 ms-frame, 160 coefficients in the frequency range of 0 ⁇ 4 kHz are divided into 10 sub-bands, wherein there are 5 sub-bands in the frequency range of 0 ⁇ 2 kHz and 5 sub-bands in the frequency range of 2 ⁇ 4 kHz. Then, bit allocation is performed for each subband.
- the number of bits of the IF bit is allocated, and the predetermined bit tot bit is subtracted from the IF bit to obtain the remaining bit rest bit, and according to the frequency range of 2 ⁇ 4 kHz
- the envelope size of each subband allocates the remaining bit rest bits to subbands in the frequency range of 2 ⁇ 4 kHz, each subband being 5 bits. Determine the number of subbands with bit allocation and bits according to the size of the rest_bits and the envelope of each subband
- the subband of the highest frequency band is allocated last-bin, and the remainder that cannot be divisible by 5 is equally distributed to each subband in the range of 0 ⁇ 2 kHz.
- the predetermined value B may be set according to an empirical value; in one embodiment, the number of bits of the predetermined bit tot_bit and the resolution of the frequency domain signal may be used (for example, there are 320 frequency domains in the 0 ⁇ 8 kHz bandwidth range) Coefficient) to determine the predetermined value B. In the case of a fixed bandwidth, the number of bits of the predetermined bit tot_bit is higher, and the predetermined value B is higher; the number of bits of the predetermined bit is fixed. When the bit is fixed, the higher the resolution of the frequency domain signal, the higher the predetermined value B.
- the predetermined value B can be determined only based on the number of bits tot_bit of the predetermined bit, and the more the number of bits tot_bit of the predetermined bit, the higher the predetermined value B.
- the predetermined value B is a preset upper limit frequency value. For example, empirically, after frequency domain transform of an input signal, a frequency domain signal having a frequency greater than the predetermined value is usually not allocated bits. Therefore, in a specific practice, the predetermined value B can be set to a frequency value lower than the highest frequency value of the frequency domain signal by a certain frequency, for example, set to 2.9 kHz, 3.2 kHz, 3.5 kHz, or the like. In other embodiments, the predetermined value B may also be determined based on other factors such as the frame length, the transform method employed, or the transform window length.
- the predetermined value B may be an index number of 20 subbands in a frequency range of 0 to 8 kHz, and the highest frequency of the frequency domain signal having bit allocation is also It can be represented by the index number of the subband in which the highest frequency is located.
- the predetermined value B and the highest frequency of the bit-assigned frequency domain signal are not limited to the frequency value, and may also be the index number of the sub-band. After reading the disclosure of the embodiments of the present invention, the engineering technician knows based on the practical conditions how to determine whether the highest frequency of the frequency domain signal having the bit allocation is greater than a predetermined value.
- the adjustment of the bit allocation of the frequency domain signal is described below. Depending on the type of the signal or the frequency domain characteristics, etc., it is possible to reduce a portion of the frequency domain signal that contributes less to the output of the decoding side, and correspondingly increase the bit frequency of the highest frequency with bit allocation and the frequency domain signal in the vicinity thereof. That is, the adjusting the bit allocation of the frequency domain signal may include: reducing the allocation of the frequency band of the frequency domain signal to which more bits are allocated. The number of bits, and the maximum frequency with bit allocation and the number of bits allocated by the frequency domain signals in the vicinity thereof are increased. For audio signals, the frequency band to which more bits are allocated is, for example, a low frequency band of 0 to 2 kHz. The following is an illustration of the adjustment of the bit allocation to the frequency domain signal.
- Adjustment example 1 The highest frequency with bit allocation is 4 kHz. If a sub-band bit in the range of 2 kHz to 4 kHz is allocated 0, 5 bits are allocated to this band until all sub-bands in the range of 2 kHz to 4 kHz are allocated to the number of bits. Assume that the additional number of bits in the range of 2 ⁇ 4 kHz is N blt . At this time, it is necessary to reduce N blt bits from the sub-bands in the range of 0 to 2 kHz.
- the algorithm used is, for example: reducing 1 bit per subband from all subbands (5 subbands) in the range of 0 ⁇ 2 kHz; then subtracting one of the highest frequency subbands; from the remaining 4 subbands Each sub-band is further reduced by 1 bit, then reduced by a sub-high frequency sub-band, and so on, until the reduced number of bits equals N blt .
- Adjustment example 2 Add J bits to all subbands of the allocated bits in the range of 2 kHz to 4 kHz. If the number of subbands with bit allocation in the range of 2 to 4 kHz is K, then additional bits in the range of 2 to 4 kHz are added at this time.
- an algorithm that can be used is: averaging N blt /5 bits per subband from all subbands (5 subbands) in the range of 0 to 2 kHz.
- the algorithm used can be: Adjust the algorithm in Example 1 and adjust any of the algorithms in Example 2.
- FIG. 2 illustrates a time-frequency joint encoding method 200 in accordance with an embodiment of the present invention.
- 220, 230, 240 are the same as 120, 130, 140 in Fig. 1, respectively. 2 differs from FIG. 1 in that steps 250, 260 are added, and 110 in FIG. 1 is replaced with 211 and 212.
- steps 250, 260 are added, and 110 in FIG. 1 is replaced with 211 and 212.
- the differences between Fig. 2 and Fig. 1 will be described hereinafter, and the same points will not be repeated.
- obtaining a first time domain signal and a second time domain signal by performing time domain analysis on the input signal For example, linear inputive coding (LPC) analysis and processing of the input signal yields one of Line Spectral Frequency (LSF) parameters and Immittance Spectral Frequency (ISF) parameters, and is also disabled.
- LSF Line Spectral Frequency
- ISF Immittance Spectral Frequency
- the difference signal res and the adaptive codebook contribute to exc_pit.
- the LSF parameter or ISF parameter is used to indicate the frequency domain characteristics of the coefficients (i.e., LPC coefficients) used in the LPC analysis.
- the residual signal res and the adaptive codebook contribution exc_pit are included in the first time domain signal, and the adaptive codebook contribution exc_pit is included in the second time domain signal.
- the residual signal res and the adaptive codebook contribution exc_pit in the first time domain signal are respectively subjected to frequency domain transform, and then the residual signal f_res in the frequency domain and the adaptive codebook in the frequency domain contribute f_
- the correlation of exc_pit is used to determine whether the adaptive codebook contribution contributes to the output signal. If the adaptive codebook contribution contributes to the output signal, the frequency domain adaptive codebook contribution f_exc_pit is subtracted from the frequency domain residual signal f_res to obtain the frequency domain difference signal f_diff, and The difference signal f_diff is used as the frequency domain signal. If the adaptive codebook contribution does not contribute to the output signal, the residual signal f_res in the frequency domain is directly used as the difference signal f-diff, that is, the frequency domain signal.
- the frequency domain signals are encoded by the same 220, 230, 240 as 120, 130, 140 in Fig. 1, and the encoded frequency domain signals are obtained.
- the time domain signal may be encoded using any time domain coding method such as predictive coding, Pulse Code Modulation (PCM) coding, etc., and the time domain coding method employed does not constitute a limitation of the present invention.
- PCM Pulse Code Modulation
- the adaptive codebook contribution contributes to the output signal, the adaptive codebook contribution needs to be obtained at the decoding end, so the adaptive codebook contribution exc_pit in the second time domain signal is encoded to be transmitted as a bit stream to the reception. end.
- the adaptive codebook contribution does not contribute to the output signal, that is, the output of the decoder does not require an adaptive codebook contribution, then the time domain coding of the portion is not required, thereby improving coding efficiency.
- the adaptive codebook contribution contributes to the output signal means decoding The terminal cannot obtain a high quality output signal based only on the encoded frequency domain signal.
- the frequency domain signal to be subjected to frequency domain coding may include other signals, such as a flag flag indicating whether the adaptive codebook contribution contributes to the output signal, in addition to the difference signal f-diff.
- the second time domain signal to be time domain coded may include, in addition to the adaptive codebook contribution exc_pit, other information needed for decoding.
- the bit allocation of the frequency domain signal is adjusted according to the highest frequency of the frequency domain signal with bit allocation, and combined with the time domain coding, thereby achieving a better coding effect.
- FIG. 3 illustrates a method 300 of decoding a signal in accordance with an embodiment of the present invention.
- the method 300 includes:
- the decoded frequency domain signal is obtained from the received bitstream by using a frequency domain decoding method corresponding to the frequency domain coding method.
- the decoded frequency domain signal is obtained from the received bitstream by performing frequency domain decoding on the frequency domain information in the bitstream to obtain a first frequency domain signal; according to the first frequency domain The signal determines whether there is a time domain coded signal contributing to the output signal in the bit stream; when it is determined that there is a time domain coded signal contributing to the output signal in the bit stream, time domain coded signal is time domain decoded and frequency domain Transforming to obtain a second frequency domain signal, and synthesizing the first frequency domain signal and the second frequency domain signal to obtain the decoded frequency domain signal, which will be described in further detail below in conjunction with FIG.
- the decoded frequency domain signal When the decoded frequency domain signal satisfies a predetermined condition, predicting the undecoded frequency domain signal according to the decoded frequency domain signal.
- the decoded frequency domain signal satisfies predetermined conditions, including: the highest frequency of the decoded frequency domain signal is greater than a predetermined value, and the decoded frequency domain signal includes a frequency domain transformed contribution to the output signal. At least one of the time domain coded signals.
- the decoded frequency domain signal may be applied first, including a frequency domain transformed time domain coded signal that contributes to the output signal, and then the highest frequency of the decoded frequency domain signal is greater than
- the judgment condition of the predetermined value, or the order of the reverse order, may also be used only as described above in connection with 130 of FIG. 1, the predetermined value is a reservation according to frequency domain coding.
- the number of bits is determined by the bit-to-bit and resolution of the frequency domain signal. According to practical needs, the predetermined value can be set to a frequency value lower than the highest frequency value of the frequency domain signal by a certain frequency.
- the predetermined value may be an index number of the subband, and the highest frequency of the frequency domain signal having the bit allocation at this time also uses the subband of the highest frequency domain.
- the index number indicates.
- the value of the predetermined value of the decoding end may be the same as or different from the value of the predetermined value of the encoding end.
- the bit stream is decoded at 310 to obtain a time domain coded signal that may include a frequency domain transform and contributes to an output signal, and the frequency domain transformed
- the time domain coded signal contributing to the output signal is, for example, a signal obtained by time domain decoding and frequency domain transform of time domain coded information contained in the bit stream, such as contribution to an adaptive codebook.
- the time domain coded signal that contributes to the output signal after the frequency domain transformation may be in addition to the adaptive codebook. Other signals than contributions.
- the decoded frequency domain signal includes an adaptive codebook contribution
- whether the decoded frequency domain signal includes a frequency may be learned according to whether the adaptive codebook contributes a flag flag that contributes to the output signal.
- the decoded frequency domain signal includes a time domain coded signal that contributes to the output signal after frequency domain transformation, which means that it is difficult to obtain high quality output by frequency domain decoding only, and the characteristics of the audio signal are Simply setting the undecoded frequency domain signal to noise degrades the output signal quality, requiring prediction of the undecoded frequency domain signal.
- a frequency domain signal of a frequency band selected from a highest frequency of the decoded frequency domain signal to a low frequency may be selected according to the selected
- the frequency domain signal is used to predict the undecoded frequency domain signal. For example, for a signal with a frame length of 20 ms and a sample rate of 12.8 kHz, the frequency domain coefficient is 256 and the bandwidth is 6.4 kHz. At 7.6 kbps, there are 16 subbands for every 16 coefficients, and a total of 16 subbands are reserved. The value is set to 10 (4 kHz).
- the undecoded frequency domain coefficients in the range of 4 to 6.4 kHz are predicted by the frequency domain coefficients decoded in the range of 1.6 to 4 kHz.
- the undecoded frequency domain signal can be predicted by performing normalization processing, envelope processing, or the like on the selected frequency domain signal.
- the implementation of the normalization process and the envelope process is a means known to those skilled in the art and will not be described in detail herein.
- the method can predict the undecoded frequency domain signal.
- the undecoded frequency domain signal can also be predicted according to the frequency domain signal of the fixed frequency band in the decoded frequency domain signal.
- the ISF parameter or the LSF parameter from the encoding end may be adopted.
- the predicted undecoded frequency domain coefficients are corrected.
- the formant position is estimated by LSF parameters or ISF parameters; the frequency domain coefficients with larger amplitudes are scaled at each estimated formant position.
- a threshold which may be set according to characteristics of the time domain analysis of the encoding end
- decrease near the position of the formant The magnitude of the predicted frequency domain coefficient.
- noise is used to predict the undecoded frequency domain signal.
- the decoded frequency domain signal is obtained by decoding, and the undecoded frequency domain signal is predicted, thereby obtaining the frequency domain signal in the entire frequency band, by performing, for example, Inverse Fast Fourier Transform (IFFT)
- IFFT Inverse Fast Fourier Transform
- the inverse frequency transform or the like is processed to obtain an output signal in the time domain.
- the ISF parameter or the LSF parameter is transformed to obtain an LPC coefficient, and the LPC coefficient is used to perform time domain synthesis on the signal obtained after inverse frequency domain transformation to obtain a time domain of the final output. signal.
- IFFT Inverse Fast Fourier Transform
- the output signal is better. effect.
- the decoding method according to an embodiment of the present invention is applied in a time-frequency joint decoding scheme.
- the subsequent operations are the same as those described in connection with Fig. 3 except for the step of obtaining the decoded frequency domain signal (310) from the received bit stream. Therefore, only how to obtain the decoded frequency domain signal in the time-frequency joint decoding method will be described below.
- the method 410 includes: 411: Demultiplex the bitstream into a first set of bits and a second set of bits. Upon decoding at the receiving end, upon receiving the bitstream, the bitstream is demultiplexed into a first set of bits and a second set of bits using a demultiplexing technique corresponding to the multiplexing technique of 260 of FIG.
- the first set of bits includes frequency domain information to be subjected to frequency domain decoding as described below
- the second set of bits includes a time domain coded signal that contributes to the output signal to be subjected to the following time domain decoding.
- the first set of bits includes, for example, a difference signal f-diff, a flag flag indicating whether the adaptive codebook contribution contributes to the output signal, and the like.
- the second set of bits includes an adaptive codebook contribution, for example, when the adaptive codebook contribution contributes to the output signal. It is noted that the first set of bits and the second set of bits may also include other signals corresponding to the encoding of the signals.
- the fourth12 Perform frequency domain decoding on the first group of bits to obtain a first frequency domain signal, and determine, according to the first frequency domain signal, whether a time domain coded signal that contributes to the output signal exists in the bit stream.
- the first set of bits is decoded by a decoding method corresponding to the frequency domain encoding method at the encoding end to obtain a first frequency domain signal.
- the first frequency domain signal includes, for example, a decoded difference signal f-diff, and a flag flag indicating whether the adaptive codebook contribution contributes to the output signal.
- the second set of bits is decoded by a decoding method corresponding to the time domain encoding method of the encoding end to obtain a decoded time domain signal. Specifically, when it is determined that there is a time domain coded signal contributing to the output signal in the bit stream, the time domain coded signal in the second group of bits is time domain decoded.
- the frequency is synthesized by adding the difference signal f_diff in the first frequency domain signal and the adaptive codebook contribution in the second frequency domain signal. Domain signal.
- the difference signal f_diff in the first frequency domain signal is directly used as the frequency domain signal.
- the present invention further provides an encoding device and a decoding device, which may be located in a terminal device, a network device, or a testing device.
- the compilation The code device or the decoding device may be implemented by a hardware circuit or by software in conjunction with hardware.
- Figure 5 illustrates an exemplary implementation of an encoding device and/or a decoding device in accordance with the present invention.
- the encoding device or decoding device 530 is invoked by a processor 510 via the input/output interface 520 to effect encoding or decoding of the audio signal with the aid of the memory 540.
- the encoding device or decoding device 530 can perform various methods and processes in the above method embodiments.
- FIG. 6 illustrates an encoding device 600 that encodes a signal in accordance with an embodiment of the present invention.
- the encoding device 600 includes: a frequency domain transform unit 610 that obtains a frequency domain signal according to an input signal; a bit allocation unit 620 that allocates a predetermined bit to the frequency domain signal according to a predetermined allocation rule; and a bit adjustment unit 630 that has a bit allocation When the highest frequency of the frequency domain signal is greater than or equal to a predetermined value, the bit allocation of the frequency domain signal is adjusted; the frequency domain encoding unit 640 encodes the frequency domain signal according to the adjusted bit allocation.
- the frequency domain transform unit 610 can obtain a frequency domain signal based on the input signal.
- the input signal can be various types of signals such as image signals, data signals, audio signals, video signals, text signals, and the like.
- the frequency domain signal can be obtained by performing frequency domain transform on the input signal by using an algorithm such as FFT or DCT.
- the type of input signal and the frequency domain transform algorithm do not constitute a limitation of the present invention.
- Bit allocation unit 620 can assign a predetermined bit tot-bit to the frequency domain signal in accordance with a predetermined allocation rule.
- the tot bit is the number of bits to be used to encode the frequency domain signal.
- the predetermined allocation rule may be, for example, allocating a larger number of the predetermined bits to a low frequency band signal in a frequency domain signal, and allocating remaining bits in the predetermined bit to energy other than the low frequency band signal Large frequency band.
- the more bits may be allocated in the low frequency band signal for all low frequency bands or according to the energy distribution of the low frequency band signals.
- the reason for allocating more bits for low-band signals is that audio signals such as speech are mainly concentrated in the low frequency range in the frequency domain, and allocating more bits to them can improve the efficiency of frequency domain coding.
- the frequency domain signal in the frequency range of 0 to 4 kHz is divided into 10 subbands, wherein the frequency range of 0 to 2 kHz is as described above in connection with 120 of FIG. There are 5 sub-bands inside, and there are 5 sub-bands in the frequency range of 2 ⁇ 4kHz. Then, bit allocation is performed for each subband. A number of bits of the IF-bit are allocated for the low-frequency frequency domain signals in the frequency range of 0 to 2 kHz.
- the remaining bits rest-bit (tot-bit minus IF-bit) are allocated to sub-bands in the frequency range of 2 ⁇ 4 kHz according to the envelope of each sub-band in the frequency range of 2 ⁇ 4 kHz. Specifically, according to rest_bits The subband of the high frequency band is last-bin, and the remainder that cannot be divisible by 5 is equally distributed to each subband in the range of 0 ⁇ 2 kHz.
- the bit adjustment unit 630 may adjust the bit allocation of the frequency domain signal when the highest frequency of the frequency-domain signal having the bit allocation is greater than or equal to a predetermined value B.
- the predetermined value B is determined based on the number of bits tot_bit of the predetermined bit and the resolution of the frequency domain signal (e.g., 4 kHz).
- the predetermined value is a preset upper limit frequency value.
- the predetermined value B may be a frequency value lower than a highest frequency value (e.g., 4 kHz) of the frequency domain signal by a certain frequency, for example, 2.9 kHz, 3.2 kHz, 3.5 kHz, or the like.
- the predetermined value B may be an index number (for example, 7 or 8) of 10 subbands in a frequency range of 0 to 4 kHz,
- the highest frequency of the frequency domain signal having the bit allocation is also represented by the index number iridex of the subband in which the highest frequency is located.
- the bit adjustment unit 630 may adjust the bit allocation of the frequency domain signal by the bit allocation unit 620 according to a predetermined allocation rule when the highest frequency is greater than or equal to a predetermined value. Depending on the type of the input signal or the frequency domain characteristics of the frequency domain signal, etc., it is possible to reduce a portion of the frequency domain signal that contributes less to the output of the decoding end, and correspondingly increase the highest frequency with bit allocation and the frequency in the vicinity thereof. Bit allocation of the domain signal. As an example, the bit adjustment unit 630 may reduce the number of bits allocated by the frequency band to which more bits are allocated in the frequency domain signal, and increase the highest frequency with bit allocation and the number of bits allocated by the frequency domain signal in the vicinity thereof. . For audio signals, the frequency band to which more bits are allocated is, for example, a low frequency band of 0 to 2 kHz.
- the frequency domain encoding unit 640 encodes the frequency domain signal according to the adjusted bit allocation.
- the method of encoding the frequency domain signal may be, for example, transform coding, subband coding, or the like. Further, when the highest frequency is less than a predetermined value, the bit adjustment unit 630 does not adjust the bit allocation of the frequency domain signal. at this time, code.
- the time-frequency joint coding apparatus 700 includes: a time domain analysis unit 711, which obtains a first time domain signal and a second time domain signal by performing time domain analysis on the input signal; and the frequency domain transform unit 712 performs the first time domain signal Frequency domain transform and processing to obtain a frequency domain signal; bit allocation unit 720, assigning a predetermined bit to the frequency domain signal according to a predetermined allocation rule; and bit adjusting unit 730, the highest frequency of the frequency domain signal having bit allocation is greater than or equal to a predetermined value Adjusting the bit allocation of the frequency domain signal; the frequency domain encoding unit 740 encodes the frequency domain signal according to the adjusted bit allocation; the time domain encoding unit 750 encodes the second time domain signal; the bit multiplexing unit 760.
- the coded frequency domain signal and the encoded second time domain signal are multiplexed into a bit stream.
- bit allocation unit 720, the bit adjustment unit 730, and the frequency domain coding unit 740 are the same as the bit allocation unit 620, the bit adjustment unit 630, and the frequency domain coding unit 640 in FIG. 6, respectively.
- 7 is different from FIG. 6 in that the time domain coding unit 750, the bit multiplexing unit 760 are added, and the frequency domain transform unit 610 in FIG. 6 is replaced by the time domain analysis unit 711 and the frequency domain transform unit 712.
- the differences between Fig. 7 and Fig. 6 will be described hereinafter, and the same points will not be repeated.
- the time domain analyzing unit 711 obtains the first time domain signal and the second time domain signal by performing time domain analysis on the input signal. For example, LPC analysis and processing of the input signal yields ISF parameters (or LSF parameters), residual signal res, and adaptive codebook contribution exc_pit. The residual signal res and the adaptive codebook contribution exc_pit are used as the first time domain signal, and the adaptive codebook contribution exc_pit is used as the second time domain signal.
- ISF parameters or LSF parameters
- residual signal res and the adaptive codebook contribution exc_pit are used as the first time domain signal
- the adaptive codebook contribution exc_pit is used as the second time domain signal.
- the frequency domain transform unit 712 can obtain the frequency domain signal by performing frequency domain transform and processing on the first time domain signal.
- the residual signal res and the adaptive codebook contribution exc_pit in the first time domain signal are respectively subjected to frequency domain transform, and then the residual signal f_ res in the frequency domain and the adaptive codebook contribution in the frequency domain are f – The correlation of exc_pit to determine if the adaptive codebook contribution contributes to the output signal. If the adaptive codebook contribution contributes to the output signal, the frequency domain adaptive codebook contribution f_exc_pit is subtracted from the frequency domain residual signal f_res to obtain the frequency domain difference signal f_diff, and The difference signal f_diff is included in the Frequency domain signal.
- the residual signal f_res of the frequency domain is directly used as the difference signal f_diff for transmission as a frequency domain signal.
- the frequency domain signal may include other signals in addition to the difference signal f-diff, such as a flag flag indicating whether the adaptive codebook contribution contributes to the output signal.
- bit allocation unit 720 and the bit adjustment unit in FIG. 7 are utilized.
- the frequency domain coding unit 740 encodes the frequency domain signal to obtain the encoded frequency domain signal.
- the time domain encoding unit 750 can encode the second time domain signal.
- the time domain signal may be encoded using a time domain coding method such as predictive coding, pulse code modulation, and the like.
- a time domain coding method such as predictive coding, pulse code modulation, and the like.
- an adaptive codebook contribution is required at the decoding end, so the adaptive codebook contribution exc_pit in the second time domain signal is encoded for transmission to the receiving end.
- the bit multiplexing unit 760 can multiplex the encoded frequency domain signal and the encoded second time domain signal into a bit stream.
- the bit allocation of the frequency domain signal is adjusted according to the highest frequency of the frequency domain signal having the bit allocation, and combined with the time domain coding, thereby achieving better coding. effect.
- FIG. 8 illustrates a decoding device 800 that decodes a signal in accordance with an embodiment of the present invention.
- the decoding device 800 includes: a decoding unit 810, which obtains a decoded frequency domain signal from a received bitstream; a spreading unit 820, configured to predict an undecoded frequency domain signal, where the decoded frequency domain signal satisfies In the case of a predetermined condition, the undecoded frequency domain signal is predicted based on the decoded frequency domain signal; and the output unit 830 obtains the final output time domain signal based on the decoded frequency domain signal and the predicted frequency domain signal.
- the decoding unit 810 can obtain the decoded frequency domain signal from the received bit stream.
- the decoded frequency domain signal is obtained from the received bit stream by using a frequency domain decoding method corresponding to the frequency domain coding method.
- the decoding unit 810 can obtain the decoded frequency domain signal from the received bit stream by performing frequency domain decoding on the frequency domain information in the bitstream to obtain the first frequency domain signal;
- a frequency domain signal determines whether there is a time domain coded signal contributing to the output signal in the bit stream; when it is determined that there is a time domain coded signal contributing to the output signal in the bit stream, time domain coded signal is time domain decoded And frequency domain transform to obtain a second frequency domain signal, and synthesizing the first frequency domain signal and the second frequency domain signal to obtain the decoded frequency domain signal, which This will be described in detail below in conjunction with FIG.
- the spreading unit 820 can be used to predict undecoded frequency domain signals.
- the spreading unit 820 can predict the undecoded frequency domain signal based on the decoded frequency domain signal.
- the decoded frequency domain signal satisfies predetermined conditions, including: the highest frequency of the decoded frequency domain signal is greater than a predetermined value, and the decoded frequency domain signal includes a frequency domain transformed contribution to the output signal. At least one of the time domain coded signals. In practice, you can make choices as needed. The resolution of the tot-bit and frequency domain signals is determined.
- the predetermined value can be set to a frequency value lower than the highest frequency value of the frequency domain signal by a certain frequency according to practical needs.
- the predetermined value may be an index number of the subband, and the highest frequency of the frequency domain signal having the bit allocation at this time also uses the subband of the highest frequency domain. The index number indicates.
- the decoded frequency domain signal obtained by decoding the bit stream in the decoding unit 810 may include obtaining time domain decoding and frequency domain transform in the time domain information included in the bit stream.
- the signal which for example contributes to the adaptive codebook.
- Whether the frequency domain signal includes a time domain coded signal that contributes to the output signal after the frequency domain transformation may be known according to whether the adaptive codebook contributes a flag flag that contributes to the output signal.
- the time domain coded signal that contributes to the output signal after the frequency domain transformation may be other signals.
- the decoded frequency domain signal includes a signal obtained by performing time domain decoding and frequency domain transform on the time domain information included in the bit stream, which indicates that the undecoded frequency domain signal includes information useful for output, thereby requiring Predicting the undecoded frequency domain signal and simply setting the undecoded frequency domain signal to noise degrades the output signal quality.
- the spreading unit 820 may set the undecoded frequency domain signal to noise when the decoded frequency domain signal does not satisfy the predetermined condition.
- the spreading unit 820 may start a frequency domain signal of a frequency band selected from a highest frequency of the decoded frequency domain signal to a low frequency. And selecting the selected frequency domain signal as described above to predict the undecoded frequency domain signal based on the selected frequency domain signal.
- the output frequency domain signal may, for example, predict the undecoded frequency domain signal based on the frequency domain signal of the fixed frequency band in the decoded frequency domain signal.
- the output unit 830 can obtain the final output time domain signal based on the decoded frequency domain signal and the predicted frequency domain signal. After the undecoded frequency domain signal is predicted, the frequency domain signal in the entire frequency band is obtained, and the frequency domain inverse transform of the entire bandwidth is performed by using an inverse transform of the frequency domain transform used in the encoding, Thereby the output signal of the time domain is obtained. As described above, the output unit can obtain the final output time domain signal for output by performing time domain synthesis on the signal after inverse frequency domain transformation using the LPC coefficient obtained from the ISF parameter (or LSF parameter).
- the undecoded frequency domain signal is set by using the decoded frequency domain signal as a guide, so that the output signal reaches the output signal. Good results.
- Figure 9 illustrates a block diagram of decoding unit 910 in time-frequency joint decoding.
- the decoding unit 910 includes: a demultiplexing unit 911, which demultiplexes the bit stream into a first group of bits and a second group of bits; and a frequency domain decoding unit 912 that performs frequency domain decoding on the first group of bits to obtain a first frequency domain signal.
- the time domain decoding unit 913 when determining that there is a time domain coded signal contributing to the output signal in the bit stream Performing time domain decoding on the second group of bits; the frequency domain transforming unit 914 performs frequency domain transform on the decoded time domain signal to obtain a second frequency domain signal; the synthesizing unit 915, the first frequency domain signal and the second The frequency domain signal is synthesized to obtain a decoded frequency domain signal.
- the disclosed apparatus and method may be implemented in other ways.
- the device embodiments described above are only schematic.
- the division of the unit is only a logical function division.
- there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not executed.
- each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
- the functions, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium.
- the technical solution of the present invention which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product, which is stored in a storage medium, including
- the instructions are used to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
- the foregoing storage medium includes: a U disk, a removable hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk or an optical disk, and the like, which can store program codes. .
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (17)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
ES12873219.5T ES2655832T3 (es) | 2012-03-29 | 2012-05-23 | Método y dispositivo de codificación y descodificación de señal |
EP19191869.7A EP3664085B1 (en) | 2012-03-29 | 2012-05-23 | Signal coding and decoding methods and devices |
MX2014011605A MX339652B (es) | 2012-03-29 | 2012-05-23 | Metodos y dispositivos de codificacion y descodificacion de señal. |
JP2015502053A JP6006400B2 (ja) | 2012-03-29 | 2012-05-23 | 信号符号化の方法および装置 |
EP17160983.7A EP3249645B1 (en) | 2012-03-29 | 2012-05-23 | Signal coding and decoding methods and devices |
CA2866202A CA2866202C (en) | 2012-03-29 | 2012-05-23 | Signal coding and decoding methods and devices |
EP12873219.5A EP2809009B1 (en) | 2012-03-29 | 2012-05-23 | Signal encoding and decoding method and device |
MYPI2014002473A MY189975A (en) | 2012-05-21 | 2012-05-23 | Signal coding and decoding methods and devices |
SG11201405216SA SG11201405216SA (en) | 2012-03-29 | 2012-05-23 | Signal coding and decoding methods and devices |
KR1020147026193A KR101621641B1 (ko) | 2012-03-29 | 2012-05-23 | 신호 코딩 및 디코딩 방법 및 장치 |
BR112014023577A BR112014023577B8 (pt) | 2012-03-29 | 2012-05-23 | Método e dispositivo de codificação de sinal de áudio e método e dispositivo de decodificação de sinal de áudio. |
RU2014142255/08A RU2592412C2 (ru) | 2012-03-29 | 2012-05-23 | Способы и устройства кодирования и декодирования сигналов |
ZA2014/06424A ZA201406424B (en) | 2012-03-29 | 2014-09-01 | Signal coding and decoding methods and devices. |
US14/496,986 US9537694B2 (en) | 2012-03-29 | 2014-09-25 | Signal coding and decoding methods and devices |
US15/358,649 US9786293B2 (en) | 2012-03-29 | 2016-11-22 | Signal coding and decoding methods and devices |
US15/684,079 US9899033B2 (en) | 2012-03-29 | 2017-08-23 | Signal coding and decoding methods and devices |
US15/864,147 US10600430B2 (en) | 2012-03-29 | 2018-01-08 | Signal decoding method, audio signal decoder and non-transitory computer-readable medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210087702.9A CN103368682B (zh) | 2012-03-29 | 2012-03-29 | 信号编码和解码的方法和设备 |
CN201210087702.9 | 2012-03-29 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/496,986 Continuation US9537694B2 (en) | 2012-03-29 | 2014-09-25 | Signal coding and decoding methods and devices |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013143221A1 true WO2013143221A1 (zh) | 2013-10-03 |
Family
ID=49258139
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2012/075924 WO2013143221A1 (zh) | 2012-03-29 | 2012-05-23 | 信号编码和解码的方法和设备 |
Country Status (15)
Country | Link |
---|---|
US (4) | US9537694B2 (zh) |
EP (3) | EP2809009B1 (zh) |
JP (2) | JP6006400B2 (zh) |
KR (1) | KR101621641B1 (zh) |
CN (3) | CN103368682B (zh) |
BR (1) | BR112014023577B8 (zh) |
CA (2) | CA2866202C (zh) |
ES (3) | ES2655832T3 (zh) |
MX (1) | MX339652B (zh) |
PL (1) | PL3664085T3 (zh) |
PT (1) | PT3249645T (zh) |
RU (1) | RU2592412C2 (zh) |
SG (2) | SG11201405216SA (zh) |
WO (1) | WO2013143221A1 (zh) |
ZA (1) | ZA201406424B (zh) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10923204B2 (en) | 2010-08-20 | 2021-02-16 | Attopsemi Technology Co., Ltd | Fully testible OTP memory |
US10916317B2 (en) | 2010-08-20 | 2021-02-09 | Attopsemi Technology Co., Ltd | Programmable resistance memory on thin film transistor technology |
US9818478B2 (en) | 2012-12-07 | 2017-11-14 | Attopsemi Technology Co., Ltd | Programmable resistive device and memory using diode as selector |
US10586832B2 (en) | 2011-02-14 | 2020-03-10 | Attopsemi Technology Co., Ltd | One-time programmable devices using gate-all-around structures |
CN103368682B (zh) | 2012-03-29 | 2016-12-07 | 华为技术有限公司 | 信号编码和解码的方法和设备 |
CN105374363B (zh) * | 2014-08-25 | 2019-06-04 | 广东美的集团芜湖制冷设备有限公司 | 音频信号编码方法和系统 |
US11062786B2 (en) | 2017-04-14 | 2021-07-13 | Attopsemi Technology Co., Ltd | One-time programmable memories with low power read operation and novel sensing scheme |
US10726914B2 (en) | 2017-04-14 | 2020-07-28 | Attopsemi Technology Co. Ltd | Programmable resistive memories with low power read operation and novel sensing scheme |
US11615859B2 (en) | 2017-04-14 | 2023-03-28 | Attopsemi Technology Co., Ltd | One-time programmable memories with ultra-low power read operation and novel sensing scheme |
US10535413B2 (en) | 2017-04-14 | 2020-01-14 | Attopsemi Technology Co., Ltd | Low power read operation for programmable resistive memories |
JP6934648B2 (ja) * | 2017-07-03 | 2021-09-15 | 東日本旅客鉄道株式会社 | トロリ線曲げ工具 |
RU2744485C1 (ru) * | 2017-10-27 | 2021-03-10 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Ослабление шума в декодере |
US10770160B2 (en) | 2017-11-30 | 2020-09-08 | Attopsemi Technology Co., Ltd | Programmable resistive memory formed by bit slices from a standard cell library |
CN118053437A (zh) * | 2022-11-17 | 2024-05-17 | 抖音视界有限公司 | 音频编码方法、解码方法、装置、设备及存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101055720A (zh) * | 2005-12-07 | 2007-10-17 | 三星电子株式会社 | 对音频信号编码和解码的方法和设备 |
CN101494054A (zh) * | 2009-02-09 | 2009-07-29 | 深圳华为通信技术有限公司 | 一种音频码率控制方法及系统 |
CN101523485A (zh) * | 2006-10-02 | 2009-09-02 | 卡西欧计算机株式会社 | 音频编码装置、音频解码装置、音频编码方法、音频解码方法和信息记录介质 |
Family Cites Families (47)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5517511A (en) * | 1992-11-30 | 1996-05-14 | Digital Voice Systems, Inc. | Digital transmission of acoustic signals over a noisy communication channel |
JP3131542B2 (ja) * | 1993-11-25 | 2001-02-05 | シャープ株式会社 | 符号化復号化装置 |
KR970011727B1 (en) | 1994-11-09 | 1997-07-14 | Daewoo Electronics Co Ltd | Apparatus for encoding of the audio signal |
JP3521596B2 (ja) | 1996-01-30 | 2004-04-19 | ソニー株式会社 | 信号符号化方法 |
JP3519859B2 (ja) * | 1996-03-26 | 2004-04-19 | 三菱電機株式会社 | 符号器及び復号器 |
FI970553A (fi) * | 1997-02-07 | 1998-08-08 | Nokia Mobile Phones Ltd | Audiokoodausmenetelmä ja -laite |
US6356211B1 (en) | 1997-05-13 | 2002-03-12 | Sony Corporation | Encoding method and apparatus and recording medium |
KR100335609B1 (ko) * | 1997-11-20 | 2002-10-04 | 삼성전자 주식회사 | 비트율조절이가능한오디오부호화/복호화방법및장치 |
KR100304092B1 (ko) * | 1998-03-11 | 2001-09-26 | 마츠시타 덴끼 산교 가부시키가이샤 | 오디오 신호 부호화 장치, 오디오 신호 복호화 장치 및 오디오 신호 부호화/복호화 장치 |
US6226616B1 (en) | 1999-06-21 | 2001-05-01 | Digital Theater Systems, Inc. | Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility |
US6621935B1 (en) * | 1999-12-03 | 2003-09-16 | Microsoft Corporation | System and method for robust image representation over error-prone channels |
JP2001255882A (ja) | 2000-03-09 | 2001-09-21 | Sony Corp | 音声信号処理装置及びその信号処理方法 |
SE0001926D0 (sv) * | 2000-05-23 | 2000-05-23 | Lars Liljeryd | Improved spectral translation/folding in the subband domain |
ATE320651T1 (de) | 2001-05-08 | 2006-04-15 | Koninkl Philips Electronics Nv | Kodieren eines audiosignals |
US7333929B1 (en) * | 2001-09-13 | 2008-02-19 | Chmounk Dmitri V | Modular scalable compressed audio data stream |
CN1127054C (zh) * | 2001-11-02 | 2003-11-05 | 北京阜国数字技术有限公司 | 用于知觉音频编码的信号处理方法 |
US20030187663A1 (en) * | 2002-03-28 | 2003-10-02 | Truman Michael Mead | Broadband frequency translation for high frequency regeneration |
DE10328777A1 (de) * | 2003-06-25 | 2005-01-27 | Coding Technologies Ab | Vorrichtung und Verfahren zum Codieren eines Audiosignals und Vorrichtung und Verfahren zum Decodieren eines codierten Audiosignals |
US7349842B2 (en) * | 2003-09-29 | 2008-03-25 | Sony Corporation | Rate-distortion control scheme in audio encoding |
US7672838B1 (en) * | 2003-12-01 | 2010-03-02 | The Trustees Of Columbia University In The City Of New York | Systems and methods for speech recognition using frequency domain linear prediction polynomials to form temporal and spectral envelopes from frequency domain representations of signals |
US7586924B2 (en) * | 2004-02-27 | 2009-09-08 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for coding an information signal into a data stream, converting the data stream and decoding the data stream |
EP1873753A1 (en) * | 2004-04-01 | 2008-01-02 | Beijing Media Works Co., Ltd | Enhanced audio encoding/decoding device and method |
KR100723400B1 (ko) | 2004-05-12 | 2007-05-30 | 삼성전자주식회사 | 복수의 룩업테이블을 이용한 디지털 신호 부호화 방법 및장치 |
JP4809234B2 (ja) * | 2004-09-17 | 2011-11-09 | パナソニック株式会社 | オーディオ符号化装置、復号化装置、方法、及びプログラム |
KR20070084002A (ko) | 2004-11-05 | 2007-08-24 | 마츠시타 덴끼 산교 가부시키가이샤 | 스케일러블 복호화 장치 및 스케일러블 부호화 장치 |
PL1866912T3 (pl) * | 2005-03-30 | 2011-03-31 | Koninl Philips Electronics Nv | Kodowanie wielokanałowego sygnału audio |
EP1949369B1 (en) * | 2005-10-12 | 2012-09-26 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding audio data and extension data |
JP2007149151A (ja) * | 2005-11-24 | 2007-06-14 | Funai Electric Co Ltd | 光ディスク再生装置、音声信号出力装置及びavシステム |
BRPI0707135A2 (pt) | 2006-01-18 | 2011-04-19 | Lg Electronics Inc. | aparelho e método para codificação e decodificação de sinal |
JP2007264154A (ja) * | 2006-03-28 | 2007-10-11 | Sony Corp | オーディオ信号符号化方法、オーディオ信号符号化方法のプログラム、オーディオ信号符号化方法のプログラムを記録した記録媒体及びオーディオ信号符号化装置 |
ES2312142T3 (es) * | 2006-04-24 | 2009-02-16 | Nero Ag | Aparato avanzado para codificar datos de audio digitales. |
KR20070115637A (ko) * | 2006-06-03 | 2007-12-06 | 삼성전자주식회사 | 대역폭 확장 부호화 및 복호화 방법 및 장치 |
KR101565919B1 (ko) * | 2006-11-17 | 2015-11-05 | 삼성전자주식회사 | 고주파수 신호 부호화 및 복호화 방법 및 장치 |
RU2406165C2 (ru) * | 2007-02-14 | 2010-12-10 | ЭлДжи ЭЛЕКТРОНИКС ИНК. | Способы и устройства для кодирования и декодирования объектно-базированных аудиосигналов |
EP2571024B1 (en) * | 2007-08-27 | 2014-10-22 | Telefonaktiebolaget L M Ericsson AB (Publ) | Adaptive transition frequency between noise fill and bandwidth extension |
KR100970446B1 (ko) | 2007-11-21 | 2010-07-16 | 한국전자통신연구원 | 주파수 확장을 위한 가변 잡음레벨 결정 장치 및 그 방법 |
CN101436407B (zh) | 2008-12-22 | 2011-08-24 | 西安电子科技大学 | 音频编解码方法 |
UA99878C2 (ru) * | 2009-01-16 | 2012-10-10 | Долби Интернешнл Аб | Гармоническое преобразование, усовершенствованное перекрестным произведением |
EP2273493B1 (en) * | 2009-06-29 | 2012-12-19 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Bandwidth extension encoding and decoding |
CN101958119B (zh) * | 2009-07-16 | 2012-02-29 | 中兴通讯股份有限公司 | 一种改进的离散余弦变换域音频丢帧补偿器和补偿方法 |
CA2778368C (en) | 2009-10-20 | 2016-01-26 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder, method for encoding an audio information, method for decoding an audio information and computer program using an iterative interval size reduction |
CN102081927B (zh) * | 2009-11-27 | 2012-07-18 | 中兴通讯股份有限公司 | 一种可分层音频编码、解码方法及系统 |
MY153845A (en) * | 2010-01-12 | 2015-03-31 | Fraunhofer Ges Forschung | Audio encoder, audio decoder, method for encoding and audio information, method for decoding an audio information and computer program using a hash table describing both significant state values and interval boundaries |
US8751225B2 (en) * | 2010-05-12 | 2014-06-10 | Electronics And Telecommunications Research Institute | Apparatus and method for coding signal in a communication system |
US9047875B2 (en) * | 2010-07-19 | 2015-06-02 | Futurewei Technologies, Inc. | Spectrum flatness control for bandwidth extension |
CN102208188B (zh) | 2011-07-13 | 2013-04-17 | 华为技术有限公司 | 音频信号编解码方法和设备 |
CN103368682B (zh) | 2012-03-29 | 2016-12-07 | 华为技术有限公司 | 信号编码和解码的方法和设备 |
-
2012
- 2012-03-29 CN CN201210087702.9A patent/CN103368682B/zh active Active
- 2012-03-29 CN CN201910973689.9A patent/CN110706715B/zh active Active
- 2012-03-29 CN CN201610881546.1A patent/CN106409299B/zh active Active
- 2012-05-23 SG SG11201405216SA patent/SG11201405216SA/en unknown
- 2012-05-23 ES ES12873219.5T patent/ES2655832T3/es active Active
- 2012-05-23 EP EP12873219.5A patent/EP2809009B1/en active Active
- 2012-05-23 PT PT171609837T patent/PT3249645T/pt unknown
- 2012-05-23 CA CA2866202A patent/CA2866202C/en active Active
- 2012-05-23 MX MX2014011605A patent/MX339652B/es active IP Right Grant
- 2012-05-23 JP JP2015502053A patent/JP6006400B2/ja active Active
- 2012-05-23 WO PCT/CN2012/075924 patent/WO2013143221A1/zh active Application Filing
- 2012-05-23 ES ES19191869T patent/ES2927563T3/es active Active
- 2012-05-23 KR KR1020147026193A patent/KR101621641B1/ko active IP Right Grant
- 2012-05-23 PL PL19191869.7T patent/PL3664085T3/pl unknown
- 2012-05-23 BR BR112014023577A patent/BR112014023577B8/pt active Search and Examination
- 2012-05-23 SG SG10201701275XA patent/SG10201701275XA/en unknown
- 2012-05-23 EP EP17160983.7A patent/EP3249645B1/en active Active
- 2012-05-23 EP EP19191869.7A patent/EP3664085B1/en active Active
- 2012-05-23 ES ES17160983T patent/ES2770831T3/es active Active
- 2012-05-23 CA CA2994705A patent/CA2994705C/en active Active
- 2012-05-23 RU RU2014142255/08A patent/RU2592412C2/ru active
-
2014
- 2014-09-01 ZA ZA2014/06424A patent/ZA201406424B/en unknown
- 2014-09-25 US US14/496,986 patent/US9537694B2/en active Active
-
2016
- 2016-09-08 JP JP2016175647A patent/JP6323881B2/ja active Active
- 2016-11-22 US US15/358,649 patent/US9786293B2/en active Active
-
2017
- 2017-08-23 US US15/684,079 patent/US9899033B2/en active Active
-
2018
- 2018-01-08 US US15/864,147 patent/US10600430B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101055720A (zh) * | 2005-12-07 | 2007-10-17 | 三星电子株式会社 | 对音频信号编码和解码的方法和设备 |
CN101523485A (zh) * | 2006-10-02 | 2009-09-02 | 卡西欧计算机株式会社 | 音频编码装置、音频解码装置、音频编码方法、音频解码方法和信息记录介质 |
CN101494054A (zh) * | 2009-02-09 | 2009-07-29 | 深圳华为通信技术有限公司 | 一种音频码率控制方法及系统 |
Non-Patent Citations (1)
Title |
---|
See also references of EP2809009A4 * |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10600430B2 (en) | Signal decoding method, audio signal decoder and non-transitory computer-readable medium | |
JP7010885B2 (ja) | 音声または音響符号化装置、音声または音響復号装置、音声または音響符号化方法及び音声または音響復号方法 | |
JP6574820B2 (ja) | 高周波帯域信号を予測するための方法、符号化デバイス、および復号デバイス | |
CN105874534B (zh) | 编码装置、解码装置、编码方法、解码方法及程序 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12873219 Country of ref document: EP Kind code of ref document: A1 |
|
REEP | Request for entry into the european phase |
Ref document number: 2012873219 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012873219 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2866202 Country of ref document: CA |
|
ENP | Entry into the national phase |
Ref document number: 20147026193 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2015502053 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2014/011605 Country of ref document: MX |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112014023577 Country of ref document: BR |
|
WWE | Wipo information: entry into national phase |
Ref document number: IDP00201406385 Country of ref document: ID |
|
ENP | Entry into the national phase |
Ref document number: 2014142255 Country of ref document: RU Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 112014023577 Country of ref document: BR Kind code of ref document: A2 Effective date: 20140923 |