EP2808867A1 - Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur - Google Patents
Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur Download PDFInfo
- Publication number
- EP2808867A1 EP2808867A1 EP14175174.3A EP14175174A EP2808867A1 EP 2808867 A1 EP2808867 A1 EP 2808867A1 EP 14175174 A EP14175174 A EP 14175174A EP 2808867 A1 EP2808867 A1 EP 2808867A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- frame
- sub
- amplitude value
- time envelope
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 230000001052 transient effect Effects 0.000 title claims abstract description 265
- 238000000034 method Methods 0.000 title claims abstract description 66
- 238000012986 modification Methods 0.000 claims description 47
- 230000004048 modification Effects 0.000 claims description 47
- 238000005070 sampling Methods 0.000 claims description 27
- 238000004364 calculation method Methods 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims 1
- 230000000875 corresponding effect Effects 0.000 description 26
- 230000003247 decreasing effect Effects 0.000 description 26
- 230000000694 effects Effects 0.000 description 7
- 230000005284 excitation Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/03—Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
Definitions
- the present invention relates to the field of communication technologies, and in particular, to a transient signal encoding method and device, decoding method and device, and processing system.
- bandwidth extension technology has developed significantly, and has found commercial applications in several fields, including acoustic enhancement of bass loudspeakers and high frequency enhancement of coded voice and audio.
- the encoding technology of low-frequency band information adopts existing encoding and decoding algorithms; and during the process of encoding and decoding high-frequency band information, a small number of bits are generally adopted to encode the high-frequency band information, and the high-frequency band information is recovered at a decoding end by using the correlation between the high-frequency and low-frequency bands.
- a transient signal has the following characteristics different from those of a non-transient signal: in the time domain, the signal energy of the transient signal has a large instant change; while in the frequency domain, the frequency spectrum of the transient signal is smooth.
- the time envelope of the transient signal is not modified, and due to the influence of the processing in the signal encoding process, such as process by frame by frame, time-frequency transform, and frequency envelope, the transient signal is likely to generate a pre-echo; therefore, the prior art has the disadvantage that the effect of the transient signal recovered at the decoding end is not satisfactory.
- the present invention is directed to a transient signal encoding method and device, decoding method and device, and processing system, which are configured to improve the quality of recovery of transient signals.
- the present invention provides a transient signal encoding method, where the method includes:
- the present invention provides a transient signal decoding method, where the method includes:
- the present invention provides a transient signal encoding device, where the device includes:
- the present invention provides a transient signal decoding device, where the device includes:
- the present invention provides a transient signal processing system, where the system includes:
- the present invention provides another transient signal processing system, where the system includes:
- the time envelope is modified according to characteristics of the transient signal, such that the difference between the amplitude value of the time envelope having the maximal amplitude value and the amplitude values of the time envelopes of the other sub-frames before the sub-frame corresponding to the time envelope having the maximal amplitude value is more distinct, thereby improving the effect of recovery of the transient signal.
- FIG. 1 is a flow chart of a transient signal encoding method according to a first embodiment of the present invention. As shown in FIG. 1 , the method includes the following steps.
- Step 11 a sub-frame where a time envelope having a maximal amplitude value (that is, a maximal time envelope) is located is obtained from time envelopes of all sub-frames of an input transient signal, in which the sub-frame is the reference sub-frame described in the embodiments of the present invention.
- the input signals may be classified, for example, the input signals may be classified into transient signals and non-transient signal, so as to adopt different encoding technologies for different types of signals.
- This embodiment mainly relates to processing of the transient signals.
- a method for obtaining a time envelope includes: dividing an input signal into one or more sub-frames; obtaining energy information of each sub-frame, for example, the energy of each sub-frame and the square root of energy information of each sub-frame, to obtain the energy information; and schematically expressing waveform characteristics or amplitude trends of the input time-domain signal by using the obtained energy information.
- the time envelope may be modified according to the characteristics of the transient signal, such that in the modified time envelope, the difference between the amplitude values of time envelopes of the sub-frames included in the transient signal is more distinct, which is specifically represented in that the difference between the amplitude value of the time envelope having the maximal amplitude value and the amplitude values of other time envelopes is more distinct, so as to highlight the characteristics of the transient signal.
- Step 13 an amplitude value of the time envelope of each sub-frame before the reference sub-frame is adjusted in such a way that a first difference is greater than a preset first threshold, in which the first difference is a difference between the amplitude value of the time envelope of each sub-frame before the reference sub-frame and the amplitude value of the maximal time envelope.
- the first threshold may be determined by the following method: decreasing the amplitude value of the time envelope of each sub-frame before the reference sub-frame to 1/8 to 1/2 of the original amplitude value, obtaining a difference between the adjusted amplitude values of the time envelopes of the sub-frames and the amplitude value of the time envelope of the reference sub-frame, and using the difference as the first threshold.
- Step 15 the adjusted time envelope is written into bitstream.
- the adjustment of the time envelope may further include: calculating an average amplitude value of the time envelopes of each sub-frame after the reference sub-frame; and adjusting the amplitude value of the time envelope of each sub-frame after the reference sub-frame in such a way that a second difference is greater than a preset second threshold when the average amplitude value is lower than or equal to a preset reference value, in which the second difference is a difference between the amplitude value of the time envelope of each sub-frame after the reference sub-frame and the amplitude value of the maximal time envelope.
- the preset reference value may be selected to be 1/3 to 3/5 of the amplitude value of the time envelope of the reference sub-frame; and the second threshold may be determined by the following method: decreasing the amplitude value of the time envelope of each sub-frame after the reference sub-frame to 1/8 to 1/2 of the original amplitude value, obtaining a difference between the adjusted amplitude values of the time envelopes of the sub-frames and the amplitude value of the time envelope of the reference sub-frame, and using the difference as the second threshold.
- the adjustment of the time-domain signal in the technical solution may further include:
- the third threshold may be selected from the range satisfying the following condition: the average energy of the adjusted time envelope of each sub-frame of the transient signal is equivalent to the average energy of the adjusted time envelope of each sub-frame, for example, the former is 0.8 to 1.2 times the latter.
- the time envelope corresponding to the transient signal needs to be encoded more finely.
- the time envelope of the transient signal can be modified according to the characteristics of the transient signal distinguished from the non-transient signal, such that the difference between the amplitude values of the time envelopes of the sub-frames included by the transient signal is more distinct, thereby improving the quality of the transient signal recovered at the decoding end.
- the time envelope of the transient signal is modified according to the characteristics of the transient signal, the difference between the amplitude values of the time envelopes of the sub-frames of the transient signal is enlarged, and the modified time envelope information is sent to the decoding end; and therefore, the position information of the transient signal is encoded and the encoded position information is sent to the decoding end without consuming any number of bits, that is, the technical effect of improving the quality of the transient signal recovered at the decoding end can be realized without increasing the number of bits required by the encoding end.
- FIG. 2 is a flow chart of a transient signal encoding method according to a second embodiment of the present invention. As shown in FIG. 2 , the method includes the following steps.
- Step 21 an input signal is decomposed into a low-frequency band signal and a high-frequency band signal; and as for the low-frequency band signal, Step 23 is performed, and as for the high-frequency band signal, Step 25 is performed.
- Step 23 parameters of the low-frequency band signal in the input signal are input into a bitstream; and Step 217 is performed.
- the parameters of the low-frequency band signal are input into the bitstream through an encoder.
- Step 25 a signal type of the input signal (the high-frequency signal) is determined, and signal type information is input into the bitstream, in which the signal type information is configured to indicate whether the input signal (that is, the signal being currently encoded) is a transient signal or a non-transient signal.
- Step 25 may include Steps 2501 to 2509 (not shown).
- Step 2501 a long frame is formed with a preset number of consecutive frames in the high-frequency band signal, and an average energy of the long frame is calculated.
- gain is the average energy of the long frame
- x [ i ] is a signal value of an ith sampling point of the time-domain signal
- N is the total number of sampling points of the whole long frame.
- Step 2503 the long frame is divided into several sub-frames, and an average energy of each sub-frame is calculated.
- each frame has a frame length of 5 ms, then the frame length of a long frame is 15 ms; the frame length of a long frame includes 480 sampling points, and if a long frame is divided into 12 sub-frames, the frame length of each sub-frame is 40 sampling points.
- An average energy sub _ gain [ i ] of each sub-frame is calculated.
- Step 2505 a third difference and a fourth difference are calculated respectively, in which the third difference is a maximal difference between the average energy of each sub-frame and the average energy of the long frame, and the third difference is calculated according to Formula (2); and the fourth difference is a maximal difference between average energies of two consecutive sub-frames, and the fourth difference is calculated according to Formula (3).
- max_ deviation max sub_gain i , gain
- Sub_gain [ i ] represents the average energy of each sub-frame
- gain represents the average energy of the long frame
- max_deviation represents a maximal difference between the average energy of each sub-frame and the average energy of the long frame, that is, the third difference in the embodiments of the present invention.
- max_ rise max ⁇ sub_gain i , sub_gain ⁇ i + 1
- sub_gain [ i ] and sub - gain [ i+1 ] represent the average energies of two consecutive sub-frames respectively
- max_rise represents a maximal difference between the average energies of two consecutive sub-frames in a long frame, that is, the fourth difference in the embodiments of the present invention.
- Step 2507 the average energy of the long frame is compared with a fourth threshold, the third difference is compared with a fifth threshold, and the fourth difference is compared with a sixth threshold, and if the average energy of the long frame is greater than the fourth threshold, the third difference is greater than the fifth threshold, and the fourth difference is greater than the sixth threshold (that is, Formula (4) is satisfied), it is determined that the high-frequency band signal is a transient signal; otherwise, it is determined that the high-frequency band signal is a non-transient signal.
- ⁇ 1 represents the fourth threshold
- ⁇ 2 represents the fifth threshold
- ⁇ 3 represents the sixth threshold.
- the values of ⁇ 1, ⁇ 2, and ⁇ 3 are correlated to the amplitude of the input transient signal, and when the overall amplitude of the transient signal is large, the values of ⁇ 1, ⁇ 2, and ⁇ 3 are large; and when the overall amplitude of the transient signal is small, the values of ⁇ 1, ⁇ 2, and ⁇ 3 are small.
- the values of ⁇ 1, ⁇ 2, and ⁇ 3 are in the ranges of 5 ⁇ 1 ⁇ 10, 2 ⁇ 2 ⁇ 5 1 ⁇ 3 ⁇ 3
- Step 2509 the obtained category information is input into a bitstream, and the category information includes transient signal information and non-transient signal information; and Step 217 is performed.
- Step 27 is performed; and as for a non-transient signal, the time envelope and the frequency-domain envelope of the non-transient signal can be obtained by using a method in the prior art, which will not be repeated herein.
- the method for classifying the input signal may be used in combination with the modification of the time envelope according to the present invention; moreover, when the time envelope of each sub-frame of the transient signal is not modified, the method for classifying the input signal may be used in combination with the method for encoding the transient signal in the prior art, and at this time, the accuracy of the identification of the transient signal can also be improved, thereby improving the effect of recovery of the transient signal at the decoding end.
- Step 26 the time envelope of each sub-frame of the input signal is calculated respectively, and if the signal type of the input signal is a transient signal, Step 27 is performed; and if the signal type of the input signal is a non-transient signal, Step 29 is performed.
- Step 27 the time envelope of the transient signal is modified.
- Step 27 may include Step 2701 to Step 2719.
- FIG. 3 is a block diagram of an embodiment of an encoding end modifying a time envelope of a transient signal according to the second embodiment of the present invention. As shown in FIG. 3 , the modification performed on the time envelope of the transient signal includes Step 2701 to Step 2719.
- Step 2701 the time envelope of each sub-frame of the transient signal is calculated, so as to obtain the time envelope tEhv [ i ] of each sub-frame.
- Step 2703 by searching in the time envelopes of the sub-frames obtained in Step 2701, a sub-frame where the maximal time envelope is located and position information corresponding to the sub-frame are obtained, in which the sub-frame is the reference sub-frame in the embodiments of the present invention, and for the convenience of illustration, the position information of the reference sub-frame is represented as pos in the following.
- Step 2705 the position information (i) of the current sub-frame is compared with the position information (pos) of the reference sub-frame, and if the current sub-frame is before the reference sub-frame (that is, i ⁇ pos), Step 2707 is performed; otherwise, Step 2709 is performed.
- Step 2707 modification of decreasing the amplitude value is performed on the time envelope of the current sub-frame, so as to obtain a first modified envelope
- Step 2719 is performed.
- the proportion by which the amplitude value is decreased may be determined according to the difference between the amplitude values of the time envelopes corresponding to the sub-frames and the amplitude value of the time envelope corresponding to the reference sub-frame, and if the difference is large, a small proportion by which the amplitude value is decreased may be selected; otherwise, a large proportion by which the amplitude value is decreased may be selected.
- Step 2711 the average value avrg pos + 1 N of the time envelope of each sub-frame after the reference sub-frame is compared with a preset reference value, in which the preset reference value in this embodiment is 1/2 of the time envelope corresponding to the reference sub-frame, that is, 1 2 ⁇ tEnv pos , and if avrg pos + 1 N ⁇ 1 2 ⁇ tEnv pos , Step 2713 is performed; otherwise, the time envelope of the current sub-frame is not modified, and Step 2719 is performed.
- the sub-frames may be modified. If the difference between the average value of the time envelope of each sub-frame after the reference sub-frame and the preset reference value is small, it indicates that the reference sub-frame corresponding to the maximal time envelope of the original signal is not abruptly changed with respect to the sub-frame thereafter, and at this time, the sub-frames may not be modified.
- the preset reference value is 1/3 to 3/5 of the maximal time envelope of the transient signal.
- Step 2713 the position information of the current sub-frame is compared with the position information of the reference sub-frame, so as to determine whether the current sub-frame is the reference sub-frame, and if yes, Step 2715 is performed; otherwise, Step 2717 is performed.
- Step 2715 modification of increasing the amplitude value is performed on the time envelope corresponding to the reference sub-frame, so as to obtain a second modified envelope; and Step 2719 is performed.
- Step 2717 modification of decreasing the amplitude value is performed on the time envelope of the current sub-frame, so as to obtain a third modified envelope, and Step 2719 is performed.
- the proportion by which the amplitude value is decreased may be determined according to the difference between the amplitude values of the time envelopes corresponding to the sub-frames and the amplitude value of the time envelope corresponding to the reference sub-frame, and if the difference is large, a small proportion by which the amplitude value is decreased may be selected; otherwise, a large proportion by which the amplitude value is decreased may be selected.
- Step 2719 the first modified envelope obtained in Step 2707, the second modified envelope obtained in Step 2715, and the third modified envelope obtained in Step 2717 are combined, to obtain the modified time envelope of the transient signal.
- Step 2701 to Step 2719 the modification of the time envelope of the transient signal is completed, and the modified time envelope of the transient signal is obtained.
- Step 211 time-frequency transform is performed on the high-frequency band signal in the input signal, so as to obtain a frequency-domain signal of the high-frequency band signal.
- the time-domain signal corresponding to the transient signal is transformed to the frequency domain through a transform method such as fast Fourier transform (FFT) and modified discrete cosine transform (MDCT), so as to obtain the frequency-domain signal corresponding to the transient signal in the frequency domain.
- FFT fast Fourier transform
- MDCT modified discrete cosine transform
- Step 211 and Step 25 No limitation is imposed on the time sequence of Step 211 and Step 25.
- Step 213 the frequency-domain envelope of each sub-band of the frequency-domain signal is calculated, so as to obtain the frequency-domain envelope of the high-frequency band signal.
- the frequency-domain envelope in the embodiments of the present invention refers to: dividing the frequency-domain signal into one or more sub-bands, obtaining energy information of each sub-band or obtaining the square root of the energy information of each sub-band, and schematically expressing spectral waveform characteristics or amplitude trends of the frequency-domain signal by using the obtained energy information or the obtained square root of the energy information. Therefore, the frequency-domain signal is divided into one or more sub-bands, and the energy information of each sub-band or the square root of the energy information of each sub-band is obtained, and the frequency-domain envelope of each sub-band of the frequency-domain signal is obtained by using the obtained energy information or the obtained square root of the energy information.
- Step 215 the obtained frequency-domain envelope of the high-frequency band signal is quantified, and then is added in the bitstream; and Step 217 is performed.
- Step 217 the bitstream added with the parameters of the low-frequency band signal, the signal type information of the high-frequency band signal, the frequency-domain envelope and the modified time envelope are sent to the decoding end, in which the signal type information is configured to indicate whether the signal being currently encoded is a transient signal or a non-transient signal, such that the decoding end can determine the type of the decoded current signal according to the signal type information.
- identification of the transient signal is performed by combining information of several consecutive frames in the high-frequency band signal, and therefore, the accuracy of the identification of the transient signal is improved, and the transient signal can be separated from the input high-frequency band signal more accurately; moreover, in this embodiment, the time envelope corresponding to the separated transient signal is modified, such that the difference between the amplitude values of the time envelopes of the sub-frame of the transient signal is more distinct, thereby improving the quality of the transient signal recovered at the decoding end.
- FIG. 4 is a flow chart of a transient signal decoding method according to a third embodiment of the present invention. As shown in FIG. 4 , the method includes the following steps.
- Step 41 a sub-frame where a time envelope having a maximal amplitude value (that is, a maximal time envelope) is located is obtained from time envelopes of all sub-frames of a pre-obtained signal having a signal type of a transient signal, in which the sub-frame is the reference sub-frame described in the embodiments of the present invention.
- the modification of the time envelope of the transient signal may be performed at the encoding end or the decoding end.
- the time envelope is modified according to the characteristics of the transient signal at the decoding end, such that in the modified time envelope, the difference between the amplitude value of the time envelope having the maximal amplitude value of the sub-frames of the transient signal and the amplitude values of other time envelopes is more distinct, so as to highlight the characteristics of the transient signal.
- Step 43 an amplitude value of the time envelope of each sub-frame before the reference sub-frame is adjusted in such a way that a first difference is greater than a preset first threshold, in which the first difference is a difference between the amplitude value of the time envelope of each sub-frame before the reference sub-frame and the amplitude value of the maximal time envelope.
- the first threshold may be determined by the following method: decreasing the amplitude value of the time envelope of each sub-frame before the reference sub-frame to 1/8 to 1/2 of the original amplitude value, obtaining a difference between the adjusted amplitude values of the time envelopes of the sub-frames and the amplitude value of the time envelope of the reference sub-frame, and using the difference as the first threshold.
- the adjustment of the time envelope may further include: calculating an average amplitude value of the time envelopes of each sub-frame after the reference sub-frame; and adjusting the amplitude value of the time envelope of each sub-frame after the reference sub-frame in such a way that a second difference is greater than a preset second threshold when the average amplitude value is lower than or equal to a preset reference value, in which the second difference is a difference between the amplitude value of the time envelope of each sub-frame after the reference sub-frame and the amplitude value of the maximal time envelope.
- the preset reference value may be selected to be 1/3 to 3/5 of the amplitude value of the time envelope of the reference sub-frame; and the second threshold may be determined by the following method: decreasing the amplitude value of the time envelope of each sub-frame after the reference sub-frame to 1/8 to 1/2 of the original amplitude value, obtaining a difference between the adjusted amplitude values of the time envelopes of the sub-frames and the amplitude value of the time envelope of the reference sub-frame, and using the difference as the second threshold.
- the adjustment of the time-domain signal in the technical solution may further include:
- the third threshold may be selected from the range satisfying the following condition: the average energy of the adjusted time envelope of each sub-frame of the transient signal is equivalent to the average energy of the adjusted time envelope of each sub-frame, for example, the former is 0.8 to 1.2 times the latter.
- Step 45 a pre-obtained time-domain signal is modified according to the adjusted time envelope, so as to obtain a recovered transient signal.
- the bitstream from the encoding end is decoded, to obtain the frequency-domain envelope of each sub-band of the signal having a signal type of a transient signal.
- a frequency-domain excitation signal is obtained from normalized low-frequency-band frequency-domain signals or random noises, a frequency-domain signal is generated according to the frequency-domain excitation signal and the frequency-domain envelope, and frequency-time transform is performed on the frequency-domain signal to obtain the time-domain signal. Then, the time-domain signal is modified according to the modified time envelope, such that the transient signal is recovered at the decoding end.
- the time envelope of the transient signal is modified at the decoding end, such that in the modified time envelope, the difference between the amplitude value of the time envelope having the maximal amplitude value and the amplitude values of other time envelopes is more distinct, so as to highlight the characteristics of the transient signal, thereby improving the quality of the transient signal recovered at the decoding end.
- FIG. 5 is a flow chart of a transient signal decoding method according to a fourth embodiment of the present invention. As shown in FIG. 5 , the method includes the following steps.
- Step 51 a bitstream from an encoding end is decoded, to obtain a time envelope and signal type information of a high-frequency band signal, and if the signal type is a transient signal, Step 52 is performed; and if the signal type is a non-transient signal, Step 518 is performed.
- Step 52 when the obtained signal type information indicates that the signal type is a transient signal, the time envelope is modified, so as to obtain a modified time envelope; and Step 518 is performed.
- Step 52 may include Step 5201-Step 5219.
- FIG. 6 is a block diagram of an embodiment of a decoding end modifying a time envelope of a transient signal according to the fourth embodiment of the present invention. As shown in FIG. 6 , when the current signal type is a transient signal, the modification performed on the time envelope by the decoding end includes Step 5201 to Step 5219.
- Step 5201 the bitstream from the encoding end is decoded, to obtain a time envelope of each sub-frame of the high-frequency band signal and signal type information. If the signal type information indicates that the type of the current signal in the bitstream is a transient signal, Step 5203 is performed, to modify the time envelope; and if the signal type information indicates that the type of the current signal in the bitstream is a non-transient signal, the signal is decoded by using a decoding method in the prior art to recover the non-transient signal, which will not be repeated herein.
- Step 5203 by searching in the time envelopes of the sub-frames obtained in Step 5201, a sub-frame where the maximal time envelope is located and position information corresponding to the sub-frame are obtained, in which the sub-frame is the reference sub-frame in the embodiments of the present invention, and for the convenience of illustration, the position information of the reference sub-frame is represented as pos in the following.
- Step 5205 the position information (i) of the current sub-frame is compared with the position information (pos) of the reference sub-frame, and if the current sub-frame is before the reference sub-frame (that is, i ⁇ pos), Step 5207 is performed; otherwise, Step 5209 is performed.
- Step 5207 modification of decreasing the amplitude value is performed on the time envelope of the current sub-frame, so as to obtain a first modified envelope, and Step 5219 is performed.
- Step 5211 the average value avrg pos + 1 N of the time envelope of each sub-frame after the reference sub-frame is compared with a preset reference value, in which the preset reference value in this embodiment is 1/4 of the time envelope corresponding to the reference sub-frame, that is, 1 2 ⁇ tEnv pos , and if avrg pos + 1 N ⁇ 3 5 ⁇ tEnv pos , Step 5213 is performed; otherwise, the time envelope of the current sub-frame is not modified, and Step 5219 is performed.
- Step 5213 the position information of the current sub-frame is compared with the position information of the reference sub-frame, so as to determine whether the current sub-frame is the reference sub-frame, and if yes, Step 5215 is performed; otherwise, Step 5217 is performed.
- Step 5215 modification of increasing the amplitude value is performed on the time envelope corresponding to the reference sub-frame, so as to obtain a second modified envelope; and Step 5219 is performed.
- Step 5217 modification of decreasing the amplitude value is performed on the time envelope of the current sub-frame, so as to obtain a third modified envelope, and Step 5219 is performed.
- Step 5219 the first modified envelope obtained in Step 5207, the second modified envelope obtained in Step 5215, the third modified envelope obtained in Step 5217, and the time envelope that does not meet the modification conditions in Step 5211 and is not subjected to time-domain modification are combined, to obtain the modified time envelope of the transient signal.
- Step 5201 to Step 5219 the modification of the time envelope of the transient signal is completed, and the modified time envelope of the transient signal is obtained.
- Step 53 the bitstream from the encoding end is decoded, to obtain the low-frequency band signal; and Step 519 is performed.
- the low-frequency band signal in the bitstream is decoded by a decoder.
- Step 51 and Step 53 No limitation is imposed on the time sequence of Step 51 and Step 53.
- Step 55 a frequency-domain excitation signal of the high-frequency band signal is generated.
- the frequency-domain excitation signal of the high-frequency band signal is obtained from normalized low-frequency-band frequency-domain signals or random noises.
- Step 57 the bitstream from the encoding end is decoded, to obtain the frequency-domain envelope of each sub-band of the high-frequency band signal.
- Step 55 No limitation is imposed on the time sequence of Step 55 and Step 57.
- Step 59 the frequency-domain excitation signal is modified by using the frequency-domain envelope of each sub-band of the high-frequency band signal.
- the objective of the modification is to enable the energy of the recovered frequency spectrum to be equivalent to the energy of the real high-frequency band spectrum.
- exc [ i ] represents the frequency-domain excitation signal
- fEnv [ j ] represents the frequency-domain envelope
- spectrum [ i ] represents the high-frequency-band frequency-domain signal.
- Step 513 frequency-time transform is performed on the generated high-frequency-band frequency-domain signal.
- Step 515 the time-domain signal is generated. If the type of the high-frequency band signal is a transient signal, Step 516 is performed, and if the high-frequency band signal is a non-transient signal, Step 517 is performed.
- Step 516 the time-domain signal having a signal type of a transient signal is adjusted, to obtain the adjusted time-domain signal signal' [ i ] .
- a preset number of sampling points in the reference sub-frame are selected; and signal amplitude of each of the selected sampling points is adjusted in such a way that a fifth difference is greater than a seventh threshold, in which the fifth difference is a difference between the signal amplitude value of each of the selected sampling points and a maximal amplitude value of the reference sub-frame.
- the seventh threshold may be selected from the following range: decreasing the amplitudes of the selected sampling points to be 1/2 of the original amplitudes, and obtaining the differences between the adjusted amplitudes of the sampling points and the maximal amplitude among the amplitudes of the sampling points included in the reference sub-frame.
- a preset number of sampling points included in the sub-frame where the time envelope having the maximal amplitude value is located are selected, and the signal amplitudes of the sampling points are decreased, so as to adjust the time-domain signal.
- the specific method for adjustment of the time-domain signal and the preset number of sampling points required to be adjusted are mainly dependent upon the characteristics of the original input signal.
- a preset number of sampling points included in the sub-frame where the time envelope having the maximal amplitude value is located are selected sequentially, for example, the sampling points in the first 1/4 sub-frame length included in the time-domain signal corresponding to the reference sub-frame where the time envelope having the maximal amplitude value is located are selected, and the amplitude values of the selected sampling points are divided by 2.
- bit positions can be used to carry the flag information to the decoding end, for example, when the encoding end has a bit for transmitting the flag information, the decoding end can determine whether to adjust the preset number of sampling points according to the flag bit; when the encoding end has multiple bit positions for carrying the flag information, the decoding end can determine which sampling points need to be adjusted according to the received flag bits; and when the encoding end has sufficient bit positions for carrying the flag information, the decoding end can determine whether each sampling point needs to be adjusted according to the received flag information.
- the method for adjusting the time-domain signal may be used in combination with the modification of the time envelope according to the present invention; moreover, when the time envelope of each sub-frame of the transient signal is not modified, the method for adjusting the time-domain signal may be used in combination with the method for encoding the transient signal in the prior art, and at this time, the characteristics of the transient signal can also be highlighted, thereby improving the effect of recovery of the transient signal.
- Step 517 the obtained time-domain signal signal' [ i ] is normalized.
- Step 518 by using the modified time envelope obtained in Step 52, the normalized time-domain signal having a signal type of a transient signal is modified, so as to obtain a recovered transient signal; and by using the time envelope signal having a signal type of non-transient signal obtained in Step 51, the corresponding time-domain signal is modified, so as to obtain a recovered non-transient signal.
- the normalized time-domain signal having a signal type of a transient signal may be modified according to Formula (6):
- signal i signal ⁇ i * tEnv j / tEnv j ⁇ ⁇
- signal' [ i ] represents the modified time-domain signal
- tEnv [ j ] represents the modified time envelope
- tEnv [ j ] ' represents the time envelope of the modified time-domain signal ( signal' [ i ])
- signal [ i ] represents the time-domain signal of the high-frequency band signal.
- Step 519 the recovered low-frequency band signal and high-frequency band signal are combined, to obtain the output wide-frequency band signal, in which the recovered high-frequency band signal includes the recovered transient signal and the recovered non-transient signal.
- Step 51 the time sequence of Step 51, Step 57, and Step 53.
- the time envelope corresponding to the transient signal in the high-frequency band signal obtained through decoding at the decoding end is modified, such that the difference between the amplitude values of the time envelopes of all sub-frames corresponding to the transient signal is more distinct, thereby improving the quality of the transient signal recovered at the decoding end; moreover, in this embodiment, before the time-domain signal is modified by using the time envelope, the amplitudes of the sampling points before the time-domain signal of the sub-frame having the maximal time envelope are decreased, so as to highlight the characteristics of the transient signal, thereby significantly improving the output effect of the transient signal in the output signal.
- FIG. 7 is a schematic structural view of a transient signal encoding device according to a fifth embodiment of the present invention.
- the transient signal encoding device of this embodiment includes: a reference sub-frame obtaining module 71, a first amplitude value adjusting module 72, and a bitstream writing module 73.
- the reference sub-frame obtaining module 71 is configured to obtain a reference sub-frame where a time envelope having a maximal amplitude value (that is, a maximal time envelope) is located from time envelopes of all sub-frames of an input transient signal.
- the first amplitude value adjusting module 72 is configured to adjust an amplitude value of the time envelope of each sub-frame before the reference sub-frame in such a way that a first difference is greater than a preset first threshold, in which the first difference is a difference between the amplitude value of the time envelope of each sub-frame before the reference sub-frame and the amplitude value of the maximal time envelope.
- the first threshold may be determined by the following method: decreasing the amplitude value of the time envelope of each sub-frame before the reference sub-frame to 1/8 to 1/2 of the original amplitude value, obtaining a difference between the adjusted amplitude values of the time envelopes of the sub-frames and the amplitude value of the time envelope of the reference sub-frame, and using the difference as the first threshold.
- the bitstream writing module 73 is configured to write the adjusted time envelope into bitstream.
- the transient signal encoding device of this embodiment further includes: an average amplitude value calculation module 74, a second amplitude value adjusting module 75, and a third amplitude value adjusting module 76.
- the average amplitude value calculation module 74 is configured to calculate an average amplitude value of the time envelopes of each sub-frame after the reference sub-frame.
- the second amplitude value adjusting module 75 is configured to adjust the amplitude value of the time envelope of each sub-frame after the reference sub-frame in such a way that a second difference is greater than a preset second threshold when the average amplitude value is lower than or equal to a preset reference value, in which the second difference is a difference between the amplitude value of the time envelope of each sub-frame after the reference sub-frame and the amplitude value of the maximal time envelope.
- the preset reference value may be selected to be 1/3 to 3/5 of the amplitude value of the time envelope of the reference sub-frame; and the second threshold may be determined by the following method: decreasing the amplitude value of the time envelope of each sub-frame after the reference sub-frame to 1/8 to 1/2 of the original amplitude value, obtaining a difference between the adjusted amplitude values of the time envelopes of the sub-frames and the amplitude value of the time envelope of the reference sub-frame, and using the difference as the second threshold.
- the third amplitude value adjusting module 76 is configured to adjust an amplitude value of the time envelope of the reference sub-frame in such a way that an average energy of the adjusted time envelope of each sub-frame of the transient signal is greater than a preset third threshold, after the amplitude value of the time envelope of each sub-frame other than the reference sub-frame is adjusted.
- the third threshold may be selected from the range satisfying the following condition: the average energy of the adjusted time envelope of each sub-frame of the transient signal is equivalent to the average energy of the adjusted time envelope of each sub-frame, for example, the former is 0.8 to 1.2 times the latter.
- the first amplitude value adjusting module can modify the time envelope of the transient signal according to the characteristics of the transient signal, such that the difference between the amplitude values of the time envelopes of the sub-frames included by the transient signal is more distinct, thereby improving the quality of the transient signal recovered at the decoding end.
- FIG. 8 is a schematic structural view of a transient signal encoding device according to a sixth embodiment of the present invention. Different from the embodiment in FIG. 7 , the transient signal encoding device of this embodiment further includes a signal type determination module 77.
- the signal type determination module 77 is configured to determine a signal type of the input signal, and write signal type information in the encoding bitstream, in which the signal type includes a transient signal or a non-transient signal.
- the signal type determination module 77 may include a long frame average energy calculation unit 771, a sub-frame average energy calculation unit 772, a difference calculation unit 773, and a signal type determination unit 774.
- the long frame average energy calculation unit 771 is configured to form a long frame with a preset number of consecutive frames in the input signal and calculate an average energy of the long frame.
- the sub-frame average energy calculation unit 772 is configured to divide the long frame into multiple sub-frames and calculate an average energy of each sub-frame.
- the difference calculation unit 773 is configured to calculate a third difference and a fourth difference respectively, in which the third difference is a maximal difference between the average energy of each sub-frame and the average energy of the long frame, and the fourth difference is a maximal difference between average energies of two consecutive sub-frames.
- the signal type determination unit 774 is configured to determine that the input signal is a transient signal when the average energy of the long frame is greater than a fourth threshold, the third difference is greater than a fifth threshold, and the fourth difference is greater than a sixth threshold; otherwise, determine that the input signal is a non-transient signal.
- identification of the transient signal is performed by combining information of several consecutive frames in the high-frequency band signal, and therefore, the accuracy of the identification of the transient signal is improved, and the transient signal can be separated from the input high-frequency band signal more accurately; moreover, in this embodiment, the time envelope corresponding to the separated transient signal is modified, such that the difference between the amplitude values of the time envelopes of the sub-frame of the transient signal is more distinct, thereby improving the quality of the transient signal recovered at the decoding end.
- FIG. 9 is a schematic structural view of a transient signal decoding device according to a seventh embodiment of the present invention.
- the transient signal encoding device of this embodiment includes: a reference sub-frame obtaining module 91, a first amplitude value adjusting module 92, and a time-domain signal modification module 93.
- the reference sub-frame obtaining module 91 is configured to obtain a reference sub-frame where a time envelope having a maximal amplitude value (that is, a maximal time envelope) is located from time envelopes of all sub-frames of a pre-obtained signal having a signal type of a transient signal.
- the first amplitude value adjusting module 92 is configured to adjust an amplitude value of the time envelope of each sub-frame before the reference sub-frame in such a way that a first difference is greater than a preset first threshold, in which the first difference is a difference between the amplitude value of the time envelope of each sub-frame before the reference sub-frame and the amplitude value of the maximal time envelope.
- the time-domain signal modification module 93 is configured to modify a pre-obtained time-domain signal according to the adjusted time envelope, so as to obtain a recovered transient signal.
- the time envelope of the transient signal is modified by the time envelope modification module at the decoding end, such that in the modified time envelope, the difference between the amplitude value of the time envelope having the maximal amplitude value and the amplitude values of other time envelopes is more distinct, so as to highlight the characteristics of the transient signal, thereby improving the quality of the transient signal recovered at the decoding end.
- FIG. 10 is a schematic structural view of a transient signal decoding device according to an eighth embodiment of the present invention. Different from the embodiment in FIG. 9 , the transient signal decoding device of this embodiment further includes: an average amplitude value calculation module 94, a second amplitude value adjusting module 95, and a third amplitude value adjusting module 96.
- the average amplitude value calculation module 94 is configured to calculate an average amplitude value of the time envelopes of each sub-frame after the reference sub-frame.
- the second amplitude value adjusting module 95 is configured to adjust the amplitude value of the time envelope of each sub-frame after the reference sub-frame in such a way that a second difference is greater than a preset second threshold when the average amplitude value is lower than or equal to a preset reference value, in which the second difference is a difference between the amplitude value of the time envelope of each sub-frame after the reference sub-frame and the amplitude value of the maximal time envelope.
- the third amplitude value adjusting module 96 is configured to adjust an amplitude value of the time envelope of the reference sub-frame in such a way that an average energy of the adjusted time envelope of each sub-frame of the transient signal is greater than a preset third threshold, after the amplitude value of the time envelope of each sub-frame other than the reference sub-frame is adjusted.
- the transient signal decoding device of this embodiment may further include a time-domain signal adjusting module 97.
- the time-domain signal adjusting module 97 is configured to select a preset number of sampling points in the reference sub-frame, and adjust signal amplitude of each of the selected sampling points in such a way that a fifth difference is greater than a seventh threshold, in which the fifth difference is a difference between the signal amplitude value of each of the selected sampling points and a maximal amplitude value of the reference sub-frame.
- the time envelope corresponding to the transient signal in the high-frequency band signal obtained through decoding at the decoding end is modified, such that the difference between the amplitude values of the time envelopes of all sub-frames corresponding to the transient signal is more distinct, thereby improving the quality of the transient signal recovered at the decoding end; moreover, in this embodiment, before the time-domain signal is modified by using the time envelope, the amplitudes of the sampling points before the time-domain signal of the sub-frame having the maximal time envelope are decreased, so as to highlight the characteristics of the transient signal, thereby significantly improving the output effect of the transient signal in the output signal.
- FIG. 11 is a schematic structural view of a transient signal processing system according to a ninth embodiment of the present invention.
- the transient signal processing system of the present invention includes a transient signal encoding device 111 and a transient signal decoding device 112.
- the modification of the time envelope of the transient signal may be performed at the encoding end.
- the transient signal encoding device 111 is configured to obtain a reference sub-frame where a time envelope having a maximal amplitude value (that is, a maximal time envelope) is located from time envelopes of all sub-frames of an input transient signal, adjust an amplitude value of the time envelope of each sub-frame before the reference sub-frame in such a way that a first difference is greater than a preset first threshold, and write the adjusted time envelope into bitstream, in which the first difference is a difference between the amplitude value of the time envelope of each sub-frame before the reference sub-frame and the amplitude value of the maximal time envelope.
- a time envelope having a maximal amplitude value that is, a maximal time envelope
- the transient signal decoding device 112 is configured to modify a pre-obtained time-domain signal according to the time envelope in the received bitstream, so as to obtain a recovered transient signal.
- the time envelope of the transient signal is modified at the encoding end, and the difference between the amplitude value of the time envelope having the maximal amplitude value among the time envelopes of all sub-frames of the transient signal and the amplitude values of other time envelopes is enlarged, so as to highlight the characteristics of the transient signal, thereby improving the quality of the transient signal recovered at the decoding end.
- transient signal processing system of this embodiment as for the specific detailed structure of the transient signal encoding device 111, reference can be made to the description of the embodiments in FIGs. 7 and 8 , and as for the specific principle of the modification of the time envelope of the transient signal, reference can be made to the description of the embodiments in FIGs. 1 to 3 , which will not be repeated herein.
- the modification of the time envelope of the transient signal may be performed at the decoding end.
- the transient signal encoding device 111 is configured to write a time envelope of each sub-frame of a transient signal in a bitstream.
- the transient signal decoding device 112 is configured to obtain a reference sub-frame where a maximal time envelope having a maximal amplitude value is located from time envelopes of all sub-frames of a signal in the received bitstream, adjust an amplitude value of the time envelope of each sub-frame before the reference sub-frame in such a way that a first difference is greater than a preset first threshold, and modify a pre-obtained time-domain signal according to the adjusted time envelope to obtain a recovered transient signal, in which the first difference is a difference between the amplitude value of the time envelope of each sub-frame before the reference sub-frame and the amplitude value of the maximal time envelope.
- the time envelope of the transient signal is modified at the decoding end, and the difference between the amplitude value of the time envelope having the maximal amplitude value among the time envelopes of all sub-frames of the transient signal and the amplitude values of other time envelopes is enlarged, so as to highlight the characteristics of the transient signal, thereby improving the quality of the transient signal recovered at the decoding end.
- transient signal processing system of this embodiment as for the specific detailed structure of the transient signal decoding device 112, reference can be made to the description of the embodiments in FIGs. 9 and 10 , and as for the specific principle of the modification of the time envelope of the transient signal, reference can be made to the description of the embodiments in FIGs. 4 to 6 , which will not be repeated herein.
- modules in a device according to an embodiment may be distributed in the device of the embodiment according to the description of the embodiment, or correspondingly disposed in one or more devices different from this embodiment.
- the modules of the above embodiment may be combined into one module, or further divided into multiple sub-modules.
- the program may be stored in a computer readable storage medium.
- the storage medium may be any medium that is capable of storing program codes, such as a ROM, a RAM, a magnetic disk, and an optical disk.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP22203295.5A EP4191583A1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal transitoire, procédé et dispositif de décodage, et système de traitement |
EP21184343.8A EP3910630B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signaux vocal ou audio transitoires, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008102470097A CN101770776B (zh) | 2008-12-29 | 2008-12-29 | 瞬态信号的编码方法和装置、解码方法和装置及处理系统 |
EP09837373.1A EP2352145B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP09837373.1A Division-Into EP2352145B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
EP09837373.1A Division EP2352145B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP22203295.5A Division EP4191583A1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal transitoire, procédé et dispositif de décodage, et système de traitement |
EP21184343.8A Division EP3910630B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signaux vocal ou audio transitoires, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
Publications (1)
Publication Number | Publication Date |
---|---|
EP2808867A1 true EP2808867A1 (fr) | 2014-12-03 |
Family
ID=42316246
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP22203295.5A Pending EP4191583A1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal transitoire, procédé et dispositif de décodage, et système de traitement |
EP14175174.3A Ceased EP2808867A1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
EP21184343.8A Active EP3910630B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signaux vocal ou audio transitoires, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
EP09837373.1A Active EP2352145B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP22203295.5A Pending EP4191583A1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal transitoire, procédé et dispositif de décodage, et système de traitement |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP21184343.8A Active EP3910630B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signaux vocal ou audio transitoires, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
EP09837373.1A Active EP2352145B1 (fr) | 2008-12-29 | 2009-12-29 | Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur |
Country Status (11)
Country | Link |
---|---|
US (1) | US8063809B2 (fr) |
EP (4) | EP4191583A1 (fr) |
JP (2) | JP5281169B2 (fr) |
KR (1) | KR101168645B1 (fr) |
CN (1) | CN101770776B (fr) |
ES (2) | ES2948521T3 (fr) |
FI (1) | FI3910630T3 (fr) |
HU (1) | HUE062878T2 (fr) |
PL (1) | PL3910630T3 (fr) |
PT (1) | PT3910630T (fr) |
WO (1) | WO2010078816A1 (fr) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101763856B (zh) * | 2008-12-23 | 2011-11-02 | 华为技术有限公司 | 信号分类处理方法、分类处理装置及编码系统 |
CN101770776B (zh) | 2008-12-29 | 2011-06-08 | 华为技术有限公司 | 瞬态信号的编码方法和装置、解码方法和装置及处理系统 |
JP5754899B2 (ja) | 2009-10-07 | 2015-07-29 | ソニー株式会社 | 復号装置および方法、並びにプログラム |
JP5850216B2 (ja) | 2010-04-13 | 2016-02-03 | ソニー株式会社 | 信号処理装置および方法、符号化装置および方法、復号装置および方法、並びにプログラム |
JP5609737B2 (ja) | 2010-04-13 | 2014-10-22 | ソニー株式会社 | 信号処理装置および方法、符号化装置および方法、復号装置および方法、並びにプログラム |
JP6075743B2 (ja) | 2010-08-03 | 2017-02-08 | ソニー株式会社 | 信号処理装置および方法、並びにプログラム |
US8762158B2 (en) * | 2010-08-06 | 2014-06-24 | Samsung Electronics Co., Ltd. | Decoding method and decoding apparatus therefor |
CN102436820B (zh) | 2010-09-29 | 2013-08-28 | 华为技术有限公司 | 高频带信号编码方法及装置、高频带信号解码方法及装置 |
JP5707842B2 (ja) | 2010-10-15 | 2015-04-30 | ソニー株式会社 | 符号化装置および方法、復号装置および方法、並びにプログラム |
CN102466764B (zh) * | 2010-11-03 | 2015-08-19 | 北京普源精电科技有限公司 | 一种频谱超限测量模板的生成方法和装置 |
DE102011011530B4 (de) * | 2011-02-17 | 2013-05-08 | Karlsruher Institut für Technologie | Verfahren zur Reduktion von Ultraschalldaten |
JP5807453B2 (ja) * | 2011-08-30 | 2015-11-10 | 富士通株式会社 | 符号化方法、符号化装置および符号化プログラム |
JP6200034B2 (ja) * | 2012-04-27 | 2017-09-20 | 株式会社Nttドコモ | 音声復号装置 |
EP2972503A4 (fr) * | 2013-03-14 | 2016-10-26 | Inova Ltd | Codeurs de source pouvant être configurés pour systèmes sismiques |
WO2014210284A1 (fr) | 2013-06-27 | 2014-12-31 | Dolby Laboratories Licensing Corporation | Syntaxe de flux binaire pour codage de voix spatial |
JP6242489B2 (ja) * | 2013-07-29 | 2017-12-06 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 脱相関器における過渡信号についての時間的アーチファクトを軽減するシステムおよび方法 |
US9666202B2 (en) | 2013-09-10 | 2017-05-30 | Huawei Technologies Co., Ltd. | Adaptive bandwidth extension and apparatus for the same |
JP6531649B2 (ja) | 2013-09-19 | 2019-06-19 | ソニー株式会社 | 符号化装置および方法、復号化装置および方法、並びにプログラム |
CN105849801B (zh) | 2013-12-27 | 2020-02-14 | 索尼公司 | 解码设备和方法以及程序 |
CN106409303B (zh) * | 2014-04-29 | 2019-09-20 | 华为技术有限公司 | 处理信号的方法及设备 |
US9697843B2 (en) * | 2014-04-30 | 2017-07-04 | Qualcomm Incorporated | High band excitation signal generation |
WO2016093508A1 (fr) | 2014-12-08 | 2016-06-16 | 엘지전자 주식회사 | Procédé pour recevoir des informations de commande dans un système de communication sans fil, et appareil associé |
US9595269B2 (en) * | 2015-01-19 | 2017-03-14 | Qualcomm Incorporated | Scaling for gain shape circuitry |
BR112017018145B1 (pt) * | 2015-02-26 | 2023-11-28 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E. V | Aparelho e método para processamento de um sinal de áudio para obter um sinal de áudio processado utilizando um envelope de domínio de tempo alvo |
CN106126164B (zh) * | 2016-06-16 | 2019-05-17 | Oppo广东移动通信有限公司 | 一种音效处理方法及终端设备 |
US10381020B2 (en) * | 2017-06-16 | 2019-08-13 | Apple Inc. | Speech model-based neural network-assisted signal enhancement |
JP7426772B2 (ja) | 2018-07-25 | 2024-02-02 | 株式会社プロテリアル | 巻磁心の製造方法および巻磁心 |
CN112629637B (zh) * | 2020-11-27 | 2021-10-26 | 华南理工大学 | 一种高频底座力天平信号的时域校准方法 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040196913A1 (en) * | 2001-01-11 | 2004-10-07 | Chakravarthy K. P. P. Kalyan | Computationally efficient audio coder |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5199082A (en) * | 1989-09-11 | 1993-03-30 | U.S. Philips Corp. | Method of detecting an amplitude transient in a field of elements having a multivalent amplitude distribution, device suitable for performing the method, and video system including the device |
US5960390A (en) * | 1995-10-05 | 1999-09-28 | Sony Corporation | Coding method for using multi channel audio signals |
US5659622A (en) | 1995-11-13 | 1997-08-19 | Motorola, Inc. | Method and apparatus for suppressing noise in a communication system |
US5825320A (en) * | 1996-03-19 | 1998-10-20 | Sony Corporation | Gain control method for audio encoding device |
DE19647399C1 (de) * | 1996-11-15 | 1998-07-02 | Fraunhofer Ges Forschung | Gehörangepaßte Qualitätsbeurteilung von Audiotestsignalen |
US6314369B1 (en) | 1998-07-02 | 2001-11-06 | Kabushikikaisha Equos Research | Communications navigation system, and navigation base apparatus and navigation apparatus both used in the navigation system |
US6122610A (en) | 1998-09-23 | 2000-09-19 | Verance Corporation | Noise suppression for low bitrate speech coder |
US6314396B1 (en) * | 1998-11-06 | 2001-11-06 | International Business Machines Corporation | Automatic gain control in a speech recognition system |
CN1154975C (zh) * | 2000-03-15 | 2004-06-23 | 皇家菲利浦电子有限公司 | 用于声频编码的拉盖尔函数 |
MXPA03010237A (es) | 2001-05-10 | 2004-03-16 | Dolby Lab Licensing Corp | Mejoramiento del funcionamiento de transitorios en sistemas de codificacion de audio de baja tasa de transferencia de bitios mediante la reduccion del pre-ruido. |
CN1165036C (zh) * | 2001-11-02 | 2004-09-01 | 北京阜国数字技术有限公司 | 一种基于自适应阀值和典型样本预测的块长选择方法 |
JP2003216188A (ja) * | 2002-01-25 | 2003-07-30 | Matsushita Electric Ind Co Ltd | オーディオ信号符号化方法、符号化装置、及び記憶媒体 |
JP2003233395A (ja) * | 2002-02-07 | 2003-08-22 | Matsushita Electric Ind Co Ltd | オーディオ信号の符号化方法及び装置、並びに符号化及び復号化システム |
JP2003271191A (ja) | 2002-03-15 | 2003-09-25 | Toshiba Corp | 音声認識用雑音抑圧装置及び方法、音声認識装置及び方法並びにプログラム |
JP4083449B2 (ja) * | 2002-03-19 | 2008-04-30 | 日鉱金属株式会社 | CdTe単結晶の製造方法 |
SG108862A1 (en) | 2002-07-24 | 2005-02-28 | St Microelectronics Asia | Method and system for parametric characterization of transient audio signals |
JP4101123B2 (ja) * | 2003-06-19 | 2008-06-18 | シャープ株式会社 | 符号化装置及び符号化方法 |
US7672838B1 (en) * | 2003-12-01 | 2010-03-02 | The Trustees Of Columbia University In The City Of New York | Systems and methods for speech recognition using frequency domain linear prediction polynomials to form temporal and spectral envelopes from frequency domain representations of signals |
DE102004009954B4 (de) | 2004-03-01 | 2005-12-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Verarbeiten eines Multikanalsignals |
WO2006051451A1 (fr) | 2004-11-09 | 2006-05-18 | Koninklijke Philips Electronics N.V. | Codage et decodage audio |
US7627481B1 (en) * | 2005-04-19 | 2009-12-01 | Apple Inc. | Adapting masking thresholds for encoding a low frequency transient signal in audio data |
KR100803205B1 (ko) * | 2005-07-15 | 2008-02-14 | 삼성전자주식회사 | 저비트율 오디오 신호 부호화/복호화 방법 및 장치 |
JP2007079306A (ja) * | 2005-09-15 | 2007-03-29 | Victor Co Of Japan Ltd | 音声信号処理装置及び音声信号処理方法 |
US7546237B2 (en) | 2005-12-23 | 2009-06-09 | Qnx Software Systems (Wavemakers), Inc. | Bandwidth extension of narrowband speech |
ATE548728T1 (de) * | 2007-03-02 | 2012-03-15 | Ericsson Telefon Ab L M | Nichtkausales nachfilter |
CN101308655B (zh) | 2007-05-16 | 2011-07-06 | 展讯通信(上海)有限公司 | 一种音频编解码方法与装置 |
EP2104096B1 (fr) * | 2008-03-20 | 2020-05-06 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Appareil et procédé de conversion d'un signal audio en une représentation paramétrée, appareil et procédé de modification d'une représentation paramétrée, appareil et procédé de synthèse d'une représentation paramétrée d'un signal audio |
CN101770776B (zh) | 2008-12-29 | 2011-06-08 | 华为技术有限公司 | 瞬态信号的编码方法和装置、解码方法和装置及处理系统 |
-
2008
- 2008-12-29 CN CN2008102470097A patent/CN101770776B/zh not_active Ceased
-
2009
- 2009-12-29 EP EP22203295.5A patent/EP4191583A1/fr active Pending
- 2009-12-29 WO PCT/CN2009/076194 patent/WO2010078816A1/fr active Application Filing
- 2009-12-29 EP EP14175174.3A patent/EP2808867A1/fr not_active Ceased
- 2009-12-29 KR KR1020117011364A patent/KR101168645B1/ko active IP Right Grant
- 2009-12-29 ES ES21184343T patent/ES2948521T3/es active Active
- 2009-12-29 PT PT211843438T patent/PT3910630T/pt unknown
- 2009-12-29 JP JP2011539886A patent/JP5281169B2/ja active Active
- 2009-12-29 ES ES09837373.1T patent/ES2540075T3/es active Active
- 2009-12-29 HU HUE21184343A patent/HUE062878T2/hu unknown
- 2009-12-29 PL PL21184343.8T patent/PL3910630T3/pl unknown
- 2009-12-29 EP EP21184343.8A patent/EP3910630B1/fr active Active
- 2009-12-29 FI FIEP21184343.8T patent/FI3910630T3/fi active
- 2009-12-29 EP EP09837373.1A patent/EP2352145B1/fr active Active
-
2011
- 2011-06-29 US US13/172,652 patent/US8063809B2/en active Active
-
2013
- 2013-05-23 JP JP2013108638A patent/JP6110212B2/ja active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040196913A1 (en) * | 2001-01-11 | 2004-10-07 | Chakravarthy K. P. P. Kalyan | Computationally efficient audio coder |
Non-Patent Citations (3)
Title |
---|
BALÁZS KOEVESI ET AL: "Pre-Echo Reduction in the ITU-T G.729.1 Embedded Coder", 16TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 1 August 2008 (2008-08-01), pages 1 - 5, XP055022343, Retrieved from the Internet <URL:http://www.eurasip.org/Proceedings/Eusipco/Eusipco2008/papers/1569105409.pdf> [retrieved on 20120320] * |
HERRE J ET AL: "Enhancing the Performance of Perceptual Audio Coders by Using Temporal Noise Shaping (TNS)", PREPRINTS OF PAPERS PRESENTED AT THE AES CONVENTION, XX, XX, 8 November 1996 (1996-11-08), pages 1 - 24, XP002102636 * |
MARTIN LINK: "An Attack Processing of Audio Signals for Optimizing the Temporal Characteristics of a Low Bit-Rate Audio Coding System", 95TH AES CONVENTION, 1 October 1993 (1993-10-01), pages 1 - 12, XP055022383, Retrieved from the Internet <URL:http://www.aes.org/tmpFiles/elib/20120320/6536.pdf> [retrieved on 20120320] * |
Also Published As
Publication number | Publication date |
---|---|
ES2540075T3 (es) | 2015-07-08 |
US20110251846A1 (en) | 2011-10-13 |
US8063809B2 (en) | 2011-11-22 |
KR101168645B1 (ko) | 2012-07-25 |
HUE062878T2 (hu) | 2023-12-28 |
EP2352145A1 (fr) | 2011-08-03 |
EP3910630A1 (fr) | 2021-11-17 |
WO2010078816A1 (fr) | 2010-07-15 |
CN101770776B (zh) | 2011-06-08 |
EP2352145B1 (fr) | 2015-04-01 |
JP2012511184A (ja) | 2012-05-17 |
ES2948521T3 (es) | 2023-09-13 |
EP4191583A1 (fr) | 2023-06-07 |
JP5281169B2 (ja) | 2013-09-04 |
EP2352145A4 (fr) | 2012-05-02 |
PL3910630T3 (pl) | 2023-08-21 |
KR20110084962A (ko) | 2011-07-26 |
FI3910630T3 (fi) | 2023-07-18 |
EP3910630B1 (fr) | 2023-04-19 |
JP2013156667A (ja) | 2013-08-15 |
CN101770776A (zh) | 2010-07-07 |
PT3910630T (pt) | 2023-07-19 |
JP6110212B2 (ja) | 2017-04-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2808867A1 (fr) | Procédé et dispositif de codage de signal vocal transitoire, procédé et dispositif de décodage, système de traitement et support de stockage lisible par ordinateur | |
KR101378696B1 (ko) | 협대역 신호로부터의 상위대역 신호의 결정 | |
EP2737479B1 (fr) | Amélioration adaptative de l'intelligibilité vocale | |
CN107731237B (zh) | 时域帧错误隐藏设备 | |
JP2018045243A (ja) | 低レートcelpデコーダに関する非音声コンテンツの向上 | |
EP3136386B1 (fr) | Appareil et procédé pour générer un signal amélioré en fréquence à l'aide d'une mise en forme du signal d'amélioration | |
US20160104499A1 (en) | Signal processing device and signal processing method | |
WO2015027168A1 (fr) | Procédé et système d'amélioration de l'intelligibilité de la parole dans des environnements bruyants | |
CN107507610B (zh) | 一种基于元音基频信息的汉语声调识别方法 | |
WO2017193551A1 (fr) | Procédé de codage de signal multicanal, et codeur | |
US12009000B2 (en) | Apparatus and method for comfort noise generation mode selection | |
Deepa et al. | The Influence of Speech Enhancement Algorithm in Speech Compression with Voice Excited Linear Predictive Coding | |
Abid et al. | The effect chirp term in audio compression using a Gammachirp wavelet | |
Waheeduddin | A Novel Robust Mel-Energy Based Voice Activity Detector for Nonstationary Noise and Its Application for Speech Waveform Compression | |
Syed | A Novel Robust Mel-Energy Based Voice Activity Detector for Nonstationary Noise and Its Application for Speech Waveform Compression | |
Nelson et al. | Tuning Time-Frequency methods for the detection of metered HF speech | |
JP2006323265A (ja) | 明瞭度評価装置、明瞭度評価方法、及び明瞭度評価プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20140701 |
|
AC | Divisional application: reference to earlier application |
Ref document number: 2352145 Country of ref document: EP Kind code of ref document: P |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR |
|
R17P | Request for examination filed (corrected) |
Effective date: 20150603 |
|
RBV | Designated contracting states (corrected) |
Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR |
|
17Q | First examination report despatched |
Effective date: 20160822 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
APBK | Appeal reference recorded |
Free format text: ORIGINAL CODE: EPIDOSNREFNE |
|
APBN | Date of receipt of notice of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA2E |
|
APBR | Date of receipt of statement of grounds of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA3E |
|
APAF | Appeal reference modified |
Free format text: ORIGINAL CODE: EPIDOSCREFNE |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: CRYSTAL CLEAR CODEC, LLC |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: CRYSTAL CLEAR CODEC, LLC Owner name: CRYSTAL CLEAR CODEC SPOLKA Z O.O. |
|
APBT | Appeal procedure closed |
Free format text: ORIGINAL CODE: EPIDOSNNOA9E |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20221025 |