US20170053655A1 - Linear prediction coefficient conversion device and linear prediction coefficient conversion method - Google Patents
Linear prediction coefficient conversion device and linear prediction coefficient conversion method Download PDFInfo
- Publication number
- US20170053655A1 US20170053655A1 US15/306,292 US201515306292A US2017053655A1 US 20170053655 A1 US20170053655 A1 US 20170053655A1 US 201515306292 A US201515306292 A US 201515306292A US 2017053655 A1 US2017053655 A1 US 2017053655A1
- Authority
- US
- United States
- Prior art keywords
- linear prediction
- sampling frequency
- power spectrum
- coefficients
- prediction coefficients
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000006243 chemical reaction Methods 0.000 title claims abstract description 44
- 238000000034 method Methods 0.000 title claims description 42
- 238000005070 sampling Methods 0.000 claims abstract description 97
- 238000001228 spectrum Methods 0.000 claims abstract description 90
- 230000015572 biosynthetic process Effects 0.000 abstract description 42
- 238000003786 synthesis reaction Methods 0.000 abstract description 42
- 238000004364 calculation method Methods 0.000 description 32
- 238000006073 displacement reaction Methods 0.000 description 11
- 238000000354 decomposition reaction Methods 0.000 description 8
- 238000013213 extrapolation Methods 0.000 description 5
- XPYGGHVSFMUHLH-UUSULHAXSA-N falecalcitriol Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@@H](CCCC(O)(C(F)(F)F)C(F)(F)F)C)=C\C=C1\C[C@@H](O)C[C@H](O)C1=C XPYGGHVSFMUHLH-UUSULHAXSA-N 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 101100350185 Caenorhabditis elegans odd-1 gene Proteins 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/13—Residual excited linear prediction [RELP]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
Definitions
- the present invention relates to a linear prediction coefficient conversion device and a linear prediction coefficient conversion method.
- An autoregressive all-pole model is a method that is often used for modeling of a short-term spectral envelope in speech and audio coding, where an input signal is acquired for a certain collective unit or a frame with a specified length, a parameter of the model is encoded and transmitted to a decoder together with another parameter as transmission information.
- the autoregressive all-pole model is generally estimated by linear prediction and represented as a linear prediction synthesis filter.
- ITU-T Recommendation G.718 One of the latest typical speech and audio coding techniques is ITU-T Recommendation G.718.
- the Recommendation describes a typical frame structure for coding using a linear prediction synthesis filter, and an estimation method, a coding method, an interpolation method, and a use method of a linear prediction synthesis filter in detail. Further, speech and audio coding on the basis of linear prediction is also described in detail in Patent Literature 2.
- FIG. 1 shows an example where the internal sampling frequency changes.
- the internal sampling frequency is 16,000 Hz in a frame i, and it is 12,800 Hz in the previous frame i ⁇ 1.
- the linear prediction synthesis filter that represents the characteristics of an input signal in the previous frame i ⁇ 1 needs to be estimated again after re-sampling the input signal at the changed internal sampling frequency of 16,000 Hz, or converted to the one corresponding to the changed internal sampling frequency of 16,000 Hz.
- the reason that the linear prediction synthesis filter needs to be calculated at a changed internal sampling frequency is to obtain the correct internal state of the linear prediction synthesis filter for the current input signal and to perform interpolation in order to obtain a model that is temporarily smoother.
- LSF coefficients are input as a parameter representing the linear prediction synthesis filter. It may be LSP coefficients, ISF coefficients, ISP coefficients or reflection coefficients, which are generally known as parameters equivalent to linear prediction coefficients.
- linear prediction coefficients are calculated in order to obtain a power spectrum Y( ⁇ ) of the linear prediction synthesis filter at the first internal sampling frequency ( 001 ). This step can be omitted when the linear prediction coefficients are known.
- the power spectrum Y( ⁇ ) of the linear prediction synthesis filter which is determined by the obtained linear prediction coefficients, is calculated ( 002 ).
- the obtained power spectrum is modified to a desired power spectrum Y′( ⁇ ) ( 003 ).
- Autocorrelation coefficients are calculated from the modified power spectrum ( 004 ).
- Linear prediction coefficients are calculated from the autocorrelation coefficients ( 005 ).
- the relationship between the autocorrelation coefficients and the linear prediction coefficients is known as the Yule-Walker equation, and the Levinson-Durbin algorithm is well known as a solution of that equation.
- This algorithm is effective in conversion of a sampling frequency of the above-described linear prediction synthesis filter. This is because, although a signal that is temporally ahead of a signal in a frame to be encoded, which is called a look-ahead signal, is generally used in linear prediction analysis, the look-ahead signal cannot be used when performing linear prediction analysis again in a decoder.
- Non Patent Literature 1 ITU-T Recommendation G.718
- Non Patent Literature 2 Speech coding and synthesis, W. B. Kleijn, K. K. Pariwal, et al. ELSEVIER.
- a linear prediction coefficient conversion device is a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, which includes a means for calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter, a means for calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum, and a means for converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
- this configuration it is possible to effectively reduce the amount of computation.
- the second sampling frequency is F2 (where F1 ⁇ F2).
- One aspect of the present invention can be described as an invention of a device as mentioned above and, in addition, may also be described as an invention of a method as follows. They fall under different categories but are substantially the same invention and achieve similar operation and effects.
- a linear prediction coefficient conversion method is a linear prediction coefficient conversion method performed by a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, the method including a step of calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter, a step of calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum and a step of converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
- FIG. 1 is a view showing the relationship between switching of an internal sampling frequency and a linear prediction synthesis filter.
- FIG. 2 is a view showing conversion of linear prediction coefficients.
- FIG. 3 is a flowchart of conversion 1.
- FIG. 4 is a flowchart of conversion 2.
- FIG. 5 is a block diagram of an embodiment of the present invention.
- FIG. 6 is a view showing the relationship between a unit circle and a cosine function.
- a response of an Nth order autoregressive linear prediction filter (which is referred to hereinafter as a linear prediction synthesis filter)
- Such generation of an autoregressive model using a known power spectrum can be used also for modification of a linear prediction synthesis filter 1/A(z) in the frequency domain. This is achieved by calculating the power spectrum of a known filter
- ⁇ indicates the (N ⁇ 2) number of frequencies placed at regular intervals at (0, ⁇ ), excluding 0 and ⁇ .
- LSF line spectral frequencies
- the representation by LSF is used in various speech and audio coding techniques for the feature quantity of a linear prediction synthesis filter, and the operation and coding of a linear prediction synthesis filter.
- the LSF uniquely characterizes the Nth order polynomial A(z) by the n number of parameters which are different from linear prediction coefficients.
- the LSF has characteristics such as it easily guarantee the stability of a linear prediction synthesis filter, it is intuitively interpreted in the frequency domain, it is less likely to be affected by quantization errors than other parameters such as linear prediction coefficients and reflection coefficients, it is suitable for interpolation and the like.
- LSF is defined as follows.
- LSF decomposition of the Nth order polynomial A(z) can be represented as follows by using displacement of an integer where ⁇ 0
- LSF of A(z) is a non-trivial root of the positive phase angle of P(z) and Q(z).
- the polynomial A(z) is the minimum phase, that is, when all roots of A(z) are inside the unit circle, the non-trivial roots of P(z) and Q(z) are arranged alternately on the unit circle.
- the number of complex roots of P(z) and Q(z) is m P and m Q , respectively.
- Table 1 shows the relationship of m P and m Q with the order n and displacement ⁇ .
- LSF low noise spectral frequency
- the representation using displacement can handle both of ISF and LSF in a unified way.
- a result obtained by LSF can be applied as it is to given ⁇ 0 or can be generalized.
- one more parameter is required to uniquely represent A(z), and the n-th reflection coefficient (which is referred to hereinafter as ⁇ n ) of A(z) is typically used. This parameter is introduced into LSF decomposition as the next factor.
- LSF of the polynomial A(z) is the roots of R( ⁇ ) and S( ⁇ ) at the angular frequency ⁇ (0, ⁇ ).
- Chebyshev polynomials of the first kind is defined as follows using a recurrence relation
- equation (16) is rewritten in the following form
- the coefficients r 0 and s 0 can be obtained by comparison of the equations (18) and (19) with (20) and (21) on the basis of m P and m Q .
- the coefficients of P(z) can be obtained from the equation (6). This example can be applied also to the polynomial of the equation (23) by using the same equation and using the coefficients of Q(z). Further, the same equation for calculating the coefficients of R(x) and S(x) can easily derive another order n and displacement ⁇ as well.
- One embodiment of the present invention provides an effective calculation method and device for, when converting a linear prediction synthesis filter calculated in advance by an encoder or a decoder at a first sampling frequency to the one at a second sampling frequency, calculating the power spectrum of the linear prediction synthesis filter and modifying it to the second sampling frequency, and then obtaining autocorrelation coefficients from the modified power spectrum.
- a calculation method for the power spectrum of a linear prediction synthesis filter according to one embodiment of the present invention is described hereinafter.
- the calculation of the power spectrum uses the LSF decomposition of the equation (6) and the properties of the polynomials P(z) and Q(z).
- the power spectrum can be converted to the real axis of the unit circle.
- One embodiment of the present invention uses the Chebyshev polynomials as a way to more effectively calculate the power spectrum
- (1) to (4) correspond to (1) to (4) in Table 1, respectively.
- the polynomials R(x) and S(x) may be calculated by the above-described Horner's method. Further, when x to calculate R(x) and S(x) is known, the calculation of a trigonometric function can be omitted by storing x in a memory.
- a ⁇ ( x i ) ⁇ 2 ⁇ 2 ⁇ ( 1 - x i ) ⁇ S 2 ⁇ ( x i ) , i ⁇ ⁇ even 2 ⁇ ( 1 + x i ) ⁇ R 2 ⁇ ( x i ) i ⁇ ⁇ odd
- R k ′ X ⁇ ( 1 ) + ( - 1 ) k ⁇ X ⁇ ( - 1 ) + ( 1 + ( - 1 ) k + 1 ) ⁇ ( - 1 ) ⁇ k / 2 ⁇ ⁇ X ⁇ ( 0 ) + 2 ⁇ ⁇ x ⁇ ⁇ ⁇ ( X ⁇ ( x ) + ( - 1 ) k ⁇ X ⁇ ( - x ) ) ⁇ T k ⁇ ( x ) ( 30 )
- N L 1+(12,800 Hz/16,000 Hz)(N ⁇ 1).
- N is the number of frequencies at a sampling frequency of 16,000 Hz.
- the conversion 1 that is performed in an encoder and a decoder under the above conditions is carried out in the following procedure.
- Step S 004 Derive linear prediction coefficients by the Levinson-Durbin method or a similar method with use of the autocorrelation coefficient obtained in Step S 003 , and obtain a linear prediction synthesis filter at the second sampling frequency (Step S 004 ).
- Step S 005 Convert the linear prediction coefficient obtained in Step S 004 to LSF (Step S 005 ).
- the conversion 2 that is performed in an encoder or a decoder can be achieved in the following procedure, in the same manner as the conversion 1.
- Step S 014 Derive linear prediction coefficients by the Levinson-Durbin method or a similar method with use of the autocorrelation coefficient obtained in Step S 013 , and obtain a linear prediction synthesis filter at the second sampling frequency (Step S 014 ).
- Step S 015 Convert the linear prediction coefficient obtained in Step S 014 to LSF (Step S 015 ).
- FIG. 5 is a block diagram in the example of the present invention.
- a real power spectrum conversion unit 100 is composed of a polynomial calculation unit 101 , a real power spectrum calculation unit 102 , and a real power spectrum extrapolation unit 103 , and further a real autocorrelation calculation unit 104 and a linear prediction coefficient calculation unit 105 are provided. This is to achieve the above-described conversions 1 and 2.
- the real power spectrum conversion unit 100 receives, as an input, LSF representing a linear prediction synthesis filter at the first sampling frequency, and outputs the power spectrum of a desired linear prediction synthesis filter at the second sampling frequency.
- the polynomial calculation unit 101 performs the processing in Steps S 001 , S 011 described above to calculate the polynomials R(x) and S(x) from LSF.
- the real power spectrum calculation unit 102 performs the processing in Steps S 002 or S 012 to calculate the power spectrum.
- the real power spectrum extrapolation unit 103 performs extrapolation of the spectrum, which is performed in Step S 012 in the case of the conversion 2.
- the power spectrum of a desired linear prediction synthesis filter is obtained at the second sampling frequency.
- the real autocorrelation calculation unit 104 performs the processing in Steps S 003 and S 013 to convert the power spectrum to autocorrelation coefficients.
- the linear prediction coefficient calculation unit 105 performs the processing in Steps S 004 and S 014 to obtain linear prediction coefficients from the autocorrelation coefficients. Note that, although this block diagram does not show the block corresponding to S 005 and S 015 , the conversion from the linear prediction coefficients to LSF or another equivalent coefficients can be easily achieved by a known technique.
- the coefficients of the polynomials R(x) and S(x) are calculated using the equations (20) and (21) in Steps S 001 and S 011 of the above-described example, the calculation may be performed using the coefficients of the polynomials of the equations (9) and (10), which can be obtained from the linear prediction coefficients. Further, the linear prediction coefficients may be converted from LSP coefficients or ISP coefficients.
- the power spectrum may be converted to that at the second sampling frequency, and Steps S 001 , S 002 , S 011 and S 012 may be omitted.
- a power spectrum may be deformed, and linear prediction coefficients at the second sampling frequency may be obtained.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Complex Calculations (AREA)
- Picture Signal Circuits (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The purpose of the present invention is to estimate, with a small amount of computation, a linear prediction synthesis filter after conversion of an internal sampling frequency. A linear prediction coefficient conversion device is a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, which includes a means for calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter, a means for calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum, and a means for converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
Description
- The present invention relates to a linear prediction coefficient conversion device and a linear prediction coefficient conversion method.
- An autoregressive all-pole model is a method that is often used for modeling of a short-term spectral envelope in speech and audio coding, where an input signal is acquired for a certain collective unit or a frame with a specified length, a parameter of the model is encoded and transmitted to a decoder together with another parameter as transmission information. The autoregressive all-pole model is generally estimated by linear prediction and represented as a linear prediction synthesis filter.
- One of the latest typical speech and audio coding techniques is ITU-T Recommendation G.718. The Recommendation describes a typical frame structure for coding using a linear prediction synthesis filter, and an estimation method, a coding method, an interpolation method, and a use method of a linear prediction synthesis filter in detail. Further, speech and audio coding on the basis of linear prediction is also described in detail in
Patent Literature 2. - In speech and audio coding that can handle various input/output sampling frequencies and operate at a wide range of bit rate, which vary from frame to frame, it is generally required to change the internal sampling frequency of an encoder. Because the same operation is required also in a decoder, decoding is performed at the same internal sampling frequency as in the encoder.
FIG. 1 shows an example where the internal sampling frequency changes. In this example, the internal sampling frequency is 16,000 Hz in a frame i, and it is 12,800 Hz in the previous frame i−1. The linear prediction synthesis filter that represents the characteristics of an input signal in the previous frame i−1 needs to be estimated again after re-sampling the input signal at the changed internal sampling frequency of 16,000 Hz, or converted to the one corresponding to the changed internal sampling frequency of 16,000 Hz. The reason that the linear prediction synthesis filter needs to be calculated at a changed internal sampling frequency is to obtain the correct internal state of the linear prediction synthesis filter for the current input signal and to perform interpolation in order to obtain a model that is temporarily smoother. - One method for obtaining another linear prediction synthesis filter on the basis of the characteristics of a certain linear prediction synthesis filter is to calculate a linear prediction synthesis filter after conversion from a desired frequency response after conversion in a frequency domain as shown in
FIG. 2 . In this example, LSF coefficients are input as a parameter representing the linear prediction synthesis filter. It may be LSP coefficients, ISF coefficients, ISP coefficients or reflection coefficients, which are generally known as parameters equivalent to linear prediction coefficients. First, linear prediction coefficients are calculated in order to obtain a power spectrum Y(ω) of the linear prediction synthesis filter at the first internal sampling frequency (001). This step can be omitted when the linear prediction coefficients are known. Next, the power spectrum Y(ω) of the linear prediction synthesis filter, which is determined by the obtained linear prediction coefficients, is calculated (002). Then, the obtained power spectrum is modified to a desired power spectrum Y′(ω) (003). Autocorrelation coefficients are calculated from the modified power spectrum (004). Linear prediction coefficients are calculated from the autocorrelation coefficients (005). The relationship between the autocorrelation coefficients and the linear prediction coefficients is known as the Yule-Walker equation, and the Levinson-Durbin algorithm is well known as a solution of that equation. - This algorithm is effective in conversion of a sampling frequency of the above-described linear prediction synthesis filter. This is because, although a signal that is temporally ahead of a signal in a frame to be encoded, which is called a look-ahead signal, is generally used in linear prediction analysis, the look-ahead signal cannot be used when performing linear prediction analysis again in a decoder.
- As described above, in speech and audio coding with two different internal sampling frequencies, it is preferred to use a power spectrum in order to convert the internal sampling frequency of a known linear prediction synthesis filter. However, because calculation of a power spectrum is complex computation, there is a problem that the amount of computation is large.
- Non Patent Literature 1: ITU-T Recommendation G.718
- Non Patent Literature 2: Speech coding and synthesis, W. B. Kleijn, K. K. Pariwal, et al. ELSEVIER.
- As described above, there is a problem that, in a coding scheme that has a linear prediction synthesis filter with two different internal sampling frequencies, a large amount of computation is required to convert the linear prediction synthesis filter at a certain internal sampling frequency into the one at a desired internal sampling frequency.
- To solve the above problem, a linear prediction coefficient conversion device according to one aspect of the present invention is a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, which includes a means for calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter, a means for calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum, and a means for converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency. In this configuration, it is possible to effectively reduce the amount of computation.
- Further, in the linear prediction coefficient conversion device according to one aspect of the present invention, the power spectrum corresponding to the second linear prediction coefficients may be obtained by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2), and extrapolating the power spectrum calculated using the first linear prediction coefficients for (N2−N1) number of power spectrum components. In this configuration, it is possible to effectively reduce the amount of computation when the second sampling frequency is higher than the first sampling frequency.
- Further, in the linear prediction coefficient conversion device according to one aspect of the present invention, the power spectrum corresponding to the second linear prediction coefficients may be obtained by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2). In this configuration, it is possible to effectively reduce the amount of computation when the second sampling frequency is lower than the first sampling frequency.
- One aspect of the present invention can be described as an invention of a device as mentioned above and, in addition, may also be described as an invention of a method as follows. They fall under different categories but are substantially the same invention and achieve similar operation and effects.
- Specifically, a linear prediction coefficient conversion method according to one aspect of the present invention is a linear prediction coefficient conversion method performed by a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, the method including a step of calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter, a step of calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum and a step of converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
- Further, a linear prediction coefficient conversion method according to one aspect of the present invention may obtain the power spectrum corresponding to the second linear prediction coefficients by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2), and extrapolating the power spectrum calculated using the first linear prediction coefficients for (N2−N1) number of power spectrum components.
- Further, a linear prediction coefficient conversion method according to one aspect of the present invention may obtain the power spectrum corresponding to the second linear prediction coefficients by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2).
- It is possible to estimate a linear prediction synthesis filter after conversion of an internal sampling frequency with a smaller amount of computation than the existing means.
-
FIG. 1 is a view showing the relationship between switching of an internal sampling frequency and a linear prediction synthesis filter. -
FIG. 2 is a view showing conversion of linear prediction coefficients. -
FIG. 3 is a flowchart ofconversion 1. -
FIG. 4 is a flowchart ofconversion 2. -
FIG. 5 is a block diagram of an embodiment of the present invention. -
FIG. 6 is a view showing the relationship between a unit circle and a cosine function. - Embodiments of a device, a method and a program are described hereinafter with reference to the drawings. Note that, in the description of the drawings, the same elements are denoted by the same reference symbols and redundant description thereof is omitted.
- First, definitions required to describe embodiments are described hereinafter.
- A response of an Nth order autoregressive linear prediction filter (which is referred to hereinafter as a linear prediction synthesis filter)
-
- can be adapted to the power spectrum Y(ω) by calculating autocorrelation
-
- for a known power spectrum Y(ω) at an angular frequency ω∈[−π, π] and, using the Nth order autocorrelation coefficients, solving linear prediction coefficients a1,a2, . . . , an by the Levinson-Durbin method as a typical method, for example.
- Such generation of an autoregressive model using a known power spectrum can be used also for modification of a linear
prediction synthesis filter 1/A(z) in the frequency domain. This is achieved by calculating the power spectrum of a known filter -
Y(ω)=1/|A(ω)|2 (3) - and modifying the obtained power spectrum Y(ω) by an appropriate method that is suitable for the purpose to obtain the modified power spectrum Y′(ω), then calculating the autocorrelation coefficients of Y′(ω) by the above equation (2), and obtaining the linear prediction coefficients of the modified
filter 1/A′(z) by the Levinson-Durbin algorithm or a similar method. - While the equation (2) cannot be analytically calculated except for simple cases, the rectangle approximation can be used as follows, for example.
-
- where Ω indicates the M number of frequencies placed at regular intervals at the angular frequency [−π,π]. When the symmetric property of Y(−ω)=−Y(ω) is used, the above-mentioned addition only needs to evaluate the angular frequency ω∈[0, π], which corresponds to the upper half of the unit circle. Thus, it is preferred in terms of the amount of computation that the rectangle approximation represented by the above equation (4) is altered as follows
-
- where Ω indicates the (N−2) number of frequencies placed at regular intervals at (0, π), excluding 0 and π.
- Hereinafter, line spectral frequencies (which are referred to hereinafter as LSF) as an equivalent means of expression of linear prediction coefficients are described hereinafter.
- The representation by LSF is used in various speech and audio coding techniques for the feature quantity of a linear prediction synthesis filter, and the operation and coding of a linear prediction synthesis filter. The LSF uniquely characterizes the Nth order polynomial A(z) by the n number of parameters which are different from linear prediction coefficients. The LSF has characteristics such as it easily guarantee the stability of a linear prediction synthesis filter, it is intuitively interpreted in the frequency domain, it is less likely to be affected by quantization errors than other parameters such as linear prediction coefficients and reflection coefficients, it is suitable for interpolation and the like.
- For the purpose of one embodiment of the present invention, LSF is defined as follows.
- LSF decomposition of the Nth order polynomial A(z) can be represented as follows by using displacement of an integer where κ≧0
-
A(z)={P(z)+Q(z)}/2 (6) - where P(z)=A(z)+z−n−κA(z−1) and
-
Q(z)=A(z)−z −n−κ A(z −1) - The equation (6) indicates that P(z) is symmetric and Q(z) is antisymmetric as follows
-
P(z)=z −n−κ P(z −1) -
Q(z)=−z −n−κ Q(z −1) - Such symmetric property is an important characteristic in LSF decomposition.
- It is obvious that P(z) and Q(z) each have a root at z=±1. Those obvious roots are as shown in the table 1 as n and κ. Thus, polynomials representing the obvious roots of P(z) and Q(z) are defined as PT(z) and QT(z), respectively. When P(z) does not have an obvious root, PT(z) is 1. The same applies to Q(z).
- LSF of A(z) is a non-trivial root of the positive phase angle of P(z) and Q(z). When the polynomial A(z) is the minimum phase, that is, when all roots of A(z) are inside the unit circle, the non-trivial roots of P(z) and Q(z) are arranged alternately on the unit circle. The number of complex roots of P(z) and Q(z) is mP and mQ, respectively. Table 1 shows the relationship of mP and mQ with the order n and displacement κ.
- When the complex roots of P(z), which is the positive phase angle, are represented as
-
ω0, ω2, . . . , ω2mP −2 - and the roots of Q(z) are represented as
-
ω1, ω3, . . . , ω2mQ −1, - the positions of the roots of the polynomial A(z), which is the minimum phase, can be represented as follows.
-
0<ω0<ω1< . . . <ωmP +mQ −1<π (7) - In speech and audio coding, displacement κ=0 or κ=1 is used. When κ=0, it is generally called immitance spectral frequency (ISF), and when κ=1, it is generally called LSF in a narrower sense than that in the description of one embodiment of the present invention. Note that, however, the representation using displacement can handle both of ISF and LSF in a unified way. In many cases, a result obtained by LSF can be applied as it is to given κ≧0 or can be generalized.
- When κ=0, the LSF representation only has the (mP+mQ=n−1) number of frequency parameters as shown in Table 1. Thus, one more parameter is required to uniquely represent A(z), and the n-th reflection coefficient (which is referred to hereinafter as γn) of A(z) is typically used. This parameter is introduced into LSF decomposition as the next factor.
-
υ=−(γn+1)/(γn−1) (8) - where γn is the n-th reflection coefficient of A(z) which begins with Q(z), and it is typically γn=an.
- When κ=1, the (mP+mQ=n) number of parameters are obtained by LSF decomposition, and it is possible to uniquely represent A(z). In this case, υ=1.
-
TABLE 1 Case n κ mP MQ Pr (z) Qr (z) ν (1) even 0 n/2 n/2 − 1 1 z2 − 1 −(γn + 1)/ (γn − 1) (2) odd 0 (n − 1)/2 (n − 1)/2 z + 1 z − 1 −(γn + 1)/ (γn − 1) (3) even 1 n/2 n/2 z + 1 z − 1 1 (4) odd 1 (n + 1)/2 (n − 1)/2 1 z2 − 1 1 - In consideration of the fact that non-obvious roots, excluding obvious roots, are a pair of complex numbers on the unit circle and obtain symmetric polynomials, the following equation is obtained.
-
- Likewise,
-
Q(z)/υQ T(z)=z −mQ ((z mQ −z −mQ )+q 1(z mQ −1 −z −mQ +1)+ . . . +q mQ ) (10) - In those polynomials,
-
p1, p2, . . . , pmP -
and -
q1, q2, . . . , qmQ - completely represent P(z) and Q(z) by using given displacement κ and ν that is determined by the order n of A(z). Those coefficients can be directly obtained from the expressions (6) and (8).
- When z=ejω and using the following relationship
-
z k +z −k =e jωk +e −jωk=2 cos ωk - the expressions (9) and (10) can be represented as follows
-
P(ω)=2e −jωmP R(ω)P T(ω) (11) -
Q(ω)=2e −jωmQ υS(ω)Q T(ω) (12) -
where -
R(ω)=cos m P ω+p 1 cos(m P−1)ω+ . . . +p mP /2 (13) -
and -
S(ω)=cos m Q ω+q 1 cos(m Q−1)ω+ . . . +q mQ /2 (14) - Specifically, LSF of the polynomial A(z) is the roots of R(ω) and S(ω) at the angular frequency ω∈(0, π).
- The Chebyshev polynomials of the first kind, which is used in one embodiment of the present invention, is described hereinafter.
- The Chebyshev polynomials of the first kind is defined as follows using a recurrence relation
-
T k+1(x)=2xT k(x)−T k−1(x) k=1,2, . . . (15) - Note that the initial values are T0(x)=1 and T1(x)=x, respectively. For x where [−1, 1], the Chebyshev polynomials can be represented as follows
-
T k(x)=cos {k cos−1 x} k=0,1, . . . (16) - One embodiment of the present invention explains that the equation (15) provides a simple method for calculating cos kω (where k=2,3, . . . ) that begins with cos ω and
cos 0=1. Specifically, with use of the equation (16), the equation (15) is rewritten in the following form -
cos kω=2 cos ω cos(k−1)ω−cos(k−2)ω k=2,3, . . . (17) - When conversion ω=arc cos x is used, the first polynomials obtained from the equation (15) are as follows
-
- When the equations (13) and (14) for x∈[−1,1] are replaced by those Chebyshev polynomials, the following equations are obtained
-
R(x)=T mP (x)+p 1 T mP−1 (x)+ . . . +p mP /2 (18) -
S(x)=T mQ (x)+q 1 T mQ−1 (x)+ . . . +q mQ /2 (19) - When LSFωi is known for i=0,1, . . . , mP+mQ−1, the following equations are obtained using the cosine of LSF xi=cos ωi (LSP)
-
R(x)=r 0(x−x 0)(x−x 2) . . . (x−x 2mP −2) (20) -
S(x)=s 0(x−x 1)(x−x 3) . . . (x−x 2mQ −1) (21) - The coefficients r0 and s0 can be obtained by comparison of the equations (18) and (19) with (20) and (21) on the basis of mP and mQ.
- The equations (20) and (21) are written as
-
R(x)=r 0 x mP +r 1 x mP −1 + . . . +r mP (22) -
S(x)=s 0 x mQ +s 1 x mQ −1 + . . . +s mQ (23) - Those polynomials can be efficiently calculated for a given x by a method known as the Horner's method. The Horner's method obtains R(x)=b0(x) by use of the following recursive relation
-
b k(x)=xb k+1(x)+r k - where the initial value is
-
b mP (x)=r mP - The same applies to S(x).
- A method of calculating the coefficients of the polynomials of the equations (22) and (23) is described hereinafter using an example. It is assumed in this example that the order of A(z) is 16 (n=16). Accordingly, mP=mQ=8 in this case. Series expansion of the equation (18) can be represented in the form of the equation (22) by substitution and simplification by the Chebyshev polynomials. As a result, the coefficients of the polynomial of the equation (22) are represented as follows using the coefficient pi of the polynomial P(z).
-
- The coefficients of P(z) can be obtained from the equation (6). This example can be applied also to the polynomial of the equation (23) by using the same equation and using the coefficients of Q(z). Further, the same equation for calculating the coefficients of R(x) and S(x) can easily derive another order n and displacement κ as well.
- Further, when the roots of the equations (20) and (21) are known, coefficients can be obtained from the equations (20) and (21).
- The outline of processing according to one embodiment of the present invention is described hereinafter.
- One embodiment of the present invention provides an effective calculation method and device for, when converting a linear prediction synthesis filter calculated in advance by an encoder or a decoder at a first sampling frequency to the one at a second sampling frequency, calculating the power spectrum of the linear prediction synthesis filter and modifying it to the second sampling frequency, and then obtaining autocorrelation coefficients from the modified power spectrum.
- A calculation method for the power spectrum of a linear prediction synthesis filter according to one embodiment of the present invention is described hereinafter. The calculation of the power spectrum uses the LSF decomposition of the equation (6) and the properties of the polynomials P(z) and Q(z). By using the LSF decomposition and the above-described Chebyshev polynomials, the power spectrum can be converted to the real axis of the unit circle.
- With the conversion to the real axis, it is possible to achieve an effective method for calculating a power spectrum at an arbitrary frequency in ω∈[0, π]. This is because it is possible to eliminate transcendental functions since the power spectrum is represented by polynomials. Particularly, it is possible to simplify the calculation of the power spectrum at ω=0, ω=π/2 and ω=π. The same simplification is applicable also to LSF where either one of P(z) or Q(z) is zero. Such properties are advantageous compared with FFT, which is generally used for the calculation of the power spectrum.
- It is known that the power spectrum of A(z) can be represented as follows using LSF decomposition.
-
|A(ω)|2 ={|P(ω)|2 +|Q(ω)|2}/4 (26) - One embodiment of the present invention uses the Chebyshev polynomials as a way to more effectively calculate the power spectrum |A(ω)|2 of A(z) compared with the case of directly applying the equation (26). Specifically, the power spectrum |A(ω)|2 is calculated on the real axis of the unit circle as represented by the following equation, by converting a variable to x=cos ω and using LSF decomposition by the Chebyshev polynomials.
-
- (1) to (4) correspond to (1) to (4) in Table 1, respectively.
- The equation (27) is proven as follows.
- The following equations are obtained from the equations (11) and (12).
-
|P(ω)|2=4|R(ω)|2 |P T(ω)|2 -
|Q(ω)|2=4υ2 |S(ω)|2 |Q T(ω)|2 - The factors that represent the obvious roots of P(ω) and Q(ω) are respectively as follows.
-
- Application of the substitution cos ω=x and cos 2ω=2x2−1 to |PT(ω)| and |QT(ω)|, respectively, gives the equation (27).
- The polynomials R(x) and S(x) may be calculated by the above-described Horner's method. Further, when x to calculate R(x) and S(x) is known, the calculation of a trigonometric function can be omitted by storing x in a memory.
- The calculation of the power spectrum of A(z) can be further simplified. First, in the case of calculating with LSF, one of R(x) and S(x) in the corresponding equation (27) is zero. When the displacement is κ=1 and the order n is an even number, the equation (27) is simplified as follows.
-
- Further, in the case of ω={0,π/2,π}, it is simplified when x={1,0,−1}. The equations are as follows when the displacement is κ=1 and the order n is an even number, which are the same as in the above example.
-
|A(ω=0)|2=4R 2(1) -
|A(ω=π/2)|2=2(R 2(0)+S 2(0)) -
|A(ω=π)|2=4S 2(−1) - The similar results can be easily obtained also when the displacement is κ=0 and the order n is an odd number.
- The calculation of autocorrelation coefficients according to one embodiment of the present invention is described below.
- In the equation (5), when a frequency Ω+=Δ,2Δ, . . . , (N−1)Δ where N is an odd number and the interval of frequencies is Δ=π/(N−1) is defined, the calculation of autocorrelation contains the above-described simplified power spectrum at ω=0,π/2,π. Because the normalization of autocorrelation coefficients by 1/N does not affect linear prediction coefficients to be obtained as a result, any positive value can be used.
- Still, however, the calculation of the equation (5) requires cos kω where k=1,2, . . . , n for each of the (N−2) number of frequencies. Thus, the symmetric property of cos kω is used.
-
cos(π−kω)=(−1)k cos kω, ω∈(0, π/2) (28) - The following characteristics are also used.
-
cos(kπ/2)=(1/2)(1+(−1)k+1)(−1)└k/2┘ (29) - where └x┘ indicates the largest integer that does not exceed x. Note that the equation (29) is simplified to 2,0,−2,0,2,0, . . . for k=0,1,2, . . . .
- Further, by conversion to x=cos ω, the autocorrelation coefficients are moved onto the real axis of the unit circle. For this purpose, the variable X(x)=Y(arc cos x) is introduced. This enables the calculation of cos kω by use of the equation (15).
- Given the above, the autocorrelation approximation of the equation (5) can be replaced by the following equation.
-
- where Tk(x)=2xTk−1(x)−Tk−2(x) k=2,3, . . . , n, and T0(x)=1, T1(x)=cos x as described above. When the symmetric property of the equation (28) is taken into consideration, the last term of the equation (30) needs to be calculated only when x∈Λ={cos Δ,cos2Δ, . . . , (N−3)Δ/2}, and the (N−3)/2 number of cosine values can be stored in a memory.
FIG. 6 shows the relationship between the frequency Λ and the cosine function when N=31. - An example of the present invention is described hereinafter. In this example, a case of converting a linear prediction synthesis filter calculated at a first sampling frequency of 16,000 Hz to that at a second sampling frequency of 12,800 Hz (which is referred to hereinafter as conversion 1) and a case of converting a linear prediction synthesis filter calculated at a first sampling frequency of 12,800 Hz to that at a second sampling frequency of 16,000 Hz (hereinafter as conversion 2) are used. Those two sampling frequencies have a ratio of 4:5 and are generally used in speech and audio coding. Each of the
conversion 1 and theconversion 2 of this example is performed on the linear prediction synthesis filter in the previous frame when the internal sampling frequency has changed, and it can be performed in any of an encoder and a decoder. Such conversion is required for setting the correct internal state to the linear prediction synthesis filter in the current frame and for performing interpolation of the linear prediction synthesis filter in accordance with time. - Processing in this example is described hereinafter with reference to the flowcharts of
FIGS. 3 and 4 . - To calculate a power spectrum and autocorrelation coefficients by using a common frequency point in both cases of the
conversions - When the number of frequencies to be used for the calculation of a power spectrum and autocorrelation coefficients in the domain where the sampling frequency is 16,000 Hz is N=31, the interval of frequencies is Δ=π/30, and the number of elements required for the calculation of autocorrelation contained in Λ is (N−3)/2=14.
- The
conversion 1 that is performed in an encoder and a decoder under the above conditions is carried out in the following procedure. - Determine the coefficients of polynomials R(x) and S(x) by using the equations (20) and (21) from roots obtained by displacement κ=0 or κ=1 and LSF which correspond to a linear prediction synthesis filter obtained at a sampling frequency of 16,000 Hz, which is the first sampling frequency (Step S001).
- Calculate the power spectrum of the linear prediction synthesis filter at the second sampling frequency up to 6,400 Hz, which is the Nyquist frequency of the second sampling frequency. Because this cutoff frequency corresponds to ω=(4/5)π at the first sampling frequency, a power spectrum is calculated using the equation (27) at NL=25 number of frequencies on the low side. For the calculation of R(x) and S(x), the Horner's method may be used to reduce the calculation. There is no need to calculate a power spectrum for the remaining 6 (=N−NL) frequencies on the high side (Step S002).
- Calculate autocorrelation coefficients corresponding to the power spectrum obtained in Step S002 by using the equation (30). In this step, N in the equation (30) is set to NL=25, which is the number of frequencies at the second sampling frequency (Step S003).
- Derive linear prediction coefficients by the Levinson-Durbin method or a similar method with use of the autocorrelation coefficient obtained in Step S003, and obtain a linear prediction synthesis filter at the second sampling frequency (Step S004).
- Convert the linear prediction coefficient obtained in Step S004 to LSF (Step S005).
- The
conversion 2 that is performed in an encoder or a decoder can be achieved in the following procedure, in the same manner as theconversion 1. - Determine the coefficients of polynomials R(x) and S(x) by using the equations (20) and (21) from roots obtained by displacement κ=0 or κ=1 and LSF which correspond to a linear prediction synthesis filter obtained at a sampling frequency of 12,800 Hz, which is the first sampling frequency (Step S011).
- Calculate the power spectrum of the linear prediction synthesis filter at the second sampling frequency up to 6,400 Hz, which is the Nyquist frequency of the first sampling frequency, first. This cutoff frequency corresponds to ω=π, and a power spectrum is calculated using the equation (27) at NL=25 number of frequencies. For the calculation of R(x) and S(x), the Horner's method may be used to reduce the calculation. For 6 frequencies exceeding 6,400 Hz at the second sampling frequency, a power spectrum is extrapolated. As an example of extrapolation, the power spectrum obtained at the NL-th frequency may be used (Step S012).
- Calculate autocorrelation coefficients corresponding to the power spectrum obtained in Step S012 by using the equation (30). In this step, N in the equation (30) is set to N=31, which is the number of frequencies at the second sampling frequency (Step S013).
- Derive linear prediction coefficients by the Levinson-Durbin method or a similar method with use of the autocorrelation coefficient obtained in Step S013, and obtain a linear prediction synthesis filter at the second sampling frequency (Step S014).
- Convert the linear prediction coefficient obtained in Step S014 to LSF (Step S015).
-
FIG. 5 is a block diagram in the example of the present invention. A real powerspectrum conversion unit 100 is composed of apolynomial calculation unit 101, a real powerspectrum calculation unit 102, and a real powerspectrum extrapolation unit 103, and further a realautocorrelation calculation unit 104 and a linear predictioncoefficient calculation unit 105 are provided. This is to achieve the above-describedconversions spectrum conversion unit 100 receives, as an input, LSF representing a linear prediction synthesis filter at the first sampling frequency, and outputs the power spectrum of a desired linear prediction synthesis filter at the second sampling frequency. First, thepolynomial calculation unit 101 performs the processing in Steps S001, S011 described above to calculate the polynomials R(x) and S(x) from LSF. Next, the real powerspectrum calculation unit 102 performs the processing in Steps S002 or S012 to calculate the power spectrum. Further, the real powerspectrum extrapolation unit 103 performs extrapolation of the spectrum, which is performed in Step S012 in the case of theconversion 2. By the above process, the power spectrum of a desired linear prediction synthesis filter is obtained at the second sampling frequency. After that, the realautocorrelation calculation unit 104 performs the processing in Steps S003 and S013 to convert the power spectrum to autocorrelation coefficients. Finally, the linear predictioncoefficient calculation unit 105 performs the processing in Steps S004 and S014 to obtain linear prediction coefficients from the autocorrelation coefficients. Note that, although this block diagram does not show the block corresponding to S005 and S015, the conversion from the linear prediction coefficients to LSF or another equivalent coefficients can be easily achieved by a known technique. - Although the coefficients of the polynomials R(x) and S(x) are calculated using the equations (20) and (21) in Steps S001 and S011 of the above-described example, the calculation may be performed using the coefficients of the polynomials of the equations (9) and (10), which can be obtained from the linear prediction coefficients. Further, the linear prediction coefficients may be converted from LSP coefficients or ISP coefficients.
- Furthermore, in the case where a power spectrum at the first sampling frequency or the second sampling frequency is known by some method, the power spectrum may be converted to that at the second sampling frequency, and Steps S001, S002, S011 and S012 may be omitted.
- In addition, in order to assign weights in the frequency domain, a power spectrum may be deformed, and linear prediction coefficients at the second sampling frequency may be obtained.
- 100 . . . real power spectrum conversion unit, 101 . . . polynomial calculation unit, 102 . . . real power spectrum calculation unit, 103 . . . real power spectrum extrapolation unit, 104 . . . real autocorrelation calculation unit, 105 . . . linear prediction coefficient calculation unit
Claims (6)
1. A linear prediction coefficient conversion device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, comprising a circuitry configured to:
calculate, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter;
calculate, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum; and
convert the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
2. The linear prediction coefficient conversion device according to claim 1 , wherein the power spectrum corresponding to the second linear prediction coefficients is obtained by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2), and extrapolating the power spectrum calculated using the first linear prediction coefficients for obtaining (N2−N1) number of power spectrum components.
3. The linear prediction coefficient conversion device according to claim 1 , wherein the power spectrum corresponding to the second linear prediction coefficients is obtained by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2).
4. A linear prediction coefficient conversion method performed by a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, comprising:
a step of calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter;
a step of calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum; and
a step of converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
5. The linear prediction coefficient conversion method according to claim 4 , wherein the power spectrum corresponding to the second linear prediction coefficients is obtained by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2), and extrapolating the power spectrum calculated using the first linear prediction coefficients for obtaining (N2−N1) number of power spectrum components.
6. The linear prediction coefficient conversion method according to claim 4 , wherein the power spectrum corresponding to the second linear prediction coefficients is obtained by calculating a power spectrum using the first linear prediction coefficients at points on the real axis corresponding to N1 number of different frequencies, where N1=1+(F1/F2)(N2−1), when the first sampling frequency is F1 and the second sampling frequency is F2 (where F1<F2).
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2014090781 | 2014-04-25 | ||
JP2014-090781 | 2014-04-25 | ||
PCT/JP2015/061763 WO2015163240A1 (en) | 2014-04-25 | 2015-04-16 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2015/061763 A-371-Of-International WO2015163240A1 (en) | 2014-04-25 | 2015-04-16 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/191,083 Continuation US10714107B2 (en) | 2014-04-25 | 2018-11-14 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
US16/191,104 Continuation US10714108B2 (en) | 2014-04-25 | 2018-11-14 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
Publications (2)
Publication Number | Publication Date |
---|---|
US20170053655A1 true US20170053655A1 (en) | 2017-02-23 |
US10163448B2 US10163448B2 (en) | 2018-12-25 |
Family
ID=54332406
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/306,292 Active US10163448B2 (en) | 2014-04-25 | 2015-04-16 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
US16/191,083 Active US10714107B2 (en) | 2014-04-25 | 2018-11-14 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
US16/191,104 Active US10714108B2 (en) | 2014-04-25 | 2018-11-14 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
US16/897,233 Active US11222644B2 (en) | 2014-04-25 | 2020-06-09 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
Family Applications After (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/191,083 Active US10714107B2 (en) | 2014-04-25 | 2018-11-14 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
US16/191,104 Active US10714108B2 (en) | 2014-04-25 | 2018-11-14 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
US16/897,233 Active US11222644B2 (en) | 2014-04-25 | 2020-06-09 | Linear prediction coefficient conversion device and linear prediction coefficient conversion method |
Country Status (22)
Country | Link |
---|---|
US (4) | US10163448B2 (en) |
EP (3) | EP4343763A3 (en) |
JP (4) | JP6018724B2 (en) |
KR (4) | KR101920297B1 (en) |
CN (2) | CN107945812B (en) |
AU (4) | AU2015251609B2 (en) |
BR (1) | BR112016024372B1 (en) |
CA (4) | CA3042069C (en) |
DK (2) | DK3471095T3 (en) |
ES (1) | ES2709329T3 (en) |
FI (1) | FI3471095T3 (en) |
HK (1) | HK1226547B (en) |
HU (1) | HUE066731T2 (en) |
MX (1) | MX352479B (en) |
MY (1) | MY167352A (en) |
PH (1) | PH12016502076B1 (en) |
PL (2) | PL3136384T3 (en) |
PT (2) | PT3471095T (en) |
RU (4) | RU2673691C1 (en) |
TR (1) | TR201901328T4 (en) |
TW (1) | TWI576831B (en) |
WO (1) | WO2015163240A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10897262B2 (en) * | 2017-03-20 | 2021-01-19 | Texas Instruments Incorporated | Methods and apparatus to determine non linearity in analog-to-digital converters |
US11282530B2 (en) * | 2014-04-17 | 2022-03-22 | Voiceage Evs Llc | Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111210837B (en) * | 2018-11-02 | 2022-12-06 | 北京微播视界科技有限公司 | Audio processing method and device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6253172B1 (en) * | 1997-10-16 | 2001-06-26 | Texas Instruments Incorporated | Spectral transformation of acoustic signals |
US20020032562A1 (en) * | 2000-07-05 | 2002-03-14 | Van Den Enden Adrianus Wilhelmus Maria | Method of calculating line spectral frequencies |
KR20050113744A (en) * | 2004-05-31 | 2005-12-05 | 에스케이 텔레콤주식회사 | System and method for construction of voice codebook |
US20060149532A1 (en) * | 2004-12-31 | 2006-07-06 | Boillot Marc A | Method and apparatus for enhancing loudness of a speech signal |
Family Cites Families (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2059726B (en) * | 1979-10-03 | 1984-06-27 | Nippon Telegraph & Telephone | Sound synthesizer |
JPS5853352B2 (en) * | 1979-10-03 | 1983-11-29 | 日本電信電話株式会社 | speech synthesizer |
JP3707116B2 (en) * | 1995-10-26 | 2005-10-19 | ソニー株式会社 | Speech decoding method and apparatus |
JPH09230896A (en) * | 1996-02-28 | 1997-09-05 | Sony Corp | Speech synthesis device |
KR970063031U (en) * | 1996-05-07 | 1997-12-11 | Brake pad of vehicle | |
FI119576B (en) * | 2000-03-07 | 2008-12-31 | Nokia Corp | Speech processing device and procedure for speech processing, as well as a digital radio telephone |
US7739052B2 (en) | 2001-05-18 | 2010-06-15 | International Business Machines Corporation | Pattern discovery techniques for determining maximal irredundant and redundant motifs |
US6895375B2 (en) * | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
US6829579B2 (en) * | 2002-01-08 | 2004-12-07 | Dilithium Networks, Inc. | Transcoding method and system between CELP-based speech codes |
US20040002856A1 (en) * | 2002-03-08 | 2004-01-01 | Udaya Bhaskar | Multi-rate frequency domain interpolative speech CODEC system |
US7027980B2 (en) * | 2002-03-28 | 2006-04-11 | Motorola, Inc. | Method for modeling speech harmonic magnitudes |
EP1383109A1 (en) * | 2002-07-17 | 2004-01-21 | STMicroelectronics N.V. | Method and device for wide band speech coding |
DE602005009374D1 (en) * | 2004-09-06 | 2008-10-09 | Matsushita Electric Ind Co Ltd | SCALABLE CODING DEVICE AND SCALABLE CODING METHOD |
KR100721537B1 (en) * | 2004-12-08 | 2007-05-23 | 한국전자통신연구원 | Apparatus and Method for Highband Coding of Splitband Wideband Speech Coder |
EP1953737B1 (en) * | 2005-10-14 | 2012-10-03 | Panasonic Corporation | Transform coder and transform coding method |
WO2007120316A2 (en) * | 2005-12-05 | 2007-10-25 | Qualcomm Incorporated | Systems, methods, and apparatus for detection of tonal components |
CN101149927B (en) * | 2006-09-18 | 2011-05-04 | 展讯通信(上海)有限公司 | Method for determining ISF parameter in linear predication analysis |
CN101484935B (en) * | 2006-09-29 | 2013-07-17 | Lg电子株式会社 | Methods and apparatuses for encoding and decoding object-based audio signals |
CN101266797B (en) * | 2007-03-16 | 2011-06-01 | 展讯通信(上海)有限公司 | Post processing and filtering method for voice signals |
CN101030375B (en) * | 2007-04-13 | 2011-01-26 | 清华大学 | Method for extracting base-sound period based on dynamic plan |
JP4691082B2 (en) * | 2007-09-11 | 2011-06-01 | 日本電信電話株式会社 | Linear prediction model order determination apparatus, linear prediction model order determination method, program thereof, and recording medium |
CN101388214B (en) * | 2007-09-14 | 2012-07-04 | 向为 | Speed changing vocoder and coding method thereof |
ES2678415T3 (en) * | 2008-08-05 | 2018-08-10 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and procedure for processing and audio signal for speech improvement by using a feature extraction |
JP4918074B2 (en) * | 2008-08-18 | 2012-04-18 | 日本電信電話株式会社 | Encoding device, encoding method, encoding program, and recording medium |
CN101770777B (en) * | 2008-12-31 | 2012-04-25 | 华为技术有限公司 | Linear predictive coding frequency band expansion method, device and coding and decoding system |
JP4932917B2 (en) | 2009-04-03 | 2012-05-16 | 株式会社エヌ・ティ・ティ・ドコモ | Speech decoding apparatus, speech decoding method, and speech decoding program |
KR101747917B1 (en) * | 2010-10-18 | 2017-06-15 | 삼성전자주식회사 | Apparatus and method for determining weighting function having low complexity for lpc coefficients quantization |
CN102065291B (en) * | 2010-11-09 | 2012-11-21 | 北京工业大学 | Sparse representation model-based image decoding method |
SE536046C2 (en) * | 2011-01-19 | 2013-04-16 | Limes Audio Ab | Method and device for microphone selection |
CN102783034B (en) * | 2011-02-01 | 2014-12-17 | 华为技术有限公司 | Method and apparatus for providing signal processing coefficients |
CN102325090B (en) * | 2011-09-21 | 2014-04-09 | 电子科技大学 | Network flow estimating method |
WO2013068634A1 (en) * | 2011-11-10 | 2013-05-16 | Nokia Corporation | A method and apparatus for detecting audio sampling rate |
CN103366749B (en) * | 2012-03-28 | 2016-01-27 | 北京天籁传音数字技术有限公司 | A kind of sound codec devices and methods therefor |
CN102867516B (en) * | 2012-09-10 | 2014-08-27 | 大连理工大学 | Speech coding and decoding method using high-order linear prediction coefficient grouping vector quantization |
CN103021405A (en) * | 2012-12-05 | 2013-04-03 | 渤海大学 | Voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter |
CN103050121A (en) | 2012-12-31 | 2013-04-17 | 北京迅光达通信技术有限公司 | Linear prediction speech coding method and speech synthesis method |
ES2770407T3 (en) * | 2014-01-24 | 2020-07-01 | Nippon Telegraph & Telephone | Linear predictive analytics logging apparatus, method, program and support |
EP2916319A1 (en) * | 2014-03-07 | 2015-09-09 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Concept for encoding of information |
KR102626320B1 (en) * | 2014-03-28 | 2024-01-17 | 삼성전자주식회사 | Method and apparatus for quantizing linear predictive coding coefficients and method and apparatus for dequantizing linear predictive coding coefficients |
RU2677453C2 (en) * | 2014-04-17 | 2019-01-16 | Войсэйдж Корпорейшн | Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates |
WO2016089749A1 (en) * | 2014-12-01 | 2016-06-09 | Hoarty W Leo | System and method for continuous media segment identification |
-
2015
- 2015-04-16 US US15/306,292 patent/US10163448B2/en active Active
- 2015-04-16 PL PL15783059T patent/PL3136384T3/en unknown
- 2015-04-16 EP EP24155958.2A patent/EP4343763A3/en active Pending
- 2015-04-16 ES ES15783059T patent/ES2709329T3/en active Active
- 2015-04-16 BR BR112016024372-2A patent/BR112016024372B1/en active IP Right Grant
- 2015-04-16 PT PT182054577T patent/PT3471095T/en unknown
- 2015-04-16 RU RU2017141922A patent/RU2673691C1/en active
- 2015-04-16 PT PT15783059T patent/PT3136384T/en unknown
- 2015-04-16 DK DK18205457.7T patent/DK3471095T3/en active
- 2015-04-16 EP EP18205457.7A patent/EP3471095B1/en active Active
- 2015-04-16 CA CA3042069A patent/CA3042069C/en active Active
- 2015-04-16 CA CA3042070A patent/CA3042070C/en active Active
- 2015-04-16 RU RU2016146095A patent/RU2639656C1/en active
- 2015-04-16 DK DK15783059.7T patent/DK3136384T3/en active
- 2015-04-16 KR KR1020187019377A patent/KR101920297B1/en active IP Right Grant
- 2015-04-16 HU HUE18205457A patent/HUE066731T2/en unknown
- 2015-04-16 MX MX2016013797A patent/MX352479B/en active IP Right Grant
- 2015-04-16 CN CN201711257010.3A patent/CN107945812B/en active Active
- 2015-04-16 KR KR1020187032874A patent/KR101957276B1/en active IP Right Grant
- 2015-04-16 KR KR1020167029288A patent/KR101772501B1/en active IP Right Grant
- 2015-04-16 WO PCT/JP2015/061763 patent/WO2015163240A1/en active Application Filing
- 2015-04-16 AU AU2015251609A patent/AU2015251609B2/en active Active
- 2015-04-16 KR KR1020177023413A patent/KR101878292B1/en active IP Right Grant
- 2015-04-16 FI FIEP18205457.7T patent/FI3471095T3/en active
- 2015-04-16 CA CA2946824A patent/CA2946824C/en active Active
- 2015-04-16 EP EP15783059.7A patent/EP3136384B1/en active Active
- 2015-04-16 TR TR2019/01328T patent/TR201901328T4/en unknown
- 2015-04-16 CN CN201580021060.4A patent/CN106233381B/en active Active
- 2015-04-16 MY MYPI2016703890A patent/MY167352A/en unknown
- 2015-04-16 PL PL18205457.7T patent/PL3471095T3/en unknown
- 2015-04-16 CA CA3042066A patent/CA3042066C/en active Active
- 2015-04-16 JP JP2016514893A patent/JP6018724B2/en active Active
- 2015-04-23 TW TW104113035A patent/TWI576831B/en active
-
2016
- 2016-09-30 JP JP2016193991A patent/JP6277245B2/en active Active
- 2016-10-19 PH PH12016502076A patent/PH12016502076B1/en unknown
- 2016-12-24 HK HK16114697A patent/HK1226547B/en unknown
-
2018
- 2018-01-15 JP JP2018004494A patent/JP6715269B2/en active Active
- 2018-06-22 AU AU2018204572A patent/AU2018204572B2/en active Active
- 2018-11-13 RU RU2018139927A patent/RU2694150C1/en active
- 2018-11-14 US US16/191,083 patent/US10714107B2/en active Active
- 2018-11-14 US US16/191,104 patent/US10714108B2/en active Active
-
2019
- 2019-06-27 RU RU2019120080A patent/RU2714390C1/en active
- 2019-12-12 AU AU2019280041A patent/AU2019280041B2/en active Active
- 2019-12-12 AU AU2019280040A patent/AU2019280040B2/en active Active
-
2020
- 2020-06-08 JP JP2020099191A patent/JP6936363B2/en active Active
- 2020-06-09 US US16/897,233 patent/US11222644B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6253172B1 (en) * | 1997-10-16 | 2001-06-26 | Texas Instruments Incorporated | Spectral transformation of acoustic signals |
US20020032562A1 (en) * | 2000-07-05 | 2002-03-14 | Van Den Enden Adrianus Wilhelmus Maria | Method of calculating line spectral frequencies |
KR20050113744A (en) * | 2004-05-31 | 2005-12-05 | 에스케이 텔레콤주식회사 | System and method for construction of voice codebook |
US20060149532A1 (en) * | 2004-12-31 | 2006-07-06 | Boillot Marc A | Method and apparatus for enhancing loudness of a speech signal |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11282530B2 (en) * | 2014-04-17 | 2022-03-22 | Voiceage Evs Llc | Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates |
US10897262B2 (en) * | 2017-03-20 | 2021-01-19 | Texas Instruments Incorporated | Methods and apparatus to determine non linearity in analog-to-digital converters |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11222644B2 (en) | Linear prediction coefficient conversion device and linear prediction coefficient conversion method | |
JP7077378B2 (en) | Information coding concept |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: NTT DOCOMO, INC., JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NAKA, NOBUHIKO;RUOPPILA, VESA;REEL/FRAME:040758/0850 Effective date: 20161014 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |