WO2015008783A1 - 線形予測分析装置、方法、プログラム及び記録媒体 - Google Patents
線形予測分析装置、方法、プログラム及び記録媒体 Download PDFInfo
- Publication number
- WO2015008783A1 WO2015008783A1 PCT/JP2014/068895 JP2014068895W WO2015008783A1 WO 2015008783 A1 WO2015008783 A1 WO 2015008783A1 JP 2014068895 W JP2014068895 W JP 2014068895W WO 2015008783 A1 WO2015008783 A1 WO 2015008783A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- coefficient
- max
- value
- fundamental frequency
- linear prediction
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
Definitions
- the present invention relates to a technique for analyzing a digital time series signal such as a voice signal, an acoustic signal, an electrocardiogram, an electroencephalogram, a magnetoencephalogram, and a seismic wave.
- a digital time series signal such as a voice signal, an acoustic signal, an electrocardiogram, an electroencephalogram, a magnetoencephalogram, and a seismic wave.
- Non-Patent Documents 1 and 2). reference. a method of encoding based on a prediction coefficient obtained by linear predictive analysis of an input audio signal or acoustic signal is widely used (for example, Non-Patent Documents 1 and 2). reference.).
- Non-Patent Documents 1 to 3 the prediction coefficient is calculated by the linear prediction analyzer illustrated in FIG.
- the linear prediction analysis apparatus 1 includes an autocorrelation calculation unit 11, a coefficient multiplication unit 12, and a prediction coefficient calculation unit 13.
- the input signal which is a digital audio signal or digital audio signal in the time domain, is processed every N sample frames.
- n represents the sample number of each sample in the input signal, and N is a predetermined positive integer.
- P max is a predetermined positive integer less than N.
- Prediction coefficient calculation unit 13 the coefficient that can be converted by the prediction coefficient calculation unit 13 into linear prediction coefficients from the first order to the P max order that is a predetermined maximum order by using R ′ O (i), for example, by the Levinson-Durbin method or the like.
- Coefficients that can be converted into linear prediction coefficients include PARCOR coefficients K O (1), K O (2), ..., K O (P max ) and linear prediction coefficients a O (1), a O (2), ... , a O (P max ), etc.
- f s the sampling frequency.
- Non-Patent Document 3 describes an example in which a coefficient based on a function other than the above-described exponential function is used.
- the function used here is a function based on a sampling period ⁇ (corresponding to a period corresponding to f s ) and a predetermined constant a, and a fixed coefficient is also used.
- a modified autocorrelation R ′ O obtained by multiplying the autocorrelation R O (i) by a fixed coefficient w O (i). i) was used to find the coefficients that can be converted into linear prediction coefficients. Therefore, it is not necessary to modify the autocorrelation R O (i) by the multiplication of the coefficient w O (i), that is, the autocorrelation R O (i) itself is not the modified autocorrelation R ′ O (i).
- the input signal is such that the peak of the spectrum does not become too large in the spectral envelope corresponding to the coefficient that can be converted to the linear prediction coefficient.
- the spectral envelope corresponding to the coefficient that can be converted into the linear prediction coefficient obtained by the modified autocorrelation R ′ O (i) is expressed by the input signal X O (n ) May be reduced in accuracy, that is, the accuracy of linear prediction analysis may be reduced.
- An object of the present invention is to provide a linear predictive analysis method, apparatus, program, and recording medium with higher analysis accuracy than in the past.
- a prediction coefficient calculation step for obtaining coefficients that can be converted into linear prediction coefficients from the first order to the P max order, and for each order
- the coefficient table for which is acquired is the coefficient table t1, and if the period is long, the coefficient is acquired in the coefficient determination step.
- a coefficient table t2 the coefficient table, at least for some i w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i), for at least a portion of each i of the other i w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i), and w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i) for each remaining i.
- the coefficient table in which the coefficient is acquired in the coefficient determination step is the coefficient table t0, and if the basic frequency is medium, the coefficient table in which the coefficient is acquired in the coefficient determination step is the coefficient table t1, and the basic frequency is low.
- the coefficient table from which the coefficients are obtained in the coefficient determination step As Le t2, at least for some i w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i), w t0 (i) ⁇ about at least a portion of each i of the other i w t1 (i) ⁇ w t2 (i), and w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i) for each remaining i.
- the flowchart for demonstrating the example of a linear prediction analysis method The flowchart for demonstrating the example of the linear prediction analysis method of 2nd embodiment.
- the flowchart for demonstrating the example of the linear prediction analysis method of 2nd embodiment The block diagram for demonstrating the example of the linear prediction analyzer of 3rd embodiment.
- the flowchart for demonstrating a modification The block diagram for demonstrating a modification.
- the block diagram for demonstrating a modification The block diagram for demonstrating the example of the linear prediction analyzer of 4th embodiment.
- the linear prediction analysis apparatus 2 includes, for example, an autocorrelation calculation unit 21, a coefficient determination unit 24, a coefficient multiplication unit 22, and a prediction coefficient calculation unit 23.
- the operations of the autocorrelation calculation unit 21, the coefficient multiplication unit 22, and the prediction coefficient calculation unit 23 are the same as the operations in the autocorrelation calculation unit 11, the coefficient multiplication unit 12, and the prediction coefficient calculation unit 13 of the conventional linear prediction analysis apparatus 1, respectively. is there.
- An input signal X O (n) that is a digital signal such as a digital speech signal, a digital acoustic signal, an electrocardiogram, an electroencephalogram, a magnetoencephalogram, or a seismic wave in a time domain for each frame that is a predetermined time interval is input to the linear predictive analyzer 2. Is done.
- the input signal is an input time series signal.
- the input signal of the current frame X O (n) (n 0,1, ..., N-1) to. n represents the sample number of each sample in the input signal, and N is a predetermined positive integer.
- Is X O (n) (n N, N + 1,..., 2N ⁇ 1).
- the periodicity analysis unit 900 includes a fundamental frequency calculation unit 930, for example.
- the fundamental frequency P is obtained, and information that can identify the fundamental frequency P is output as information about the fundamental frequency. There are various known methods for obtaining the fundamental frequency, and any known method may be used.
- the obtained fundamental frequency P may be encoded to obtain a fundamental frequency code, and the fundamental frequency code may be output as information about the fundamental frequency. Further, the fundamental frequency quantization value ⁇ P corresponding to the fundamental frequency code may be obtained, and the fundamental frequency quantization value ⁇ P may be output as information about the fundamental frequency.
- the fundamental frequency calculation unit 930 a specific example of the fundamental frequency calculation unit 930 will be described.
- Fundamental frequency calculation unit 930, P s1 is a fundamental frequency of the M sub-frames constituting the current frame, ..., a maximum value max (P s1, ..., P sM) of the P sM information capable of identifying the Output as information about the fundamental frequency.
- Nn is a predetermined positive integer that satisfies the relationship Nn ⁇ N
- the fundamental frequency calculation unit 930 also obtains the fundamental frequency P next obtained for the signal interval of the previous frame and stored in the fundamental frequency calculation unit 930, that is, the current frame of the signal interval of the immediately previous frame.
- the fundamental frequency for each of a plurality of subframes may be obtained.
- this is an example where the fundamental frequency calculation unit 930 is operated after the linear prediction analysis apparatus 2 for the same frame.
- FIG. 2 is a flowchart of a linear prediction analysis method performed by the linear prediction analysis apparatus 2.
- the input signal X O (n) (n -Np, -Np + 1, ..., -1, 0,1, ..., N-1, N, ..., N-1 + Nn
- the autocorrelation R O (i) may be calculated using part of the input signals of the previous and subsequent frames.
- Np and Nn are predetermined positive integers that satisfy the relationship of Np ⁇ N and Nn ⁇ N, respectively.
- the autocorrelation may be obtained from the approximated power spectrum by using the MDCT sequence as an approximation of the power spectrum. As described above, any of known techniques used in the world can be used as the autocorrelation calculation method.
- Coefficient w O (i) is a coefficient for obtaining a deformation by modifying the autocorrelation R O (i) the autocorrelation R 'O (i).
- the coefficient w O (i) is also called a lag window w O (i) or a lag window coefficient w O (i) in the field of signal processing. Since the coefficient w O (i) is a positive value, the coefficient w O (i) is larger / smaller than the predetermined value, and the coefficient w O (i) is larger / smaller than the predetermined value. Sometimes expressed. Further, the size of Ragumado w O (i), shall mean the value of the lag window w O (i).
- the information about the fundamental frequency input to the coefficient determination unit 24 is information that specifies the fundamental frequency obtained from all or part of the input signal of the current frame and / or the input signal of a frame near the current frame. That is, the fundamental frequency used for determining the coefficient w O (i) is a fundamental frequency obtained from all or part of the input signal of the current frame and / or the input signal of a frame near the current frame.
- the coefficient determination unit 24 supports the information about the fundamental frequency in all or part of the possible range of the fundamental frequency corresponding to the information about the fundamental frequency for all or some orders from the 0th order to the P max order. As the fundamental frequency is larger, smaller values are determined as coefficients w O (0), w O (1),..., W O (P max ). Further, the coefficient determination unit 24 uses a value having a positive correlation with the fundamental frequency instead of the fundamental frequency, and decreases the coefficient w O (0), w O (1),. It may be determined as w O (P max ).
- the coefficient determination unit 24 determines the coefficient w O (i) using, for example, a monotone non-increasing function for the fundamental frequency corresponding to the input information about the fundamental frequency.
- the coefficient w O (i) is determined by the following equation (1).
- P is a fundamental frequency corresponding to information on the inputted fundamental frequency.
- the coefficient w O (i) is determined by the following equation (2) using ⁇ which is a predetermined value larger than 0.
- ⁇ is a value for adjusting the width of the lag window when the coefficient w O (i) is regarded as the lag window, in other words, the strength of the lag window.
- the predetermined ⁇ is obtained by encoding and decoding a speech signal or an acoustic signal with a coding device including the linear prediction analysis device 2 and a decoding device corresponding to the coding device for a plurality of candidate values of ⁇ , What is necessary is just to determine by selecting as a candidate value with favorable subjective quality and objective quality of a signal and a decoding acoustic signal as (alpha).
- the coefficient w O (i) may be determined by the following equation (2A) using a predetermined function f (P) for the fundamental frequency P.
- the equation for determining the coefficient w O (i) using the fundamental frequency P is not limited to the above equations (1), (2), (2A), and an increase in a value that is positively correlated with the fundamental frequency. Any other expression may be used as long as it can describe a monotonous non-increasing relationship.
- the coefficient w O (i) may be determined by any one of the following formulas (3) to (6).
- a is a real number determined depending on the fundamental frequency
- m is a natural number determined depending on the fundamental frequency.
- a is a value having a negative correlation with the fundamental frequency
- m is a value having a negative correlation with the fundamental frequency.
- ⁇ is a sampling period.
- Equation (3) is a window function in the form called Bartlett window
- Equation (4) is a window function in the format called Binomial window
- Equation (5) is a window function in the form called Triangular in frequency domain window
- (6) is a window function of the form called RectangularRin frequency domain window.
- the coefficient w O (i) may be monotonously decreased with an increase in a value that is positively correlated with the fundamental frequency for only at least some orders i, not for each i of 0 ⁇ i ⁇ P max .
- the magnitude of the coefficient w O (i) may not monotonously decrease as the value having a positive correlation with the fundamental frequency increases.
- the prediction coefficient calculation unit 23 obtains a coefficient that can be converted into a linear prediction coefficient using the modified autocorrelation R ′ O (i) (step S3).
- the prediction coefficient calculation unit 23 modified autocorrelation R 'with O (i), such as by Levinson-Durbin method, PARCOR coefficients K O (1 from the primary P max following to a predetermined maximum order ), K O (2), ..., K O (P max) and the linear prediction coefficients a O (1), a O (2), ..., calculates the a O (P max).
- modified autocorrelation R 'with O (i) such as by Levinson-Durbin method, PARCOR coefficients K O (1 from the primary P max following to a predetermined maximum order ), K O (2), ..., K O (P max) and the linear prediction coefficients a O (1), a O (2), ..., calculates the a O (P max).
- the coefficient w O (i) is multiplied by the autocorrelation to obtain a modified autocorrelation and a coefficient that can be converted into a linear prediction coefficient, resulting in a pitch component even when the input signal has a high fundamental frequency
- the quality is higher than the quality of the decoded speech signal and the decoded acoustic signal obtained by encoding and decoding the speech signal and the acoustic signal with the encoding device including the conventional linear prediction analysis device and the decoding device corresponding to the encoding device. ,good.
- the coefficient determination unit 24 determines the coefficient w O (i) based on a value that is negatively correlated with the fundamental frequency, instead of a value that is positively correlated with the fundamental frequency.
- the functional configuration of the linear prediction analysis apparatus 2 according to the modification of the first embodiment and the flowchart of the linear prediction analysis method performed by the linear prediction analysis apparatus 2 are the same as those in the first embodiment shown in FIGS.
- the linear prediction analysis apparatus 2 of the modified example of the first embodiment is the same as the linear prediction analysis apparatus 2 of the first embodiment, except for the part where the processing of the coefficient determination unit 24 is different.
- Information about the period of the digital speech signal and the digital acoustic signal for each frame is also input to the linear prediction analysis apparatus 2.
- Information about the period is obtained by the periodicity analysis unit 900 outside the linear prediction analysis apparatus 2.
- the periodicity analysis unit 900 includes a cycle calculation unit 940, for example.
- the period calculation unit 940 obtains the period T from all or part of the input signal X O of the current frame and / or the input signals of the frames near the current frame. For example, the period calculation unit 940 obtains the period T of the digital audio signal or digital acoustic signal in the signal section including all or part of the input signal X O (n) of the current frame, and the information that can identify the period T is determined as the period. Is output as information about. There are various known methods for obtaining the period, and any known method may be used. Alternatively, the obtained period T may be encoded to obtain a period code, and the period code may be output as information about the period. Furthermore, the quantization value ⁇ T of the period corresponding to the period code may be obtained, and the period quantization value ⁇ T may be output as information about the period. Hereinafter, a specific example of the period calculation unit 940 will be described.
- Period calculating section 940, T s1 is the period of M sub-frames constituting the current frame, ..., the minimum value min (T s1, ..., T sM) of the T sM for cycle specific information capable Is output as information.
- the period calculation unit 940 obtains the signal section of the previous frame and stores the period T next stored in the period calculation unit 940, that is, a part of the current frame in the signal section of the previous frame.
- input signal X O (n) (n 0, 1, ..., Nn-1) of the output cycle determined for the identifiable information as information about the period.
- the period for each of a plurality of subframes may be obtained.
- the information about the period input to the coefficient determination unit 24 is information that specifies the period obtained from all or part of the input signal of the current frame and / or the input signal of a frame near the current frame. That is, the period used for determining the coefficient w O (i) is a period obtained from all or part of the input signal of the current frame and / or the input signal of the frame near the current frame.
- the coefficient determination unit 24 has a period corresponding to the information about the period in all or a part of a possible range of the period corresponding to the information about the period for all or part of the orders from the 0th order to the P max order. Larger values are determined as coefficients w O (0), w O (1),..., W O (P max ). The coefficient determining unit 24 uses the value in the period positively correlated instead of the period, the coefficient a larger value period is large w O (0), w O (1), ..., w O ( P max ) may be determined.
- the coefficient determination unit 24 determines the coefficient w O (i) using, for example, a monotonic non-decreasing function for the period corresponding to the information about the input period.
- the coefficient w O (i) is determined by the following equation (7).
- T is a period corresponding to information about the input period.
- the coefficient w O (i) is determined by the following equation (8) using ⁇ which is a predetermined value larger than 0.
- ⁇ is a value for adjusting the width of the lag window when the coefficient w O (i) is regarded as the lag window, in other words, the strength of the lag window.
- the predetermined ⁇ is obtained by encoding and decoding a speech signal or an acoustic signal with a coding device including the linear prediction analysis device 2 and a decoding device corresponding to the coding device for a plurality of candidate values of ⁇ , What is necessary is just to determine by selecting as a candidate value with favorable subjective quality and objective quality of a signal and a decoding acoustic signal as (alpha).
- the coefficient w O (i) is determined by the following equation (8A) using a predetermined function f (T) for the period T.
- the formula for determining the coefficient w O (i) using the period T is not limited to the above formulas (7), (8), (8A), and is an increase in a value that is negatively correlated with the fundamental frequency. Any other expression may be used as long as it can describe a monotonous non-decreasing relationship.
- the coefficient w O (i) may be monotonously increased with an increase in a value that is negatively correlated with the fundamental frequency only for at least some orders i, not for each i of 0 ⁇ i ⁇ P max . In other words, depending on the order i, the magnitude of the coefficient w O (i) may not increase monotonously with an increase in the value that is negatively correlated with the fundamental frequency.
- the coefficient w corresponding to the order i for at least a part of the prediction orders i according to a value that is negatively correlated with the fundamental frequency.
- the magnitude of O (i) monotonically increases as the value negatively correlates with the fundamental frequency of the signal interval including all or part of the input signal X O (n) of the current frame.
- Possible coefficients can be found, It is possible to achieve high linear prediction of analytical precision than come. Therefore, the decoded speech signal and decoding obtained by encoding and decoding the speech signal and the acoustic signal with the encoding device including the linear prediction analysis device 2 of the modification of the first embodiment and the decoding device corresponding to the encoding device.
- the quality of the acoustic signal is determined based on the decoded speech signal and the decoded acoustic signal obtained by encoding and decoding the speech signal and the acoustic signal with the encoding device including the conventional linear prediction analysis device and the decoding device corresponding to the encoding device. Better than quality.
- FIG. 9 shows experimental results of MOS evaluation experiments using 24 audio-acoustic signal sources and 24 subjects.
- the six MOS values of “conventional method” and “cutA” in FIG. 9 include the encoding devices for each bit rate described in FIG. 9 including the conventional linear prediction analysis device and the decoding devices corresponding to those encoding devices.
- the MOS value for the decoded audio signal and the decoded audio signal obtained by encoding and decoding the audio / acoustic signal source.
- the six MOS values of “proposed method” and “cutB” in FIG. 9 are included in the encoding devices of the respective bit rates described in FIG. 9 including the linear prediction analysis device of the modification of the first embodiment and those encoding devices.
- a value having a positive correlation with the fundamental frequency or a value having a negative correlation with the fundamental frequency is compared with a predetermined threshold, and the coefficient w O (i) is determined according to the comparison result.
- the second embodiment differs from the first embodiment only in the method of determining the coefficient w O (i) in the coefficient determination unit 24, and is the same as the first embodiment in other points. The following description will focus on the parts that are different from the first embodiment, and redundant description of the same parts as in the first embodiment will be omitted.
- the functional configuration of the linear prediction analysis apparatus 2 according to the second embodiment and the flowchart of the linear prediction analysis method performed by the linear prediction analysis apparatus 2 are the same as those in the first embodiment shown in FIGS.
- the linear prediction analysis apparatus 2 according to the second embodiment is the same as the linear prediction analysis apparatus 2 according to the first embodiment except for a portion where the processing of the coefficient determination unit 24 is different.
- FIG. 1 An example of the processing flow of the coefficient determination unit 24 of the second embodiment is shown in FIG.
- the coefficient determination unit 24 of the second embodiment performs, for example, the processing of each step S41A, step S42, and step S43 in FIG.
- the coefficient determination unit 24 compares a predetermined threshold value with a value that is positively correlated with the fundamental frequency corresponding to the input fundamental frequency information (step S41A).
- the value having a positive correlation with the fundamental frequency corresponding to the input fundamental frequency information is, for example, the fundamental frequency itself corresponding to the input fundamental frequency information.
- the coefficient determination unit 24 determines the coefficient w l (i) according to a predetermined rule when a value that is positively correlated with the fundamental frequency is not equal to or greater than a predetermined threshold, that is, when the fundamental frequency is determined to be low.
- w h (i) and w l (i) are determined so as to satisfy the relationship w h (i) ⁇ w l (i) for at least a part of each i.
- w h (i) and w l (i) it is, for each of at least some i w h (i) ⁇ w l satisfies the relation (i), for the other i w h (i) ⁇ w l (i) is determined so as to satisfy the relationship.
- at least a part of each i is, for example, i other than 0 (that is, 1 ⁇ i ⁇ P max ).
- w h (i) and w l (i) are obtained by calculating w O (i) as w h (i) when the fundamental frequency P is P1 in equation (1), and by using equation (1). It is determined according to a predetermined rule of determining w O (i) as w l (i) when P is P2 (where P1> P2). Further, for example, w h (i) and w l (i) obtains equation (2) w O when ⁇ is ⁇ 1 at (i) a w h (i), the ⁇ in Equation (2) It is determined according to a predetermined rule that w O (i) when ⁇ 2 (where ⁇ 1> ⁇ 2) is determined as w l (i).
- both ⁇ 1 and ⁇ 2 are predetermined in the same manner as ⁇ in the equation (2).
- w h (i) and w l (i) obtained in advance by any of these rules are stored in a table, and whether a value having a positive correlation with the fundamental frequency is equal to or greater than a predetermined threshold value. Therefore, either w h (i) or w l (i) may be selected from the table. Also, each of w h (i) and w l (i), as w h i is increased (i), is determined as the value of w l (i) is reduced.
- a coefficient that can be converted into a linear prediction coefficient that suppresses occurrence of a spectrum peak due to a pitch component even when the fundamental frequency of the input signal is high is obtained.
- a value that is negatively correlated with the fundamental frequency is compared with a predetermined threshold value instead of a value that is positively correlated with the fundamental frequency, and a coefficient is determined according to the comparison result.
- w O (i) is determined.
- the predetermined threshold value in the first modification of the second embodiment is different from the predetermined threshold value compared with the value having a positive correlation with the fundamental frequency in the second embodiment.
- the functional configuration and flowchart of the linear prediction analysis apparatus 2 of the first modification of the second embodiment are the same as FIGS. 1 and 2 as the modification of the first embodiment.
- the linear prediction analysis apparatus 2 of the first modification example of the second embodiment is the same as the linear prediction analysis apparatus 2 of the modification example of the first embodiment, except that the processing of the coefficient determination unit 24 is different.
- FIG. 4 shows an example of the processing flow of the coefficient determination unit 24 of the first modification of the second embodiment.
- the coefficient determination unit 24 according to the first modification of the second embodiment performs, for example, the processes of step S41B, step S42, and step S43 in FIG.
- the coefficient determination unit 24 compares a value that is negatively correlated with the fundamental frequency corresponding to the information about the input period with a predetermined threshold (step S41B).
- the value having a negative correlation with the fundamental frequency corresponding to the information about the input period is, for example, the period corresponding to the information about the input period.
- w h (i) and w l (i) are determined so as to satisfy the relationship w h (i) ⁇ w l (i) for at least a part of i.
- w h (i) and w l (i) satisfy the relationship w h (i) ⁇ w l (i) for at least some i, and w h (i) ⁇ w for other i.
- l Determine to satisfy the relationship (i).
- at least a part of i is, for example, i other than 0 (that is, 1 ⁇ i ⁇ P max ).
- w h (i) and w l (i) are obtained by calculating w O (i) as w h (i) when period T is T1 in equation (7), and period T is calculated in equation (7). It is determined according to a predetermined rule that w O (i) when T2 (where T1 ⁇ T2) is determined as w l (i). Further, for example, w h (i) and w l (i) obtains equation (8) w O when ⁇ is ⁇ 1 at (i) a w h (i), the ⁇ in equation (8) It is determined according to a predetermined rule that w O (i) when ⁇ 2 (where ⁇ 1 ⁇ 2) is determined as w l (i).
- both ⁇ 1 and ⁇ 2 are predetermined in the same manner as ⁇ in the equation (8).
- w h (i) and w l (i) obtained in advance by any of these rules are stored in a table, and whether or not a value having a negative correlation with the fundamental frequency is equal to or less than a predetermined threshold value.
- w h (i) or w l (i) may be selected from the table.
- each of w h (i) and w l (i), as w h i is increased (i), is determined as the value of w l (i) is reduced.
- linear prediction that suppresses the occurrence of spectral peaks caused by pitch components even when the fundamental frequency of the input signal is high. Coefficients that can be converted into coefficients can be obtained, and coefficients that can be converted into linear prediction coefficients that can represent the spectral envelope even when the fundamental frequency of the input signal is low, can be obtained, and are analyzed more than before Highly accurate linear prediction can be realized.
- the coefficient w O (i) is determined using one threshold value, but in the second modification of the second embodiment, the coefficient w O (i) is determined using two or more threshold values. Is.
- a method for determining a coefficient using two threshold values th1 ′ and th2 ′ will be described as an example. It is assumed that the thresholds th1 ′ and th2 ′ satisfy the relationship 0 ⁇ th1 ′ ⁇ th2 ′.
- the functional configuration of the linear prediction analysis apparatus 2 of the second modification of the second embodiment is the same as that of the second embodiment in FIG.
- the linear prediction analysis apparatus 2 of the second modification of the second embodiment is the same as the linear prediction analysis apparatus 2 of the second embodiment, except for the part where the processing of the coefficient determination unit 24 is different.
- the coefficient determination unit 24 compares a value having a positive correlation with the fundamental frequency corresponding to the input fundamental frequency information with the thresholds th1 ′ and th2 ′.
- the value having a positive correlation with the fundamental frequency corresponding to the input fundamental frequency information is, for example, the fundamental frequency itself corresponding to the input fundamental frequency information.
- the coefficient determination unit 24 uses a predetermined rule.
- w h (i), w m (i), and w l (i) satisfy the relationship of w h (i) ⁇ w m (i) ⁇ w l (i) for at least a part of each i.
- Shall be determined as follows.
- at least a part of each i is, for example, each i other than 0 (that is, 1 ⁇ i ⁇ P max ).
- w h (i), w m (i), and w l (i) are w h (i) ⁇ w m (i) ⁇ w l (i) at least for each i, and other i W h (i) ⁇ w m (i) ⁇ w l (i) for at least a part of each i, w h (i) ⁇ w m (i) ⁇ w l for at least a part of each i Decide to satisfy the relationship (i).
- w h (i), w m (i), and w l (i) are obtained by calculating w O (i) as w h (i) when the fundamental frequency P is P1 in equation (1).
- w O (i) when the fundamental frequency P is P2 (where P1> P2) is obtained as w m (i), and in equation (1), the fundamental frequency P is P3 (where P2> P3) It is determined according to a predetermined rule that w O (i) at a given time is determined as w l (i). Further, for example, w h (i), w m (i), and w l (i) are obtained by calculating w O (i) as w h (i) when ⁇ is ⁇ 1 in equation (2). In step (2), w O (i) when ⁇ is ⁇ 2 (where ⁇ 1> ⁇ 2) is obtained as w m (i).
- O (i) is determined as w l (i).
- ⁇ 1, ⁇ 2, and ⁇ 3 are determined in advance in the same manner as ⁇ in Expression (2).
- w h (i), w m (i), and w l (i) obtained in advance by any of these rules are stored in a table, and a value that is positively correlated with the fundamental frequency and a predetermined value are stored.
- One of w h (i), w m (i), and w l (i) may be selected from the table by comparison with a threshold value.
- the coefficient w m (i) between them may be determined using w h (i) and w l (i).
- w h (i), w m (i), and w l (i) are such that the values of w h (i), w m (i), and w l (i) decrease as i increases. It is determined.
- the fundamental frequency of the input signal is high, it is converted into a linear prediction coefficient that suppresses the occurrence of a spectrum peak due to the pitch component. Possible coefficients can be obtained, and even when the fundamental frequency of the input signal is low, coefficients that can be converted into linear prediction coefficients that can express the spectral envelope can be obtained, and analysis accuracy is higher than before Linear prediction can be realized.
- the coefficient w O (i) is determined using one threshold value.
- the coefficient w O ( i) is determined.
- a method for determining a coefficient using two threshold values th1 and th2 will be described as an example. It is assumed that the thresholds th1 and th2 satisfy the relationship 0 ⁇ th1 ⁇ th2.
- the functional configuration of the linear predictive analyzer 2 of the third modification of the second embodiment is the same as that of the first modification of the second embodiment in FIG.
- the linear prediction analysis apparatus 2 of the third modification example of the second embodiment is the same as the linear prediction analysis apparatus 2 of the first modification example of the second embodiment, except for the part where the processing of the coefficient determination unit 24 is different.
- the coefficient determination unit 24 compares the threshold frequency th1 and ⁇ th2 with a value that is negatively correlated with the fundamental frequency corresponding to the information about the input period.
- the value having a negative correlation with the fundamental frequency corresponding to the information about the input period is, for example, the period corresponding to the information about the input period.
- the coefficient determination unit 24 determines the coefficient w l (i) according to a predetermined rule when the value that is negatively correlated with the fundamental frequency is equal to or greater than the threshold th2, that is, when the period is determined to be long.
- w h (i), w m (i), and w l (i) satisfy the relationship w h (i) ⁇ w m (i) ⁇ w l (i) for at least a part of each i.
- Shall be determined as follows.
- at least a part of each i is, for example, each i other than 0 (that is, 1 ⁇ i ⁇ P max ).
- w h (i), w m (i), and w l (i) are w h (i) ⁇ w m (i) ⁇ w l (i) for at least a part of each i, and other i W h (i) ⁇ w m (i) ⁇ w l (i) for at least a part of each i of w, and w h (i) ⁇ w m (i) ⁇ w l (i) for each remaining i To satisfy the relationship.
- w h (i), w m (i), and w l (i) are obtained by calculating w O (i) as w h (i) when period T is T1 in equation (7),
- w O (i) when period T is T2 is obtained as w m (i).
- period T is T3 (where T2 ⁇ T3)
- w O (i) is determined as w l (i).
- w h (i), w m (i), and w l (i) are obtained by calculating w O (i) as w h (i) when ⁇ is ⁇ 1 in equation (8).
- w h (i), w m (i), and w l (i) may be selected from the table by comparison with a threshold value.
- ⁇ is 0 ⁇ ⁇ ⁇ 1, and when the period T takes a small value, the value of ⁇ also decreases, and when the period T takes a large value, the function ⁇ increases. This is a value obtained from the period T by (T).
- w h (i), w m (i), and w l (i) are such that the values of w h (i), w m (i), and w l (i) decrease as i increases. It is determined.
- the occurrence of a spectrum peak due to the pitch component is suppressed even when the fundamental frequency of the input signal is high.
- a coefficient that can be converted into a linear prediction coefficient can be obtained, and a coefficient that can be converted into a linear prediction coefficient that can represent a spectral envelope even when the fundamental frequency of the input signal is low. Can also realize linear prediction with high analysis accuracy.
- the coefficient w O (i) is determined using a plurality of coefficient tables.
- the third embodiment is different from the first embodiment only in the method of determining the coefficient w O (i) in the coefficient determination unit 24, and is the same as the first embodiment in other points.
- the following description will focus on the parts that are different from the first embodiment, and redundant description of the same parts as in the first embodiment will be omitted.
- the linear prediction analysis apparatus 2 according to the third embodiment is different in the processing of the coefficient determination unit 24, and as illustrated in FIG. 5, the linear prediction according to the first embodiment is performed except for a part further including a coefficient table storage unit 25. This is the same as the analyzer 2.
- the coefficient table storage unit 25 stores two or more coefficient tables.
- FIG. 6 shows an example of the processing flow of the coefficient determination unit 24 of the third embodiment.
- the coefficient determination unit 24 according to the third embodiment performs, for example, the processes in steps S44 and S45 in FIG.
- the coefficient determination unit 24 has a value that is positively correlated with the fundamental frequency corresponding to information about the input fundamental frequency or a value that is negatively correlated with the fundamental frequency corresponding to information about the input period. From the two or more coefficient tables stored in the coefficient table storage unit 25, one coefficient corresponding to a value having a positive correlation with the fundamental frequency or a value having a negative correlation with the fundamental frequency Table t is selected (step S44). For example, a value that is positively correlated with the fundamental frequency corresponding to the information about the fundamental frequency is the fundamental frequency corresponding to the information about the fundamental frequency, and is negative with respect to the fundamental frequency corresponding to the information about the input period. The correlated value is a period corresponding to the information about the input period.
- the coefficient determination unit 24 selects the coefficient table t0 as the coefficient table t if the value having a positive correlation with the fundamental frequency is equal to or greater than a predetermined threshold, and otherwise selects the coefficient table t1 as the coefficient table t. Choose as. That is, if the value that is positively correlated with the fundamental frequency is greater than or equal to a predetermined threshold, that is, if it is determined that the fundamental frequency is high, select the coefficient table with the smaller coefficient for each i, If the value having a positive correlation with the fundamental frequency is not equal to or greater than the predetermined threshold value, that is, if it is determined that the fundamental frequency is low, the coefficient table with the larger coefficient for each i is selected.
- the coefficient table selected by the coefficient determination unit 24 when the value that is positively correlated with the fundamental frequency in the two coefficient tables stored in the coefficient table storage unit 25 is the first value.
- the value that is positively correlated with the fundamental frequency in the two coefficient tables stored in the coefficient table storage unit 25 is a second value that is smaller than the first value.
- the coefficient table selected by the coefficient determination unit 24 is a second coefficient table, and the magnitude of the coefficient corresponding to each order i in the second coefficient table is at least a part of each order i in the first coefficient table. It is larger than the magnitude of the coefficient corresponding to each order i.
- the coefficient determination unit 24 selects the coefficient table t0 as the coefficient table t if the value negatively correlated with the fundamental frequency is equal to or smaller than the predetermined threshold value, and otherwise sets the coefficient table t1 as the coefficient table t. select. That is, when a value that is negatively correlated with the fundamental frequency is equal to or less than a predetermined threshold, that is, when it is determined that the cycle is short, a coefficient table with a smaller coefficient for each i is selected, and the fundamental If the value that is negatively correlated with the frequency is not less than or equal to the predetermined threshold value, that is, if it is determined that the period is long, the coefficient table with the larger coefficient for each i is selected.
- the coefficient table selected by the coefficient determination unit 24 when the value negatively correlated with the fundamental frequency in the two coefficient tables stored in the coefficient table storage unit 25 is the first value.
- the value that is negatively correlated with the fundamental frequency in the two coefficient tables stored in the coefficient table storage unit 25 is a second value that is greater than the first value.
- the coefficient table selected by the coefficient determination unit 24 is a second coefficient table, and the magnitude of the coefficient corresponding to each order i in the second coefficient table is at least a part of each order i in the first coefficient table. It is larger than the coefficient size of each order i.
- the coefficient determination unit 24 (1) If the value positively correlated with the fundamental frequency> th2 ', that is, if the fundamental frequency is determined to be high, select the coefficient table t0 as the coefficient table t, (2) When th2 ′ ⁇ a value positively correlated with the fundamental frequency> th1 ′, that is, when it is determined that the fundamental frequency is medium, the coefficient table t1 is selected as the coefficient table t, (3) When th1 ′ ⁇ a value having a positive correlation with the fundamental frequency, that is, when it is determined that the fundamental frequency is low, the coefficient table t2 is selected as the coefficient table t.
- the coefficient determination unit 24 (1) When the value negatively correlated with the fundamental frequency ⁇ th2, that is, when it is determined that the period is long, the coefficient table t2 is selected as the coefficient table t, (2) When th2> value negatively correlated with the fundamental frequency ⁇ th1, that is, when it is determined that the period is medium, the coefficient table t1 is selected as the coefficient table t, (3) If th1> a value that is negatively correlated with the fundamental frequency, that is, if it is determined that the period is short, the coefficient table t0 is selected as the coefficient table t.
- a quantized value of a period is used as a value having a negative correlation with the fundamental frequency, and a coefficient table t is selected according to the quantized value of this period.
- the period T obtained by the period calculation unit 940 is input as to a predetermined positive integer to be satisfied.
- a cycle T that is information about the cycle is input to the coefficient determination unit 24.
- the period T is included in a range of 29 ⁇ T ⁇ 231.
- the coefficient determination unit 24 obtains the index D from the period T specified by the input information about the period T by the following equation (17).
- This index D is a value that has a negative correlation with the fundamental frequency, and corresponds to the quantized value of the period.
- D int (T / 110 + 0.5) (17)
- FIG. 7 is an example of a diagram showing the relationship between the cycle T, the index D, and the cycle quantization value T ′.
- the horizontal axis in FIG. 7 is the period T, and the vertical axis is the quantized value T ′ of the period.
- the period quantization value T ′ D ⁇ 110. Since the period T is 29 ⁇ T ⁇ 231, the index D has a value of 0, 1, or 2.
- w t0 (i) [1.0, 0.999566371, 0.998266613, 0.996104103, 0.993084457, 0.989215493, 0.984507263, 0.978971839, 0.972623467, 0.96547842, 0.957554817, 0.948872864, 0.939454317, 0.929322779, 0.918503404, 0.907022834, 0.894909143]
- w t1 (i) [1.0, 0.999706, 0.998824, 0.997356, 0.995304, 0.992673, 0.989466, 0.985689, 0.98135, 0.976455, 0.971012, 0.965032, 0.958525, 0.951502, 0.943975, 0.935956, 0.927460]
- w t2 (i) [1.0, 0.999926, 0.999706, 0.999338, 0.998824, 0.998163, 0.997356, 0.996403, 0.995304, 0.99406, 0.992672, 0.99114, 0.989465, 0.987647, 0.985688, 0.983588, 0.981348]
- FIG. 8 is a graph showing the magnitudes of the coefficients w t0 (i), w t1 (i), and w t2 (i) of the coefficient table for each i.
- the horizontal axis in FIG. 8 represents the order i
- the vertical axis in FIG. 8 represents the magnitude of the coefficient.
- the coefficient size monotonously decreases as the value of i increases in each coefficient table.
- the relationship of w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i) is satisfied for i ⁇ 1. Yes.
- the coefficient size monotonously increases.
- the plurality of coefficient tables stored in the coefficient table storage unit 25 are not limited to the above example as long as they have such a relationship.
- i 0, it is not necessary to satisfy the relationship of w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i), and w t0 (0), w t1 (0), w t2 (0) does not necessarily have the same value.
- w t0 (0) 1.0001
- w t1 (0) 1.0
- w t2 (0) 1.0 as in
- w t0 (0) only for i 0, w t1 (0 )
- w t2 (0 ) May not satisfy the relationship of w t0 (i) ⁇ w t1 (i) ⁇ w t2 (i).
- the coefficient determination unit 24 selects the coefficient table tD corresponding to the index D as the coefficient table t.
- each coefficient table t0, t1, t2 is associated with the index D, but each coefficient table t0, t1, t2 is a value other than the value or index D that is positively correlated with the fundamental frequency. It may be associated with a value having a negative correlation with the fundamental frequency.
- the coefficient stored in any one of the plurality of coefficient tables is determined as the coefficient w O (i), but the modified example of the third embodiment additionally includes a plurality of coefficients. This includes the case where the coefficient w O (i) is determined by the arithmetic processing based on the coefficient stored in the table.
- the functional configuration of the linear prediction analysis apparatus 2 of the modification of the third embodiment is the same as that of the third embodiment in FIG.
- the linear prediction analysis apparatus 2 of the third embodiment is different from the linear prediction analysis apparatus 2 of the third embodiment except that the processing of the coefficient determination unit 24 is different and the coefficient table included in the coefficient table storage unit 25 is different. Is the same.
- each coefficient w t0 (i) of the coefficient table t0 is converted to the coefficient w O (i ) (2)
- th2 ′ ⁇ a value positively correlated with the fundamental frequency> th1 ′ that is, when it is determined that the fundamental frequency is medium
- ⁇ ′ is 0 ⁇ ⁇ ′ ⁇ 1, and when the fundamental frequency P takes a small value, the value of ⁇ ′ also becomes small, and when the fundamental frequency P takes a large value, the value of ⁇ ′ also becomes large.
- each coefficient w t2 (i) of the coefficient table t2 is set as a coefficient w O (i).
- each coefficient w t0 (i) of the coefficient table t0 and the coefficient table Using each coefficient w t2 (i) of t2, the coefficient w O (i) is determined by w O (i) (1- ⁇ ) ⁇ w t0 (i) + ⁇ ⁇ w t2 (i), (3) If th1> a value that is negatively correlated with the fundamental frequency, that is, if it is determined that the period is small, each coefficient w t0 (i) in the coefficient table t0 is set as a coefficient w O (i). select.
- T is a value obtained from the period T.
- the value close to w t0 (i) can be used as the coefficient w O (i), while the period is medium. since out when the period T is large can be w t2 coefficient value close to (i) w O (i) , only two tables, it is possible to obtain three or more coefficients w O (i) .
- the coefficient multiplier 22 is not included, and the coefficient w O (i) and the autocorrelation R O (i) are calculated in the prediction coefficient calculator 23. May be used to perform linear prediction analysis.
- 10 and 11 are configuration examples of the linear prediction analysis apparatus 2 corresponding to FIGS. 1 and 5, respectively.
- the prediction coefficient calculation unit 23 uses a modified autocorrelation R ′ O (i) obtained by multiplying the coefficient w O (i) and the autocorrelation R O (i). Instead, linear prediction analysis is performed by directly using the coefficient w O (i) and the autocorrelation R O (i) (step S5).
- a linear prediction analysis is performed on an input signal X O (n) using a conventional linear prediction analysis apparatus, and a fundamental frequency is obtained by a fundamental frequency calculation unit using a result of the linear prediction analysis.
- the coefficient w O (i) based on the obtained fundamental frequency is used to obtain a coefficient that can be converted into a linear prediction coefficient by the linear prediction analysis apparatus of the present invention.
- the linear prediction analysis apparatus 3 of the fourth embodiment includes a first linear prediction analysis unit 31, a linear prediction residual calculation unit 32, a fundamental frequency calculation unit 33, and a second linear prediction analysis unit 34, for example. I have.
- Linear prediction residual calculation unit 32 performs filtering equivalent to or similar to linear prediction based on coefficients that can be converted into linear prediction coefficients from the first order to the P max order with respect to the input signal X O (n). Processing is performed to obtain a linear prediction residual signal X R (n). Since the filtering process can also be called a weighting process, the linear prediction residual signal X R (n) can also be said to be a weighted input signal.
- the fundamental frequency calculator 33 obtains the fundamental frequency P of the linear prediction residual signal X R (n) and outputs information about the fundamental frequency.
- P s1 is a fundamental frequency of the M sub-frames constituting the current frame, ..., a maximum value max (P s1, ..., P sM) of the P sM can identify Is output as information about the fundamental frequency.
- the second linear prediction analysis unit 34 includes the linear prediction analysis device 2 according to the first embodiment to the third embodiment, the linear prediction analysis device 2 according to the second modification of the second embodiment, and the linearity of the modification according to the third embodiment.
- linear prediction analysis is performed on the input signal X O (n) using a conventional linear prediction analysis apparatus, and the period is obtained by the period calculation unit using the result of the linear prediction analysis.
- the coefficient w O (i) based on the obtained period is used to obtain a coefficient that can be converted into a linear prediction coefficient by the linear prediction analysis apparatus of the present invention.
- the linear prediction analysis apparatus 3 of the modification of the fourth embodiment includes a first linear prediction analysis unit 31, a linear prediction residual calculation unit 32, a period calculation unit 35, and a second linear prediction analysis unit 34.
- the first linear prediction analysis unit 31 and the linear prediction residual calculation unit 32 of the linear prediction analysis device 3 of the modification of the fourth embodiment are the same as the linear prediction analysis device 3 of the fourth embodiment, respectively.
- a description will be given centering on differences from the fourth embodiment.
- T s1 is the period of M sub-frames constituting the current frame, ..., the minimum value min (T s1 ..., T sM ) can identify the information in the T sM Output as information about the period.
- the second linear prediction analysis unit 34 of the modification of the fourth embodiment includes the linear prediction analysis apparatus 2 of the modification of the first embodiment, the linear prediction analysis apparatus 2 of the first modification of the second embodiment, and the second implementation.
- ⁇ Values that are positively correlated with the fundamental frequency> As described in the second specific example of the fundamental frequency calculation unit 930 in the first embodiment, a sample part that is pre-read and used as a look-ahead in the signal processing of the previous frame as a value having a positive correlation with the fundamental frequency. Of these, the fundamental frequency of the portion corresponding to the sample of the current frame may be used.
- an estimated value of the fundamental frequency may be used as a value that has a positive correlation with the fundamental frequency.
- the estimated value of the fundamental frequency for the current frame predicted from the fundamental frequency of the past multiple frames, and the average, minimum, or maximum value of the fundamental frequency for the past multiple frames are used as the estimated fundamental frequency. It may be used. Further, an average value, a minimum value, or a maximum value of the fundamental frequency for a plurality of subframes may be used as the estimated value of the fundamental frequency.
- the quantized value of the fundamental frequency may be used as a value that has a positive correlation with the fundamental frequency. That is, the fundamental frequency before quantization may be used, or the fundamental frequency after quantization may be used.
- the fundamental frequency for any analyzed channel may be used.
- ⁇ Values that are negatively correlated with the fundamental frequency As described as specific example 2 of the period calculation unit 940 in the first embodiment, as a value having a negative correlation with the fundamental frequency, a sample part that is pre-read and used in the signal processing of the previous frame is also used. Of these, the period of the portion corresponding to the sample of the current frame may be used.
- an estimated value of the period may be used as a value that is negatively correlated with the fundamental frequency.
- the estimated value of the period for the current frame predicted from the fundamental frequency of a plurality of past frames, or the average value, the minimum value, or the maximum value of the period for a plurality of past frames may be used as the estimated value of the period.
- an average value, minimum value, or maximum value of the periods for a plurality of subframes may be used as an estimated value of the fundamental frequency.
- an estimated value of the period of the current frame predicted by the portion corresponding to the sample of the current frame among the sample portions used by prefetching which is also referred to as look-ahead, may be used as the basic frequency of a plurality of frames in the past.
- the average value, minimum value, or maximum value for the portion corresponding to the sample of the current frame, among the sample portions that are used by pre-reading which is also called look-ahead, may be used as the estimated value. Good.
- the quantized value of the period may be used as a value that is negatively correlated with the fundamental frequency. That is, the period before quantization may be used, or the period after quantization may be used.
- the period for any analyzed channel may be used.
- the threshold value may be set to be divided into one of two cases adjacent to each other. That is, a case where the threshold value is greater than or equal to a certain threshold value may be a case where the threshold value is greater than the threshold value, and a case where the value is smaller than the threshold value may be the case where the threshold value is equal to or less than the threshold value.
- a case where the value is greater than a certain threshold value may be a case where the value is equal to or greater than the threshold value, and a case where the value is equal to or less than the threshold value may be defined as a case where the value is smaller than the threshold value.
- each step in the linear prediction analysis method is realized by a computer, the processing contents of the functions that the linear prediction analysis method should have are described by a program. And each step is implement
- the program describing the processing contents can be recorded on a computer-readable recording medium.
- a computer-readable recording medium any recording medium such as a magnetic recording device, an optical disk, a magneto-optical recording medium, and a semiconductor memory may be used.
- each processing means may be configured by executing a predetermined program on a computer, or at least a part of these processing contents may be realized by hardware.
Abstract
Description
線形予測分析装置1の自己相関計算部11は、入力信号XO(n)から自己相関RO(i) (i=0,1,…,Pmax)を式(11)により求める。Pmaxは、N未満の所定の正の整数である。
次に、係数乗算部12が、自己相関RO(i)に予め定めた係数wO(i) (i=0,1,…,Pmax)を同じiごとに乗じることにより、変形自己相関R'O(i) (i=0,1,…,Pmax)を求める。すなわち、変形自己相関R'O(i)は式(12)により求める。
そして、予測係数計算部13が、R'O(i)を用いて、例えばLevinson-Durbin法などにより、1次から予め定めた最大次数であるPmax次までの線形予測係数に変換可能な係数を求める。線形予測係数に変換可能な係数とは、PARCOR係数KO(1),KO(2),…,KO(Pmax)や線形予測係数aO(1),aO(2),…,aO(Pmax)等である。
この発明の一態様による線形予測分析方法は、入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析方法であって、少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i)(i=0, 1, …, Pmax)を計算する自己相関計算ステップと、係数wO(i) (i=0, 1, …, Pmax)と自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R'O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算ステップと、を含み、少なくとも一部の各次数iに対して、各次数iに対応する係数wO(i)が、現在又は過去のフレームにおける入力時系列信号に基づく基本周波数と正の相関関係にある値の増加とともに単調減少する関係にある場合が含まれている。
第一実施形態の線形予測分析装置2は、図1に示すように、自己相関計算部21、係数決定部24、係数乗算部22及び予測係数計算部23を例えば備えている。自己相関計算部21、係数乗算部22及び予測係数計算部23の動作は、従来の線形予測分析装置1の自己相関計算部11、係数乗算部12及び予測係数計算部13における動作とそれぞれ同じである。
基本周波数計算部930は、現フレームの入力信号XO(n) (n=0, 1, …, N-1)および/または現フレームの近傍のフレームの入力信号の全部または一部から基本周波数Pを求める。基本周波数計算部930は、例えば、現フレームの入力信号XO(n) (n=0, 1, …, N-1)の全部または一部を含む信号区間のディジタル音声信号やディジタル音響信号の基本周波数Pを求め、基本周波数Pを特定可能な情報を基本周波数についての情報として出力する。基本周波数を求める方法としては、様々な公知の方法が存在するので、公知の何れの方法を用いてもよい。また、求めた基本周波数Pを符号化して基本周波数符号を得る構成とし、基本周波数符号を基本周波数についての情報として出力してもよい。さらに基本周波数符号に対応する基本周波数の量子化値^Pを得る構成とし、基本周波数の量子化値^Pを基本周波数についての情報として出力してもよい。以下、基本周波数計算部930の具体例について説明する。
基本周波数計算部930の具体例1は、現フレームの入力信号XO(n) (n=0, 1, …, N-1)が複数個のサブフレームで構成されている場合、かつ、同一のフレームについては線形予測分析装置2よりも先に基本周波数計算部930が動作される場合、の例である。基本周波数計算部930は、まず、2以上の整数であるM個のサブフレームであるXOs1(n) (n=0, 1, …, N/M-1), …, XOsM(n)(n= (M-1)N/M, (M-1)N/M+1, …, N-1)のそれぞれの基本周波数であるPs1, …, PsMを求める。NはMで割り切れるとする。基本周波数計算部930は、現フレームを構成するM個のサブフレームの基本周波数であるPs1, …, PsMのうちの最大値max(Ps1, …, PsM)を特定可能な情報を基本周波数についての情報として出力する。
基本周波数計算部930の具体例2は、現フレームの入力信号XO(n) (n=0, 1, …, N-1)と1つ後のフレームの一部の入力信号XO(n) (n=N, N+1, …, N+Nn-1)(ただし、Nnは、Nn<Nという関係を満たす所定の正の整数。)とで、先読み部分を含む信号区間が現フレームの信号区間として構成されている場合であり、かつ、同一のフレームについては線形予測分析装置2よりも後に基本周波数計算部930が動作される場合、の例である。基本周波数計算部930は、現フレームの信号区間について、現フレームの入力信号XO(n) (n=0, 1, …, N-1)と1つ後のフレームの一部の入力信号XO(n) (n=N, N+1, …, N+Nn-1)のそれぞれの基本周波数であるPnow, Pnextを求め、基本周波数Pnextを基本周波数計算部930に記憶する。基本周波数計算部930は、また、1つ前のフレームの信号区間について求めて基本周波数計算部930に記憶されていた基本周波数Pnext、すなわち、1つ前のフレームの信号区間のうちの現フレームの一部の入力信号XO(n) (n=0, 1, …, Nn-1)について求めた基本周波数、を特定可能な情報を基本周波数についての情報として出力する。なお、具体例1と同様に、現フレームについては複数のサブフレームごとの基本周波数を求めてもよい。
基本周波数計算部930の具体例3は、現フレームの入力信号XO(n) (n=0, 1, …, N-1)そのものが現フレームの信号区間として構成されている場合であり、かつ、同一のフレームについては線形予測分析装置2よりも後に基本周波数計算部930が動作される場合、の例である。基本周波数計算部930は、現フレームの信号区間である現フレームの入力信号XO(n) (n=0, 1, …, N-1)の基本周波数Pを求め、基本周波数Pを基本周波数計算部930に記憶する。基本周波数計算部930は、また、1つ前のフレームの信号区間、すなわち、1つ前のフレームの入力信号XO(n) (n=-N, -N+1, …, -1)について求めて基本周波数計算部930に記憶されていた基本周波数Pを特定可能な情報を基本周波数についての情報として出力する。
自己相関計算部21は、入力されたNサンプルのフレーム毎の時間領域のディジタル音声信号やディジタル音響信号である入力信号XO(n)(n=0,1,…,N-1)から自己相関RO(i) (i=0,1,…,Pmax)を計算する(ステップS1)。Pmaxは、予測係数計算部23が求める線形予測係数に変換可能な係数の最大次数であり、N以下の所定の正の整数である。計算された自己相関RO(i) (i=0,1,…,Pmax)は、係数乗算部22に提供される。
係数決定部24は、入力された基本周波数についての情報を用いて、係数wO(i) (i=0,1,…,Pmax)を決定する(ステップS4)。係数wO(i)は、自己相関RO(i)を変形して変形自己相関R'O(i)を得るための係数である。係数wO(i)は、信号処理の分野においては、ラグ窓wO(i)又はラグ窓係数wO(i)とも呼ばれているものである。係数wO(i)は正の値であるので、係数wO(i)が所定の値よりも大きい/小さいことを、係数wO(i)の大きさが所定の値よりも大きい/小さいと表現することがある。また、ラグ窓wO(i)の大きさとは、そのラグ窓wO(i)の値を意味するものとする。
係数乗算部22は、係数決定部24で決定した係数wO(i) (i=0,1,…,Pmax)と、自己相関計算部21で求めた自己相関RO(i) (i=0,1,…,Pmax)とを同じiごとに乗じることにより、変形自己相関R'O(i) (i=0,1,…,Pmax)を求める(ステップS2)。すなわち、係数乗算部22は、以下の式(15)により自己相関R'O(i)を計算する。計算された自己相関R'O(i)は、予測係数計算部23に提供される。
予測係数計算部23は、変形自己相関R'O(i)を用いて線形予測係数に変換可能な係数を求める(ステップS3)。
第一実施形態の変形例は、係数決定部24が、基本周波数と正の相関関係にある値ではなく、基本周波数と負の相関関係にある値に基づいて係数wO(i)を決定するものである。基本周波数と負の相関関係にある値とは、例えば周期、周期の推定値又は周期の量子化値である。例えば、周期T、基本周波数P、サンプリング周波数fsとすると、T=fs/Pとなるため、周期は基本周波数と負の相関関係にあるものである。基本周波数と負の相関関係にある値に基づいて係数wO(i)を決定する例を第一実施形態の変形例として説明する。
周期計算部940は、現フレームの入力信号XOおよび/または現フレームの近傍のフレームの入力信号の全部または一部から周期Tを求める。周期計算部940は、例えば、現フレームの入力信号XO(n)の全部または一部を含む信号区間のディジタル音声信号やディジタル音響信号の周期Tを求め、周期Tを特定可能な情報を周期についての情報として出力する。周期を求める方法としては、様々な公知の方法が存在するので、公知の何れの方法を用いてもよい。また、求めた周期Tを符号化して周期符号を得る構成とし、周期符号を周期についての情報として出力してもよい。さらに周期符号に対応する周期の量子化値^Tを得る構成とし、周期の量子化値^Tを周期についての情報として出力してもよい。以下、周期計算部940の具体例について説明する。
周期計算部940の具体例1は、現フレームの入力信号XO(n) (n=0, 1, …, N-1)が複数個のサブフレームで構成されている場合、かつ、同一のフレームについては線形予測分析装置2よりも先に周期計算部940が動作される場合、の例である。周期計算部940は、まず、2以上の整数であるM個のサブフレームであるXOs1(n) (n=0, 1, …, N/M-1), …, XOsM(n)(n= (M-1)N/M, (M-1)N/M+1, …, N-1)のそれぞれの周期であるTs1, …, TsMを求める。NはMで割り切れるとする。周期計算部940は、現フレームを構成するM個のサブフレームの周期であるTs1, …, TsMのうちの最小値min(Ts1, …, TsM)を特定可能な情報を周期についての情報として出力する。
周期計算部940の具体例2は、現フレームの入力信号XO(n) (n=0, 1, …, N-1)と1つ後のフレームの一部の入力信号XO(n) (n=N, N+1, …, N+Nn-1) (ただし、Nnは、Nn<Nという関係を満たす所定の正の整数。)とで、先読み部分を含む信号区間が現フレームの信号区間として構成されている場合であり、かつ、同一のフレームについては線形予測分析装置2よりも後に周期計算部940が動作される場合、の例である。周期計算部940は、現フレームの信号区間について、現フレームの入力信号XO(n) (n=0, 1, …, N-1)と1つ後のフレームの一部の入力信号XO(n) (n=N, N+1, …, N+Nn-1)のそれぞれの周期であるTnow, Tnextを求め、周期Tnextを周期計算部940に記憶する。周期計算部940は、また、1つ前のフレームの信号区間について求めて周期計算部940に記憶されていた周期Tnext、すなわち、1つ前のフレームの信号区間のうちの現フレームの一部の入力信号XO(n) (n=0, 1, …, Nn-1)について求めた周期、を特定可能な情報を周期についての情報として出力する。なお、具体例1と同様に、現フレームについては複数のサブフレームごとの周期を求めてもよい。
周期計算部940の具体例3は、現フレームの入力信号XO(n) (n=0, 1, …, N-1)そのものが現フレームの信号区間として構成されている場合であり、かつ、同一のフレームについては線形予測分析装置2よりも後に周期計算部940が動作される場合、の例である。周期計算部940は、現フレームの信号区間である現フレームの入力信号XO(n) (n=0, 1, …, N-1)の周期Tを求め、周期Tを周期計算部940に記憶する。周期計算部940は、また、1つ前のフレームの信号区間、すなわち、1つ前のフレームの入力信号XO(n) (n=-N, -N+1, …, -1)について求めて周期計算部940に記憶されていた周期Tを特定可能な情報を周期についての情報として出力する。
第一実施形態の変形例の線形予測分析装置2の係数決定部24は、入力された周期についての情報を用いて、係数wO(i) (i=0,1,…,Pmax)を決定する(ステップS4)。
言い換えれば、次数iによっては、係数wO(i)の大きさが基本周波数と負の相関関係にある値の増加とともに単調増加しなくてもよい。
図9は、24個の音声音響信号ソースと24人の被験者によるMOS評価実験の実験結果である。図9の「従来法」「cutA」の6つのMOS値は、従来の線形予測分析装置を含む図9に記載した各ビットレートの符号化装置とそれらの符号化装置に対応する復号装置とを用いて、音声音響信号ソースを符号化復号して得られた復号音声信号や復号音響信号に対するMOS値である。図9の「提案手法」「cutB」の6つのMOS値は、第一実施形態の変形例の線形予測分析装置を含む図9に記載した各ビットレートの符号化装置とそれらの符号化装置に対応する復号装置とを用いて、音声音響信号ソースを符号化復号して得られた復号音声信号や復号音響信号に対するMOS値である。図9の実験結果からも、本発明の線形予測分析装置を含む符号化装置とその符号化装置に対応する復号装置とを用いることにより、従来の線形予測分析装置を含む場合よりも、高いMOS値すなわち良い音質を得られたことがわかる。
第二実施形態は、基本周波数と正の相関関係にある値又は基本周波数と負の相関関係にある値と所定の閾値とを比較し、その比較結果に応じて係数wO(i)を決定するものである。第二実施形態は、係数決定部24における係数wO(i)の決定方法のみが第一実施形態と異なり、他の点について第一実施形態と同様である。以下、第一実施形態と異なる部分を中心に説明し、第一実施形態と同様の部分については重複説明を省略する。
第二実施形態の第一変形例は、基本周波数と正の相関関係にある値ではなく、基本周波数と負の相関関係にある値と所定の閾値とを比較し、その比較結果に応じて係数wO(i)を決定するものである。第二実施形態の第一変形例における所定の閾値は、第二実施形態において基本周波数と正の相関関係にある値と比較される所定の閾値とは異なる。
第二実施形態では1個の閾値を用いて係数wO(i)を決定したが、第二実施形態の第二変形例は2個以上の閾値を用いて係数wO(i)を決定するものである。以下、2個の閾値th1', th2'を用いて係数を決定する方法を例に挙げて説明する。閾値th1', th2'は、0<th1'<th2'という関係を満たすとする。
第二実施形態の第一変形例では1個の閾値を用いて係数wO(i)を決定したが、第二実施形態の第三変形例は2個以上の閾値を用いて係数wO(i)を決定するものである。以下、2個の閾値th1, th2を用いて係数を決定する方法を例に挙げて説明する。閾値th1, th2は、0<th1<th2という関係を満たすとする。
第三実施形態は、複数個の係数テーブルを用いて係数wO(i)を決定するものである。第三実施形態は、係数決定部24における係数wO(i)の決定方法のみが第一実施形態と異なり、他の点について第一実施形態と同様である。以下、第一実施形態と異なる部分を中心に説明し、第一実施形態と同様の部分については重複説明を省略する。
(1) 基本周波数と正の相関関係にある値>th2'の場合、すなわち、基本周波数が高いと判断された場合には、係数テーブルt0を係数テーブルtとして選択し、
(2) th2'≧基本周波数と正の相関関係にある値>th1'の場合、すなわち、基本周波数が中程度である判断された場合には、係数テーブルt1を係数テーブルtとして選択し、
(3) th1'≧基本周波数と正の相関関係にある値の場合、すなわち、基本周波数が低い判断された場合には、係数テーブルt2を係数テーブルtとして選択する。
(1) 基本周波数と負の相関関係にある値≧th2の場合、すなわち、周期が長いと判断された場合には、係数テーブルt2を係数テーブルtとして選択し、
(2) th2>基本周波数と負の相関関係にある値≧th1の場合、すなわち、周期が中程度であると判断された場合には、係数テーブルt1を係数テーブルtとして選択し、
(3) th1>基本周波数と負の相関関係にある値の場合、すなわち、周期が短いと判断された場合には、係数テーブルt0を係数テーブルtとして選択する。
係数テーブル記憶部25に記憶されている2個以上の係数テーブルについて以下のことが言える。
以下、第三実施形態の具体例について説明する。この具体例では、基本周波数と負の相関関係にある値として周期の量子化値が用いられ、この周期の量子化値に応じて係数テーブルtが選択される。
D=int(T/110+0.5) (17)
wt0(i) =[1.0, 0.999566371, 0.998266613, 0.996104103, 0.993084457, 0.989215493, 0.984507263, 0.978971839, 0.972623467, 0.96547842, 0.957554817, 0.948872864, 0.939454317, 0.929322779, 0.918503404, 0.907022834, 0.894909143]
wt1(i) =[1.0, 0.999706, 0.998824, 0.997356, 0.995304, 0.992673 , 0.989466, 0.985689, 0.98135, 0.976455, 0.971012, 0.965032, 0.958525, 0.951502, 0.943975, 0.935956, 0.927460]
wt2(i)=[1.0, 0.999926, 0.999706, 0.999338, 0.998824, 0.998163, 0.997356, 0.996403, 0.995304, 0.99406, 0.992672, 0.99114, 0.989465, 0.987647, 0.985688, 0.983588, 0.981348]
第三実施形態では複数個の係数テーブルのうち何れか1つのテーブルに記憶された係数を係数wO(i)として決定したが、第三実施形態の変形例はこれに加えて複数個の係数テーブルに記憶された係数に基づく演算処理により係数wO(i)を決定する場合を含む。
(1) 基本周波数と正の相関関係にある値>th2'の場合、すなわち、基本周波数が高いと判断された場合には、係数テーブルt0の各係数wt0(i)を係数wO(i)として選択し、
(2) th2'≧基本周波数と正の相関関係にある値>th1'の場合、すなわち、基本周波数が中程度であると判断された場合には、係数テーブルt0の各係数wt0(i)と係数テーブルt2の各係数wt2(i)とを用いて、wO(i)=β'×wt0(i)+(1-β')×wt2(i)により係数wO(i)を決定し、
(3) th1'≧基本周波数と正の相関関係にある値の場合、すなわち、基本周波数が低いと判断された場合には、係数テーブルt2の各係数wt2(i)を係数wO(i)として選択する。ここでβ'は、0≦β'≦1であり、基本周波数Pが小さい値をとるときはβ'の値も小さくなり、基本周波数Pが大きい値をとるときにβ'の値も大きくなる関数β'=c(P)により、基本周波数Pから求める値である。この構成とすれば、基本周波数が中程度の場合のうちの基本周波数Pが小さい時にはwt2(i)に近い値を係数wO(i)とすることができ、逆に基本周波数が中程度の場合のうちの基本周波数Pが大きい時にはwt0(i)に近い値を係数wO(i)とすることができるので、2つのテーブルだけで、3個以上の係数wO(i)を得ることができる。
(1) 基本周波数と負の相関関係にある値≧th2の場合、すなわち、周期が長いと判断された場合には、係数テーブルt2の各係数wt2(i)を係数wO(i)として選択し、
(2) th2>基本周波数と負の相関関係にある値≧th1の場合、すなわち、周期が中程度であると判断された場合には、係数テーブルt0の各係数wt0(i)と係数テーブルt2の各係数wt2(i)とを用いて、wO(i)=(1-β)×wt0(i)+β×wt2(i)により係数wO(i)を決定し、
(3) th1>基本周波数と負の相関関係にある値の場合、すなわち、周期が小さいと判断された場合には、係数テーブルt0の各係数wt0(i)を係数wO(i)として選択する。ここでβは0≦β≦1であり、かつ、周期Tが小さい値をとるときはβの値も小さくなり、周期Tが大きい値をとるときにβの値も大きくなる関数β=b(T)により、周期Tから求める値である。この構成とすれば、周期が中程度の場合のうちの周期Tが小さい時にはwt0(i)に近い値を係数wO(i)とすることができ、逆に周期が中程度の場合のうちの周期Tが大きい時にはwt2(i)に近い値を係数wO(i)とすることができるので、2つのテーブルだけで、3個以上の係数wO(i)を得ることができる。
図10及び図11に示すように、上述の全ての実施形態及び変形例において、係数乗算部22を含まず、予測係数計算部23において係数wO(i)とと自己相関RO(i)を用いて線形予測分析を行ってもよい。図10と図11は、それぞれ図1と図5に対応する線形予測分析装置2の構成例である。この場合は、予測係数計算部23は、図12に示すように、係数wO(i)と自己相関RO(i)とが乗算されたものである変形自己相関R'O(i)ではなく、係数wO(i)と自己相関RO(i)とを直接用いて線形予測分析を行う(ステップS5)。
第四実施形態は、入力信号XO(n)に対して従来の線形予測分析装置を用いて線形予測分析を行い、その線形予測分析の結果を用いて基本周波数計算部で基本周波数を得て、得られた基本周波数に基づく係数wO(i)を用いて本発明の線形予測分析装置により線形予測係数に変換可能な係数を求めるものである。
第一線形予測分析部31は、従来の線形予測分析装置1と同じ動作をする。すなわち、第一線形予測分析部31は、入力信号XO(n)から自己相関RO(i) (i=0,1,…,Pmax)を求め、自己相関RO(i) (i=0,1,…,Pmax)と予め定めた係数wO(i) (i=0,1,…,Pmax)とを同じiごとに乗じることにより変形自己相関R' O(i) (i=0,1,…,Pmax)を求め、変形自己相関R' O(i) (i=0,1,…,Pmax)から1次から予め定めた最大次数であるPmax次までの線形予測係数に変換可能な係数を求める。
線形予測残差計算部32は、入力信号XO(n)に対して、1次からPmax次までの線形予測係数に変換可能な係数に基づく線形予測や線形予測と等価なまたは類似したフィルタリング処理を行って線形予測残差信号XR(n)を求める。フィルタリング処理は重み付け処理とも言えるので、線形予測残差信号XR(n)は重み付け入力信号であるともいえる。
基本周波数計算部33は、線形予測残差信号XR(n)の基本周波数Pを求め、基本周波数についての情報を出力する。基本周波数を求める方法としては、様々な公知の方法が存在するので、公知の何れの方法を用いてもよい。基本周波数計算部33は、例えば、現フレームの線形予測残差信号XR (n) (n=0, 1, …, N-1)を構成する複数個のサブフレームのそれぞれについて基本周波数を求める。すなわち、2以上の整数であるM個のサブフレームであるXRs1(n) (n=0, 1, …, N/M-1), …, XRsM(n)(n= (M-1)N/M, (M-1)N/M+1, …, N-1)のそれぞれの基本周波数であるPs1, …, PsMを求める。NはMで割り切れるとする。基本周波数計算部33は、次に、現フレームを構成するM個のサブフレームの基本周波数であるPs1, …, PsMのうちの最大値max(Ps1, …, PsM)を特定可能な情報を基本周波数についての情報として出力する。
第二線形予測分析部34は、第一実施形態から第三実施形態の線形予測分析装置2、第二実施形態の第二変形例の線形予測分析装置2、第三実施形態の変形例の線形予測分析装置2、第一実施形態から第三実施形態に共通の変形例の線形予測分析装置2の何れかと同じ動作をする。すなわち、第二線形予測分析部34は、入力信号XO(n)から自己相関RO(i) (i=0,1,…,Pmax)を求め、基本周波数計算部33が出力した基本周波数についての情報に基づいて係数wO(i) (i=0,1,…,Pmax)を決定し、自己相関RO(i) (i=0,1,…,Pmax)と決定した係数wO(i) (i=0,1,…,Pmax)とを用いて1次から予め定めた最大次数であるPmax次までの線形予測係数に変換可能な係数を求める。
第四実施形態の変形例は、入力信号XO(n)に対して従来の線形予測分析装置を用いて線形予測分析を行い、その線形予測分析の結果を用いて周期計算部で周期を得て、得られた周期に基づく係数wO(i)を用いて本発明の線形予測分析装置により線形予測係数に変換可能な係数を求めるものである。
周期計算部35は、線形予測残差信号XR(n)の周期Tを求め、周期についての情報を出力する。周期を求める方法としては、様々な公知の方法が存在するので、公知の何れの方法を用いてもよい。周期計算部35は、例えば、現フレームの線形予測残差信号XR (n) (n=0, 1, …, N-1)を構成する複数個のサブフレームのそれぞれについて周期を求める。すなわち、2以上の整数であるM個のサブフレームであるXRs1(n) (n=0, 1, …, N/M-1), …, XRsM(n)(n= (M-1)N/M, (M-1)N/M+1, …, N-1)のそれぞれの周期であるTs1, …, TsMを求める。NはMで割り切れるとする。周期計算部35は、次に、現フレームを構成するM個のサブフレームの周期であるTs1, …, TsMのうちの最小値min(Ts1 …, TsM)を特定可能な情報を周期についての情報として出力する。
第四実施形態の変形例の第二線形予測分析部34は、第一実施形態の変形例の線形予測分析装置2、第二実施形態の第一変形例の線形予測分析装置2、第二実施形態の第三変形例の線形予測分析装置2、第三実施形態の線形予測分析装置2、第三実施形態の変形例の線形予測分析装置2、第一実施形態から第三実施形態に共通の変形例の線形予測分析装置2の何れかと同じ動作をする。すなわち、第二線形予測分析部34は、入力信号XO(n)から自己相関RO(i) (i=0,1,…,Pmax)を求め、周期計算部35が出力した周期についての情報に基づいて係数wO(i) (i=0,1,…,Pmax)を決定し、自己相関RO(i) (i=0,1,…,Pmax)と決定した係数wO(i) (i=0,1,…,Pmax)とを用いて1次から予め定めた最大次数であるPmax次までの線形予測係数に変換可能な係数を求める。
第一実施形態において基本周波数計算部930の具体例2として説明した通り、基本周波数と正の相関関係にある値として、前のフレームの信号処理においてLook-aheadとも呼ばれる先読みして利用するサンプル部分のうち現フレームのサンプルに対応する部分の基本周波数を用いてもよい。
第一実施形態において周期計算部940の具体例2として説明した通り、基本周波数と負の相関関係にある値として、前のフレームの信号処理においてLook-aheadとも呼ばれる先読みして利用するサンプル部分のうち現フレームのサンプルに対応する部分の周期を用いてもよい。
Claims (14)
- 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析方法であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i)を計算する自己相関計算ステップと、
係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R’O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算ステップと、を含み、
少なくとも一部の各次数iに対して、前記各次数iに対応する係数wO(i)が、現在又は過去のフレームにおける入力時系列信号に基づく周期、周期の量子化値、または基本周波数と負の相関関係にある値の増加とともに単調増加する関係にある場合が含まれている、
線形予測分析方法。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析方法であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算ステップと、
2個以上の係数テーブルのそれぞれにはi=0, 1, …, Pmaxの各次数iと前記各次数iに対応する係数wO(i)とが対応付けて記憶されているとして、現在又は過去のフレームにおける入力時系列信号に基づく周期、周期の量子化値、または基本周波数と負の相関関係にある値を用いて前記2個以上の係数テーブルの中の1個の係数テーブルから係数wO(i) (i=0, 1, …, Pmax)を取得する係数決定ステップと、
取得された前記係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算ステップと、を含み、
前記2個以上の係数テーブルの中の、前記周期、周期の量子化値、または基本周波数と負の相関関係にある値が第一値である場合に前記係数決定ステップで係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第一係数テーブルとし、
前記2個以上の係数テーブルの中の、前記周期、周期の量子化値、または基本周波数と負の相関関係にある値が前記第一値よりも大きい第二値である場合に前記係数決定ステップで係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第二係数テーブルとして、
少なくとも一部の各次数iに対して、前記第二係数テーブルにおける前記各次数iに対応する係数は、前記第一係数テーブルにおける前記各次数iに対応する係数よりも大きい、
線形予測分析方法。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析方法であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算ステップと、
係数テーブルt0には係数wt0(i) (i=0, 1,…, Pmax)が格納されており、係数テーブルt1には係数wt1(i) (i=0, 1,…, Pmax)、係数テーブルt2には係数wt2(i) (i=0, 1,…, Pmax)が格納されているとして、現在又は過去のフレームにおける入力時系列信号に基づく周期、周期の量子化値、または基本周波数と負の相関関係にある値を用いて前記係数テーブルt0,t1,t2の中の1個の係数テーブルから係数を取得する係数決定ステップと、
前記取得した係数と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算ステップと、を含み、
前記周期、周期の量子化値、または基本周波数と負の相関関係にある値に応じて、周期が短い場合、周期が中程度の場合、周期が長い場合の何れかの場合に分類されるとし、周期が短い場合に前記係数決定ステップで係数が取得される係数テーブルを係数テーブルt0とし、周期が中程度の場合に前記係数決定ステップで係数が取得される係数テーブルを係数テーブルt1とし、周期が長い場合に前記係数決定ステップで係数が取得される係数テーブルを係数テーブルt2として、少なくとも一部のiについてwt0(i)<wt1(i)≦wt2(i)であり、それ以外のiのうちの少なくとも一部の各iについてwt0(i)≦wt1(i)<wt2(i)であり、残りの各iについてwt0(i)≦wt1(i)≦wt2(i)である、
線形予測分析方法。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析方法であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i)(i=0, 1, …, Pmax)を計算する自己相関計算ステップと、
係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R'O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算ステップと、を含み、
少なくとも一部の各次数iに対して、前記各次数iに対応する係数wO(i)が、現在又は過去のフレームにおける入力時系列信号に基づく基本周波数と正の相関関係にある値の増加とともに単調減少する関係にある場合が含まれている、
線形予測分析方法。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析方法であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算ステップと、
2個以上の係数テーブルのそれぞれにはi=0, 1, …, Pmaxの各次数iと前記各次数iに対応する係数wO(i)とが対応付けて記憶されているとして、現在又は過去のフレームにおける入力時系列信号に基づく基本周波数と正の相関関係にある値を用いて前記2個以上の係数テーブルの中の1個の係数テーブルから係数wO(i) (i=0, 1, …, Pmax)を取得する係数決定ステップと、
取得された係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算ステップと、を含み、
前記2個以上の係数テーブルの中の、前記基本周波数と正の相関関係にある値が第一値である場合に前記係数決定ステップで係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第一係数テーブルとし、
前記2個以上の係数テーブルの中の、前記基本周波数と正の相関関係にある値が前記第一値よりも小さい第二値である場合に前記係数決定ステップで係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第二係数テーブルとして、
少なくとも一部の各次数iに対して、前記第二係数テーブルにおける前記各次数iに対応する係数は、前記第一係数テーブルにおける前記各次数iに対応する係数よりも大きい、
線形予測分析方法。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析方法であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算ステップと、
係数テーブルt0には係数wt0(i) (i=0,1,…,Pmax)が格納されており、係数テーブルt1には係数wt1(i) (i=0,1,…,Pmax)、係数テーブルt2には係数wt2(i) (i=0,1,…,Pmax)が格納されているとして、現在又は過去のフレームにおける入力時系列信号に基づく基本周波数と正の相関関係にある値を用いて前記係数テーブルt0,t1,t2の中の1個の係数テーブルから係数を取得する係数決定ステップと、
前記取得した係数と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算ステップと、を含み、
前記基本周波数と正の相関関係にある値に応じて、基本周波数が高い場合、基本周波数が中程度の場合、基本周波数が低い場合の何れかの場合に分類されるとし、基本周波数が高い場合に前記係数決定ステップで係数が取得される係数テーブルを係数テーブルt0とし、基本周波数が中程度の場合に前記係数決定ステップで係数が取得される係数テーブルを係数テーブルt1とし、基本周波数が低い場合に前記係数決定ステップで係数が取得される係数テーブルを係数テーブルt2として、少なくとも一部のiについてwt0(i)<wt1(i)≦wt2(i)であり、それ以外のiのうちの少なくとも一部の各iについてwt0(i)≦wt1(i)<wt2(i)であり、残りの各iについてwt0(i)≦wt1(i)≦wt2(i)である、
線形予測分析方法。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析装置であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i)を計算する自己相関計算部と、
係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R’O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算部と、を含み、
少なくとも一部の各次数iに対して、前記各次数iに対応する係数wO(i)が、現在又は過去のフレームにおける入力時系列信号に基づく周期、周期の量子化値、または基本周波数と負の相関関係にある値の増加とともに単調増加する関係にある場合が含まれている、
線形予測分析装置。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析装置であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算部と、
2個以上の係数テーブルのそれぞれにはi=0, 1, …, Pmaxの各次数iと前記各次数iに対応する係数wO(i)とが対応付けて記憶されているとして、現在又は過去のフレームにおける入力時系列信号に基づく周期、周期の量子化値、または基本周波数と負の相関関係にある値を用いて前記2個以上の係数テーブルの中の1個の係数テーブルから係数wO(i) (i=0, 1, …, Pmax)を取得する係数決定部と、
取得された前記係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算部と、を含み、
前記2個以上の係数テーブルの中の、前記周期、周期の量子化値、または基本周波数と負の相関関係にある値が第一値である場合に前記係数決定部で係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第一係数テーブルとし、
前記2個以上の係数テーブルの中の、前記周期、周期の量子化値、または基本周波数と負の相関関係にある値が前記第一値よりも大きい第二値である場合に前記係数決定部で係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第二係数テーブルとして、
少なくとも一部の各次数iに対して、前記第二係数テーブルにおける前記各次数iに対応する係数は、前記第一係数テーブルにおける前記各次数iに対応する係数よりも大きい、
線形予測分析装置。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析装置であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算部と、
係数テーブルt0には係数wt0(i) (i=0, 1,…, Pmax)が格納されており、係数テーブルt1には係数wt1(i) (i=0, 1,…, Pmax)、係数テーブルt2には係数wt2(i) (i=0, 1,…, Pmax)が格納されているとして、現在又は過去のフレームにおける入力時系列信号に基づく周期、周期の量子化値、または基本周波数と負の相関関係にある値を用いて前記係数テーブルt0,t1,t2の中の1個の係数テーブルから係数を取得する係数決定部と、
前記取得した係数と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算部と、を含み、
前記周期、周期の量子化値、または基本周波数と負の相関関係にある値に応じて、周期が短い場合、周期が中程度の場合、周期が長い場合の何れかの場合に分類されるとし、周期が短い場合に前記係数決定部で係数が取得される係数テーブルを係数テーブルt0とし、周期が中程度の場合に前記係数決定部で係数が取得される係数テーブルを係数テーブルt1とし、周期が長い場合に前記係数決定部で係数が取得される係数テーブルを係数テーブルt2として、少なくとも一部のiについてwt0(i)<wt1(i)≦wt2(i)であり、それ以外のiのうちの少なくとも一部の各iについてwt0(i)≦wt1(i)<wt2(i)であり、残りの各iについてwt0(i)≦wt1(i)≦wt2(i)である、
線形予測分析装置。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析装置であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i)(i=0, 1, …, Pmax)を計算する自己相関計算部と、
係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R'O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算部と、を含み、
少なくとも一部の各次数iに対して、前記各次数iに対応する係数wO(i)が、現在又は過去のフレームにおける入力時系列信号に基づく基本周波数と正の相関関係にある値の増加とともに単調減少する関係にある場合が含まれている、
線形予測分析装置。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析装置であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算部と、
2個以上の係数テーブルのそれぞれにはi=0, 1, …, Pmaxの各次数iと前記各次数iに対応する係数wO(i)とが対応付けて記憶されているとして、現在又は過去のフレームにおける入力時系列信号に基づく基本周波数と正の相関関係にある値を用いて前記2個以上の係数テーブルの中の1個の係数テーブルから係数wO(i) (i=0, 1, …, Pmax)を取得する係数決定部と、
取得された前記係数wO(i) (i=0, 1, …, Pmax)と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算部と、を含み、
前記2個以上の係数テーブルの中の、前記基本周波数と正の相関関係にある値が第一値である場合に前記係数決定部で係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第一係数テーブルとし、
前記2個以上の係数テーブルの中の、前記基本周波数と正の相関関係にある値が前記第一値よりも小さい第二値である場合に前記係数決定部で係数wO(i) (i=0, 1, …, Pmax)が取得される係数テーブルを第二係数テーブルとして、
少なくとも一部の各次数iに対して、前記第二係数テーブルにおける前記各次数iに対応する係数は、前記第一係数テーブルにおける前記各次数iに対応する係数よりも大きい、
線形予測分析装置。 - 入力時系列信号に対応する線形予測係数に変換可能な係数を、所定時間区間であるフレームごとに求める、線形予測分析装置であって、
少なくともi=0,1,…,Pmaxのそれぞれについて、現在のフレームの入力時系列信号XO(n)とiサンプルだけ過去の入力時系列信号XO(n-i)またはiサンプルだけ未来の入力時系列信号XO(n+i)との自己相関RO(i) (i=0, 1, …, Pmax)を計算する自己相関計算部と、
係数テーブルt0には係数wt0(i) (i=0, 1,…, Pmax)が格納されており、係数テーブルt1には係数wt1(i) (i=0, 1,…, Pmax)、係数テーブルt2には係数wt2(i) (i=0, 1,…, Pmax)が格納されているとして、現在又は過去のフレームにおける入力時系列信号に基づく基本周波数と正の相関関係にある値を用いて前記係数テーブルt0,t1,t2の中の1個の係数テーブルから係数を取得する係数決定部と、
前記取得した係数と前記自己相関RO(i) (i=0, 1, …, Pmax)とが対応するiごとに乗算されたものである変形自己相関R' O(i) (i=0, 1, …, Pmax)を用いて、1次からPmax次までの線形予測係数に変換可能な係数を求める予測係数計算部と、を含み、
前記基本周波数と正の相関関係にある値に応じて、基本周波数が高い場合、基本周波数が中程度の場合、基本周波数が低い場合の何れかの場合に分類されるとし、基本周波数が高い場合に前記係数決定部で係数が取得される係数テーブルを係数テーブルt0とし、基本周波数が中程度の場合に前記係数決定部で係数が取得される係数テーブルを係数テーブルt1とし、基本周波数が低い場合に前記係数決定部で係数が取得される係数テーブルを係数テーブルt2として、少なくとも一部のiについてwt0(i)<wt1(i)≦wt2(i)であり、それ以外のiのうちの少なくとも一部の各iについてwt0(i)≦wt1(i)<wt2(i)であり、残りの各iについてwt0(i)≦wt1(i)≦wt2(i)である、
線形予測分析装置。 - 請求項1から6の線形予測分析方法の各ステップをコンピュータに実行させるためのプログラム。
- 請求項1から6の線形予測分析方法の各ステップをコンピュータに実行させるためのプログラムが記録されたコンピュータ読み取り可能な記録媒体。
Priority Applications (20)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18173638.0A EP3399522B1 (en) | 2013-07-18 | 2014-07-16 | Linear prediction analysis device, method, program, and storage medium |
PL14826090T PL3012835T3 (pl) | 2013-07-18 | 2014-07-16 | Urządzenie, sposób i program do analizy predykcji liniowej, oraz nośnik zapisu |
PL18173638T PL3399522T3 (pl) | 2013-07-18 | 2014-07-16 | Urządzenie, sposób i program do analizy predykcji liniowej oraz nośnik zapisu |
KR1020177032374A KR101883789B1 (ko) | 2013-07-18 | 2014-07-16 | 선형 예측 분석 장치, 방법, 프로그램 및 기록 매체 |
JP2015527315A JP6117359B2 (ja) | 2013-07-18 | 2014-07-16 | 線形予測分析装置、方法、プログラム及び記録媒体 |
ES14826090T ES2699582T3 (es) | 2013-07-18 | 2014-07-16 | Dispositivo, método, programa y medio de almacenamiento de análisis de predicción lineal |
CN201811547968.0A CN109887520B (zh) | 2013-07-18 | 2014-07-16 | 线性预测分析装置、线性预测分析方法以及记录介质 |
US14/905,158 US10909996B2 (en) | 2013-07-18 | 2014-07-16 | Linear prediction analysis device, method, program, and storage medium |
PL18173641T PL3389047T3 (pl) | 2013-07-18 | 2014-07-16 | Urządzenie, sposób i program do analizy predykcji liniowej oraz nośnik zapisu |
CN201811547976.5A CN110070877B (zh) | 2013-07-18 | 2014-07-16 | 线性预测分析装置、线性预测分析方法以及记录介质 |
CN201480040536.4A CN105378836B (zh) | 2013-07-18 | 2014-07-16 | 线性预测分析装置、方法、以及记录介质 |
CN201811547577.9A CN109979471B (zh) | 2013-07-18 | 2014-07-16 | 线性预测分析装置、线性预测分析方法以及记录介质 |
KR1020167001218A KR101797679B1 (ko) | 2013-07-18 | 2014-07-16 | 선형 예측 분석 장치, 방법, 프로그램 및 기록 매체 |
CN201811547969.5A CN110085243B (zh) | 2013-07-18 | 2014-07-16 | 线性预测分析装置、线性预测分析方法以及记录介质 |
EP14826090.4A EP3012835B1 (en) | 2013-07-18 | 2014-07-16 | Linear-prediction analysis device, method, program, and storage medium |
CN201811547970.8A CN110070876B (zh) | 2013-07-18 | 2014-07-16 | 线性预测分析装置、线性预测分析方法以及记录介质 |
EP18173641.4A EP3389047B1 (en) | 2013-07-18 | 2014-07-16 | Linear prediction analysis device, method, program, and storage medium |
KR1020177032372A KR101883767B1 (ko) | 2013-07-18 | 2014-07-16 | 선형 예측 분석 장치, 방법, 프로그램 및 기록 매체 |
US17/120,462 US11532315B2 (en) | 2013-07-18 | 2020-12-14 | Linear prediction analysis device, method, program, and storage medium |
US17/970,879 US11972768B2 (en) | 2013-07-18 | 2022-10-21 | Linear prediction analysis device, method, program, and storage medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2013149160 | 2013-07-18 | ||
JP2013-149160 | 2013-07-18 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/905,158 A-371-Of-International US10909996B2 (en) | 2013-07-18 | 2014-07-16 | Linear prediction analysis device, method, program, and storage medium |
US17/120,462 Continuation US11532315B2 (en) | 2013-07-18 | 2020-12-14 | Linear prediction analysis device, method, program, and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015008783A1 true WO2015008783A1 (ja) | 2015-01-22 |
Family
ID=52346231
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2014/068895 WO2015008783A1 (ja) | 2013-07-18 | 2014-07-16 | 線形予測分析装置、方法、プログラム及び記録媒体 |
Country Status (9)
Country | Link |
---|---|
US (3) | US10909996B2 (ja) |
EP (3) | EP3399522B1 (ja) |
JP (1) | JP6117359B2 (ja) |
KR (3) | KR101797679B1 (ja) |
CN (6) | CN109887520B (ja) |
ES (3) | ES2699582T3 (ja) |
PL (3) | PL3389047T3 (ja) |
TR (1) | TR201815212T4 (ja) |
WO (1) | WO2015008783A1 (ja) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3399522B1 (en) * | 2013-07-18 | 2019-09-11 | Nippon Telegraph and Telephone Corporation | Linear prediction analysis device, method, program, and storage medium |
PL3385948T3 (pl) | 2014-03-24 | 2020-01-31 | Nippon Telegraph And Telephone Corporation | Sposób kodowania, koder, program i nośnik zapisu |
US9721159B2 (en) * | 2015-10-05 | 2017-08-01 | Evan Donald Balster | Periodicity analysis system |
EP3684463A4 (en) | 2017-09-19 | 2021-06-23 | Neuroenhancement Lab, LLC | NEURO-ACTIVATION PROCESS AND APPARATUS |
JP6904198B2 (ja) * | 2017-09-25 | 2021-07-14 | 富士通株式会社 | 音声処理プログラム、音声処理方法および音声処理装置 |
US11717686B2 (en) | 2017-12-04 | 2023-08-08 | Neuroenhancement Lab, LLC | Method and apparatus for neuroenhancement to facilitate learning and performance |
WO2019133997A1 (en) | 2017-12-31 | 2019-07-04 | Neuroenhancement Lab, LLC | System and method for neuroenhancement to enhance emotional response |
US11364361B2 (en) | 2018-04-20 | 2022-06-21 | Neuroenhancement Lab, LLC | System and method for inducing sleep by transplanting mental states |
CN113382683A (zh) | 2018-09-14 | 2021-09-10 | 纽罗因恒思蒙特实验有限责任公司 | 改善睡眠的系统和方法 |
US11786694B2 (en) | 2019-05-24 | 2023-10-17 | NeuroLight, Inc. | Device, method, and app for facilitating sleep |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010170124A (ja) * | 2008-12-30 | 2010-08-05 | Huawei Technologies Co Ltd | 信号圧縮方法及び装置 |
WO2012046685A1 (ja) * | 2010-10-05 | 2012-04-12 | 日本電信電話株式会社 | 符号化方法、復号方法、符号化装置、復号装置、プログラム、記録媒体 |
Family Cites Families (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5550859A (en) * | 1994-04-29 | 1996-08-27 | Lucent Technologies Inc. | Recovering analog and digital signals from superimposed analog and digital signals using linear prediction |
JP3402748B2 (ja) * | 1994-05-23 | 2003-05-06 | 三洋電機株式会社 | 音声信号のピッチ周期抽出装置 |
US5774846A (en) * | 1994-12-19 | 1998-06-30 | Matsushita Electric Industrial Co., Ltd. | Speech coding apparatus, linear prediction coefficient analyzing apparatus and noise reducing apparatus |
US5648989A (en) * | 1994-12-21 | 1997-07-15 | Paradyne Corporation | Linear prediction filter coefficient quantizer and filter set |
JP3522012B2 (ja) * | 1995-08-23 | 2004-04-26 | 沖電気工業株式会社 | コード励振線形予測符号化装置 |
TW321810B (ja) * | 1995-10-26 | 1997-12-01 | Sony Co Ltd | |
EP0883107B9 (en) * | 1996-11-07 | 2005-01-26 | Matsushita Electric Industrial Co., Ltd | Sound source vector generator, voice encoder, and voice decoder |
FI113903B (fi) * | 1997-05-07 | 2004-06-30 | Nokia Corp | Puheen koodaus |
CA2722110C (en) * | 1999-08-23 | 2014-04-08 | Panasonic Corporation | Apparatus and method for speech coding |
US6959274B1 (en) * | 1999-09-22 | 2005-10-25 | Mindspeed Technologies, Inc. | Fixed rate speech compression system and method |
US20040002856A1 (en) * | 2002-03-08 | 2004-01-01 | Udaya Bhaskar | Multi-rate frequency domain interpolative speech CODEC system |
AU2003213439A1 (en) * | 2002-03-08 | 2003-09-22 | Nippon Telegraph And Telephone Corporation | Digital signal encoding method, decoding method, encoding device, decoding device, digital signal encoding program, and decoding program |
CN1677493A (zh) * | 2004-04-01 | 2005-10-05 | 北京宫羽数字技术有限责任公司 | 一种增强音频编解码装置及方法 |
WO2005112005A1 (ja) * | 2004-04-27 | 2005-11-24 | Matsushita Electric Industrial Co., Ltd. | スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法 |
CN1977309B (zh) * | 2004-08-19 | 2010-11-10 | 日本电信电话株式会社 | 多信道信号编码方法、解码方法、用于该方法的装置和程序、以及其上存储程序的记录介质 |
WO2006025313A1 (ja) * | 2004-08-31 | 2006-03-09 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置、音声復号化装置、通信装置及び音声符号化方法 |
EP2273494A3 (en) * | 2004-09-17 | 2012-11-14 | Panasonic Corporation | Scalable encoding apparatus, scalable decoding apparatus |
DE602006020686D1 (de) * | 2005-01-12 | 2011-04-28 | Nippon Telegraph & Telephone | Kodierverfahren und dekodierverfahren mit langzeitvorhersage, vorrichtungen, programm und aufzeichnungsmedium dafür |
JP4675692B2 (ja) * | 2005-06-22 | 2011-04-27 | 富士通株式会社 | 話速変換装置 |
JP4736632B2 (ja) * | 2005-08-31 | 2011-07-27 | 株式会社国際電気通信基礎技術研究所 | ボーカル・フライ検出装置及びコンピュータプログラム |
CN1815552B (zh) * | 2006-02-28 | 2010-05-12 | 安徽中科大讯飞信息科技有限公司 | 基于线谱频率及其阶间差分参数的频谱建模与语音增强方法 |
DE602007003023D1 (de) * | 2006-05-30 | 2009-12-10 | Koninkl Philips Electronics Nv | Linear-prädiktive codierung eines audiosignals |
JP4757130B2 (ja) * | 2006-07-20 | 2011-08-24 | 富士通株式会社 | ピッチ変換方法及び装置 |
CN101154381B (zh) * | 2006-09-30 | 2011-03-30 | 华为技术有限公司 | 一种获取线性预测滤波器系数的装置 |
JP5618826B2 (ja) * | 2007-06-14 | 2014-11-05 | ヴォイスエイジ・コーポレーション | Itu.t勧告g.711と相互運用可能なpcmコーデックにおいてフレーム消失を補償する装置および方法 |
JP4825916B2 (ja) * | 2007-12-11 | 2011-11-30 | 日本電信電話株式会社 | 符号化方法、復号化方法、これらの方法を用いた装置、プログラム、記録媒体 |
JP4918074B2 (ja) * | 2008-08-18 | 2012-04-18 | 日本電信電話株式会社 | 符号化装置、符号化方法、符号化プログラム、及び記録媒体 |
CN101983402B (zh) * | 2008-09-16 | 2012-06-27 | 松下电器产业株式会社 | 声音分析装置、方法、系统、合成装置、及校正规则信息生成装置、方法 |
WO2010102446A1 (zh) * | 2009-03-11 | 2010-09-16 | 华为技术有限公司 | 一种线性预测分析方法、装置及系统 |
JP4932917B2 (ja) * | 2009-04-03 | 2012-05-16 | 株式会社エヌ・ティ・ティ・ドコモ | 音声復号装置、音声復号方法、及び音声復号プログラム |
CN102044250B (zh) * | 2009-10-23 | 2012-06-27 | 华为技术有限公司 | 频带扩展方法及装置 |
CN102884570B (zh) * | 2010-04-09 | 2015-06-17 | 杜比国际公司 | 基于mdct的复数预测立体声编码 |
JP5663461B2 (ja) * | 2011-12-06 | 2015-02-04 | 日本電信電話株式会社 | 符号化方法、符号化装置、プログラム、記録媒体 |
CN102693147B (zh) * | 2012-06-13 | 2015-10-28 | 上海第二工业大学 | 计算机汇编语言的辅助分析装置及分析方法 |
CN102867516B (zh) * | 2012-09-10 | 2014-08-27 | 大连理工大学 | 一种采用高阶线性预测系数分组矢量量化的语音编解方法 |
EP3399522B1 (en) * | 2013-07-18 | 2019-09-11 | Nippon Telegraph and Telephone Corporation | Linear prediction analysis device, method, program, and storage medium |
KR101850523B1 (ko) * | 2014-01-24 | 2018-04-19 | 니폰 덴신 덴와 가부시끼가이샤 | 선형 예측 분석 장치, 방법, 프로그램 및 기록 매체 |
US9928850B2 (en) * | 2014-01-24 | 2018-03-27 | Nippon Telegraph And Telephone Corporation | Linear predictive analysis apparatus, method, program and recording medium |
-
2014
- 2014-07-16 EP EP18173638.0A patent/EP3399522B1/en active Active
- 2014-07-16 JP JP2015527315A patent/JP6117359B2/ja active Active
- 2014-07-16 ES ES14826090T patent/ES2699582T3/es active Active
- 2014-07-16 TR TR2018/15212T patent/TR201815212T4/tr unknown
- 2014-07-16 US US14/905,158 patent/US10909996B2/en active Active
- 2014-07-16 ES ES18173638T patent/ES2760934T3/es active Active
- 2014-07-16 PL PL18173641T patent/PL3389047T3/pl unknown
- 2014-07-16 KR KR1020167001218A patent/KR101797679B1/ko active IP Right Grant
- 2014-07-16 CN CN201811547968.0A patent/CN109887520B/zh active Active
- 2014-07-16 CN CN201811547577.9A patent/CN109979471B/zh active Active
- 2014-07-16 ES ES18173641T patent/ES2749904T3/es active Active
- 2014-07-16 PL PL18173638T patent/PL3399522T3/pl unknown
- 2014-07-16 CN CN201811547976.5A patent/CN110070877B/zh active Active
- 2014-07-16 PL PL14826090T patent/PL3012835T3/pl unknown
- 2014-07-16 WO PCT/JP2014/068895 patent/WO2015008783A1/ja active Application Filing
- 2014-07-16 CN CN201811547970.8A patent/CN110070876B/zh active Active
- 2014-07-16 KR KR1020177032374A patent/KR101883789B1/ko active IP Right Grant
- 2014-07-16 EP EP18173641.4A patent/EP3389047B1/en active Active
- 2014-07-16 CN CN201480040536.4A patent/CN105378836B/zh active Active
- 2014-07-16 EP EP14826090.4A patent/EP3012835B1/en active Active
- 2014-07-16 KR KR1020177032372A patent/KR101883767B1/ko active IP Right Grant
- 2014-07-16 CN CN201811547969.5A patent/CN110085243B/zh active Active
-
2020
- 2020-12-14 US US17/120,462 patent/US11532315B2/en active Active
-
2022
- 2022-10-21 US US17/970,879 patent/US11972768B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010170124A (ja) * | 2008-12-30 | 2010-08-05 | Huawei Technologies Co Ltd | 信号圧縮方法及び装置 |
WO2012046685A1 (ja) * | 2010-10-05 | 2012-04-12 | 日本電信電話株式会社 | 符号化方法、復号方法、符号化装置、復号装置、プログラム、記録媒体 |
Non-Patent Citations (6)
Title |
---|
ITU-T RECOMMENDATION G.718, ITU, 2008 |
ITU-T RECOMMENDATION G.729, ITU, 1996 |
REDWAN SALAMI ET AL.: "Design and Description of CS -ACELP:A Toll Quality 8 kb/s Speech Coder", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, vol. 6, no. 2, 1 March 1998 (1998-03-01), pages 116 - 130, XP011054298 * |
See also references of EP3012835A4 |
YOH'ICHI TOHKURA; FUMITADA ITAKURA; SHIN'ICHIRO HASHIMOTO: "Spectral Smoothing Technique in PARCOR Speech Analysis-Synthesis", IEEE TRANS. ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, vol. ASSP-26, no. 6, 1978 |
YOICHI TOKURA ET AL.: "PARCOR Taiiki Asshuku Hoshiki ni Okeru Onsei Hinshitsu Kojo", THE TRANSACTIONS OF THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS A, vol. J61-A, no. 3, 25 March 1978 (1978-03-25), pages 254 - 261, XP008182649 * |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6117359B2 (ja) | 線形予測分析装置、方法、プログラム及び記録媒体 | |
JP6423065B2 (ja) | 線形予測分析装置、方法、プログラム及び記録媒体 | |
JP6416363B2 (ja) | 線形予測分析装置、方法、プログラム及び記録媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14826090 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14905158 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 2015527315 Country of ref document: JP Kind code of ref document: A Ref document number: 20167001218 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2014826090 Country of ref document: EP |