JPS58198095A

JPS58198095A - Line spectrum type voice analyzer/synthesizer

Info

Publication number: JPS58198095A
Application number: JP57080963A
Authority: JP
Inventors: 哲田口; 小林　雅徳; 孝行石川
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1982-05-14
Filing date: 1982-05-14
Publication date: 1983-11-17
Also published as: JPH0235991B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】本発明は線スペクトル型音声分析合成装置に関し、特に
線スペクトル周波数の量子化歪を低減して音声のスペク
トル包絡の近似度を改善することを特徴とする線スペク
トル型音声分析合成装置に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a line spectrum type speech analysis and synthesis device, and particularly to a line spectrum type voice analysis and synthesis device, which is characterized in that it reduces quantization distortion of line spectrum frequencies and improves the approximation of the spectrum envelope of the voice. Regarding analysis and synthesis equipment.

従来の線スペクトル型分析合成装置の一例のブロック図
を第１図に示す。図において明かなように、音声分析側
５１においては、線形予測分析器１およびＬＳＰパラメ
ータ計算器２を含むＬＳＰパラメータ抽出手段、ＬＳＰ
パラメータ符号化器３、音源パラメータ分析器４．音源
情報符号化器５とを備え、音声合成側５２においては、
ＬＳＰパラメータ復号化器７．にパラメータ計算器８゜
音源情報復号化器９．励撮信号発生器１０．音声合成フ
ィルタ１１を備えている。FIG. 1 shows a block diagram of an example of a conventional line spectrum analysis and synthesis apparatus. As is clear in the figure, the speech analysis side 51 includes LSP parameter extraction means including a linear prediction analyzer 1 and an LSP parameter calculator 2;
Parameter encoder 3, sound source parameter analyzer 4. The speech synthesis side 52 includes a sound source information encoder 5.
LSP parameter decoder7. Parameter calculator 8° Sound source information decoder 9. Excitation signal generator 10. A speech synthesis filter 11 is provided.

分析側５１において音声入力信号から線形予測係数を分
析抽出してＬ　Ｓ　’Ｐパラメータ計算器２に送出する
。前記線形予測係数としてはαパラメータα１（ｉ＝１
〜ｎ１　ｎは所定の正の整数）を用い、このαパラメー
タからＬＡＰパラメータ計算器２により線スペクトル周
波数ω１（ｉ＝１〜ｎ、ｎは所定の正の整数）を算出す
る。この（ωｉ）は一般にＬＳＰパラメータと呼ばれて
いる。この（ωｉ）はＬＳＰパラメータ符号化器３に送
られ、ここにおいて線形量子化された後、音声のスペク
トル包絡に拘わる分析データとしてデータ伝送手段６を
介して音声合成側５２に伝送される。また音源パラメー
タ分析器４は前記入力信号がらピッチ周波数、有声無声
判別信号、短時間平均電力を含む音源情報を抽出して音
源情報符号化器５に出力する。音源情報符号化器５は前
記音源情報を量子化してデータ伝送手段６を介して音声
合成側５２に伝送する。The analysis side 51 analyzes and extracts linear prediction coefficients from the audio input signal and sends them to the L S 'P parameter calculator 2 . The linear prediction coefficient is α parameter α1 (i=1
~n1 n is a predetermined positive integer), and the LAP parameter calculator 2 calculates the line spectrum frequency ω1 (i=1 to n, n is a predetermined positive integer) from this α parameter. This (ωi) is generally called an LSP parameter. This (ωi) is sent to the LSP parameter encoder 3, where it is linearly quantized and then transmitted to the speech synthesis side 52 via the data transmission means 6 as analysis data related to the spectral envelope of the speech. Further, the sound source parameter analyzer 4 extracts sound source information including the pitch frequency, voiced/unvoiced discrimination signal, and short-time average power from the input signal, and outputs the extracted sound source information to the sound source information encoder 5. The sound source information encoder 5 quantizes the sound source information and transmits it to the speech synthesis side 52 via the data transmission means 6.

音声合成側５２において、前記音声のスペクトル包絡に
拘わる分析データについては、音声分析側５１において
ＬＳＰパラメータ符号化器３により符号化された前記Ｌ
ＳＰパラメータをＬＳＰパラメータ復号化器７において
復号し、Ｋパラメータ計算器８において前記ＬＳＰパラ
メータ（ωｉ）から同一次数の線形予測係数αパラメー
タ（ωｉ）を算出し、このαパラメータから更に線形予
測係数の態形式であるにパラメータ（Ｋｉ）を抽出して
音声合成フィルタ１１の係数を制御する。一方、音源情
報符号化器５によシ量子化された前記音源情報は音源情
報復号化器９に入力されて復号され、励振信号発生器１
０に出力される。励振信号発生器１０は前記音源情報を
入力してフィルタ励振信号を発生し、音声合成フィルタ
１１に供給する。On the speech synthesis side 52, the analysis data related to the spectral envelope of the speech is analyzed using the LSP encoded by the LSP parameter encoder 3 on the speech analysis side 51.
The SP parameter is decoded in the LSP parameter decoder 7, and the linear prediction coefficient α parameter (ωi) of the same order is calculated from the LSP parameter (ωi) in the K parameter calculator 8. From this α parameter, the linear prediction coefficient is further calculated. The coefficients of the speech synthesis filter 11 are controlled by extracting a parameter (Ki) in the form of a speech synthesis filter. On the other hand, the sound source information quantized by the sound source information encoder 5 is input to the sound source information decoder 9 and decoded, and the excitation signal generator 1
Output to 0. The excitation signal generator 10 inputs the sound source information, generates a filter excitation signal, and supplies it to the speech synthesis filter 11.

音声合成フィルタ１１はにパラメータ計算器８よシ入力
された前記にパラメータと振振信号発生器１０から供給
されたフィルタ励振信号とにより音声を合成し端子９２
より出力する。The voice synthesis filter 11 synthesizes voice using the parameters inputted from the parameter calculator 8 and the filter excitation signal supplied from the vibration signal generator 10, and outputs the voice to the terminal 92.
Output from

この従来の線スペクトル型音声分析合成装置の一例にお
ける線スペクトル周波数の模様の一例を第５図に示す。FIG. 5 shows an example of a line spectrum frequency pattern in an example of this conventional line spectrum type speech analysis and synthesis apparatus.

図において縦軸は時間、横軸は線スペクトル周波数を表
わしている。点１０．１〜１０８は時刻ｔ１における線
スペクトル周波数の分布を示す。この例における線スペ
クトル周波数の数は８個である。同様にして、ｔ２〜ｔ
ｓの各時刻にお５− ける線スペクトル周波数の分布が２０１〜５０８の各点
により表わされている。In the figure, the vertical axis represents time and the horizontal axis represents line spectrum frequency. Points 10.1 to 108 show the distribution of line spectrum frequencies at time t1. The number of line spectral frequencies in this example is eight. Similarly, t2 to t
The distribution of line spectrum frequencies at each time s is represented by points 201 to 508.

図において、時刻ｔ３における線スペクトル周波数３０
１，３０２に着目すると、その値が極めて近接している
ことが分る。時刻ｔ４における線スペクトル周波数４０
１，４０２についても同様である。一方時刻１１におけ
る線スペクトル周波数１０３，１０４および１０４，１
０５等は、比においてＬＳＰパラメータ符号化器３によ
り線形量子化されるが、前記線スペクトル周波数３０１
゜３０２または４０１．４０２の場合のように隣接周波
数間隔が極めて狭い周波数領域においては、それらの線
スペクトル周波数に含まれている音声のスペクトル包絡
に拘わる情報量が高密度で分布されておシ、また一方に
おいては、対照的に前記線スペクトル周波数１０３．１
０４または１０４゜１０５の場合のように隣接周波数間
隔が大きい周波数領域においては、それらの線スペクト
ル周波＝６− 数に含まれている音声のスペクトル包絡に拘わる情報量
の密度は相対的に低い。従って、従来の線スペクトル周
波数領域における線形量子化手法によるデータ伝送にお
いては、前記線スペクトル周波数に付与されている分析
データの情報密度分布に関係なく一様に量子化されるた
め、結果として、量子化歪が増大し音声スペクトル包絡
の近似度に関連する重要な情報が一部損われ、且つ量子
化効率が低下するという障害を生じる。従って、最終的
に音声合成側５２において再生抽出されるにパラメータ
（Ｋｉ）により制御される音声合成フィルタ１１によシ
合成される音声の質も低下する。In the figure, line spectrum frequency 30 at time t3
Focusing on 1,302, it can be seen that the values are extremely close. Line spectrum frequency 40 at time t4
The same applies to 1,402. On the other hand, line spectrum frequencies 103, 104 and 104, 1 at time 11
05 etc. are linearly quantized by the LSP parameter encoder 3 in the ratio, but the line spectrum frequency 301
In frequency regions where adjacent frequency intervals are extremely narrow, such as in the case of ゜302 or 401.402, the amount of information related to the spectral envelope of the voice contained in those line spectral frequencies is distributed with high density. On the other hand, in contrast, the line spectral frequency 103.1
In frequency regions where adjacent frequency intervals are large, such as in the case of 04 or 104°105, the density of the amount of information related to the spectral envelope of the voice contained in those line spectral frequencies is relatively low. Therefore, in data transmission using the conventional linear quantization method in the line spectral frequency domain, quantization is uniformly performed regardless of the information density distribution of the analysis data assigned to the line spectral frequency, and as a result, the quantization This causes a problem in that the quantization distortion increases, important information related to the degree of approximation of the audio spectrum envelope is partially lost, and the quantization efficiency decreases. Therefore, when the voice is finally reproduced and extracted on the voice synthesis side 52, the quality of the voice synthesized by the voice synthesis filter 11 controlled by the parameter (Ki) also deteriorates.

即ち、従来の線スペクトル型音声分析合成装置において
は、分析側において線形予測係数よシ抽出されたＬＳＰ
パラメータ（ωｉ）を、線形量子化によシ符号化して合
成側にデータを伝送しているために、隣接する線スペク
トル周波数の周波数間隔の狭い領域における量子化歪が
犬きくなシ、音声のスペクトル包絡の近似度を劣化させ
、且つ量子化効率を低下させるという欠点がある。That is, in the conventional line spectrum type speech analysis and synthesis device, the LSP extracted from the linear prediction coefficients is
Since the parameter (ωi) is encoded by linear quantization and the data is transmitted to the synthesis side, quantization distortion in the narrow frequency interval region of adjacent line spectral frequencies is severe, and the voice This has the drawback of deteriorating the degree of approximation of the spectral envelope and reducing the quantization efficiency.

本発明の目的は上記の欠点を除去し、線スペクトル周波
数を線形量子化する代りに、一つの特定の線スペクトル
周波数と凡ての隣接する線スペクトル周波数間の周波数
差とを非線形量子化手法により符号化し、量子中が少な
く且つ量子化効率の良い線スペクトル型音声分析合成装
置を提供することにある。It is an object of the present invention to eliminate the above-mentioned drawbacks and instead of linearly quantizing line spectral frequencies, the frequency differences between one particular line spectral frequency and all adjacent line spectral frequencies are quantized using a non-linear quantization technique. It is an object of the present invention to provide a line spectrum type speech analysis and synthesis device that performs encoding, has a small amount of quantization, and has high quantization efficiency.

本発明の線スペクトル型音声分析合成装置は、入力信号
よ多線形予測分析によって得られる線形予測係数から線
スペクトル周波数を算出して音声を分析合成を行う線ス
ペクトル型音声分析合成装置において、前記入力信号か
ら前記線スペクトル周波数の中の特定の線スペクトル周
波数（ωにと〜ｎ−１）とを含む線スペクトルパラメー
タを分析抽出する線スペクトルパラメータ抽出手段と、
この線スペクトルパラメータに拘わる信号を非線　　　
　　・形量子化する符号化手段とを音声分析側に備え、
且つ前記符号化手段により非線形量子化された符号化信
号を復号する復号化手段と、この復号された前記線スペ
クトルパラメータに拘わる信号よ多線形予測係数を抽出
して音声合成フィルタを制御し合成波形を出力する音声
合成手段とを音声合成側に備えて構成される。The line spectrum type speech analysis and synthesis apparatus of the present invention analyzes and synthesizes speech by calculating a line spectrum frequency from an input signal and linear prediction coefficients obtained by multilinear prediction analysis. Line spectrum parameter extraction means for analyzing and extracting a line spectrum parameter including a specific line spectrum frequency (ω to n-1) among the line spectrum frequencies from the signal;
The signal related to this linear spectrum parameter is
・Equipped with an encoding means for shape quantization on the speech analysis side,
a decoding means for decoding the coded signal non-linearly quantized by the coding means; and a decoding means for extracting multilinear prediction coefficients from the decoded signal related to the line spectrum parameters to control a speech synthesis filter and generate a synthesized waveform. The speech synthesis device is equipped with a speech synthesis means for outputting the speech.

以下本発明について図面を参照して詳細に説明する。The present invention will be described in detail below with reference to the drawings.

第２図は、本発明の線スペクトル型音声分析合成装置の
第１の実施例を示すブロック図である。FIG. 2 is a block diagram showing a first embodiment of the line spectrum type speech analysis and synthesis apparatus of the present invention.

本実施例においては、音声分析側６１において線形予測
分析器１２および線スペクトルパラメータ（ωに、Δω
ｉ）計算器１３を含む純スペクトルパラメータ抽出手段
、線スペクトルパラメータ（ωｋ。In this embodiment, the speech analysis side 61 uses the linear prediction analyzer 12 and the line spectrum parameters (ω, Δω
i) Pure spectral parameter extraction means including a calculator 13, line spectral parameters (ωk.

Δωｉ）符号化器１４．音源パラメータ分析器１５゜音
源情報符号化器１６とを備え、音声合成側６２において
線スペクトルパラメータ（ωに、Δωｉ）復号化器１８
．にパラメータ計算器１９、音源情報符号化器２０．励
振信号発生器２１．音声合成フィルタ２２を備えている
。Δωi) encoder 14. a sound source parameter analyzer 15 and a sound source information encoder 16, and a line spectrum parameter (ω, Δωi) decoder 18 on the speech synthesis side 62.
．． a parameter calculator 19, a sound source information encoder 20. Excitation signal generator 21. A voice synthesis filter 22 is provided.

分析側６１において音声入力信号は端子９３か９− ら入力される。線形予測分析器１２は前記入力信号から
自己相関係数、共分散係数などの波形再現度に拘わる係
数を分析抽出し紳スペクトルパラメータ（ωに、ΔωＩ
）計算器１３に出力する。線スペクトルパラメータ（ω
に、Δωｉ）計算器１３は前記波形再現度に拘わる係数
から正規方程式を解くことによ多線形予測係数（αｉ）
即ちαパラメータを求め、更にこのαパラメータから所
定の線スペクトル周波数ωｋ（ｋは所定の正の整数）お
よび凡ての隣接する線スペクトル周波数の周波数差Δω
１（ｉ−１〜ｎ−１、ｎは所定の正の整数）を算出する
。算出された前記ωにおよびΔωｉは線スペクトルパラ
メータ（ωに、Δωｉ）符号化器１４に送出され、ここ
において非線形量子化手法により符号化される。第５図
を用いて説明したとおり、隣接する線スペクトル周波数
の周波数差Δω１（ｉ＝＝１〜ｎ−１）の小さい線スペ
クトル周波数領域においては音声のスペクトル包絡の分
析データが高密度で分布しており、またΔωｉの大きい
線スペクトル周波数領域においては対照的に前記分析デ
ータの１０− 分布密度は低い。従って、前記音声のスペクトル包絡の
分析データの近似度を保持するためには、Δωｉの小さ
い線スペクトル周波数領域と△ωｉの大きい線スペクト
ル周波数領域とを区分して、分析データの分布密度に対
応してこのΔωｉを忠実に音声合成側６２に伝送する必
要がある。音声スペクトル包絡の分析データについては
、実験的に隣接する線スペクトル周波数の周波数差Δω
ｉの線スペクトル周波数帯における統計的分布データが
分っているので、この分布データを参照し、て前記周波
数差△ωＩＫ対する量子化精度を制御する。即ち、周波
数差の小さい線スペクトル周波数領域における△ωｉに
ついては精度高く量子化し、一方円波数差の大きい線ス
ペクトル周波数領域におけるΔωｉについては相対的に
精度を下げて量子化するとい手法で符号化する。この線
スペクトルパラメータ（ωに、Δωｉ）符号化器の特徴
は、特定の線スペクトル周波数ωにと周波数差△ωｉと
を非線形量子化する点にあり、本発明の主要要件である
。線スペクトルパラメータ（ωに、Δωｉ）符号化器１
４によシ非線形量子化された前記ωにおよびΔωｉは音
声のスペクトル包絡に拘わる分析データとしてデータ伝
送線路１７を介して音声合成側６２に伝送される。また
音源パラメータ分析器１５は前記入力信号からピッチ周
波数、有声無声判別信号、短時間平均電力を含む音源情
報を抽出して音源情報符号化器１６に出力する。音源情
報符号化器１６は前記音源情報を量子化してデータ伝送
手段１７を介して音声合成側６２に伝送する。On the analysis side 61, an audio input signal is input from a terminal 93 or 9-. The linear prediction analyzer 12 analyzes and extracts coefficients related to waveform reproducibility such as autocorrelation coefficients and covariance coefficients from the input signal, and calculates the spectral parameters (ω, ΔωI
) is output to the calculator 13. Line spectrum parameter (ω
, Δωi) The calculator 13 calculates the multilinear prediction coefficient (αi) by solving a normal equation from the coefficients related to the waveform reproducibility.
That is, the α parameter is determined, and from this α parameter, the predetermined line spectrum frequency ωk (k is a predetermined positive integer) and the frequency difference Δω between all adjacent line spectrum frequencies are calculated.
1 (i-1 to n-1, n is a predetermined positive integer). The calculated ω and Δωi are sent to the line spectrum parameter (ω, Δωi) encoder 14, where they are encoded using a nonlinear quantization technique. As explained using Fig. 5, in the line spectral frequency region where the frequency difference Δω1 (i==1 to n-1) between adjacent line spectral frequencies is small, the analysis data of the speech spectral envelope is distributed with high density. In contrast, the 10-distribution density of the analytical data is low in the line spectral frequency region where Δωi is large. Therefore, in order to maintain the degree of approximation of the analysis data of the voice spectral envelope, it is necessary to divide the line spectrum frequency region with small Δωi and the line spectrum frequency region with large Δωi to correspond to the distribution density of the analysis data. It is necessary to faithfully transmit the lever Δωi to the speech synthesis side 62. For the analytical data of the audio spectrum envelope, we experimentally calculated the frequency difference Δω between adjacent line spectrum frequencies.
Since the statistical distribution data in the line spectrum frequency band of i is known, the quantization precision for the frequency difference ΔωIK is controlled with reference to this distribution data. In other words, Δωi in the line spectrum frequency domain where the frequency difference is small is quantized with high precision, while Δωi in the line spectrum frequency domain where the circular wave number difference is large is quantized with relatively lower precision. . The feature of this line spectrum parameter (ω, Δωi) encoder is that it nonlinearly quantizes a specific line spectrum frequency ω and the frequency difference Δωi, which is the main requirement of the present invention. Line spectrum parameters (ω, Δωi) encoder 1
The nonlinearly quantized ω and Δωi according to 4 are transmitted to the speech synthesis side 62 via the data transmission line 17 as analysis data related to the spectral envelope of the speech. Further, the sound source parameter analyzer 15 extracts sound source information including the pitch frequency, voiced/unvoiced discrimination signal, and short-time average power from the input signal, and outputs it to the sound source information encoder 16 . The sound source information encoder 16 quantizes the sound source information and transmits it to the speech synthesis side 62 via the data transmission means 17.

段１７を介して伝送されてきた前記所定の線スペクトル
周波数ωにおよび周波数差Δωｉの符号化信号を線スペ
クトルパラメータ（ωに、Δωｉ）復号化器１８におい
て復号し、Ｋパラメータ計算器１９に出力する。Ｋパラ
メータ計算器１９においては、前記ωにおよび△ωｉか
ら線形予測係数αパラメータを算出し、更にとのαパラ
メータよ、！７にパラメータを抽出して、このにパラメ
ータによシ音声合成フィルタ２２の係数を制御する。一
方、音源情報については、伝送線路１７を介して伝送さ
れてきた前記音源情報の符号化信号を音源情報復号化器
２０において復号し、励振信号発生器２１に出力する。The encoded signal of the predetermined line spectrum frequency ω and the frequency difference Δωi transmitted through the stage 17 is decoded by the line spectrum parameter (ω, Δωi) decoder 18 and output to the K parameter calculator 19. do. In the K parameter calculator 19, the linear prediction coefficient α parameter is calculated from the above-mentioned ω and Δωi, and the α parameter of ! 7, and the coefficients of the speech synthesis filter 22 are controlled by the parameters. On the other hand, regarding the sound source information, the coded signal of the sound source information transmitted via the transmission line 17 is decoded by the sound source information decoder 20 and output to the excitation signal generator 21 .

励振信号発生器２１は前記音源情報を入力してフィルタ
励振信号を発生し音声合成フィルタ２２に供給する。音
声合成フィルタ２２はにパラメータ計算器１９よ多入力
された前記にパラメータと励振信号発生器２１から供給
されたフィルタ励振信号とによシ音声を合成し端子９４
より出力する。The excitation signal generator 21 inputs the sound source information, generates a filter excitation signal, and supplies it to the speech synthesis filter 22 . The voice synthesis filter 22 synthesizes voice using the parameters inputted to the parameter calculator 19 and the filter excitation signal supplied from the excitation signal generator 21, and outputs the voice to the terminal 94.
Output from

次に本発明の第２の実施例について、第３図を参照して
説明する。図において分析側７１については前記第１の
実施例の場合と同様である。合成側については非線形量
子化手法により符号化された前記ωにおよびΔωｉの音
声のスペクトル包絡に拘わる分析データと前記音源情報
の符号化信号とがデータ伝送手段２８を介して合成側７
２に伝送され、その中のωにおよびΔωｉは線スペクト
ルパラメータ（ωに、Δωｉ）復号化器２９において復
号されαパラメータ計算器３０に出力される。αパー１
３− ラメータ計算器３０においては、前記ωにおよびΔωｉ
から線形予測係数αパラメータを算出し、このαパラメ
ータにより音声合成フィルタ３３ｔｖｉ４数を制御する
。一方、音源情報については、分析側７１より伝送され
てきた前記音源情報の符号化信号は音声情報復号化器３
１において復号され励振信号発生器３２に出力する。励
振信号発生器３２は前記音源情報を入力してンイルタ励
振信号を発生し音声合成フィルタ３３に供給する。音声
合成フィルタ３３はαパラメータ計算器３０よ多入力さ
れた前記αパラメータと励振信号発生器３２から供給さ
れたフィルタ励振信号とにより音声を合成し端子９６よ
り出力する。Next, a second embodiment of the present invention will be described with reference to FIG. In the figure, the analysis side 71 is the same as in the first embodiment. On the synthesis side, the analysis data related to the spectral envelope of the audio of ω and Δωi encoded by the nonlinear quantization method and the encoded signal of the sound source information are sent to the synthesis side 7 via the data transmission means 28.
2, among which ω and Δωi are decoded by a line spectrum parameter (ω, Δωi) decoder 29 and output to an α parameter calculator 30. α par 1
3- In the parameter calculator 30, for said ω and Δωi
A linear prediction coefficient α parameter is calculated from the α parameter, and the number of speech synthesis filters 33tvi4 is controlled by this α parameter. On the other hand, regarding the sound source information, the encoded signal of the sound source information transmitted from the analysis side 71 is transmitted to the sound information decoder 3.
1 and output to the excitation signal generator 32. The excitation signal generator 32 inputs the sound source information, generates an inverter excitation signal, and supplies it to the speech synthesis filter 33. The voice synthesis filter 33 synthesizes voice using the α parameters inputted from the α parameter calculator 30 and the filter excitation signal supplied from the excitation signal generator 32, and outputs the synthesized voice from the terminal 96.

次に本発明の第３の実施例について、第４図を参照して
説明する。図において分析側８１については前記第１お
よび第２の実施例の場合と同様である。合成側について
は非線形量子化手法によシ符号化された前記ωにおよび
Δωｌの音声のスペクトル包絡に拘わる分析データと前
記音源情報の符号化信号とがデータ伝送手段３９を介し
て合成側１４− ８２に伝送され、その中のωにおよびΔωｉは線スペク
トルパラメータ（ωに、Δωｌ）復号化器４０において
復号されＬＳＰパラメータ計算器４１に出力する。ＬＳ
Ｐパラメータ計算器４１においては、前記ωにおよびΔ
ωｉから線形予測係数αパラメータを算出し、必要に応
じて算出されたαパラメータの次数を変換して、しかる
後この次数変換されたαパラメータからＬＳＰパラメー
タを算出し、このＬＳＰパナメータにより音声合成フィ
ルタ４４の次数を制御する。一方、音源情報については
、分析側８１よシ伝送されてきた前記音源情報の符号化
信号は音源情報復号化器４２において復号されて励振信
号発生器４３に出力する。励振信号発生器４３は前記音
源情報を入力してフィルタ励振信号を発生し音声合成フ
ィルタ４４に供給する。Next, a third embodiment of the present invention will be described with reference to FIG. In the figure, the analysis side 81 is the same as in the first and second embodiments. On the synthesis side, the analysis data related to the spectral envelope of the voice of ω and Δωl encoded by the non-linear quantization method and the encoded signal of the sound source information are sent via the data transmission means 39 to the synthesis side 14- 82, among which ω and Δωi are decoded by the line spectrum parameter (ω, Δωl) decoder 40 and output to the LSP parameter calculator 41. L.S.
In the P parameter calculator 41, the above ω and Δ
A linear prediction coefficient α parameter is calculated from ωi, the order of the calculated α parameter is converted as necessary, and an LSP parameter is calculated from the order-converted α parameter. Controls the order of 44. On the other hand, regarding the sound source information, the encoded signal of the sound source information transmitted from the analysis side 81 is decoded by the sound source information decoder 42 and output to the excitation signal generator 43. The excitation signal generator 43 inputs the sound source information, generates a filter excitation signal, and supplies it to the speech synthesis filter 44 .

音声合成フィルタ４４はＬＡＰパラメータ計算器４１よ
り入力された前記ＬＳＰパラメータと励振信号発生器４
３から供給されたフィルタ励振信号とによシ音声を合成
し端子９８よシ出力する。The speech synthesis filter 44 uses the LSP parameters input from the LAP parameter calculator 41 and the excitation signal generator 4.
The synthesized voice is synthesized with the filter excitation signal supplied from 3, and outputted from a terminal 98.

以上詳細に説明したように、本発明の線スペクトル型音
声分析合成装置は、分析側から合成側に伝送する線形予
測係数に拘わるデータの様態として、従来の線形量子化
手法により符号化された線スペクトル周波数（ωｉ）即
ちＬＳＰパラメータによる代シに、非線形量子化手法に
よシ符号化された一つの所定の線スペクトル周波数ωに
と凡ての隣接する線スペクトル周波数間の周波数差Δω
ｉを用いることにより、量子化歪を低減し且つ量子化効
率を向上できるという効果がある。As explained in detail above, the line spectrum type speech analysis and synthesis device of the present invention uses lines encoded by the conventional linear quantization method as data related to linear prediction coefficients to be transmitted from the analysis side to the synthesis side. The spectral frequency (ωi), i.e. the frequency difference Δω between one given line spectral frequency ω and all adjacent line spectral frequencies encoded by a non-linear quantization technique in place of the LSP parameters.
The use of i has the effect of reducing quantization distortion and improving quantization efficiency.

[Brief explanation of the drawing]

第１図は従来の線スペクトル型音声分析合成装置のブロ
ック図、第２図、第３図、第４図はそれぞれ本発明の第
１．第２．第３の実施例のブロック図、第５図は線スペ
クトル周波数の時間分布説明図である。図において、１．１２，２３．３４・・・・・・線形予測分析器、２
・・・・・・ＬＳＰパラメータ計算器、３・・・・・・
ＬＡＰパラメータ符号化器、４，１５，２６．３７・・
・・・・音源パラメータ分析器、５，１６，２７．３８
・・・・・・音源情報符号化器、６．１７，２８．３９
・・・・・・データ伝送手段、７・・・・・・ＬＡＰパ
ラメータ復号化器、８．１９・・・・・・Ｋパラメータ
計算器、９，２０，３１゜４２・・・・・・音源情報復
号化器、１０，２１，３２゜４３・・・・・・励振信号
発生器、１１，２２，３３．４４・・・・・・音声合成
フィルタ、１３，２４．３５・・・・・・線スペクトル
パラメータ（ωに、Δωｉ）計算器、１４゜２５．３６
・・・・・・線スペクトルパラメータ（ωに、ΔωＤ符
号化器、１８，２９．４０・・・・・・線スペクトルパ
ラメータ（ωに、Δωｉ）復号化器、３０・・・・・・
αパラメータ計算器、４１・・・・・・ＬＡＰパラメー
タ計算器、５１，６１，７１．８１・・・・・・分析側
、５２゜６２．７２．８２・・・・・・合成側、９１〜
９８・・・・・・端　１７− 噂　　！　　　句　　　６　　　ゝ ←　　Ｎ　　匂　　０　９手続補正書−発）特許庁長官　殿 ■、事件の表示　　　昭和５７年特許　願第０８０９６
３号２、発明の名称　　　線スペクトル型音声分析合成
装置３、補正をする者事件との関係　　　　　　　出　願　人東京都港区芝五
丁目３３番１号（４２３）　　　Ｂ本電気株式会社代表者　関本忠弘４、代理人５、補正の対象明細書の「発明の詳細な説明」の欄６、補正の内容第１５頁、第１９行目と第２０行目の間に次の文章を挿
入する。［なお、線スペクトル周波数間の周波数差を次
々と量子化し、これを復号することにより生じる量子化
歪の蓄積は以下の方法で容易に除去し得る。即ち、初め
にΔω、を量子化する。 Δω１を量子化した結果発生した歪をδ、とする。次にΔω、を量子化する代りに（Δω２−δ１）を量子
化する。（Δω、−δ、）を量子化した結果発生した歪
をδ、とする。以下同様に（Δωｎ−１−δｎ−２）ま
で量子化する。」FIG. 1 is a block diagram of a conventional line spectrum type speech analysis and synthesis device, and FIGS. Second. FIG. 5, a block diagram of the third embodiment, is an explanatory diagram of the time distribution of line spectrum frequencies. In the figure, 1.12, 23.34...Linear prediction analyzer, 2
・・・・・・LSP parameter calculator, 3・・・・・・
LAP parameter encoder, 4, 15, 26. 37...
...Sound source parameter analyzer, 5, 16, 27.38
...Sound source information encoder, 6.17, 28.39
...... Data transmission means, 7... LAP parameter decoder, 8.19... K parameter calculator, 9, 20, 31° 42... Sound source information decoder, 10, 21, 32° 43...Excitation signal generator, 11, 22, 33.44...Speech synthesis filter, 13, 24.35... ...Line spectrum parameter (ω, Δωi) calculator, 14°25.36
...Line spectrum parameter (ω, ΔωD encoder, 18, 29.40...Line spectrum parameter (ω, Δωi) decoder, 30...
α parameter calculator, 41... LAP parameter calculator, 51, 61, 71.81... Analysis side, 52°62.72.82... Synthesis side, 91 ~
98... Edge 17- Rumor! Clause 6 ゝ← N Odor 0 9 Procedural Amendment - Issued by Mr. Commissioner of the Patent Office ■, Indication of the case 1982 Patent Application No. 08096
No. 3 No. 2, Title of the invention: Line spectrum type speech analysis and synthesis device 3, Relationship to the amended case Applicant: 5-33-1 Shiba, Minato-ku, Tokyo (423) B Hon Electric Co., Ltd. Representative: Tadahiro Sekimoto 4. Agent 5, insert the following sentence between lines 19 and 20 on page 15 of the contents of the amendment in column 6 of the "Detailed Description of the Invention" of the specification to be amended. [Note that the accumulation of quantization distortion caused by sequentially quantizing and decoding frequency differences between line spectral frequencies can be easily removed by the following method. That is, Δω is first quantized. Let δ be the distortion generated as a result of quantizing Δω1. Next, instead of quantizing Δω, (Δω2−δ1) is quantized. Let δ be the distortion generated as a result of quantizing (Δω, −δ,). Thereafter, quantization is similarly performed up to (Δωn-1−δn-2). ”

Claims

[Scope of Claims] (1) In a line spectrum type analysis and synthesis device for analyzing and synthesizing speech by extracting line spectrum frequencies from linear prediction coefficients obtained from an input signal by linear prediction analysis, Line spectrum parameter extraction means for analyzing and extracting line spectrum parameters including a specific line spectrum frequency and frequency differences between all adjacent line spectrum frequencies; and nonlinear quantization of signals related to the line spectrum parameters. a decoding means for decoding the encoded signal that has been nonlinearly quantized by the encoding means;
A line spectrum type voice characterized in that the voice synthesis side is equipped with a voice synthesis means for extracting a linear prediction coefficient from the decoded signal related to the line spectrum parameter, controlling a voice synthesis filter, and outputting a synthesized voice waveform. Analysis and synthesis equipment. (2. In the device described in claim (1),
A line spectrum type speech analysis supplementary device characterized in that a parameter is used as the linear prediction coefficient for controlling the speech synthesis filter included in the speech synthesis means. (3) Range of % allowable rust In the equipment described in paragraph (1),
A line spectrum type speech analysis and synthesis device, characterized in that an α parameter is used as the linear prediction coefficient for controlling the speech synthesis filter included in the speech synthesis means. (4) In the device according to claim (1),
A line spectrum type speech analysis and synthesis device, characterized in that an L8P parameter is used as the linear prediction coefficient for controlling the speech synthesis filter included in the speech synthesis means.