JPS63210899A

JPS63210899A - Voice synthesizer

Info

Publication number: JPS63210899A
Application number: JP4312087A
Authority: JP
Inventors: 利光蓑輪
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1987-02-27
Filing date: 1987-02-27
Publication date: 1988-09-01

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、入力した文字列を、恰も人間が読み上げたか
のように音声化する音声合成装置に関するもので１本発
明の音声合成装置は、ワードプロセッサ等に入力した文
字列の読合せ、盲人の読書用等に利用される。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a speech synthesis device that converts an input character string into a voice as if it were read out by a human. It is used for reading aloud character strings entered in computers, etc., and for reading by blind people.

（従来の技術）第４図は従来の音声合成装置の構成を示すもので、１は
、声道の特性を共振・反共振であられした声道パラメー
タがファイルされた声道パラメータファイルで、声道パ
ラメータは、約１０ｍ５毎に音声を分析して得たところ
のホルマント周波数及びバンド幅の情報や、スペクトル
を線スペクトル化したＬＳＰパラメータ等で構成されて
いる。２は、文字列が入力すると、それ等の文字列に含
まれる音節の声道パラメータを声道パラメータファイル
１から選択して、時間的に配列した上、補間計算によっ
て算出した声道パラメータを各音節間の補間区間に挿入
して、それ等の音節を結合する声道パラメータ結合部、
３は、アンプ制御情報が入力すると、パルス列及び白色
雑音の振幅を決定するアンプ計算部、４は、抑揚制御情
報が入力すると、パルス列のパルス間隔を決定する抑揚
計算部、５はアンプ計算部３で決定された振幅及び抑揚
計算部で決定されたパルス間隔に基づいてパルス列を出
力するパルス列発生部、６はアンプ計算部３で決定され
た振幅に基づいて白色雑音を出力する白色雑音発生部、
７はパルス列及び白色雑音が声道に入力したときの声道
中の透過波及び反射波を計算することにより、口唇から
の透過波として所望の音声信号を得る音響計算部で、こ
の音響計算部７はデジタル計算機で構成されている。８
は音響計算部７から出力されたデジタル音声信号をアナ
ログ音声信号に変換するＤ／Ａコンバータ、９はアナロ
グ音声信号によって駆動されるスピーカである。(Prior art) Fig. 4 shows the configuration of a conventional speech synthesizer. 1 is a vocal tract parameter file containing vocal tract parameters obtained by comparing vocal tract characteristics with resonance and anti-resonance; The road parameters are composed of information on formant frequency and bandwidth obtained by analyzing speech every 10 m5, LSP parameters obtained by converting the spectrum into a line spectrum, and the like. 2, when character strings are input, the vocal tract parameters of the syllables included in those character strings are selected from the vocal tract parameter file 1, arranged temporally, and the vocal tract parameters calculated by interpolation calculation are a vocal tract parameter combining unit that is inserted into an interpolation interval between syllables and combines those syllables;
3 is an amplifier calculation unit that determines the amplitude of the pulse train and white noise when the amplifier control information is input; 4 is an intonation calculation unit that determines the pulse interval of the pulse train when the intonation control information is input; 5 is the amplifier calculation unit 3 a pulse train generation section that outputs a pulse train based on the amplitude determined by the amplitude and intonation calculation section 3, and a white noise generation section 6 that outputs white noise based on the amplitude determined by the amplifier calculation section 3;
7 is an acoustic calculation unit which obtains a desired audio signal as a transmitted wave from the lips by calculating transmitted waves and reflected waves in the vocal tract when a pulse train and white noise are input to the vocal tract; 7 consists of a digital computer. 8
9 is a D/A converter that converts the digital audio signal output from the acoustic calculation unit 7 into an analog audio signal, and 9 is a speaker driven by the analog audio signal.

このように構成された従来例において、文字列、アンプ
制御情報及び抑揚制御情報が入力すると、アンプ計算部
３は、アンプ補間法で生成したアンプパラメータにより
、先行音節の母音終端部と後続音節の子音部先端部との
間の補間区間を一様に直線補間していた（第５図参照）
。In the conventional example configured in this way, when a character string, amplifier control information, and intonation control information are input, the amplifier calculation unit 3 calculates the vowel final part of the preceding syllable and the vowel final part of the following syllable using the amplifier parameters generated by the amplifier interpolation method. The interpolation interval between the tip of the consonant and the tip of the consonant was uniformly linearly interpolated (see Figure 5).
.

（発明が解決しようとする問題点）このため、各音節の結合時に、声道が早めに変化したり
、遅めに変化したりする音節では、音韻が劣化するとい
う問題があった。(Problems to be Solved by the Invention) Therefore, when syllables are combined, syllables in which the vocal tract changes early or changes late have a problem in that the phoneme deteriorates.

本発明は、このような問題に鑑みてなされたもので、補
間区間における各音節間で特徴的なアンプパラメータの
変化を模擬できる音声合成装置を提供することを目的と
している。The present invention has been made in view of these problems, and it is an object of the present invention to provide a speech synthesis device that can simulate characteristic changes in amplifier parameters between each syllable in an interpolation interval.

（問題点を解決するための手段）本発明は、前述の目的を達成するために、文字列を音声
化するときの各音節間の補間区間におけるアンプパラメ
ータを、先行音節終端部のアンプパラメータと後続音節
先頭部のアンプパラメータとによって決められた非線形
の補間関数によって補間演算して決めるようにしたもの
である。(Means for Solving the Problems) In order to achieve the above-mentioned object, the present invention sets the amplifier parameters in the interpolation interval between each syllable when converting a character string into sounds to be the same as the amplifier parameters at the end of the preceding syllable. This is determined by interpolation calculation using a non-linear interpolation function determined by the amplifier parameter of the beginning of the following syllable.

（作　用）本発明によれば、声道の早めの変化或いは遅めの変化を
非線形の補間関数で演算することにより、音節間の過渡
音を自然音声に近づけることができるようになる。(Function) According to the present invention, by calculating early or late changes in the vocal tract using a nonlinear interpolation function, it is possible to make transient sounds between syllables closer to natural speech.

（実施例）以下、図面を参照しながら、本発明の実施例を詳細に説
明する。(Example) Hereinafter, an example of the present invention will be described in detail with reference to the drawings.

第１図は本発明の一実施例の構成を示し、第２図は本発
明の一実施例におけるアンプ計算部の構成を示すもので
、第４図の符号と同一符号のものは同一部分を示してお
り、又、ＩＯは、先行音節終端部のアンプパラメータを
一時記憶するアンプパラメータ格納部１１と、音節毎に
決まったアンプパラメータ補間用の非線形の補間関数を
ファイルした補間関数ファイル１２と、音節毎のアンプ
パラメータをファイルしたアンプパラメータファイル１
３と、先行音節終端部のアンプパラメータをアンプパラ
メータ格納部１１から読み出すと共に、入力した後続の
音節に対応するアンプパラメータをアンプパラメータフ
ァイル１３から選択し、先行音節終端部のアンプパラメ
ータとその後続音節のアンプパラメータとに対応するア
ンプパラメータ補間用の非線形の補間関数を補間関数フ
ァイル１２から選択して演算し、補間区間のアンプパラ
メータを決定する補間計算部１４と、この補間計算部１
４で演算された補間区間のアンプパラメータを一時記憶
する補間アンプパラメータ格納部１５とからなるアンプ
パラメータ補間部である。Fig. 1 shows the configuration of an embodiment of the present invention, and Fig. 2 shows the structure of an amplifier calculation section in an embodiment of the invention. In addition, the IO includes an amplifier parameter storage unit 11 that temporarily stores amplifier parameters at the end of the preceding syllable, and an interpolation function file 12 that stores a nonlinear interpolation function for interpolating amplifier parameters determined for each syllable. Amplifier parameter file 1 containing amplifier parameters for each syllable
3, reads the amplifier parameter at the end of the preceding syllable from the amplifier parameter storage unit 11, selects the amplifier parameter corresponding to the input subsequent syllable from the amplifier parameter file 13, and reads the amplifier parameter at the end of the preceding syllable and the following syllable. an interpolation calculation unit 14 that selects and calculates a nonlinear interpolation function for amplifier parameter interpolation corresponding to the amplifier parameters from the interpolation function file 12 and determines the amplifier parameters of the interpolation interval;
This amplifier parameter interpolation section includes an interpolation amplifier parameter storage section 15 that temporarily stores the amplifier parameters of the interpolation interval calculated in step 4.

尚、補間関数ファイル１２にファイルされている非線形
の補間関数は次の通りである。The nonlinear interpolation functions stored in the interpolation function file 12 are as follows.

ａ　ｉ　（ｎ）＝　（１−７（ｎｔＬｃ２））　ａ　ｔ
　（ｎｅ＊ｃｔ）＋／（ｎｔＬｃｚ）　ａ　ｔ　（ｎｓ
＊ｃＪ・・・・・（１）但し、　ａ、（ｎ）　　：　補間部のアンプパラメータ
ａｉ（ｎ、、Ｃ工）：　先行音節Ｃ工の終端部アンプパ
ラメータａ　（（ｎ＠＠ｃ３）　　：　後続音節Ｃ１の
先頭部アンプパラメータｎ＠　：　Ｃ１の終端時点ｎ・二０２の先頭時点ｎ　：　サンプリング時点（ｎｅを起点とする）ｌ　：
　補間区間長／（ｎｔｌｔｃｂ）　：　音節Ｃ２毎に定められた非線
形の補間関数且つ、単調増加関数尚、　／（ｎ＋Ｌｅｂ）は自然音声中のアンプパラメー
タ変形データから統計的に抽出されるもので、（２）式
以外の制限はつけない。a i (n) = (1-7(ntLc2)) a t
(ne*ct)+/(ntLcz) a t (ns
*cJ・・・・・・(1) However, a, (n): Amplifier parameter ai(n,,C) of the interpolation part: Amplifier parameter a of the terminal part of the preceding syllable C ((n@@c3): Amplifier parameter at the beginning of the subsequent syllable C1 n@: End time n of C1, start time n of 202: Sampling time (starting from ne) l:
Interpolation interval length/(ntltcb): Nonlinear interpolation function and monotonically increasing function determined for each syllable C2. /(n+Leb) is statistically extracted from the amplifier parameter deformation data in natural speech, and ( 2) No restrictions other than the formula are added.

このように構成された本実施例では、文字が入力すると
、補間計算部１０は、１文字前に入力した先行音節終端
部のアンプパラメータをアンプパラメータ格納部１１か
ら読み出すと共に、金入力した音節に対応するアンプパ
ラメータをアンプパラメータファイル１３から選択し、
先行音節終端部のアンプパラメータと後続音節先頭部の
アンプパラメータとに対応するアンプパラメータ補間用
の非線形の補間関数を補間関数ファイル１２から選択し
て、（１）式を演算することにより、各サンプリング時
点における補間区間のアンプパラメータを決定し。In this embodiment configured as described above, when a character is input, the interpolation calculation section 10 reads out the amplifier parameter of the last part of the preceding syllable inputted one character ago from the amplifier parameter storage section 11, and also applies the amplifier parameter to the inputted syllable. Select the corresponding amplifier parameter from the amplifier parameter file 13,
By selecting a nonlinear interpolation function for amplifier parameter interpolation corresponding to the amplifier parameter at the end of the preceding syllable and the amplifier parameter at the beginning of the following syllable from the interpolation function file 12 and calculating equation (1), each sampling Determine the amplifier parameters of the interpolation interval at the time point.

その補間区間のアンプパラメータを補間アンプパラメー
タ格納部１５に一時記憶させる。そして、アンプパラメ
ータ補間部１０で決定された非線形なアンプパラメータ
を各音節間の補間区間に挿入して、それ等の音節を結合
する〔第３図（ａ）及び（ｂ）参照〕。The amplifier parameters of the interpolation section are temporarily stored in the interpolation amplifier parameter storage section 15. Then, the nonlinear amplifier parameters determined by the amplifier parameter interpolation section 10 are inserted into the interpolation interval between each syllable, and the syllables are combined (see FIGS. 3(a) and 3(b)).

尚、本実施例において、／（ｎ、ｊ！、ｃ＆）はｃｋ−
□（先行音節）と無関係であるとしているが、ｃｈ−ｉ
までも考慮に入れると、アンプパラメータ変化の近似精
度が更に向上する。In this example, /(n, j!, c&) is ck-
Although it is said that it is unrelated to □ (preceding syllable), ch-i
Taking this into account will further improve the approximation accuracy of amplifier parameter changes.

（発明の効果）以上説明したように、本発明によれば、音節間の補間区
間のアンプパラメータの変化が自然音声に近くなって、
従来の音声合成装置よりも音韻性が良好になるという効
果がある。(Effects of the Invention) As explained above, according to the present invention, the change in amplifier parameters in the interpolation interval between syllables becomes close to that of natural speech,
This has the effect of providing better phonology than conventional speech synthesizers.

[Brief explanation of the drawing]

第１図は本発明の一実施例の構成図、第２図は本発明の
一実施例におけるアンプ計算部の構成図、第３図（ａ）
及び（ｂ）は本発明の一実施例によるアンプパラメータ
の補間法の説明図、第４図は従来の音声合成装置の構成
図、第５図は従来の音声合成装置によるアンプパラメー
タの補間法の説明図である。１　・・・声道パラメータファイル、　２　・・・声道
パラメータ結合部、　４　・・抑揚計算部、　５　・・
・パルス列発生部、　６　・・・白色雑音発生部、　７
　・・・音響計算部、　８　・・Ｄ／Ａコンバータ、　
９　・・・　スピーカ、　１０・・・アンプ計算部、１
１・・・アンプパラメータ格納部、１２・・・補間関数
ファイル、１３・・・アンプパラメータファイル、　１
４・・・補間計算部、１５・・・補間アンプパラメータ
格納部。第１図第２図、１０第３図第４図第５図時閉（ｍｓｌFIG. 1 is a block diagram of an embodiment of the present invention, FIG. 2 is a block diagram of an amplifier calculation section in an embodiment of the present invention, and FIG. 3(a)
and (b) is an explanatory diagram of an amplifier parameter interpolation method according to an embodiment of the present invention, FIG. 4 is a block diagram of a conventional speech synthesis device, and FIG. 5 is an illustration of an amplifier parameter interpolation method by a conventional speech synthesis device. It is an explanatory diagram. 1...Vocal tract parameter file, 2...Vocal tract parameter combination unit, 4...Intonation calculation unit, 5...
・Pulse train generation section, 6...White noise generation section, 7
...Acoustic calculation section, 8 ...D/A converter,
9...Speaker, 10...Amplifier calculation section, 1
1... Amplifier parameter storage section, 12... Interpolation function file, 13... Amplifier parameter file, 1
4... Interpolation calculation unit, 15... Interpolation amplifier parameter storage unit. Figure 1 Figure 2, 10 Figure 3 Figure 4 Figure 5 Closed (msl)

Claims

[Claims]

In a speech synthesis device that vocalizes an input character string, the amplifier parameter in the interpolation interval between each syllable when vocalizing the character string is determined by the amplifier parameter at the end of the preceding syllable and the amplifier parameter at the beginning of the following syllable. A speech synthesis device characterized by comprising means for performing interpolation calculation and determination using a predetermined nonlinear interpolation function.