JPH0833747B2

JPH0833747B2 - Sound synthesis method

Info

Publication number: JPH0833747B2
Application number: JP62091705A
Authority: JP
Inventors: 典雄須田
Original assignee: Meidensha Corp
Current assignee: Meidensha Corp
Priority date: 1987-04-14
Filing date: 1987-04-14
Publication date: 1996-03-29
Anticipated expiration: 2011-03-29
Also published as: JPS63257000A

Description

【発明の詳細な説明】 A. 産業上の利用分野本発明は人間の音声又は楽器の音の合成に関する。DETAILED DESCRIPTION OF THE INVENTION A. INDUSTRIAL FIELD OF APPLICATION The present invention relates to the synthesis of human speech or musical instrument sounds.

B. 発明の概要本発明は、音響管の断面積変化における音響伝搬の現
象が、送電線等の電気回路における系統過渡現象に近似
していることに着目し、音響管の断面積変化をサージイ
ンピーダンス変化に対応させてサージインピーダンスの
可変により音の合成をするようにしたものである。B. Summary of the Invention The present invention focuses on the fact that the phenomenon of sound propagation in a change in the cross-sectional area of an acoustic tube is similar to a system transient phenomenon in an electric circuit such as a power transmission line, and the change in the cross-sectional area of the acoustic tube is surged. It is designed to synthesize sounds by varying surge impedance in response to impedance changes.

また管楽器等の音響管の長さの変化による音の合成
は、インピーダンス結合部に遅延回路を設けて、その長
さ変化に対応した遅延定数を変化させて合成できるよう
にした音の合成方法である。To synthesize sounds by changing the length of the acoustic tube of a wind instrument, etc., a delay circuit is provided in the impedance coupling section, and the delay constant corresponding to the change in length can be changed to synthesize the sound. is there.

C. 従来の技術音声合成やミュージックシンセサイザー（電子楽器）
等の所謂音を人工的に合成して出力する電子装置は、最
近になって１ないし数チップの音声認識や音声合成のLS
Iが音声情報処理と半導体の大規模集積回路技術により
低価格で実現されるようになり、その使用目的，制約条
件により種々の方式が提案されている。この音声合成に
は、人間の発生した生の音声を録音しておき、これを適
当に結合して文章に編集する録音編集方式と、人間の声
を直接的には利用せず、人間の音声のパラメータだけを
抽出し、音声合成過程で、そのパラメータを制御して人
工的に音声信号を作り出す方法がある。C. Conventional technology Speech synthesis and music synthesizer (electronic musical instrument)
Recently, electronic devices that artificially synthesize and output so-called sounds such as LS have been used for LS for voice recognition and voice synthesis of one or several chips.
I has come to be realized at a low price by means of voice information processing and semiconductor large-scale integrated circuit technology, and various methods have been proposed depending on the purpose of use and constraints. In this speech synthesis, a raw voice generated by a human being is recorded, and the recording and editing method in which the raw voice is appropriately combined and edited into a sentence, and the human voice is not directly used. There is a method of extracting only the parameter of, and controlling the parameter in the voice synthesis process to artificially generate a voice signal.

このパラメータ方式で良質な合成音が得られることで
広く利用されているパーコール（PARCOR）方がある。There is a PARCOR method that is widely used because it produces high quality synthesized sounds with this parameter method.

音声を電子計算機で扱う場合、音声波形をある周期毎
にサンプリングして各サンプリング点での音声信号の値
をアナログ／ディジタル変換し、その値を０と１の符号
で表示して行われるが、アナログ信号に忠実な記録をす
るには、ビット数を増やす必要があるが音声合成信号は
大変多くのメモリーを必要とする。When a voice is handled by an electronic computer, the voice waveform is sampled at a certain cycle, the value of the voice signal at each sampling point is converted from analog to digital, and the value is displayed with a code of 0 and 1. It is necessary to increase the number of bits in order to record faithfully to the analog signal, but the voice synthesis signal requires a very large amount of memory.

そこで、この情報量を極力少なくするために各種の高
能率な符号化法が研究開発されている。Therefore, various highly efficient coding methods have been researched and developed in order to reduce the amount of this information as much as possible.

その方法の１つとして、１つの音声信号の情報に対
し、最低限の１ビットとした方式で、デルタ変調方式が
ある。この方式は、１ビットの使い方として、次にくる
音声信号値が現在の値より高いか低いかを判定して、高
ければ符号“1"、低ければ符号“0"を与え音声信号の符
号化を行うもので、実際のシステム構成としては一定の
振幅ステップ量（デルタ）を定めておき、誤差が蓄積さ
れないように今までの符号化によって得られる音声の値
と、入力してくる音声信号との残差信号に対して、符号
化を行う。As one of the methods, there is a delta modulation method in which a minimum of 1 bit is used for information of one audio signal. This method uses one bit to judge whether the next audio signal value is higher or lower than the current value, and if the value is higher, the code "1" is given, and if the value is lower, the code "0" is given. In the actual system configuration, a fixed amplitude step amount (delta) is set, and the audio value obtained by the encoding up to now and the input audio signal are set so that errors are not accumulated. The residual signal of is encoded.

このような構成を予測コード化といわれ、線形予測法
（何個か前のサンプル値から予測する）およびパーコー
ル方式（線形予測法の予測係数の代わりにパーコール係
数ｋといわれる偏自己相関関数を用いる）がある。Such a configuration is called predictive coding, and a linear prediction method (predicting from several previous sample values) and a Percoll method (a partial autocorrelation function called a Percoll coefficient k instead of the prediction coefficient of the linear prediction method is used. ).

D. 発明が解決しようとする問題点前述のように予測コード化を用いたものは、音と音と
の継ぎ目に相当する調音結合が難しいという問題があ
る。即ち第10図は横軸に音声発生の時間ｔをとり、縦軸
にパーコール係数ｋをとったもので、例えば母音から子
音を経て母音に至る発声において、母音の定常から過渡
を経て子音に至りまた母音の過渡を経て母音の定常音に
至る過程で母音と母音の継ぎ目の音が跡切れ、人間が聞
いたときに自然な感じを与えない。D. Problems to be Solved by the Invention As described above, the method using predictive coding has a problem that articulatory coupling corresponding to a joint between sounds is difficult. That is, in FIG. 10, the horizontal axis represents the sound generation time t, and the vertical axis represents the Percoll coefficient k. Also, in the process of passing a vowel transition to a stationary vowel sound, the seam of the vowel and the vowel sound is cut off, which does not give a natural feeling to humans.

また楽器音合成の場合は、音階の継ぎ目が重要である
が合成手法が実際の楽器の音発生の原理と異なるため、
やはり自然な感じが無く、特に残響音において顕著にあ
らわれる。これら両者において自然な音に近付けるため
には、これを構成するメモリや、演算器等の電子部品を
多く必要とし装置が高価になる等の問題がある。Also, in the case of musical instrument sound synthesis, the seam of the scale is important, but the synthesis method is different from the principle of sound generation of the actual instrument,
After all, there is no natural feeling, and it appears remarkably especially in reverberation sound. In order to bring the sound closer to a natural sound in both cases, there is a problem that a large amount of memory and an electronic component such as an arithmetic unit are required and the apparatus becomes expensive.

E. 問題点を解決するための手段以上の点に鑑み、本発明は人間の音の発生や楽器の楽
音は人間の口腔や音響管の長さや断面積等の形状変化に
よって作り出される。そこで、これら音響管の音波の伝
達を表す進行波現象を音響管等価回路で解析し、音響管
の断面積がサージインピーダンスに反比例することに着
目し、サージインピーダンスを変化させることで断面積
を模擬的に変化させ、サージインピーダンスを連続的変
化することで調音結合をスムーズに行うことができるよ
うにして人間の発声と同様な音の合成を容易となし音声
の自然性の向上を図ったものである。E. Means for Solving the Problems In view of the above points, according to the present invention, the generation of human sounds and the musical sounds of musical instruments are created by shape changes such as the length and cross-sectional area of the human oral cavity and acoustic tube. Therefore, we analyze the traveling wave phenomenon that expresses the transmission of sound waves in these acoustic tubes with an acoustic tube equivalent circuit, and pay attention to the fact that the cross-sectional area of the acoustic tube is inversely proportional to the surge impedance, and simulate the cross-sectional area by changing the surge impedance. It is possible to smoothly synthesize articulation by changing the surge impedance continuously and changing the surge impedance continuously, making it easy to synthesize sounds similar to human speech and improving the naturalness of speech. is there.

また楽器の長さの変化は、遅延回路の段数で模擬し、
断面積変化と相俟ってより自然な楽器音を簡単に実現で
きるようにした。Also, the change in the length of the musical instrument is simulated by the number of stages of the delay circuit,
Together with the change in cross-sectional area, we made it easier to achieve more natural instrument sounds.

F. 作用人間は口腔を動かすことにより、音を発声し、管楽器
は音響管の長さや形状を変化させることによって楽音を
作る。本発明は音響管（人間の声帯から口唇までの声道
も１つの音響管とみなすことができる。）の断面積を等
価回路のサージインピーダンスに１対１に対応させてい
るので、このサージインピーダンスを変化させれば音響
管の断面積を変化させたと同じとなる。このサージイン
ピーダンスの変更は、電気技術的に極めて簡単にできる
ので、人間の音の発生と全く同様な音の合成ができ、特
に従来の問題点とされた音と音の継ぎ目にあたる調音結
合もサージインピーダンスを連続的に変化することで良
好に行われ、自然に近い音の発声ができる。F. Action Humans produce sounds by moving their mouths, and wind instruments make musical sounds by changing the length and shape of acoustic tubes. According to the present invention, the cross-sectional area of the acoustic tube (the vocal tract from the human vocal cord to the lip can be regarded as one acoustic tube) corresponds to the surge impedance of the equivalent circuit one to one. If is changed, it is the same as changing the cross-sectional area of the acoustic tube. Since this surge impedance can be changed very easily in terms of electrotechniques, it is possible to synthesize sounds that are exactly the same as the generation of human sounds. In particular, the articulation coupling that is a problem between sounds and sounds, which is a problem in the past, is also surged. Good performance is achieved by continuously changing the impedance, and it is possible to produce a sound that is close to natural.

また、音響管の長さを変えることは、音波の進行波を
遅らせることあるから電気回路的には遅延回路（メモ
リ）の段数を変えることに相当し遅延回路の定数を調整
することにより極めて簡単に模擬できる。従って断面積
変化と相俟ってより自然な楽器音も簡単に実現できる。Also, changing the length of the acoustic tube is equivalent to changing the number of stages of the delay circuit (memory) because it may delay the traveling wave of the sound wave, and it is extremely simple by adjusting the constant of the delay circuit. Can be simulated. Therefore, in combination with the change in cross-sectional area, a more natural instrument sound can be easily realized.

G. 実施例音声を口から外に放射されるには、音源が必要で、こ
の音源は声帯にによって作り出される。一方声帯は２枚
のヒダを開閉することによって呼気を断続的に止める働
きがあり、その断続によってパフと呼ばれる空気流が発
生し、声帯を緊張させるとこのヒダに張力が加わりヒダ
の開閉の周波数が高くなり、周波数の高いパフ音が発生
する。そして呼気流を大きくすると大きな音となる。G. Example A sound source is required to radiate sound out of the mouth, which is produced by the vocal cords. On the other hand, the vocal cords have the function of intermittently stopping the exhalation by opening and closing two folds, and the intermittent flow creates an air flow called a puff. Becomes higher and a high frequency puff sound is generated. And when the expiratory flow is increased, a loud sound is produced.

この音源波が声道のような円筒状の音響管を通過する
と、開放端から音波は共振現象によりある成分が強調さ
れ、ある成分が減弱し複雑な母音の波形が作り出され
る。音源が同じ波形をもっていても、口唇から放射され
るまでに通過する声道の形によって影響を受ける。即
ち、声道の形状が一定であれば音源のピッチや強度を変
えてもスペクトル包絡はあまり変化しない。声道は母音
によって極めて複雑な形状を示すが、声道があまり変化
しない部分と大きく変化する部分に分けて考えることが
できる。例えば第１図のように長さと断面積がＡ₁,A₂と
それぞれ異なるような２つの音響管が接続したものと仮
定することができる。When this sound source wave passes through a cylindrical acoustic tube such as the vocal tract, a certain component of the sound wave is emphasized by the resonance phenomenon from the open end, and a certain component is attenuated to form a complicated vowel waveform. Even if the sound source has the same waveform, it is affected by the shape of the vocal tract that passes through before it is emitted from the lips. That is, if the shape of the vocal tract is constant, the spectrum envelope does not change much even if the pitch or intensity of the sound source is changed. The vocal tract shows an extremely complicated shape due to vowels, but it can be divided into a part where the vocal tract does not change much and a part where the vocal tract changes greatly. For example, as shown in FIG. ₁ , it can be assumed that two acoustic tubes having different lengths and sectional areas A ₁ and A ₂ are connected.

第１図は音響管モデル図、第２図はその等価回路図
で、断面積がＡ₁,A₂とそれぞれ異なる２つの音響管を接
続した場合である。FIG. 1 is an acoustic tube model diagram, and FIG. 2 is an equivalent circuit diagram thereof, in which two acoustic tubes having different sectional areas A ₁ and A ₂ are connected.

この音響管の接続する面に着目すると、音波の流れは
断面積の異なる場合、その異なる面で音波の一部が反射
するという現象を生ずる。この現象は、電気回路でイン
ピーダンスの異なる線路にインパルス電流を流したとき
の過渡現象と同じである。Focusing on the surface to which the acoustic tube is connected, when the sound waves have different cross-sectional areas, a phenomenon occurs in which a part of the sound wave is reflected on the different surface. This phenomenon is the same as a transient phenomenon when an impulse current is passed through lines having different impedances in an electric circuit.

音声の発生は、前述したように声帯による音源の断続
によって行われるがこれは電気的には、インパルスが断
続的に印加すると等価となる。As described above, the sound is generated by the intermittent sound source by the vocal cords, which is electrically equivalent to the intermittent application of impulses.

音は気体，液体，固体のいずれでも伝わる一種の振動
であるが、電気回路的には、抵抗の無い無損失のLC分布
回路に対応させることができる。そしてこの等価回路の
電気的インピーダンス（V/I）は、となるので、音波の場合に置き換えると音波の速度，空
気密度ρと音速Ｃ′を掛けたρＣ′となり、音場におけ
るインピーダンス即ち音響インピーダンスは気体の質量
と音速だけに依存する。Sound is a kind of vibration that can be transmitted by gas, liquid, or solid, but in terms of electrical circuits, it can be applied to a lossless LC distribution circuit with no resistance. And the electrical impedance (V / I) of this equivalent circuit is Therefore, when the sound wave is replaced, the sound wave velocity, the air density ρ, and the sound velocity C ′ are multiplied by ρC ′, and the impedance in the sound field, that is, the acoustic impedance, depends only on the mass of the gas and the sound velocity.

断面積の異なる音響管が連設されていると、その境界
面で反射が起こる。これは電気的なサージインピーダン
スに模擬することができる。即ち、第１図のような音響
管の断面積の異なるブロックの接続された等価回路は第
２図に置き換えられる。When acoustic tubes with different cross-sectional areas are connected in series, reflection occurs at the boundary surface. This can be simulated as an electrical surge impedance. That is, an equivalent circuit in which blocks having different cross-sectional areas of the acoustic tube as shown in FIG. 1 are connected is replaced with FIG.

ここで、空気密度をρ，音速をＣ′とすれば、各音響
管の音響アドミッタンスY,Y₂は次のように与えられる。Here, if the air density is ρ and the sound velocity is C ′, the acoustic admittances Y and Y ₂ of the respective acoustic tubes are given as follows.

但しＺ₁,Z₂は音響インピーダンス次に隣接ブロックよりの伝搬電流源をＩ₁,I₂とし、こ
れにより決定される電流分布ａ₁,a₂及びｉ₁,i₂および接
合点イの電圧をｅとするとまたａ₁＝ｉ₁＋Ｉ₁,a₂＝ｉ₂＋Ｉ₂ ｉ₁＝ａ₁−Ｉ₁,i₂＝ａ₂−Ｉとなり、次のステップのための隣接ブロックの伝搬電流
源Ｉ₁′,I₂′は、Ｉ₁′＝ｉ₁＋ａ₁,I₂′＝ｉ₂＋ａ₂となる。 Where Z ₁ and Z ₂ are acoustic impedances, and I ₁ and I ₂ are propagation current sources from the adjacent blocks, and the current distributions a ₁ and a ₂ and i ₁ and i ₂ and the voltage at the junction point a are determined by these. Let be e Also, a ₁ = i ₁ + I ₁ , a ₂ = i ₂ + I ₂ i ₁ = a ₁ -I ₁ , i ₂ = a ₂ -I, and the propagation current source I ₁ ′, of the adjacent block for the next step is obtained. I ₂ ′ becomes I ₁ ′ = i ₁ + a ₁ and I ₂ ′ = i ₂ + a ₂ .

上式でｉ₁＝ａ₁−Ｉ₁およびｉ₂＝ａ₂−Ｉ₂を代入して
Ｉ₁′＝2a₁−Ｉ₁,I₂′＝2a₂−Ｉ₁としてもよい。It is also possible to substitute i ₁ = a ₁ -I ₁ and i ₂ = a ₂ -I ₂ in the above formula to obtain I ₁ ′ = 2a ₁ −I ₁ and I ₂ ′ = 2a ₂ −I ₁ .

上記の方式において断面積Ａの時間に対する補間状況
を第７図に示す。ここで最も演算の簡単な直線補間を示
している。FIG. 7 shows the interpolation situation with respect to time of the sectional area A in the above method. Here, the simplest linear interpolation is shown.

第３図は音響管の電気回路等価モデル図で、その
（ア）図は声帯から口唇までの声道を１つの音響管とみ
なした音響管モデル図、（イ）図はその電気回路モデル
図、（ウ）図は進行波等価モデル図を示している。Fig. 3 is an electrical circuit equivalent model diagram of the acoustic tube. Fig. 3 (a) is an acoustic tube model diagram in which the vocal tract from the vocal cords to the lips is regarded as one acoustic tube, and Fig. 3 (a) is its electrical circuit model diagram. , (C) shows a traveling wave equivalent model diagram.

第３図を説明するに先立ち、人間の母音はどのように
して作られるかを説明する。Before explaining FIG. 3, it will be explained how human vowels are created.

第４図は音声発生時の声道の断面積変化を模擬したも
ので、その（ア）図は、「ア」の発声の場合で喉の奥が
狭く口唇が開いた状態で肺から押し出される呼気で声帯
が呼気を断続的に開閉して声道（音響管）の中で反射を
繰り返して出てくる音波が「ア」の音声波形となって出
てくる。「イ」は（イ）図のように喉の方が広く口唇の
先が狭いと「イ」の音声波形が出力される。Fig. 4 simulates the cross-sectional area change of the vocal tract when a voice is generated. Fig. 4 (a) shows the case of "A" when it is pushed out of the lungs with a narrow throat and open lips. The vocal cords open and close during breathing, and the sound waves that are repeatedly reflected in the vocal tract (acoustic tube) emerge as a voice waveform of "A". As shown in (a), “i” has a wide throat and a narrow lip, and the audio waveform of “i” is output.

このように口の恰好で周波数が決まり、口の恰好を模
擬すれば「ア」なり「イ」が発声される。口の恰好は音
響管の断面積で模擬でき、また音響管の断面積の変化
は、サージアドミッタンスの変化で模擬でき、サージア
ドミッタンスの変化は、電気回路上極めて容易に可変で
きる。第３図（ア）は断面積Ａ₁,A₂…Ａ_nと異なる断面
積をもった音響管を接続して声道を模擬したものであ
る。同図（イ）はその音響インピーダンスを電気回路の
LC回路に置き換えたもので、各音響管を１個のLC線路と
し、全体を集中線路のｎ−１の電気回路としたものであ
る。また第３図（ウ）進行波等価モデル図で、各音響管
の音響インピーダンスＺ₁,Z₂…Ｚ_nは、音響管の断面積
に反比例（音響アドミッタンスは比例）し、音波の速度
に比例するのでとなる。なお、同図でＺ_Oは音源インピーダンス,Z_Lは放
射インピーダンスを示し、またブロック間の矢印は、進
行波と後進波を表している。In this way, the frequency is determined by the mouth preference, and if the mouth preference is simulated, "a" or "a" is uttered. The appearance of the mouth can be simulated by the cross-sectional area of the acoustic tube, and the change of the cross-sectional area of the acoustic tube can be simulated by the change of the surge admittance, and the change of the surge admittance can be changed very easily on the electric circuit. Figure 3 (A) is obtained by simulating the vocal tract by connecting the sound tube having different cross-sectional area as the cross-sectional area _{_{_{A 1, A 2 ... A n}}} . The same figure (a) shows the acoustic impedance of the electric circuit.
It is replaced with an LC circuit, and each acoustic tube is one LC line, and the whole is an n-1 electric circuit of a concentrated line. Further, in FIG. 3 (c) traveling wave equivalent model diagram, the acoustic impedances Z ₁ , Z ₂ ... Z _n of each acoustic tube are inversely proportional to the cross-sectional area of the acoustic tube (acoustic admittance is proportional) and proportional to the velocity of the sound wave. Because Becomes In the figure, Z _O represents the sound source impedance, Z _L represents the radiation impedance, and the arrows between the blocks represent the traveling wave and the backward wave.

今「ア」という音声を発声させる場合は、第４図の口
唇の先の断面積に相当する断面積Ａ₁のところで「ア」
の口の恰好を与えて、インパルスＰを断続的に印加する
ことで、「ア」の音が得られ、また「ア」から「イ」の
音を発声させる場合は、同図（イ）に示すように断面積
をＡ₁′に狭め「イ」の口の恰好を与えることで「イ」
が得られる。When uttering the voice "A", "A" is placed at the cross-sectional area A ₁ corresponding to the cross-sectional area of the tip of the lip in Fig. 4.
When the sound of "A" is obtained by applying the impulse P intermittently by giving the shape of the mouth of "A" and the sound of "A" is produced from "A", As shown in the figure, the cross-sectional area is narrowed to A ₁ ′ to give the mouth of “a” the appearance of “a”.
Is obtained.

インパルスＰが連続して断続的に与えられ、断面積全
体を「イ」の口の恰好に変化させる場合、声道は第３図
に示すｎ個の音響管によって模擬しているので、これら
の各断面積を「ア」から動かして口の恰好を「ア−イ」
と連続的に変えることになる。この音響管の断面積を変
えるということは、サージインピーダンスを徐々に変え
ることによって行われる。If impulses P are given continuously and intermittently and the entire cross-sectional area is changed to the shape of the mouth of "a", the vocal tract is simulated by n acoustic tubes shown in FIG. Move each cross section from "A" to change the mouth appearance to "A"
Will be changed continuously. Changing the cross-sectional area of this acoustic tube is performed by gradually changing the surge impedance.

従って、第５図に示すように断面積はＡ₁からＡ₁′に
連続的に変えられるので、定常状態の「ア」，「イ」の
音が得られることは勿論であるが、更にインピーダンス
は連続して可変できるので、その中間の音、即ち音と音
との間の音を得ることができる。従って第６図に示すよ
うに音の切れが無く人間の発音に近い調音結合がスムー
ズに行われる。Therefore, as shown in FIG. 5, since the cross-sectional area is continuously changed from A ₁ to A ₁ ′, steady-state “A” and “A” sounds can be obtained, but further impedance Can be varied continuously, so that an intermediate sound, that is, a sound between sounds can be obtained. Therefore, as shown in FIG. 6, there is no break in the sound, and the articulatory coupling similar to the human pronunciation is smoothly performed.

次に音波の伝搬速度を考えると、これは長さｌでLCを
持った電線路にインパルスを印加した時の過渡現象に似
ている。Next, considering the propagation velocity of sound waves, this is similar to a transient phenomenon when an impulse is applied to an electric line having a length of l and having an LC.

即ち第７図に示すようにLCを有する線路を等価的に表
すと第８図のようになる。ここで両端部からみたサージ
インピーダンスＺ₀₁,Z₀₂は、となる。That is, a line having LC as shown in FIG. 7 is equivalently expressed as shown in FIG. Here, the surge impedances Z ₀₁ and Z ₀₂ seen from both ends are Becomes

ここで相手から到達してきた進行波を等価的な電流源
と考えると、となり電流は中間にｎ個の遅延回路ブロックＺがあれ
ば、ｎ時間後に出力される。即ち左側の回路で発生した
ものがτ時間後右側に到達したということになる。Considering the traveling wave that arrives from the other party as an equivalent current source, If there are n delay circuit blocks Z in the middle, the next current is output after n hours. That is, what has occurred in the circuit on the left has arrived on the right after τ.

Ｉ₂は送り管側の電流となる。但し、ディジタル計算においては、電圧または
電流を細分割するのでＶ₁,V₂は計測時刻ｔにおける電
圧，τは経過時間を示している。I ₂ is the current on the feed tube side Becomes However, in the digital calculation, since the voltage or current is subdivided, V ₁ and V ₂ indicate the voltage at the measurement time t, and τ indicates the elapsed time.

第８図では、L,C回路にインパルスを印加すれば、τ
時間後に出力管側に出る。そしてτ時間前到達されたも
のは相手にも到達しているということを等価的に表して
いる。線路の長さｌを１にするということは、遅延ブロ
ックｎを正規化して１にすることで計算し易くなる。ｌ
を3cmに刻む場合は遅延ブロックのｎを３ブロックにす
ればよい。In Fig. 8, if impulses are applied to the L and C circuits, τ
It goes out to the output tube side after time. And, it is equivalently expressed that what has arrived τ time ago has also reached the other party. Setting the line length 1 to 1 facilitates calculation by normalizing the delay block n to 1. l
In case of dividing into 3 cm, the delay block n may be set to 3 blocks.

第３図（ア）を人間の声道は男性で約17cmなので、1c
m刻みで17本の音響管で模擬すれば、Ａ₁から入った波形
は、半周期の電流を10に分割しそのΔｔを10μsecとす
れば、170μsecかかってAn側から出てくる。楽器のトロ
ンボンを考えると、トロンボンは音響管の長短によって
楽音を変える。本発明によれば、トロンボンの「ア」の
音からトロンボンの「ニ」の音のパラメータを２つ持て
ば良いトロンボンの「ア」の音はトロンボンの「ド」の
音からトロンボンの上の音という２つのパラメータがあ
ればよい。その中間音は、遅延回路の遅延ブロックを変
えることによって自由に調音することができる。Figure 3 (a) shows that the human vocal tract is about 17 cm for men, so 1c
If you simulate with 17 acoustic tubes in m steps, the waveform input from A ₁ will come out from the An side in 170 μsec if you divide the half-cycle current into 10 and Δt is 10 μsec. Considering the musical instrument thrombone, the thrombone changes the musical sound depending on the length of the acoustic tube. According to the present invention, it suffices to have two parameters from the sound of "A" of thrombon to the sound of "D" of thrombon. The sound of "A" of thrombone is the sound of "Do" of thrombone There are two parameters. The intermediate tone can be freely articulated by changing the delay block of the delay circuit.

即ち第８図における遅延回路Ｚ₁→Ｚ_n,Z_n→Ｚ₁を可変
することで同じ面積をもつ音響管の長さを変化すること
に対応させることができる。That is, by varying the delay circuits Z ₁ → Z _n and Z _n → Z ₁ in FIG. 8, it is possible to deal with changing the length of the acoustic tube having the same area.

次に、第３図（ウ）の進行波等価モデルの演算処理を
第９図のフローチャートに基づいて説明する。Next, the calculation processing of the traveling wave equivalent model of FIG. 3C will be described based on the flowchart of FIG.

音響管Ａ₁にインパルスが入力されると、コンピュー
タよりなる演算処理装置は、ステップＳ₁にてメモリよ
りＡ₁のａ_OA,i_OA,I_OA,Eを取り出す。取り出した値をも
とに、ステップＳ₂では、ａ_OA′＝ｆ（E,I_aA）ｉ_OA′＝ａ_OA′−Ｉ_OA の演算を行う。この演算値ａ_OA′,i_OA′およびステップ
Ｓ₃でメモリより導入された管Ａ₂の値ａ_1B,a_1A,i_1B,
i_1A,I_1B,I_1Aを用いてステップＳ₄では、ａ_1B′＝Ｓ_1B（Ｉ_1B＋Ｉ_1A）ａ_1A′＝Ｓ_1A（Ｉ_1B＋Ｉ_1A）ｉ_1B′＝ａ_1B′−Ｉ_1B ｉ_1A′＝ａ_iA′−Ｉ_1B Ｉ_1B′＝ｉ_0A′−ａ_0A′ の演算を行う。ステップＳ₅ではＳ₄にて求められた
ｉ_1B′,a_1B′を用いてＩ_0A′＝ｉ_1B＋ａ_1B を演算する。また一方、Ｓ₅にて求められた値ｉ_1A′,a
_1A′と、ステップＳ₆においてメモリより導入された管
Ａ₃の値ａ_2B,a_2A,i_2B,i_2A,I_2B,I_2Aとを用いてステップ
Ｓ₇にて次の演算が行われる。When an impulse is input to the acoustic tube A ₁ , the arithmetic processing unit including a computer retrieves a _OA , i _OA , I _OA , E of A ₁ from the memory in step S ₁ . Based on the retrieved values, in step _{_{S 2, a OA '= f}} (E, I aA) i OA' calculation of = a _{_OA} '-I _OA performed. The calculated values a _OA ′, i _OA ′ and the values a _1B , a _1A , i _1B , of the pipe A ₂ introduced from the memory in step S ₃ .
i _1A, I _1B, in step S ₄ by using the _{_{I 1A, a 1B '= S}} 1B (I 1B + I 1A) a 1A' = S 1A (I 1B + I 1A) i _1B ′ = a _1B ′ −I _1B i _1A ′ = a _iA ′ −I _1B I _1B ′ = i _0A ′ −a _0A ′ is calculated. In step S ₅ , I _0A ′ = i _1B + a _1B is calculated using i _1B ′, a _1B ′ obtained in S ₄ . On the other hand, the value i _1A ′, a obtained in S ₅
And _1A ', the value a _2B tube A ₃ introduced from the _{_{memory, a 2A, i 2B, i}} 2A, I 2B, the next operation at step S ₇ by using the I _2A performed in step S ₆ .

ａ_2B′＝Ｓ_2B（Ｉ_2B＋Ｉ_2A）ａ_2A′＝Ｓ_2A（Ｉ_2B＋Ｉ_2A）ｉ_2B′＝ａ_2B′−Ｉ_2B ｉ_2A′＝ａ_2A′−Ｉ_2A Ｉ_2B′＝ｉ_1A＋ａ_iA′ ステップＳ₈ではＳ₇にて求められたｉ_2B′,i_2A′を用
いてＩ_1A′＝ｉ_2B′＋ａ_2A′ の演算が行われる。以下同様にして模擬された音響管の
断面積Ａ₁〜Ａ_nに夫々対応した演算が行われ、ステップ
Ｓ_n-1では、ａ_nB′＝ｆ（Ｉ_nB）ｉ_nB′＝ａ_nB′−Ｉ_nB Ｉ_nB＝ｉ_n-1A＋ａ_n-1A の演算を行う。その結果を用いてステップＳ_nでは、Ｉ_n-1A′＝ｉ_nB′＋ａ_nB′ の演算を行う。すなわち、音響管のＡ₁〜Ａ_nに対応した
等価回路の最終段（ｎ段）における演算結果の出力がD/
A変換されて図示省略されたスピーカに出力され、スピ
ーカより音声として出力される。a _2B ′ = S _2B (I _2B + I _2A ) a _2A ′ = S _2A (I _2B + I _2A ) i _2B ′ = a _2B ′ −I _2B i _2A ′ = a _2A ′ −I _2A I _2B ′ = i _1A + a _iA ′ In step S ₈ , i _2B ′, i _2A ′ obtained in S ₇ is used. The calculation of I _1A ′ = i _2B ′ + a _2A ′ is performed. Similarly, calculations corresponding to the cross-sectional areas A _{1 to} A _n of the simulated acoustic tube are performed, and in step S _n-1 , a _nB ′ = f (I _nB ) _inB ′ = a _nB ′ − The calculation of I _nB I _nB = i _n-1A + a _n-1A is performed. Using the result, in step S _n , I _n-1A ′ = i _nB ′ + a _nB ′ is calculated. That is, the output of the calculation result at the final stage (n stages) of the equivalent circuit corresponding to A _{1 to} A _n of the acoustic tube is D /
It is A-converted and output to a speaker (not shown), and is output as sound from the speaker.

したがって演算処理装置は音響管Ａ₁〜Ａ_nに対応した
演算を行うものであるから、この演算処理装置は音響管
のＡ₁〜Ａ_n個々の等価回路を流れる各部の電流値および
関数f,S_iB,S_iA（ｉ＝1,2…ｎ−１）をテーブルとして有
しているメモリと、当該等価回路の各部の電流値を演算
する第１の演算手段と、当該等価回路とは相隣接する等
価回路の電流値を用いて当該等価回路の電流値を演算す
る第２の演算手段とを備えている。Therefore, since the arithmetic processing unit performs the arithmetic operation corresponding to the acoustic tubes A _{1 to} A _n , this arithmetic processing unit uses the electric current value and the function f of each part flowing through each equivalent circuit of A _{1 to} A _n of the acoustic tube. A memory having S _iB , S _iA (i = 1, 2 ... N-1) as a table, a first calculating means for calculating the current value of each part of the equivalent circuit, and the equivalent circuit are Second arithmetic means for calculating the current value of the equivalent circuit by using the current value of the adjacent equivalent circuit.

H. 発明の効果以上のように本発明は、音響管の断面積が等価回路の
サージインピーダンスに１対１に対応していることに着
目し、この断面積の変化をサージインピーダンスの変化
で模擬するようにしたものであるから、パラメータとし
ては、断面積だけを持てばよいので、極めて簡単にで
き、従来のように音声合成のための多くのメモリを必要
としない。また、調音結合も、サージインピーダンスの
連続的な変化によってスムーズに行われ、自然音に近い
発声が得られる。H. Effect of the Invention As described above, the present invention focuses on the fact that the cross-sectional area of the acoustic tube corresponds to the surge impedance of the equivalent circuit in a one-to-one manner, and simulates the change in this cross-sectional area by the change in surge impedance. Since this is done, only the cross-sectional area needs to be given as a parameter, so that it can be made extremely simple and does not require many memories for speech synthesis as in the conventional case. Further, articulatory coupling is smoothly performed by the continuous change of surge impedance, and utterance close to natural sound can be obtained.

また、楽音も遅延回路のメモリーの段数を変えること
により容易に得ることができ、断面積変化と相俟ってよ
り自然な楽器音を簡単に実現出来る。Also, musical tones can be easily obtained by changing the number of stages of the memory of the delay circuit, and in combination with the change in cross-sectional area, a more natural musical instrument sound can be easily realized.

[Brief description of drawings]

第１図〜第９図は本発明を説明するための図で、第１図
は音響管モデル図、第２図は音響管等価回路図、第３図
は音響管の電気回路等価モデル図、第４図は声道の変化
説明図、第５図は音響管断面積の時間に対する補間説明
図、第６図は本発明の調音結合説明図、第７図は音声伝
搬を電気的に模擬した電気回路図、第８図は第７図の等
価回路図、第９図は本発明の音合成をコンピュータ処理
するプログラムの一例を示すフローチャート図、第10図
は従来のパーコール合成による調音結合説明図を示す。Ａ₁,A₂…Ａ_n……音響管の断面積、Ｙ₁,Y₂……音響管の
音響アドミッタンス、Ｃ₁,C₂…Ｃ_n……電気回路モデル
の静電容量、Ｌ₁,L₂…Ｌ_n……同上のリアクタンス、
Ｚ₁,Z₂…Ｚ_n……サージインピーダンス。1 to 9 are diagrams for explaining the present invention. FIG. 1 is an acoustic tube model diagram, FIG. 2 is an acoustic tube equivalent circuit diagram, and FIG. 3 is an electrical circuit equivalent model diagram of the acoustic tube. FIG. 4 is an explanatory view of changes in the vocal tract, FIG. 5 is an explanatory view of interpolation of the acoustic tube cross-section with respect to time, FIG. 6 is an explanatory view of articulatory coupling of the present invention, and FIG. 7 is an electrical simulation of voice propagation. FIG. 8 is an electric circuit diagram, FIG. 8 is an equivalent circuit diagram of FIG. 7, FIG. 9 is a flow chart showing an example of a program for computer-processing the sound synthesis of the present invention, and FIG. Indicates. A ₁ , A ₂ ... A _n ... cross-sectional area of the acoustic tube, Y ₁ , Y ₂ ... acoustic admittance of the acoustic tube, C ₁ , C ₂ ... C _n ... capacitance of the electric circuit model, L ₁ , L ₂ … L _n …… Same reactance,
_{_{_{Z 1, Z 2 ... Z n}}} ...... surge impedance.

Claims

[Claims]

1. In sound synthesis that simulates using an acoustic tube,
Two adjacent acoustic tubes are treated as a parallel circuit of a propagating current source and a surge impedance that is inversely proportional to the acoustic cross-sectional area. (However, Ai, A _{i + 1} is the cross section of two adjacent acoustic tubes, i is 1,2,3…
A method for synthesizing sounds, characterized by comprising the coefficient n).

2. In the sound synthesis simulated by using an acoustic tube,
Two adjacent acoustic tubes are treated as a parallel circuit of a propagating current source and a surge impedance that is inversely proportional to the acoustic cross-sectional area. (However, Ai, A _{i + 1} is the cross section of two adjacent acoustic tubes, i is 1,2,3…
The current source of the acoustic tube having the coefficient of n) and the sum of the current flowing into the surge impedance and the current flowing into the surge impedance of the current source to be newly propagated to the next, or twice the value of the current flowing into the surge impedance and the current impedance of the acoustic tube. A sound synthesis method characterized by using a difference between and.

3. In sound synthesis that simulates using an acoustic tube,
By treating two adjacent acoustic tubes as a parallel circuit of a propagation current source and a surge impedance inversely proportional to the cross-sectional area of the acoustic tube, and having a delay circuit for the propagation of the propagation current source, and making this delay circuit variable. A sound synthesis method characterized in that it is adapted to change the length of an acoustic tube having an area.