JPH07210184A - Voice editor/synthesizer - Google Patents

Voice editor/synthesizer

Info

Publication number
JPH07210184A
JPH07210184A JP6005586A JP558694A JPH07210184A JP H07210184 A JPH07210184 A JP H07210184A JP 6005586 A JP6005586 A JP 6005586A JP 558694 A JP558694 A JP 558694A JP H07210184 A JPH07210184 A JP H07210184A
Authority
JP
Japan
Prior art keywords
voice
voices
connection
synthetic
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP6005586A
Other languages
Japanese (ja)
Inventor
Hiroko Yoshida
田 博 子 吉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP6005586A priority Critical patent/JPH07210184A/en
Publication of JPH07210184A publication Critical patent/JPH07210184A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To make it possible to synthesize a natural and smooth voice from unrecorded synthetic units when each of the units has a certain degree of recording quantity and to synthesize a natural voice generating no discontinuousness even if the synthetic unit is small. CONSTITUTION:A recording voice retrieving part 2, a connection unit determining part 3 and a sound signal connecting part 5 are prepared, and when a sentence of contents expressed a voice to be synthetized is inputted from a synthetic sentence signal input part 1, the retrieving part 2 retrieves the contents of recorded voices and selects several recording units including the voice concerned. The determining part 3 selects voices to be connected so as to quite connection distortion out of those voices, the connecting part 5 connects the selected voices and a synthetic signal output part 6 synthesizes a required voice.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、駅の案内放送や電気製
品の操作説明等に用いる、デジタル録音した音声を編集
により合成する録音編集合成装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a recording / editing / synthesizing device for synthesizing digitally recorded voices by editing, which is used for station guide broadcasting and operation explanation of electric appliances.

【0002】[0002]

【従来の技術】従来、この種の音声編集合成装置は、あ
らかじめ人が発声した音声を、単語や文節、文等を単位
として録音しておき、必要に応じて読み出して編集し、
文章等の音声に合成して出力している。
2. Description of the Related Art Conventionally, this type of voice editing / synthesizing apparatus records a voice uttered by a person in advance in units of words, phrases, sentences, etc., and reads and edits the voice as necessary.
Synthesized into speech such as sentences and output.

【0003】すなわち、以下に示す文章1の例の様に
「まもなく3番線に急行東京行きがまいります」という
音声を出力するには「まもなく→A3→B2→C1→ま
いります」というように音声を選択して順に出力するこ
とによって、所望の文章を合成することができる。
That is, as in the example of the sentence 1 below, in order to output a voice saying "I will soon go to Line 3 for Tokyo", I will say "Soon → A3 → B2 → C1 → I will come" A desired sentence can be synthesized by selecting and outputting in sequence.

【0004】 文章1: まもなく ○番線に ○○○行きが まいります。 A:番線 B:電車種別 C:行き先 A1: 1番線に B1:各駅停車 C1: 東京行きが A2: 2番線に B2:急行 C2: 横浜行きが A3: 3番線に B3:快速 C3: 品川行きが B4:通勤快速 C4: 川崎行きが[0004] sentence 1: Mairi is soon to ○ Line ○○○ go. A: Line B: Train type C: Destination A1: Line 1 B1: Stop at each station C1: Tokyo bound A2: Line 2 B2: Express C2: Yokohama bound A3: Line 3 B3: Rapid C3: Shinagawa bound B4: Rapid commuting C4: To Kawasaki

【0005】このように、上記従来の方法でも、合成単
位を組み合わせて出力することにより、所望の音声を合
成することができる。
As described above, even in the above-mentioned conventional method, a desired voice can be synthesized by combining and outputting the synthesis units.

【0006】[0006]

【発明が解決しようとする課題】しかしながら、上記従
来の音声編集合成装置では、合成単位が文章や文節であ
り、その単位ごとに音声を出力しているため録音量が多
く、ある程度決まった文章でなければ合成できない。ま
た、合成単位として録音されていない音声は出力するこ
とができないという問題があった。すなわち、以下の文
章の例の様に文章のパターンが替わったり、電車の行き
先が変わった場合、文章1の「東京行きが」や、「横浜
行きが」という音声はあっても、「東京行きの」や、
「横浜方面」の音声がないため、それらを新たに録音し
なければならない。
However, in the above-described conventional voice editing / synthesizing apparatus, the synthesis unit is a sentence or a phrase, and since the voice is output for each unit, the recording amount is large and the sentence is fixed to a certain degree. Unless it can be synthesized. In addition, there is a problem that a voice that is not recorded as a synthesis unit cannot be output. That is, if the pattern of the sentence is changed or the destination of the train is changed as in the example of the following sentence, even if there are voices such as "To Tokyo" or "To Yokohama" in sentence 1, "To Tokyo" No '
Since there is no voice for "Yokohama", we have to record them newly.

【0007】 文章2: まもなく ○番線に ○○方面 ○○○行きの 電車が到着します。 A:番線 D:方面 E:行き先 A1: 1番線に B1:小田原方面 C1: 東京行きの A2: 2番線に B2:横浜方面 C2: 静岡行きの A3: 3番線に B3:品川方面 C3: 名古屋行きの B4:川崎方面 C4: 青森行きのSentence 2: Soon, a train bound for ○○ will arrive on the ○○ line . A: Line D: Direction E: Destination A1: Line 1 B1: Odawara C1: Tokyo A2: Line 2 B2: Yokohama C2: Shizuoka A3: Line 3 B3: Shinagawa C3: Nagoya B4: To Kawasaki C4: To Aomori

【0008】このような新たな録音を避けるため、合成
単位を「東京」、「横浜」、「行きの」や、「行きが」
等、文章や文節にせず、単語、接続語、助詞、音節、と
いうように小さくすると、録音量を増やさずに合成でき
る語彙や文章を増加することができるが、「横浜方面」
を再生する際には、「横浜」と「方面」という合成単位
を順に出力するだけであるため、「横浜、方面」という
様な不連続な音声になってしまい、音声品質が低下して
しまうという問題があった。
In order to avoid such a new recording, the composition units are "Tokyo", "Yokohama", "Gono" and "Goga".
For example, if you reduce the number of words, connectives, particles, syllables, etc. instead of sentences or phrases, you can increase the vocabulary and sentences that can be synthesized without increasing the recording amount.
When playing back, only the synthetic units of "Yokohama" and "direction" are output in order, resulting in discontinuous voice such as "Yokohama, direction", which deteriorates the voice quality. There was a problem.

【0009】本発明は、上記従来の問題を解決するもの
で、録音されていない合成単位でも、ある程度の音声量
があれば、それらの音声の中から自然で滑らかな音声を
合成したり、合成単位を小さくしても不連続が生じな
い、自然な音声を合成できる音声編集合成装置を提供す
ることを目的とする。
The present invention solves the above-mentioned conventional problems, and even if a synthesis unit that is not recorded has a certain amount of voice, it synthesizes a natural and smooth voice from those voices, or synthesizes them. An object of the present invention is to provide a voice editing / synthesizing device capable of synthesizing a natural voice in which discontinuity does not occur even if the unit is reduced.

【0010】[0010]

【課題を解決するための手段】本発明は、上記目的を達
成するために、音声の接続を音声が継続している途中の
無声破裂音の無音区間、無声摩擦音の摩擦区間、同じ有
声音でスペクトルが似通っている区間で接続する手段を
備えたものである。
SUMMARY OF THE INVENTION In order to achieve the above object, the present invention uses a silent segment of an unvoiced plosive sound, a frictional segment of an unvoiced fricative sound, and the same voiced sound in the middle of continuous voice connection. It is provided with a means for connecting in sections where the spectra are similar.

【0011】本発明はまた、音声を接続する場合、接続
される音声信号は、接続箇所からある一定区間パワーを
直線的に減衰させ、接続する音声信号は、同じ区間パワ
ーを直線的に増加させていき、その区間の信号を足し合
わせるようにしたものである。
According to the present invention, when a voice is connected, the connected voice signal linearly attenuates a certain section power from the connection point, and the connected voice signal linearly increases the same section power. Then, the signals of that section are added together.

【0012】本発明はまた、音声を接続する場合、接続
する箇所が音声のピッチが存在する箇所である場合、ピ
ッチの位相の差による音声の接続歪を避けるために、接
続箇所の2つの音声信号の相関をとり、相関の高い部分
を接続箇所に設定して、音声の位相による接続歪を軽減
するようにしたものである。
In the present invention, when connecting voices, when the connecting place is a place where the pitch of the voices exists, in order to avoid the connection distortion of the voices due to the phase difference of the pitches, two voices of the connecting place are connected. The signal is correlated, and a portion having a high correlation is set as a connection point to reduce connection distortion due to the phase of voice.

【0013】[0013]

【作用】本発明は、上記のような構成により次の様な作
用を有する。すなわち、音声が継続している途中の無声
破裂音の無音区間、無声摩擦音の摩擦区間、同じ有声音
でスペクトルが似通っている箇所等、接続による歪が生
じない箇所で接続することによって、録音量を増やさな
くても、不連続が生じない自然な音声を合成することが
できる。
The present invention has the following actions due to the above-mentioned structure. That is, the amount of recording can be increased by connecting at a place where no distortion occurs due to the connection, such as a silent section of an unvoiced plosive while the voice is continuing, a friction section of an unvoiced fricative, or a section where the spectrum is similar for the same voiced sound. It is possible to synthesize a natural voice without discontinuity without increasing the.

【0014】また、音声のつながりを良くするために、
接続される側の音声に、ある一定の区間内で1から0に
直線的に向かう重み付けを施し、接続する側の音声に
は、接続箇所からある一定の区間内でに0から1に直線
的に向かう重み付けを施し、その区間で2つの音声を足
し合わせることによって、接続箇所での音声のつながり
が良くなり、滑らかな音声を再生することができる。
In order to improve the voice connection,
The connected voice is weighted linearly from 1 to 0 in a certain section, and the connected voice is linearly changed from 0 to 1 in a certain section from the connection point. By weighting toward and adding two voices in the section, the voice connection at the connection point is improved, and a smooth voice can be reproduced.

【0015】さらに、ピッチが存在する箇所で接続する
場合、接続箇所近傍での2つの音声の相関を取り、一番
相関の大きな箇所を接続箇所に設定することによって、
位相による接続歪が少ない、自然な音声を再生できる接
続箇所を決定することができる。
Further, when connecting at a place where a pitch exists, by correlating two voices in the vicinity of the connecting place and setting a place having the largest correlation as a connecting place,
It is possible to determine a connection point that can reproduce natural sound with little phase connection distortion.

【0016】[0016]

【実施例】以下、図面を参照しながら、本発明の一実施
例について説明する。図1は本発明の音声編集合成装置
の一実施例の構成を示すブロック図である。図1におい
て、1は合成文章信号入力部、2は録音音声検索部、3
は接続単位決定部、4は音声信号読み込み部、5は音声
信号接続部、6は合成信号出力部、7は録音音声であ
る。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of an embodiment of a voice editing / synthesizing apparatus of the present invention. In FIG. 1, 1 is a synthetic text signal input unit, 2 is a recorded voice search unit, 3
Is a connection unit determining unit, 4 is a voice signal reading unit, 5 is a voice signal connecting unit, 6 is a synthesized signal output unit, and 7 is recorded voice.

【0017】音声は一般に音韻の種類によって、ある程
度の特徴を持っている。つまり、p,t,k等、無声破
裂音や無声破擦音のt∫は、音韻の始まりに無音の閉鎖
音があり、無声摩擦音s,f,hや無性破擦音のts
(ツ)は持続的な、鋭い摩擦的な音で始まる。また、音
声の母音区間はスペクトル変化がある程度一定である。
このような箇所では音声の性質が一定であるため、音声
の途中で接続しても、接続による歪はめだたない。その
ため、図1の合成文章信号入力部1で合成したい音声の
内容の文章を入力したら、録音音声検索部2で録音され
ている音声の内容を検索して、その音声が含まれている
合成単位をいくつか選択する。そして、接続単位決定部
3では、それらの音声の中から一番接続歪がめだたない
で接続できる音声を選び出し、音声信号読み込み部4で
選び出された音声に相当する信号を録音音声7の中から
読み込み、音声信号接続部5で音声を接続して、合成信
号出力部6で接続された音声信号を出力する。
Speech generally has some features depending on the type of phoneme. That is, t∫ of unvoiced plosives or unvoiced affrications such as p, t, k has a silent closing sound at the beginning of the phoneme, and unvoiced fricatives s, f, h and ts of asexual affricate.
(Tsu) begins with a persistent, sharp, frictional sound. In the vowel section of speech, the spectrum change is constant to some extent.
Since the nature of the voice is constant in such a part, even if the voice is connected in the middle of the voice, the distortion due to the connection is not significant. Therefore, when a sentence having a voice content to be synthesized is input in the synthetic sentence signal input unit 1 of FIG. 1, the recorded voice search unit 2 searches the content of the recorded voice, and a synthesis unit including the voice. Select some. Then, the connection unit determination unit 3 selects a voice that can be connected with the least connection distortion from those voices, and outputs a signal corresponding to the voice selected by the voice signal reading unit 4 in the recorded voice 7. The voice signal is connected to the voice signal connection unit 5, and the synthesized signal output unit 6 outputs the connected voice signal.

【0018】図2および図3に示す例によると、「下田
方面」という文節を合成する場合、まず録音音声検索部
2で、「下田方面」の「下田」を含む「下田行きが」が
選択され、また「下田方面」の「方面」を含む「東京方
面」、「国府津方面」、「平塚方面」、「熱海方面」が
選択される。次に、接続単位決定部3で、選択された音
声の中から、母音aで接続すれば所望の音声が合成され
る「下田行きが」と「平塚方面」が決定され、音声信号
接続部5でこれらの音声の母音aで接続して「下田方
面」を合成し、合成信号出力部6で合成された音声を出
力する。
According to the examples shown in FIGS. 2 and 3, when synthesizing the phrase "Shimoda area", first, the recording voice search unit 2 selects "Shimoda bound" including "Shimoda" of "Shimoda area". In addition, “Tokyo direction” including “Shimoda direction”, “Koufu direction”, “Hiratsuka direction”, and “Atami direction” are selected. Next, the connection unit determination unit 3 determines “Shimoda bound” and “Hiratsuka direction”, in which a desired voice is synthesized by connecting with the vowel a, from the selected voices, and the voice signal connection unit 5 Then, the vowels a of these voices are connected to synthesize "Shimoda area", and the synthesized voice is output by the synthetic signal output unit 6.

【0019】また、音声信号接続部5で音声を接続する
場合、図4に示すように、2つの音声のつながりを良く
するため、接続する箇所でのピッチ周期に相当する区間
等、ある一定の区間の音声データに接続される側の音声
Aに1から0に向かう重み付けを施し、接続する側の音
声Bには0から1に向かう重み付けを施し、重み付けを
行なった区間の音声を相互に加算することによって音声
の連続性を良くし、接続による歪を軽減することができ
る。
Further, in the case of connecting voices by the voice signal connection unit 5, as shown in FIG. 4, in order to improve the connection between the two voices, a certain interval such as a section corresponding to the pitch cycle at the connecting point is provided. The voice A on the side connected to the voice data of the section is weighted from 1 to 0, the voice B on the connecting side is weighted from 0 to 1, and the voices of the weighted sections are added to each other. By doing so, it is possible to improve the continuity of voice and reduce distortion due to connection.

【0020】また、ピッチが存在する箇所で接続する場
合、音声のピッチの位相のずれによって歪が生じてしま
う場合がある。そこで、接続する部分がきまったら、図
5に示すように、以下の式から2つの音声の相関を求
め、一番相関が高かった箇所の接続点を移動して、位相
のずれ(shiftで表される量)を補正することによ
って音声の位相を合わせ、音声の位相の相違による接続
歪を回避することができる。
Further, when the connection is made at a place where a pitch exists, distortion may occur due to the phase shift of the pitch of the voice. Therefore, when the connected portion is determined, as shown in FIG. 5, the correlation between the two voices is obtained from the following equation, and the connection point at the location having the highest correlation is moved to obtain the phase shift (shift expression). It is possible to match the phase of the voice by correcting the amount) and avoid the connection distortion due to the difference in the phase of the voice.

【0021】[0021]

【数1】 [Equation 1]

【0022】以上のように、本実施例によれば、音声の
途中であっても接続しやすい箇所で音声を接続すること
によって、録音されていない音声でも、ある程度録音さ
れた音声の中から自然で滑らかな音声を合成することが
できる。
As described above, according to this embodiment, even if the voice is not recorded, the voice is naturally recorded from the recorded voice to some extent by connecting the voice at a place where the voice can be easily connected even in the middle of the voice. You can synthesize smooth voices with.

【0023】また、録音時に先行母音を含む音声、無声
破裂音の無音部で区切った音声、または無声摩擦音の摩
擦部分で区切った音声など、組み合わせる際に、つなぎ
やすい部分で始まる音声を単位として録音しておき、再
生時にそれらの部分で接続することによって、単語、接
続語、助詞単位の編集合成並の録音量で、文章、文節単
位の編集合成並の品質の音声を合成することができる。
Further, when recording, a voice including a preceding vowel, a voice delimited by a silent part of an unvoiced plosive, or a voice delimited by a frictional part of an unvoiced fricative is recorded as a unit of a voice starting at a part which is easily connected. In addition, by connecting these parts at the time of reproduction, it is possible to synthesize a voice having a quality equivalent to that of editing / synthesis in sentences or clauses with a recording amount equivalent to that of editing / synthesis in units of words, connecting words, and particles.

【0024】[0024]

【発明の効果】本発明は、上記実施例から明らかなよう
に、音声が継続している途中の無声破裂音の無音区間、
無声摩擦音の摩擦区間、同じ有声音でスペクトルが似通
っている箇所等、接続による歪が生じない箇所で接続す
ることによって、録音量を増やさなくても、不連続が生
じない自然な音声を合成することができる。
As is apparent from the above embodiment, the present invention provides a silent section of a voiceless plosive in the middle of continuous voice.
By connecting at a place where distortion due to the connection does not occur, such as a friction section of unvoiced fricative, a place where the spectrum is similar for the same voiced sound, natural speech that does not cause discontinuity without increasing the recording amount is synthesized. be able to.

【0025】また、音声のつながりを良くするために、
接続される側の音声に、ある一定の区間内で1から0に
直線的に向かう重み付けを施し、接続する側の音声に
は、接続箇所からある一定の区間内でに0から1に直線
的に向かう重み付けを施し、その区間で2つの音声を足
し合わせることによって、接続箇所での音声のつながり
が良くなり、滑らかな音声を再生することができる。
In order to improve the connection of voice,
The connected voice is weighted linearly from 1 to 0 in a certain section, and the connected voice is linearly changed from 0 to 1 in a certain section from the connection point. By weighting toward and adding two voices in the section, the voice connection at the connection point is improved, and a smooth voice can be reproduced.

【0026】さらに、ピッチが存在する箇所で接続する
場合、接続箇所近傍での2つの音声の相関を取り、一番
相関の大きな箇所を接続箇所に設定することによって、
位相による接続歪が少ない、自然な音声を再生できる接
続箇所を決定することができる。
Further, when connecting at a place where a pitch exists, by correlating two voices in the vicinity of the connecting place and setting a place having the largest correlation as a connecting place,
It is possible to determine a connection point that can reproduce natural sound with little phase connection distortion.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例における音声編集装置の構成
を示す概略ブロック図。
FIG. 1 is a schematic block diagram showing the configuration of a voice editing device according to an embodiment of the present invention.

【図2】同装置の動作を説明するための模式図。FIG. 2 is a schematic diagram for explaining the operation of the device.

【図3】同装置の動作を説明するための波形図。FIG. 3 is a waveform diagram for explaining the operation of the device.

【図4】同装置の動作を説明するための波形図。FIG. 4 is a waveform diagram for explaining the operation of the device.

【図5】同装置の動作を説明するための波形図。FIG. 5 is a waveform diagram for explaining the operation of the device.

【符号の説明】[Explanation of symbols]

1 合成文章信号入力部 2 録音音声検索部 3 接続単位決定部 4 音声信号読み込み部 5 音声信号接続部 6 合成信号出力部 7 録音音声 1 Synthetic text signal input unit 2 Recorded voice search unit 3 Connection unit determination unit 4 Voice signal reading unit 5 Voice signal connection unit 6 Synthetic signal output unit 7 Recorded voice

Claims (3)

【特許請求の範囲】[Claims] 【請求項1】 文章または文節単位で録音された音声を
組み合わせて出力する音声編集合成装置において、音声
の接続を、音声の持続している途中の無声破裂音の無音
区間、無声摩擦音の摩擦区間およびスペクトルが似てい
る区間で接続する手段を備えた音声編集合成装置。
1. A voice editing / synthesizing device for outputting voices recorded in units of sentences or phrases in combination, in which voices are connected by a silent segment of a silent plosive and a frictional segment of an unvoiced fricative in the middle of continuous voice. And a voice editing / synthesizing device having means for connecting in sections having similar spectra.
【請求項2】 音声を接続する場合、接続される音声信
号は、接続箇所からある一定区間パワーを直線的に減衰
させ、接続する音声信号は、同じ区間パワーを直線的に
増加させていき、その区間の信号を足し合わせることを
特徴とする請求項1記載の音声編集合成装置。
2. When connecting voice, the connected voice signal linearly attenuates power in a certain section from the connection point, and the connected voice signal linearly increases power in the same section, 2. The voice editing / synthesizing apparatus according to claim 1, wherein signals of the section are added together.
【請求項3】 音声を接続する場合、接続する箇所が音
声のピッチが存在する箇所である場合、ピッチの位相の
差による音声の接続歪を避けるために、接続箇所の2つ
の音声信号の相関をとり、相関の高い部分を接続箇所に
設定して、音声の位相による接続歪を軽減することを特
徴とする請求項1記載の音声編集合成装置。
3. When connecting voices, when the connecting place is a place where the pitch of the voice exists, in order to avoid the connection distortion of the voice due to the phase difference of the pitch, the correlation of two voice signals at the connecting place. The voice editing / synthesizing apparatus according to claim 1, wherein a portion having a high correlation is set as a connection point to reduce connection distortion due to a voice phase.
JP6005586A 1994-01-24 1994-01-24 Voice editor/synthesizer Pending JPH07210184A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP6005586A JPH07210184A (en) 1994-01-24 1994-01-24 Voice editor/synthesizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP6005586A JPH07210184A (en) 1994-01-24 1994-01-24 Voice editor/synthesizer

Publications (1)

Publication Number Publication Date
JPH07210184A true JPH07210184A (en) 1995-08-11

Family

ID=11615355

Family Applications (1)

Application Number Title Priority Date Filing Date
JP6005586A Pending JPH07210184A (en) 1994-01-24 1994-01-24 Voice editor/synthesizer

Country Status (1)

Country Link
JP (1) JPH07210184A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011221486A (en) * 2010-03-26 2011-11-04 Toshiba Corp Audio editing method and device, and audio synthesis method
JP2012173702A (en) * 2011-02-24 2012-09-10 Denso Corp Voice guidance system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011221486A (en) * 2010-03-26 2011-11-04 Toshiba Corp Audio editing method and device, and audio synthesis method
US8868422B2 (en) 2010-03-26 2014-10-21 Kabushiki Kaisha Toshiba Storing a representative speech unit waveform for speech synthesis based on searching for similar speech units
JP2012173702A (en) * 2011-02-24 2012-09-10 Denso Corp Voice guidance system

Similar Documents

Publication Publication Date Title
US20130041669A1 (en) Speech output with confidence indication
US8195464B2 (en) Speech processing apparatus and program
JPH08110789A (en) Voice synthesis method by link and partial overlap of waveforms
JPH10153998A (en) Auxiliary information utilizing type voice synthesizing method, recording medium recording procedure performing this method, and device performing this method
JPS62160495A (en) Voice synthesization system
WO2002054383A1 (en) Text voice synthesis device and program recording medium
KR100710600B1 (en) The method and apparatus that createdplayback auto synchronization of image, text, lip's shape using TTS
Mengko et al. Indonesian Text-To-Speech system using syllable concatenation: Speech optimization
JPH07210184A (en) Voice editor/synthesizer
JPH08335096A (en) Text voice synthesizer
Itoh et al. A new waveform speech synthesis approach based on the COC speech spectrum
JP2011090218A (en) Phoneme code-converting device, phoneme code database, and voice synthesizer
JP3089940B2 (en) Speech synthesizer
JP3060276B2 (en) Speech synthesizer
RU2298234C2 (en) Method for compilation phoneme synthesis of russian speech and device for realization of said method
JP2577372B2 (en) Speech synthesis apparatus and method
JP3124791B2 (en) Speech synthesizer
JP3394281B2 (en) Speech synthesis method and rule synthesizer
JP2002244693A (en) Device and method for voice synthesis
Bonada et al. Improvements to a sample-concatenation based singing voice synthesizer
Crystal et al. Segmental durations in connected speech signals
JPH0836397A (en) Voice synthesizer
JPH0572599B2 (en)
JPS5950079B2 (en) Speech synthesis method
JPH0756591A (en) Device and method for voice synthesis and recording medium