JPS63169700A - Unit voice editing type rule synthesizer - Google Patents

Unit voice editing type rule synthesizer

Info

Publication number
JPS63169700A
JPS63169700A JP62002247A JP224787A JPS63169700A JP S63169700 A JPS63169700 A JP S63169700A JP 62002247 A JP62002247 A JP 62002247A JP 224787 A JP224787 A JP 224787A JP S63169700 A JPS63169700 A JP S63169700A
Authority
JP
Japan
Prior art keywords
unit
speech
pitch
memory
subset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP62002247A
Other languages
Japanese (ja)
Inventor
松尾 則子
伏木田 勝信
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP62002247A priority Critical patent/JPS63169700A/en
Publication of JPS63169700A publication Critical patent/JPS63169700A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management Or Editing Of Information On Record Carriers (AREA)

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 (産業上の利用分野) 本発明は規則型音声合成装置、特に入力される音素記号
列から音声の合成波形を生成する規則型音声合成装置に
関する。
DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a regular speech synthesizer, and more particularly to a regular speech synthesizer that generates a synthesized speech waveform from an input phoneme symbol string.

(従来の技術) 従来、規則型音声合成装置において、cv、vc等を単
位音声とする音声波形を編集、合成する方式がある。例
えば、音声研究会資料882−06(1982年4月)
”CV、VC波形のピッチ同期的補間による任意語合成
方式″に記載されているような技術が知られている。
(Prior Art) Conventionally, in a regular speech synthesizer, there is a method of editing and synthesizing speech waveforms using cv, vc, etc. as unit speech. For example, Voice Study Group Material 882-06 (April 1982)
A technique described in "Arbitrary word synthesis method using pitch-synchronous interpolation of CV and VC waveforms" is known.

(発明が解決しようとする問題点) 従来方式では、前後の単位音声の影響(調音結合)を取
り入れていなかったため、個々の音声の明瞭性は良いが
合成音が不自然に聞こえるという欠点があった。この欠
点を解決するためにそれぞれのcv、vcに対して種々
の調音結合条件に対応した複数個の単位音声波形をあら
かじめ自然音声から切り出し用意して用いる方式が当然
考えられる。しかしながら、この方式は、単位音声波形
の組合せ方によって、ピッチ、スペクトル包絡が不連続
になる恐れがある。
(Problem to be solved by the invention) The conventional method did not take into account the influence of the preceding and succeeding unit voices (articulatory combination), so although the clarity of individual voices was good, the synthesized voice sounded unnatural. Ta. In order to solve this drawback, it is natural to think of a method in which a plurality of unit speech waveforms corresponding to various articulatory coupling conditions are extracted and prepared in advance from natural speech for each cv and vc. However, in this method, the pitch and spectrum envelope may become discontinuous depending on how the unit speech waveforms are combined.

(問題点を解決するための手段) 本発明の規則型音声合成装置は、合成すべき音声を表す
入力音素列から、CV、VC等の単位音声名列に分解す
る手段と、該入力音素列からピッチルールに従ってピッ
チ周波数データを算出する手段と、該単位音声名列から
複数個の互いに異なるピッチバタン、ホルマントパタン
を有する単位音声の組(異音サブセットと呼称する)を
選択する手段と、ピッチ、ホルマントなど各単位音声の
境界データを記憶しておくメモリーと、各単位音声波形
を記憶してお(メモリーと、選択された前記異音サブセ
ットの中から該境界データを用いてある範囲内(例えば
、一単語)において前後の単位音声の距離の和が最小と
なるように単位音声名を選択する手段と前記選択された
単位音声名に従って該単位音声波形を編集し音声の合成
を行う手段とを有する。
(Means for Solving the Problems) The regular speech synthesis device of the present invention includes means for decomposing an input phoneme string representing speech to be synthesized into unit phonetic name strings such as CV and VC, and means for calculating pitch frequency data according to a pitch rule from the unit phonetic name sequence; , a memory for storing boundary data of each unit voice such as formant, and a memory for storing each unit voice waveform (memory, and a memory for storing boundary data of each unit voice such as formant, etc.). For example, means for selecting unit speech names such that the sum of the distances between the preceding and succeeding unit speeches in one word is minimum, and means for editing the unit speech waveform and synthesizing speech according to the selected unit speech names. has.

(作用) 本方式は入力音素列から、生成されたCV、VC等の単
位音声名列とピッチルールにより生成されたピッチ周波
数データとから異音サブセットを求め、前記異音サブセ
ットの中から前後の単位音声間の距離(ピッチ、ホルマ
ントなどの関数)の和がある一定の音声区間内(例えば
、一単語)で最小となる最適な単位音声系列を選ぶ。前
記最適な単位音声系列を選択するためには前記異音サブ
セットに含まれる単位音声に対して可能な全ての組合せ
に対する前記距離の和を求め、最小となるものを選択す
る。このように前後の音素の影響を取り入れ、且つ単位
音声間のピッチ、ホルマントの連続性を最適化して合成
することにより、明瞭性が高く滑らかな音質を持つ合成
音声が得られる。
(Operation) This method calculates allophone subsets from the input phoneme string, generated unit phonetic name strings such as CV and VC, and pitch frequency data generated by pitch rules, and selects the preceding and following allophone subsets from among the allophone subsets. An optimal unit speech sequence is selected in which the sum of the distances (functions of pitch, formant, etc.) between unit speeches is the minimum within a certain speech interval (for example, one word). In order to select the optimal unit speech sequence, the sum of the distances for all possible combinations of unit speech included in the allophone subset is determined, and the minimum one is selected. In this way, by incorporating the influence of the preceding and following phonemes and optimizing the continuity of pitch and formant between unit voices for synthesis, synthesized speech with high clarity and smooth sound quality can be obtained.

(実施例) 第1図は本発明の原理を実現するための一実施例を示す
ブロック図である。入力端子1から入力された音素列は
、単位音声名列分解回路2とピッチルール回路3に入力
され、2ではcv、vc等の単位音声名列に分解され、
3では前記音素列に従って前記各単位音声のピッチ周波
数値(ある程度幅をもたせる)が決定される。2と3の
結果をもとに複数個の単位音声の組である異音サブセッ
トを異音サブセット選択回路4で選ぶ。最適値選択回路
5において境界条件記憶メモリ6に記憶されている各単
位音声の境界におけるピッチ、ホルマントデータを用い
て前後の単位音声間の距離の和がある音声区間内(例え
ば、一単語)で最小となるように、4で選ばれた異音サ
ブセットの中から選択する。5で選ばれた最適な異音サ
ブセットと単位音声波形メモリー7に記憶されている単
位音声波形を用いて音声合成回路8で該単位音声波形を
編集合成し、出力端子9から出力する。単位音声波形の
編集合成方式については、前記文猷音声研究会費料88
2−06(1982年4月)”cv、vc濾波形ピッチ
同期的補間による任意語合成方式″に詳しいのでここで
は説明を省略する。
(Embodiment) FIG. 1 is a block diagram showing an embodiment for realizing the principle of the present invention. The phoneme string input from the input terminal 1 is input to the unit phonetic name string decomposition circuit 2 and the pitch rule circuit 3, where it is decomposed into unit phonetic name strings such as cv, vc, etc.
3, the pitch frequency value (with some width) of each unit voice is determined according to the phoneme sequence. Based on the results of steps 2 and 3, an allophone subset selection circuit 4 selects an allophone subset that is a set of a plurality of unit voices. The optimum value selection circuit 5 uses the pitch and formant data at the boundaries of each unit voice stored in the boundary condition storage memory 6 to determine the sum of the distances between the preceding and succeeding unit voices within a speech interval (for example, one word). Select from among the allophone subsets selected in step 4 so as to be the minimum. Using the optimal abnormal sound subset selected in step 5 and the unit speech waveform stored in the unit speech waveform memory 7, the speech synthesis circuit 8 edits and synthesizes the unit speech waveform and outputs it from the output terminal 9. Regarding the unit speech waveform editing and synthesis method, please refer to the above-mentioned Bunyu Speech Study Group fee 88.
2-06 (April 1982) "Arbitrary Word Synthesis Method Using CV, VC Filtered Pitch Synchronous Interpolation" is detailed, so the explanation will be omitted here.

(発明の効果) 以上、説明したように本願はcv、vc等の単位音声名
列とピッチルールと異音サブセットからある範囲内(例
えば、一単語)で最適化されたピッチ、ホルマントなど
を用いることにより前後の音素の影響を取り入れた合成
音を得ることができ、明瞭性が高くなめらかな音質が得
られる。
(Effects of the Invention) As explained above, the present application uses pitches, formants, etc. that are optimized within a certain range (for example, one word) from unit phonetic name sequences such as cv and vc, pitch rules, and allophone subsets. By doing this, it is possible to obtain a synthesized sound that takes into account the effects of the preceding and following phonemes, resulting in high clarity and smooth sound quality.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は、本発明°の一実施例を示すブロック図である
。 1・・・・・・入力端子 2・・・・・・単位音声名列分解回路 3・・・・・・ピッチルール回路 4・・・・・・異音サブセット選択回路5・・・・・・
異音サブセット最適値選択回路6・・・・・境界条件記
憶メモリー 7・・・・・単位音声波形メモリー 8・・・・・音声合成回路
FIG. 1 is a block diagram showing an embodiment of the present invention. 1... Input terminal 2... Unit phonetic name string decomposition circuit 3... Pitch rule circuit 4... Allophone subset selection circuit 5...・
Abnormal sound subset optimal value selection circuit 6...Boundary condition memory memory 7...Unit speech waveform memory 8...Speech synthesis circuit

Claims (1)

【特許請求の範囲】[Claims] 合成すべき音声を表す入力音素列から、CV、VC等の
単位音声名列に分解する手段と、該入力音素列からピッ
チルールに従ってピッチ周波数データを算出する手段と
、該単位音声名列および前記ピッチ周波数データに従っ
て異音サブセットを選択する手段と、ピッチ、ホルマン
トなどの各単位音声の境界データを記憶しておくメモリ
ーと、各単位音声波形を記憶しておくメモリーと、選択
された前記異音サブセットの中から該境界データを用い
てある音声区間内(例えば、一単語)において前後の単
位音声の距離の和が最小となるように単位音声名を選択
する手段と前記選択された単位音声名に従って該単位音
声波形を編集し音声の合成を行う手段とを含むことを特
徴とする規則型音声合成装置。
means for decomposing an input phoneme string representing speech to be synthesized into unit phonetic name strings such as CV and VC; means for calculating pitch frequency data from the input phoneme string according to a pitch rule; means for selecting an allophone subset according to pitch frequency data; a memory for storing boundary data of each unit voice such as pitch and formant; a memory for storing each unit voice waveform; Means for selecting a unit phonetic name from a subset using the boundary data so that the sum of the distances of preceding and following unit phonetics within a certain phonetic interval (for example, one word) is minimum; and the selected unit phonetic name. and means for editing the unit speech waveform and synthesizing speech according to the following.
JP62002247A 1987-01-07 1987-01-07 Unit voice editing type rule synthesizer Pending JPS63169700A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP62002247A JPS63169700A (en) 1987-01-07 1987-01-07 Unit voice editing type rule synthesizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP62002247A JPS63169700A (en) 1987-01-07 1987-01-07 Unit voice editing type rule synthesizer

Publications (1)

Publication Number Publication Date
JPS63169700A true JPS63169700A (en) 1988-07-13

Family

ID=11524024

Family Applications (1)

Application Number Title Priority Date Filing Date
JP62002247A Pending JPS63169700A (en) 1987-01-07 1987-01-07 Unit voice editing type rule synthesizer

Country Status (1)

Country Link
JP (1) JPS63169700A (en)

Similar Documents

Publication Publication Date Title
US10008193B1 (en) Method and system for speech-to-singing voice conversion
CA2296330C (en) Generation of voice messages
JPS63285598A (en) Phoneme connection type parameter rule synthesization system
JPS62160495A (en) Voice synthesization system
JPH06266390A (en) Waveform editing type speech synthesizing device
US6424937B1 (en) Fundamental frequency pattern generator, method and program
JP4225128B2 (en) Regular speech synthesis apparatus and regular speech synthesis method
JP2003345400A (en) Method, device, and program for pitch conversion
JP3109778B2 (en) Voice rule synthesizer
JP5175422B2 (en) Method for controlling time width in speech synthesis
JPS63169700A (en) Unit voice editing type rule synthesizer
JPH09319394A (en) Voice synthesis method
JP3081300B2 (en) Residual driven speech synthesizer
JP3059751B2 (en) Residual driven speech synthesizer
JPH09179576A (en) Voice synthesizing method
JPS58129500A (en) Singing voice synthesizer
JP2577372B2 (en) Speech synthesis apparatus and method
JP3437472B2 (en) Speech synthesis method and apparatus
JP2987089B2 (en) Speech unit creation method, speech synthesis method and apparatus therefor
JP2002244693A (en) Device and method for voice synthesis
JP2586040B2 (en) Voice editing and synthesis device
JP2573586B2 (en) Rule-based speech synthesizer
JPS60153099A (en) Rule type voice synthesizer
JP2573585B2 (en) Speech spectrum pattern generator
JPS62283398A (en) Rule voice synthesization