JPH05204390A

JPH05204390A - Accent giving device and voice synthesis device

Info

Publication number: JPH05204390A
Application number: JP4010359A
Authority: JP
Inventors: Kiyo Hara; 紀代原; Yuriko Suruga; 由里子駿河
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1992-01-23
Filing date: 1992-01-23
Publication date: 1993-08-13

Abstract

PURPOSE:To provide a very natural synthetic voice by adding, and emphasis using a voice synthetic speech process and an accent giving method. CONSTITUTION:The device has a text inputting means which inputs a text, a morpbens processing means 1a which separates the inputted text into morphems a dictionary means 1b which stores a dictionary information which is referred by the means 1a, an accent giving means 1d which gives accents and an intonation selecting means 1e which selects intonation. The dictionary information stored in the means 1b holds two accent informations, i.e., a general accent information and an accent information when an emphasis is given. The means 1d gives an accent using these accent informations and based on the intonation selected by the means 1e.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声合成装置で利用さ
れる言語処理手法、特にアクセント付与の装置に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a language processing method used in a speech synthesizer, and more particularly to an accent imparting device.

【０００２】[0002]

【従来の技術】従来の音声規則合成装置としては、例え
ば、古井：ディジタル音声処理 p.146（東海大学出版会
1985）に示されている装置が知られている。図６はこ
の従来の音声合成装置の構成を示すブロック図である。
文字列入力端０には漢字かな混じり文データが入力され
る。形態素処理部１ａでは、入力された漢字かな混じり
文が辞書１ｂを用いて形態素に分割され、各形態素の読
み・アクセント型・品詞等が付与される。構文解析部で
１ｃは、形態素処理部１ａで得られた各形態素の情報を
用いて文節の決定を行い、文節間の係受け解析を行う。
アクセント処理部１ｄでは、アクセント句の決定、アク
セント位置の決定、ポーズやイントネーション立て直し
位置の決定を行う。これら形態素処理部１ａ、辞書１
ｂ、構文解析部１ｃ、アクセント処理部１ｄは、言語処
理部１を構成している。音響処理部２では、言語処理部
１で得られた読みとアクセントの情報に基づいて合成パ
ラメータを作成する。この合成パラメータには、音声の
大きさを決める振幅、声道の状態を決める声道記述パラ
メータ（ＰＡＲＣＯＲ係数やホルマント周波数など）、
声帯の状態を決める有声／無声判定フラグ、声の高さを
決める基本周波数等がある。合成処理部３は、音響処理
部２で得られた合成パラメータ列を音声波形に変換し、
合成音出力端４に音声波形を得る。2. Description of the Related Art As a conventional speech rule synthesizer, for example, Furui: Digital Speech Processing p.146 (Tokai University Press)
The device shown in 1985) is known. FIG. 6 is a block diagram showing the configuration of this conventional speech synthesizer.
Kanji and kana mixed sentence data is input to the character string input terminal 0. In the morpheme processing unit 1a, the input kanji / kana mixed sentence is divided into morphemes using the dictionary 1b, and the reading / accent type / part of speech of each morpheme is added. The syntactic analysis unit 1c uses the information of each morpheme obtained by the morpheme processing unit 1a to determine a phrase and performs a dependency analysis between the phrases.
The accent processing unit 1d determines accent phrases, accent positions, and poses and intonation upright positions. These morpheme processing unit 1a and dictionary 1
b, the syntax analysis unit 1c, and the accent processing unit 1d constitute the language processing unit 1. The sound processing unit 2 creates a synthesis parameter based on the reading and accent information obtained by the language processing unit 1. This synthesis parameter includes an amplitude that determines the volume of the voice, a vocal tract description parameter that determines the state of the vocal tract (PARCOR coefficient, formant frequency, etc.),
There is a voiced / unvoiced determination flag that determines the state of the vocal cords, and a fundamental frequency that determines the pitch of the voice. The synthesis processing unit 3 converts the synthesis parameter sequence obtained by the acoustic processing unit 2 into a voice waveform,
A voice waveform is obtained at the synthetic sound output terminal 4.

【０００３】また図７は、辞書１ｂの構成および例を示
した図である。１単語毎に、表記・読み・品詞番号・前
接続番号・後接続番号・アクセント・結合アクセントの
情報を持つ。前接続番号・後接続番号は、形態素の接続
チェックを行うために与えられたもので、例えば「思い
ます」を形態素解析した場合、「思い」に対して名詞／
５段動詞連用形の２候補があるが、「ます」は名詞には
接続しないので５段動詞が選択される。アクセントは、
その形態素が単独で発声された場合のアクセント型を示
し、付属語では持たないものが多い。結合アクセント型
は、複合語や文節を構成する際の前接部のアクセントへ
の影響の仕方を示す。FIG. 7 is a diagram showing a configuration and an example of the dictionary 1b. For each word, it has information of notation, reading, part-of-speech number, pre-connection number, post-connection number, accent, and combined accent. The pre-connection number and post-connection number are given to check the connection of morphemes. For example, when "I think" is morphologically analyzed, the noun /
Although there are two candidates for the 5-dan verb continuous form, since "masu" is not connected to a noun, the 5-dan verb is selected. The accent is
It shows the accent type when the morpheme is uttered alone, and often does not have an adjunct. The combined accent type indicates how the front part influences the accent when composing a compound word or phrase.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、このよ
うな従来の音声規則合成装置は、ＷＰ文章の読み合わせ
や公共案内放送等いろいろな分野で利用されつつある。
合成された音声の個々の音節が理解できるという明瞭性
の観点からは、かなり実用レベルに迫ってきているが、
アクセントの位置と発声モーラ数が同じならば同じイン
トネーションが付与されたり、卓立や発声速度の揺らぎ
等が加味されておらず、自然性という観点からは、非常
に単調で機械的であるといわざるを得ない。However, such a conventional speech rule synthesizing device is being used in various fields such as reading WP sentences and public guide broadcasting.
From the viewpoint of clarity that individual syllables of synthesized speech can be understood, it is approaching a practical level.
If the position of the accent and the number of vocal morae are the same, the same intonation is not given, and the standup and fluctuation of the vocal speed are not added, and from the viewpoint of naturalness, it is said to be very monotonous and mechanical. I have no choice.

【０００５】本発明は、かかる従来の音声規則合成装置
の課題に鑑みてなされたもので、規則合成音の単調性を
なくし、高品質の規則合成音を出力できるアクセント付
与装置を提供することを目的としている。The present invention has been made in view of the problems of the conventional voice rule synthesizing device, and provides an accent imparting device capable of outputting a high quality rule synthesizing voice without monotonicity of the rule synthesizing voice. Has a purpose.

【０００６】[0006]

【課題を解決するための手段】本発明は、テキストを入
力するテキスト入力手段と、入力されたテキストを形態
素に分割する形態素処理手段と、形態素処理手段で参照
する辞書情報を格納した辞書手段と、アクセントを付与
するアクセント付与手段とを備えたアクセント付与装置
に於て、音調を選択する音調選択手段と、複数種類のア
クセント情報を保持する記憶手段とを有し、音調選択手
段で選択された音調にしたがって、アクセント情報を利
用してアクセント付与手段がアクセントを付与するアク
セント付与装置である。According to the present invention, there are provided text input means for inputting text, morpheme processing means for dividing the input text into morphemes, and dictionary means for storing dictionary information referred to by the morpheme processing means. , An accent imparting device having an accent imparting means for imparting an accent, having a tone selecting means for selecting a tone and a storing means for holding a plurality of types of accent information, and selecting by the tone selecting means. This is an accent imparting device in which the accent imparting means imparts an accent in accordance with a tone by using accent information.

【０００７】[0007]

【作用】本発明では、複数種類のアクセント情報、例え
ば、一般的なアクセントと強調発声した際の２通りのア
クセント型を保持し、選択された音調に従ってどちらの
アクセント型を使うかを決定し、一般的なアクセントも
しくは強調的なアクセントを付与することにより、規則
合成音の単調さを軽減し自然性の高い合成音を提供す
る。In the present invention, a plurality of types of accent information, for example, a general accent and two types of accent types when emphasized are held, are determined, and which accent type is used according to the selected tone, By adding a general accent or an emphasized accent, it is possible to reduce the monotonicity of the regular synthetic sound and provide a synthetic sound with high naturalness.

【０００８】また、乱数を発生させて、例えば、強調読
み、一般読みを選択することにより変化に富んだ合成音
を提供する。Further, a random number is generated and, for example, emphasized reading or general reading is selected to provide a synthetic voice rich in variation.

【０００９】また、形態素の品詞情報や辞書に記載され
た情報に従って、例えば、強調読み、一般読みを選択す
ることにより自然性の高い合成音を提供する。Further, according to the part-of-speech information of the morpheme or the information written in the dictionary, for example, the emphasized reading or the general reading is selected to provide a synthetic sound with high naturalness.

【００１０】また、複数の声質が存在する場合選択され
た声質に従って、例えば、強調読み、一般読みを選択す
ることにより声質間の差異をより明確にし、高品質な合
成音を提供する。Further, when a plurality of voice qualities are present, the difference between voice qualities is made clearer by selecting, for example, emphasized reading or general reading according to the selected voice qualities, and a high-quality synthesized voice is provided.

【００１１】[0011]

【実施例】以下、本発明の実施例について図面を参照し
て説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１２】図１は、請求項１の本発明のアクセント付
与装置の一実施例を利用した音声合成装置の構成を示す
ブロック図であって、以下にその構成をその作用ととも
に説明する。文字列入力端０には漢字かな混じり文が入
力される。形態素処理部１ａでは、入力された漢字かな
混じり文が辞書１ｂを用いて形態素に分割され、各形態
素の読み・アクセント型・品詞等が付与される。構文解
析部１ｃでは、形態素処理部１ａで得られた各形態素の
情報を用いて文節の決定を行い、文節間の係受け解析を
行う。アクセント処理部１ｄでは、アクセント句の決
定、アクセント位置の決定、ポーズやイントネーション
立て直し位置の決定を行う。音調選択部１ｅは、メリハ
リのきいた強調的な発声で合成するか一般的な発声にす
るかを選択するための手段であって、５はその選択情報
の入力端子である。その選択情報は、アクセント処理部
１ｄへ入力されている。これら、形態素処理部１ａ、辞
書１ｂ、構文解析部１ｃ、アクセント処理部１ｄ、音調
選択部１ｅで言語処理部１を構成している。FIG. 1 is a block diagram showing the construction of a speech synthesizer using an embodiment of the accent imparting apparatus of the present invention according to claim 1, and the construction will be described below together with its operation. A kanji-kana mixed sentence is input to the character string input terminal 0. In the morpheme processing unit 1a, the input kanji / kana mixed sentence is divided into morphemes using the dictionary 1b, and the reading / accent type / part of speech of each morpheme is added. The syntactic analysis unit 1c uses the information of each morpheme obtained by the morpheme processing unit 1a to determine a phrase and performs a dependency analysis between the phrases. The accent processing unit 1d determines accent phrases, accent positions, and poses and intonation upright positions. The tone selection section 1e is a means for selecting whether to synthesize by a emphasized voicing with sharpness or general voicing, and 5 is an input terminal of the selection information. The selection information is input to the accent processing unit 1d. The morpheme processing section 1a, the dictionary 1b, the syntax analysis section 1c, the accent processing section 1d, and the tone selection section 1e constitute the language processing section 1.

【００１３】音響処理部２では、このような言語処理部
１で得られた読みとアクセントの情報に基づいて合成パ
ラメータを作成する。この合成パラメータは、音声の大
きさを決める振幅、声道の状態を決める声道記述パラメ
ータ（ＰＡＲＣＯＲ係数やホルマント周波数など）、声
帯の状態を決める有声／無声判定フラグ、声の高さを決
める基本周波数等がある。合成処理部３は、音響処理部
２で得られた合成パラメータ列を音声波形に変換し、合
成音出力端子４に音声波形を得る。本実施例では音響処
理部２、合成処理部３の方式については、特に限定しな
い。The sound processing unit 2 creates a synthesis parameter based on the reading and accent information obtained by the language processing unit 1. This synthesis parameter is an amplitude that determines the volume of the voice, a vocal tract description parameter that determines the state of the vocal tract (PARCOR coefficient, formant frequency, etc.), a voiced / unvoiced determination flag that determines the state of the vocal cord, and a basic that determines the pitch of the voice. Frequency etc. The synthesis processing unit 3 converts the synthesis parameter sequence obtained by the acoustic processing unit 2 into a voice waveform, and obtains a voice waveform at the synthetic voice output terminal 4. In this embodiment, the methods of the sound processing unit 2 and the synthesis processing unit 3 are not particularly limited.

【００１４】図２は、本実施例における辞書１ｃの付属
語辞書の構成および例を示すブロック図である。従来例
の辞書項目に加えて強調アクセント型をもつ。強調アク
セント型とは、卓立や独自アクセント句形成の可能性に
関する情報で、１以上の場合にはその可能性があること
を示す。FIG. 2 is a block diagram showing the structure and an example of the auxiliary word dictionary of the dictionary 1c in this embodiment. In addition to the dictionary items of the conventional example, it has an accent accent type. The emphasized accent type is information about the possibility of excellence and the formation of unique accent phrases, and if the number is 1 or more, the possibility exists.

【００１５】各処理の詳細について具体例を用いて説明
する。「これは、音声合成装置です。」という文章が入
力された場合について考える。形態素処理部１ａにより
入力文章は以下のように形態素分割され、アクセントや
読みの情報を得る。ここで、「は」「です」に対して与
えられている結合アクセント型Ａやｂは、ＮＨＫアクセ
ント辞典・解説付録（日本放送出版会 1985年）に記載
されているもので、自立語と結合して文節を構成する際
の結合アクセント型を示したものである。また、各単語
のアクセント型は、アクセントのある音節位置を示した
ものである。自立語の結合アクセント型は、複合語の後
続単語になった時のアクセント型を示す。強調アクセン
ト型は、この例では、格助詞「は」だけに設定されてい
る。さらに構文解析部１ｃにより文節境界が決定され、
文節間の係受け（文節間距離）が決定される。本例で
は、文節「これは」は、直後の文節にかかるので文節間
距離は１となる。アクセント処理部１ｄでは、アクセン
ト句およびアクセント位置の決定を行う。本例では、
「これは」の部分は平板型、「音声合成装置です」の部
分は、９型で９音節めすなわち「装置」の「そ」にアク
セントがある（言語処理部・処理結果の「so」の後部に
付加された「1 」はアクセントのある音節を示す）。（入力文章）「これは、音声合成です。」（形態素分割）これ／は／、／音声／合成／装置です。（読み）コレワオンセーコ゛ーセーソーチテ゛ス（アクセント）０ − １００ − （結合アクセント）１Ａ１１１ｂ（強調アクセント） − １ − − − − （品詞）代名係助名名名助動（文節）これは、／音声合成装置です。（文節間距離）１（アクセント句）これは、／音声合成装置です。（文節アクセント）０９（言語処理部出力１） ko re wa poz o nn se e go o se e so1 o ti de su （言語処理部出力２） ko re wa' poz o nn se e go o se e so1 o ti de su ここで、poz はこの位置にポーズが挿入されイントネー
ションの立て直しが行れることを示す。言語処理部出力
１は、一般読みの場合で、出力２は強調読みの場合を示
す。強調読みの場合は、[wa]に付加された[']が卓立を
示し、音響処理部２でイントネーションが付与される際
に、卓立として[wa]の基本周波数が一般読みの場合より
高めに設定される。Details of each process will be described using a specific example. Consider the case where the sentence "This is a speech synthesizer." Is entered. The input sentence is morpheme-divided by the morpheme processing unit 1a as follows to obtain accent and reading information. Here, the combined accent types A and b given to "ha" and "desu" are those listed in the NHK Accent Dictionary / Explanatory Appendix (Japan Broadcasting Corporation 1985), and combined with independent words. It shows the combined accent type when constructing a phrase. The accent type of each word indicates a syllable position with an accent. The combined accent type of an independent word indicates an accent type when it becomes a subsequent word of a compound word. In this example, the emphasized accent type is set only to the case particle “ha”. Further, the syntactic analysis unit 1c determines the phrase boundary,
Dependence between phrases (distance between phrases) is determined. In this example, the bunsetsu “korewa” is applied to the bunsetsu immediately after, so that the bunsetsu distance is 1. The accent processing unit 1d determines the accent phrase and the accent position. In this example,
"This is" part is flat type, "Speech synthesizer" part is 9 type, 9 syllables, that is, "device" with "accent" (language processing unit / process result "so" The "1" added to the back indicates accented syllables). (Input sentence) "This is voice synthesis." (Morpheme division) This is /, /, voice / synthesis / device. (Reading) Korewa Onsei Kosei Sauce Dise (Accent) 0-1 10 0- (Joined Accent) 1 A 1 1 1 1 1 b 1 1 1 1 1 b (Emphasis Accent) -1 ------ (Part of speech) (Phrase) This is a / voice synthesizer. (Phrase distance) 1 (Accent phrase) This is a / voice synthesizer. (Phrase accent) 09 (Language processing unit output 1) ko re wa poz o nn se e go o se e so1 o ti de su (Language processing unit output 2) ko re wa ' poz o nn se e go o se e so1 o ti de su Here, poz indicates that a pose can be inserted at this position to restore the intonation. The language processing unit output 1 indicates the case of general reading, and the output 2 indicates the case of emphasized reading. In the case of emphasized reading, ['] added to [wa] indicates superiority, and when the intonation is given by the sound processing unit 2, the fundamental frequency of [wa] is increased as excellence as compared with the case of general reading. It is set higher.

【００１６】強調の対象となる形態素が１音節の場合は
卓立として処理されるが、２音節以上の場合には、独自
のアクセント句を形成する事になる。例えば「僕ではわ
からない。」という文章を処理した場合、一般読みで
は、「bo1 ku de wa poz wa kara1 na i 」となり、強
調読みでは「bo1 ku / de1 wa poz wa ka ra1 na i」と
なる。ここで「／」は、アクセント句の切れ目を意味す
る。When the morpheme to be emphasized is one syllable, it is processed as outstanding, but when it is more than two syllables, an original accent phrase is formed. For example, if you process the sentence "I don't understand," it will be "bo1 ku de wa poz wa kara1 na i" in general reading, and "bo1 ku / de1 wa poz wa ka ra1 na i" in emphasized reading. Here, "/" means a break in the accent phrase.

【００１７】このように本実施例によれば、一般的なア
クセント（イントネーション）と卓立や強調を加味した
アクセントを選択することが出来、規則合成音の機械的
な単調さを軽減し、自然性の高い合成音を提供すること
ができる。As described above, according to the present embodiment, it is possible to select a general accent (intonation) and an accent in which standup and emphasis are added, and to reduce the mechanical monotonicity of the rule-synthesized sound and to make it natural. It is possible to provide a synthetic sound with high property.

【００１８】実施例２図３は、請求項２記載の本発明のアクセント付与装置を
利用した音声合成装置に関する実施例の構成を示したブ
ロック図である。なお図１の実施例と共通する要素には
同一番号をつけ、その説明を省略する。音調選択端子５
に代えて乱数発生手段１ｆが存在する。乱数発生手段１
ｆで発生された乱数にしたがって、音調選択部１ｅで音
調を選択するようになっている。乱数を発生させて、強
調読み、一般読みを選択することにより変化にとんだ合
成音を提供できる。Embodiment 2 FIG. 3 is a block diagram showing the configuration of an embodiment of a voice synthesizing apparatus using the accent imparting apparatus of the present invention as defined in claim 2. The same elements as those in the embodiment of FIG. 1 are designated by the same reference numerals and the description thereof will be omitted. Tone selection terminal 5
There is a random number generating means 1f instead. Random number generator 1
The tone selection section 1e selects a tone according to the random number generated in f. By generating a random number and selecting emphasized reading or general reading, it is possible to provide a synthetic sound that changes.

【００１９】実施例３図４は、請求項３記載の本発明のアクセント付与装置を
利用した音声合成装置に関する実施例の構成を示したブ
ロック図である。なお図１の実施例と共通する要素には
同一番号をつけ、その説明を省略する。音調選択端子５
に代えて、辞書１ｂから得られる情報にしたがって音調
を決定するようになっている。たとえば、品詞情報を用
いて、名詞・固有名詞に続く場合は強調発声とするが、
その他の品詞では強調しないとする。「これは音声合成
装置です。」の場合の「は」は一般的な発声となるが、
「音声合成装置はこれです。」という文章では、「は」
は卓立発声される。本実施例では、品詞情報を用いて音
調を決定したが、これは本発明を何等拘束するものでは
ない。たとえば、ユーザ辞書の単語をすべて強調発声し
たり、辞書項目毎に強調発声されるという情報を新たに
持つことも可能である。このように、形態素の品詞情
報や辞書に記載された情報に従って強調読み、一般読み
を選択することにより自然性の高い合成音を提供でき
る。Embodiment 3 FIG. 4 is a block diagram showing the construction of an embodiment of a voice synthesizing apparatus using the accent imparting apparatus of the present invention as defined in claim 3. The same elements as those in the embodiment of FIG. 1 are designated by the same reference numerals and the description thereof will be omitted. Tone selection terminal 5
Instead of this, the tones are determined according to the information obtained from the dictionary 1b. For example, using part-of-speech information, when following a noun / proper noun, it is emphasized,
No emphasis is placed on other parts of speech. In the case of "This is a speech synthesizer", "ha" is a general utterance,
In the sentence "This is the voice synthesizer.", "Ha"
Is uttered outstandingly. In the present embodiment, the tone is determined using the part-of-speech information, but this does not restrict the present invention. For example, it is possible to emphasize all the words in the user dictionary, or to newly have information that each word is emphasized. In this way, a synthetic sound with high naturalness can be provided by highlighting reading according to morpheme part-of-speech information or information written in a dictionary and selecting general reading.

【００２０】実施例４図５は、請求項４記載の本発明の音声合成装置の実施例
の構成を示したブロック図である。図１の場合の音調選
択端子５に代えて、声質選択端子６およびその端子６に
接続された声質選択部７を有している。声質選択部７
は、音調選択部１ｅ及び音響処理部２に接続されてい
る。合成音の声質が複数用意されている場合、声質選択
部７によって声質が選択されると、その選択にしたがっ
て、自動的に音調選択部１ｅで音調を選択することがで
きる。アクセント処理部１ｄは、それを利用してアクセ
ント処理を行う。このように、強調読み、一般読みを選
択することにより声質間の差異をより明確にし、高品質
な合成音を提供できる。Embodiment 4 FIG. 5 is a block diagram showing the configuration of an embodiment of the speech synthesizer of the present invention according to claim 4. Instead of the tone selection terminal 5 in the case of FIG. 1, it has a voice quality selection terminal 6 and a voice quality selection unit 7 connected to the terminal 6. Voice quality selector 7
Is connected to the tone selection unit 1e and the sound processing unit 2. When a plurality of voice qualities of the synthetic sound are prepared, when the voice quality is selected by the voice quality selection unit 7, the tones selection unit 1e can automatically select the tone according to the selection. The accent processing unit 1d uses it to perform accent processing. As described above, by selecting the emphasized reading and the general reading, the difference between voice qualities can be made clearer and a high-quality synthesized speech can be provided.

【００２１】なお、本発明の各手段は、コンピュータを
用いてソフトウェア的に実現し、あるいはそれら各機能
を有する専用のハード回路を用いて実現してもかまわな
い。Each means of the present invention may be realized by software using a computer, or may be realized by using a dedicated hardware circuit having each of these functions.

【００２２】また、本発明のアクセント情報としては、
上記した強調読みなどに限らず、複数種類のアクセント
情報があればよい。As the accent information of the present invention,
The accent reading is not limited to the above-described emphasized reading, and any type of accent information may be used.

【００２３】また、それらアクセント情報は、必ずしも
辞書に存在する必要はなく、別の記憶手段に格納されて
いてもよい。The accent information need not always exist in the dictionary and may be stored in another storage means.

【００２４】また、本発明のアクセント付与装置、音声
合成装置は、構文解析を必ず経なければならないという
わけではない。Further, the accent imparting device and the voice synthesizing device of the present invention do not necessarily have to undergo the syntax analysis.

【００２５】[0025]

【発明の効果】以上のように本発明によれば、イントネ
ーションの基本となるアクセントを卓立や強調を加味し
て付与することが出来、規則合成音の機械的な単調さを
軽減し、了解性・自然性の高い効果的な合成音を提供す
ることが出来る。As described above, according to the present invention, it is possible to add an accent, which is the basis of intonation, in consideration of prominence and emphasis, and reduce the mechanical monotony of the rule-synthesized sound. It is possible to provide an effective synthetic sound that is highly natural and natural.

[Brief description of drawings]

【図１】請求項１の本発明の実施例にかかるアクセント
付与装置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of an accent imparting device according to an embodiment of the present invention according to claim 1.

【図２】同実施例の付属語辞書の構成を示すブロック図
である。FIG. 2 is a block diagram showing a configuration of an auxiliary word dictionary of the embodiment.

【図３】請求項２の本発明の実施例にかかるアクセント
付与装置の構成を示すブロック図である。FIG. 3 is a block diagram showing a configuration of an accent imparting device according to an embodiment of the present invention according to claim 2.

【図４】請求項３の本発明の実施例にかかるアクセント
付与装置の構成を示すブロック図である。FIG. 4 is a block diagram showing a configuration of an accent imparting device according to an embodiment of the present invention according to claim 3;

【図５】請求項４の本発明の実施例にかかる音声合成装
置の構成を示すブロック図である。FIG. 5 is a block diagram showing a configuration of a voice synthesizing apparatus according to an embodiment of the present invention according to claim 4.

【図６】従来例の音声合成装置の構成を示すブロック図
である。FIG. 6 is a block diagram showing a configuration of a conventional speech synthesizer.

【図７】従来例の付属語辞書の構成を示すブロック図で
ある。FIG. 7 is a block diagram showing the structure of a conventional auxiliary word dictionary.

[Explanation of symbols]

０テキスト入力端子１言語処理部（手段）１ａ形態素処理部（手段）１ｂ辞書１ｃ構文解析部（手段）１ｄアクセント処理部（手段）１ｅ音調選択部（手段）１ｆ乱数発声部（手段）２音響処理部（手段）３合成処理部（手段）４合成音出力端子５音調入力端子６声質入力端子７声質選択部（手段） 0 text input terminal 1 language processing unit (means) 1a morphological processing unit (means) 1b dictionary 1c syntactic analysis unit (means) 1d accent processing unit (means) 1e tone selection unit (means) 1f random number vocalization unit (means) 2 sound Processing unit (means) 3 Synthesis processing unit (means) 4 Synthetic sound output terminal 5 Tonal input terminal 6 Voice quality input terminal 7 Voice quality selection section (means)

Claims

[Claims]

1. A text input unit for inputting text, a morpheme processing unit for dividing the input text into morphemes, a dictionary unit for storing dictionary information referred to by the morpheme processing unit, and an accent imparting for imparting an accent. An accent imparting device including means, comprising: a tone selection means for selecting a tone and a storage means for holding a plurality of types of accent information, and the accent according to the tone selected by the tone selection means. An accent imparting device, wherein the accent imparting means imparts an accent using information.

2. A text input means for inputting text, a morpheme processing means for dividing the input text into morphemes, a dictionary means for storing dictionary information referred to by the morpheme processing means, and an accent attachment for giving an accent. An accent applying device including means, a random number generating means for generating a random number, a tone selecting means for selecting a tone,
Storage means for storing a plurality of types of accent information, selecting a tone by the tone selection means according to a random number generated by the random number generation means, and the accent according to the tone selected by the tone selection means. An accent imparting device, wherein the accent imparting means imparts an accent using information.

3. Text input means for inputting text, morpheme processing means for dividing the input text into morphemes, dictionary means for storing dictionary information referred to by the morpheme processing means, and accent addition for giving accent In the accent imparting device including means, there is provided a tone selection means for selecting a tone and a storage means for holding accent information relating to the dictionary information, and the tone selection is performed by the tone selection means from the accent information. An accent imparting device, characterized in that the accent imparting means imparts an accent in accordance with the tone selected by the tone selecting means.

4. The accent imparting device according to claim 1, wherein the dictionary means also serves as the storage means.

5. A text input means for inputting text, a morpheme processing means for dividing the input text into morphemes, a dictionary means for storing dictionary information referred to by the morpheme processing means, and an accent attachment for giving an accent. Means and a voice synthesizing means for synthesizing a voice, in an accent imparting device, a tone selecting means for selecting a tone, a voice quality selecting means for selecting a voice quality of a synthesized voice, and a storage means for holding a plurality of types of accent information. The tone quality selecting means selects a tone according to the voice quality selected by the tone quality selecting means, and the accent assigning means using the accent information according to the tone selected by the tone quality selecting means. A voice synthesizer characterized by adding an accent.

6. The speech synthesizer according to claim 5, wherein the dictionary means also serves as the storage means.