JPH0335296A

JPH0335296A - Text voice synthesizing device

Info

Publication number: JPH0335296A
Application number: JP1170230A
Authority: JP
Inventors: Jiyungo Kitou; 鬼頭　淳悟; Nobuyoshi Umiki; 延佳海木
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1989-06-30
Filing date: 1989-06-30
Publication date: 1991-02-15

Abstract

PURPOSE:To prevent difficult words or homonyms from being caught in wrong meanings by changing the difficult words or homonyms by using plain words in accordance with the results of the analysis by a character and symbol string analyzing section and outputting the synthetic voices of input sentences. CONSTITUTION:A difficult word analyzing and changing section 33 which selects the plain words having the same meanings as the meanings of the difficult words or homonyms extracted in accordance with the results of the analysis by the character and symbol string analyzing section 32, changes the input character symbol strings by using the selected plain words and outputs the same to a synthetic voice parameter forming section 34 is provided to output the synthetic voices of the input sentences changed to the styles consisting of the plain words. The input sentences of the styles of writing words are changed to the synthetic voices by the styles of the easily understandable natural speaking words and such voices are outputted in this way and, therefore, the difficult words or homonyms are conveyed in the correct meanings.

Description

【発明の詳細な説明】〈産業上の利用分野〉この発明は、任意の文字記号列から成る入力文章を音声
に変換するテキスト音声合成装置に関する。DETAILED DESCRIPTION OF THE INVENTION <Industrial Application Field> The present invention relates to a text-to-speech synthesis device that converts an input sentence consisting of an arbitrary string of characters and symbols into speech.

〈従来の技術〉従来、テキスト音声合成装置によって文字記号列から成
る文章を合成音声に変換する際には、入力された任意の
文字記号列に正しい読み、アクセントおよびイント不一
ノヨンを付加して音声合成用パラメータを生成する。そ
して、この生成された音声合成用パラメータに基ついて
ｎ声を合成して出力するようにしている。<Prior art> Conventionally, when a text-to-speech synthesizer converts a sentence consisting of a character string into synthesized speech, it adds the correct pronunciation, accent, and intonation to any input character string. Generate parameters for speech synthesis. Then, based on the generated voice synthesis parameters, n voices are synthesized and output.

すなわち、第３図に才５いて、文字記号列入力部１に文
字記号列（例えば日本語漢字仮名混じり文）が入力され
て文字記号列解析部２に送られる。そうすると、文字記
号列解析部２（よ、後に詳述するようにして入力文字記
号９１１の形態素（単語）解析構文解析および色味解Ｉ
ｆ〒等を行う。合成、キルパラメータ生成部３は、韻律
を制御するｌ二めに、文字記号列解析部２における形態
素解析によって同定された各単語のアクセントや構文構
造から単語が連鎖した際の文節や呼気段落のアクセント
やポーズの設定を行う。また′、さらに発声音声に対応
した合成単位に対する継続時間、ピンチパターン、パワ
ーパターンおよび音韻特徴パラメータ（偏自己相関係数
、線スペクトル対、ホルマント等）のパラメータ時系列
を得る。そうすると、音声合成部４は上記音声合成用の
パラメータ時系列に基づいて実際の合成音声波形を生成
して合成音声出力部５から出力する。That is, as shown in FIG. 3, a character and symbol string (for example, a sentence containing Japanese, kanji, and kana) is input to the character and symbol string input section 1 and sent to the character and symbol string analysis section 2. Then, the character symbol string analysis unit 2 (as will be described in detail later) performs morpheme (word) analysis and syntactic analysis of the input character symbol 911 and color solution I.
Perform f〒 etc. The synthesis and kill parameter generation unit 3 controls the prosody.Secondly, the character-symbol string analysis unit 2 analyzes the accent and syntactic structure of each word and determines the structure of the phrase or exhalation paragraph when the words are chained. Set accents and poses. Furthermore, parameter time series of duration, pinch pattern, power pattern, and phonetic feature parameters (partial autocorrelation coefficient, line spectrum pair, formant, etc.) for the synthesis unit corresponding to the uttered speech are obtained. Then, the speech synthesis section 4 generates an actual synthesized speech waveform based on the parameter time series for speech synthesis, and outputs it from the synthesized speech output section 5.

第４図は文字記号列解析部２の更に詳細なブロック図で
ある。文字記号列解析部２は形態素解析部２１、構文解
析部２２．意味解析部２３および辞書（単語辞書、意味
辞方、漢字辞書、記号辞書１品詞接続行列、接続禁止辞
書等）２４によって構成されている。Ｌ足形＠素解析部
２１は、文字記号列入力部１から入力された文字記号列
を辞書（１語静３！）漢字辞書、記号辞舛等）２４を用
いて形態素解析を行い単語を同定すると１（に、同定し
た単語の品詞等の文法情報やアクセントを得ろ。Ｌ記構
文解析部２２は形態素解析部２１によって同定された単
語の構文を辞書（品詞接続行列および接続禁止辞書等）
２４を用いて決定する。また、上記意味解析部２３は入
力された文字記号列の色味を辞書（意味辞書等）２４を
用いて決定する。ただし、文字記号列解析部２は必ずし
も形態素解析部２１．構文解析部２２．意味解析部２３
および辞書２４によって構成する必要はない。すなわち
、必要に応じて形態素解析部２１．構文解析部２２およ
び辞書２４、あるいは、形態素解析部２１および辞書２
４のみによって構成してもよい。FIG. 4 is a more detailed block diagram of the character symbol string analysis section 2. The character symbol string analysis section 2 includes a morphological analysis section 21, a syntactic analysis section 22. It is composed of a semantic analysis section 23 and dictionaries (word dictionary, semantic dictionary, kanji dictionary, symbol dictionary 1 part-of-speech connection matrix, connection inhibition dictionary, etc.) 24. The L-footprint@element analysis unit 21 performs morphological analysis on the character-symbol string input from the character-symbol string input unit 1 using a dictionary (1 word static 3!, kanji dictionary, symbol dictionary, etc.) 24 to identify words. Then, in step 1, obtain grammatical information such as part of speech and accent of the identified word.
24. Further, the semantic analysis unit 23 determines the color of the input character symbol string using a dictionary (semantic dictionary, etc.) 24. However, the character symbol string analysis section 2 is not necessarily the morphological analysis section 21. Syntax analysis unit 22. Semantic analysis section 23
and dictionary 24. That is, the morphological analysis unit 21. Syntactic analysis unit 22 and dictionary 24, or morphological analysis unit 21 and dictionary 2
It may be configured by only 4.

すなわち、上記テキスト音声合成装置は、入力文章をそ
のまま合成音声によって読み上げるものである。That is, the text-to-speech synthesis device reads an input sentence as it is using synthesized speech.

〈発明が解決しようとする課題〉新聞や各種データベース等に見られる文章は目で読むこ
とを前提にして作成されており、その殆どが書き言葉の
文体によって表現されている。すなわら、通常耳にする
話し言葉の文体とは大きく異なり、難易な単語や同音異
義語が数多く含まれている。<Problem to be solved by the invention> Sentences found in newspapers, various databases, etc. are created on the premise that they are to be read with the naked eye, and most of them are expressed in the style of written words. In other words, the writing style is very different from the spoken language we normally hear, and it contains many difficult words and homonyms.

しかしながら、上記テキスト音声合成装置においては、
入力文章をそのまま合成音声によって読み上げるように
なっているので、新聞や各種データベースを入力した場
合には、書き言葉の文体での合成音声が出力される。し
たがって、その場合には通常聞き慣れている話し言葉の
文体とはかなり異なり、出力される合成音声を聞く人に
は堅く感じられ、難易な単語や同音異義語が間違った意
味に聞き取られる恐れがある。したがって、情報の出力
手段としての機能を十分果たしているとはいえないとい
う問題がある。However, in the above text-to-speech synthesizer,
Since the input text is read aloud as it is by synthesized speech, when a newspaper or various databases are input, synthesized speech in the style of written language is output. Therefore, in such a case, the writing style is quite different from the spoken language that one is accustomed to hearing, and the synthesized speech that is output may feel stiff to the listener, and difficult words or homophones may be misinterpreted as having the wrong meaning. . Therefore, there is a problem in that it cannot be said to function satisfactorily as an information output means.

そこで、この発明の目的は、入力された文章を分かり易
い自然な合成音声によって出力することができ、難易な
単語や同音異義語が間違った意味に聞き取られろ恐れの
ないテキスト音声合成装置を提供することにある。SUMMARY OF THE INVENTION Therefore, an object of the present invention is to provide a text-to-speech synthesizer that can output an input sentence as an easy-to-understand and natural synthesized voice, and that eliminates the fear that difficult words or homophones may be interpreted as having the wrong meaning. There is a particular thing.

く課題を解決するための手段〉上記目的を達成するため、この発明は、文章を構成する
文字記号列の形態素、構文、意味等を文字記号列解析部
で解析し、この解析結果に従って合成音声パラメータ生
成部で音声合成用パラメータを生成し、この音声合成用
パラメータに基づいて音声合成部で合成音声を生成して
出力するテキスト音声合成装置において、上記文字記号
列解析部による解析結果に基づいて入力文章の中から難
易語あるいは同音異義語を抽出し、この抽出した難易語
あるいは同音異義語と同じ意味の平易な単語を選出し、
選出された平易な単語を用いて入力文字記号列を変更し
、この変更された文字記号列を上記合成音声パラメータ
生成部に出力ずろ錐易語解析変更部を備えて、平５　ｔ
、ｉ　Ｑｉ語から成る文体に変更された入力文章の合成
音声を出力することを特徴としている。Means for Solving the Problems To achieve the above object, the present invention analyzes the morphemes, syntax, meanings, etc. of character strings constituting a sentence in a character symbol string analysis section, and generates synthesized speech according to the analysis results. In a text-to-speech synthesis device in which a parameter generation section generates speech synthesis parameters, and a speech synthesis section generates and outputs synthesized speech based on the speech synthesis parameters, Extract difficult words or homonyms from the input text, select simple words with the same meaning as the extracted difficult words or homonyms,
The input character symbol string is changed using the selected plain words, and the modified character symbol string is outputted to the synthesized speech parameter generation section.
, i Qi words.

また、上記テキストご声合成装置は、入力文章の文字記
号列を平易な文章の文字記号列に変更する際におけろ変
更のレベルを指定する指定信号を、上記難鴇語解析変更
部に対して出力する難易語変更制御部を備えろ一方、上
記難易語解析変更部を、上記難易語変更制御部からの上
記断定信号が入力された場合に、上記指定信号で指定さ
れた変更のレベルにある平易な単語を選出して入力文字
記号列を変更するように成すことが望ましい。In addition, the text speech synthesis device sends a specification signal to the difficult word analysis and modification unit to specify the level of change when changing the character symbol string of the input sentence to the character symbol string of a plain sentence. and a difficult word change control section that outputs the difficult word analysis and change section; It is desirable to select a certain plain word and change the input character string.

〈作用〉この発明のテキスト音声合成装置において、文章を構成
する文字記号列が文字記号列解析部に入力されて形態素
解析、構文解析、色味解析等が実行される。そし等、上
記文字記号列解析部による解析結果が難易語解析変更部
に入力されろ。そうすると、難易語解析変更部は、上記
文字記号列解析部による解析結果にｊ＋”；ついて入力
文冶の中から堆易語あるいは同キシε義語を抽出し、こ
の抽出さ！また難易、悟あるいは同音異義語と同じ色味
の平易なＩｎ語を選出する。そして、選出した）１Ｌ易
な単語を用いて入力文字記号列が変更されろ。上記・稚
易語解析変更部によって変更された文字記号列は合成音
声パラメータ生成部に入力され、この変更後の文字記号
列に従って音声合成用パラメータか生成される。そして
、音声合成部によって、」−、記音重合成用パラメータ
に基づいて合成音声か生成されて出力される。したかっ
て、平易な！Ｉｉ語を用いて平易な文体に変更された入
力文章の合成音声か出力される。<Operation> In the text-to-speech synthesis device of the present invention, a character-symbol string constituting a sentence is input to the character-symbol string analysis section, and morphological analysis, syntactic analysis, color analysis, etc. are performed. Then, input the analysis results from the character symbol string analysis section to the difficult word analysis and modification section. Then, the difficult word analysis and modification section extracts the easy words or synonyms from the input grammar based on the analysis result by the character symbol string analysis section, and this extraction! Alternatively, select a simple In word with the same color as the homophone.Then, change the input character symbol string using the selected) 1L easy word.Changed by the above-mentioned simple word analysis and modification section. The character-symbol string is input to the synthesized speech parameter generator, and parameters for speech synthesis are generated according to the changed character-symbol string.Then, the speech synthesizer generates "-," synthesized based on the parameters for phonetic overlapping synthesis. Sound is generated and output. It's simple if you want to! A synthesized speech of the input sentence changed into a plain writing style using Ii words is output.

また、難易語変更制御部から、難易語変更のレベルを指
定する指定信号が上記難易語解析変更部に対して出力さ
れるようにする。そして、−ヒ記難易語変更制御部から
上記指定信号が出力された場合に、この指定信号で指定
されたレベルにある平易な単語が上記難易語解析変更部
によって選出されて入力文字記号列が変更されるように
すれば、上記指定信号で指定されたレベルにある平易な
！ｐ語を用いて変更された入力文章の合成音声を出力て
きる。Further, the difficult word change control section outputs a designation signal specifying the level of difficult word change to the difficult word analysis and change section. Then, when the specified signal is output from the difficult word change control section, the simple word at the level specified by this specified signal is selected by the difficult word analysis and change section and the input character symbol string is changed. If you allow it to be changed, it will be at the level specified by the specified signal above! A synthesized speech of the input sentence modified using the p-word is output.

〈実施例〉以下、この発明を図示の実施例により詳細に説明する。<Example> Hereinafter, the present invention will be explained in detail with reference to illustrated embodiments.

第１図はこの発明のテキスト音声合成装置の一実施例を
示すブロック図である。FIG. 1 is a block diagram showing an embodiment of the text-to-speech synthesis apparatus of the present invention.

第１図において、文字記号列入力部３！に文字記号列（
例えば日本語漢字仮名混じり文）か入力されて文字記号
列解析部３２に送出される。そうすると、文字記号列解
析部３２は、入力された文字記号列の形態素解析、構文
解析および意味解析等を、上記従来例の場合と同様にし
て辞書を引いて行う。そして、解析された形態素（単語
）および各単語の品詞を出力する。その際に、解析した
結果が活用のある単語である場合には活用形等の文法情
報も併せて出力する。ここで、第１図においては省略し
ているが、文字記号列解析部＆２は、上記従来例と同様
に、第４図に示す形態素解析部２Ｉ、構文解析部２２．
意味解析部２３および辞占２４によって構成されている
。In FIG. 1, character symbol string input section 3! character string (
For example, a sentence containing Japanese, kanji, and kana is input and sent to the character/symbol string analysis section 32. Then, the character-symbol string analysis unit 32 performs morphological analysis, syntactic analysis, semantic analysis, etc. of the input character-symbol string by referring to a dictionary in the same manner as in the conventional example. Then, the analyzed morphemes (words) and the part of speech of each word are output. At that time, if the result of the analysis is a word with a conjugation, grammatical information such as the conjugation is also output. Here, although omitted in FIG. 1, the character symbol string analysis section &2 includes the morphological analysis section 2I, the syntactic analysis section 22.
It is composed of a meaning analysis section 23 and a dictionary 24.

一方、難易語から平易な単語への変更を指示するための
制御信号が制御信号入力部３７に入力されて難易語変更
制御部３８に送出される。そう４−ると、難易語変更制
御部３８は入力された制御信号を解析して種々の難易語
変更指令を難易語解析変更部３３に入力する。ここで、
上記種々の錐易語変更指令とは、難易語から平易な単語
への変更を行う／行わないの制御指令および難易語変更
を行う場合における難易語変更のレベルを指定する制御
指令等である。On the other hand, a control signal for instructing a change from a difficult word to a simple word is input to the control signal input section 37 and sent to the difficult word change control section 38. Then, the difficult word change control unit 38 analyzes the input control signal and inputs various difficult word change commands to the difficult word analysis and change unit 33. here,
The various easy-to-understand word change commands include a control command to perform/not perform a change from a difficult word to a simple word, and a control command to specify the level of a difficult word change when changing a difficult word.

上記難易語解析変更部３３は、難易語変更制御部３８か
らの難易語変更指令が入力されると、この難易語変更指
令に基づいて入力文章中の難易語あるいは同音異義語を
抽出し、抽出した単語を上記難易語変更指令に従って変
更することによって、入力文章を平易な文章に変更する
。第４図（よ上記錐易語解析変更部３３の更に詳細なブ
ロック図であり、難易語抽出部４１，１ｉｆｔ易語変更
部４２および類語辞書４３等によって構成されている。When a difficult word change command is input from the difficult word change control unit 38, the difficult word analysis and change unit 33 extracts a difficult word or a homophone from the input sentence based on the difficult word change command. The input sentence is changed to a simple sentence by changing the word in accordance with the difficult word change command. FIG. 4 is a more detailed block diagram of the easy-to-understand word analysis and change unit 33, which is composed of a difficult-to-understand word extraction unit 41, an easy-to-understand word change unit 42, a thesaurus dictionary 43, and the like.

］二記難難易抽出部４１は、類語辞書４３を検索して、
入力文章中に存在する雉易語あるいは同音異ａ１Ｍｉを
抽出する。上記難易語変更部４２は、難易語変更制御部
３８からの錐８語変更指令に基づいて預語辞書４３を検
索して、難易語抽出部４■こよ−・て抽出された難易語
や同音異義語を平易な単語に変更するのである。] The second difficulty/difficulty extraction unit 41 searches the thesaurus dictionary 43 and
Extract the pheasant word or homophone a1Mi that exists in the input sentence. The difficult word change section 42 searches the prophecy dictionary 43 based on the 8-word change command from the difficult word change control section 38, and extracts the difficult words and homophones from the difficult word extraction section 4. Change synonyms to simpler words.

ここで、上記類語辞書４３には難易語あるいは同音異義
語と同じ色味の平易な単語群か難易度に応じて分類され
て格納されている。そして、難易語あるいは同音異義語
の変更に際しては、難易語変更制御部３８から出力され
る難易語変更のレヘルを階定する制御指令に基づいて、
指定された難易語変更のレベルに合った難易度の単語が
平易な単語群の中から選択されるのである。Here, the synonym dictionary 43 stores simple words having the same color as difficult words or homophones, classified according to the difficulty level. When changing a difficult word or a homophone, based on a control command that determines the level of difficult word change output from the difficult word change control section 38,
Words with a difficulty level that matches the specified level of difficulty word change are selected from the group of easy words.

こうして、難易語や同音異義語が平易な単語に変更され
た文字記号列解析結果が合成音声パラメータ生成部３４
に送出される。一方、難易語変更を行わない場合（すな
わち、難易給変更制御部３８から難易語変更を行わない
制御指令から成る難易語変更指令が入力された場合）に
は、難易詔解析変更部３３は何も実行しない。したがっ
て、文字記号列解析部３２におけろ文字記号列解析結果
かそのまま合成音声パラメータ生成部３４に送出される
のである。In this way, the character symbol string analysis results in which difficult words and homophones are changed to easy words are sent to the synthesized speech parameter generation unit 34.
sent to. On the other hand, when a difficult word is not changed (that is, when a difficult word change command consisting of a control command that does not change a difficult word is input from the difficult edict change control section 38), what does the difficult edict analysis change section 33 do? is not executed either. Therefore, the character and symbol string analysis result in the character and symbol string analysis section 32 is directly sent to the synthesized speech parameter generation section .

上記合成音声パラメータ生成部３４は、韻律を制御する
ために、上記文字記号列解析部３２によって同定され、
さらに難易語変更制御部３８からＱ）難易語変更指令に
よって難易語や同音異義語が平易な単語に変更された各
単語のアクセントや構文構造により、単語が連鎖した際
の文節や呼気段落のアクセントやポーズの設定を行う。The synthesized speech parameter generation unit 34 is identified by the character symbol string analysis unit 32 in order to control prosody,
Furthermore, the accent and syntactic structure of each word in which a difficult word or homophone is changed to a simple word by Q) a difficult word change command from the difficult word change control unit 38, the accent of a clause or exhalation paragraph when words are chained. and pose settings.

また、さらに発声音声に対応した合成単位に対する継続
時間。Furthermore, the duration for the synthesis unit corresponding to the uttered voice.

ピッチパターン、パワーパターンおよび音韻特徴パラメ
ータ（偏自己相関係数、線スペクトル対、ホルマント等
）のパラメータ時系列を得る。そうすると、音声合成部
３５は上記音声合成用のパラメータ時系列に基づいて実
際の合成音声波形を生成して合成音声出力部３６から出
力するのである。Parameter time series of pitch patterns, power patterns, and phonetic feature parameters (partial autocorrelation coefficients, line spectrum pairs, formants, etc.) are obtained. Then, the speech synthesis section 35 generates an actual synthesized speech waveform based on the parameter time series for speech synthesis, and outputs it from the synthesized speech output section 36.

次に、上記難易語弊Ｆ′ｒ変更部３３において、難易語
や同音異義語が平易な単語に変更された場合における、
入力文章と出力音声内容との具体例を小す。Next, when a difficult word or homophone is changed into a simple word by the difficult word change unit 33,
A specific example of an input sentence and an output audio content is shown below.

（例文１）５１語を平易な単語に置き換える。(Example sentence 1) Replace 51 words with simple words.

入力文章科学技術の表肌モ進歩は、産業経済の発達を促進すると
共に、社会構造、制度にし艷外空影響を及ぼした。とり
わけ今日の社会は、情報及び、通信によって索革され、
情報化社会と１ｌｌＦ８ａ４゜■ 出力音声内容科学技術の３ｙｓ進歩は、産業経済の発達を」４進める
と共に、社会構造、制度にもさまざまな影響を及ぼした
。とりわけ今日の社会は、情報及び、通信によって１ｔ
）４１）、情報化社会とよばれる。The surface advances in input text science and technology have not only promoted the development of the industrial economy, but also had an enormous impact on social structures and institutions. In particular, today's society has been revolutionized by information and communication,
Information Society and 1llF8a4゜■ Output Speech Content Advances in science and technology have not only advanced the development of the industrial economy but also had various effects on social structures and institutions. In particular, in today's society, information and communication
)41) is called the information society.

（例文２）同音異義語を平易な単語で説明する。(Example sentence 2) Explain homophones using simple words.

入力文章著作者は、著作権を新色しており、他人が利用する場合
には事前に著作者の了解を得なければならない。The author of the input text has a new copyright, and when someone else uses the text, the author's consent must be obtained in advance.

出力音声内容著作者は、著作権を専有、独りで所有しており、他人が
利用する場合には事前に著作者の了解を得なければなら
ない。The author of the output audio content exclusively owns the copyright, and if another person uses the content, the author's consent must be obtained in advance.

（例文３）同音異義語を平易な単語で説明する。(Example Sentence 3) Explain homophones using simple words.

入力文章今回の提案は試論をまとめたものである。input text This proposal is a compilation of trial theory.

出力音声内容今回の提案は試論、試みに述べたものをまとめたもので
ある。Output audio content This proposal is a compilation of what was discussed in the preliminary discussion and trial.

上述の３つの例文に見られるように、本実施例において
は、難易語抽出部４１で抽出された難易詔や同義語が長足→速い促進する一′押し進める種々の−さまざまな変革され一変わり呼称される→よばれる専有−独りで所有試論−・試みに述べたしののように、指定された難易語変更のレベルに合った難易
度における平易な単語に変更されて合成音声によって出
力されるので、聞く人には柔らかく感しられて非常に聞
き易いのである。また、間違った意味に取られる恐れが
ないので、十分に情報の内容が聞き手に伝達されるので
ある。As can be seen in the above three example sentences, in this embodiment, the difficult edicts and synonyms extracted by the difficult word extraction unit 41 are changed to various changes such as long-leg → fast-promoting 1' pushing forward. → Proprietary to be called - Possessed alone - As in the case of the experiment, the words are changed to simple words at a difficulty level that matches the level of the specified difficult word change and are output by synthesized speech. Therefore, it feels soft to the listener and is very easy to listen to. Also, the content of the information is sufficiently conveyed to the listener, as there is no risk of it being misinterpreted.

ここで、上記（例文２）および（例文３）における同音
異義語の場合には、単に同音異義語を平易な単語に置き
換えるのではなく、同音異義語の合成音声を発声した後
に同じ意味の他の平易な単語の合成音声で言い換えて説
明するようにしている。Here, in the case of homonyms in (Example Sentence 2) and (Example Sentence 3) above, instead of simply replacing the homonym with a plain word, instead of uttering the synthesized speech of the homonym and then replacing it with the same meaning. I try to paraphrase and explain using synthesized speech using simple words.

こうすることによって、聞き手は元の同音異義語を知る
ことができるので、入力文章の微妙なニュアンスをくみ
取ることができるのである。This allows the listener to know the original homonym and pick up on the subtle nuances of the input text.

このように、本実施例においては、制御信号入力部３７
からの制御信号に基づいて難易語変更制御部３８から種
々の難易語変更指令か難易語変更制御部３３に入力され
る。そうすると、難易語変更制御部３３は、入力された
文字記号列から難易語あるいは同音異義語を抽出し、上
記抽出された難易語あるいは同音＋Ａ義語と同し意味の
平易な単語群の中から、難易語変更制御部３８からの制
御指令に従って目的とするレベルの難易度の単語を選択
し、上記難易語をこの平易な単語に変更するか、あるい
は、同音異義語の後に平易な単語を挿入する。そして、
この難易給が平易な単語に変更された、あるいは、同音
異義語の後にその同音５ε義語に対応する平易な単語が
挿入された入力文字記号列に基づいて、音声を合成して
出力する。したがって、音声たけでは聞き取り取りにく
いような意味の難しい書き言葉の単語が入力文章に含ま
れていて乙、通常の会話で用いられるような平易な単語
から成る話し言葉の合成音声によって出力することがで
きる。In this way, in this embodiment, the control signal input section 37
Various difficult word change commands are input from the difficult word change control section 38 to the difficult word change control section 33 based on control signals from the difficult word change control section 38 . Then, the difficult word change control unit 33 extracts difficult words or homophones from the input character symbol string, and selects the difficult words or homophones from among the simple words having the same meaning as the extracted difficult words or homophones + A synonyms. , select a word with the target level of difficulty according to the control command from the difficult word change control unit 38, and change the difficult word to this easy word, or insert the easy word after the homophone. do. and,
Speech is synthesized and outputted based on the input character symbol string in which the difficult word is changed to a plain word or a plain word corresponding to the homophone 5ε synonym is inserted after a homophone. Therefore, even if the input text contains written words with difficult meanings that are difficult to understand with voice alone, it is possible to output a synthesized speech of spoken words consisting of simple words that are used in normal conversation.

すなわち、この発明のテキスト音声合成装置から発声さ
れる合成音声は、聞く人には柔らかく感じられて非常に
聞き易く、間違った意味に取られる恐れかなく、情報の
内容を十分に聞き手に伝達することができるのである。In other words, the synthesized speech uttered by the text-to-speech synthesizer of the present invention feels soft to the listener, is very easy to hear, and sufficiently conveys the content of the information to the listener without the risk of being misunderstood. It is possible.

上記実施例においては、入力文字記号列における難易語
を平易な単語に変更した文字記号列に基づいて音声を合
成するようにしている。しかしながら、この発明はこれ
幌限定されるものでははい。In the embodiment described above, speech is synthesized based on a character-symbol string in which difficult words in the input character-symbol string are changed to simple words. However, this invention is not limited to this.

すなわち、同音異義語の場合と同様に、難易語の後に上
記選択された平易な単語を挿入した文字記号列に基づい
て平易な単語で言い換えるように発声させてもよい。That is, as in the case of homophones, the user may be uttered so as to paraphrase the difficult word with a simple word based on a character symbol string in which the selected simple word is inserted after the difficult word.

〈発明の効果〉以上より明らかなように、この発明のテキスト音声合成
装置は、難易語解析変更部によって、文字記号列解析部
の解析結果に基づいて入力文章の中から難易語あるいは
同音異義語を抽出し、この抽出した難易語あるいは同音
異義語と同じ意味の平易な単語を選出し、この選出され
た平易な単語を用いて文字記号列を変更して合成音声パ
ラメータ生成部に出力するようにしたので、書き言葉の
文体の入力文章を分かり易い自然な話し言葉の文体によ
る合成音声に変更して出力できる。したかって、難易語
や同音異義語が間違った意味に聞き取られる恐れがない
。<Effects of the Invention> As is clear from the above, the text-to-speech synthesis device of the present invention uses the difficult word analysis and modification section to select difficult words or homophones from the input text based on the analysis results of the character symbol string analysis section. , select a simple word with the same meaning as the extracted difficult word or homophone, change the character symbol string using the selected simple word, and output it to the synthesized speech parameter generator. , it is possible to change an input sentence in a written style to synthesized speech in an easy-to-understand, natural spoken style and output it. Therefore, there is no risk that difficult words or homonyms will be interpreted as having the wrong meaning.

また、この発明のテキスト音声合成装置は、難易語変更
制御部によって、入力文章の文字記号列を平易な文章の
文字記号列に変更する際におけろ変更のレベルを指定す
る指定信号を上記難易語解析変更部に出力し、この難易
語解析変更部は、Ｌ記難易語変更制御部から上記指定信
号が入力された場合に、上記指定信号で指定された変更
レベルにある平易な単語を選出して入力文字記号列を変
更するようにしたので、難易語変更の実施／不実施を指
定でき、かつ、難易語変更のレベルを指定できる。Further, the text-to-speech synthesis device of the present invention uses the difficult word change control unit to send a designation signal specifying the level of change when changing a character symbol string of an input sentence to a character symbol string of a simple sentence. When the above specified signal is input from the L difficult word change control section, the difficult word analysis and change section selects a simple word at the change level specified by the above specified signal. Since the input character symbol string is changed by changing the input character symbol string, it is possible to specify whether or not to change the difficult word, and also to specify the level of the difficult word change.

[Brief explanation of drawings]

第１図はこの発明のテキスト音声合成装置における一実
施例のブロック図、第２図は第１図における難易語解析
変更部のさらに詳細なブロック図、第３図は従来のテキ
スト音声合成装置のブロック図、第４図は第１図および
第３図における文字記号列解析部のさらに詳細むブロッ
ク図である。３１・・文字記号列入力部、３２　・文字記号列解析部、３３・・難易語解析変更部、３４・・合成音声パラメータ生成部、３５・・音声合成部、　　　３６・・・合成音声出力部
、３７・・・制御信号入力部、３８・・・Ｎ異語変更制御部、４１・・・錐易語抽出部、　　　４２・・難易語変更部
、４３　類語辞書。FIG. 1 is a block diagram of an embodiment of the text-to-speech synthesis device of the present invention, FIG. 2 is a more detailed block diagram of the difficult word analysis and modification section in FIG. 1, and FIG. 3 is a block diagram of a conventional text-to-speech synthesis device. Block Diagram FIG. 4 is a more detailed block diagram of the character string analysis section in FIGS. 1 and 3. 31...Character symbol string input section, 32.Character symbol string analysis section, 33..Difficult word analysis change section, 34..Synthesized speech parameter generation section, 35..Speech synthesis section, 36..Synthesized speech output section , 37... Control signal input unit, 38... N foreign word change control unit, 41... Easy word extraction unit, 42... Difficult word change unit, 43 Thesaurus dictionary.

Claims

[Claims]

(1) The character symbol string analysis section analyzes the morphemes, syntax, meanings, etc. of the character symbol strings that make up the sentence, and the synthesized speech parameter generation section generates speech synthesis parameters according to the analysis results. In a text-to-speech synthesizer that generates and outputs synthesized speech in a speech synthesis section based on A simple word with the same meaning as the difficult word or homophone is selected, the input character string is changed using the selected simple word, and the changed character string is sent to the synthesized speech parameter generator. What is claimed is: 1. A text-to-speech synthesis device, comprising: a difficult word analysis/change unit that outputs a synthesized speech of an input sentence whose writing style is changed to include simple words.

(2) In the text-to-speech synthesis device according to claim 1, the designation signal specifying the level of change when changing the character symbol string of the input sentence to the character symbol string of a plain sentence is changed to the difficult word analysis change. and a difficult word change control section that outputs an output to the difficult word analysis and change section, when the difficult word analysis and change section receives the specified signal from the difficult word change control section, the difficult word change control section outputs the change specified by the specified signal to the difficult word analysis and change section. A text-to-speech synthesis device characterized in that a plain word at a certain level is selected and an input character symbol string is changed.