JPH01186998A

JPH01186998A - Sentence intonation processing method for voice synthesizing device

Info

Publication number: JPH01186998A
Application number: JP63011250A
Authority: JP
Inventors: Yoshimasa Sawada; 沢田　喜正; Norio Suda; 典雄須田
Original assignee: Meidensha Corp; Meidensha Electric Manufacturing Co Ltd
Current assignee: Meidensha Corp; Meidensha Electric Manufacturing Co Ltd
Priority date: 1988-01-21
Filing date: 1988-01-21
Publication date: 1989-07-26

Abstract

PURPOSE:To obtain a synthesized voice having an intonation suitable for a Japanese language by distributing a text to subjects and predicates the divide the text into clause units and determining a fundamental intonation and a word accent width in accordance with the number of clauses and positions of clauses with respect to each of subjects and predicates. CONSTITUTION:A sentence analyzer 5b refers to a grammar dictionary 6 and a word dictionary 7 to divide a Japanese-language input text into clause units and distributes respective clauses to subjects and predicates and determines a fundamental intonation set in a table 8 in accordance with number of clauses and clause positions with respect to each of subjects and predicates and determines a word accent width set in a table 9 in accordance with the number of clauses and clause positions with respect to each of subjects and predicates. These fundamental intonations and word accents to which word accent widths are given are superposed to obtain a sentence intonation, thereby obtaining a sentence intonation in accordance with fundamental intonation patterns and accent widths given to word accent positions. Thus, metrical information suitable for the Japanese language is obtained.

Description

【発明の詳細な説明】Ａ、産業上の利用分野本発明は、規則合成方式による音声合成装置に係り、特
に文イントネーションの処理方式に関する。DETAILED DESCRIPTION OF THE INVENTION A. Field of Industrial Application The present invention relates to a speech synthesis device using a rule synthesis method, and particularly to a processing method for sentence intonation.

Ｂ１発明の概要本発明は、テキストの韻律情報を文章解析から得るにお
いて、テキストを主部と述部に振り分けて文節単位に区切り、
主部と述部別に文節数と文節位置から基本イントネーシ
ョンと単語アクセント幅を決定することにより、辞書容量の低減と文イントネーション処理の簡単化を図
りながら適切なイントネーションの合成音声が得られる
ようにしたものである。B1 Summary of the Invention The present invention, in obtaining prosodic information of a text from sentence analysis, divides the text into a main part and a predicate part and divides it into clause units,
By determining the basic intonation and word accent width based on the number of clauses and clause positions for each subject and predicate, it is possible to obtain synthesized speech with appropriate intonation while reducing dictionary capacity and simplifying sentence intonation processing. It is something.

Ｃ１従来の技術規則合成方法による音声合成装置は、例えば第３図に示
す構成にされる。文章解析部ｌは日本語入力テキストの
文字列に対して辞書１ａと文章解析装置１ｂによる文章
解析を行う。辞書１＆には単語の読みがな変換のための
辞書のほかに単語の文節区切１句区切等のための日本語
文法辞書を有し、さらには単語のアクセントや基本イン
トネーションの規則辞書を有する。文章解析装置１ｂは
辞書１ａを参照して入力テキストを音素あるいは音節の
音韻記号列に変換すると共に、単語アクセントや基本イ
ントネーション等の韻律情報を発生する。C1 A speech synthesis apparatus based on the conventional technical rule synthesis method has, for example, the configuration shown in FIG. The text analysis unit 1 performs text analysis on a character string of Japanese input text using a dictionary 1a and a text analysis device 1b. In addition to a dictionary for converting the pronunciation of words, the dictionary 1& has a Japanese grammar dictionary for dividing words into clauses and phrases, and also has a dictionary of rules for word accents and basic intonation. The text analysis device 1b converts the input text into a phoneme symbol string of phonemes or syllables by referring to the dictionary 1a, and also generates prosodic information such as word accents and basic intonation.

音声合成規則部２は、ファイル２ａとパラメータ生成装
置２ｂによって構成される。ファイル２ａは音韻単位の
特徴パラメータとそれらの接続規則及び韻律情報の制御
規則を蓄積しておく。パラメータ生成装置２ｂは音韻情
報に対する特徴パラメータをその接続時間等の情報と共
に連結した制御パラメータ列を生成すると共に、韻律情
報による音源のピッチ、エネルギー、イントネーション
処理を施した音源パターン列を生成する。The speech synthesis rule section 2 is composed of a file 2a and a parameter generation device 2b. The file 2a stores feature parameters of phoneme units, their connection rules, and prosodic information control rules. The parameter generation device 2b generates a control parameter sequence in which feature parameters for phoneme information are linked together with information such as connection time, and also generates a sound source pattern sequence in which pitch, energy, and intonation processing of the sound source is processed based on prosody information.

音声生成部３は、音源生成装置３ａと音声合成ディジタ
ルファイル３ｂと音声変換器３ｃとによつて構成される
。音源生成装置３ａは、音源パターン列に従ったピッチ
、エネルギー等の音源信号を発生する。ディジタルフィ
ルタ３ｂは制御パラメータ列に従ってパーコール係数や
伝達関数又はフォルマント周波数のパラメータか変えら
れ、このパラメータでの音源信号に対する応答出力に合
成音声データ列を得る。音声変換器３ｃはフィルタ３ｂ
の出力をアナログ信号に変換して音声波形を得、スピー
カ等の電気−前変換手段による合成音声を出力する。The speech generation section 3 is composed of a sound source generation device 3a, a speech synthesis digital file 3b, and a speech converter 3c. The sound source generation device 3a generates sound source signals such as pitch and energy according to the sound source pattern sequence. The digital filter 3b changes the parameters of the Percoll coefficient, transfer function, or formant frequency according to the control parameter string, and obtains a synthesized speech data string as a response output to the sound source signal with these parameters. The audio converter 3c is a filter 3b
The output of the converter is converted into an analog signal to obtain an audio waveform, and synthesized audio is output by an electrical pre-conversion means such as a speaker.

上述のような音声合成装置において、韻律情報は、文章
データに対して基本イントネーション、単語アクセント
、ストレス、ポーズ、継続時間等の組み合わせで１つの
文イントネーションとして作成される。この文イントネ
ーションの作成処理には、日本語入力テキストを辞書１
ａを参照した構文解析による文節区切９句区切１文区切
、形態素（言語的に意味を持つ最小の単位）分類等によ
って単語の系列区分化と単語境界を付した表音列に変換
する。この表音列情報に対して、句読点単位の文節（モ
ーラ）数から求められる基本イントネーション、単語単
位のアクセント、句単位のポーズ、接頭語や接尾辞等か
ら求められる単語中のストレス、音素単位の継続時間等
が決定される。In the speech synthesis apparatus as described above, prosody information is created as one sentence intonation by combining basic intonation, word accent, stress, pause, duration, etc. for sentence data. To create this sentence intonation, input the Japanese input text into the dictionary 1.
It is converted into a phonetic sequence with word series segmentation and word boundaries added by segmentation into nine phrases and one sentence, morpheme (the smallest unit that has linguistic meaning) classification, etc. by syntactic analysis with reference to a. For this phonetic string information, basic intonation determined from the number of clauses (mora) in punctuation mark units, accents in word units, pauses in phrase units, stress in words determined from prefixes and suffixes, and phoneme unit The duration time etc. are determined.

Ｄ１発明が解決しようとする問題点従来の文イントネーション処理方法では、構文解析と単
語又は音節さらには音素単位を使った基本イントネーシ
ョンやアクセント決定がなされるため、解析項目数が膨
大になるし、入力テキストの長短やモー才数による多く
のアクセント型など処理項目数も膨大になり、複雑な処
理を必要とする問題があった。D1 Problems to be Solved by the Invention In conventional sentence intonation processing methods, basic intonation and accent are determined using syntactic analysis and words, syllables, and even phoneme units, resulting in a huge number of analysis items and input The number of items to be processed, such as the length of the text and the number of accent types depending on the number of characters, is enormous, creating the problem of requiring complex processing.

また、従来の処理方法を細密に実行しようとするには単
語アクセントのみに限らずアクセント幅（音節数）も決
定しなければならず、日本語の複雑多岐になる形態の複
雑性から処理が一層複雑になる問題があった。In addition, in order to perform the conventional processing method in detail, it is necessary to determine not only the word accent but also the accent width (number of syllables). There were complications.

本発明の目的は、処理を複雑にすることなく、基本イン
トネーション及び単語アクセント幅まで規定できる処理
方法を提供するにある。An object of the present invention is to provide a processing method that can define basic intonation and word accent width without complicating the processing.

６１課題を解決するための手段と作用本発明は上記目的を達成するために、日本語入力テキス
トを文節単位に区切り、各文節を主部と述部に振り分け
、この主部と述部別に夫々文節数と文節位置からテーブ
ルに定める基本イントネーションを決定し、かつ主部と
述部別に夫々文節数と文節位置からテーブルに定める単
語アクセント幅を決定し、これら基本イントネーション
と単語アクセント幅を与えた単語アクセントを重量して
文イントネーションを求めることにより、文節数と文節
位置という少ない組み合わせの基本イントネーションパ
ターン及び単語アクセント位置に加えるアクセント幅か
ら文イントネーションを求め、主部と述部の区切りによ
る日本語特有の韻律との適合を図る。61 Means and Effects for Solving the Problems In order to achieve the above object, the present invention divides the Japanese input text into clauses, divides each clause into a main part and a predicate, and divides each clause into a main part and a predicate. Determine the basic intonation set in the table from the number of clauses and clause position, and determine the word accent width set in the table from the number of clauses and clause position for the subject and predicate, respectively, and words given these basic intonation and word accent width. By calculating the sentence intonation by weighting the accent, we can calculate the sentence intonation from the basic intonation pattern of small combinations of the number of clauses and the position of the clause, and the accent width added to the word accent position. Try to match the prosody.

Ｆ、実施例第１図は本発明の一実施例を示す文章解析部のブロック
図である。辞書５ａは日本語文法辞書６や単語辞書７（
アクセント、持続時間）のほかに、文章の主部と述部の
イントネーションテーブル８及び単語アクセント幅設定
テーブル９を備える。F. Embodiment FIG. 1 is a block diagram of a text analysis section showing an embodiment of the present invention. The dictionary 5a is a Japanese grammar dictionary 6 and a word dictionary 7 (
(accent, duration), an intonation table 8 for the main part and predicate of a sentence, and a word accent width setting table 9.

文章解析装置５ｂは、日本語入力テキストに対して、従
来と同様に文法辞書６及び単語辞書７を参照して単語の
読みがな変換、文節区切１句区切。The sentence analysis device 5b refers to the grammar dictionary 6 and the word dictionary 7 to convert the readings of the words and divides the Japanese text into phrases and phrases, as in the past.

単語アクセント位置等の文章解析を行う。この文節区切
及び句区切において、文章解析装置５ｂは文節境界を求
め、さらに文節を助詞の違いにより主部と述部に区切っ
た主部・述部境界を求める。Analyzes sentences such as word accent positions. At this clause break and phrase break, the sentence analysis device 5b finds a clause boundary, and further finds a subject/predicate boundary where the clause is divided into a subject and a predicate based on the difference in particles.

このようにして求められた文章の主部と述部に対して、
文章解析装置５ｂは主部及び述部単位でその文節数と文
節位置からイントネーションテーブル８を参照して基本
イントネーションを決定する。For the subject and predicate of the sentence obtained in this way,
The sentence analysis device 5b determines the basic intonation for each subject and predicate by referring to the intonation table 8 based on the number of clauses and the clause position.

また、文章解析装置５ｂは、主部及び述部に含まれる単
語のアクセント幅を主部及び述部の文節数と文節位置か
らアクセント幅設定テーブル９を参照して決定する。Furthermore, the sentence analysis device 5b determines the accent width of the word included in the subject and predicate based on the number of clauses and the clause position of the subject and predicate by referring to the accent width setting table 9.

このような文章解析装置５ｂにより、文イントネーショ
ンが決定され、即ち文章の主部と述部及び夫々に含まれ
る文節数と文節位置から決定され、この文イントネーシ
ジンにポーズ等を付加して韻律情報が作成される。なお
、韻律情報の作成は従来と同様に行われる。Sentence intonation is determined by such a sentence analysis device 5b, that is, it is determined from the main part and predicate of the sentence, the number of phrases included in each, and the phrase position, and the sentence intonation is determined by adding pauses etc. Information is created. Note that the creation of prosody information is performed in the same manner as before.

以下、本実施例を具体的に説明する。This example will be explained in detail below.

イントネーションテーブル８及び単語アクセント幅設定
テーブル９は以下の表に示すように主部及び述部別にそ
の文節数と文節位置の組み合わせに応じたパターンの基
本イントネーション及び単語アクセント幅が設定される
。In the intonation table 8 and the word accent width setting table 9, the basic intonation and word accent width of the pattern are set according to the combination of the number of phrases and the position of phrases for each subject and predicate, as shown in the table below.

イントネーションテーブル単語アクセント設定テーブル例えば、基本イントネーションは、主部の文節数ｎで該
主部の文節位置がｎにあれば抑揚パターンにＰ、設定さ
れて、該パターンＰ−による基本イントネーションと決
定される。また、単語アクセント幅は当該単語が述部に
属し、文節数ｍになる文節に含まれて文節位置１にある
ときにアクセント幅Ｂｍｌに設定されてアクセント位置
における該アクセント幅Ｂ、による単語アクセントと決
定される。Intonation Table Word Accent Setting Table For example, if the basic intonation is the number of clauses in the main part n and the clause position of the main part is in n, the intonation pattern is set to P, and the basic intonation is determined by the pattern P-. . In addition, the word accent width is set to the accent width Bml when the word belongs to a predicate, is included in a clause with the number of clauses m, and is in clause position 1, and the word accent is set to the accent width B at the accent position. It is determined.

第２図は日本語入力テキストが「学校の桜がきれいにさ
いた。」にあるときに基本イントネーションと単語アク
セント幅から文イントネーションを求めた例を示す。基
本イントネーションには主部の文節数２で文節位置１に
なるパターンｐａｔと、述部の文節数２で文節位置２に
なるパターンＱ□をテーブルから読出して両パターンを
結合した基本イントネーションを得る。単語アクセント
には主部の単語「かっこう」が主部の文節数２で文節位
置１になるアクセント幅Ａ！ｌを、単語「さくら」が文
節数２で文節位置２のアクセント幅Ａ、を、述部の単語
「きれいに」が文節数２で文節位置１のアクセント幅Ｂ
□を、単語「さく」が文節数２で文節位置２のアクセン
ト幅Ｂ、を得る。これらパターンと単語アクセント位置
でのアクセント幅が重畳され、さらに滑らかさを与える
フィルタ処理によって図示のような文イントネーション
が決定される。Figure 2 shows an example in which the sentence intonation is calculated from the basic intonation and word accent width when the Japanese input text is ``The cherry blossoms at school bloomed beautifully.'' For the basic intonation, a pattern pat in which the number of clauses in the main part is 2 and the clause position is 1, and a pattern Q□ in which the number of clauses in the predicate is 2 and the clause position is 2 are read out from the table, and the basic intonation is obtained by combining both patterns. The word accent has an accent width of A where the main part of the word ``kakkou'' has 2 clauses in the main part and 1 clause position! l, the word ``Sakura'' has 2 clauses and the accent width is A in clause position 2, and the predicate word ``Kirei ni'' has 2 clauses and the accent width is B in clause position 1.
□, the word "saku" has the number of clauses 2 and the accent width B at the clause position 2 is obtained. These patterns and the accent width at the word accent position are superimposed, and the sentence intonation as shown in the figure is determined by filter processing that provides further smoothness.

従って、文イントネーション決定には日本語入力テキス
トの主部と述部への分解と主部及び述部毎の文節数と文
節位置からテーブルを使って基本イントネーションが決
められ、同様に単語アクセントもアクセント位置に対し
て主部及び述部毎の単語の文節数と文節位置からテーブ
ルを使ってアクセント幅が決められる。このため、文イ
ントネーション処理に従来の音節及び音素単位の単語ア
クセント決定など解析項目数を多くする処理を不要にし
、少ない基本イントネーション及び単語アクセント幅に
よる簡単な処理になる。また、複雑多岐になる日本語の
文イントネーション処理にも主部と述部の区別と文節数
と文節位置の区別に基づく基本イントネーションとアク
セント位置に加える単語アクセント幅により作成する文
イントネーションは日本語の韻律に適合するもので、自
然な合成音声を得ることができる。Therefore, to determine sentence intonation, the basic intonation is determined using a table based on the decomposition of the Japanese input text into the subject and predicate, and the number and position of clauses for each subject and predicate. Similarly, word accents are also accented. The accent width is determined using a table based on the number of clauses of the word for each subject and predicate and the clause position. Therefore, the conventional process of increasing the number of analysis items such as word accent determination in syllable and phoneme units is unnecessary for sentence intonation processing, and the process becomes simple with a small number of basic intonations and word accent widths. In addition, for Japanese sentence intonation processing, which is complex and diverse, we can create sentence intonation based on the basic intonation based on the distinction between subject and predicate, the number of clauses, and the clause position, and the word accent width added to the accent position. It is suitable for prosody and can produce natural synthesized speech.

Ｇ１発明の効果以上のとおり、本発明によれば、日本語入力テキストを
文節単位で主部と述部に振分け、夫々文節数と文節位置
により定める基本イントネーションパターン及び単語ア
クセント幅を決定した文イントネーションを得るように
したため、文イントネーション処理を簡単にしながら日
本語に適合した韻律情報を得ることができる効果がある
。G1 Effects of the Invention As described above, according to the present invention, a Japanese input text is divided into a subject part and a predicate part in units of clauses, and the basic intonation pattern and word accent width are determined based on the number of clauses and the clause position, respectively. This has the effect of making it possible to obtain prosodic information suitable for Japanese while simplifying sentence intonation processing.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示すブロック図、第２図は
実施例のイントネーション波形を例示する図、第３図は
規則合成方式による音声合成装置のブロック図である。５ａ・・・辞書、５ｂ・・・文章解析装置、６・・・文
法辞書、７・・・単語辞書、８・・・イントネーション
テーブル、９・・・単語アクセント設定テーブル。第１図実施例のブロック図５ｉ・・・辞書５ｂ・・・文章解析装置６・・・文法辞書７・・・単語辞書８・・・イントネーションテーブル９・・・単語アクセント設定テーブル第２図実施例のイントネーション波形図ｒ−一一一主部一一一一一入　　　　　　　！“−＼−
−−時　間FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is a diagram illustrating intonation waveforms of the embodiment, and FIG. 3 is a block diagram of a speech synthesis apparatus using a rule synthesis method. 5a...Dictionary, 5b...Sentence analysis device, 6...Grammar dictionary, 7...Word dictionary, 8...Intonation table, 9...Word accent setting table. Fig. 1 Block diagram of the embodiment 5i Dictionary 5b Sentence analysis device 6 Grammar dictionary 7 Word dictionary 8 Intonation table 9 Word accent setting table Fig. 2 Implementation Example intonation waveform diagram r-111 main part 11111 entered! “−＼−
--time

Claims

[Claims]

(1) Obtain phonological information and prosody information from the Japanese input text, obtain the control parameters of the digital filter corresponding to the phonological information and the sound source pattern of the sound source generator corresponding to the prosody information, and generate the audio signal corresponding to the text. In a speech synthesis device using a rule synthesis method, the Japanese input text is divided into clauses, each clause is divided into a main part and a predicate, and the basics are defined in a table based on the number of clauses and the clause position for each main part and predicate. Determine the intonation, determine the word accent width set in the table from the number of clauses and clause position for each subject and predicate, and superimpose these basic intonations and word accents given the word accent width to obtain the sentence intonation. A sentence intonation processing method for a speech synthesizer, characterized by: