JP2001034285A

JP2001034285A - Model sentence table database for insert voice synthesis and method for generating model sentence table

Info

Publication number: JP2001034285A
Application number: JP11203238A
Authority: JP
Inventors: Shu Ozaki; 周尾崎; Shoji Fujimoto; 昇治藤本; Shinichi Nakaishi; 信一中石; Yoshitaka Hirashima; 善隆平嶋
Original assignee: Denso Ten Ltd
Current assignee: Denso Ten Ltd
Priority date: 1999-07-16
Filing date: 1999-07-16
Publication date: 2001-02-09

Abstract

PROBLEM TO BE SOLVED: To make retrieval easy at the time of insert synthesis by generating a model sentence table database where meter data of a generated model sentence table are discriminatingly stored by uses or parts of speech. SOLUTION: The voice that, for example, an announcer speaks is recorded, the recorded voice is converted by an A/D converter 11, and the voice waveform converted by an intermediate data generation part 12 is viewed and sectioned by phonemes, labeled, and marked with synchronizing points by cycles of the voice waveform. A meter data extraction part 13 extracts meter data from the data marked with the synchronizing points by cycles of the voice waveform to generate a model sentence table. The generated model sentence table is adjusted by a model sentence table adjustment part 14 to obtain a model sentence table. Then the model sentence table which is generated and adjusted is stored to structure a model sentence database 15. The meter data are stored in the database 15 discriminatingly by usages and parts of speech, so retrieval is made easy at the time of insert synthesis.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明ははめ込み音声合成に
用いる文例テーブルデータベース、及び該文例テーブル
データベースを用いて新しく文例テーブルを生成する方
法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a sentence example table database used for in-speech synthesis and a method for generating a new example table using the sentence example table database.

【０００２】[0002]

【従来の技術】図１は文例テーブルを用いたテキスト音
声合成システムのフローチャートの一例を示した図であ
る。図において、漢字仮名混じり文の文字列である日本
語テキストを言語処理する（Ｓ１）。ここではテキスト
を解析して単語に分割し、単語の読み、及びアクセント
を付与し、アクセント付読み仮名の文字列に変換する。
例えば、テキストが「本日は晴天なり」であったとする
と、「ホ′ンシ〃ツハセ′イテンナリ」に変換する。そ
の際は言語辞書Ａを用いる。次に、上記文字列に対応す
る韻律データが文例テーブルデータベースＢにあるかど
うか判断する（Ｓ２）。韻律データは文字列にイントネ
ーションを付与するためのデータである。文字列に対応
する韻律データが文例テーブルデータベースＢにあれば
（Ｙｅｓ）、この韻律データに上記アクセント付読み仮
名の文字列をはめ込み、はめ込み合成が行われる（Ｓ
３）。一方、韻律データがなければ（Ｎｏ）、規則合成
が行われる（Ｓ４）。即ち、前記アクセント付読み仮名
に基づいて、韻律規則による継続時間長、アクセント、
イントネーション、ポーズなどを決定する。そして、い
ずれの場合も波形辞書Ｃを用いて波形合成が行われ（Ｓ
５）、合成された音声が出力される。2. Description of the Related Art FIG. 1 shows an example of a flowchart of a text-to-speech synthesis system using a sentence example table. In the figure, a Japanese text which is a character string of a sentence mixed with kanji and kana is subjected to language processing (S1). Here, the text is analyzed and divided into words, the pronunciation of the words and accents are given, and the words are converted into character strings of accented pronunciations.
For example, if the text is “Today is fine weather”, the text is converted to “Honshutsuhase” Itinari. At that time, the language dictionary A is used. Next, it is determined whether the prosody data corresponding to the character string exists in the sentence example table database B (S2). Prosody data is data for adding intonation to a character string. If the prosody data corresponding to the character string is in the sentence example table database B (Yes), the character string of the accented reading kana is inserted into this prosody data, and the inset synthesis is performed (S).
3). On the other hand, if there is no prosody data (No), rule synthesis is performed (S4). That is, based on the accented reading kana, the duration of the prosodic rule, the accent,
Determine intonation, poses, etc. In each case, waveform synthesis is performed using the waveform dictionary C (S
5) The synthesized voice is output.

【０００３】[0003]

【発明が解決しようとする課題】上記システムにおい
て、はめ込み合成のために文例テーブルデータベースＢ
が用いられている。文例テーブルを生成する場合、例え
ばアナウンサーの発声した音声を収録し、収録した音声
をＡ／Ｄ変換し、変換したデータから音声波形を見て音
素毎に区切り、ラベル付けして音声波形１周期毎に同期
点をマークする。そして、これらラベル付けし同期点を
マークしたデータから韻律データ、即ち、文字列にイン
トネーションを付与するデータを抽出して文例テーブル
を生成し、これを格納して文例テーブルデータベースと
している。In the above system, the sentence example table database B is used for the inset synthesis.
Is used. When generating a sentence example table, for example, a voice uttered by an announcer is recorded, A / D conversion is performed on the recorded voice, a voice waveform is divided from the converted data into phonemes, labeled, and labeled for each cycle of the voice waveform. Mark a sync point. Then, prosody data, that is, data for giving the intonation to the character string is extracted from the data labeled and marked with the synchronization point to generate a sentence example table, which is stored as a sentence example table database.

【０００４】しかし、上記のように文例テーブルを生成
するには非常に手間がかかる。また、文例テーブルは一
文あるいは複数文の文単位、または文を構成する要素の
韻律データで構成されており、はめ込み合成するときに
文例テーブルから韻律データを検索するため時間がかか
る。また、新しく文章を作成し、追加し、または変更す
るときには文例テーブルを生成し直す必要がある。[0004] However, as described above, it takes a great deal of time to generate a sentence example table. The sentence example table is composed of one sentence or a plurality of sentence units, or prosody data of elements constituting the sentence, and it takes time to search for the prosody data from the sentence example table when performing inlay synthesis. When a new text is created, added, or changed, it is necessary to regenerate the text example table.

【０００５】[0005]

【課題を解決するための手段】上記課題を解決するた
め、本発明では生成した文例テーブルの韻律データを用
途別あるいは品詞別に区分して格納した文例テーブルデ
ータベースを作成し、はめ込み音声合成するときは文例
テーブルデータベースに区分して格納された韻律データ
から検索できるようにする。In order to solve the above problems, according to the present invention, a sentence example table database in which prosody data of a created example example table is stored for each use or part of speech is created and embedded speech synthesis is performed. A search can be performed from prosodic data stored separately in a sentence example table database.

【０００６】また、新しく文例テーブルを生成する場合
には、すでに区分して格納されている韻律データを用
い、これらを組み合わせて生成できるようにする。Further, when a new sentence example table is generated, prosody data that has already been divided and stored is used, and these can be generated in combination.

【０００７】[0007]

【発明の実施の形態】図２は文例テーブルを生成するた
めの構成を示した図である。まず、例えばアナウンサー
の発声した音声を収録し、収録した音声をＡ／Ｄ変換器
１１でＡ／Ｄ変換し、中間データ生成部１２で変換した
音声波形を見て音素毎に区切りを入れ、ラベル付けして
音声波形１周期毎に同期点をマークする。そして韻律デ
ータ抽出部１３においてこれらラベル付けして音声波形
１周期毎に同期点をマークしたデータから韻律データを
抽出し、文例テーブルを生成する。次に生成された文例
テーブルを文例テーブル調整部１４で調整し文例テーブ
ルとする。そして、上記生成し調整した文例テーブルを
格納して文例テーブルデータベース１５とする。FIG. 2 is a diagram showing a configuration for generating a sentence example table. First, for example, a voice uttered by an announcer is recorded, the recorded voice is A / D-converted by an A / D converter 11, a voice waveform converted by the intermediate data generation unit 12 is viewed, and a break is inserted for each phoneme. And a synchronization point is marked for each period of the audio waveform. Then, the prosody data extracting unit 13 extracts prosody data from the data with the labels and the synchronization point marked for each period of the audio waveform, and generates a sentence example table. Next, the generated sentence example table is adjusted by the sentence example table adjustment unit 14 to obtain a sentence example table. The generated and adjusted sentence example table is stored as a sentence example table database 15.

【０００８】図３は、本発明による文例テーブルデータ
ベースの実施形態を示す図である。例えば、交通情報に
おいては以下のような文章がよく使われる。「東名高速道路の名古屋方面は伊勢原を先頭に２０ｋｍ
の渋滞」上記文章において、「東名高速道路」は道路名であり、
「名古屋方面」は方向であり、「伊勢原」は地名であ
り、「２０ｋｍ」は距離であり、「渋滞」は状況であ
る。FIG. 3 is a diagram showing an embodiment of a sentence example table database according to the present invention. For example, the following sentences are often used in traffic information. "The Tomei Expressway bound for Nagoya is 20km ahead of Isehara
In a traffic jam, "the sentence," Tomei Expressway "is a street name,
“Nagoya” is a direction, “Isehara” is a place name, “20 km” is a distance, and “traffic jam” is a situation.

【０００９】また、報道においては以下のような文章が
よく使われる。「○○氏は１１時にロンドンに到着した。」上記文章において、「○○氏」は人名であり、「１１
時」は日付であり、「ロンドン」は地名であり、「到着
した」は状況である。このように文章には「人名」、
「地名」、「時間」、「距離」、「状況」など、用途に
応じた単語が含まれ、それぞれに韻律データが生成され
ている。The following sentences are often used in news reports. "○○ Mr. arrived in London to 11 o'clock." In the above sentence, "○○ Mr." is a person's name, "11
"Time" is a date, "London" is a place name, and "Arrival" is a situation. In this way, the sentence "person name",
Words such as "place name", "time", "distance", and "situation" are included according to the purpose, and prosody data is generated for each.

【００１０】そこで、実施形態では図３に示すように
「人名」、「地名」、「時間」、「距離」、「状況」な
ど、用途別に韻律データを格納した文例テーブルデータ
ベースを構成した。図４は、本発明による文例テーブル
データベースの別の実施形態を示す図である。例えば、
交通情報においては以下のような文章がよく使われる。Therefore, in the embodiment, as shown in FIG. 3, a sentence example table database storing prosody data such as "person name", "place name", "time", "distance", and "situation" is constructed. FIG. 4 is a diagram showing another embodiment of the sentence example table database according to the present invention. For example,
The following sentences are often used in traffic information.

【００１１】「湾岸線は５ｋｍ渋滞です。」上記文章において、「は」は助詞であり、「です」は助
動詞である。これらは「湾岸線は」あるいは「渋滞で
す」に付与された韻律データの末尾に現れる品詞であ
る。そこで、本実施形態では図４に示すようにこれら韻
律データの末尾に現れる品詞の韻律データ毎にまとめて
格納し、文例テーブルデータベースを構成する。[0011] "Gulf line is 5km traffic jam." In the above sentence, "is" is a particle, "it is" is an auxiliary verb. These are "Gulf line" or the "traffic jam
This is the part of speech that appears at the end of the prosody data assigned to "". Therefore, in the present embodiment, as shown in FIG. 4, the prosody data of the parts of speech appearing at the end of these prosody data are collectively stored and constitute a sentence example table database.

【００１２】図５は、本発明による文例テーブルデータ
ベースの別の実施形態を示す図である。図４に示した実
施形態では、韻律データの末尾に現れる品詞の韻律デー
タ毎にまとめて格納したが、本実施形態では「助詞」を
さらにアクセントが同じパターンの助詞毎に区分して格
納するものである。例えば、図５に示されているよう
に、「が」、「と」、「に」等は区分Ａに、「の」は区
分Ｂに、「ね」、「かな」は区分Ｃにそれぞれ格納す
る。FIG. 5 is a diagram showing another embodiment of the sentence example table database according to the present invention. In the embodiment shown in FIG. 4, the prosodic data of the parts of speech appearing at the end of the prosodic data are stored collectively, but in the present embodiment, the "particles" are further stored separately for each particle having the same accent pattern. It is. For example, as shown in FIG. 5, “ga”, “to”, “ni”, etc. are stored in section A, “no” is stored in section B, “ne” and “kana” are stored in section C, respectively. I do.

【００１３】図６は、本発明による文例テーブルデータ
ベースの別の実施形態を示す図である。図５に示した実
施形態では、「助詞」をアクセントが同じパターンの助
詞に区分して韻律データを格納したが、本実施形態では
「助動詞」をアクセントが同じパターンの助動詞に区分
して格納するものである。例えば、図６に示されている
ように、「だ」は区分ａに、「です」、「みたい」は区
分ｂに、「う」、「よう」等は区分ｃにそれぞれ格納す
る。FIG. 6 is a diagram showing another embodiment of the sentence example table database according to the present invention. In the embodiment illustrated in FIG. 5, the prosody data is stored by dividing “particles” into particles having the same accent pattern. In the present embodiment, “auxiliary particles” are stored by dividing into auxiliary particles having the same accent pattern. Things. For example, as shown in FIG. 6, "da" is stored in section a, "is" and "look" are stored in section b, and "u" and "you" are stored in section c.

【００１４】図７は、本発明による文例テーブルデータ
ベースの別の実施形態を示す図である。図５及び図６に
示した実施形態では、「助詞」または「助動詞」をアク
セントが同じパターンに区分して韻律データを格納して
いるが、本実施形態では図７に示すように「助詞」及び
「助動詞」をアクセントが同じパターンに区分して韻律
データを格納するものである。FIG. 7 is a diagram showing another embodiment of the sentence example table database according to the present invention. In the embodiment shown in FIG. 5 and FIG. 6, the prosody data is stored by dividing the “particles” or “auxiliary verbs” into patterns having the same accent, but in the present embodiment, as shown in FIG. And "auxiliary verbs" are classified into patterns having the same accent and store prosody data.

【００１５】図８は、本発明による文例テーブルデータ
ベースの別の実施形態を示す図である。本実施形態は、
韻律データの用途別区分と品詞別区分を組み合わせたも
のである。図８に示すように、「人名」、「地名」、
「時間」、「距離」、「状況」など、用途別に区分した
ものを、「助詞」及び「助動詞」に区分したものと組み
合わせた韻律データを格納する。例えば、「人名」と
「助詞」を組み合わせて「○○氏は」に対応する韻律デ
ータを格納する。「状況」と「助動詞」を組み合わせて
「渋滞です」対応する韻律データを格納する。FIG. 8 is a diagram showing another embodiment of the sentence example table database according to the present invention. In this embodiment,
This is a combination of the classification of prosody data by use and the classification by part of speech. As shown in FIG. 8, "person name", "place name",
The prosody data in which data classified by use such as “time”, “distance”, and “situation” are combined with data classified into “particles” and “auxiliary verbs” are stored. For example, the prosody data corresponding to “Mr. XX” is stored by combining “person name” and “particles”. The prosody data corresponding to "congestion" is stored by combining "situation" and "auxiliary verb".

【００１６】図９は、本発明による文例テーブルデータ
ベースの別の実施形態を示す図である。図８に示した実
施形態では、韻律データの用途別区分と品詞別区分を組
み合わせて格納しているが、本実施形態では図９に示す
ように、「人名」、「地名」、「時間」、「距離」な
ど、用途別に区分したものに、「助詞」をアクセントが
同じパターンに区分したものと組み合わせた韻律データ
を格納する。FIG. 9 is a diagram showing another embodiment of the sentence example table database according to the present invention. In the embodiment shown in FIG. 8, the classification of the prosody data by use and the classification by part of speech are stored in combination, but in this embodiment, as shown in FIG. 9, "person name", "place name", "time" , Prosodic data in which “particles” are classified into patterns having the same accents are stored in data classified according to uses such as “distance”.

【００１７】図１０は、本発明による文例テーブルデー
タベースの別の実施形態を示す図である。本実施形態で
は図９に示すように、「人名」、「地名」、「時間」、
「距離」など、用途別に区分したものに、「助動詞」を
アクセントが同じパターンに区分したものと組み合わせ
た韻律データを格納する。図１１は、本発明による文例
テーブルデータベースの別の実施形態を示す図である。
本実施形態では図９及び１０に示すように、「人名」、
「地名」、「時間」、「距離」、「状況」など、用途別
に区分したものに、「助詞」及び「助動詞」をアクセン
トが同じパターンに区分したものと組み合わせた韻律デ
ータを格納する。FIG. 10 is a diagram showing another embodiment of the sentence example table database according to the present invention. In the present embodiment, as shown in FIG. 9, "person name", "place name", "time",
The prosody data in which "auxiliary verbs" are combined with those classified into the same accent pattern are stored in those classified by use such as "distance". FIG. 11 is a diagram showing another embodiment of the sentence example table database according to the present invention.
In the present embodiment, as shown in FIGS.
The prosody data in which "particles" and "auxiliary verbs" are combined with the ones in which the accents are divided into the same pattern is stored in the ones classified by use such as "place name", "time", "distance", and "situation".

【００１８】例えば、「東京は／晴天です」のように
「地名」＋「助詞」、または「状況」＋「助動詞」のよ
うに組み合わせた韻律データで格納する。次に、文例テ
ーブルデータベースを用いて新しく文例テーブルを生成
する方法について説明する。図１２は文例テーブルデー
タベースを用いて新しく文例テーブルを生成する方法に
用いる文例テーブル生成装置の構成を示す図である。入
力情報であるアクセント付読み仮名が入力されると、情
報解析部２１は文字列を呼気段落で区切る等の処理を行
う。次に韻律データ検索部２２は、呼気段落で切られた
文字列に基づいて文例テーブルデータベース２３から韻
律データを検索する。次に、文例テーブル生成部２４で
検索された韻律データを組み合わせて文例テーブルを生
成する。そして、生成された文例テーブルを文例テーブ
ル調整部２５で調整して文例テーブルとする。For example, the prosody data is stored as a combination of "place name" + "particle", such as "Tokyo is a fine weather" or "situation" + "auxiliary verb". Next, a method of generating a new sentence example table using the sentence example table database will be described. FIG. 12 is a diagram showing a configuration of a sentence example table generating apparatus used in a method of newly generating a sentence example table using the sentence example table database. When the accented reading kana, which is input information, is input, the information analysis unit 21 performs processing such as dividing a character string into exhalation paragraphs. Next, the prosody data search unit 22 searches the prosody data from the sentence example table database 23 based on the character string cut in the exhalation paragraph. Next, a sentence example table is generated by combining the prosody data retrieved by the sentence example table generation unit 24. Then, the generated sentence example table is adjusted by the sentence example table adjustment unit 25 to be a sentence example table.

【００１９】図１３は本発明により文例テーブルを生成
する方法を説明するためのフローチャートである。文例
テーブルの生成が開始されると、韻律データが区分して
格納された文例テーブルデータベースを検索する（Ｓ
１）。例えば、「○○は／◇◇です。」（「／」は呼気
段落）という文の文例テーブルを作成する場合、この文
例テーブルに該当する韻律データが有るかどうか判断さ
れる（Ｓ２）。Ｙｅｓの場合、即ち「○○は」と「◇◇
です」の韻律データがある場合には、これらの韻律デー
タを組み合わせて、「○○は／◇◇です。」の文例テー
ブルを生成する。FIG. 13 is a flowchart for explaining a method of generating a text example table according to the present invention. When the generation of the sentence example table is started, the sentence example table database in which the prosody data is divided and stored is searched (S
1). For example, when creating a sentence example table of a sentence "○ is / ◇◇"("/" is an exhalation paragraph), it is determined whether or not there is a corresponding prosody data in this sentence example table (S2). In the case of Yes, that is, “○○” and “◇◇
If there is prosody data of "is," these prosody data are combined to generate a sentence example table of "XX is / ◇◇."

【００２０】次に、韻律データのパラメータを調整する
（Ｓ９）。図１４は上記呼気段落間に与える周波数のパ
ターンを表したもので、横軸が時間（ｔ）で、縦軸が周
波数（ｆ）である。例えば、新しく生成した文例テーブ
ルが、「現在地は／横浜駅の／東側です」の文に対応す
る文例テーブルであった場合、図１４に示すようなパタ
ーンに沿った周波数を上記文例テーブルの各呼気段落間
の文字列の韻律データとして与え、韻律データのパラメ
ータを調整する。例えば、最初の「現在地は」のパラメ
ータを周波数の高い（図１４で高い山型）ものとし、
「横浜駅の」を次に周波数の高いものとし、「東側で
す」を一番低い周波数のものとする。Next, the parameters of the prosody data are adjusted (S9). FIG. 14 shows a frequency pattern given between the above-mentioned exhalation paragraphs. The horizontal axis is time (t) and the vertical axis is frequency (f). For example, if the newly generated sentence example table is a sentence example table corresponding to the sentence “Your current location is / East side of Yokohama Station”, the frequency according to the pattern shown in FIG. It is given as prosody data of a character string between paragraphs, and the parameters of the prosody data are adjusted. For example, assume that the parameter of the first “current location” has a high frequency (high mountain shape in FIG. 14),
“Yokohama Station” has the next highest frequency, and “East” is the lowest frequency.

【００２１】次に、必要な韻律データが揃ったか、即ち
新しく生成される文例テーブルの韻律データがすべて揃
ったかどうか判断される（Ｓ５）。Ｎｏの場合Ｓ１にも
どるが、Ｙｅｓの場合、上記検索した韻律データ、また
は後述の規則合成した韻律データを組み合わせて文例テ
ーブルを生成する（Ｓ６）。最後に試聴により文例テー
ブルのアクセント等を調整し修正する（Ｓ７）。Next, it is determined whether necessary prosody data has been prepared, that is, whether all the prosody data of the newly generated sentence example table have been prepared (S5). In the case of No, the process returns to S1, but in the case of Yes, a sentence example table is generated by combining the retrieved prosody data or the prosody data synthesized by rules described later (S6). Finally, the accent and the like of the sentence example table are adjusted and corrected by trial listening (S7).

【００２２】一方、Ｓ２で該当する韻律データがない場
合（Ｎｏ）、同じパターンの助詞、助動詞の韻律データ
があるかどうか判断される（Ｓ３）。Ｙｅｓの場合、助
詞、助動詞の韻律データを用いる（Ｓ８）。例えば、
「○○は／◇◇です。」という文の文例テーブルを生成
する場合、文例テーブルデータベースには、「○○が／◇◇です。」「○○の／◇◇です。」の韻律データしかなく、「○○は」の韻律データがない
場合を想定する。この場合、文例テーブルデータベース
には「○○は」の韻律データはないが、「○○が」と
「○○の」の韻律データはある。そして、このうち
「が」は図５に示すように、「は」と同じパターンＡに
区分されている。そこで「○○は」の韻律データとして
「○○が」の韻律データを用いる。次に、先に説明した
ように、韻律データのパラメータを調整する（Ｓ９）。On the other hand, if there is no corresponding prosody data in S2 (No), it is determined whether there is prosody data of particles and auxiliary verbs of the same pattern (S3). In the case of Yes, prosody data of particles and auxiliary verbs is used (S8). For example,
When generating a sentence example table for the sentence "XX is / ◇◇.", Only the prosodic data of "XX is / ◇◇." It is assumed that there is no prosody data of "○○○". In this case, the sentence example table database does not have the prosody data of "XX", but has the prosody data of "XX" and "XX". As shown in FIG. 5, "ga" is classified into the same pattern A as "ha". Therefore, the prosody data of “XX” is used as the prosody data of “XX”. Next, as described above, the parameters of the prosody data are adjusted (S9).

【００２３】なお、上記説明において、文例テーブルに
ない韻律データは後述の規則合成により生成し、組み合
わせて用いる。Ｓ３でＮｏの場合、即ち同じパターンの
助詞、助動詞の韻律データがない場合、規則合成で生成
した音声から韻律データを取り出す（Ｓ４）。即ち、ア
クセント付読み仮名を入力し、様々な日本語の文章から
抽出した平均的な日本語の規則に従って音素長情報、ピ
ッチ情報を基に韻律データを生成する。次に、必要な韻
律データが揃ったか、即ち新しく生成される文例テーブ
ルの韻律データがすべて揃ったかどうか判断される（Ｓ
５）。Ｎｏの場合Ｓ１にもどるが、Ｙｅｓの場合、韻律
データを組み合わせて文例テーブルを生成する（Ｓ
６）。最後に試聴により文例テーブルを調整し修正する
（Ｓ７）。In the above description, prosody data that is not in the sentence example table is generated by rule synthesis described later and used in combination. In the case of No in S3, that is, when there is no prosody data of particles and auxiliary verbs of the same pattern, prosody data is extracted from the speech generated by the rule synthesis (S4). That is, prosody data is generated based on phoneme length information and pitch information in accordance with an average Japanese rule extracted from various Japanese sentences with accented reading kana. Next, it is determined whether necessary prosody data has been prepared, that is, whether all the prosody data of the newly generated sentence example table have been prepared (S
5). If No, the process returns to S1, but if Yes, the prosody data is combined to generate a sentence example table (S1).
6). Finally, the sentence example table is adjusted and corrected by trial listening (S7).

【００２４】[0024]

【発明の効果】本発明によると、韻律データが用途別、
品詞別に区分されてデータベースに格納されているの
で、はめ込み合成するときに検索し易くなる。また、新
しく文例テーブルを生成する場合、データベースにすで
に格納してある韻律データを用いて文例テーブルを生成
できるので、生成に手間がかからず、また生成するため
の時間を減ずることができる。According to the present invention, the prosody data is classified by application.
Since it is stored in the database by being classified according to the part of speech, it becomes easy to search when performing inlay synthesis. In addition, when a new sentence example table is generated, the sentence example table can be generated using the prosody data already stored in the database, so that the generation is not troublesome and the time for generation can be reduced.

[Brief description of the drawings]

【図１】文例テーブルを用いたテキスト音声合成システ
ムのフローチャートを示した図である。FIG. 1 is a diagram showing a flowchart of a text-to-speech synthesis system using a sentence example table.

【図２】文例テーブルを生成するための構成を示した図
である。FIG. 2 is a diagram illustrating a configuration for generating a sentence example table;

【図３】本発明による文例テーブルデータベースの実施
形態を示した図である。FIG. 3 is a diagram showing an embodiment of a sentence example table database according to the present invention.

【図４】本発明による文例テーブルデータベースの別の
実施形態を示した図である。FIG. 4 is a diagram showing another embodiment of a sentence example table database according to the present invention.

【図５】本発明による文例テーブルデータベースの別の
実施形態を示した図である。FIG. 5 is a diagram showing another embodiment of the sentence example table database according to the present invention.

【図６】本発明による文例テーブルデータベースの別の
実施形態を示した図である。FIG. 6 is a diagram showing another embodiment of the sentence example table database according to the present invention.

【図７】本発明による文例テーブルデータベースの別の
実施形態を示した図である。FIG. 7 is a diagram showing another embodiment of the sentence example table database according to the present invention.

【図８】本発明による文例テーブルデータベースの別の
実施形態を示した図である。FIG. 8 is a diagram showing another embodiment of the sentence example table database according to the present invention.

【図９】本発明による文例テーブルデータベースの別の
実施形態を示した図である。FIG. 9 is a diagram showing another embodiment of the sentence example table database according to the present invention.

【図１０】本発明による文例テーブルデータベースの別
の実施形態を示した図である。FIG. 10 is a diagram showing another embodiment of the sentence example table database according to the present invention.

【図１１】本発明による文例テーブルデータベースの別
の実施形態を示した図である。FIG. 11 is a diagram showing another embodiment of the sentence example table database according to the present invention.

【図１２】本発明文例テーブルデータベースを用いて新
しく文例テーブルを生成する方法に用いる文例テーブル
生成装置の構成を示す図である。FIG. 12 is a diagram illustrating a configuration of a sentence example table generating apparatus used in a method of newly generating a sentence example table using the sentence example table database of the present invention.

【図１３】本発明による文例テーブルデータベースを用
いて新しく文例テーブルを生成する方法を説明するため
のフローチャートである。FIG. 13 is a flowchart illustrating a method of generating a new sentence example table using the sentence example table database according to the present invention.

【図１４】韻律データに与える周波数パターンの一例を
表した図である。FIG. 14 is a diagram illustrating an example of a frequency pattern given to prosody data.

[Explanation of symbols]

１１…Ａ／Ｄ変換器１２…中間データ生成部１３…韻律データ抽出部１４…文例テーブル調整部１５…文例テーブルデータベース２１…情報解析部２２…韻律データ検索部２３…文例テーブルデータベース２４…文例テーブル生成部２５…文例テーブル調整部 DESCRIPTION OF SYMBOLS 11 ... A / D converter 12 ... Intermediate data generation part 13 ... Prosodic data extraction part 14 ... Sentence example table adjustment part 15 ... Sentence example table database 21 ... Information analysis part 22 ... Prosody data search part 23 ... Sentence example table database 24 ... Sentence example table Generation unit 25: sentence example table adjustment unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者中石信一兵庫県神戸市兵庫区御所通１丁目２番28号富士通テン株式会社内 (72)発明者平嶋善隆兵庫県神戸市兵庫区御所通１丁目２番28号富士通テン株式会社内Ｆターム(参考） 5D045 AA07 9A001 EE05 FF03 HH11 HH18 JJ01 JJ77 KK02 ──────────────────────────────────────────────────続き Continuing from the front page (72) Inventor Shinichi Nakaishi 1-2-28 Goshodori, Hyogo-ku, Kobe, Hyogo Prefecture Inside Fujitsu Ten Limited (72) Inventor Yoshitaka Hirashima 1 Goshodori, Hyogo-ku, Kobe-shi, Hyogo F-term (reference) in Fujitsu Ten Co., Ltd. 2-28 Chome 5D045 AA07 9A001 EE05 FF03 HH11 HH18 JJ01 JJ77 KK02

Claims

[Claims]

1. A speech synthesis sentence example table database composed of sentence units or prosodic data of elements constituting a sentence used for generating a speech waveform in in-speech speech synthesis, wherein the database is divided into parts of speech at the end. A sentence example table database having stored prosody data.

2. The sentence example table database according to claim 1, wherein the part of speech is a particle and an auxiliary verb, and the prosody data includes a particle and an auxiliary verb in which accents are further classified and stored for the same pattern.

3. The sentence example table database according to claim 1, wherein the part-of-speech is a particle or an auxiliary verb, and has prosody data in which the particle or the auxiliary verb is further stored by classifying accents by the same pattern.

4. A speech synthesis sentence example database comprising sentence units or prosodic data of elements constituting a sentence used for generating a speech waveform in in-speech speech synthesis, wherein the sentence example table is classified according to the use at the time of use. A sentence example table database having stored prosody data.

5. A speech synthesis sentence example table database composed of prosody data of sentence units or elements constituting a sentence used for generating a speech waveform in in-speech synthesis, wherein the prosody is classified by use at the time of use. A sentence example table database having prosody data stored in combination with data and prosody data classified according to part of speech at the end.

6. A prosody which is a combination of a part-of-speech which is a particle and an auxiliary verb, and which is a combination of prosody data in which particles and auxiliary verbs are further divided into patterns having the same accent, and prosody data which is divided in accordance with the intended use. The sentence example table database according to claim 5 having data.

7. The prosody which is a combination of a prosody data in which the part of speech is a particle or an auxiliary verb, and the prosody data in which the particle or the auxiliary verb is further classified by the same accent and the prosody data which is classified by the use at the time of use. The sentence example table database according to claim 5 having data.

8. A method for generating a speech synthesis sentence example table composed of prosody data used for generating a speech waveform in in-speech speech synthesis, wherein the prosody data constituting the sentence example table is stored in a table. 7. A method of generating a sentence example table by searching from the sentence example table database described in any one of 7 and combining the searched prosody data.

9. A method for generating a speech synthesis sentence example table composed of prosody data used for generating a speech waveform in in-speech synthesis, wherein the prosody data constituting the sentence example table is stored. A method of generating a sentence example table by combining the searched prosodic data from the sentence example table database described in any one of 3, 6, and 7, wherein the prosody data of the searched particle or auxiliary verb is stored in the sentence example table database. If not, a sentence example table is generated using the prosody data of the particle or auxiliary verb in the sentence example table database in the same pattern as the particle or auxiliary verb in the same pattern.

10. The method according to claim 8, wherein a parameter of the retrieved prosody data is adjusted.

11. The method of generating a sentence example table according to claim 8, wherein as a result of the search, when prosody data is not in the database, the prosody data generated by rule synthesis is used.