JP2002156990A

JP2002156990A - Method and apparatus for pause duration processing in chinese voice synthesis

Info

Publication number: JP2002156990A
Application number: JP2000355758A
Authority: JP
Inventors: Sesho Son; 世章孫; Kinin Sha; 琴韻謝
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2000-11-22
Filing date: 2000-11-22
Publication date: 2002-05-31

Abstract

PROBLEM TO BE SOLVED: To synthesize a voice of Chinese having a natural pause by reproducing a pause for breathing in a human speech. SOLUTION: A word inspection part 12 and a part-of-speech/extension syntax inspection part 13 inspect semantic units in an object sentence of voice synthesis by referring to a dictionary memory 11 stored with information for extending words into a phrase. The positions before and after a semantic unit are provided as pause position candidates. A pause position calculation part 18 calculates weighted values of pauses as respective pause position candidates through a sequence based upon respective semantic units in the sentence and their positions. When a pause position candidate has a weighted value of a pause larger than a predetermined value, it is determined as a pause position and pause duration is specified. Through this calculation of the weighted value of the pause, the pause position in the sentence and its pause duration can accurately be determined.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、合成された音声の
ポーズ位置及びポーズ継続時間を決定し、自然音声らし
さを得るための、中国語音声合成における音声のポーズ
継続時間を処理するポーズ継続時間処理方法及び装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a pause duration for processing a pause duration of a speech in Chinese speech synthesis for determining a pause position and a pause duration of a synthesized speech and obtaining a natural voice. The present invention relates to a processing method and apparatus.

【０００２】[0002]

【従来の技術】自然発話音声において、話者の呼吸の動
作により、文の適当な位置にポーズが存在する。一般的
な法則として、人間の発話における周期的な息継ぎは、
中国語の文字で６文字から８文字ごと（すなわち中国語
の音節）に起き、フレーズ又は単語を構成する意味的単
位（semantic unit）の前又は後に発生する。息継ぎの
間は、音声の持続はややポーズされる。長い文の音声合
成に対して、ポーズする複数の位置とポーズする複数の
継続時間とを適切に決定することは、合成された音声の
自然さの度合いにおいて大きな影響力を持つ。2. Description of the Related Art In spontaneously uttered speech, a pause exists at an appropriate position in a sentence due to a breathing motion of a speaker. As a general rule, periodic breathing in human speech is
Occurs every six to eight Chinese characters (ie, Chinese syllables) and occurs before or after the semantic units that make up the phrase or word. During the breath, the duration of the voice is slightly paused. For speech synthesis of long sentences, properly determining the positions to pause and the durations to pause has a great influence on the naturalness of the synthesized speech.

【０００３】中国語のテキストを音声合成するための従
来の装置は、文中における各文字の音節の、音声の継続
時間を処理することのみを考慮している。それらの装置
は、ポーズ位置及びポーズ継続時間の処理については研
究していない。例えば、「合成された音声のシステムの
ための音声の継続時間の処理方法及び装置（Speech Dur
ation Processing Method and Apparatus for Synthesi
zed Speech System）」と題された中華民国特許出願第
８２１０３２２２号に関連して、本明細書の図１３は、
音声継続時間を処理する単位としてリズム（rhyme）を
用いることによって、任意の長さのピンイン文の各音節
の継続時間を決定するための処理装置を図示するブロッ
ク図である。図１３において１１０は、キーボードから
構成されるようなデバイスを利用してピンイン文を入力
するためのピンイン文入力部を表す。１１１は複数のフ
レーズマーカーに従ってフレーズ中の複数の音節の位置
を検査するためのフレーズ検査部を表す。１１２は複数
のトーンマーカーを用いてピンイン文の中の複数の音節
を検査するためのトーン／音節検査部を表す。１１３
は、各単音節のピンインマーカーと、それについて構成
するリズムの指定された数とを記憶するための音節／リ
ズム照合部を表す。１１４は、音節／リズム照合部を用
いて、検査された音節を構成する複数のリズムを検査
し、リズムのデータを記憶するためのリズム検査部を表
す。１１５は各リズムの基本音声の継続時間を記憶する
ための基本音声継続時間記憶部を表す。１１６は、検査
されたリズムに従って基本音声継続時間記憶部から複数
のリズムを検査するための、基本音声継続時間決定部を
表す。１１７は、各リズムのトーンと、文中のリズムの
位置と、接続された複数のリズムのクラスと、同じくリ
ズムが属するフレーズ中の韻の位置とを用いて確立され
た、音声継続時間パラメータ記憶部をあらわす。１１８
は、リズムのトーンと、文中のリズムの位置と、接続さ
れた複数のリズムのクラスと、同じくリズムが属するフ
レーズ中の韻の位置とを、複数のリズムの音声継続時間
パラメータを音声継続時間パラメータ記憶部から検索す
るために複数のインデックスキー（indexing key）とし
て用い、複数のリズムの全音声継続時間を記録すること
によって音節音声の継続時間を計算するための、音節音
声継続時間計算部を表す。１１９は、装置が継続時間の
処理において用いなければならない互いに異なる複数の
レジスタを表す。[0003] Conventional devices for speech synthesis of Chinese text only consider processing the duration of speech of the syllable of each character in the sentence. These devices do not study the processing of pause position and pause duration. See, for example, "Speech Duration Processing Method and Apparatus (Speech Dur
ation Processing Method and Apparatus for Synthesi
FIG. 13 herein relates to Republic of China Patent Application No. 82103222 entitled "Zed Speech System".
FIG. 2 is a block diagram illustrating a processing device for determining the duration of each syllable of a Pinyin sentence of an arbitrary length by using a rhythm (rhyme) as a unit for processing a voice duration. In FIG. 13, reference numeral 110 denotes a Pinyin sentence input unit for inputting a Pinyin sentence using a device such as a keyboard. Reference numeral 111 denotes a phrase inspection unit for inspecting the positions of a plurality of syllables in a phrase according to a plurality of phrase markers. Reference numeral 112 denotes a tone / syllable inspection unit for inspecting a plurality of syllables in a Pinyin sentence using a plurality of tone markers. 113
Represents a syllable / rhythm matching unit for storing the pinyin marker of each single syllable and the specified number of rhythms constituting the syllable. Reference numeral 114 denotes a rhythm inspection unit for inspecting a plurality of rhythms constituting the inspected syllable by using a syllable / rhythm matching unit and storing rhythm data. Reference numeral 115 denotes a basic voice duration storage unit for storing the duration of the basic voice of each rhythm. Reference numeral 116 denotes a basic voice duration determining unit for testing a plurality of rhythms from the basic voice duration storage unit according to the inspected rhythm. A voice duration parameter storage unit 117 is established using the tone of each rhythm, the position of the rhythm in the sentence, the plurality of connected rhythm classes, and the position of the rhyme in the phrase to which the rhythm belongs. To represent. 118
Is the rhythm tone, the rhythm position in the sentence, the connected rhythm classes, and the rhyme position in the phrase to which the rhythm belongs. Represents a syllable voice duration calculator for calculating the duration of a syllable voice by recording the total voice duration of a plurality of rhythms and using it as a plurality of indexing keys to retrieve from storage. . 119 represents a plurality of different registers that the device must use in processing the duration.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上で言
及された従来の音声継続時間処理装置は、文中の複数の
文字の複数の音節の音声継続時間の処理のみを考慮し、
文中のポーズ位置及びポーズ継続時間のより徹底的な処
理は実行しない。例として、合成された中国語の音声
「堅強的求生意志是能However, the above-mentioned conventional speech duration processing apparatus described above considers only processing of speech duration of a plurality of syllables of a plurality of characters in a sentence,
No more thorough processing of pause positions and pause durations in sentences is performed. As an example, synthesized Chinese speech "

【外１】戰勝病魔最有效的方法」（日本語訳：堅強に生きる道を
求める意志は病魔を克服することに一番効率的な方法で
ある）を取り上げる。図７に図示された、上述の従来型
の音声継続時間処理装置を用いて合成されたテキストの
波形では、文は一息で発話されたように聞こえ、それは
自然な音声ではない。ポーズ位置が「堅強的求生意志」
と、「是能[Outside 1] The most effective way to overcome the disease is "The most effective way to overcome the disease." In the text waveform illustrated in FIG. 7 synthesized using the conventional speech duration processing device described above, the sentence sounds as if it were spoken in a single breath, which is not a natural sound. Pause position is "strong recruitment will"
And "I know

【外２】戰勝病魔」と、「最有效的方法」との間、すなわち例文
を上記のように３つに分割したときの第１の部分と第２
の部分の間と、第２の部分と第３の部分の間とに挿入さ
れ、ポーズ継続時間がそれぞれ１２０ｍｓと８０ｍｓに
決定されると、合成された音声は図８に図示されたよう
な波形を有する。息の変化を模倣するように複数のポー
ズが存在するので、合成された音声のリズムの自然さと
意味のわかりやすさとは明らかに向上する。ゆえに、音
声合成される文に対して、各音節の音声の継続時間だけ
が考慮されるべきではない。より重要なこととして、非
常に自然で非常にわかりやすい合成された言語を得るた
めに、ポーズ継続時間がテキスト中の適当な位置に追加
されなければならない。文中で複数のポーズの適当な位
置と継続時間とを決定する方法に関することが、本発明
が解決しようとする課題である。[Outside 2] The first part and the second part when the example sentence is divided into three parts as described above, between the "victory sickness" and the "most effective method".
And between the second part and the third part, and when the pause duration is determined to be 120 ms and 80 ms, respectively, the synthesized voice has a waveform as shown in FIG. Having. Since there are a plurality of poses that mimic a change in breath, the naturalness of the rhythm of the synthesized voice and the clarity of the meaning are clearly improved. Therefore, only the duration of the speech of each syllable should not be considered for a sentence to be synthesized. More importantly, in order to obtain a very natural and very understandable synthesized language, the pause duration must be added at the appropriate place in the text. It is an object of the present invention to solve a method of determining appropriate positions and durations of a plurality of poses in a sentence.

【０００５】ゆえに本発明の主な目的は、中国語のテキ
ストを音声に変換するシステムに上述の問題を克服させ
るための、中国語音声合成におけるポーズ継続時間処理
方法及び装置を提供することである。SUMMARY OF THE INVENTION It is therefore a primary object of the present invention to provide a method and apparatus for processing pause duration in Chinese speech synthesis so that a system for converting Chinese text to speech can overcome the aforementioned problems. .

【０００６】[0006]

【課題を解決するための手段】本発明の第１の態様に係
る中国語音声合成におけるポーズ継続時間処理方法は、
入力部及びメモリバッファ部を有するコンピュータシス
テムにおける中国語のテキストを音声に変換するシステ
ムでの使用に適応され、中国語の語彙と、品詞及び拡張
シンタックスを含む対応する情報とを記憶するための辞
書メモリを構成し、複数の前でポーズする文字と、複数
の前ポーズパラメータを含む対応する情報とを記憶する
ための前でポーズする文字のテーブルメモリを構成し、
複数の後でポーズする文字と、複数の後ポーズパラメー
タを含む対応する情報とを記憶するための後でポーズす
る文字のテーブルメモリを構成し、単語及びフレーズの
うちのいずれか一方を含む意味的単位の長さに従って複
数の一般ポーズパラメータを記憶するためのポーズパラ
メータ記憶部を構成するステップと、上記辞書メモリに
記憶された中国語の語彙の複数の単語と入力文を比較す
ることにより、上記入力部を用いて入力された上記入力
文中の各単語の境界の位置を決定するステップと、上記
辞書メモリを参照し、境界部が決定された各単語の品詞
及び拡張シンタックスを検索するステップと、上記拡張
シンタックスと、上記文中の複数の単語のうちの互いに
隣接するものの品詞の関係とに従って、文中の複数の単
語を複数のフレーズに結合するステップと、複数の単語
の複数のフレーズへの結合の後で、これらの意味的単位
の前及び後の位置がポーズ位置候補となるように、上記
入力文から分割された複数の各意味的単位のポーズの重
みづけられた値を計算するステップとを備え、上記ステ
ップは複数の前でポーズする文字を検索し、それらにつ
いて重みづけられた値を計算するステップと、複数の後
でポーズする文字を検索し、それらについて重みづけら
れた値を計算するステップとを含み、上記複数のポーズ
位置候補のポーズの重みづけられた値を計算することに
よって、複数のポーズ位置及び複数のポーズ継続時間を
決定するステップを備え、予め決められた値よりも大き
いポーズの重みづけられた値を有するポーズ位置候補の
各々はポーズ位置であると決定され、かつポーズレベル
を与えられ、上記ポーズ継続時間はそのポーズレベルに
従って決定されることを特徴とする。According to a first aspect of the present invention, there is provided a pause duration processing method in Chinese speech synthesis.
Adapted for use in a system for converting Chinese text to speech in a computer system having an input portion and a memory buffer portion, for storing a Chinese vocabulary and corresponding information including part of speech and extended syntax. Configuring a dictionary memory and configuring a table memory of front-paused characters for storing a plurality of previously-paused characters and corresponding information including a plurality of previous-pause parameters;
Forming a table memory of later-paused characters for storing a plurality of later-paused characters and corresponding information including a plurality of later-paused parameters, wherein the semantic includes one of a word and a phrase. Configuring a pose parameter storage unit for storing a plurality of general pose parameters according to the length of the unit, and comparing the input sentence with a plurality of words of a Chinese vocabulary stored in the dictionary memory. Determining the position of the boundary of each word in the input sentence input using the input unit; and referencing the dictionary memory and searching for the part of speech and the extended syntax of each word whose boundary has been determined. , A plurality of words in a sentence in a plurality of frames according to the extended syntax and a part-of-speech relationship of adjacent ones of the plurality of words in the sentence. And combining the plurality of words into the plurality of phrases, so that the positions before and after these semantic units are candidate pose positions. Calculating a weighted value of the pose of the semantic unit, wherein the step retrieves a plurality of previously paused characters and calculates a weighted value for them; Retrieving the characters to be paused and calculating weighted values for them; calculating the weighted values of the poses of the plurality of pose position candidates to obtain a plurality of pose positions and a plurality of poses. Determining a duration, wherein each of the pose position candidates having a pose weighted value greater than a predetermined value is determined to be a pose position. And given a pause level, the pause duration is being determined in accordance with the pause level.

【０００７】本発明の第２の態様に係る中国語音声合成
におけるポーズ継続時間処理装置は、文入力部及びメモ
リバッファ部であるシステムを有する中国語のテキスト
を音声に変換するシステムでの使用に適応され、中国語
の語彙と、品詞及び拡張シンタックスを含む対応する情
報とを記憶するための辞書メモリと、複数の前でポーズ
する文字と、前ポーズパラメータを含む対応する情報と
を記憶するための前でポーズする文字のテーブルメモリ
と、複数の後でポーズする文字と、後ポーズパラメータ
を含む対応する情報とを記憶するための後でポーズする
文字のテーブルメモリと、単語及びフレーズのうちのい
ずれか一方を含む意味的単位の長さに従って一般ポーズ
パラメータを記憶するためのポーズパラメータ記憶部
と、上記辞書メモリに記憶された中国語の語彙の複数の
単語と入力文を比較することにより、上記入力部を用い
て入力された上記入力文中の各単語の境界の位置を決定
するための境界位置決定手段と、上記辞書メモリを参照
し、境界の位置が決定された各単語の品詞及び拡張シン
タックスを検索するための検索手段と、上記拡張シンタ
ックスと、上記文中の複数の単語のうちの互いに隣接す
るものの品詞の関係とに従って、文中の複数の単語を複
数のフレーズに結合する結合手段と、複数の単語の複数
のフレーズへの結合の後で、これらの意味的単位の前及
び後の位置がポーズ位置候補となるように、上記入力文
から分割された複数の各意味的単位のポーズの重みづけ
られた値を計算する計算手段とを備え、上記手段は複数
の前でポーズする文字を検索し、それらについて重みづ
けられた値を計算する手段と、複数の後でポーズする文
字を検索し、それらについて重みづけられた値を計算す
る手段とを含み、上記複数のポーズ位置候補のポーズの
重みづけられた値を計算することによって、複数のポー
ズ位置及び複数のポーズ継続時間を決定するポーズ決定
手段を備え、予め決められた値よりも大きいポーズの重
みづけられた値を有するポーズ位置候補の各々はポーズ
位置であると決定され、かつポーズレベルを与えられ、
上記ポーズ継続時間はそのポーズレベルに従って決定さ
れることを特徴とする。A pause duration processing apparatus in Chinese speech synthesis according to a second aspect of the present invention is used in a system for converting Chinese text to speech having a system which is a sentence input section and a memory buffer section. An adapted dictionary memory for storing Chinese vocabulary and corresponding information including part-of-speech and extended syntax, storing a plurality of previously paused characters, and corresponding information including a previous pause parameter. A table memory of characters to be paused before, a table memory of characters to be paused to store a plurality of characters to be paused later, and corresponding information including a post-pause parameter; A pose parameter storage unit for storing a general pose parameter in accordance with the length of a semantic unit including any one of the above, and the dictionary memory Boundary position determining means for determining a position of a boundary of each word in the input sentence input by using the input unit by comparing a plurality of words in the stored Chinese vocabulary with the input sentence, A search unit for referring to the dictionary memory and searching for the part of speech and the extended syntax of each word whose boundary position has been determined; and the extended syntax and a plurality of words in the sentence that are adjacent to each other. A combining means for combining a plurality of words in a sentence into a plurality of phrases according to the part of speech, and a position before and after these semantic units after a combination of the plurality of words into a plurality of phrases. Calculating means for calculating a weighted value of the pose of each of the plurality of semantic units divided from the input sentence so as to be a candidate, wherein the means searches for a plurality of characters to be paused before, It Means for calculating a weighted value for the plurality of pose position candidates, and means for searching for a plurality of characters to be paused and calculating a weighted value for them. A plurality of pose positions and a plurality of pause durations by calculating the calculated pose values, and each of the pose position candidates having a pose weight value greater than a predetermined value is provided. Is determined to be a pause position and given a pause level,
The pause duration is determined according to the pause level.

【０００８】本発明の第１の態様に係る中国語音声合成
におけるポーズ継続時間処理方法のデータ構成と処理ス
テップによれば、音声合成されることを待機している任
意の長さの中国語の文は、文中の各単語の音節の位置を
予め構成された辞書メモリに記憶された単語と比較する
ことにより検査する単語検査ステップを最初に受ける。
次いで、品詞／拡張シンタックス検査ステップにおい
て、各単語の品詞及び拡張シンタックスは辞書メモリを
参照して検査される。さらにフレーズ拡張ステップにお
いて、文中の複数の単語のうちの互いに隣接するものは
拡張シンタックスと複数の品詞の関係とに従い複数のフ
レーズに結合される。このとき、文中のすべての単語又
はフレーズは基本意味的単位とみなされる。一般に、意
味的単位の中の文字数が多くなるほど、それによってよ
り深い意味が含意される。これらの意味的単位の前及び
後の位置はポーズ位置のための候補である。その後、ポ
ーズの重みづけられた値を計算するステップにおいて、
検査された又は拡張された複数の意味的単位のポーズの
重みづけられた値は、文中の意味的単位の順序に従って
計算される。重みづけられた値は３つの方法、すなわち
前のポーズ、後のポーズ、及び一般のポーズで計算され
る。前のポーズの重みづけられた値の計算は、予め構成
された前でポーズする文字のテーブルメモリにおける前
でポーズする文字との比較を介して実行される。前でポ
ーズする文字が検査されると、それについてポーズの重
みづけられた値が計算される。後のポーズの重みづけら
れた値の計算は、予め構成された後でポーズする文字の
テーブルメモリにおける後でポーズする文字との比較を
介して実行される。後でポーズする文字が検査される
と、それについてポーズの重みづけられた値が計算され
る。一般のポーズの重みづけられた値の計算は、意味的
単位の中の文字数に基づく。予め構成されたポーズパラ
メータ記憶部からポーズパラメータを見出だすことによ
り、それについて重みづけられた値が計算される。最後
に、ポーズ位置及びポーズ継続時間を決定するステップ
において、複数の意味的単位のポーズの重みづけられた
値に従って、複数の意味的単位の前及び後にポーズが必
要であるか否かと、ポーズ継続時間とが決定される。上
記の結果から、音声合成されることを待機している中国
語の文に対して、自然な音声に対応した複数のポーズ位
置と複数のポーズ継続時間が取得できる。According to the data configuration and processing steps of the pause duration processing method in Chinese speech synthesis according to the first aspect of the present invention, a Chinese language of an arbitrary length waiting for speech synthesis is provided. The sentence first undergoes a word checking step in which the syllable position of each word in the sentence is checked by comparing it with words stored in a pre-configured dictionary memory.
Next, in the part of speech / extended syntax checking step, the part of speech and the extended syntax of each word are checked with reference to the dictionary memory. Further, in the phrase expansion step, adjacent ones of the plurality of words in the sentence are combined into a plurality of phrases according to the expanded syntax and the relationship between the plurality of parts of speech. At this time, all words or phrases in the sentence are regarded as basic semantic units. In general, the more characters in a semantic unit, the deeper the meaning is implied thereby. Positions before and after these semantic units are candidates for pause positions. Then, in the step of calculating the weighted value of the pose,
The weighted values of the examined or expanded semantic unit poses are calculated according to the order of the semantic units in the sentence. The weighted values are calculated in three ways: before pose, after pose, and general pose. The calculation of the weight value of the previous pose is performed via a comparison of the pre-configured previously paused character with the previously paused character in the table memory. Once the previously paused character has been examined, a weighted value of the pause is calculated for it. The calculation of the weight value of the later pose is performed via a comparison of the pre-configured later-paused character with the later-paused character in the table memory. If the character to be paused later is examined, a weighted value of the pause is calculated for it. The calculation of the weight value of the general pose is based on the number of characters in the semantic unit. By finding a pose parameter from a pre-configured pose parameter storage, a weighted value is calculated for it. Finally, in the step of determining the pause position and the pause duration, whether a pause is required before and after the plurality of semantic units according to the weighted value of the pose of the plurality of semantic units, The time is determined. From the above results, a plurality of pause positions and a plurality of pause durations corresponding to natural speech can be acquired for a Chinese sentence waiting to be synthesized.

【０００９】本発明の第２の態様に係る中国語音声合成
におけるポーズ継続時間処理装置の構成によると、最初
に単語検査部が、予め構成された辞書メモリに記憶され
ている単語との比較によって、音声合成されることを待
機している任意の長さの中国語の文における各単語の音
節の位置を検査する。次いで品詞／拡張シンタックス検
査部は、各単語の品詞及び拡張シンタックスを辞書メモ
リを参照して検査する。さらにフレーズ拡張部が利用さ
れ、拡張シンタックスと複数の品詞の関係とに従い文中
の複数の単語のうちの互いに隣接するものを複数のフレ
ーズに結合する。このとき文中の複数の単語又はフレー
ズは基本意味的単位とみなされる。意味的単位の中の文
字数が多くなるほど、それによってより深い意味が含意
される。これらの意味的単位の前及び後の位置はポーズ
位置のための候補となる。その後、ポーズの重みづけら
れた値計算部を介して、検査された又は拡張された複数
の意味的単位のポーズの重みづけられた値は、文中の複
数の意味的単位の順序に従って計算される。重みづけら
れた値は３つの方法、すなわち前のポーズ、後のポー
ズ、及び一般のポーズで計算される。前のポーズの重み
づけられた値の計算は、予め構成された前でポーズする
文字のテーブルメモリにおける前でポーズする文字との
比較を介して実行される。前でポーズする文字が検査さ
れると、それについてポーズの重みづけられた値が計算
される。後のポーズの重みづけられた値の計算は、予め
構成された後でポーズする文字のテーブルメモリにおけ
る後でポーズする文字との比較を介して実行される。後
でポーズする文字が検査されると、それについてポーズ
の重みづけられた値が計算される。一般のポーズの重み
づけられた値の計算は、意味的単位の中の文字数に基づ
く。予め構成されたポーズパラメータ記憶部からポーズ
パラメータを見出だすことにより、それについて重みづ
けられた値が計算される。最後に、ポーズ位置及びポー
ズ継続時間決定部を介して、複数の意味的単位のポーズ
の重みづけられた値に従って、意味的単位の前及び後に
ポーズが必要であるか否かと、ポーズ継続時間とが決定
される。上記の結果から、音声合成され、使用のために
出力されることを待機している中国語の文に対して、自
然音声に対応した複数のポーズ位置と複数のポーズ継続
時間を取得できる。According to the configuration of the pause duration processing apparatus for Chinese speech synthesis according to the second aspect of the present invention, first, the word checker compares the word with a word stored in a preconfigured dictionary memory. Check the location of the syllables of each word in a Chinese sentence of any length waiting to be synthesized. Next, the part-of-speech / extended syntax checking unit checks the part of speech and the extended syntax of each word with reference to the dictionary memory. Further, a phrase expansion unit is used to combine adjacent ones of a plurality of words in a sentence into a plurality of phrases in accordance with the expanded syntax and the relationship between the plurality of parts of speech. At this time, a plurality of words or phrases in the sentence are regarded as basic semantic units. The greater the number of characters in a semantic unit, the deeper the meaning it implies. Positions before and after these semantic units are candidates for pause positions. Then, via the pose weighted value calculator, the weighted values of the poses of the inspected or expanded semantic units are calculated according to the order of the semantic units in the sentence. . The weighted values are calculated in three ways: before pose, after pose, and general pose. The calculation of the weight value of the previous pose is performed via a comparison of the pre-configured previously paused character with the previously paused character in the table memory. Once the previously paused character has been examined, a weighted value of the pause is calculated for it. The calculation of the weight value of the later pose is performed via a comparison of the pre-configured later-paused character with the later-paused character in the table memory. If the character to be paused later is examined, a weighted value of the pause is calculated for it. The calculation of the weight value of the general pose is based on the number of characters in the semantic unit. By finding a pose parameter from a pre-configured pose parameter storage, a weighted value is calculated for it. Finally, via a pause position and pause duration determination unit, according to the weighted values of the poses of the plurality of semantic units, whether or not a pause is required before and after the semantic unit, Is determined. From the above results, a plurality of pause positions and a plurality of pause durations corresponding to natural speech can be obtained for a Chinese sentence that is synthesized and is waiting to be output for use.

【００１０】本発明の他の特徴と利点は、添付の図を参
照して、以下の好ましい実施形態の詳細な説明において
明らかになる。[0010] Other features and advantages of the present invention will become apparent in the following detailed description of the preferred embodiments, taken in conjunction with the accompanying drawings.

【００１１】[0011]

【発明の実施の形態】以下図面を参照して本発明に係る
実施形態について説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１２】図１は、本発明に係る中国語のテキストを
音声に変換するシステムのためのポーズ継続時間の処理
装置の好ましい実施形態を説明するシステムのブロック
図である。図１に図示されているように、上記システム
は以下の構成要素を含む。FIG. 1 is a system block diagram illustrating a preferred embodiment of a pause duration processing apparatus for a system for converting Chinese text to speech according to the present invention. As illustrated in FIG. 1, the system includes the following components:

【００１３】１０は、音声合成されるべき文のテキスト
を入力するための、キーボードから構成されるような文
入力部を示す。Reference numeral 10 denotes a sentence input unit, such as a keyboard, for inputting text of a sentence to be synthesized.

【００１４】１１は、複数の中国語の単語と、品詞及び
拡張シンタックスなどのような対応する情報とを記憶す
るための辞書メモリを示す。辞書メモリ１１の構成を説
明する概略図が図９に図示されている。上記品詞は名詞
（Ｎ）、動詞（Ｖ）、形容詞（Ｊ）、副詞（Ａ）などを
含む。単語からフレーズに拡張するか否かを決定するた
めの上記拡張シンタックスは、拡張が無いことを表すφ
と、後に名詞が接続されることを表すＡＮと、前に名詞
が接続されることを表すＢＮと、後に動詞が接続される
ことを表すＡＶと、前に動詞が接続されることを表すＢ
Ｖとを含む。また、辞書メモリ１１は音声マーカー等の
他の情報を含んでもよい。音声マーカーは読み記号（発
音記号）と呼ぶこともでき、例えば図９に示されている
“qiu2 sheng1 yi4 zhi4”（中国語の「求生意志」を表
す）は読み記号から組合わされた読み記号列である。こ
こでの数字「２，１，４，４」はそれぞれ中国語に固有
のアクセントである声調を表すものである。Reference numeral 11 denotes a dictionary memory for storing a plurality of Chinese words and corresponding information such as part of speech and extended syntax. A schematic diagram illustrating the configuration of the dictionary memory 11 is shown in FIG. The parts of speech include nouns (N), verbs (V), adjectives (J), adverbs (A), and the like. The expansion syntax for determining whether to expand from a word to a phrase is φ φ indicating that there is no expansion.
And AN indicating that a noun is connected later, BN indicating that a noun is connected before, AV indicating that a verb is connected later, and B indicating that a verb is connected before.
V. Further, the dictionary memory 11 may include other information such as a voice marker. The phonetic marker can also be called a phonetic symbol (pronunciation symbol). For example, “qiu2 sheng1 yi4 zhi4” (representing “Chinese willingness” in Chinese) shown in FIG. 9 is a phonetic symbol string combined from the phonetic symbol It is. The numbers “2, 1, 4, 4” here represent tones that are accents unique to Chinese.

【００１５】１２は、上記辞書メモリ１１に記憶された
単語と比較することにより、入力文の中の各単語の音節
の位置を決定する単語検査部を表す。Reference numeral 12 denotes a word inspection unit that determines the position of a syllable of each word in the input sentence by comparing the word with the word stored in the dictionary memory 11.

【００１６】１３は、各検査された単語に対応する品詞
及び拡張シンタックスを上記辞書メモリ１１から検索す
るための品詞／拡張シンタックス検査部を表す。Reference numeral 13 denotes a part-of-speech / extended syntax check unit for searching the dictionary memory 11 for a part-of-speech and an extended syntax corresponding to each inspected word.

【００１７】１４は、各単語の品詞及び拡張シンタック
スを用いて、互いに隣接する単語をより完全な意味を有
する複数の単語からなる単位（又は複数のフレーズ）に
拡張するためのフレーズ拡張部をあらわす。The phrase expansion unit 14 expands a word adjacent to each other into a unit (or a plurality of phrases) having a more complete meaning by using the part of speech and the expansion syntax of each word. It represents.

【００１８】１５は、「前でポーズする文字」（以下、
前ポーズ文字と呼ぶ）及び対応する情報を記憶するため
の前でポーズする文字のテーブルメモリを表す。上記前
ポーズ文字は、中国語を発話する際に、その字の前にポ
ーズが必要になる文字である。上記対応する情報は、例
えば前ポーズ文字の前ポーズの重みづけられた値を計算
するための、前ポーズの重みづけられた値のパラメータ
（ｆｐ）である。前ポーズ文字のテーブルメモリ１５の
構造の概要図が図１０に図示されている。Reference numeral 15 denotes a “character to be paused before” (hereinafter, “character to be paused”).
(Referred to as the previous pause character) and a table memory of characters to pause before for storing corresponding information. The preceding pause character is a character that requires a pause before the character when speaking Chinese. The corresponding information is, for example, a parameter (fp) of the weight value of the previous pose for calculating the weight value of the previous pose of the previous pose character. A schematic diagram of the structure of the table memory 15 for the previous pause character is shown in FIG.

【００１９】１６は、「後でポーズする文字」（以下、
後ポーズ文字と呼ぶ）及び対応する情報を記憶するため
の後でポーズする文字のテーブルメモリを表す。上記後
ポーズ文字は、中国語を発話する際に、その字の後にポ
ーズが必要になる文字である。上記対応する情報は、例
えば後ポーズ文字の後ポーズの重みづけられた値を計算
するための、後ポーズの重みづけられた値のパラメータ
（ｂｐ）である。後ポーズ文字のテーブルメモリ１６の
構造の概要図が図１１に図示されている。16 is a "character to be paused later" (hereinafter referred to as "character").
Character table memory for storing later-paused characters) and corresponding information. The post-pause character is a character that requires a pause after the character when speaking Chinese. The corresponding information is, for example, a parameter (bp) of the weight value of the post-pose for calculating the weight value of the post-pose of the post-pose character. A schematic diagram of the structure of the post-pause character table memory 16 is shown in FIG.

【００２０】１７は、意味的単語のフレーズを形成する
複数の文字の数に対応し、かつそれに従う一般ポーズの
重みづけられた値のパラメータ（ｐｐ）の情報を記憶す
るためのポーズパラメータ記憶部を表す。上記意味的単
語のフレーズを形成する文字の数によって、意味的単語
の一般ポーズの重みづけられた値が計算される。上記一
般ポーズの重みづけられた値のパラメータは、ある字の
前にポーズが必要になる状況と、ある字の後にポーズが
必要になる状況とのどちらとも異なる他の状況に関連し
たパラメータである。ポーズパラメータ記憶部１７の構
造の概要図が図１２に図示されている。Reference numeral 17 denotes a pose parameter storage unit for storing information of a parameter (pp) of a weight value of a general pose corresponding to the number of a plurality of characters forming the phrase of the semantic word and according to the number. Represents A weighted value of the general pose of the semantic word is calculated according to the number of characters forming the phrase of the semantic word. The parameter of the weight value of the general pose is a parameter related to a situation that requires a pause before a certain character and a situation that requires a pause after a certain character. . A schematic diagram of the structure of the pause parameter storage unit 17 is shown in FIG.

【００２１】１８は、前ポーズ文字のテーブルメモリ１
５と、後ポーズ文字のテーブルメモリ１６と、ポーズパ
ラメータ記憶部１７から得られたポーズの重みづけられ
た値のパラメータとを参照して、文中の意味的単語の位
置に従って、文中の各意味的単語のポーズの重みづけら
れた値を計算するためのポーズの重みづけられた値計算
部を表す。上記計算は、上記意味的単語の位置と上記パ
ラメータとの掛け算によって実行され、そのため、入力
された文においてより後の方に位置する意味的単語のポ
ーズの重みづけられた値はより大きな値を有する。Reference numeral 18 denotes a table memory 1 for the previous pause character.
5, the post-pose character table memory 16 and the parameter of the weight value of the pose obtained from the pose parameter storage unit 17, and according to the position of the semantic word in the sentence, 5 illustrates a pose weighted value calculator for calculating weighted values of word poses. The calculation is performed by multiplying the position of the semantic word by the parameter, so that the weighted value of the pose of the semantic word located later in the input sentence has a larger value. Have.

【００２２】１９は、意味的単語の前又は後にポーズが
挿入されるべきか否かを決定し、計算された意味的単語
のポーズの重みづけられた値の大きさに基づいてポーズ
継続時間を決定するためのポーズ位置及びポーズ継続時
間決定部を表す。次いで、上記ポーズ位置及びポーズ継
続時間決定部１９から出力された、ポーズ位置及びポー
ズ継続時間を挿入された文のテキストデータは、公知の
音声合成手段によって音声合成される。19 determines whether a pause should be inserted before or after the semantic word, and determines the pause duration based on the magnitude of the calculated weighted value of the semantic word pose. This represents a pause position and pause duration determination unit for determination. Next, the text data of the sentence into which the pause position and the pause duration are inserted, which is output from the pause position and pause duration determination unit 19, is subjected to voice synthesis by a known voice synthesis unit.

【００２３】２０は、複数のポーズ位置と複数のポーズ
継続時間とを処理する際に、装置が用いなければならな
い互いに異なる複数のレジスタ及び複数のメモリバッフ
ァ領域からなるレジスタ及びメモリバッファ部を表す。
文入力部１０に入力された文は、続いて単語検査部１
２、品詞／拡張シンタックス検査部１３、フレーズ拡張
部１４、ポーズの重みづけられた値計算部１８、ポーズ
位置及びポーズ継続時間計算部１９によって次々に処理
され、それぞれのブロックはレジスタ及びメモリバッフ
ァ部２０内の異なる部分を利用してデータを書き込み、
そして読み出す。Reference numeral 20 denotes a register and a memory buffer unit which are used by the apparatus when processing a plurality of pause positions and a plurality of pause durations, the register including a plurality of different registers and a plurality of memory buffer areas.
The sentence input to the sentence input unit 10 is subsequently sent to the word inspection unit 1
2. The part-of-speech / extended syntax checking unit 13, the phrase expanding unit 14, the pause weighted value calculating unit 18, the pause position and the pause duration calculating unit 19 are sequentially processed, and each block is a register and a memory buffer. Write data using different parts in the unit 20,
Then read it.

【００２４】本装置が複数のポーズ位置及び複数のポー
ズ継続時間を処理するとき、互いに異なる複数のレジス
タ及びメモリメモリバッファ領域が用いられなければな
らない。図１においてそれらは省略され、図示されてい
ないが、実際の実施においては必要である。上記レジス
タ及びメモリバッファ領域は構成要素として、入力文の
テキストデータを記憶するための、TextBufferメモリバ
ッファ領域と、文中の単語を指定する数を記憶するため
のWd_iレジスタ（数字１、２、３…などを用い、例えば
１は文中の最初の単語を指定する）と、入力文中の各検
査された単語の値（単語の開始位置、単語の長さ）を記
憶するためのWdアレーレジスタとを含み、上記Wdアレー
レジスタにおいて、例えば次式は、文中の第４の単語が
第５の音節から開始し、２音節の単語の長さを有するこ
とを示している。When the apparatus processes multiple pause positions and multiple pause durations, different registers and memory buffer areas must be used. In FIG. 1, they are omitted and not shown, but are necessary in actual implementation. The register and the memory buffer area are, as constituent elements, a TextBuffer memory buffer area for storing text data of an input sentence, and a Wd_i register (numbers 1, 2, 3,...) For storing a number specifying a word in the sentence. For example, 1 designates the first word in a sentence) and a Wd array register for storing the value of each inspected word (word start position, word length) in the input sentence. In the Wd array register, for example, the following expression indicates that the fourth word in the sentence starts from the fifth syllable and has a word length of two syllables.

【００２５】[0025]

【数１】Wd［４］＝（５，２）## EQU1 ## Wd [4] = (5,2)

【００２６】上記レジスタ及びメモリバッファ領域はさ
らに、入力文中の各検査された単語の品詞を記憶するた
めのWd_typeアレーレジスタを含む。例えば、次式は文
中の第２の単語の品詞が名詞であることを示している。The register and memory buffer area further includes a Wd_type array register for storing the part of speech of each checked word in the input sentence. For example, the following expression indicates that the part of speech of the second word in the sentence is a noun.

【００２７】[0027]

【数２】Wd_type［２］＝Ｎ[Equation 2] Wd_type [2] = N

【００２８】上記レジスタ及びメモリバッファ領域はさ
らに、入力文中の各検査された単語の拡張シンタックス
を記憶するためのWd_expandアレーレジスタを含む。例
えば、次式は文中の第１の単語の拡張シンタックスであ
り、上記第１の単語の後に名詞が接続されることを示し
ている。The register and memory buffer area further includes a Wd_expand array register for storing expanded syntax of each examined word in the input sentence. For example, the following expression is an extended syntax of the first word in the sentence, and indicates that a noun is connected after the first word.

【００２９】[0029]

【数３】Wd_expand［１］＝ＡＮ## EQU3 ## Wd_expand [1] = AN

【００３０】上記レジスタ及びメモリバッファ領域はさ
らに、入力文中の各フレーズ（文節）を形成する音節の
値（フレーズの長さ、フレーズの位置）を記憶するため
のWd_phrアレーレジスタを含む。例えば、次式は文中の
第４の音節が３音節からなるフレーズの第１の音節を形
成することを示している。The register and memory buffer area further include a Wd_phr array register for storing syllable values (phrase length, phrase position) forming each phrase (phrase) in the input sentence. For example, the following expression indicates that the fourth syllable in the sentence forms the first syllable of a three-syllable phrase.

【００３１】[0031]

【数４】Wd_phr［４］＝（３，１）[Equation 4] Wd_phr [4] = (3,1)

【００３２】上記レジスタ及びメモリバッファ領域はさ
らに、文中のフレーズの開始位置を記憶するためのPhr_
startレジスタと、文中のフレーズの終了位置を記憶す
るためのPhr_endレジスタと、音節の区間を単位とし
て、フレーズの長さを記憶するためのPhr_lengthレジス
タと、入力文中の各フレーズの値（フレーズ開始音節、
フレーズの長さ）を記憶するためのPhraseアレーレジス
タとを含み、上記Phraseアレーレジスタにおいて、例え
ば次式は、文中の第４のフレーズが第９の音節から開始
し、２音節のフレーズ長さを有することを示す。[0032] The register and the memory buffer area further include Phr_ for storing a start position of a phrase in a sentence.
A start register, a Phr_end register for storing the end position of the phrase in the sentence, a Phr_length register for storing the length of the phrase in units of syllables, and a value of each phrase in the input sentence (phrase start syllable) ,
And a Phrase array register for storing the length of the phrase. In the Phrase array register, for example, the following expression indicates that the fourth phrase in the sentence starts from the ninth syllable and the phrase length of two syllables is To have

【００３３】[0033]

【数５】Phrase［４］＝（９，２）[Equation 5] Phrase [4] = (9, 2)

【００３４】上記レジスタ及びメモリバッファ領域はさ
らに、文中のフレーズを指定する数を記憶するためのP_
iレジスタ、P_beginレジスタ及びP_endレジスタ（数
１，２，３…などを用い、例えば１は文中の最初のフレ
ーズを指定する）を含む。ここで、P_beginレジスタ及
びP_endレジスタは、ポーズ継続時間を挿入する必要が
あるか否かを判断するための処理セクションの開始ポイ
ンタと終了ポインタにそれぞれ対応する。さらに上記判
断を実行するために、上記P_beginレジスタ及びP_endレ
ジスタの値を用いて計算された値を記憶するS_rangeレ
ジスタを含む。The register and the memory buffer area further include P_ for storing a number designating a phrase in a sentence.
An i register, a P_begin register, and a P_end register (using Equations 1, 2, 3,..., for example, 1 designates the first phrase in a sentence). Here, the P_begin register and the P_end register correspond to a start pointer and an end pointer of a processing section for determining whether or not it is necessary to insert a pause duration. Further, in order to execute the above determination, an S_range register for storing a value calculated using the values of the P_begin register and the P_end register is included.

【００３５】上記レジスタ及びメモリバッファ領域はさ
らに、フレーズのポーズパラメータを記憶するためのｆ
ｐレジスタ、ｂｐレジスタ及びｐｐレジスタとを含む。
ここで、ｆｐレジスタの値は前ポーズパラメータを表
し、ｂｐレジスタの値は後ポーズパラメータを表し、ｐ
ｐレジスタの値は一般ポーズパラメータを表す。The register and the memory buffer area further include f for storing the pause parameter of the phrase.
It includes a p register, a bp register, and a pp register.
Here, the value of the fp register represents the before pause parameter, the value of the bp register represents the after pause parameter, and p
The value of the p register represents a general pause parameter.

【００３６】上記レジスタ及びメモリバッファ領域はさ
らに、ポーズを決定する重みづけられた値を記憶するた
めのW_minorレジスタ及びW_majorレジスタを含む。ポー
ズの重みづけられた値がW_minorレジスタの値よりも大
きいがW_majorレジスタの値よりも小さいとき、８０ｍ
ｓのより短いポーズが決定される。ポーズの重みづけら
れた値がW_major以上であるとき、１２０ｍｓのより長
いポーズが決定される。この実施形態において、「より
短いポーズの重みづけられた値」を示すW_minorレジス
タの値は２０に設定され、「より長いポーズの重みづけ
られた値」を示すW_majorレジスタの値は４０に設定さ
れている。The register and memory buffer area further includes a W_minor register and a W_major register for storing weighted values for determining a pause. 80 m when the weighted value of the pose is greater than the value of the W_minor register but less than the value of the W_major register
A shorter pose of s is determined. When the weighted value of the pose is greater than or equal to W_major, a longer pause of 120 ms is determined. In this embodiment, the value of the W_minor register indicating “weighted value of shorter pause” is set to 20, and the value of the W_major register indicating “weighted value of longer pause” is set to 40. ing.

【００３７】上記レジスタ及びメモリバッファ領域はさ
らに、フレーズのポーズの重みづけられた値を記憶する
ためのＷｆｐレジスタ、Ｗｂｐレジスタ及びＷｐｐレジ
スタを含む。Ｗｆｐレジスタの値は前ポーズの重みづけ
られた値を示す。Ｗｂｐレジスタの値は後ポーズの重み
づけられた値を示す。Ｗｐｐレジスタの値は一般ポーズ
の重みづけられた値を示す。The register and memory buffer areas further include a Wfp register, a Wbp register, and a Wpp register for storing weighted values of the pause of the phrase. The value of the Wfp register indicates the weight value of the previous pause. The value of the Wbp register indicates the weight value of the post-pause. The value of the Wpp register indicates the weight value of the general pose.

【００３８】上記レジスタ及びメモリバッファ領域はさ
らに、ポーズフレーズを指定する数を記憶するためのP_
pauseレジスタを含む。例えば、P_pauseレジスタの値が
２に等しいとき、第２のフレーズの後でポーズが必要で
あることを示している。The register and the memory buffer area further include P_ for storing a number specifying a pause phrase.
Includes pause register. For example, when the value of the P_pause register is equal to 2, it indicates that a pause is required after the second phrase.

【００３９】上記レジスタ及びメモリバッファ領域はさ
らに、ポーズ位置の音節を指定する数を記憶するための
S_pauseレジスタを含む。例えば、S_pauseレジスタの値
が７に等しいとき、第７の音節の後でポーズが必要であ
ることを示している。The register and the memory buffer area further store a number specifying a syllable at a pause position.
Includes S_pause register. For example, when the value of the S_pause register is equal to 7, it indicates that a pause is required after the seventh syllable.

【００４０】上記レジスタ及びメモリバッファ領域はさ
らに、ミリ秒の区間を単位としてポーズ継続時間を記憶
するためのD_pauseレジスタを含む。この実施形態にお
いて、より長いポーズは１２０ｍｓで、より短いポーズ
は８０ｍｓである。The register and the memory buffer area further include a D_pause register for storing a pause duration in units of milliseconds. In this embodiment, the longer pose is 120 ms and the shorter pose is 80 ms.

【００４１】上記レジスタ及びメモリバッファ領域はさ
らに、入力文のポーズ位置及びポーズ継続時間（S_paus
e，D_pause）を記憶するためのList_pauseアレーレジス
タを含む。例えば、次式は第１のポーズ位置が第７の音
節の後で、１２０ｍｓの継続時間を有することを示して
いる。The register and memory buffer area further include a pause position and a pause duration (S_paus
e, D_pause) for storing a List_pause array register. For example, the following expression indicates that the first pause position has a duration of 120 ms after the seventh syllable.

【００４２】[0042]

【数６】List_pause［１］＝（７，１２０ｍｓ）## EQU6 ## List_pause [1] = (7,120 ms)

【００４３】図２から図６は、図１の実施形態に係る中
国語のテキストを音声に変換するシステムのためのポー
ズ継続時間の処理装置の動作のフローチャートを図示
し、上記システムは基本的処理単位として意味的単語を
用いている。図２から図６において以下のステップが説
明されている。FIGS. 2 to 6 show flowcharts of the operation of the pause duration processing apparatus for the system for converting Chinese text to speech according to the embodiment of FIG. A semantic word is used as a unit. The following steps are described in FIGS.

【００４４】図２のステップＳ１において、文のテキス
トは、TextBufferメモリバッファ領域に入力される。In step S1 of FIG. 2, the text of the sentence is input to the TextBuffer memory buffer area.

【００４５】次いでステップＳ２において、現在入力さ
れたテキストのキーがテキストにおいて終了キーである
か否かが検査される。終了キーであるときは、フローは
ステップＳ３に進む。そうでなければ、フローはステッ
プＳ１に戻る。Then, in step S2, it is checked whether the key of the currently input text is an end key in the text. If it is the end key, the flow proceeds to step S3. Otherwise, the flow returns to step S1.

【００４６】ステップＳ３において、文におけるテキス
トを検査して、辞書メモリ１１中の複数の単語と比較す
ることにより文中の各単語を見出だす。文中の単語の位
置及び単語の長さがWdアレーレジスタに記憶される。In step S3, the text in the sentence is examined, and each word in the sentence is found by comparing the text with a plurality of words in the dictionary memory 11. The position of the word in the sentence and the length of the word are stored in the Wd array register.

【００４７】次いで、ステップＳ４において、Wdアレー
レジスタ中の各検査された単語に従って、各単語に対応
する品詞及び拡張シンタックスが辞書メモリ１１から検
索され、それらはWd_typeアレーレジスタとWd_expandア
レーレジスタにそれぞれ記憶される。Next, in step S4, the part of speech and the extended syntax corresponding to each word are searched from the dictionary memory 11 according to each inspected word in the Wd array register, and these are stored in the Wd_type array register and the Wd_expand array register, respectively. It is memorized.

【００４８】次いでステップＳ５において、Wdアレーレ
ジスタ中の各検査された単語に従って、単語に対応する
各音節の構成データは、Wd_phrアレーレジスタ内に記憶
される。Next, in step S5, according to each examined word in the Wd array register, the constituent data of each syllable corresponding to the word is stored in the Wd_phr array register.

【００４９】次いで図３のステップＳ６において、Wd_i
レジスタ内における値は１に設定され、第１の単語から
フレーズ拡張処理が開始される。Next, in step S6 of FIG. 3, Wd_i
The value in the register is set to 1, and the phrase expansion process starts from the first word.

【００５０】次いでステップＳ７において、（Wd_i）番
目の単語が拡張シンタックスを有するか否かが決定され
る。（値がφであるとき、これは単語が拡張シンタック
スを有さないことを示している）。（Wd_i）番目の単語
が拡張シンタックスを有するときは同時に上記単語がど
の拡張シンタックスを有するかを判断し、フローはステ
ップＳ８に進む。そうでないときはフローはステップＳ
１１に進む。Next, in step S7, it is determined whether or not the (Wd_i) -th word has the extended syntax. (When the value is φ, this indicates that the word has no extended syntax). When the (Wd_i) -th word has the extended syntax, it is determined at the same time which extended syntax the word has, and the flow proceeds to step S8. Otherwise, the flow is step S
Proceed to 11.

【００５１】ステップＳ８において、ステップＳ７で決
定された拡張シンタックスに従って、（Wd_i）番目の単
語の前又は後の隣接する単語の品詞が上記拡張シンタッ
クスに対応するか否かが決定される。上記対応があると
きは、フローはステップＳ９に進む。そうでないとき
は、フローはステップＳ１１に進む。In step S8, according to the extended syntax determined in step S7, it is determined whether or not the part of speech of an adjacent word before or after the (Wd_i) -th word corresponds to the extended syntax. If there is such a response, the flow proceeds to step S9. Otherwise, the flow goes to step S11.

【００５２】ステップＳ９において、フレーズ拡張処理
が開始される。拡張が前に進むとき、Wd_i−１レジスタ
の値によって指定される単語が、拡張されるべき単語と
して選択される。拡張が後に進むとき、Wd_i＋１レジス
タの値によって指定される単語が、拡張されるべき単語
として選択される。複数の単語がフレーズに拡張された
とき、このフレーズは次いで拡張処理を実行するための
新しい拡張されるべき単語とみなされる。隣接する拡張
している単語と、拡張されるべき単語とは結合され、拡
張されたフレーズを形成する。拡張フレーズのための開
始位置Phr_start及び終了位置Phr_endが見出だされ、拡
張されたフレーズの長さを次式より計算する。In step S9, the phrase expansion process is started. As the expansion proceeds, the word specified by the value of the Wd_i-1 register is selected as the word to be expanded. As the expansion proceeds, the word specified by the value of the Wd_i + 1 register is selected as the word to be expanded. When multiple words are expanded into a phrase, the phrase is then considered a new word to be expanded to perform the expansion process. The adjacent expanding word and the word to be expanded are combined to form an expanded phrase. The start position Phr_start and the end position Phr_end for the expanded phrase are found, and the length of the expanded phrase is calculated by the following equation.

【００５３】[0053]

【数７】Phr_length＝Phr_end−Phr_start＋１[Equation 7] Phr_length = Phr_end−Phr_start + 1

【００５４】続いて、開始位置Phr_startと、終了位置P
hr_endと、拡張されたフレーズの長さPhr_lengthとは、
Phr_startレジスタ、Phr_endレジスタ及びPhr_lengthレ
ジスタにそれぞれ記憶される。Subsequently, the start position Phr_start and the end position P
hr_end and extended phrase length Phr_length are
These are stored in the Phr_start register, Phr_end register, and Phr_length register, respectively.

【００５５】次いでステップＳ１０において、Wd_phrア
レーレジスタにおける対応する複数の音節の値は、拡張
されたフレーズに従って更新される。個別には、更新は
次のように続く。Next, in step S10, the values of the corresponding syllables in the Wd_phr array register are updated according to the expanded phrase. Individually, the update proceeds as follows.

【００５６】[0056]

【数８】Wd_phr［Phr_start］＝（Phr_length，１）[Equation 8] Wd_phr [Phr_start] = (Phr_length, 1)

【数９】 Wd_phr［Phr_start＋１］＝（Phr_length，２）## EQU9 ## Wd_phr [Phr_start + 1] = (Phr_length, 2)

【００５７】以下同様に続き、最後は次式のようにな
る。The following is continued in the same manner, and finally the following equation is obtained.

【００５８】[0058]

【数１０】 Wd_phr［Phr_end］＝（Phr_length，Phr_length）[Expression 10] Wd_phr [Phr_end] = (Phr_length, Phr_length)

【００５９】次いでステップＳ１１において、Wd_iレジ
スタの値が最後の単語を指定する値に到達したか否かを
決定する。最後の単語に到達したときは、フローはステ
ップＳ１３に進む。そうでないときは、フローはステッ
プＳ１２に進む。Next, in step S11, it is determined whether or not the value of the Wd_i register has reached a value designating the last word. When the last word has been reached, the flow proceeds to step S13. Otherwise, the flow goes to step S12.

【００６０】ステップＳ１２において、Wd_iレジスタ内
の値が１だけインクリメントされ、続いてフローはステ
ップＳ７に戻り、フレーズ拡張処理を続行する。In step S12, the value in the Wd_i register is incremented by one, and then the flow returns to step S7 to continue the phrase expansion process.

【００６１】ステップＳ１３において、Wd_phrアレーレ
ジスタにおけるフレーズを形成する複数の音節を指定す
る値は、フレーズが基本単位であるPhraseアレーレジス
タにおける値に変換される。この点において、文は意味
的単位の複数のフレーズに分解された。続くステップに
おいて、これらの値はポーズの重みづけられた値の計算
において単位とされ、それらの前又は後にポーズが必要
であるか否かと、適当なポーズ継続時間とが決定され
る。In step S13, the values specifying a plurality of syllables forming a phrase in the Wd_phr array register are converted into values in the Phrase array register in which the phrase is a basic unit. At this point, the sentence was broken down into multiple phrases in semantic units. In a following step, these values are taken into account in the calculation of the weighted values of the poses, and whether a pause is required before or after them and an appropriate pause duration are determined.

【００６２】次いで図４のステップＳ１４において、フ
レーズポインタであるP_begin及びP_endレジスタの値は
１に初期化される。Next, in step S14 of FIG. 4, the values of the P_begin and P_end registers, which are phrase pointers, are initialized to 1.

【００６３】次いでステップＳ１５において、P_begin
レジスタからP_endレジスタが指定するフレーズまでの
音節数が数えられ、S_rangeレジスタに記憶される。計
算ステップは以下のように実行される。Next, in step S15, P_begin
The number of syllables from the register to the phrase specified by the P_end register is counted and stored in the S_range register. The calculation step is performed as follows.

【００６４】[0064]

【数１１】S_range＝Phrase［P_end］．syllable＋Phra
se［P_end］．length−Phrase［P_begin］．syllableS_range = Phrase [P_end]. syllable ＋ Phra
se [P_end]. length-Phrase [P_begin]. syllable

【００６５】次いでステップＳ１６において、S_range
レジスタの値はまだ１３音節よりも小さいか否かが決定
される。人間が発話するときのポーズ（息継ぎ）の動作
では、１周期がほぼ６から８個の中国語の文字（すなわ
ち中国語の音節）に対応し、ポーズはフレーズ又は単語
を構成する意味的単位の前又は後に発生する。一般の人
は、約１３個の音節からなる区間内に少なくとも１回の
ポーズが必要となる。従って本実施形態では「１３」の
しきい値を採用し、１３音節よりも小さいときは、フロ
ーはステップＳ１７に進む。そうでないときは、フロー
はステップＳ１９に進み、P_beginレジスタからP_endレ
ジスタが指定するフレーズまでのポーズ位置とポーズ継
続時間とを決定する。Next, in step S16, S_range
It is determined whether the value of the register is still less than 13 syllables. In a pause (breathing) operation when a human speaks, one cycle corresponds to approximately 6 to 8 Chinese characters (ie, Chinese syllables), and a pause is a semantic unit of a phrase or word. Occurs before or after. The average person needs at least one pause within a section consisting of about 13 syllables. Therefore, in the present embodiment, the threshold value of “13” is adopted, and when the threshold value is smaller than 13 syllables, the flow proceeds to step S17. Otherwise, the flow proceeds to step S19, and determines a pause position and a pause duration from the P_begin register to the phrase specified by the P_end register.

【００６６】ステップＳ１７において、P_endレジスタ
が指定するフレーズは文の最後のフレーズであるか否か
が決定される。P_endレジスタが指定するフレーズが最
後のフレーズであるときは、これは文中のすべてのフレ
ーズが検査されたことを示す。続いてフローはステップ
Ｓ２２に進み、すべてのポーズ位置及びポーズ継続時間
を出力する。そうでないときはステップＳ１８に進む。In step S17, it is determined whether the phrase specified by the P_end register is the last phrase of the sentence. When the phrase specified by the P_end register is the last phrase, this indicates that all the phrases in the sentence have been checked. Subsequently, the flow proceeds to step S22, and outputs all pause positions and pause durations. Otherwise, the process proceeds to step S18.

【００６７】ステップＳ１８において、P_endレジスタ
内の値が１だけインクリメントされ、フローはステップ
Ｓ１５に戻る。In step S18, the value in the P_end register is incremented by 1, and the flow returns to step S15.

【００６８】ステップＳ１９において、P_beginレジス
タからP_endレジスタが指定するフレーズまでの、すべ
てのフレーズのポーズの重みづけられた値が計算され、
P_pauseレジスタの値が指定するあるフレーズに基づい
て設定されたポーズ位置S_pause及びポーズ継続時間D_p
auseを決定し、上記のS_pause及びD_pauseレジスタの値
はポーズアレーList_pauseに記憶される。ここでは経験
値に基づいて設定された２つのポーズレベル、すなわち
W_major及びW_minorレジスタの値に従って、ポーズ継続
時間が決定される。このステップＳ１９はさらにサブス
テップＳ１９−１からＳ１９−１６までをサブルーチン
として含む。図５のサブステップＳ１９−１において、
ポインタP_iはP_beginレジスタの値（開始ポインタ）を
指定する。次いでサブステップＳ１９−２において、
「前ポーズ文字のテーブルメモリ」１５（図１０）が検
索され、P_iレジスタの値で表されたフレーズが前ポー
ズ文字であるか否かが調べられる。前ポーズ文字である
ときは、フローはサブステップＳ１９−３に進む。そう
でないときは、フローはサブステップＳ１９−５に進
む。サブステップＳ１９−３において、P_iレジスタに
よって指定されたフレーズの文字が、前でポーズする文
字であるか否かを決定するために、前ポーズ文字のテー
ブルメモリ１５から上記文字の前ポーズパラメータｆｐ
が見出だされ、上記文字に対する前ポーズの重みづけら
れた値Ｗｆｐが次式のように計算される。In step S19, the weighted values of the poses of all the phrases from the P_begin register to the phrase specified by the P_end register are calculated,
Pause position S_pause and pause duration D_p set based on a phrase specified by the value of the P_pause register
ause is determined, and the values of the S_pause and D_pause registers are stored in a pause array List_pause. Here, two pose levels, which are set based on experience,
The pause duration is determined according to the values of the W_major and W_minor registers. This step S19 further includes sub steps S19-1 to S19-16 as a subroutine. In sub-step S19-1 of FIG.
The pointer P_i specifies the value (start pointer) of the P_begin register. Next, in sub-step S19-2,
The "previous pause character table memory" 15 (FIG. 10) is searched to determine whether the phrase represented by the value of the P_i register is the previous pause character. If the character is the previous pause character, the flow proceeds to substep S19-3. Otherwise, the flow goes to sub-step S19-5. In sub-step S19-3, in order to determine whether or not the character of the phrase specified by the P_i register is a character to be paused before, the previous pause parameter fp of the character is stored in the table memory 15 of the previous pause character.
Is found, and the weight value Wfp of the previous pose for the character is calculated as follows:

【００６９】[0069]

【数１２】Ｗｆｐ＝（Phrase［P_i］．syllable−Phras
e［P_begin］．syllable＋Phrase［P_i］．length）×
ｆｐWfp = (Phrase [P_i] .syllable-Phras
e [P_begin]. syllable + Phrase [P_i]. length) ×
fp

【００７０】次いでサブステップＳ１９−４において、
Ｗｆｐは「より短いポーズの重みづけられた値」を示す
W_minorレジスタの値以上であるか否かが決定される
（この実施形態におけるW_minorレジスタの値は２０に
設定されている）。ＷｆｐがW_minorレジスタの値以上
であるときは、フレーズの前にポーズが必要であると決
定され、次いでフローは図６のサブステップＳ１９−１
６に進む。そうでないときは、フローはサブステップＳ
１９−５に進む。サブステップＳ１９−５において、
「後ポーズ文字のテーブルメモリ１６」（図１１）が検
索され、P_iレジスタの値で表されたフレーズが後ポー
ズ文字であるか否かが調べられる。後ポーズ文字である
ときは、フローはサブステップＳ１９−６に進む。そう
でないときは、フローはサブステップＳ１９−８に進
む。サブステップＳ１９−６において、P_iレジスタに
よって指定されたフレーズの文字の後ポーズパラメータ
ｂｐを調べるために「後ポーズ文字のテーブルメモリ」
１６が検索され、上記文字に対する後ポーズの重みづけ
られた値Ｗｂｐが次式のように計算される。Next, in substep S19-4,
Wfp indicates “weighted value of shorter pause”
It is determined whether the value is equal to or more than the value of the W_minor register (the value of the W_minor register in this embodiment is set to 20). If Wfp is greater than or equal to the value of the W_minor register, it is determined that a pause is required before the phrase, and then the flow proceeds to sub-step S19-1 in FIG.
Proceed to 6. Otherwise, the flow proceeds to substep S
Proceed to 19-5. In sub-step S19-5,
The "post-pause character table memory 16" (FIG. 11) is searched to determine whether the phrase represented by the value of the P_i register is a post-pause character. If it is a post-pause character, the flow proceeds to sub-step S19-6. Otherwise, the flow goes to sub-step S19-8. In sub-step S19-6, the "post-pause character table memory" is used to check the post-pause parameter bp of the character of the phrase specified by the P_i register.
16 is retrieved, and a weight value Wbp of the post-pose for the character is calculated as follows.

【００７１】[0071]

【数１３】Ｗｂｐ＝（Phrase［P_i］．syllable−Phras
e［P_begin］．syllable＋Phrase［P_i］．length）×
ｂｐWbp = (Phrase [P_i] .syllable-Phras
e [P_begin]. syllable + Phrase [P_i]. length) ×
bp

【００７２】次いでサブステップＳ１９−７において、
Ｗｂｐは「より短いポーズの重みづけられた値」を示す
W_minorレジスタの値以上であるか否かが決定される。
ＷｂｐがW_minorレジスタの値以上であるときは、フレ
ーズの後にポーズが必要であると決定され、次いでフロ
ーはサブステップＳ１９−１６に進む。そうでないとき
は、フローはサブステップＳ１９−８に進む。サブステ
ップＳ１９−８において、P_iレジスタの値がP_endレジ
スタを指定する値に到達したか否かが決定される。P_i
レジスタの値がP_endレジスタを指定する値に到達した
ときは、フローは図６のサブステップＳ１９−１０に進
む。そうでないときは、フローはサブステップＳ１９−
９に進む。サブステップＳ１９−９において、P_iレジ
スタの値は１だけインクリメントされる。次いでフロー
はサブステップＳ１９−２に戻る。サブステップＳ１９
−１０において、P_iレジスタの値はもう一度P_beginレ
ジスタが指定するフレーズを指定し、一般ポーズの決定
が開始される。次いでサブステップＳ１９−１１におい
て、P_iレジスタの値によって表されるフレーズ内の文
字数に従って、図１２より一般ポーズの重みづけられた
値のパラメータｐｐが見出だされ、一般ポーズの重みづ
けられた値Ｗｐｐが次式のように計算される。Next, in substep S19-7,
Wbp indicates “shorter pose weighted value”
It is determined whether the value is equal to or more than the value of the W_minor register.
If Wbp is greater than or equal to the value of the W_minor register, it is determined that a pause is required after the phrase, and then the flow proceeds to substep S19-16. Otherwise, the flow goes to sub-step S19-8. In sub-step S19-8, it is determined whether the value of the P_i register has reached a value designating the P_end register. P_i
When the value of the register reaches the value designating the P_end register, the flow proceeds to substep S19-10 in FIG. Otherwise, the flow proceeds to sub-step S19-
Go to 9. In sub-step S19-9, the value of the P_i register is incremented by one. Next, the flow returns to sub-step S19-2. Sub-step S19
At -10, the value of the P_i register again specifies the phrase specified by the P_begin register, and the determination of the general pose is started. Next, in sub-step S19-11, the parameter pp of the general pose weighted value is found from FIG. 12 according to the number of characters in the phrase represented by the value of the P_i register, and the general pose weighted value is found. Wpp is calculated as follows:

【００７３】[0073]

【数１４】Ｗｐｐ＝（Phrase［P_i］．syllable−Phras
e［P_begin］．syllable＋Phrase［P_i］．length）×
ｐｐWpp = (Phrase [P_i] .syllable-Phras
e [P_begin]. syllable + Phrase [P_i]. length) ×
pp

【００７４】次いでサブステップＳ１９−１２におい
て、Ｗｐｐは「より短いポーズの重みづけられた値」を
示すW_minorレジスタの値以上であるか否かが決定され
る。ＷｐｐがW_minorレジスタの値以上であるときは、
フレーズの後にポーズが必要であると決定され、次いで
フローはサブステップＳ１９−１６に進む。そうでない
ときは、フローはサブステップＳ１９−１３に進む。サ
ブステップＳ１９−１３において、P_iレジスタの値はP
_endレジスタの指定する値に到達したか否かが決定され
る。P_iレジスタの値がP_endレジスタを指定する値に到
達したときは、フローはサブステップＳ１９−１５に進
む。そうでないときは、フローはサブステップＳ１９−
１４に進む。サブステップＳ１９−１４において、P_i
レジスタの値は１だけインクリメントされ、フローはサ
ブステップＳ１９−１１に進む。サブステップＳ１９−
１５において、この時点で、ポーズを設定されていない
P_beginレジスタが指定するフレーズからP_endレジスタ
が指定するフレーズまでの少なくとも１３文字からなる
音節が存在する。ゆえに、強制されたポーズが設定さ
れ、Ｗｐｐの値は「より長いポーズの重みづけられた
値」を示すW_majorレジスタの示す値に等しくされる
（この実施形態におけるW_majorレジスタの値は４０に
設定されている）。次いでフローはサブステップＳ１９
−１６に進む。サブステップＳ１９−１６において、ポ
ーズフレーズP_pauseと、ポーズ位置S_pauseと、ポーズ
継続時間D_pauseとがこのステップで決定される。上記
のS_pauseとD_pauseレジスタの値はポーズアレーList_p
auseに付加される。Next, in sub-step S19-12, it is determined whether or not Wpp is equal to or greater than a value of a W_minor register indicating a "weight value of a shorter pause". When Wpp is equal to or greater than the value of the W_minor register,
It is determined that a pause is required after the phrase, and then the flow proceeds to sub-step S19-16. Otherwise, the flow goes to sub-step S19-13. In sub-step S19-13, the value of the P_i register is P
It is determined whether or not the value specified by the _end register has been reached. When the value of the P_i register has reached the value designating the P_end register, the flow proceeds to substep S19-15. Otherwise, the flow proceeds to sub-step S19-
Proceed to 14. In sub-step S19-14, P_i
The value of the register is incremented by 1, and the flow proceeds to sub-step S19-11. Sub-step S19-
At 15, no pause has been set at this time
There is a syllable consisting of at least 13 characters from the phrase specified by the P_begin register to the phrase specified by the P_end register. Therefore, a forced pose is set and the value of Wpp is made equal to the value of the W_major register, which indicates "weighted value of the longer pause" (the value of the W_major register in this embodiment is set to 40). ing). Next, the flow proceeds to substep S19.
Proceed to -16. In sub-step S19-16, a pause phrase P_pause, a pause position S_pause, and a pause duration D_pause are determined in this step. The values of the S_pause and D_pause registers above are the pause array List_p
Added to ause.

【００７５】（１）ポーズフレーズP_pauseは次のよう
に決定される。前ポーズであるときは、(1) The pause phrase P_pause is determined as follows. When in the front pose,

【数１５】P_pause＝P_i−１そうでないときは、P_pause = P_i-1 Otherwise,

【数１６】P_pause＝P_i[Equation 16] P_pause = P_i

【００７６】（２）ポーズ位置S_pauseは次のように決
定される。(2) The pause position S_pause is determined as follows.

【００７７】[0077]

【数１７】S_pause＝Phrase［P_pause］．syllable＋Ph
rase［P_pause］．length−１S_pause = Phrase [P_pause]. syllable + Ph
rase [P_pause]. length-1

【００７８】（３）ポーズ継続時間D_pauseは次のよう
に決定される。Ｗｆｐ，Ｗｂｐ又はＷｐｐがW_major以
上であるときは(3) The pause duration D_pause is determined as follows. When Wfp, Wbp or Wpp is greater than W_major

【数１８】D_pause＝１２０ｍｓそうでないときは、D_pause = 120 ms Otherwise,

【数１９】D_pause＝８０ｍｓ[Equation 19] D_pause = 80 ms

【００７９】次いで図４のステップＳ２０において、P_
pauseレジスタの値が指定するフレーズは文の最後のフ
レーズに到達したか否かが決定される。最後のフレーズ
に到達したときは、フローはステップＳ２２に進む。そ
うでないときは、フローはステップＳ２１に進む。Next, in step S20 of FIG.
It is determined whether the phrase specified by the value of the pause register has reached the last phrase of the sentence. When the last phrase has been reached, the flow proceeds to step S22. Otherwise, the flow goes to step S21.

【００８０】ステップＳ２１において、P_beginレジス
タの値（開始ポインタ）及びP_endレジスタの値（終了
ポインタ）はポーズフレーズP_pauseにすぐ続くフレー
ズを指定し、フローはステップＳ１５に進む。In step S21, the value of the P_begin register (start pointer) and the value of the P_end register (end pointer) specify a phrase immediately following the pause phrase P_pause, and the flow proceeds to step S15.

【００８１】ステップＳ２２において、音声合成システ
ムで用いるために、全文のポーズ位置S_pause及びポー
ズ継続時間D_pauseのすべての値がポーズアレーList_pa
useから出力され、本発明の装置の動作は終了する。次
いで、上記ポーズ位置及びポーズ継続時間決定部１９か
ら出力された、ポーズ位置及びポーズ継続時間を挿入さ
れた文のテキストデータは、公知の音声合成手段によっ
て音声合成される。In step S22, all values of the pause position S_pause and the pause duration D_pause of the whole sentence are set in the pause array List_pa for use in the speech synthesis system.
Output from use, the operation of the device of the present invention ends. Next, the text data of the sentence into which the pause position and the pause duration are inserted, which is output from the pause position and pause duration determination unit 19, is subjected to voice synthesis by a known voice synthesis unit.

【００８２】[0082]

【実施例】主に図２から図６のフローチャートを参照し
て、上で構成された好ましい実施形態の、中国語のテキ
ストを音声に変換するシステムのためのポーズ継続時間
の処理装置の動作を説明するために、以下の例におい
て、文「堅強的求生意志是能DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Referring mainly to the flowcharts of FIGS. 2 to 6, the operation of the pause duration processing apparatus for a system for converting Chinese text to speech of the preferred embodiment constructed above will be described. To illustrate, in the example below, the sentence “Robust

【外３】戰勝病魔最有效的方法」を入力する。[Outside 3] Enter the most effective method of war sickness.

【００８３】実施例のプロセスフローは以下のように進
む。図２のステップＳ１において、キーボードのような
文入力部１０を用いて文を入力する。ステップＳ２にお
いて、テキスト中の終了キーを検出して入力は終了す
る。このとき文のテキストデータ「堅強的求生意志是能The process flow of the embodiment proceeds as follows. In step S1 of FIG. 2, a sentence is input using a sentence input unit 10 such as a keyboard. In step S2, the end is detected by detecting the end key in the text. At this time, the text data of the sentence "

【外４】戰勝病魔最有效的方法」がTextBuffer［］メモリバッ
ファ領域に記憶される。[Outside 4] The most effective method of the war is stored in the TextBuffer [] memory buffer area.

【００８４】その後ステップＳ３において、辞書メモリ
１１内の複数の単語と比較することにより、単語検査部
１２は、文中の各単語「堅強」「的」「求生意志」
「是」「能Then, in step S 3, by comparing the words with a plurality of words in the dictionary memory 11, the word checking unit 12 determines whether each word in the sentence is “strong”, “target”, or “willing to seek”
`` Yes '' `` Noh

【外５】」「戰勝」「病魔」「最」「有效」「的」及び「方法」
を検査し、各単語の文中における開始位置と単語の文字
数とを、アレーレジスタにWd［］内の数の対（単語の
開始位置，単語の長さ）の列として記憶する。すなわち
以下に図示される通りである。[Outside 5] "" Sengoku """Sick""" Most """Effective""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" V e"""""
Is checked, and the start position of each word in the sentence and the number of characters of the word are stored in the array register as a sequence of pairs of numbers (word start position, word length) in Wd []. That is, it is as illustrated below.

【００８５】[0085]

【表１】 ―――――――――――――――――――――――――――――――――――― Wd［１］＝（１，２）， … 「堅強」 Wd［２］＝（３，１）， … 「的」 Wd［３］＝（４，４）， … 「求生意志」 Wd［４］＝（８，１）， … 「是」 Wd［５］＝（９，２）， … 「能[Table 1] ―――――――――――――――――――――――――――――――――― Wd [1] = (1,2) ,… “Strength” Wd [2] = (3,1),… “Target” Wd [3] = (4,4),… “Want to volunteer” Wd [4] = (8,1),… Wd [5] = (9,2),… “Noh

【外６】」 Wd［６］＝（１１，２）， … 「戰勝」 Wd［７］＝（１３，２）， … 「病魔」 Wd［８］＝（１５，１）， … 「最」 Wd［９］＝（１６，２）， … 「有效」 Wd［１０］＝（１８，１）， … 「的」 Wd［１１］＝（１９，２）， … 「方法」 ――――――――――――――――――――――――――――――――――――[Outside 6] Wd [6] = (11, 2),… “War Victory” Wd [7] = (13, 2),… “Sick” Wd [8] = (15, 1),… “Most” Wd [9] = (16, 2), ... "Effective" Wd [10] = (18, 1), ... "target" Wd [11] = (19, 2), ... "Method" ----------- ―――――――――――――――――――――――――――

【００８６】次いでステップＳ５において、Wd［］に
記憶された各単語に従って、品詞／拡張シンタックス検
査部１３は、（その内容が図９に図示されているよう
な）辞書メモリ１１より各単語について品詞及び拡張シ
ンタックスを見出だし、それをWd_typeアレーレジスタ
及びWd_expandアレーレジスタにそれぞれ記憶する。す
なわち以下に図示される通りである。Next, in step S5, in accordance with each word stored in Wd [], the part-of-speech / extended syntax checking unit 13 reads the word from the dictionary memory 11 (the contents of which are shown in FIG. 9). The part-of-speech and the extended syntax are found and stored in the Wd_type array register and the Wd_expand array register, respectively. That is, it is as illustrated below.

【００８７】[0087]

【表２】 ―――――――――――――――――――――――――――――――――――― Wd_type［１］＝Ｎ， Wd_expand［１］＝φ …「堅強」 Wd_type［２］＝Ｊ， Wd_expand［２］＝ＢＮ …「的」 Wd_type［３］＝Ｎ， Wd_expand［３］＝φ …「求生意志」 Wd_type［４］＝Ｖ， Wd_expand［４］＝φ …「是」 Wd_type［５］＝Ａ， Wd_expand［５］＝φ …「能[Table 2] ――――――――――――――――――----- ] = Φ… “robust” Wd_type [2] = J, Wd_expand [2] = BN… “target” Wd_type [3] = N, Wd_expand [3] = φ… “Want to volunteer” Wd_type [4] = V, Wd_expand [4] = φ… “Yes” Wd_type [5] = A, Wd_expand [5] = φ…

【外７】」 Wd_type［６］＝Ｖ， Wd_expand［６］＝ＡＮ …「戰勝」 Wd_type［７］＝Ｎ， Wd_expand［７］＝φ …「病魔」 Wd_type［８］＝Ａ， Wd_expand［８］＝φ …「最」 Wd_type［９］＝Ｎ，Ｊ，Wd_expand［９］＝φ …「有效」 Wd_type［１０］＝Ｊ， Wd_expand［１０］＝ＢＮ …「的」 Wd_type［１１］＝Ｎ， Wd_expand［１１］＝φ …「方法」 ――――――――――――――――――――――――――――――――――――[Outside 7] Wd_type [6] = V, Wd_expand [6] = AN… “War Victory” Wd_type [7] = N, Wd_expand [7] = φ… “Sick” Wd_type [8] = A, Wd_expand [8] = φ ... “ Wd_type [9] = N, J, Wd_expand [9] = φ ... "effective" Wd_type [10] = J, Wd_expand [10] = BN ... "target" Wd_type [11] = N, Wd_expand [11] = φ… “Method” ――――――――――――――――――――――――――――――――――――

【００８８】次にフレーズ拡張部１４がフレーズ拡張処
理を開始するために用いられる。最初にステップＳ５に
おいて、Wdアレーレジスタ内の各検査された単語に従っ
て、共に単語を形成する各音節の構成情報がWd_phrアレ
ーレジスタにNext, the phrase expansion unit 14 is used to start the phrase expansion processing. First, in step S5, according to each examined word in the Wd array register, the configuration information of each syllable that together forms a word is stored in the Wd_phr array register.

【数２０】Wd_phr［音節の位置］＝（フレーズの長さ，
フレーズ内の位置）の形式で記憶される。すなわち以下に図示される通りで
ある。[Mathematical formula-see original document] Wd_phr [syllable position] = (length of phrase,
(Position in the phrase). That is, it is as illustrated below.

【００８９】[0089]

【表３】 ―――――――――――――――――――――――――――――――――――― Wd［１］＝（１，２） →Wd_phr［１］＝（２，１） …「堅強」 →Wd_phr［２］＝（２，２） Wd［２］＝（３，１） →Wd_phr［３］＝（１，１） …「的」 Wd［３］＝（４，４） →Wd_phr［４］＝（４，１） …「求生意志」 →Wd_phr［５］＝（４，２） →Wd_phr［６］＝（４，３） →Wd_phr［７］＝（４，４） Wd［４］＝（８，１） →Wd_phr［８］＝（１，１） …「是」 Wd［５］＝（９，２） →Wd_phr［９］＝（２，１） …「能[Table 3] ―――――――――――――――――――――――――――――――――― Wd [1] = (1,2) → Wd_phr [1] = (2,1)… “strong” → Wd_phr [2] = (2,2) Wd [2] = (3,1) → Wd_phr [3] = (1,1)… Wd [3] = (4, 4) → Wd_phr [4] = (4, 1)… “willing to seek” → Wd_phr [5] = (4, 2) → Wd_phr [6] = (4, 3) → Wd_phr [7] = (4,4) Wd [4] = (8,1) → Wd_phr [8] = (1,1)… “Zero” Wd [5] = (9,2) → Wd_phr [9] = (2,1)… "Noh

【外８】」 →Wd_phr［１０］＝（２，２） Wd［６］＝（１１，２） →Wd_phr［１１］＝（２，１） …「戰勝」 →Wd_phr［１２］＝（２，２） Wd［７］＝（１３，２） →Wd_phr［１３］＝（２，１） …「病魔」 →Wd_phr［１４］＝（２，２） Wd［８］＝（１５，１） →Wd_phr［１５］＝（１，１） …「最」 Wd［９］＝（１６，２） →Wd_phr［１６］＝（２，１） …「有效」 →Wd_phr［１７］＝（２，２） Wd［１０］＝（１８，１）→Wd_phr［１８］＝（１，１） …「的」 Wd［１１］＝（１９，２）→Wd_phr［１９］＝（２，１） …「方法」 →Wd_phr［２０］＝（２，２） ――――――――――――――――――――――――――――――――――――[Outside 8] → Wd_phr [10] = (2,2) Wd [6] = (11,2) → Wd_phr [11] = (2,1)… "War" → Wd_phr [12] = (2,2) Wd [ 7] = (13, 2) → Wd_phr [13] = (2, 1)… “Sick” → Wd_phr [14] = (2, 2) Wd [8] = (15, 1) → Wd_phr [15] = (1,1)… “most” Wd [9] = (16,2) → Wd_phr [16] = (2,1)… “valid” → Wd_phr [17] = (2,2) Wd [10] = (18, 1) → Wd_phr [18] = (1, 1)… “target” Wd [11] = (19, 2) → Wd_phr [19] = (2, 1)… “Method” → Wd_phr [20] = (2,2) ――――――――――――――――――――――――――――――――――

【００９０】堅強／的／求生意志／是／能Strong / target / recruitment will / requirement / noh

【外９】／戰勝／病魔／最／有效／的／方法[Outside 9] / War / sickness / most / effective / effective / method

【００９１】その後図３のステップＳ６において、Wd_i
レジスタ内の値が１に設定され、第１の単語「堅強」の
拡張処理が開始される。ステップＳ７において、Thereafter, in step S6 of FIG. 3, Wd_i
The value in the register is set to 1, and the expansion processing of the first word “robust” is started. In step S7,

【数２１】Wd_expand［１］≠φ であるか否かが決定される。Wd_expand［１］はφなの
で、拡張処理は必要なく、次の単語の拡張処理に進む。
ステップＳ１１においてWd_iレジスタの値が検査され、
最後の単語を指定する値に到達したか否かが決定され
る。上記結果は「ＮＯ」であり、ステップＳ１２におい
てWd_iレジスタの値が２に変化させられる。次いで再び
ステップＳ７において、第２の単語の拡張処理が続けら
れ、## EQU21 ## It is determined whether Wd_expand [1] １φ. Since Wd_expand [1] is φ, no expansion processing is necessary, and the process proceeds to the expansion processing of the next word.
In step S11, the value of the Wd_i register is checked,
It is determined whether the value specifying the last word has been reached. The result is "NO", and the value of the Wd_i register is changed to 2 in step S12. Next, in step S7 again, the expansion processing of the second word is continued,

【数２２】Wd_expand［２］≠φ であるか否かが決定される。ここでは## EQU22 ## It is determined whether or not Wd_expand [2] ≠ φ. here

【数２３】Wd_expand［２］＝ＢＮであって、これは前に名詞が接続され、Ｎ，ＢＮの順序
のフレーズを生成する拡張シンタックスを表すので、ス
テップＳ８において前の単語の品詞が検査される。この
とき、## EQU23 ## Since Wd_expand [2] = BN, which represents an extended syntax for generating a phrase in the order of N and BN with nouns connected before, the part of speech of the previous word is checked in step S8. Is done. At this time,

【数２４】Wd_type［Wd_i−１］＝Ｎは、拡張シンタックスＢＮに対応した名詞を表す。この
ように、２番目（Wd_i＝２）の単語「的」と、１番目
（Wd_i−１＝１）の単語「堅強」とは拡張されて、フレ
ーズ「堅強的」を形成する。他の単語において、Wd_phr
［１］と、Wd_phr［２］と、Wd_phr［３］とは拡張され
て、開始位置Phr_start＝１と、終了位置Phr_end＝３
と、フレーズ長さ[Expression 24] Wd_type [Wd_i-1] = N represents a noun corresponding to the extended syntax BN. In this way, the second (Wd_i = 2) word “target” and the first (Wd_i−1 = 1) word “robust” are expanded to form the phrase “robust”. In other words, Wd_phr
[1], Wd_phr [2], and Wd_phr [3] are expanded to include a start position Phr_start = 1 and an end position Phr_end = 3.
And the phrase length

【数２５】Phr_length＝３−１＋１＝３とを有する新しいフレーズを形成する。ステップＳ９に
おいて、これらの値はPhr_startレジスタ、Phr_endレジ
スタ及びPhr_lengthレジスタにそれぞれ記憶される。続
いてステップＳ１０において、Wd_phrアレーレジスタ内
の３つの音節を含むこのフレーズに関連付けられた値は
以下のように更新される。Form a new phrase with Phr_length = 3-1 + 1 = 3. In step S9, these values are stored in the Phr_start register, Phr_end register, and Phr_length register, respectively. Subsequently, in step S10, the value associated with this phrase containing three syllables in the Wd_phr array register is updated as follows.

【００９２】[0092]

【表４】 ―――――――――――――――――――――――――――――――――――― Wd_phr［１］＝（２，１） → Wd_phr［１］＝（３，１） Wd_phr［２］＝（２，２） → Wd_phr［２］＝（３，２） Wd_phr［３］＝（１，１） → Wd_phr［３］＝（３，３） ――――――――――――――――――――――――――――――――――――[Table 4] ―――――――――――――――――――――――――――――――――― Wd_phr [1] = (2,1) → Wd_phr [1] = (3,1) Wd_phr [2] = (2,2) → Wd_phr [2] = (3,2) Wd_phr [3] = (1,1) → Wd_phr [3] = (3) , 3) ――――――――――――――――――――――――――――――――――――

【００９３】次いで、ステップＳ１１においてWd_iレジ
スタの値がまだ最後の単語を指定する値に到達していな
いと決定されたので、ステップＳ１２においてWd_iレジ
スタの値は１だけインクリメントされて３になり、次の
単語「求生意志」の拡張処理を続ける。上記拡張された
フレーズ「堅強的」は、後の拡張処理では拡張を実行す
るべき単語とみなされる。ステップＳ７においてNext, in step S11, it is determined that the value of the Wd_i register has not yet reached the value designating the last word. Therefore, in step S12, the value of the Wd_i register is incremented by 1 to 3, and Continue the expansion processing of the word "Shouki will". The expanded phrase “robust” is regarded as a word to be expanded in a later expansion process. In step S7

【数２６】Wd_expand［３］＝φ と決定された後で、ステップＳ１１においてWd_iレジス
タの値はまだ最後の単語を指定する値に到達していない
ので、ステップＳ１２においてWd_iレジスタの値は１だ
けインクリメントされ、ステップＳ７が再び実行され
る。このように、ステップＳ７、Ｓ８、Ｓ９、Ｓ１０、
Ｓ１１、及びＳ１２は繰り返され、第４の単語、第５の
単語、…第１１の単語「方法」までを処理する。フレー
ズ拡張処理は、ステップＳ１１において文中の最後の単
語に到達したことを検出することで終了される。このと
き、Wd_phrアレーレジスタ内の値は以下のようになる。After determining that Wd_expand [3] = φ, since the value of the Wd_i register has not yet reached the value specifying the last word in step S11, the value of the Wd_i register is only 1 in step S12. The value is incremented, and step S7 is executed again. Thus, steps S7, S8, S9, S10,
S11 and S12 are repeated to process the fourth word, the fifth word,... Up to the eleventh word “method”. The phrase expansion process is completed by detecting that the last word in the sentence has been reached in step S11. At this time, the values in the Wd_phr array register are as follows.

【００９４】[0094]

【表５】 ―――――――――――――――――――――――――――――――――――― Wd_phr［１］＝（２，１） →Wd_phr［１］＝（３，１）…「堅強的」 Wd_phr［２］＝（２，２） →Wd_phr［２］＝（３，２） Wd_phr［３］＝（１，１） →Wd_phr［３］＝（３，３） Wd_phr［４］＝（４，１） →Wd_phr［４］＝（４，１）…「求生意志」 Wd_phr［５］＝（４，２） →Wd_phr［５］＝（４，２） Wd_phr［６］＝（４，３） →Wd_phr［６］＝（４，３） Wd_phr［７］＝（４，４） →Wd_phr［７］＝（４，４） Wd_phr［８］＝（１，１） →Wd_phr［８］＝（１，１）…「是」 Wd_phr［９］＝（２，１） →Wd_phr［９］＝（２，１）…「能[Table 5] ―――――――――――――――――――――――――――――――――― Wd_phr [1] = (2,1) → Wd_phr [1] = (3,1)… “Robust” Wd_phr [2] = (2,2) → Wd_phr [2] = (3,2) Wd_phr [3] = (1,1) → Wd_phr [ 3] = (3,3) Wd_phr [4] = (4,1) → Wd_phr [4] = (4,1)… “willingness to seek” Wd_phr [5] = (4,2) → Wd_phr [5] = (4,2) Wd_phr [6] = (4,3) → Wd_phr [6] = (4,3) Wd_phr [7] = (4,4) → Wd_phr [7] = (4,4) Wd_phr [8 ] = (1,1) → Wd_phr [8] = (1,1)… “Yes” Wd_phr [9] = (2,1) → Wd_phr [9] = (2,1)… ”Noh

【外１０】」 Wd_phr［１０］＝（２，２）→Wd_phr［１０］＝（２，２） Wd_phr［１１］＝（２，１）→Wd_phr［１１］＝（４，１）…「戰勝病魔」 Wd_phr［１２］＝（２，２）→Wd_phr［１２］＝（４，２） Wd_phr［１３］＝（２，１）→Wd_phr［１３］＝（４，３） Wd_phr［１４］＝（２，２）→Wd_phr［１４］＝（４，４） Wd_phr［１５］＝（１，１）→Wd_phr［１５］＝（１，１）…「最」 Wd_phr［１６］＝（２，１）→Wd_phr［１６］＝（３，１）…「有效的」 Wd_phr［１７］＝（２，２）→Wd_phr［１７］＝（３，２） Wd_phr［１８］＝（１，１）→Wd_phr［１８］＝（３，３） Wd_phr［１９］＝（２，１）→Wd_phr［１９］＝（２，１）…「方法」 Wd_phr［２０］＝（２，２）→Wd_phr［２０］＝（２，２） ――――――――――――――――――――――――――――――――――――[Outside 10] Wd_phr [10] = (2,2) → Wd_phr [10] = (2,2) Wd_phr [11] = (2,1) → Wd_phr [11] = (4,1)... 12] = (2,2) → Wd_phr [12] = (4,2) Wd_phr [13] = (2,1) → Wd_phr [13] = (4,3) Wd_phr [14] = (2,2) → Wd_phr [14] = (4,4) Wd_phr [15] = (1,1) → Wd_phr [15] = (1,1) ... “most” Wd_phr [16] = (2,1) → Wd_phr [16 ] = (3,1) "effective" Wd_phr [17] = (2,2) → Wd_phr [17] = (3,2) Wd_phr [18] = (1,1) → Wd_phr [18] = ( 3,3) Wd_phr [19] = (2,1) → Wd_phr [19] = (2,1) ... “Method” Wd_phr [20] = (2,2) → Wd_phr [20] = (2,2) ――――――――――――――――――――――――――――――― ----

【００９５】前述のことから、文の１１個の単語「堅強
的求生意志是能From the above, it can be seen that the 11 words in the sentence “

【外１１】戰勝病魔最有效的方法」がフレーズ拡張処理
をされた後で、８個のフレーズ「堅強的」「求生意志」
「是」「能[Outside 11] Eight phrases are "strong" and "willing will"
`` Yes '' `` Noh

【外１２】」「戰勝病魔」「最」「有效的」及び「方法」を得るこ
とができる。[Outside 12] ”, Victory sickness,“ most ”,“ effective ”and“ method ”.

【００９６】次に、各フレーズのポーズの重みづけられ
た値の計算を容易化するために、ステップ１３におい
て、インデックスキーとして音節を用いるWd_phr［］
アレーレジスタ内の値は、インデックスキーとしてフレ
ーズ（「Phrase_id」によって指定される）を用いるPhr
ase［］アレーレジスタ内の値に変換される。Phrase
［］アレーレジスタのフォーマットは、（フレーズを
開始する位置、フレーズの長さ）を表す数の対（syllab
le，length）を有するNext, in step 13, Wd_phr [] using syllables as index keys to facilitate the calculation of the weighted value of the pose for each phrase.
The value in the array register is Phr using the phrase (specified by "Phrase_id") as the index key
ase [] Converted to the value in the array register. Phrase
[] The format of the array register is a pair of numbers (syllab
le, length)

【数２７】 Phrase［Phrase_id］＝（syllable，length）である。ステップＳ１３における変換の後で、Phrase
［１］アレーレジスタからPhrase［８］アレーレジスタ
までに対して以下の値が得られる。[Mathematical formula-see original document] Phrase [Phrase_id] = (syllable, length). After the conversion in step S13, the Phrase
[1] The following values are obtained from the array register to the Phrase [8] array register.

【００９７】[0097]

【表６】 ―――――――――――――――――――――――――――――――――――― Phrase［１］＝（１，３），Phrase［２］＝（４，４）， Phrase［３］＝（８，１），Phrase［４］＝（９，２）， Phrase［５］＝（１１，４），Phrase［６］＝（１５，１）， Phrase［７］＝（１６，３），Phrase［８］＝（１９，２）。 ――――――――――――――――――――――――――――――――――――[Table 6] ―――――――――――――――――――――――――――――――――― Phrase [1] = (1,3) , Phrase [2] = (4,4), Phrase [3] = (8,1), Phrase [4] = (9,2), Phrase [5] = (11,4), Phrase [6] = (15,1), Phrase [7] = (16,3), Phrase [8] = (19,2). ――――――――――――――――――――――――――――――――――――

【００９８】図４のステップＳ１４から開始されると、
文中のフレーズは意味的処理ユニットであると考えられ
る。各意味的ユニットのポーズの重みづけられた値を計
算することによって、意味的ユニットの前及び後でポー
ズ継続時間が必要であるか否かが決定される。最初に、
処理セクションとして、少なくとも１３音節（両端の音
節も含めて）を含むフレーズ数が、文から続いて検索さ
れる。ステップＳ１４において、上記セクションの開始
ポインタP_beginは最初に１に設定され、上記セクショ
ンの終了ポインタP_endは開始ポインタと同じに設定さ
れる。続いてステップＳ１４において、P_beginとP_end
の間の音節数が計算され、S_rangeレジスタに記憶され
る。従って次式が成立する。When starting from step S14 in FIG.
The phrases in the sentence are considered to be semantic processing units. By calculating the weighted value of the pose of each semantic unit, it is determined whether a pause duration is required before and after the semantic unit. At first,
As the processing section, the number of phrases including at least 13 syllables (including both syllables) is subsequently searched from the sentence. In step S14, the section start pointer P_begin is initially set to 1, and the section end pointer P_end is set to be the same as the start pointer. Subsequently, in step S14, P_begin and P_end
Are calculated and stored in the S_range register. Therefore, the following equation is established.

【００９９】[0099]

【数２８】S_range＝Phrase［P_end］．syllable＋Phra
se［P_end］．length−Phrase［P_begin］．syllable＝
１＋３−１＝３S_range = Phrase [P_end]. syllable ＋ Phra
se [P_end]. length-Phrase [P_begin]. syllable =
1 + 3-1 = 3

【０１００】その後ステップＳ１６において、セクショ
ンが１３音節以下であることを示すS_range＜１３が成
立するか否かが決定される。次いでP_endレジスタの値
は次のフレーズを指定する。ステップＳ１７において、
P_endレジスタの値がまだ最後のフレーズを指定する値
に到達していないと決定されたときは、ステップＳ１８
においてP_endレジスタの値は１だけインクリメントさ
れ、P_beginレジスタ（開始ポインタに対応する）とP_e
ndレジスタ（終了ポインタに対応する）とがそれぞれ指
定するフレーズの間の音節数の計算が続けられ、ステッ
プＳ１５において上記音節数が１３音節以上であるか否
かが決定される。巡回するステップＳ１５、Ｓ１６、Ｓ
１７及びＳ１８は反復される。P_endレジスタの値が５
であるとき、ステップＳ１５においてS_rangeレジスタ
の値が１４であると決定され、ステップＳ１６において
上記S_rangeレジスタの値が１３音節以上であると決定
される。次いでステップＳ１９に進み、P_begin＝１か
らP_end＝５までが指定する拡張された５つのフレーズ
によって構成されるセクションの、ポーズ位置及びポー
ズ継続時間が以下に説明されるように決定される。Thereafter, in step S16, it is determined whether or not S_range <13 indicating that the section is 13 syllables or less is satisfied. The value of the P_end register then specifies the next phrase. In step S17,
If it is determined that the value of the P_end register has not yet reached the value designating the last phrase, step S18
In, the value of the P_end register is incremented by 1, and the P_begin register (corresponding to the start pointer) and P_e
The calculation of the number of syllables between the phrases respectively designated by the nd register (corresponding to the end pointer) is continued, and it is determined in step S15 whether or not the number of syllables is 13 or more syllables. Steps S15, S16, S to go around
17 and S18 are repeated. P_end register value is 5
In step S15, the value of the S_range register is determined to be 14, and in step S16, the value of the S_range register is determined to be 13 or more syllables. Next, proceeding to step S19, the pause position and the pause duration of the section constituted by the expanded five phrases designated by P_begin = 1 to P_end = 5 are determined as described below.

【０１０１】[0101]

【表７】 [Table 7]

【０１０２】シーケンスにおいてセクション内のフレー
ズを検査し、前ポーズ文字又は後ポーズ文字が存在する
か否かを調べるために、最初に図５のサブステップＳ１
９−１においてP_iレジスタの値がP_beginレジスタ（開
始ポインタ）の値に設定される。このとき、P_iレジス
タの値は１であり、フレーズ「堅強的」を指定してい
る。次いでサブステップＳ１９−２において「前ポーズ
文字のテーブルメモリ」１５が検索され、「堅強的」が
前ポーズ文字でないことを見出だす。次にサブステップ
Ｓ１９−５において「後ポーズ文字のテーブルメモリ」
１６が検索され、「堅強的」が後ポーズ文字でないこと
を見出だす。次のフレーズの検査を続けるために、サブ
ステップＳ１９−８においてP_iレジスタの値がまだセ
クションの最後のフレーズを指定する値に到達していな
いことを確認した後で、サブステップＳ１９−９におい
てP_iレジスタの値は１だけインクリメントされ、フレ
ーズ「求生意志」の検査が続けられ、これらの文字（外
２０のフレーズ）がサブステップＳ１９−２又はサブス
テップＳ１９−５において、前ポーズ文字でも後ポーズ
文字でないことが見出だされる。サブステップＳ１９−
８においてP_iレジスタがまだセクションの最後のフレ
ーズに到達していないことを確認する。次いでP_iレジ
スタの値は１だけインクリメントされ、フレーズ「是」
の検査が続けられ、サブステップＳ１９−２において上
記フレーズを構成する文字「是」が前ポーズ文字である
ことが見出だされる。「是」のポーズの重みづけられた
値のパラメータｆｐは、「前ポーズ文字のテーブルメモ
リ１５」から得られるように５．４であり、それについ
てサブステップＳ１９−３において前ポーズの重みづけ
られた値Ｗｆｐが以下のように計算される。In order to examine the phrases in the section in the sequence and to see if there is a pre-pause character or a post-pause character, first, in sub-step S1 of FIG.
At 9-1, the value of the P_i register is set to the value of the P_begin register (start pointer). At this time, the value of the P_i register is 1, which specifies the phrase “robust”. Next, in sub-step S19-2, "table memory of previous pause character" 15 is searched to find that "robust" is not the previous pause character. Next, in sub-step S19-5, "table memory for post-pause character"
16 is searched to find that "robust" is not a post-pause character. In order to continue checking the next phrase, after confirming in step S19-8 that the value of the P_i register has not yet reached the value specifying the last phrase of the section, P_i is determined in substep S19-9. The value of the register is incremented by one, and the examination of the phrase "request for will" is continued, and these characters (the other 20 phrases) are recognized in the sub-step S19-2 or the sub-step S19-5 by the pre-pause character or the post-pause character. Is not found. Sub-step S19-
At 8, check that the P_i register has not yet reached the last phrase of the section. Next, the value of the P_i register is incremented by 1, and the phrase “ze”
Is continued, and in sub-step S19-2, it is found that the character "ZE" constituting the phrase is a pre-pause character. The parameter fp of the weight value of the pose of “ze” is 5.4, as obtained from the “table memory 15 of the previous pose character”, which is weighted in the sub-step S19-3. The calculated value Wfp is calculated as follows.

【０１０３】[0103]

【数２９】Ｗｆｐ＝（Phrase［P_i］．syllable−Phras
e［P_begin］．syllable＋Phrase［P_i］．length）×
ｆｐ＝（８−１＋１）×５．４＝４３．２[Mathematical formula-see original document] Wfp = (Phrase [P_i] .syllable-Phras]
e [P_begin]. syllable + Phrase [P_i]. length) ×
fp = (8-1 + 1) × 5.4 = 43.2

【０１０４】その後、サブステップＳ１９−４におい
て、ＷｆｐはW_minorレジスタの値以上なので（この実
施形態においてW_minorレジスタの値は２０に設定され
ている)、図６のサブステップＳ１９−１６が実行され
る。それは前ポーズなので、ポーズフレーズP_pauseはThereafter, in sub-step S19-4, since Wfp is equal to or greater than the value of the W_minor register (the value of the W_minor register is set to 20 in this embodiment), sub-step S19-16 of FIG. 6 is executed. . Because it is the previous pose, the pose phrase P_pause is

【数３０】P_pause＝P_i＝３−１＝２であり、ポーズ位置S_pauseは次式で表される。## EQU30 ## P_pause = P_i = 3-1 = 2, and the pause position S_pause is represented by the following equation.

【数３１】 S_pause＝Phrase［P_pause］．syllable＋Phrase［P_pause］．length−１＝Phrase［２］．syllable＋Phrase［２］．length−１＝４＋４−１＝７S_pause = Phrase [P_pause]. syllable + Phrase [P_pause]. length-1 = Phrase [2]. syllable + Phrase [2]. length-1 = 4 + 4-1 = 7

【０１０５】Ｗｆｐの値が４３．２であるため、W_majo
rよりも大きく（W_majorレジスタの値はこの実施形態で
は４０に設定されている）、ポーズ継続時間は次式に従
う。Since the value of Wfp is 43.2, W_majo
It is larger than r (the value of the W_major register is set to 40 in this embodiment), and the pause duration follows the following equation.

【０１０６】[0106]

【数３２】D_pause＝１２０ｍｓ[Formula 32] D_pause = 120 ms

【０１０７】次に、ポーズ位置S_pause（７の値をと
る）及びポーズ継続時間D_pause（１２０ｍｓの値をと
る）の値は、それぞれポーズアレーList_pauseに記憶さ
れ、P_beginレジスタとP_endレジスタの指定するフレー
ズの間のポーズ位置及びポーズ継続時間の決定は終了さ
れる。次いでフローは図５のステップＳ２０に戻り、P_
pauseレジスタの指定するフレーズが文の最後のフレー
ズであるか否かが検査される。このとき、P_pause＝２
で、最後のフレーズではない。従って、ステップＳ２１
が実行され、P_begin及びP_endレジスタが指定するフレ
ーズを、P_pauseレジスタが指定するフレーズの後の、
次のフレーズに指定する。すなわち、Next, the values of the pause position S_pause (having a value of 7) and the pause duration D_pause (having a value of 120 ms) are stored in the pause array List_pause, respectively, and the values of the phrases specified by the P_begin register and the P_end register are stored. The determination of the pause position and the pause duration is completed. Next, the flow returns to step S20 in FIG.
It is checked whether the phrase specified by the pause register is the last phrase of the sentence. At this time, P_pause = 2
And not the last phrase. Therefore, step S21
Is executed, and the phrase specified by the P_begin and P_end registers is replaced with the phrase specified by the P_pause register.
Specify the next phrase. That is,

【数３３】P_begin＝P_end＝P_pause＋１＝２＋１＝３である。巡回するステップＳ１５、Ｓ１６、Ｓ１７及び
Ｓ１８は、ステップＳ１６においてS_rangeレジスタの
値が１３になるP_end＝８まで反復される。ステップＳ
１９が実行され、P_begin＝３からP_end＝８までが指定
する拡張された６つのフレーズによって構成されるセク
ションの、ポーズ位置及びポーズ継続時間の決定の準備
を以下のようにする。P_begin = P_end = P_pause + 1 = 2 + 1 = 3 Steps S15, S16, S17, and S18 to be repeated are repeated until P_end = 8 at which the value of the S_range register becomes 13 in step S16. Step S
Step 19 is executed to prepare for the determination of the pause position and the pause duration of the section composed of the expanded six phrases specified by P_begin = 3 to P_end = 8 as follows.

【０１０８】[0108]

【表８】 [Table 8]

【０１０９】文中のフレーズが前ポーズ文字又は後ポー
ズ文字を有するか否かをシーケンス中で検査するため
に、サブステップＳ１９−１においてP_iレジスタの値
はP_beginレジスタの値に設定される。巡回するサブス
テップＳ１９−２、Ｓ１９−５、Ｓ１９−８及び１９−
９が反復される。最後に、P_iレジスタの値が６である
フレーズ「最」が検査され、サブステップＳ１９−２に
おいて上記フレーズを構成する文字「最」が前ポーズ文
字であることが見出だされる。「最」の前ポーズの重み
づけられた値のパラメータｆｐは「前ポーズ文字のテー
ブルメモリ１５」から２．７であることが見出だされ
る。サブステップＳ１９−３において前ポーズの重みづ
けられた値Ｗｆｐは以下のように計算される。In order to check in the sequence whether the phrase in the sentence has a pre-pause character or a post-pause character, the value of the P_i register is set to the value of the P_begin register in sub-step S19-1. Circulating sub-steps S19-2, S19-5, S19-8 and 19-
9 is repeated. Finally, the phrase "most" whose value of the P_i register is 6 is examined, and in sub-step S19-2, it is found that the character "most" constituting the phrase is the preceding pause character. The parameter fp of the weighted value of the “most” previous pose is found to be 2.7 from the “previous pose character table memory 15”. In sub-step S19-3, the weight value Wfp of the previous pose is calculated as follows.

【０１１０】[0110]

【数３４】Ｗｆｐ＝（Phrase［P_i］．syllable−Phras
e［P_begin］．syllable＋Phrase［P_i］．length）×
ｆｐ＝（１５−８＋１）×２．７＝２１．６Wfp = (Phrase [P_i] .syllable-Phras
e [P_begin]. syllable + Phrase [P_i]. length) ×
fp = (15−8 + 1) × 2.7 = 21.6

【０１１１】次にサブステップＳ１９−４において、Ｗ
ｆｐはW_minorレジスタの値以上であると決定されたの
で（この実施形態においてW_minorレジスタの値は２０
に設定されている）、サブステップＳ１９−１６が実行
される。前ポーズなので、ポーズフレーズP_pauseは、Next, in sub-step S19-4, W
Since fp is determined to be equal to or greater than the value of the W_minor register (the value of the W_minor register is 20 in this embodiment).
), The sub-step S19-16 is executed. Because the pose is before, the pose phrase P_pause is

【数３５】P_pause＝P_i−１＝６−１＝５また、ポーズ位置S_pauseは、P_pause = P_i-1 = 6-1 = 5 Also, the pause position S_pause is

【数３６】 S_pause＝Phrase［P_pause］．syllable＋Phrase［P_pause］．length−１＝Phrase［５］．syllable＋Phrase［５］．length−１＝１１＋４−１＝１４である。Ｗｆｐ＝２１．６であり、W_majorよりも小さ
いので（この実施形態においてW_majorレジスタの値は
４０に設定されている）、ポーズ継続時間D_pauseはS_pause = Phrase [P_pause]. syllable + Phrase [P_pause]. length-1 = Phrase [5]. syllable + Phrase [5]. length-1 = 11 + 4-1 = 14. Since Wfp = 21.6 and smaller than W_major (the value of the W_major register is set to 40 in this embodiment), the pause duration D_pause is

【数３７】D_pause＝８０ｍｓ[Formula 37] D_pause = 80ms

【０１１２】その後、S_pauseレジスタの値（１４）及
びD_pauseレジスタの値（８０ｍｓ）がポーズアレーLis
t_pauseに記憶され、P_begin及びP_endレジスタが指定
するフレーズの間のポーズ位置及びポーズ継続時間の決
定は終了される。フローはステップＳ２０に戻り、再び
P_pauseレジスタの指定するフレーズが文中の最後のフ
レーズであるか否かを検査する。このとき、P_pause＝
５であり、最後のフレーズではない。ゆえに、ステップ
Ｓ２１に進み、P_begin及びP_endレジスタが指定するフ
レーズを、P_pauseレジスタが指定するフレーズの後の
次のフレーズに指定する。すなわちThereafter, the value (14) of the S_pause register and the value (80 ms) of the D_pause register are set to the pause array Lis.
The determination of the pause position and the pause duration between the phrases stored in t_pause and specified by the P_begin and P_end registers is completed. The flow returns to step S20, and again
Checks whether the phrase specified in the P_pause register is the last phrase in the sentence. At this time, P_pause =
5 is not the last phrase. Therefore, the process proceeds to step S21, where the phrase specified by the P_begin and P_end registers is specified as the next phrase after the phrase specified by the P_pause register. Ie

【数３８】P_begin＝P_end＝P_pause＋１＝５＋１＝６[Equation 38] P_begin = P_end = P_pause + 1 = 5 + 1 = 6

【０１１３】巡回するステップＳ１５、Ｓ１６、Ｓ１７
及びＳ１８はP_end＝９になるまで反復される。ステッ
プＳ１７において、P_endレジスタの値は文の最後のフ
レーズを指定する値に到達した。ついでステップＳ２２
が実行され、シーケンスにポーズアレーList_pauseの値
が出力され、本発明の装置の処理動作が終了される。次
いで、ポーズを挿入された上記テキストデータに対し
て、公知の音声合成手段によって音声合成をすることが
できる。Steps S15, S16, S17 to go around
And S18 are repeated until P_end = 9. In step S17, the value of the P_end register has reached a value designating the last phrase of the sentence. Then step S22
Is executed, the value of the pause array List_pause is output to the sequence, and the processing operation of the apparatus of the present invention ends. Next, speech synthesis can be performed on the text data into which the pause has been inserted by a known speech synthesis unit.

【０１１４】例文の「堅強的求生意志是能[0114] The example sentence "

【外１３】戰勝病魔最有效的方法」に対して得られたポーズ
位置とポーズ継続時間は（７，１２０ｍｓ）と（１４，
８０ｍｓ）である。言いかえれば、第７の音節「志」の
後に１２０ｍｓのポーズが存在し、第１４の音節「魔」
の後に８０ｍｓのポーズが存在する。図７は、従来技術
の音声合成装置に係る、本来の合成された音声の波形を
図示し、上記波形は本発明のポーズ継続時間処理装置を
用いてさらに処理され、図８に図示されるような波形を
持つ自然なポーズ継続時間を有する合成された音声を得
る。[Outside 13] The pause position and pause duration obtained for the “Strictly effective method” are (7,120 ms) and (14,
80 ms). In other words, there is a pause of 120 ms after the seventh syllable “Shi”, and the 14th syllable “Ma”
Is followed by a pause of 80 ms. FIG. 7 illustrates the waveform of the original synthesized speech according to the prior art speech synthesizer, which waveform is further processed using the pause duration processing device of the present invention, as shown in FIG. Obtain synthesized speech with natural pause durations with complex waveforms.

【０１１５】本発明は上述の実施形態に制限されるべき
ではない。例えば、本発明のポーズ継続時間処理装置の
フレーズ拡張部１４において、互いに隣接する単語をフ
レーズに拡張するために、フレーズ拡張シンタックスを
用いる変わりに、入力の際にフレーズマーカーを付加す
ることができる。入力文にフレーズマーカーを標記し、
入力文に対して辞書メモリ１１を参照して単語を検出
し、次いで単語から自動拡張してフレーズを生成する。
また、前もって人手で判断処理を行うか、フレーズ辞書
を利用して自動的にフレーズの判断処理を実行するか、
又は統計したコーパスを利用するかのいずれかの方法に
よってフレーズの位置を発見することができ、それによ
ってフレーズとフレーズとの間のポーズ位置及びポーズ
継続時間を決定できる。それにかわって、フレーズキャ
ッシュを構成し、入力文中のフレーズを類似の方法で検
査することができる。The present invention should not be limited to the embodiments described above. For example, in the phrase expansion unit 14 of the pause duration processing apparatus of the present invention, a phrase marker can be added at the time of input instead of using phrase expansion syntax in order to expand words adjacent to each other into a phrase. . Put a phrase marker on the input sentence,
A word is detected by referring to the dictionary memory 11 for the input sentence, and then the phrase is automatically extended from the word to generate a phrase.
In addition, whether to perform the judgment process manually in advance, or to automatically execute the judgment process of the phrase using a phrase dictionary,
Alternatively, the position of the phrase can be found either by using a statistical corpus or the like, whereby the pause position and pause duration between the phrases can be determined. Instead, a phrase cache can be constructed to check for phrases in the input sentence in a similar manner.

【０１１６】上述のことから、本発明は、入力文の音節
の音声の継続時間を考慮することに加えて、さらに、巧
妙な方法で意味的に適当な位置にポーズ継続時間を挿入
する。従って先行技術における、息継ぎによるポーズが
欠落することで発話が不自然になる問題は克服され、そ
れによって非常に自然でとてもわかりやすく自然なポー
ズ継続時間を有する合成された音声を提供する。From the above, in addition to taking into account the duration of the syllable speech of the input sentence, the present invention further inserts the pause duration at semantically appropriate locations in a subtle way. Thus, the problem of prior art speech artifacts due to lack of pause due to breathing is overcome, thereby providing a synthesized speech having a very natural and very understandable natural pause duration.

【０１１７】本発明は最も実用的で好ましい実施形態と
考えられるものにおいて記述されたが、本発明が開示さ
れた実施形態に制限されず、すべてのそのような変形と
等価な配置を包含するような、最も広い解釈の意図と範
囲に含まれるさまざまな配置にあてはまるように意図す
ることが理解される。Although the present invention has been described in what is considered to be the most practical and preferred embodiments, the present invention is not limited to the disclosed embodiments, but encompasses all equivalent arrangements of such modifications. It is to be understood that the intent is to apply to various arrangements within the intent and scope of the broadest interpretation.

【０１１８】[0118]

【発明の効果】以上説明したように、本発明に係る中国
語音声合成におけるポーズ継続時間処理装置及び方法に
よれば、文入力部及びメモリバッファ部であるシステム
を有する装置であって、中国語の語彙と、品詞及び拡張
シンタックスを含む対応する情報とを記憶するための辞
書メモリと、複数の前でポーズする文字と、前ポーズパ
ラメータを含む対応する情報とを記憶するための前でポ
ーズする文字のテーブルメモリと、複数の後でポーズす
る文字と、後ポーズパラメータを含む対応する情報とを
記憶するための後でポーズする文字のテーブルメモリ
と、単語及びフレーズのうちのいずれか一方を含む意味
的単位の長さに従って一般ポーズパラメータを記憶する
ためのポーズパラメータ記憶部と、上記辞書メモリに記
憶された中国語の語彙の複数の単語と入力文を比較する
ことにより、上記入力部を用いて入力された上記入力文
中の各単語の境界の位置を決定するための境界位置決定
手段と、上記辞書メモリを参照して境界の位置が決定さ
れた各単語の品詞及び拡張シンタックスを検索するため
の検索手段と、上記拡張シンタックスと、上記文中の複
数の単語のうちの隣接するものの品詞の関係とに従っ
て、文中の複数の単語を複数のフレーズに結合する結合
手段と、複数の単語の複数のフレーズへの結合の後で、
これらの意味的単位の前及び後の位置がポーズ位置候補
となるように、上記入力文から分割された複数の各意味
的単位のポーズの重みづけられた値を計算する計算手段
とを備え、上記計算手段は複数の前でポーズする文字を
検索し、それらについて重みづけられた値を計算する手
段と、複数の後でポーズする文字を検索し、それらにつ
いて重みづけられた値を計算する手段とを含み、さらに
上記装置は、上記複数のポーズ位置候補のポーズの重み
づけられた値を計算することによって、複数のポーズ位
置及び複数のポーズ継続時間を決定するポーズ決定手段
を備え、予め決められた値よりも大きいポーズの重みづ
けられた値を有するポーズ位置候補の各々はポーズ位置
であると決定され、かつポーズレベルを与えられ、上記
ポーズ継続時間はそのポーズレベルに従って決定され
る。As described above, according to the apparatus and method for processing a pause duration in Chinese speech synthesis according to the present invention, an apparatus having a system that is a sentence input unit and a memory buffer unit is provided. Memory for storing the vocabulary and corresponding information including part-of-speech and extended syntax, a plurality of characters to be paused before, and a pause before storing corresponding information including a previous pause parameter. A table memory of characters to be paused, a table memory of characters to be paused to store a plurality of characters to be paused later, and corresponding information including a post-pause parameter, and one of words and phrases to be stored. A pose parameter storage unit for storing general pose parameters according to the length of a semantic unit including the word, and a Chinese word stored in the dictionary memory. By comparing the plurality of words with the input sentence, the boundary position determining means for determining the position of the boundary of each word in the input sentence input using the input unit, and referring to the dictionary memory A search unit for searching for the part of speech and the extended syntax of each word whose boundary position has been determined, the extended syntax, and the relationship between the parts of speech of adjacent ones of the plurality of words in the sentence. After combining multiple words into multiple phrases and combining multiple words into multiple phrases,
Calculating means for calculating a weighted value of the pose of each of the plurality of semantic units divided from the input sentence so that positions before and after these semantic units are pose position candidates. The calculating means searches for a plurality of previously paused characters and calculates a weighted value for them, and the means for searching for a plurality of subsequently paused characters and calculates a weighted value for them. The apparatus further comprises a pose determining means for determining a plurality of pose positions and a plurality of pause durations by calculating weighted values of the poses of the plurality of pose position candidates. Each of the candidate pose positions having a weighted value of the pose greater than the given value is determined to be a pose position and given a pose level, wherein the pause duration is It is determined according to the pause level.

【０１１９】従って、本発明に係るポーズ継続時間処理
装置を用いて合成された音声には息の変化を模倣するよ
うに複数のポーズが存在するので、合成された音声のリ
ズムの自然さと意味のわかりやすさとを向上させること
ができる。Therefore, a voice synthesized using the pause duration processing apparatus according to the present invention has a plurality of poses so as to imitate a change in breath, so that the naturalness and meaning of the rhythm of the synthesized voice are significant. The clarity can be improved.

[Brief description of the drawings]

【図１】本発明に係る一実施形態である、中国語のテ
キストから音声に変換するためのポーズ継続時間処理装
置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a pause duration processing apparatus for converting a Chinese text into a voice according to an embodiment of the present invention.

【図２】図１のポーズ継続時間処理装置によって実行
される、ポーズ位置及びポーズ継続時間計算処理の第１
の部分を示すフローチャートである。FIG. 2 is a first diagram illustrating a pause position and pause duration calculation process executed by the pause duration processing apparatus of FIG. 1;
It is a flowchart which shows the part.

【図３】図１のポーズ継続時間処理装置によって実行
される、ポーズ位置及びポーズ継続時間計算処理の第２
の部分を示すフローチャートである。FIG. 3 is a second diagram illustrating a pause position and pause duration calculation process performed by the pause duration processing apparatus of FIG. 1;
It is a flowchart which shows the part.

【図４】図１のポーズ継続時間処理装置によって実行
される、ポーズ位置及びポーズ継続時間計算処理の第３
の部分を示すフローチャートである。FIG. 4 is a diagram illustrating a pause position and pause duration calculation process performed by the pause duration processing apparatus of FIG. 1;
It is a flowchart which shows the part.

【図５】図４のサブルーチンである、ポーズ位置及び
ポーズ継続時間計算サブルーチン処理の第１の部分を示
すフローチャートである。FIG. 5 is a flowchart showing a first part of a pause position and pause duration calculation subroutine process, which is a subroutine of FIG. 4;

【図６】図４のサブルーチンである、ポーズ位置及び
ポーズ継続時間計算サブルーチン処理の第２の部分を示
すフローチャートである。6 is a flowchart showing a second part of a pause position and pause duration calculation subroutine process, which is a subroutine of FIG.

【図７】従来技術に係る音声合成のための音声継続時
間処理装置によって音声合成された中国語テキストの波
形であって、ポーズが挿入されていないときの信号波形
図である。FIG. 7 is a signal waveform diagram of a Chinese text that is speech-synthesized by the speech duration processing apparatus for speech synthesis according to the related art, when no pause is inserted.

【図８】図７の波形においてポーズが挿入されたとき
の信号波形図である。FIG. 8 is a signal waveform diagram when a pause is inserted in the waveform of FIG. 7;

【図９】図１の辞書メモリ１１の構造を説明する図で
ある。9 is a diagram illustrating the structure of the dictionary memory 11 of FIG.

【図１０】図１の前ポーズ文字のテーブルメモリ１５
の構成を説明する図である。FIG. 10 is a table memory 15 of the previous pause character in FIG. 1;
FIG. 3 is a diagram illustrating the configuration of FIG.

【図１１】図１の後ポーズ文字のテーブルメモリ１６
の構成を説明する図である。11 is a table memory 16 for a post-pause character in FIG. 1;
FIG. 3 is a diagram illustrating the configuration of FIG.

【図１２】図１のポーズパラメータ記憶部１７の構成
を説明する図である。FIG. 12 is a diagram illustrating a configuration of a pause parameter storage unit 17 of FIG.

【図１３】従来技術に係る音声合成のための音声継続
時間処理装置のブロック図である。FIG. 13 is a block diagram of a speech duration processing apparatus for speech synthesis according to the related art.

[Explanation of symbols]

１０…文入力部、１１…辞書メモリ、１２…単語検査部、１３…品詞／拡張シンタックス検査部、１４…フレーズ拡張部、１５…前ポーズ文字のテーブルメモリ、１６…後ポーズ文字のテーブルメモリ、１７…ポーズパラメータ記憶部、１８…ポーズの重みづけられた値計算部、１９…ポーズ位置及びポーズ継続時間決定部、２０…レジスタ及びメモリバッファ部、１１０…ピンイン文入力部、１１１…フレーズ検査部、１１２…トーン／音節検査部、１１３…音節／リズム照合部、１１４…リズム検査部、１１５…基本音声継続時間記憶部、１１６…基本音声継続時間決定部、１１７…音声継続時間パラメータ記憶部、１１８…音節音声継続時間計算部、１１９…バッファメモリ領域。 10: sentence input unit, 11: dictionary memory, 12: word inspection unit, 13: part-of-speech / extended syntax inspection unit, 14: phrase expansion unit, 15: table memory for front pause character, 16: table memory for post pause character 17: Pause parameter storage unit 18: Pause weighted value calculation unit 19: Pause position and pause duration determination unit 20: Register and memory buffer unit 110: Pinyin sentence input unit 111: Phrase inspection 112, Tone / Syllable Inspection Unit, 113, Syllable / Rhythm Matching Unit, 114, Rhythm Inspection Unit, 115, Basic Voice Duration Storage Unit, 116, Basic Voice Duration Determination Unit, 117, Voice Duration Parameter Storage Unit 118: syllable voice duration calculating unit 119: buffer memory area

Claims

[Claims]

1. A pause duration processing method for a system for converting Chinese text to speech in a computer system having an input unit and a memory buffer unit, comprising: a Chinese vocabulary, a part of speech, and an extended syntax. A dictionary memory for storing corresponding information including a plurality of previously paused characters and a table memory of previously paused characters for storing corresponding information including a plurality of previous pause parameters. A table memory of later-paused characters for storing a plurality of later-paused characters and corresponding information including a plurality of later-paused parameters, wherein one of the words and phrases is stored. Configuring a pose parameter storage for storing a plurality of general pose parameters according to the length of the semantic unit including Determining a position of a boundary of each word in the input sentence input using the input unit by comparing a plurality of words of the Chinese vocabulary stored in the dictionary memory with the input sentence; Referring to the dictionary memory and searching for the part of speech and the extended syntax of each word whose boundary has been determined; and Combining the words in the sentence into the phrases, and after combining the words into the phrases, the positions before and after these semantic units are candidate pose positions, Calculating a weighted value of the pose of each of the plurality of semantic units divided from the input sentence, wherein the step of searching for a plurality of previously paused characters comprises Calculating a weighted value for each of the plurality of pose positions, and searching for a plurality of characters to be paused later, and calculating a weighted value for them. Determining a plurality of pose positions and a plurality of pause durations by calculating a pause position, wherein each of the pose position candidates having a pose weighted value greater than a predetermined value is a pause position. Is determined to be
And a pause level, wherein the pause duration is determined according to the pause level.

2. A pause duration processing device for a system for converting Chinese text to speech having a system which is a sentence input unit and a memory buffer unit, comprising a Chinese vocabulary, a part of speech and an extended syntax. A dictionary memory for storing corresponding information including: a plurality of previously paused characters; a table memory of previously paused characters for storing corresponding information including previous pause parameters; A table memory of later-paused characters for storing later-paused characters and corresponding information including later-pause parameters; and general according to the length of semantic units containing either words or phrases. A pose parameter storage unit for storing a pause parameter; and a plurality of words of a Chinese vocabulary stored in the dictionary memory and an input sentence. By doing so, boundary position determining means for determining the position of the boundary of each word in the input sentence input using the input unit, and each word whose boundary position is determined by referring to the dictionary memory Search means for searching for the part of speech and the extended syntax of the sentence, the plurality of words in the sentence are divided into a plurality of phrases according to the extended syntax, Combining means for combining, after combining a plurality of words into a plurality of phrases, a plurality of each of the plurality of words divided from the input sentence so that positions before and after these semantic units are pose position candidates. Calculating means for calculating a weighted value of the pose of the semantic unit, said means for searching for a plurality of previously paused characters and calculating a weighted value for them; Means for searching for characters to be paused after, and calculating weighted values for them. Pose determining means for determining a plurality of pause durations, wherein each of the pose position candidates having a weight value of the pose greater than a predetermined value is determined to be a pose position, and a pause level is given. Wherein the pause duration is determined according to the pause level.