JPH0337700A - Pause insertion position determining system - Google Patents
Pause insertion position determining systemInfo
- Publication number
- JPH0337700A JPH0337700A JP1173445A JP17344589A JPH0337700A JP H0337700 A JPH0337700 A JP H0337700A JP 1173445 A JP1173445 A JP 1173445A JP 17344589 A JP17344589 A JP 17344589A JP H0337700 A JPH0337700 A JP H0337700A
- Authority
- JP
- Japan
- Prior art keywords
- word
- pause
- speech
- boundary
- character string
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000003780 insertion Methods 0.000 title claims abstract description 61
- 230000037431 insertion Effects 0.000 title claims abstract description 61
- 238000000034 method Methods 0.000 claims description 20
- 239000000470 constituent Substances 0.000 claims description 6
- 241001417093 Moridae Species 0.000 description 4
- 238000011156 evaluation Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 230000000877 morphologic effect Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
Abstract
Description
【発明の詳細な説明】
(産業上の利用分野)
本発明は、文字列を音声に変換する規則音声合成等にお
いて、ポーズを押入する位置を決定するポーズ挿入位置
決定方式に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a pause insertion position determining method for determining a position to insert a pause in regular speech synthesis for converting character strings into speech.
(従来の技術)
任意の文章を音声に変換する規則音声合成においては、
文中の適切な位置に適切な長さのポーズを挿入すること
が必要であり、合成される音声の自然性を向上させる上
で重要である。すなわち、人間には一息で発声できる長
さに限界があり、発声者は、意味の上で結びつきの弱い
適当な文節あるいは単語の境界において、息継ぎのため
や聞き手に意味の切れ目を伝えるためにポーズを置く。(Prior art) In regular speech synthesis that converts arbitrary sentences into speech,
It is necessary to insert a pause of an appropriate length at an appropriate position in a sentence, and this is important in improving the naturalness of synthesized speech. In other words, there is a limit to how long a human voice can be uttered in one breath, and the speaker pauses at the boundaries of phrases or words that have weak connections in terms of meaning, either to take a breather or to convey a break in meaning to the listener. put
したがって、合成音声を生成する際においても、適度に
ポーズが挿入されていないと聞き手は不自然さを感じる
。Therefore, when generating synthesized speech, the listener will feel unnatural if appropriate pauses are not inserted.
従来、ポーズの挿入位置の決定には、隣接する文節と文
節の結びつきの強さが用いられていた。Conventionally, the strength of the connection between adjacent bunsetsu has been used to determine the pose insertion position.
すなわち、前後の文節間の意味の上での結び付きが弱い
ほどポーズは挿入され易いという性質を利用する。In other words, it utilizes the property that the weaker the semantic connection between the preceding and following phrases, the easier it is for a pause to be inserted.
まず、文節と文節の境界における結びつきの強さを、先
行文節から受けの文節に至るまで゛の文節数で表現し、
この尺度を分離度と呼ぶ。分離度の値が大きいというこ
とは、ある文節がより遠くにある文節と結びついており
、隣接する文節との結びつきは弱いということを表して
いる。したがって、分離度の大きい文節境界では、ポー
ズが挿入される可能性が高いと考える。また、人間には
一息で発声できる長さに限界があり、発声者は適当な位
置で息継ぎのためにポーズを置く。したがって、文節境
界の前後の文節の総モーラ数が多し)場合にポーズが挿
入され易いと考える。以上のことから、ポーズを挿入す
るかどうかの判断をするための評価尺度として、分離度
と総モーラ数との積の値を用い、これが、予め定めるあ
る閾値を越える場合にポーズが挿入されるものとする。First, the strength of the bond at the boundary between bunsetsu and bunsetsu is expressed by the number of clauses from the preceding clause to the receiving clause,
This measure is called degree of separation. A large value of the degree of separation indicates that a certain phrase is connected to a phrase that is further away, and that the connection with adjacent phrases is weak. Therefore, we believe that there is a high possibility that a pause will be inserted at a bunsetsu boundary with a high degree of separation. Furthermore, humans have a limit to how long they can vocalize in one breath, so the speaker pauses at an appropriate position to catch their breath. Therefore, pauses are likely to be inserted when the total number of moras in the clauses before and after the clause boundary is large. Based on the above, the product of the degree of separation and the total number of moras is used as an evaluation measure to determine whether or not to insert a pause, and if this exceeds a certain predetermined threshold, a pause is inserted. shall be taken as a thing.
このようなポーズ挿入位置の決定方法については、日本
音響学会音声研究会試料878−07(1978−4)
1文音声の音調規則の検討](文献1)に詳述されてい
る。Regarding the method of determining such a pause insertion position, please refer to the Acoustical Society of Japan Speech Research Group Sample 878-07 (1978-4).
Examination of intonation rules for one-sentence speech] (Reference 1).
(発明が解決しようとする問題点)
従来の方法では、文章を構成する文節同士の係り受は関
係を正確に解析する必要があった。しかしながら、この
係り受は関係を常に正確に解析することは難しく、この
ため、選択されたポーズ挿入位置が不自然になることが
あるという問題点があった。(Problems to be Solved by the Invention) In the conventional method, it was necessary to accurately analyze the dependencies between clauses that make up a sentence. However, it is difficult to always accurately analyze the relationship of this modification, and as a result, there is a problem that the selected pose insertion position may become unnatural.
これに対して本発明は、このような係り受は関係の解析
を行うことなしに、自然なポーズの挿入位置を決定する
ことが可能なポーズ挿入位置決定方式を提供することを
目的としている。In contrast, an object of the present invention is to provide a pose insertion position determination method that can determine a natural pose insertion position without analyzing such dependency relationships.
(問題を解決するための手段)
第1の本発明は、入力された文字列を音声に変換する際
に、前記入力された文字列で表される文章をそれを構成
する単語に分解し、分解された各単語の境界にポーズを
挿入するかどうかを判定するポーズ挿入位置決定方式に
おいて、予め単語の直後にどの程度ポーズが挿入され易
いかを表す数値を単語の品詞ごとに記憶しておき、入力
された文字列で表される文章を構成する各単語の境界の
直前の単語の品詞に応じた前記記憶された単語の直後の
ポーズの挿入され易さを表す数値に基づいて当該単語境
界にポーズを挿入するかどうかを判定することを特徴と
する。(Means for Solving the Problem) The first aspect of the present invention provides, when converting an input character string into speech, a sentence represented by the input character string is decomposed into its constituent words, In the pause insertion position determination method that determines whether or not to insert a pause at the boundary of each decomposed word, a numerical value indicating how likely a pause is to be inserted immediately after a word is stored in advance for each part of speech of the word. , the word boundary is determined based on a numerical value representing the ease of inserting a pause immediately after the memorized word according to the part of speech of the word immediately before the boundary of each word constituting the sentence represented by the input character string. It is characterized by determining whether or not to insert a pose into the image.
第2の本発明は、入力された文字列を音声に変換する際
に、前記入力された文字列で表される文章をそれを構成
する単語に分解し、分解された各単語の境界にポーズを
挿入するかどうかを判定するポーズ挿入位置決定方式に
おいて、予め単語の直前にどの程度ポーズが挿入され易
いかを表す数値を単語の品詞ごとに記憶しておき、入力
された文字列で表される文章を構成する各単語の境界の
直後の単語の品詞に応じた前記記憶された単語の直前の
ポーズの挿入され易さを表す数値に基づいて当該単語境
界にポーズを挿入するかどうかを判定することを特徴と
する。A second aspect of the present invention, when converting an input character string into speech, decomposes a sentence represented by the input character string into its constituent words, and pauses at the boundary of each decomposed word. In the pause insertion position determination method that determines whether or not to insert a pause, a numerical value indicating how likely a pause is to be inserted immediately before a word is memorized for each part of speech of the word, and the number expressed by the input character string is Determine whether or not to insert a pause at the word boundary based on a numerical value representing the ease with which a pause immediately before the memorized word is inserted according to the part of speech of the word immediately after the boundary of each word constituting the sentence. It is characterized by
第3の本発明は、入力された文字列を音声に変換する際
に、前記入力された文字列で表される文章をそれを構成
する単語に分解し、分解された各単語の境界にポーズを
押入するかどうかを判定するポーズ挿入位置決定方式に
おいて、予め単語の直後にどの程度ポーズが挿入され易
いかを表す数値と、単語の直前にどの程度ポーズが挿入
され易いかを表す数値とを、それぞれ単語の品詞ごとに
記憶しておき、入力された文字列で表される文章を構成
する各単語の境界の直前の単語の品詞に応した前記記憶
された単語の直後のポーズの押入され易さを表す数値と
、当該単語境界の直後の単語の品詞に応じた前記記憶さ
れた単語の直前のポーズの挿入され易さを表す数値とに
基づいて当該単語境界にポーズを挿入するかどうかを判
定することを特徴とする。A third aspect of the present invention is that when converting an input character string into speech, the sentence represented by the input character string is decomposed into its constituent words, and pauses are made at the boundaries of each decomposed word. In the pause insertion position determination method that determines whether to insert a pause, a numerical value indicating how likely a pause is to be inserted immediately after a word and a numerical value indicating how likely a pause is to be inserted immediately before a word are determined in advance. , each part of speech of each word is memorized, and a pause immediately after the memorized word is inserted corresponding to the part of speech of the word immediately before the boundary of each word constituting the sentence represented by the input character string. Whether to insert a pause at the word boundary based on a numerical value representing the ease of insertion and a numerical value representing the ease with which a pause immediately before the memorized word can be inserted according to the part of speech of the word immediately after the word boundary. It is characterized by determining.
(作用)
ポーズは、文節あるいは単語の境界(以下、単に境界と
あるのは、文節あるいは単語の境界を示すものとする)
において、その境界の前後にある単語の、意味の上での
結び付きが弱い境界に挿入され易い。従来方式は、この
結び付きの強弱を文章の係り受は構造から算出していた
。一方で、ポーズが挿入される境界の前後にある単語の
品詞を調べてみると、ポーズが挿入され易い品詞と、挿
入されにくい品詞とがあることがわかる。そこで、品詞
の持つこのような性質を利用すれば、文章の係り受は構
造を用いなくても、自然なポーズ挿入位置を決定するこ
とが可能であると考えられる。(Action) A pause is a boundary between a phrase or a word (hereinafter, the term "boundary" simply refers to a boundary between a phrase or a word)
, words are likely to be inserted into a boundary where the words before and after the boundary have a weak connection in terms of meaning. In the conventional method, the strength of this connection was calculated from the structure of the text's dependencies. On the other hand, when examining the parts of speech of words before and after the boundary where a pause is inserted, it is found that there are parts of speech in which pauses are easily inserted and parts of speech in which it is difficult to insert. Therefore, it is thought that by utilizing such properties of parts of speech, it is possible to determine a natural pause insertion position without using the structure of sentence dependencies.
そこで、予め、いくつかの文章を人間が読み上げた音声
(以下、文章音声データベースと称する)から、ポーズ
が挿入されている境界の直後、あるいは直前にある単語
の品詞を調べ、品詞ごとに、その単語の直前あるいは直
後にどの程度ポーズが挿入され易いかを求める。ポーズ
の挿入され易さとしては、例えば、文章音声データベー
スにおけるある品詞の全出現に対する、その品詞がポー
ズを伴っている出現の割合等を用いることが考えられる
。すなわち、品詞がPSである単語の直前にポーズが挿
入される確率P、(PS)、直後にポーズが挿入される
確率P〆PS)をそれぞれ以下のように定義する。Therefore, in advance, we check the parts of speech of the words immediately after or immediately before the boundary where the pause is inserted from the audio of several sentences read aloud by humans (hereinafter referred to as the sentence audio database), and then analyze the parts of speech for each part of speech. Find out how likely a pause is to be inserted immediately before or after a word. As the ease with which a pause is inserted, for example, the ratio of occurrences of a certain part of speech accompanied by a pause to all occurrences of that part of speech in the sentence speech database may be used. That is, the probability P,(PS) that a pause will be inserted immediately before a word whose part of speech is PS, and the probability P<PS) that a pause will be inserted immediately after, are defined as follows.
P、(PS)= (直前にポーズを伴っている品詞PS
の出現数)/(品詞psの全出現数) 、、、、 (1
)P〆PS)=(直後にポーズを伴っている品詞PSの
出現数)/(品詞psの全出現数) 、、、、 (2)
以下では、単語の直前あるいは直後にどの程度ポーズが
挿入され易いかを表す数値として、前記ポーズ挿入確率
Pp(PS)、ppps)を用いる場合を例にとって説
明を行うことにする。P, (PS) = (Part of speech PS accompanied by a pause immediately before
(number of occurrences of part of speech ps) / (total number of occurrences of part of speech ps) ,,,, (1
)P〆PS)=(Number of occurrences of part-of-speech PS that immediately follows a pause)/(Total number of occurrences of part-of-speech ps) ,,,, (2)
In the following, an example will be explained in which the pause insertion probability Pp (PS), ppps) is used as a numerical value representing how likely a pause is to be inserted immediately before or after a word.
これらのポーズ挿入確率Pp(PS)、P〆PS)、す
なわちポーズの挿入され易さを、品詞ごとに求める。These pause insertion probabilities Pp(PS), P〆PS), that is, the ease with which a pause is inserted, are determined for each part of speech.
品詞の種類としては、名詞、動詞、副詞といった分類の
方法がある。また、動詞や形容詞、助動詞などのように
活用する品詞については、未然形、連用形、連体形など
の活用形ごとに分類することも考えられる。There are different classification methods for parts of speech, such as nouns, verbs, and adverbs. Furthermore, for parts of speech that are inflected such as verbs, adjectives, and auxiliary verbs, it is also possible to classify them into conjugated forms such as unnatural form, conjunctive form, and adjunctive form.
第1の本発明では、音声に変換する文章においてポーズ
を挿入するべき境界を決定する際の判定の尺度として、
(2)式で示される。各境界における当該境界の直前に
ある単語の品詞PS、に応じたポーズ挿入確率P〆PS
、)を用いる。このような判定尺度を用いて、実際にポ
ーズ挿入位置を決定する方法としては、例えば、(文献
1)で示されるような方法が考えられる。すなわち、前
記判定尺1度と境界の前後の文節の長さ(例えば、モー
ラ数)とから算出される値を評価尺度とする。この評価
尺度に閾値を設け、各境界における評価尺度が閾値を越
えているかどうかに応じて、当該境界にポーズを挿入す
るかどうかの判断を行う。あるいは、音声に変換すべき
文章の長さから、はじめにポーズをいくつ挿入するかを
定めておき、前記判定尺度の大きい境界から順にポーズ
を挿入していくこともできる。In the first aspect of the present invention, as a criterion for determining the boundary where a pause should be inserted in a sentence to be converted into speech,
It is shown by equation (2). Pause insertion probability P〆PS according to the part of speech PS of the word immediately before the boundary at each boundary
, ) is used. As a method of actually determining the pose insertion position using such a determination criterion, for example, a method as shown in (Reference 1) can be considered. That is, the evaluation scale is a value calculated from the judgment scale 1 degree and the lengths of the clauses before and after the boundary (for example, the number of moras). A threshold is set for this evaluation scale, and depending on whether the evaluation scale at each boundary exceeds the threshold, it is determined whether or not to insert a pause at the boundary. Alternatively, the number of pauses to be inserted may be determined first based on the length of the sentence to be converted into speech, and the pauses may be inserted in order from the boundary with the largest criterion.
第2の本発明では、前記判定の尺度として、(1)式で
示される、各境界における当該境界の直後にある単語の
品詞PSfに応じたポーズ挿入確率P、(PSr)を用
いる。In the second aspect of the present invention, the pause insertion probability P, (PSr) corresponding to the part of speech PSf of the word immediately following the boundary in each boundary is used as the criterion for the determination.
また、第3の本発明では、前記判定の尺度として、(2
)式で示される、各境界における当該境界の直前にある
単語の品詞PS、に応じたポーズ挿入確率P1(PS、
)と、(1)式で示される、当該境界の直後にある単語
の品詞PSrに応じたポーズ挿入確率Pp(PSf)と
を用いる。両者を用いた判定の尺度としては、例えば、
PdPS、)とP、(PSF)との和などが考えられる
。Further, in the third aspect of the present invention, (2
), the pause insertion probability P1(PS,
) and the pause insertion probability Pp (PSf) according to the part of speech PSr of the word immediately after the boundary, which is shown by equation (1). As a judgment scale using both, for example,
A possible example is the sum of PdPS, ) and P, (PSF).
以上のような手法を用いることにより、音声に変換する
文章の係り受は構造は用いずに、形態素解析で得られる
単語の品詞を用いるだけで、自然なポーズ挿入位置を決
定することが可能となる。By using the above method, it is possible to determine a natural pause insertion position by simply using the part of speech of the word obtained through morphological analysis, without using the structure of the dependency of the sentence to be converted into speech. Become.
(実施例)
第1図は、第1の本発明によるポーズ挿入位置決定方式
を実現するための一実施例を示すブロック図である。(Embodiment) FIG. 1 is a block diagram showing an embodiment for realizing the pose insertion position determination method according to the first invention.
まず、音声に変換すべき文字列を、文字列入力端子11
から入力する。入力された前記文字列は形態素解析部工
2に送られ、入力文字列で表される文章を単語列に分割
し、各単語の品詞や読みを決定する。この品詞や読みを
伴った単語列は、単語長算出部13及びポーズ挿入位置
決走部15に送られる。単語長算出部13では、前記単
語列が与えられると、単語の読みを用いて文章を構成し
ている各単語の長さ(例えばモーラ数)を算出し、ポー
ズ挿入位置決走部15に送る。First, the character string to be converted into speech is input to the character string input terminal 11.
Enter from. The input character string is sent to the morphological analysis section 2, which divides the sentence represented by the input character string into word strings and determines the part of speech and pronunciation of each word. This word string with part of speech and pronunciation is sent to the word length calculation section 13 and the pause insertion position determination section 15. When the word string is given, the word length calculation section 13 calculates the length of each word (for example, the number of moras) constituting the sentence using the pronunciation of the word, and sends it to the pause insertion position determination section 15. .
ポーズ挿入確率記憶部14には、(2)式で表される、
品詞の後ろにポーズが挿入される確率pKps)が、品
詞ごとに予め蓄えられている。The pause insertion probability storage unit 14 stores, as expressed by equation (2),
The probability that a pause will be inserted after a part of speech (pKps) is stored in advance for each part of speech.
ポーズ仲人位置決定部15は、まず、前記品詞を伴った
単語列によって各境界の直前にある単語の品詞PS、に
対応した、前記ポーズ挿入確率記憶部14に蓄えられて
いるポーズ抑大確率PrCPS、)を読み出す。読み出
された前記ポーズ挿入確率P[PSp)と、前記単語長
算出部13で算出された各単語の長さとを用いて、(作
用)の項で説明したような手法によってポーズを挿入す
る位置を決定し、結果をポーズ挿入位置出力端子16か
ら出力する。The pose matchmaker position determination unit 15 first determines the pose suppression probability PrCPS stored in the pause insertion probability storage unit 14, which corresponds to the part of speech PS of the word immediately preceding each boundary using the word string with the part of speech. , ). Using the read pause insertion probability P [PSp) and the length of each word calculated by the word length calculation unit 13, the position at which a pause is inserted is determined by the method described in the (effect) section. is determined, and the result is output from the pause insertion position output terminal 16.
第2の本発明を実現するためには、第1図におけるポー
ズ挿入確率記憶部14に蓄えられているポーズ挿入確率
を、(1)式で表されるPp(PS)とする。そして、
ポーズ挿入位置決走部15では、各境界の直後にある単
語の品詞PSfに応じて、ポーズ挿入確率記憶部14に
蓄えられているポーズ挿入確率Pp(PSr)を読み出
し、第1の本発明と同様にしてポーズを挿入する位置を
決定すればよい。In order to realize the second aspect of the present invention, the pause insertion probability stored in the pause insertion probability storage section 14 in FIG. 1 is set to Pp (PS) expressed by equation (1). and,
The pause insertion position determination unit 15 reads out the pause insertion probability Pp (PSr) stored in the pause insertion probability storage unit 14 according to the part of speech PSf of the word immediately after each boundary, and reads out the pause insertion probability Pp (PSr) stored in the pause insertion probability storage unit 14. The position to insert the pose can be determined in the same way.
また、第3の本発明を実現するためには、第1図におけ
るポーズ挿入確率記憶部14に、(1)式で表されるP
、(PS)と、(2)式で表されるPdPS〉とを蓄え
ておく。そして、ポーズ挿入位置決走部15では、各境
界の直前にある単語の品詞PSpに応じたポーズ挿入確
率PdPS、)、及び各境界の直後にある単語の品詞P
S「に応じたポーズ挿入確率P、、(PSr)とを、ポ
ーズ挿入確率記憶部14から読み出し、(作用)の項で
説明したような手法によってポーズを挿入する位置を決
定すればよい。In addition, in order to realize the third aspect of the present invention, it is necessary to store P expressed by equation (1) in the pause insertion probability storage unit 14 in FIG.
, (PS) and PdPS> expressed by equation (2) are stored. Then, the pause insertion position determining unit 15 calculates the pause insertion probability PdPS, ) corresponding to the part of speech PSp of the word immediately before each boundary, and the part of speech P of the word immediately after each boundary.
The pose insertion probabilities P, .
(発明の効果)
以上述べてきたように、本発明は、正しく解析すること
が困難な係り受は解析の結果を用いることなしに、ポー
ズの挿入位置を決定する。このため、従来方式よりも正
しく、自然なポーズの挿入位置を決定することが可能と
なる。したがって、本発明は、文字列で与えられた任意
の文章を音声に変換する音声合成装置等におけるポーズ
挿入位置決定方式として有効である。(Effects of the Invention) As described above, the present invention determines the insertion position of a pose without using the results of analysis of dependencies that are difficult to analyze correctly. Therefore, it is possible to determine a more accurate and natural pose insertion position than the conventional method. Therefore, the present invention is effective as a pause insertion position determination method in a speech synthesis device or the like that converts an arbitrary sentence given as a character string into speech.
第1図は、第1の本発明によるポーズ挿入位置決定方式
を実現するための一実施例を示すブロック図である。
図において、11は文字列入力端子、12は形態素解析
部、13は単語長算出部、14はポーズ挿入確率記憶部
、15はポーズ挿入位置決定部、16はポーズ挿入位置
出力端子である。FIG. 1 is a block diagram showing an embodiment for realizing the pose insertion position determination method according to the first invention. In the figure, 11 is a character string input terminal, 12 is a morphological analysis section, 13 is a word length calculation section, 14 is a pause insertion probability storage section, 15 is a pause insertion position determination section, and 16 is a pause insertion position output terminal.
Claims (3)
力された文字列で表される文章をそれを構成する単語に
分解し、分解された各単語の境界にポーズを挿入するか
どうかを判定するポーズ挿入位置決定方式において、予
め単語の直後にどの程度ポーズが挿入され易いかを表す
数値を単語の品詞ごとに記憶しておき、入力された文字
列で表される文章を構成する各単語の境界の直前の単語
の品詞に応じた前記記憶された単語の直後のポーズの挿
入され易さを表す数値に基づいて当該単語境界にポーズ
を挿入するかどうかを判定することを特徴とするポーズ
挿入位置決定方式。(1) When converting an input character string into speech, do you decompose the sentence represented by the input character string into its constituent words and insert a pause at the boundary of each decomposed word? In the pause insertion position determination method, a numerical value indicating how likely a pause is to be inserted immediately after a word is stored in advance for each part of speech of the word, and a sentence represented by the input character string is constructed. It is characterized in that it is determined whether or not to insert a pause at the word boundary based on a numerical value representing the ease with which a pause is inserted immediately after the memorized word according to the part of speech of the word immediately before the word boundary. A pose insertion position determination method.
力された文字列で表される文章をそれを構成する単語に
分解し、分解された各単語の境界にポーズを挿入するか
どうかを判定するポーズ挿入位置決定方式において、予
め単語の直前にどの程度ポーズが挿入され易いかを表す
数値を単語の品詞ごとに記憶しておき、入力された文字
列で表される文章を構成する各単語の境界の直後の単語
の品詞に応じた前記記憶された単語の直前のポーズの挿
入され易さを表す数値に基づいて当該単語境界にポーズ
を挿入するかどうかを判定することを特徴とするポーズ
挿入位置決定方式。(2) When converting an input character string into speech, do you decompose the sentence represented by the input character string into its constituent words and insert a pause at the boundary of each decomposed word? In the pause insertion position determination method, a numerical value indicating how likely a pause is to be inserted immediately before a word is stored in advance for each part of speech of the word, and a sentence represented by the input character string is constructed. It is characterized in that it is determined whether or not to insert a pause at the word boundary based on a numerical value representing the ease with which a pause immediately before the memorized word is inserted according to the part of speech of the word immediately after the word boundary. A pose insertion position determination method.
力された文字列で表される文章をそれを構成する単語に
分解し、分解された各単語の境界にポーズを挿入するか
どうかを判定するポーズ挿入位置決定方式において、予
め単語の直後にどの程度ポーズが挿入され易いかを表す
数値と、単語の直前にどの程度ポーズが挿入され易いか
を表す数値とを、それぞれ単語の品詞ごとに記憶してお
き、入力された文字列で表される文章を構成する各単語
の境界の直前の単語の品詞に応じた前記記憶された単語
の直後のポーズの挿入され易さを表す数値と、当該単語
境界の直後の単語の品詞に応じた前記記憶された単語の
直前のポーズの挿入され易さを表す数値とに基づいて当
該単語境界にポーズを挿入するかどうかを判定すること
を特徴とするポーズ挿入位置決定方式。(3) When converting an input character string into speech, do you decompose the sentence represented by the input character string into its constituent words and insert a pause at the boundary of each decomposed word? In the pause insertion position determination method that determines whether or not the word It is memorized for each part of speech and represents the ease with which a pause can be inserted immediately after the memorized word according to the part of speech of the word immediately before the boundary of each word constituting the sentence represented by the input character string. Determining whether or not to insert a pause at the word boundary based on a numerical value and a numerical value representing ease of insertion of a pause immediately before the memorized word according to the part of speech of the word immediately after the word boundary. A pose insertion position determination method featuring:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1173445A JP3001210B2 (en) | 1989-07-04 | 1989-07-04 | Pose insertion position determination device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1173445A JP3001210B2 (en) | 1989-07-04 | 1989-07-04 | Pose insertion position determination device |
Publications (2)
Publication Number | Publication Date |
---|---|
JPH0337700A true JPH0337700A (en) | 1991-02-19 |
JP3001210B2 JP3001210B2 (en) | 2000-01-24 |
Family
ID=15960609
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP1173445A Expired - Lifetime JP3001210B2 (en) | 1989-07-04 | 1989-07-04 | Pose insertion position determination device |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP3001210B2 (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS59123889A (en) * | 1982-12-29 | 1984-07-17 | 富士通株式会社 | Voice editing/synthesization processing system |
-
1989
- 1989-07-04 JP JP1173445A patent/JP3001210B2/en not_active Expired - Lifetime
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS59123889A (en) * | 1982-12-29 | 1984-07-17 | 富士通株式会社 | Voice editing/synthesization processing system |
Also Published As
Publication number | Publication date |
---|---|
JP3001210B2 (en) | 2000-01-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6751592B1 (en) | Speech synthesizing apparatus, and recording medium that stores text-to-speech conversion program and can be read mechanically | |
Dutoit | High-quality text-to-speech synthesis: An overview | |
Chu et al. | Locating boundaries for prosodic constituents in unrestricted Mandarin texts | |
JP4745036B2 (en) | Speech translation apparatus and speech translation method | |
JPH09500223A (en) | Multilingual speech recognition system | |
Dutoit | A short introduction to text-to-speech synthesis | |
Ashby | An acoustic profile of right-dislocations in French | |
Proença et al. | The LetsRead corpus of Portuguese children reading aloud for performance evaluation | |
KR100499116B1 (en) | Method and apparatus for prosodic phrasing for speech synthesis | |
JPS6318457A (en) | Method and apparatus for extracting feeling information | |
JPH0337700A (en) | Pause insertion position determining system | |
JP3076047B2 (en) | Pose insertion position determination device | |
JP3518340B2 (en) | Reading prosody information setting method and apparatus, and storage medium storing reading prosody information setting program | |
KR100304654B1 (en) | Method and apparatus for analyzing korean document | |
Defina et al. | Scaling processes of clause chains in Pitjantjatjara | |
JP2001092482A (en) | Speech synthesis system and speech synthesis method | |
JP2006330060A (en) | Speech synthesizer, speech processor, and program | |
JP3142160B2 (en) | Phonetic symbol generator | |
JPH0962286A (en) | Voice synthesizer and the method thereof | |
JPH05134691A (en) | Method and apparatus for speech synthesis | |
JPH03225400A (en) | Pause length determining system | |
Geissler | Tonal and laryngeal contrasts in Diaspora Tibetan | |
Adeyemo et al. | Development and integration of Text to Speech Usability Interface for Visually Impaired Users in Yoruba language. | |
KR0180650B1 (en) | Sentence analysis method for korean language in voice synthesis device | |
KR100959494B1 (en) | Voice Synthesizer and Its Method using Processing Not registered Word |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20071112 Year of fee payment: 8 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20081112 Year of fee payment: 9 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20081112 Year of fee payment: 9 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20091112 Year of fee payment: 10 |
|
EXPY | Cancellation because of completion of term | ||
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20091112 Year of fee payment: 10 |