JPH0337700A - Pause insertion position determining system - Google Patents

Pause insertion position determining system

Info

Publication number
JPH0337700A
JPH0337700A JP1173445A JP17344589A JPH0337700A JP H0337700 A JPH0337700 A JP H0337700A JP 1173445 A JP1173445 A JP 1173445A JP 17344589 A JP17344589 A JP 17344589A JP H0337700 A JPH0337700 A JP H0337700A
Authority
JP
Japan
Prior art keywords
word
pause
speech
boundary
character string
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP1173445A
Other languages
Japanese (ja)
Other versions
JP3001210B2 (en
Inventor
Kazuhiko Iwata
和彦 岩田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP1173445A priority Critical patent/JP3001210B2/en
Publication of JPH0337700A publication Critical patent/JPH0337700A/en
Application granted granted Critical
Publication of JP3001210B2 publication Critical patent/JP3001210B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Abstract

PURPOSE:To determine a natural pause insertion position by discriminating whether a pause should be inserted to a word boundary or not based on the numerical value which corresponds to the part of speech of the word of the boundary of each word constituting a sentence expressed with an inputted character string and indicates the preliminarily stored degree of easiness of pause insertion just after each word. CONSTITUTION:The character string to be converted to a voice is inputted from a character string input terminal 11. The inputted character string is sent to a morpheme analyzing part 12, and the sentence expressed with the input character string is divided to word strings, and parts of speech and readings of respective words are determined. A pause insertion position determining part 15 reads out the pause insertion probability which corresponds to the part of speech of the word just before the boundary of each word string having a part of speech and is stored in a pause insertion settlement storage part 14. The read-out pause insertion probability and the length of each word calculated by a word length calculating part 13 are used to determine the pause insertion position, and the result is outputted from a pause insertion position output terminal 16. Thus, the natural pause insertion position is determined.

Description

【発明の詳細な説明】 (産業上の利用分野) 本発明は、文字列を音声に変換する規則音声合成等にお
いて、ポーズを押入する位置を決定するポーズ挿入位置
決定方式に関する。
DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a pause insertion position determining method for determining a position to insert a pause in regular speech synthesis for converting character strings into speech.

(従来の技術) 任意の文章を音声に変換する規則音声合成においては、
文中の適切な位置に適切な長さのポーズを挿入すること
が必要であり、合成される音声の自然性を向上させる上
で重要である。すなわち、人間には一息で発声できる長
さに限界があり、発声者は、意味の上で結びつきの弱い
適当な文節あるいは単語の境界において、息継ぎのため
や聞き手に意味の切れ目を伝えるためにポーズを置く。
(Prior art) In regular speech synthesis that converts arbitrary sentences into speech,
It is necessary to insert a pause of an appropriate length at an appropriate position in a sentence, and this is important in improving the naturalness of synthesized speech. In other words, there is a limit to how long a human voice can be uttered in one breath, and the speaker pauses at the boundaries of phrases or words that have weak connections in terms of meaning, either to take a breather or to convey a break in meaning to the listener. put

したがって、合成音声を生成する際においても、適度に
ポーズが挿入されていないと聞き手は不自然さを感じる
Therefore, when generating synthesized speech, the listener will feel unnatural if appropriate pauses are not inserted.

従来、ポーズの挿入位置の決定には、隣接する文節と文
節の結びつきの強さが用いられていた。
Conventionally, the strength of the connection between adjacent bunsetsu has been used to determine the pose insertion position.

すなわち、前後の文節間の意味の上での結び付きが弱い
ほどポーズは挿入され易いという性質を利用する。
In other words, it utilizes the property that the weaker the semantic connection between the preceding and following phrases, the easier it is for a pause to be inserted.

まず、文節と文節の境界における結びつきの強さを、先
行文節から受けの文節に至るまで゛の文節数で表現し、
この尺度を分離度と呼ぶ。分離度の値が大きいというこ
とは、ある文節がより遠くにある文節と結びついており
、隣接する文節との結びつきは弱いということを表して
いる。したがって、分離度の大きい文節境界では、ポー
ズが挿入される可能性が高いと考える。また、人間には
一息で発声できる長さに限界があり、発声者は適当な位
置で息継ぎのためにポーズを置く。したがって、文節境
界の前後の文節の総モーラ数が多し)場合にポーズが挿
入され易いと考える。以上のことから、ポーズを挿入す
るかどうかの判断をするための評価尺度として、分離度
と総モーラ数との積の値を用い、これが、予め定めるあ
る閾値を越える場合にポーズが挿入されるものとする。
First, the strength of the bond at the boundary between bunsetsu and bunsetsu is expressed by the number of clauses from the preceding clause to the receiving clause,
This measure is called degree of separation. A large value of the degree of separation indicates that a certain phrase is connected to a phrase that is further away, and that the connection with adjacent phrases is weak. Therefore, we believe that there is a high possibility that a pause will be inserted at a bunsetsu boundary with a high degree of separation. Furthermore, humans have a limit to how long they can vocalize in one breath, so the speaker pauses at an appropriate position to catch their breath. Therefore, pauses are likely to be inserted when the total number of moras in the clauses before and after the clause boundary is large. Based on the above, the product of the degree of separation and the total number of moras is used as an evaluation measure to determine whether or not to insert a pause, and if this exceeds a certain predetermined threshold, a pause is inserted. shall be taken as a thing.

このようなポーズ挿入位置の決定方法については、日本
音響学会音声研究会試料878−07(1978−4)
1文音声の音調規則の検討](文献1)に詳述されてい
る。
Regarding the method of determining such a pause insertion position, please refer to the Acoustical Society of Japan Speech Research Group Sample 878-07 (1978-4).
Examination of intonation rules for one-sentence speech] (Reference 1).

(発明が解決しようとする問題点) 従来の方法では、文章を構成する文節同士の係り受は関
係を正確に解析する必要があった。しかしながら、この
係り受は関係を常に正確に解析することは難しく、この
ため、選択されたポーズ挿入位置が不自然になることが
あるという問題点があった。
(Problems to be Solved by the Invention) In the conventional method, it was necessary to accurately analyze the dependencies between clauses that make up a sentence. However, it is difficult to always accurately analyze the relationship of this modification, and as a result, there is a problem that the selected pose insertion position may become unnatural.

これに対して本発明は、このような係り受は関係の解析
を行うことなしに、自然なポーズの挿入位置を決定する
ことが可能なポーズ挿入位置決定方式を提供することを
目的としている。
In contrast, an object of the present invention is to provide a pose insertion position determination method that can determine a natural pose insertion position without analyzing such dependency relationships.

(問題を解決するための手段) 第1の本発明は、入力された文字列を音声に変換する際
に、前記入力された文字列で表される文章をそれを構成
する単語に分解し、分解された各単語の境界にポーズを
挿入するかどうかを判定するポーズ挿入位置決定方式に
おいて、予め単語の直後にどの程度ポーズが挿入され易
いかを表す数値を単語の品詞ごとに記憶しておき、入力
された文字列で表される文章を構成する各単語の境界の
直前の単語の品詞に応じた前記記憶された単語の直後の
ポーズの挿入され易さを表す数値に基づいて当該単語境
界にポーズを挿入するかどうかを判定することを特徴と
する。
(Means for Solving the Problem) The first aspect of the present invention provides, when converting an input character string into speech, a sentence represented by the input character string is decomposed into its constituent words, In the pause insertion position determination method that determines whether or not to insert a pause at the boundary of each decomposed word, a numerical value indicating how likely a pause is to be inserted immediately after a word is stored in advance for each part of speech of the word. , the word boundary is determined based on a numerical value representing the ease of inserting a pause immediately after the memorized word according to the part of speech of the word immediately before the boundary of each word constituting the sentence represented by the input character string. It is characterized by determining whether or not to insert a pose into the image.

第2の本発明は、入力された文字列を音声に変換する際
に、前記入力された文字列で表される文章をそれを構成
する単語に分解し、分解された各単語の境界にポーズを
挿入するかどうかを判定するポーズ挿入位置決定方式に
おいて、予め単語の直前にどの程度ポーズが挿入され易
いかを表す数値を単語の品詞ごとに記憶しておき、入力
された文字列で表される文章を構成する各単語の境界の
直後の単語の品詞に応じた前記記憶された単語の直前の
ポーズの挿入され易さを表す数値に基づいて当該単語境
界にポーズを挿入するかどうかを判定することを特徴と
する。
A second aspect of the present invention, when converting an input character string into speech, decomposes a sentence represented by the input character string into its constituent words, and pauses at the boundary of each decomposed word. In the pause insertion position determination method that determines whether or not to insert a pause, a numerical value indicating how likely a pause is to be inserted immediately before a word is memorized for each part of speech of the word, and the number expressed by the input character string is Determine whether or not to insert a pause at the word boundary based on a numerical value representing the ease with which a pause immediately before the memorized word is inserted according to the part of speech of the word immediately after the boundary of each word constituting the sentence. It is characterized by

第3の本発明は、入力された文字列を音声に変換する際
に、前記入力された文字列で表される文章をそれを構成
する単語に分解し、分解された各単語の境界にポーズを
押入するかどうかを判定するポーズ挿入位置決定方式に
おいて、予め単語の直後にどの程度ポーズが挿入され易
いかを表す数値と、単語の直前にどの程度ポーズが挿入
され易いかを表す数値とを、それぞれ単語の品詞ごとに
記憶しておき、入力された文字列で表される文章を構成
する各単語の境界の直前の単語の品詞に応した前記記憶
された単語の直後のポーズの押入され易さを表す数値と
、当該単語境界の直後の単語の品詞に応じた前記記憶さ
れた単語の直前のポーズの挿入され易さを表す数値とに
基づいて当該単語境界にポーズを挿入するかどうかを判
定することを特徴とする。
A third aspect of the present invention is that when converting an input character string into speech, the sentence represented by the input character string is decomposed into its constituent words, and pauses are made at the boundaries of each decomposed word. In the pause insertion position determination method that determines whether to insert a pause, a numerical value indicating how likely a pause is to be inserted immediately after a word and a numerical value indicating how likely a pause is to be inserted immediately before a word are determined in advance. , each part of speech of each word is memorized, and a pause immediately after the memorized word is inserted corresponding to the part of speech of the word immediately before the boundary of each word constituting the sentence represented by the input character string. Whether to insert a pause at the word boundary based on a numerical value representing the ease of insertion and a numerical value representing the ease with which a pause immediately before the memorized word can be inserted according to the part of speech of the word immediately after the word boundary. It is characterized by determining.

(作用) ポーズは、文節あるいは単語の境界(以下、単に境界と
あるのは、文節あるいは単語の境界を示すものとする)
において、その境界の前後にある単語の、意味の上での
結び付きが弱い境界に挿入され易い。従来方式は、この
結び付きの強弱を文章の係り受は構造から算出していた
。一方で、ポーズが挿入される境界の前後にある単語の
品詞を調べてみると、ポーズが挿入され易い品詞と、挿
入されにくい品詞とがあることがわかる。そこで、品詞
の持つこのような性質を利用すれば、文章の係り受は構
造を用いなくても、自然なポーズ挿入位置を決定するこ
とが可能であると考えられる。
(Action) A pause is a boundary between a phrase or a word (hereinafter, the term "boundary" simply refers to a boundary between a phrase or a word)
, words are likely to be inserted into a boundary where the words before and after the boundary have a weak connection in terms of meaning. In the conventional method, the strength of this connection was calculated from the structure of the text's dependencies. On the other hand, when examining the parts of speech of words before and after the boundary where a pause is inserted, it is found that there are parts of speech in which pauses are easily inserted and parts of speech in which it is difficult to insert. Therefore, it is thought that by utilizing such properties of parts of speech, it is possible to determine a natural pause insertion position without using the structure of sentence dependencies.

そこで、予め、いくつかの文章を人間が読み上げた音声
(以下、文章音声データベースと称する)から、ポーズ
が挿入されている境界の直後、あるいは直前にある単語
の品詞を調べ、品詞ごとに、その単語の直前あるいは直
後にどの程度ポーズが挿入され易いかを求める。ポーズ
の挿入され易さとしては、例えば、文章音声データベー
スにおけるある品詞の全出現に対する、その品詞がポー
ズを伴っている出現の割合等を用いることが考えられる
。すなわち、品詞がPSである単語の直前にポーズが挿
入される確率P、(PS)、直後にポーズが挿入される
確率P〆PS)をそれぞれ以下のように定義する。
Therefore, in advance, we check the parts of speech of the words immediately after or immediately before the boundary where the pause is inserted from the audio of several sentences read aloud by humans (hereinafter referred to as the sentence audio database), and then analyze the parts of speech for each part of speech. Find out how likely a pause is to be inserted immediately before or after a word. As the ease with which a pause is inserted, for example, the ratio of occurrences of a certain part of speech accompanied by a pause to all occurrences of that part of speech in the sentence speech database may be used. That is, the probability P,(PS) that a pause will be inserted immediately before a word whose part of speech is PS, and the probability P<PS) that a pause will be inserted immediately after, are defined as follows.

P、(PS)= (直前にポーズを伴っている品詞PS
の出現数)/(品詞psの全出現数) 、、、、 (1
)P〆PS)=(直後にポーズを伴っている品詞PSの
出現数)/(品詞psの全出現数) 、、、、 (2)
以下では、単語の直前あるいは直後にどの程度ポーズが
挿入され易いかを表す数値として、前記ポーズ挿入確率
Pp(PS)、ppps)を用いる場合を例にとって説
明を行うことにする。
P, (PS) = (Part of speech PS accompanied by a pause immediately before
(number of occurrences of part of speech ps) / (total number of occurrences of part of speech ps) ,,,, (1
)P〆PS)=(Number of occurrences of part-of-speech PS that immediately follows a pause)/(Total number of occurrences of part-of-speech ps) ,,,, (2)
In the following, an example will be explained in which the pause insertion probability Pp (PS), ppps) is used as a numerical value representing how likely a pause is to be inserted immediately before or after a word.

これらのポーズ挿入確率Pp(PS)、P〆PS)、す
なわちポーズの挿入され易さを、品詞ごとに求める。
These pause insertion probabilities Pp(PS), P〆PS), that is, the ease with which a pause is inserted, are determined for each part of speech.

品詞の種類としては、名詞、動詞、副詞といった分類の
方法がある。また、動詞や形容詞、助動詞などのように
活用する品詞については、未然形、連用形、連体形など
の活用形ごとに分類することも考えられる。
There are different classification methods for parts of speech, such as nouns, verbs, and adverbs. Furthermore, for parts of speech that are inflected such as verbs, adjectives, and auxiliary verbs, it is also possible to classify them into conjugated forms such as unnatural form, conjunctive form, and adjunctive form.

第1の本発明では、音声に変換する文章においてポーズ
を挿入するべき境界を決定する際の判定の尺度として、
(2)式で示される。各境界における当該境界の直前に
ある単語の品詞PS、に応じたポーズ挿入確率P〆PS
、)を用いる。このような判定尺度を用いて、実際にポ
ーズ挿入位置を決定する方法としては、例えば、(文献
1)で示されるような方法が考えられる。すなわち、前
記判定尺1度と境界の前後の文節の長さ(例えば、モー
ラ数)とから算出される値を評価尺度とする。この評価
尺度に閾値を設け、各境界における評価尺度が閾値を越
えているかどうかに応じて、当該境界にポーズを挿入す
るかどうかの判断を行う。あるいは、音声に変換すべき
文章の長さから、はじめにポーズをいくつ挿入するかを
定めておき、前記判定尺度の大きい境界から順にポーズ
を挿入していくこともできる。
In the first aspect of the present invention, as a criterion for determining the boundary where a pause should be inserted in a sentence to be converted into speech,
It is shown by equation (2). Pause insertion probability P〆PS according to the part of speech PS of the word immediately before the boundary at each boundary
, ) is used. As a method of actually determining the pose insertion position using such a determination criterion, for example, a method as shown in (Reference 1) can be considered. That is, the evaluation scale is a value calculated from the judgment scale 1 degree and the lengths of the clauses before and after the boundary (for example, the number of moras). A threshold is set for this evaluation scale, and depending on whether the evaluation scale at each boundary exceeds the threshold, it is determined whether or not to insert a pause at the boundary. Alternatively, the number of pauses to be inserted may be determined first based on the length of the sentence to be converted into speech, and the pauses may be inserted in order from the boundary with the largest criterion.

第2の本発明では、前記判定の尺度として、(1)式で
示される、各境界における当該境界の直後にある単語の
品詞PSfに応じたポーズ挿入確率P、(PSr)を用
いる。
In the second aspect of the present invention, the pause insertion probability P, (PSr) corresponding to the part of speech PSf of the word immediately following the boundary in each boundary is used as the criterion for the determination.

また、第3の本発明では、前記判定の尺度として、(2
)式で示される、各境界における当該境界の直前にある
単語の品詞PS、に応じたポーズ挿入確率P1(PS、
)と、(1)式で示される、当該境界の直後にある単語
の品詞PSrに応じたポーズ挿入確率Pp(PSf)と
を用いる。両者を用いた判定の尺度としては、例えば、
PdPS、)とP、(PSF)との和などが考えられる
Further, in the third aspect of the present invention, (2
), the pause insertion probability P1(PS,
) and the pause insertion probability Pp (PSf) according to the part of speech PSr of the word immediately after the boundary, which is shown by equation (1). As a judgment scale using both, for example,
A possible example is the sum of PdPS, ) and P, (PSF).

以上のような手法を用いることにより、音声に変換する
文章の係り受は構造は用いずに、形態素解析で得られる
単語の品詞を用いるだけで、自然なポーズ挿入位置を決
定することが可能となる。
By using the above method, it is possible to determine a natural pause insertion position by simply using the part of speech of the word obtained through morphological analysis, without using the structure of the dependency of the sentence to be converted into speech. Become.

(実施例) 第1図は、第1の本発明によるポーズ挿入位置決定方式
を実現するための一実施例を示すブロック図である。
(Embodiment) FIG. 1 is a block diagram showing an embodiment for realizing the pose insertion position determination method according to the first invention.

まず、音声に変換すべき文字列を、文字列入力端子11
から入力する。入力された前記文字列は形態素解析部工
2に送られ、入力文字列で表される文章を単語列に分割
し、各単語の品詞や読みを決定する。この品詞や読みを
伴った単語列は、単語長算出部13及びポーズ挿入位置
決走部15に送られる。単語長算出部13では、前記単
語列が与えられると、単語の読みを用いて文章を構成し
ている各単語の長さ(例えばモーラ数)を算出し、ポー
ズ挿入位置決走部15に送る。
First, the character string to be converted into speech is input to the character string input terminal 11.
Enter from. The input character string is sent to the morphological analysis section 2, which divides the sentence represented by the input character string into word strings and determines the part of speech and pronunciation of each word. This word string with part of speech and pronunciation is sent to the word length calculation section 13 and the pause insertion position determination section 15. When the word string is given, the word length calculation section 13 calculates the length of each word (for example, the number of moras) constituting the sentence using the pronunciation of the word, and sends it to the pause insertion position determination section 15. .

ポーズ挿入確率記憶部14には、(2)式で表される、
品詞の後ろにポーズが挿入される確率pKps)が、品
詞ごとに予め蓄えられている。
The pause insertion probability storage unit 14 stores, as expressed by equation (2),
The probability that a pause will be inserted after a part of speech (pKps) is stored in advance for each part of speech.

ポーズ仲人位置決定部15は、まず、前記品詞を伴った
単語列によって各境界の直前にある単語の品詞PS、に
対応した、前記ポーズ挿入確率記憶部14に蓄えられて
いるポーズ抑大確率PrCPS、)を読み出す。読み出
された前記ポーズ挿入確率P[PSp)と、前記単語長
算出部13で算出された各単語の長さとを用いて、(作
用)の項で説明したような手法によってポーズを挿入す
る位置を決定し、結果をポーズ挿入位置出力端子16か
ら出力する。
The pose matchmaker position determination unit 15 first determines the pose suppression probability PrCPS stored in the pause insertion probability storage unit 14, which corresponds to the part of speech PS of the word immediately preceding each boundary using the word string with the part of speech. , ). Using the read pause insertion probability P [PSp) and the length of each word calculated by the word length calculation unit 13, the position at which a pause is inserted is determined by the method described in the (effect) section. is determined, and the result is output from the pause insertion position output terminal 16.

第2の本発明を実現するためには、第1図におけるポー
ズ挿入確率記憶部14に蓄えられているポーズ挿入確率
を、(1)式で表されるPp(PS)とする。そして、
ポーズ挿入位置決走部15では、各境界の直後にある単
語の品詞PSfに応じて、ポーズ挿入確率記憶部14に
蓄えられているポーズ挿入確率Pp(PSr)を読み出
し、第1の本発明と同様にしてポーズを挿入する位置を
決定すればよい。
In order to realize the second aspect of the present invention, the pause insertion probability stored in the pause insertion probability storage section 14 in FIG. 1 is set to Pp (PS) expressed by equation (1). and,
The pause insertion position determination unit 15 reads out the pause insertion probability Pp (PSr) stored in the pause insertion probability storage unit 14 according to the part of speech PSf of the word immediately after each boundary, and reads out the pause insertion probability Pp (PSr) stored in the pause insertion probability storage unit 14. The position to insert the pose can be determined in the same way.

また、第3の本発明を実現するためには、第1図におけ
るポーズ挿入確率記憶部14に、(1)式で表されるP
、(PS)と、(2)式で表されるPdPS〉とを蓄え
ておく。そして、ポーズ挿入位置決走部15では、各境
界の直前にある単語の品詞PSpに応じたポーズ挿入確
率PdPS、)、及び各境界の直後にある単語の品詞P
S「に応じたポーズ挿入確率P、、(PSr)とを、ポ
ーズ挿入確率記憶部14から読み出し、(作用)の項で
説明したような手法によってポーズを挿入する位置を決
定すればよい。
In addition, in order to realize the third aspect of the present invention, it is necessary to store P expressed by equation (1) in the pause insertion probability storage unit 14 in FIG.
, (PS) and PdPS> expressed by equation (2) are stored. Then, the pause insertion position determining unit 15 calculates the pause insertion probability PdPS, ) corresponding to the part of speech PSp of the word immediately before each boundary, and the part of speech P of the word immediately after each boundary.
The pose insertion probabilities P, .

(発明の効果) 以上述べてきたように、本発明は、正しく解析すること
が困難な係り受は解析の結果を用いることなしに、ポー
ズの挿入位置を決定する。このため、従来方式よりも正
しく、自然なポーズの挿入位置を決定することが可能と
なる。したがって、本発明は、文字列で与えられた任意
の文章を音声に変換する音声合成装置等におけるポーズ
挿入位置決定方式として有効である。
(Effects of the Invention) As described above, the present invention determines the insertion position of a pose without using the results of analysis of dependencies that are difficult to analyze correctly. Therefore, it is possible to determine a more accurate and natural pose insertion position than the conventional method. Therefore, the present invention is effective as a pause insertion position determination method in a speech synthesis device or the like that converts an arbitrary sentence given as a character string into speech.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は、第1の本発明によるポーズ挿入位置決定方式
を実現するための一実施例を示すブロック図である。 図において、11は文字列入力端子、12は形態素解析
部、13は単語長算出部、14はポーズ挿入確率記憶部
、15はポーズ挿入位置決定部、16はポーズ挿入位置
出力端子である。
FIG. 1 is a block diagram showing an embodiment for realizing the pose insertion position determination method according to the first invention. In the figure, 11 is a character string input terminal, 12 is a morphological analysis section, 13 is a word length calculation section, 14 is a pause insertion probability storage section, 15 is a pause insertion position determination section, and 16 is a pause insertion position output terminal.

Claims (3)

【特許請求の範囲】[Claims] (1)入力された文字列を音声に変換する際に、前記入
力された文字列で表される文章をそれを構成する単語に
分解し、分解された各単語の境界にポーズを挿入するか
どうかを判定するポーズ挿入位置決定方式において、予
め単語の直後にどの程度ポーズが挿入され易いかを表す
数値を単語の品詞ごとに記憶しておき、入力された文字
列で表される文章を構成する各単語の境界の直前の単語
の品詞に応じた前記記憶された単語の直後のポーズの挿
入され易さを表す数値に基づいて当該単語境界にポーズ
を挿入するかどうかを判定することを特徴とするポーズ
挿入位置決定方式。
(1) When converting an input character string into speech, do you decompose the sentence represented by the input character string into its constituent words and insert a pause at the boundary of each decomposed word? In the pause insertion position determination method, a numerical value indicating how likely a pause is to be inserted immediately after a word is stored in advance for each part of speech of the word, and a sentence represented by the input character string is constructed. It is characterized in that it is determined whether or not to insert a pause at the word boundary based on a numerical value representing the ease with which a pause is inserted immediately after the memorized word according to the part of speech of the word immediately before the word boundary. A pose insertion position determination method.
(2)入力された文字列を音声に変換する際に、前記入
力された文字列で表される文章をそれを構成する単語に
分解し、分解された各単語の境界にポーズを挿入するか
どうかを判定するポーズ挿入位置決定方式において、予
め単語の直前にどの程度ポーズが挿入され易いかを表す
数値を単語の品詞ごとに記憶しておき、入力された文字
列で表される文章を構成する各単語の境界の直後の単語
の品詞に応じた前記記憶された単語の直前のポーズの挿
入され易さを表す数値に基づいて当該単語境界にポーズ
を挿入するかどうかを判定することを特徴とするポーズ
挿入位置決定方式。
(2) When converting an input character string into speech, do you decompose the sentence represented by the input character string into its constituent words and insert a pause at the boundary of each decomposed word? In the pause insertion position determination method, a numerical value indicating how likely a pause is to be inserted immediately before a word is stored in advance for each part of speech of the word, and a sentence represented by the input character string is constructed. It is characterized in that it is determined whether or not to insert a pause at the word boundary based on a numerical value representing the ease with which a pause immediately before the memorized word is inserted according to the part of speech of the word immediately after the word boundary. A pose insertion position determination method.
(3)入力された文字列を音声に変換する際に、前記入
力された文字列で表される文章をそれを構成する単語に
分解し、分解された各単語の境界にポーズを挿入するか
どうかを判定するポーズ挿入位置決定方式において、予
め単語の直後にどの程度ポーズが挿入され易いかを表す
数値と、単語の直前にどの程度ポーズが挿入され易いか
を表す数値とを、それぞれ単語の品詞ごとに記憶してお
き、入力された文字列で表される文章を構成する各単語
の境界の直前の単語の品詞に応じた前記記憶された単語
の直後のポーズの挿入され易さを表す数値と、当該単語
境界の直後の単語の品詞に応じた前記記憶された単語の
直前のポーズの挿入され易さを表す数値とに基づいて当
該単語境界にポーズを挿入するかどうかを判定すること
を特徴とするポーズ挿入位置決定方式。
(3) When converting an input character string into speech, do you decompose the sentence represented by the input character string into its constituent words and insert a pause at the boundary of each decomposed word? In the pause insertion position determination method that determines whether or not the word It is memorized for each part of speech and represents the ease with which a pause can be inserted immediately after the memorized word according to the part of speech of the word immediately before the boundary of each word constituting the sentence represented by the input character string. Determining whether or not to insert a pause at the word boundary based on a numerical value and a numerical value representing ease of insertion of a pause immediately before the memorized word according to the part of speech of the word immediately after the word boundary. A pose insertion position determination method featuring:
JP1173445A 1989-07-04 1989-07-04 Pose insertion position determination device Expired - Lifetime JP3001210B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1173445A JP3001210B2 (en) 1989-07-04 1989-07-04 Pose insertion position determination device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1173445A JP3001210B2 (en) 1989-07-04 1989-07-04 Pose insertion position determination device

Publications (2)

Publication Number Publication Date
JPH0337700A true JPH0337700A (en) 1991-02-19
JP3001210B2 JP3001210B2 (en) 2000-01-24

Family

ID=15960609

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1173445A Expired - Lifetime JP3001210B2 (en) 1989-07-04 1989-07-04 Pose insertion position determination device

Country Status (1)

Country Link
JP (1) JP3001210B2 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59123889A (en) * 1982-12-29 1984-07-17 富士通株式会社 Voice editing/synthesization processing system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59123889A (en) * 1982-12-29 1984-07-17 富士通株式会社 Voice editing/synthesization processing system

Also Published As

Publication number Publication date
JP3001210B2 (en) 2000-01-24

Similar Documents

Publication Publication Date Title
US6751592B1 (en) Speech synthesizing apparatus, and recording medium that stores text-to-speech conversion program and can be read mechanically
Dutoit High-quality text-to-speech synthesis: An overview
Chu et al. Locating boundaries for prosodic constituents in unrestricted Mandarin texts
JP4745036B2 (en) Speech translation apparatus and speech translation method
JPH09500223A (en) Multilingual speech recognition system
Dutoit A short introduction to text-to-speech synthesis
Ashby An acoustic profile of right-dislocations in French
Proença et al. The LetsRead corpus of Portuguese children reading aloud for performance evaluation
KR100499116B1 (en) Method and apparatus for prosodic phrasing for speech synthesis
JPS6318457A (en) Method and apparatus for extracting feeling information
JPH0337700A (en) Pause insertion position determining system
JP3076047B2 (en) Pose insertion position determination device
JP3518340B2 (en) Reading prosody information setting method and apparatus, and storage medium storing reading prosody information setting program
KR100304654B1 (en) Method and apparatus for analyzing korean document
Defina et al. Scaling processes of clause chains in Pitjantjatjara
JP2001092482A (en) Speech synthesis system and speech synthesis method
JP2006330060A (en) Speech synthesizer, speech processor, and program
JP3142160B2 (en) Phonetic symbol generator
JPH0962286A (en) Voice synthesizer and the method thereof
JPH05134691A (en) Method and apparatus for speech synthesis
JPH03225400A (en) Pause length determining system
Geissler Tonal and laryngeal contrasts in Diaspora Tibetan
Adeyemo et al. Development and integration of Text to Speech Usability Interface for Visually Impaired Users in Yoruba language.
KR0180650B1 (en) Sentence analysis method for korean language in voice synthesis device
KR100959494B1 (en) Voice Synthesizer and Its Method using Processing Not registered Word

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20071112

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081112

Year of fee payment: 9

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081112

Year of fee payment: 9

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20091112

Year of fee payment: 10

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20091112

Year of fee payment: 10