JPH0358097A - Determination system for pause insertion position - Google Patents

Determination system for pause insertion position

Info

Publication number
JPH0358097A
JPH0358097A JP1194970A JP19497089A JPH0358097A JP H0358097 A JPH0358097 A JP H0358097A JP 1194970 A JP1194970 A JP 1194970A JP 19497089 A JP19497089 A JP 19497089A JP H0358097 A JPH0358097 A JP H0358097A
Authority
JP
Japan
Prior art keywords
word
pause
speech
insertion position
parts
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP1194970A
Other languages
Japanese (ja)
Other versions
JP3076047B2 (en
Inventor
Kazuhiko Iwata
和彦 岩田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP01194970A priority Critical patent/JP3076047B2/en
Publication of JPH0358097A publication Critical patent/JPH0358097A/en
Application granted granted Critical
Publication of JP3076047B2 publication Critical patent/JP3076047B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Abstract

PURPOSE:To determine a pause insertion position by using a numeral corresponding to a combination of parts of speech of a word and the length of the word by storing numerals indicating the easiness of pause insertion by combinations of parts of speech of words before and after word borders. CONSTITUTION:A character string to be converted into a voice is inputted from a character string input terminal 11 and a morpheme analysis part decomposes a sentence into word strings and determines the part of speech and the rendering of each word. The word strings having the parts of speech and the rendering are sent out to a word length calculation part 13 and a pause insertion position determination part 15. A word length calculation part 13 calculates the length of each word constituting the sentence by using the rendering of the word and sends it to the pause insertion position determination part 15. A pause insertion probability storage part 14 is stored with pause insertion probability previously by combinations of parts of speech of words before and after borders. The pause insertion position determination part 15 uses the pause insertion probability and the length of the word to determine the natural insertion position of a pause.

Description

【発明の詳細な説明】 (産業上の利用分野) 本発明は、文字列を音声に変換する規則音声合戒等にお
いて、ポーズを挿入する位置を決定するポーズ挿入位置
決定方式に関する。
DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a pause insertion position determination method for determining the position at which a pause is to be inserted in a regular voice gathering for converting a character string into speech.

(従来の技術) 任意の文章を音声に変換する規則音声合或においては、
文中の適切な位置に適切な長さのポーズを挿入すること
が必要であり、合威される音声の自然性を向上させる上
で重要である。すなわち、人間には一息で発声できる長
さに限界があり、発声者は、意味の上で結びつきの弱い
適当な文節あるいは単語の境界において、息継ぎのため
や聞き手に意味の切れ目を伝えるためにポーズを置く。
(Prior art) In regular speech synthesis that converts arbitrary sentences into speech,
It is necessary to insert a pause of an appropriate length at an appropriate position in a sentence, and is important in improving the naturalness of the synthesized speech. In other words, there is a limit to how long a human voice can be uttered in one breath, and the speaker pauses at the boundaries of phrases or words that have weak connections in terms of meaning, either to take a breather or to convey a break in meaning to the listener. put

したがって、合或音声を生或する際においても、適度に
ポーズが挿入されていないと聞き手は不自然さを感じる
Therefore, even when producing audible speech, if appropriate pauses are not inserted, the listener will feel that it is unnatural.

従来、ポーズの挿入位置の決定には、隣接する文節と文
節の結びつきの強さが用いられていた。
Conventionally, the strength of the connection between adjacent bunsetsu has been used to determine the pose insertion position.

すなわち、前後の文節間の意味の上で結びつきが弱いほ
どポーズは挿入され易いという性質を利用する。
In other words, it utilizes the property that the weaker the connection between the preceding and following clauses in terms of meaning, the easier it is to insert a pause.

まず、文節と文節の境界における結びつきの強さを、先
行文節から受けの文節に至るまでの文節数で表現し、こ
の尺度を分離度と呼ぶ。分離度の値が大きいということ
は、ある文節がより遠くにある文節と結びついており、
隣接する文節との結びつきは弱いということを表してい
る。したがって、分離度の大きい文節境界では、ポーズ
が挿入される可能性が高いと考える。また、人間には一
息で発声できる長さに限界があり、発声者は適当な位置
で息継ぎのためにポーズを置く。したがって、文節境界
の前後の文節の総モーラ数が多い場合にポーズが挿入さ
れ易いと考える。以上のことから、ポーズを挿入するか
どうかの判断をするための評価尺度として、分離度と総
モーラ数との積の値を用い、これが、予め定めるある閾
値を越える場合にポーズが挿入されるものとする。
First, the strength of the connection between bunsetsu and bunsetsu boundaries is expressed by the number of bunsetsu from the preceding clause to the receiving clause, and this measure is called the degree of separation. A large value of separation means that a certain clause is connected to a clause that is further away.
This indicates that the connection with adjacent clauses is weak. Therefore, we believe that there is a high possibility that a pause will be inserted at a bunsetsu boundary with a high degree of separation. Furthermore, humans have a limit to how long they can vocalize in one breath, so the speaker pauses at an appropriate position to catch their breath. Therefore, it is considered that pauses are likely to be inserted when the total number of moras of clauses before and after a clause boundary is large. Based on the above, the product of the degree of separation and the total number of moras is used as an evaluation measure for determining whether to insert a pause, and if this exceeds a certain predetermined threshold, a pause is inserted. shall be taken as a thing.

このようなポーズ挿入位置の決定方法については、日本
音響学会音声研究会資料878−07(1978−4)
F文音声の音調規則の検討](文献1)に詳述されてい
る。
Regarding the method of determining such a pause insertion position, please refer to Acoustical Society of Japan Speech Research Group Material 878-07 (1978-4).
Examination of intonation rules for F-sentence speech] (Reference 1).

(発明が解決しようとする課題) 従来の方法では、文章を構戒する文節同士の係り受け関
係を正確に解析する必要があった。1−かじながら、こ
の係り受け関係を常に正確に解析することは難しく、こ
のため、選択されたポーズ挿入位置が不自然になること
があるという問題点があった。
(Problem to be Solved by the Invention) In the conventional method, it was necessary to accurately analyze the dependency relationships between the clauses that make up the sentence. 1- However, it is difficult to always accurately analyze this dependency relationship, and as a result, there is a problem that the selected pose insertion position may become unnatural.

これに対して本発明は、このような係り受け関係の解析
を行うことなしに、自然なポースの挿入位置を決定する
ことが可能なポーズ挿入位置決定方式を提供することを
目的としている。
In contrast, an object of the present invention is to provide a pose insertion position determining method that can determine a natural pose insertion position without analyzing such dependency relationships.

(課題を解決するための手段) 本発明は、入力された文字列で表される文章をそれを構
戒する単語に分解し、分解された各単語の境界にポーズ
を挿入するかどうかを判定するポーズ挿入位置決定方式
において、単語境界にどの程度ポーズが挿入され易いか
を表す数値を単語境界の前後にある単語の品詞の組み合
せごとに記憶しておき、入力された各単語の境界の前後
にある単語の品詞の組み合せに対応する数値と当該境界
の前後の単語の長さとを用いて当該単語境界にポーズを
挿入するか否かを定めることを特徴とする。
(Means for Solving the Problems) The present invention decomposes a sentence represented by an input character string into words that disturb it, and determines whether to insert a pause at the boundary of each decomposed word. In the pause insertion position determination method, a numerical value indicating how easily a pause is inserted at a word boundary is memorized for each combination of parts of speech of words before and after the word boundary. The method is characterized in that it is determined whether or not to insert a pause at the word boundary using a numerical value corresponding to a combination of parts of speech of the word in the word boundary and the lengths of the words before and after the boundary.

(作用) ポーズは、単語の境界において、その境界の前後にある
単語の意味の上での結び付きが弱い境界に挿入され易い
。従来方式は、この結び付きの強弱を文章の係り受け構
造から算出していた。一方で、ポーズが挿入される境界
の前後にある単語の品詞を調べてみると、ポーズが挿入
され易い品詞と、挿入されにくい品詞とがあることがわ
かる。
(Operation) A pause is likely to be inserted at a word boundary where the meanings of the words before and after the boundary are weakly connected. In the conventional method, the strength of this connection was calculated from the dependency structure of the sentence. On the other hand, when examining the parts of speech of words before and after the boundary where a pause is inserted, it is found that there are parts of speech in which pauses are easily inserted and parts of speech in which it is difficult to insert.

そこで、品詞の持つこのような性質を利用すれば、文章
の係り受け構造を用いなくても、自然なポーズ挿入位置
を決定することが可能になる。
Therefore, by utilizing such properties of parts of speech, it becomes possible to determine a natural pause insertion position without using the dependency structure of sentences.

そこで、予め、いくつかの文章を人間が読み上げた音声
(以下、文章音声データベースと称する)から、ポーズ
が挿入されている境界の前後にある単語の品詞を調べ、
品詞の組み合せごとにどの程度ポーズが挿入され易いか
を求める。ポーズの挿入され易さとしては、例えば、文
章音声データベースにおける、ある品詞の組み合せの全
出現に対するその品詞の組み合せがポーズを伴っている
出現の割合などを用いることが考えられる。以下では、
この割合をポーズ挿入確率と呼ぶことにする。品詞の組
み合せを考慮する際の品詞の種類としては、名詞、動詞
、副詞といった分類の方法がある。この他、動詞や形容
詞、助動詞などのように活用する品詞については、未然
形、連用形、連体形などの活用形ごとに分類することも
有益である。
Therefore, we first checked the parts of speech of the words before and after the boundary where the pause was inserted from the audio of several sentences read aloud by humans (hereinafter referred to as the sentence audio database).
Find out how easily a pause is inserted for each combination of parts of speech. As the ease with which a pause is inserted, for example, the ratio of occurrences in which a certain combination of parts of speech is accompanied by a pause to all occurrences of a combination of parts of speech in the sentence speech database may be used. Below,
This ratio will be called the pause insertion probability. When considering combinations of parts of speech, there are classification methods such as nouns, verbs, and adverbs. In addition, it is also useful to classify parts of speech that are inflected, such as verbs, adjectives, and auxiliary verbs, into conjugated forms such as natural form, conjunctive form, and adjunctive form.

また、実際にポーズ挿入位置を決定する方法としては、
例えば(文献1)で示されるような方法が考えられる。
Also, as a method to actually determine the pose insertion position,
For example, a method as shown in (Reference 1) can be considered.

すなわち、ある単語境界におけるポーズ挿入確率と当該
境界の前後の単語の長さ(例えば、モーラ数)とから算
出される値、例えばポーズ挿入確率とモーラ数との積な
どを評価尺度とする。この評価尺度に閾値を設け、各境
界における評価尺度が閾値を越えているかどうかに応じ
て、当該境界にポーズを挿入するかどうかの判断を行う
That is, a value calculated from the pause insertion probability at a certain word boundary and the length of words before and after the boundary (for example, the number of moras), such as the product of the pause insertion probability and the number of moras, is used as an evaluation measure. A threshold is set for this evaluation scale, and depending on whether the evaluation scale at each boundary exceeds the threshold, it is determined whether or not to insert a pause at the boundary.

あるいは、音声に変換すべき文章の長さから、はじめに
ポーズをいくつ挿入するがを定めておき、前記ポーズ挿
入確率の大きい境界がら順にポーズを挿入していくこと
もできる。
Alternatively, the number of pauses to be inserted may be determined first based on the length of the sentence to be converted into speech, and the pauses may be inserted in descending order of the probability of pause insertion.

以上のような手法を用いることにより、音声に変換する
文章の係り受け構造は用いずに、形態素解析で得られる
単語の品詞を用いるだけで、自然なポーズ挿入位置を決
定することが可能となる。
By using the above method, it becomes possible to determine a natural pause insertion position by simply using the part of speech of the word obtained through morphological analysis, without using the dependency structure of the sentence to be converted into speech. .

(実施例) 第1図は、本発明によるポーズ挿入位置決定方式を実現
するための一実施例を示すブロック図である。
(Embodiment) FIG. 1 is a block diagram showing an embodiment for realizing a pose insertion position determination method according to the present invention.

まず、音声に変換すべき文字列を、文字列入力端子1l
から入力する。入力された前記文字列は形態素解析部l
2に送られ、入力文字列で表される文章を単語列に分解
し、各単語の品詞や読みを決定する。この品詞や読みを
伴った単語列は、単語長算出部13及びポーズ挿入位置
決定部15に送られる。単語長算出部13では、前記単
語列が与えられると、単語の読みを用いて文章を構戊し
ている各単語の長さ(例えばモーラ数)を算出し、ポー
ズ挿入位置決定部15に送る。
First, input the character string to be converted into speech into the character string input terminal 1l.
Enter from. The input character string is sent to the morphological analysis unit l
2, the sentence represented by the input character string is broken down into word strings, and the part of speech and pronunciation of each word are determined. This word string with part of speech and pronunciation is sent to the word length calculation section 13 and the pause insertion position determination section 15. When the word string is given, the word length calculation section 13 calculates the length of each word (for example, the number of moras) that makes up the sentence using the pronunciation of the words, and sends it to the pause insertion position determination section 15. .

ポーズ挿入確率記憶部14には、境界前後の単語の品詞
の組み合せごとのポーズ挿入確率が予め蓄えられている
。ポーズ挿入位置決定部15は、前記品詞を伴った単語
列にしたがって、各境界の前後にある単語の品詞の組み
合せに対応した、前記ポーズ挿入確率記憶部14に蓄え
られているポーズ挿入確率を読み出す。読み出された前
記ポーズ挿入確率と、前記単語長算出部13で算出され
た各単語の長さとを用いて、(作用)の項で説明したよ
うな手法によってポーズを挿入する位置を決定し、結果
をポーズ挿入位置出力端子16から出力する。
The pause insertion probability storage unit 14 stores in advance the pause insertion probability for each combination of parts of speech of words before and after the boundary. The pause insertion position determination unit 15 reads out the pause insertion probability stored in the pause insertion probability storage unit 14, which corresponds to the combination of parts of speech of words before and after each boundary, according to the word string with the part of speech. . Using the read pause insertion probability and the length of each word calculated by the word length calculation unit 13, determine the position where the pause is inserted by the method described in the (effect) section, The result is output from the pause insertion position output terminal 16.

(発明の効果) 以上述べてきたように、本発明は、正しく解析すること
が困難な係り受け解析の結果を用いることなしに、ポー
ズの挿入位置を決定する。このため、従来方式よりも正
しく、自然なポーズの挿入位置を決定することが可能と
なる。したがって、本発明は、文字列で与えられた任意
の文章を音声に変換する音声合戒装置等におけるポーズ
挿入位置決定方式として有効である。
(Effects of the Invention) As described above, the present invention determines the pose insertion position without using the results of dependency analysis, which is difficult to analyze correctly. Therefore, it is possible to determine a more accurate and natural pose insertion position than the conventional method. Therefore, the present invention is effective as a pause insertion position determination method in a voice command device or the like that converts an arbitrary sentence given as a character string into voice.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は、本発明の一実施例を示すブロック図である。 図において、12・・・形態素解析部、13・・・単語
長算出部、14・・・ポーズ挿入確率記憶部、15・・
・ポーズ挿入位置決定部である。
FIG. 1 is a block diagram showing one embodiment of the present invention. In the figure, 12... Morphological analysis unit, 13... Word length calculation unit, 14... Pause insertion probability storage unit, 15...
・This is a pose insertion position determination unit.

Claims (1)

【特許請求の範囲】[Claims] 入力された文字列で表される文章をそれを構成する単語
に分解し、分解された各単語の境界にポーズを挿入する
かどうかを判定するポーズ挿入位置決定方式において、
単語境界にどの程度ポーズが挿入され易いかを表す数値
を単語境界の前後にある単語の品詞の組み合せごとに記
憶しておき、各単語の境界の前後にある単語の品詞の組
合せに対応する数値と当該境界の前後の単語の長さとを
用いて当該単語境界にポーズを挿入するか否かを定める
ことを特徴とするポーズ挿入位置決定方式。
In a pause insertion position determination method that decomposes a sentence represented by an input character string into its constituent words and determines whether to insert a pause at the boundary of each decomposed word,
A numerical value indicating how easily a pause is inserted at a word boundary is memorized for each combination of parts of speech of words before and after the word boundary, and a numerical value corresponding to the combination of parts of speech of words before and after each word boundary is stored. and the lengths of words before and after the boundary to determine whether or not to insert a pause at the word boundary.
JP01194970A 1989-07-26 1989-07-26 Pose insertion position determination device Expired - Fee Related JP3076047B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP01194970A JP3076047B2 (en) 1989-07-26 1989-07-26 Pose insertion position determination device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP01194970A JP3076047B2 (en) 1989-07-26 1989-07-26 Pose insertion position determination device

Publications (2)

Publication Number Publication Date
JPH0358097A true JPH0358097A (en) 1991-03-13
JP3076047B2 JP3076047B2 (en) 2000-08-14

Family

ID=16333374

Family Applications (1)

Application Number Title Priority Date Filing Date
JP01194970A Expired - Fee Related JP3076047B2 (en) 1989-07-26 1989-07-26 Pose insertion position determination device

Country Status (1)

Country Link
JP (1) JP3076047B2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6523998B2 (en) 2016-03-14 2019-06-05 株式会社東芝 Reading information editing apparatus, reading information editing method and program

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5847719A (en) * 1981-09-11 1983-03-19 Hitachi Ltd Assortment controller
JPS59123889A (en) * 1982-12-29 1984-07-17 富士通株式会社 Voice editing/synthesization processing system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5847719A (en) * 1981-09-11 1983-03-19 Hitachi Ltd Assortment controller
JPS59123889A (en) * 1982-12-29 1984-07-17 富士通株式会社 Voice editing/synthesization processing system

Also Published As

Publication number Publication date
JP3076047B2 (en) 2000-08-14

Similar Documents

Publication Publication Date Title
US6751592B1 (en) Speech synthesizing apparatus, and recording medium that stores text-to-speech conversion program and can be read mechanically
US5475796A (en) Pitch pattern generation apparatus
Chu et al. Locating boundaries for prosodic constituents in unrestricted Mandarin texts
JP4745036B2 (en) Speech translation apparatus and speech translation method
JPH086591A (en) Voice output device
Allen Some suprasegmental contours in French two-year-old children’s speech
JPH0358097A (en) Determination system for pause insertion position
KR100499116B1 (en) Method and apparatus for prosodic phrasing for speech synthesis
JP3001210B2 (en) Pose insertion position determination device
Defina et al. Scaling processes of clause chains in Pitjantjatjara
Stoel The intonation of Banyumas Javanese
JP3142160B2 (en) Phonetic symbol generator
JPH0962286A (en) Voice synthesizer and the method thereof
JPH05134691A (en) Method and apparatus for speech synthesis
O'Shaughnessy Fundamental frequency by rule for a text-to-speech system
JP2748445B2 (en) Pause insertion position determination method
JPH03225400A (en) Pause length determining system
Fennell Searching for Phonological Amelioration
Zemirli et al. An effective model of stressing in an Arabic Text To Speech System
Gwóźdź Acoustic Cues in the Disambiguation of Polysemous Strings
Kawa et al. Development of a text-to-speech system for Japanese based on waveform splicing
Çöltekin Units in segmentation: A computational investigation
KR0180650B1 (en) Sentence analysis method for korean language in voice synthesis device
KR100959494B1 (en) Voice Synthesizer and Its Method using Processing Not registered Word
Perry et al. Syllable timing and pausing: evidence from Cantonese

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080609

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090609

Year of fee payment: 9

LAPS Cancellation because of no payment of annual fees