JPS63182699A

JPS63182699A - Word reading information storage dictionary

Info

Publication number: JPS63182699A
Application number: JP62014591A
Authority: JP
Inventors: 坂井　信輔
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1987-01-23
Filing date: 1987-01-23
Publication date: 1988-07-27

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、音声合成装置等に用いられ、単語の読み情報
を格納する辞書に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a dictionary that is used in a speech synthesis device and the like and stores word pronunciation information.

（従来の技術）従来、音声の規則合成のためのテキスト解析処理におい
て、入力テキストを解析するために各単語の正書法表記
９品詞、読み、アクセントなどの情報を格納した単語辞
書が用いられている。たとえば、福島等による情報処理
学会日本語文書処理研究会資料２−４　（１９８５）　
、　ｒ盲人用読書器における文音声変換のための文章解
析」においては、漢字などを見出しとし、読みやアクセ
ントあるいは品詞等の情報を有する辞書を用い、品詞の
接続表を参照しながら入力テキストから語句の列に変換
し、その読みやアクセント等の情報と構文情報から構成
される音声合成情報を出力している。(Prior art) Conventionally, in text analysis processing for rule-based synthesis of speech, a word dictionary that stores information such as the orthographic notation of each word, 9 parts of speech, pronunciation, accent, etc. is used to analyze input text. . For example, Information Processing Society of Japan Japanese Document Processing Study Group Material 2-4 (1985) by Fukushima et al.
, ``Sentence Analysis for Sentence to Speech Conversion in Readers for the Blind'' uses kanji as headings, uses a dictionary with information such as pronunciations, accents, and parts of speech, and analyzes input text while referring to connection tables of parts of speech. It converts it into a string of words and outputs speech synthesis information consisting of information such as pronunciation and accent, and syntax information.

（発明が解決しようとする問題点）しかし、音声の規則合成によって明瞭な合成音声を生成
しようとすると、各音素、特に子音を強くはっきりと合
成しなければならない。とくに子音が重要な理由は、母
音はもともとその継続時間長も長くパワーも大きく、さ
らにその種類も少ないから、相互に紛れることが少ない
のに対し、子音はパワーが小さい上に種類も多いので、
相互に紛れやすいからである。ところが、従来の例に見
られるような音声の規則合成では、明瞭性を向上させる
ために各音素、特に子音の強さ、パワーを大きくしたり
時間長を長くしたりして強調すると、−音一音が強くな
るので滑らかさに欠け、耳障りで不自然な合成音になり
、長時間聞くと疲れるという問題点があった。(Problems to be Solved by the Invention) However, in order to generate clear synthesized speech by regular speech synthesis, each phoneme, especially consonants, must be synthesized strongly and clearly. The reason why consonants are especially important is that vowels have long durations and high power, and there are few types of them, so they are less likely to be confused with each other, whereas consonants have low power and many types.
This is because they are easily confused with each other. However, in the conventional method of speech synthesis, when each phoneme, especially a consonant, is emphasized by increasing its strength or power or lengthening its duration in order to improve clarity, the - sound The problem was that because each note became stronger, it lacked smoothness, resulting in a harsh and unnatural synthesized sound, and it became tiring to listen to for a long time.

本発明の目的は、この問題点を解決した、相互に紛れや
すい場合のみ子音を強調することを指示する情報を与え
る辞書を提供することにある。An object of the present invention is to provide a dictionary that solves this problem and provides information for instructing to emphasize consonants only when they are likely to be confused with each other.

（問題点を解決するための手段）前述の問題点を解決するために本発明が提供する手段は
、少なくとも音声記号の系列からなる音声表記を有する
辞書項目からなる単語読み情報格納辞書であって、ある
辞書項目の音声表記が、他のある辞書項目の音声表記と
比較して、音声記号系列中の多くの対応する位置では同
一の音声記号をもち、対応するいくつかの位置では、異
なるが類似する音声記号をもつ場合、前記ある辞書項目
に、異なるが類似する音声記号の音声表記中での位置の
情報が付加してあることを特徴としている。(Means for Solving the Problems) Means provided by the present invention in order to solve the above-mentioned problems is a word reading information storage dictionary consisting of dictionary entries having phonetic transcriptions consisting of at least a series of phonetic symbols. , the phonetic transcription of a certain dictionary entry has the same phonetic symbol in many corresponding positions in the phonetic symbol sequence, and differs in some corresponding positions, compared to the phonetic transcription of some other dictionary item. In the case of having similar phonetic symbols, the certain dictionary entry is characterized in that information about the position of the different but similar phonetic symbol in the phonetic notation is added.

（作用）本発明は、テキスト解析部においては、辞書を引さなが
ら入力文字列を解析した結果、音声合成部に対し、従来
のように語句の読みやアクセント等の情報の他に、合成
しようとする語句に対して紛れ易い語句が存在するか否
か、かつ存在するならば、どの音素が紛れやすいかとい
う情報を送り、その情報に基づいて、音声合成部におい
て、子音などの強調の度合いを制御しながら音声を合成
することを可能にする辞書を提供するものである。(Function) In the present invention, as a result of analyzing an input character string while referring to a dictionary, the text analysis section sends information such as pronunciation and accent of the word to the speech synthesis section. It sends information on whether there are any words or phrases that are easily confused with each other, and if so, which phonemes are likely to be confused with each other.Based on that information, the speech synthesis section adjusts the degree of emphasis on consonants, etc. The present invention provides a dictionary that makes it possible to synthesize speech under control.

たとえば、辞書に登録される薄型に「歓待」／　ｋａ　
Ｎ　ｔａ　ｉ／と「乾杯Ｊ　／ｋａ　Ｎ　ｐａ　ｉ／が
存在する場合を考える。斜線で囲まれた表現は、音声記
号の列により表された、各単語の発音である。子音ｈ／
と／ｐ／は類似する音声記号であると予め定義されてい
るとすると、この２つの単語は、それぞれ第３音節の子
音部に類似する音声記号を持ち、他の位置では、同一の
音声記号を持つので、類似音声記号の存在を示すマーカ
を“＊゛′とするとこれらの単語の音声表記は、それぞ
れ／１ｃａＮｔ＊ａｉ／、／ｋａ　Ｎ　ｐ＊ａ　ｉ／と
表される。合成音声により出力されるべき入力文字列に
これらのいずれかの単語、例えば「歓待」が含まれてい
る場合、音声合成部は、マーカ“＊“′の存在により、
子音／Ｌ／のみを選択的に強調することが可能となる。For example, the word "hospitality" / ka is added to the dictionary.
Consider the case where N ta i/ and ``Cheers J /ka N pa i/'' exist.The expression surrounded by diagonal lines is the pronunciation of each word represented by a string of phonetic symbols.The consonant h/
Given that and /p/ are predefined as similar phonetic symbols, these two words each have similar phonetic symbols in the consonant part of the third syllable, and identical phonetic symbols in other positions. Therefore, if the marker indicating the existence of similar phonetic symbols is "*゛', the phonetic notation of these words will be expressed as /1caNt*ai/ and /ka N p*a i/, respectively.By synthesized speech, If the input character string to be output contains any of these words, for example "hospitality", the speech synthesis unit will
It becomes possible to selectively emphasize only the consonant /L/.

（実施例）以下、図面を参照しつつ、本発明を一層詳しく説明する
。第１図は本発明の一実施例の作成に用いられる。ａ語
１歓待」、′乾杯」に対する入力辞書項目を示す図であ
る。第２図〜第４図は、その入力辞書項目に対して発音
の似通った辞書項目の類似音声記し・の位置を示す情報
を付加する処理の流れをあられずフローチャートである
。(Example) Hereinafter, the present invention will be described in more detail with reference to the drawings. FIG. 1 is used to create one embodiment of the present invention. FIG. 3 is a diagram showing input dictionary entries for "a word 1 hospitality" and "cheers"; 2 to 4 are flowcharts showing the flow of processing for adding information indicating the position of a similar phonetic notation of a dictionary item with similar pronunciation to the input dictionary item.

まず、類似音声記号のセットを定義する（２０１．　）
例えば、／ｌ／と／ｐ／と／に／、また、／ｍ／と／ｎ
／、および、／ｂ／と／ｄ／と／ｇ／と／１−／が、類
似する音声記号であるとする場合、すべての類似音声記
号のセットの集合Ｓは、ｓ＝　（（／Ｐ／　　／ｌ／　　／に／）（／ｍ／　　
／ｎ／）（／ｂ／　　／ｄ／　　／ｇ／　　／ｒ／））
となる。First, define a set of similar phonetic symbols (201).
For example, /l/ and /p/ and /, also /m/ and /n
/, and /b/, /d/, /g/, and /1-/ are similar phonetic symbols, then the set S of all similar phonetic symbols is s= ((/P / /l/ /ni/) (/m/
/n/) (/b/ /d/ /g/ /r/))
becomes.

続いて、すべての入力辞書項目を音声表記によりソート
する（２０２）。このソーティングは、例えば、音声表
記をなす音声記号のアルファベット類に行なえば良い。Next, all input dictionary items are sorted by phonetic notation (202). This sorting may be performed, for example, in alphabetical order of phonetic symbols forming the phonetic notation.

ただし、この際、類似する音声記号は同一とみなしてソ
ートする。よって、例えば、第１図の「歓待」１０１お
よび「乾杯」１０２は、発音の似通った辞書項目が連続
した並びの中に現れる。ここで、発音の似通った辞書項
目とは、音声表記をなす音声記号系列中の対応するいく
つかの位置に類似する音声記号をもち、かつ音声記号系
列中の他の対応する位置では同一の音声記号をもつよう
な複数の辞書項目のことである。However, in this case, similar phonetic symbols are considered to be the same and sorted. Therefore, for example, "hospitality" 101 and "cheers" 102 in FIG. 1 appear in a series of dictionary items with similar pronunciations. Here, dictionary entries with similar pronunciations are those that have phonetic symbols that are similar at some corresponding positions in the phonetic symbol sequence that constitutes the phonetic notation, and that have the same pronunciation at other corresponding positions in the phonetic symbol sequence. It refers to multiple dictionary entries that have symbols.

２０２までの処理により、入力辞書項目は、発音の似通
った辞書項目が連続して現れるようにならべかえられて
いる。そこで、発音の似通った辞書項目の並びごとに１
つのグループとして、各グループに対して、第３図、第
４図の処理を行なう（２０４）。Through the processing up to step 202, the input dictionary items are rearranged so that dictionary items with similar pronunciations appear consecutively. Therefore, for each sequence of dictionary items with similar pronunciation,
The processing shown in FIGS. 3 and 4 is performed for each group as one group (204).

第３図は、１つのグループに対する処理の流れを表して
いる。発音の似通った辞書項目の定義により、１つのグ
ループ内の各辞書項目は、音声表記に同一数の音声記号
を持つ。この数をＭＡＸ−ＰＨＯＮＥとする（３０１　
）。ｉを１からＭＡＸ−Ｐ）ＩＯＮＥまでの各整数値に
順に設定して、第４図の処理を行なう（３０２）。FIG. 3 shows the flow of processing for one group. By the definition of phonetically similar dictionary entries, each dictionary entry within a group has the same number of phonetic symbols in its phonetic transcription. Let this number be MAX-PHONE (301
). The process shown in FIG. 4 is performed by sequentially setting i to each integer value from 1 to MAX-P)ION (302).

第４５！Ｊは、１つのグループのｉ番目の音声記号につ
いて行なわれる処理を表わす。グループに含まれる各単
語の音声表記のｉ番目の音声記号がすべて同一ならば、
ｉ番目の音声記号はそのままであるが、そうでなげれば
、マーカ“＊”が直後に挿入される。45th! J represents the processing performed on the i-th phonetic symbol of one group. If the i-th phonetic symbol of the phonetic transcription of each word included in the group is all the same, then
The i-th phonetic symbol remains as it is, but if it is not, a marker "*" is inserted immediately after it.

以上の処理により、例えば、第１図の入力辞書項目は、
第５図の出力辞書項目に見られるように、類似する音声
記号が存在する音声記号の後に、マーカ“＊°′を挿入
される。As a result of the above processing, for example, the input dictionary entry in FIG.
As seen in the output dictionary entry of FIG. 5, a marker "*°" is inserted after a phonetic symbol for which a similar phonetic symbol exists.

（発明の効果）以上に述べたように、本発明によれば、相互に紛れやす
い発音の似通った辞書項目に対して、類似音声記号の位
置を示す情報が付加され、この情報に基づいて、音声合
成部において、子音などの強調の度合いを制御しながら
音声を合成することが可能となる。また、本発明により
構成された辞書は、音声認識において、相互に紛れやす
い発音の似通った単語候補間の識別にも用いることが可
能である。(Effects of the Invention) As described above, according to the present invention, information indicating the position of similar phonetic symbols is added to dictionary items with similar pronunciations that are easily confused with each other, and based on this information, In the speech synthesis section, it becomes possible to synthesize speech while controlling the degree of emphasis of consonants and the like. Furthermore, the dictionary configured according to the present invention can be used in speech recognition to identify word candidates with similar pronunciations that are likely to be confused with each other.

[Brief explanation of the drawing]

第１図は、本発明の一実施例の作成に用いられる単語「
歓待」、「乾杯」に対する入力辞書項目を示す図である
。第２図〜第４図はその入力辞書項目に対して発音の似
通った辞書項目の類似音声記号の位置を示す情報を付加
する処理の流れを表わすフローチャートである。第５図
は、本発明の一実施例である辞書の項目を示し、第２図
〜第４図の処理により類似音声記号の位置を示す情報を
付加された、単語「歓待」、「乾杯」に対する出力辞書
項目の図である。１０１・・・単語「歓待」に対する入力辞書項目、１０
２・・・単語「乾杯」に対する入力辞書項目、５０１・
・・単語「歓待」に対する出力辞書項目、５０２・・・
単語「乾杯」に対する出力辞書項目。FIG. 1 illustrates the words “
FIG. 4 is a diagram showing input dictionary entries for "Hospitality" and "Cheers." FIGS. 2 to 4 are flowcharts showing the flow of processing for adding information indicating the position of similar phonetic symbols of dictionary items with similar pronunciation to the input dictionary item. FIG. 5 shows the entries of a dictionary that is an embodiment of the present invention, and the words "hospitality" and "cheers" are added with information indicating the positions of similar phonetic symbols through the processing of FIGS. 2 to 4. FIG. 3 is a diagram of output dictionary entries for . 101... Input dictionary entry for the word "hospitality", 10
2... Input dictionary entry for the word "cheers", 501.
...Output dictionary entry for the word "hospitality", 502...
Output dictionary entry for the word "cheers".

Claims

[Claims]

In a word reading information storage dictionary consisting of dictionary items each having a phonetic notation consisting of at least a series of phonetic symbols, the phonetic notation of a certain dictionary item has a higher number of phonetic notations in the phonetic symbol series than the phonetic notation of another dictionary item. When corresponding positions have the same phonetic symbol and several corresponding positions have different but similar phonetic symbols, the certain dictionary entry contains information about the position in the phonetic transcription of the different but similar phonetic symbols. A dictionary for storing word reading information, characterized in that: is added.