JP3201329B2

JP3201329B2 - Speech synthesizer

Info

Publication number: JP3201329B2
Application number: JP02512798A
Authority: JP
Inventors: 敬子稲垣; 幸夫三留
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1998-01-22
Filing date: 1998-01-22
Publication date: 2001-08-20
Anticipated expiration: 2018-01-22
Also published as: JPH11212586A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、任意のテキストに
基づいて合成音声を生成する音声合成装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech synthesizer for generating a synthesized speech based on an arbitrary text.

【０００２】[0002]

【従来の技術】任意の漢字かな混じりのテキストを音声
に変換するテキスト音声合成装置では、入力された漢字
かな混じりの日本語の文に読みやアクセント、ポーズ、
入力文を読み上げる際のリズムやイントネーションを決
定し、それを基に予め蓄えておいた単位音声を編集して
音声信号を生成する。2. Description of the Related Art In a text-to-speech synthesizer for converting an arbitrary kanji-kana mixed text into voice, a reading, accent, pause,
The rhythm and intonation for reading out the input sentence are determined, and based on the rhythm and intonation, the unit voice stored in advance is edited to generate a voice signal.

【０００３】入力された文は、まず、形態素解析により
入力された文の単語を１語１語分割し、辞書に登録され
ている読みと品詞等の文法情報をつける。続いて、分割
された単語を意味的、文法的なまとまりで結合し、複合
語を作成する。この辞書は、漢字かな混じりの単語辞
書、漢字１文字ずつの単漢字辞書、漢字以外の未登録語
に対し読みを付与するための未登録語辞書などから構成
されている。In an input sentence, words of the input sentence are first divided into words one by one by morphological analysis, and grammatical information such as reading and part of speech registered in a dictionary is added. Subsequently, the divided words are combined in a semantic and grammatical unit to create a compound word. This dictionary is composed of a word dictionary containing kanji and kana, a single kanji dictionary for each kanji, an unregistered word dictionary for giving a reading to unregistered words other than kanji, and the like.

【０００４】形態素解析では、まず単語辞書を用いて検
索が行なわれ、それぞれに対し、辞書中の読みと品詞情
報が付与される。また、検索に失敗した文字列は未登録
語として処理され、１文字１文字に対し単漢字辞書と未
登録語用辞書を用いて読みが付与される。未登録語の品
詞情報は、前後の品詞から確定できる時には付与され
る。In the morphological analysis, a search is first performed using a word dictionary, and reading and part-of-speech information in the dictionary are given to each word. A character string that has failed to be searched is processed as an unregistered word, and a reading is given to each character by using a single kanji dictionary and a dictionary for unregistered words. The part-of-speech information of an unregistered word is added when it can be determined from the preceding and following parts of speech.

【０００５】ところで、音声合成装置は高性能化と共に
小型化が進み、辞書や音声データの容量はできるだけ小
さくすることが望まれている。辞書の小型化のためにエ
ントリを削除してしまうのは、安易ではあるが、読み上
げ精度が低下してしまうため現実的ではない。そこで、
読み上げ精度を下げることなく、辞書の容量をコンパク
トにした合成音声装置の開発が進んでいる。このような
音声合成装置の一例が、特開平３−２０４６９８号公報
に開示されている。この従来例では、漢字表記の読みの
検索情報を有する単語辞書と、漢字１文字毎のかな表記
の情報を有する単漢字辞書とを備え、単語辞書はかな表
記を持たず、単漢字辞書内の対応するかな表記のアドレ
スのみを記すことにより、単語辞書の容量を減らしてい
る。[0005] By the way, the speech synthesizer has been downsized along with its high performance, and it is desired that the capacity of the dictionary and the speech data be as small as possible. Deleting an entry to reduce the size of the dictionary is easy, but is not realistic because the reading accuracy is reduced. Therefore,
Development of a synthesized speech device in which the dictionary capacity is made compact without lowering the reading accuracy is progressing. One example of such a speech synthesizer is disclosed in Japanese Patent Laid-Open Publication No. Hei 3-204698. In this conventional example, a word dictionary having search information for reading a kanji notation and a single kanji dictionary having information on kana notation for each kanji character are provided. By writing only the corresponding kana address, the size of the word dictionary is reduced.

【０００６】[0006]

【発明が解決しようとする課題】上述のように、従来の
音声合成装置では、単語辞書と単漢字辞書を用いて入力
された文に対して形態素解析を行ない、各単語の候補を
同定すると共に、同定した単語候補の品詞等の文法情報
やアクセントを求めている。この単漢字辞書には、漢字
１文字で読む場合の読みだけでなく、複合語の時のみに
使われる読みも登録されているため辞書の容量が大き
い。また、形態素解析で未登録語と判定された文字列に
対し読みを付与する時には、この単漢字辞書を利用する
が、読みが複数あるため選択が難しい。As described above, the conventional speech synthesizer performs a morphological analysis on a sentence using a word dictionary and a single kanji dictionary to identify each word candidate, and And grammatical information such as the part of speech of the identified word candidate and accents. In this single kanji dictionary, not only the reading when reading with one kanji but also the reading that is used only for a compound word is registered, so the dictionary capacity is large. In addition, when adding a reading to a character string determined to be an unregistered word by morphological analysis, this single kanji dictionary is used, but selection is difficult because there are a plurality of readings.

【０００７】そこで、本発明の目的は、単漢字が複合語
の時のみに使われる読みをルール化することで単漢字辞
書の容量を小型化し、かつ、読み付けルールを用いて、
未登録語として判定された単漢字の読みを推定すること
ができる音声合成装置を提供することにある。Therefore, an object of the present invention is to reduce the capacity of a single-kanji dictionary by making rules for reading that are used only when the single-kanji is a compound word, and by using reading rules.
An object of the present invention is to provide a speech synthesizer that can estimate the reading of a single kanji character determined as an unregistered word.

【０００８】[0008]

【課題を解決するための手段】前述の課題を解決するた
め本発明による音声合成装置は、対のような特徴的構成
を有する。（１）入力された漢字仮名混じりの日本語のテキストに
対し、辞書を用いて形態素解析を行ない発音情報を生成
し、前記発音情報に基づいて音声を合成するテキスト音
声合成装置であり、前記辞書が読みの代わりに読みの検
索情報と読み付けルール検索情報とを有する単語辞書
と、漢字１文字毎の読み情報を有する単漢字辞書と、複
合語の読みとアクセントを決定する読み付けルールと、
記号、アルファベット、仮名など、前記単漢字辞書に登
録されていない１文字が未登録語として検出された時に
読みを付与する未登録語辞書とを備え、前記単漢字辞書
に個々の漢字の読みとともに音訓情報を保持し、前記形
態素解析部において未登録語と判断された部分の読み付
けを行なう際に、隣接する漢字の音訓情報に未登録語単
漢字の音訓情報を合わせることで未登録語の読みを推定
する音声合成装置。In order to solve the above-mentioned problems, a speech synthesizing apparatus according to the present invention has a pair of characteristic configurations. (1) A text-to-speech synthesizer that performs morphological analysis on an input Japanese text mixed with kanji and kana to generate pronunciation information by using a dictionary and synthesizes speech based on the pronunciation information. A word dictionary having search information of reading and reading rule search information instead of reading, a single kanji dictionary having reading information for each kanji character, a reading rule for determining reading and accent of compound words,
An unregistered word dictionary that gives a reading when one character that is not registered in the single kanji dictionary, such as a symbol, an alphabet, or a kana, is detected as an unregistered word. When holding the phonological information and reading the part determined as an unregistered word in the morphological analysis unit, the phonological information of the unregistered word single kanji is matched with the phonological information of the adjacent kanji to obtain the unregistered word. A speech synthesizer that estimates readings.

【０００９】（２）上記（１）の音声合成装置におい
て、前記単漢字辞書に個々の漢字の読みとともに日本語
テキスト中の出現頻度情報を保持し、前記形態素解析部
において未登録語と判断された部分の読み付けを行なう
際に、頻度の高い読みを選択することで未登録語の読み
を推定する音声合成装置。(2) In the speech synthesizer of (1), the single kanji dictionary holds the reading frequency of each kanji and the appearance frequency information in the Japanese text, and the morphological analysis unit determines that the word is an unregistered word. A speech synthesizer that estimates a reading of an unregistered word by selecting a frequent reading when reading a part that has been set.

【００１０】（３）入力されたテキストを形態検素解析
により単語に分割し、単語毎に読み検索情報とルール、
品詞情報に基づいて分割された単語の読みとアクセント
位置を決定し、未登録語と判定された文字列には単漢字
辞書と未登録語辞書を用いて読みを付与し、前記分割さ
れ、読みと品詞情報が付与された単語は、意味的、文法
的なまとまりで結合し、複合語を作成し、前記複合語の
アクセントをアクセント結合規則を用いて推定し、これ
らすべての情報から発音記号を生成し、前記入力テキス
トを読み上げる際のリズムやイントネーションなどの韻
律を決定し、この決定された韻律パタンに従って予め蓄
えておいた音声を編集して音声信号を生成する際に、単
漢字辞書中の読みを決定し、単語辞書から該当する単漢
字を含むエントリを全て検索し、その読みに対する読み
付けルールと、前後音素環境を調べ、統計を取り、出現
頻度の多いルールから順に、一致条件を調べ、一致して
いるものを採用することを特徴とする音声合成装置。(3) The input text is divided into words by morphological analysis, and the read search information and rules are read for each word.
The pronunciation and accent position of the words divided based on the part of speech information are determined, and a character string determined as an unregistered word is given a reading using a single kanji dictionary and an unregistered word dictionary. And words with part-of-speech information are combined in a semantic and grammatical unit to form a compound word, the accent of the compound word is estimated using accent combining rules, and phonetic symbols are extracted from all the information. Generate and determine the prosody such as rhythm and intonation when reading the input text, and edit the voice stored in advance according to the determined prosody pattern to generate a voice signal. Decide the reading, search all entries containing the corresponding single kanji from the word dictionary, check the reading rules for that reading and the surrounding phoneme environment, collect statistics, and find rules with high frequency of appearance In order from examines the match condition, the speech synthesis apparatus characterized by employing for a match.

【００１１】（４）前記アクセント結合規則は、アクセ
ントの変化の規則性に基づいてルール化したものを用い
る上記（３）の音声合成装置。(4) The speech synthesizing apparatus according to (3), wherein the accent combination rule is a rule based on the regularity of change in accent.

【００１２】（５）入力文字列が未登録語と判定された
場合には、単漢字辞書の音訓情報を用いて未登録語の読
みを推定する上記（３）の音声合成装置。(5) The speech synthesizing apparatus according to (3), wherein when the input character string is determined to be an unregistered word, the reading of the unregistered word is estimated by using the phonetic information of the single kanji dictionary.

【００１３】（６）入力文字列が未登録語と判定された
場合には、予め単漢字辞書の個々の読みに出現頻度情報
を持たせておき、前記単漢字辞書を検索して、当該漢字
の読みが複数あった場合には、最も出現頻度の高い読み
を付与する上記（３）の音声合成装置。(6) If the input character string is determined to be an unregistered word, the individual kanji dictionary is provided with appearance frequency information in advance, and the single kanji dictionary is searched. The speech synthesizer according to (3), wherein when there are a plurality of readings, the reading with the highest appearance frequency is given.

【００１４】（７）入力文字列が未登録語と判定された
場合には、予め単漢字辞書の個々の読みに音訓情報と出
現頻度情報を持たせておき、単漢字の読みが複数あった
場合には、隣接する単語の音訓情報を用いて候補を絞
り、その中から頻度の最も高いものを選ぶ上記（３）の
音声合成装置。(7) If the input character string is determined to be an unregistered word, individual readings of the single kanji dictionary are given in advance with onnology information and appearance frequency information, and there are a plurality of single kanji readings. In such a case, the speech synthesis apparatus according to (3), wherein candidates are narrowed down using the sound training information of an adjacent word, and a candidate having the highest frequency is selected from the candidates.

【００１５】（８）前記一致条件は、単語辞書の統計を
取る際に、有声無声の一致、音素の一致、もしくは調音
の一致などで分類し、ルール毎に決めておく上記（３）
の音声合成装置。(8) The above-mentioned matching conditions are classified by voiced / unvoiced match, phoneme match, articulatory match or the like when statistics of a word dictionary are obtained, and are determined for each rule.
Voice synthesizer.

【００１６】[0016]

【００１７】[0017]

【００１８】[0018]

【発明の実施の形態】以下、本発明の実施の形態につい
て添付図面を参照しながら説明する。図１は、本発明に
よる音声合成装置の実施形態の基本構成図である。図１
において、辞書１は、入力テキストに対しテキスト解析
を行なう時に用いられる。テキスト解析部２は、テキス
ト解析を行なって単語の同定、読みとアクセントの付
与、ポーズ位置等の発音情報を生成する。音声合成部３
は、発音情報を基に音声信号を生成する。辞書１は、単
語の表記と読みのかわりの読み検索情報と品詞情報が登
録されている単語辞書、個々の漢字の読みが登録されて
いる単漢字辞書、単漢字以外の文字が未登録語として検
出された時に読みを付与する未登録語辞書、アクセント
変化や読みの変化をルール化した読み付けルール等から
構成されている。Embodiments of the present invention will be described below with reference to the accompanying drawings. FIG. 1 is a basic configuration diagram of an embodiment of a speech synthesis device according to the present invention. FIG.
The dictionary 1 is used when performing text analysis on an input text. The text analysis unit 2 performs text analysis to generate pronunciation information such as identification of words, addition of reading and accent, and pause positions. Voice synthesis unit 3
Generates an audio signal based on pronunciation information. Dictionary 1 is a word dictionary in which reading search information and part-of-speech information in place of word notation and reading are registered, a single kanji dictionary in which readings of individual kanji are registered, and characters other than single kanji are unregistered words. It is composed of an unregistered word dictionary that gives a reading when detected, a reading rule that rules accent changes and reading changes, and the like.

【００１９】図２は、本発明の第２の実施形態における
単語辞書の構成を示している。単語辞書には、各エント
リに対しその表記と、読みの検索情報、読み付けルー
ル、品詞が登録されている。また、単漢字辞書には、個
々の漢字の読みとそれぞれのアドレスが登録されてお
り、単語辞書の読みの検索情報にはこのアドレスが記さ
れている。例えば、単語辞書には「法則」という表記
と、「法」に対する単漢字辞書の読み「ホウ」のアドレ
スと「則」に対する単漢字辞書の読み「ソク」のアドレ
スが記されている。単漢字辞書には、「法」の読みとし
て「ホウ」、「のり」等が記されている。FIG. 2 shows the configuration of a word dictionary according to the second embodiment of the present invention. The notation, reading search information, reading rules, and parts of speech are registered for each entry in the word dictionary. In addition, in the single kanji dictionary, readings of individual kanji and respective addresses are registered, and this address is described in search information of reading in the word dictionary. For example, in the word dictionary, the notation "law", the address of a single kanji dictionary reading "ho" for "law", and the address of a single kanji dictionary reading "soku" for "rule" are written. In the single kanji dictionary, "ho", "nori", and the like are described as readings of "law".

【００２０】読み付けルールには、単語のアクセント
型、複合語用に単漢字辞書の読みを変化させるための音
韻ルール、複合語のアクセントをシフトさせるためのア
クセントルールなどが記述されている。音韻ルールには
例えば、子音／ｈ／を／ｐ／や／ｂ／に置換するルール
を記述しておく。これら子音／ｈ／が／ｐ／に置換され
るのは、「日（ひ）」が「生年月日（せいねんがっ
ぴ）」では「ぴ」になったり「比（ひ）」が「反比例
（はんぴれい）」では「ぴ」になるのに対応することが
できる。その他にも音韻の削除や「っ」や「ん」の挿入
をルール化することで、単漢字辞書の読みが大幅に削減
できる。The reading rules describe an accent type of a word, a phonological rule for changing the reading of a single kanji dictionary for a compound word, an accent rule for shifting an accent of a compound word, and the like. For example, rules for replacing consonants / h / with / p / and / b / are described in the phoneme rules. The reason why these consonants / h / are replaced with / p / is that “day (hi)” becomes “ぴ” in “date of birth (seinengappu)” or “ratio (hi)” becomes “ Inverse proportion can correspond to "対応". In addition, it is possible to drastically reduce the reading of a single kanji dictionary by defining the deletion of phonemes and the insertion of "tsu" and "n".

【００２１】これらのルールは、例えば、図２の読み付
けルールテーブルように記述することができる。次に、
図２の単語辞書中にある「未来（みらい）」と「分布
（ぶんぷ）」を用いて、単漢字の読みに読み付けルール
を適応し、単語に正しい読みとアクセントを付与する方
法について述べる。These rules can be described, for example, as a reading rule table shown in FIG. next,
A method of applying reading rules to single kanji readings and giving correct readings and accents to words using “future (mirai)” and “distribution (bumpo)” in the word dictionary of FIG. 2 will be described.

【００２２】図２の単語辞書中の「Ａ１」は、アクセン
ト型（１型）を表し、「Ｐ」は、子音／ｈ／を／ｐ／に
置換することを表している。単語辞書中の「未来」に
は、それぞれの漢字の読み検索情報と、ルールが記述さ
れている。ルールは、単語のアクセントが１型であるこ
とを示しているため「みらい」の「み」にアクセントが
つくことがわかる。また、「分布」には読み付けルール
が「Ｐ」となっているため、「分布」を「ぶんふ」では
なく「ぶんぷ」と正しく読むことができる。"A1" in the word dictionary of FIG. 2 represents an accent type (type 1), and "P" represents replacing a consonant / h / with / p /. "Future" in the word dictionary describes the reading search information of each kanji and rules. Since the rule indicates that the accent of the word is type 1, it can be seen that "mi" of "mirai" has an accent. In addition, since the reading rule is “P” for “distribution”, “distribution” can be correctly read as “bump” instead of “bump”.

【００２３】次に、図１を用いて、本発明の音声合成装
置の動作を説明する。入力されたテキストは、テキスト
解析部２において、単語辞書１を参照しながら形態素解
析により単語の判定が行なわれる。単語辞書１には、単
語毎に読み検索情報とルール、品詞情報が付与されてお
り、これらを元に読みとアクセント位置が決定する。こ
こで、未登録語と判定された文字列には、単漢字辞書と
未登録語辞書を用いて読みが付与される。Next, the operation of the speech synthesizer of the present invention will be described with reference to FIG. The input text is subjected to morphological analysis to determine a word in the text analysis unit 2 while referring to the word dictionary 1. The word dictionary 1 is provided with reading search information, rules, and part of speech information for each word, and the reading and accent position are determined based on these. Here, a character string determined as an unregistered word is given a reading using a single kanji dictionary and an unregistered word dictionary.

【００２４】分割され、読みと品詞情報が付与された単
語は、意味的、文法的なまとまりで結合し、複合語を作
成する。複合語のアクセントは、例えば個々の単語のア
クセントとは異なるため、アクセント結合規則を用いて
推定する。このアクセント結合規則は、アクセントの変
化には規則性があることに着目し、それらを整理してル
ール化したものを用いることができる。また、文中のポ
ーズを挿入する位置を決定する方法としては、品詞ごと
のポーズの入り易さを利用する方法などを用いることが
できる。The words to which the reading and the part-of-speech information are added are combined in a semantic and grammatical unit to form a compound word. Since the accent of a compound word is different from, for example, the accent of an individual word, it is estimated using an accent combination rule. This accent combination rule focuses on the fact that accent changes have regularity, and can be used by organizing them into rules. In addition, as a method of determining a position at which a pause in a sentence is to be inserted, a method that utilizes the ease of entering a pause for each part of speech can be used.

【００２５】これらすべての情報から発音記号を生成
し、音声合成部３へ出力する。音声合成部３では、ま
ず、テキスト解析部２で生成した発音記号を基に入力テ
キストを読み上げる際のリズムやイントネーションなど
の韻律を決定する。次に韻律パタンに従って予め蓄えて
おいた音声を編集して音声信号を生成する。A phonetic symbol is generated from all the information and output to the speech synthesizer 3. The speech synthesis unit 3 first determines prosody such as rhythm and intonation when reading out the input text based on the phonetic symbols generated by the text analysis unit 2. Next, the voice stored in advance is edited according to the prosody pattern to generate a voice signal.

【００２６】テキスト解析部２で、未登録語と判定され
た文字列中に漢字が含まれている場合には、単漢字辞書
を用いて読みを付与する。しかし、図２に示すように、
単漢字辞書には、各漢字に対し複数の読みが登録されて
いるため、これらのうちどれかを選択する必要がある。
そこで、第２の発明では、予め単漢字辞書に音訓情報を
持たせておき、それらを用いて未登録語単漢字の読みを
推定する。If the text analysis unit 2 includes a kanji in a character string determined as an unregistered word, a reading is given using a single kanji dictionary. However, as shown in FIG.
In the single kanji dictionary, a plurality of readings are registered for each kanji, and it is necessary to select one of them.
Therefore, in the second invention, the on-kanji information is previously stored in the single-kanji dictionary, and the reading of the unregistered word single-kanji is estimated using them.

【００２７】図３は、本発明の第３の実施形態における
単漢字辞書の実施形態例である。入力文字列が未登録語
と判定された時には、単漢字辞書の音訓情報を用いて未
登録語の読みを推定する。例えば、「音韻継続時間長
（おんいんけいぞくじかんちょう）」という文字列が入
力され、「音韻継続時間」までは、形態素解析によりそ
れぞれ「音韻」、「継続」、「時間」と単語の判定が行
なわれ、「長」が未登録語と判定されたとする。図３の
辞書には「長」の読みとして「チヨウ、なが・い、お
さ、た・ける、はじめ、たけ」が登録されている。直前
の漢字「間」の読みには、「カン、ケン、あいだ、ま、
しず・か、ひそ・かに、うあい、けん」が登録されてい
る。また、単語辞書から「時間」の読み検索情報から
「じかん」であることがわかり、直前の漢字「間（か
ん）」が音読みであることがわかる。そこで、「長」の
読みを選択する時にも、音読みの中から選択する。この
場合「長」の音読みは１つしか登録されていないため、
「ちょう」が選択され正しく読み上げることができる。
ここで、もし、音読みが複数あった場合には、あらかじ
めどれか一つをデフォルトにしておき、それを選ぶ。FIG. 3 shows an embodiment of a single kanji dictionary according to the third embodiment of the present invention. When the input character string is determined to be an unregistered word, the reading of the unregistered word is estimated using the on-kanji information of the single kanji dictionary. For example, a character string “phoneme duration” is input, and the words “phoneme”, “continuation”, and “time” are determined by morphological analysis until “phoneme duration”. Is performed, and it is determined that “length” is an unregistered word. In the dictionary of FIG. 3, "Chiyo, Nagai, Osa, Ta / Karu, Hajime, Take" are registered as readings of "Length". The reading of the immediately preceding kanji, “ma”, includes “Kan, Ken, Ama, Ma,
"Shizu / Kana, Hidden / Kani, Ui, Ken" are registered. In addition, it can be seen from the word dictionary that the reading search information for “time” is “jikan”, and that the immediately preceding kanji “ma” is a phonetic reading. Therefore, when selecting the reading of “long”, the selection is made from sound readings. In this case, since only one reading of "Long" is registered,
"Butterfly" is selected and can be read out correctly.
Here, if, in the case where Readings there was more than one, leave the one any advance in the default, select it.

【００２８】第３の実施形態では、予め単漢字辞書の個
々の読みに出現頻度情報を持たせておき、それらを用い
て未登録語単漢字の読みを推定する。図４は、第４の本
発明の単漢字辞書の実施形態例である。第３の実施形態
では、入力文字列が未登録語と判定された時に、単漢字
辞書を検索して、当該漢字の読みが複数あった場合に
は、最も出現頻度の高い読みを付与する。これにより、
例えば、「谷間」が単語辞書になかった場合、「こくか
ん」や「たにあいだ」を選択することなく、「たにま」
を選択することができる。個々の漢字の読みの出現頻度
は、予め大規模なコーパスと単語辞書を用いて、当該漢
字の読みの出現頻度を調べて単漢字辞書に登録してお
く。In the third embodiment, individual readings of the single kanji dictionary are provided with appearance frequency information in advance, and the reading of the unregistered word single kanji is estimated using them. FIG. 4 shows a fourth embodiment of the single kanji dictionary of the present invention. In the third embodiment, when an input character string is determined to be an unregistered word, a single kanji dictionary is searched, and when there are a plurality of readings of the kanji, a reading having the highest appearance frequency is given. This allows
For example, if "valley" is not in the word dictionary, you can select "Tokutama" without selecting "Kokukan" or "Taniyama".
Can be selected. The appearance frequency of the reading of each kanji is checked in advance using a large-scale corpus and a word dictionary, and registered in a single kanji dictionary.

【００２９】また、本発明の第４の実施形態では、予め
単漢字辞書の個々の読みに音訓情報と出現頻度情報を持
たせておき、それらを用いて未登録語単漢字の読みを推
定する。図５は、第４の実施形態の単漢字辞書の実施形
態例である。これら、２つの情報を用いることにより未
登録語の読み推定の精度がさらに向上する。この実施形
態例としては、単漢字の読みが複数あった場合には、隣
接する単語の音訓情報を用いて候補を絞り、その中から
頻度の最も高いものを選ぶことで実現できる。Further, in the fourth embodiment of the present invention, individual pronunciations of the single kanji dictionary are provided with pronunciation information and appearance frequency information in advance, and the reading of the unregistered word single kanji is estimated using them. . FIG. 5 shows an example of a single kanji dictionary according to the fourth embodiment. By using these two pieces of information, the accuracy of the estimation of the reading of unregistered words is further improved. As an example of this embodiment, when there are a plurality of single kanji readings, it can be realized by narrowing down the candidates using the sound training information of an adjacent word and selecting the candidate having the highest frequency.

【００３０】さらに、第５の実施形態では、単語辞書に
読みを付与する時に用いる、読み付けルールを用いて未
登録の読みの推定を向上させる。まず、単漢字辞書中の
読みを決定し、次に単語辞書から該当する単漢字を含む
エントリを全て検索し、その読みに対する読み付けルー
ルと、前後音素環境を調べ、統計を取る。出現頻度の多
いルールから順に、一致条件を調べ、一致している場合
にはそれを採用する。一致条件は、単語辞書の統計を取
る際に、例えば、有声無声の一致、音素の一致、もしく
は調音の一致などで分類し、ルール毎に決めておく。Further, in the fifth embodiment, the estimation of unregistered readings is improved by using a reading rule used when giving a reading to a word dictionary. First, the pronunciation in the single kanji dictionary is determined, and then all entries including the corresponding single kanji are searched from the word dictionary, the reading rules for the reading, the surrounding phoneme environment, and the statistics are obtained. The matching conditions are checked in order from the rule having the highest frequency of occurrence, and if the rules match, the rules are adopted. The matching conditions are classified based on, for example, voiced / unvoiced matches, phoneme matches, or articulatory matches when statistics of the word dictionary are obtained, and are determined for each rule.

【００３１】図６は、単漢字辞書の「本」のよみ「ホ
ン」に関する読み付けルールの例である。「本」の読み
として「ホン」選択されると、まず、出現頻度の高い子
音の有声音化のルールの一致条件について調べ、次に、
ｈ／，／ｂ／を／ｐ／に置換するルールの一致条件が調
べられる。例えば、未登録語「単行本」の「本」に「ホ
ン」を選択した場合には、子音の有声化ルールが適用さ
れ、次の／ｐ／への置換は条件が一致しないため適用さ
れない。よって「本」の読み「ボン」に決まる。以上の
手段により、未登録語と判定された単語の読みを推定す
る。FIG. 6 shows an example of a reading rule for reading "hon" of "hon" in the single kanji dictionary. When "hon" is selected as a "book" reading, first, the matching condition of the rule of voiced consonants with high appearance frequency is checked,
The matching condition of the rule for replacing h /, / b / with / p / is examined. For example, when “hon” is selected as the “book” of the unregistered word “book”, the consonant voicing rule is applied, and the next replacement with / p / is not applied because the conditions do not match. Therefore, the reading of "book" is determined as "bon". By the above means, the reading of the word determined as an unregistered word is estimated.

【００３２】[0032]

【発明の効果】以上説明したように、本発明による音声
合成装置は、読み上げ精度を下げることなく辞書の容量
を減らすことができ、かつ、単漢字辞書の音訓情報や出
現頻度情報を用いて未登録語の読みを推定することがで
きる。As described above, the speech synthesizing apparatus according to the present invention can reduce the capacity of the dictionary without lowering the reading accuracy, and uses the on-kanji information and the appearance frequency information of the single kanji dictionary. The reading of registered words can be estimated.

[Brief description of the drawings]

【図１】本発明による音声合成装置の実施形態の構成ブ
ロック図である。FIG. 1 is a configuration block diagram of an embodiment of a speech synthesis device according to the present invention.

【図２】本発明の実施形態における辞書の単語辞書と単
漢字辞書、読み付けルールの一例を示す図である。FIG. 2 is a diagram showing an example of a word dictionary, a single kanji dictionary, and reading rules of a dictionary according to the embodiment of the present invention.

【図３】本発明の第２の実施形態における単漢字辞書の
一例を示す図である。FIG. 3 is a diagram illustrating an example of a single kanji dictionary according to a second embodiment of the present invention.

【図４】本発明の第３の実施形態における単漢字辞書の
一例を示す図である。FIG. 4 is a diagram illustrating an example of a single kanji dictionary according to a third embodiment of the present invention.

【図５】本発明の第４の実施形態における単漢字辞書の
一例を示す図である。FIG. 5 is a diagram illustrating an example of a single kanji dictionary according to a fourth embodiment of the present invention.

【図６】本発明の第５の実施形態における出現頻度、読
み付けルール、一致条件の一例を示す図である。FIG. 6 is a diagram illustrating an example of an appearance frequency, a reading rule, and a matching condition according to the fifth embodiment of the present invention.

[Explanation of symbols]

１辞書２テキスト解析部３音声合成部 1 dictionary 2 text analyzer 3 voice synthesizer

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平７−295584（ＪＰ，Ａ) 特開平９−245023（ＪＰ，Ａ) 特開平８−185197（ＪＰ，Ａ) 特開平６−282290（ＪＰ，Ａ) 特開平２−184897（ＪＰ，Ａ) 特開平１−257997（ＪＰ，Ａ) 特開平９−244679（ＪＰ，Ａ) 特公平６−79230（ＪＰ，Ｂ２) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 13/08 ────────────────────────────────────────────────── ─── Continuation of front page (56) References JP-A-7-295584 (JP, A) JP-A-9-245023 (JP, A) JP-A 8-185197 (JP, A) JP-A-6-295197 282290 (JP, A) JP-A-2-184897 (JP, A) JP-A-1-257997 (JP, A) JP-A-9-244679 (JP, A) JP-B-6-79230 (JP, B2) (58) Field surveyed (Int. Cl. ⁷ , DB name) G10L 13/08

Claims

(57) [Claims]

1. A text-to-speech synthesizer that performs morphological analysis using a dictionary on input Japanese text mixed with kanji and kana to generate pronunciation information and synthesizes speech based on the pronunciation information. A word dictionary in which the dictionary has reading information of reading and reading rule search information instead of reading, a single kanji dictionary having reading information of each kanji character, and a reading rule for determining reading and accent of compound words when the, symbol, alphabet, such as a pseudonym, and an unregistered word dictionary that the one character that is not registered in the single Chinese character dictionary to grant the reading when it is detected as an unregistered word, the single-Han
Holds onon information along with readings of individual kanji in the character dictionary,
Of the part determined as an unregistered word in the morphological analysis unit
When reading, the kanji information of the adjacent kanji has not been registered.
Unregistered words are read by matching the recorded word information of single kanji
A speech synthesizer characterized by estimating the following .

2. The speech synthesizer according to claim 1, wherein the unit is
In the Japanese text with reading of each kanji in the kanji dictionary
Is stored in the morphological analysis unit.
When reading the part judged to be a registered word, the frequency
Of unregistered words by selecting the highest reading
A speech synthesizer characterized by the following.

3. An input text is analyzed by morphological analysis.
Divide into words, read and search information, rules, part of speech information for each word
Word reading and accent position
The character string determined as an unregistered word is determined by a single kanji dictionary.
A reading is given using an unregistered word dictionary, and the divided
Words to which the word and part of speech information are added are semantically and grammatically
Joins to form a compound word and access the compound word
Are estimated using accent combination rules, and
Generate phonetic symbols from all the information and read the input text.
Determine the prosody of the rhythm, intonation, etc.
And store it in advance according to this determined prosodic pattern.
When editing the audio that was output to generate the audio signal,
Determine the reading in the book and include the corresponding single kanji from the word dictionary.
Search for all entries and read
And Lumpur, before and after examining the phoneme environment, take the statistics, frequency of occurrence
Examine the matching conditions in order from the rule with the most
A speech synthesizing apparatus characterized by employing the following.

4. The method according to claim 1, wherein the accent combination rule comprises :
Claims that use rules based on the regularity of change
Item 4. A speech synthesizer according to item 3.

5. When an input character string is determined as an unregistered word
Uses the on-kanji information of the single kanji dictionary to predict the reading of unregistered words.
The speech synthesizer according to claim 3, wherein

6. When an input character string is determined as an unregistered word
Has the appearance frequency information in advance for each reading of the single kanji dictionary.
First, search the single kanji dictionary and read the kanji.
If there is more than one, add the most frequently occurring reading
The speech synthesizer according to claim 3.

7. When an input character string is determined as an unregistered word
Is the pronunciation of the pronunciation and the frequency of appearance
Keep information, and if there is more than one single kanji reading
Narrows down candidates using the on-word information of adjacent words,
The voice according to claim 3, wherein the most frequent one is selected from among them.
Synthesizer.

8. The method according to claim 1, wherein the matching condition is determined when statistics of a word dictionary are obtained.
And voiced and unvoiced matches, phoneme matches, or articulatory matches
4. The method according to claim 3, wherein the information is classified according to a rule and the like.
Speech synthesizer.