JPH05165486A

JPH05165486A - Text voice transforming device

Info

Publication number: JPH05165486A
Application number: JP3334873A
Authority: JP
Inventors: Hiroki Kamanaka; 博樹釜中
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-12-18
Filing date: 1991-12-18
Publication date: 1993-07-02

Abstract

PURPOSE:To prevent a broadcast forbidden word itself from being sounded at the text voice transforming device to transform an input sentence to audio signals and to sound them by tabling read forbidden words and judging whether it is the read forbidden word or not while successively segmenting input sentences for the unit of a word. CONSTITUTION:A read forbidden word table 22 is provided and this read forbidden word table 22 is composed of a storing means for a system user to freely add or erase contents. Thus, at a Japanese text voice transforming device 20, the system user can freely set the read forbidden words and replaced expressions. In this case, a text analysis part 24 judges whether it is the read forbidden word or not by successively segmenting the input sentences for the unit of the word, the read forbidden word is stopped sounding based on the judged result, and it is sounded while replacing the expression of the read forbidden word.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、テキストデータでなる
入力文章を音声信号に変換して発音出力するテキスト音
声変換装置に関し、例えば、日本語文章を対象とする日
本語テキスト音声変換装置に適用し得る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a text-to-speech converter for converting an input sentence composed of text data into a voice signal and outputting the pronunciation, for example, applied to a Japanese text-to-speech converter for a Japanese sentence. You can

【０００２】[0002]

【従来の技術】日本語テキスト音声変換装置は、ワード
プロセツサで作成したりＯＣＲから読み取って得たりし
た漢字かな混じり文（テキストデータ）を音声信号に変
換して発音出力するものである。2. Description of the Related Art A Japanese text-to-speech converter converts a kanji-kana mixed sentence (text data) created by a word processor or obtained by reading from an OCR into a voice signal and outputs the voice signal.

【０００３】図２は、従来の日本語テキスト音声変換装
置の構成を示すものである。図２において、日本語テキ
スト音声変換装置１にはＪＩＳコード等のテキストデー
タで表された漢字かな混じり文Ｄ１が入力され、テキス
ト解析部２が、この漢字かな混じり文Ｄ１を、発音辞書
４の格納内容を利用して音韻韻律記号列Ｄ２に変換す
る。FIG. 2 shows the configuration of a conventional Japanese text-to-speech converter. In FIG. 2, a Japanese text-to-speech converter 1 is input with a kanji / kana mixed sentence D1 represented by text data such as JIS code, and the text analysis unit 2 converts the kanji / kana mixed sentence D1 into a pronunciation dictionary 4. The stored contents are used to convert to a phonological prosodic symbol string D2.

【０００４】ここで音韻韻律記号列とは、入力文の読
み、アクセント、イントネーション等を文字列として記
述した中間言語である。発音辞書４は、図３に示すよう
に、この音韻韻律記号列Ｄ２を生成するための表記、読
み、アクセント等の情報を単語単位で格納している。上
述したテキスト解析部２は、この発音辞書４の表記と入
力文とのマツチングを順次行なうことにより、入力文を
解析して入力文の読み、アクセント、イントネーション
を決定し、これにより音韻韻律記号列Ｄ２を生成する。
この音韻韻律記号列Ｄ２が合成パラメータ生成部６に与
えられる。Here, the phonological prosody symbol string is an intermediate language in which the reading, accent, intonation, etc. of the input sentence are described as a character string. As shown in FIG. 3, the pronunciation dictionary 4 stores information such as notations, readings, and accents for generating the phonological prosodic symbol string D2 in units of words. The above-described text analysis unit 2 analyzes the input sentence by sequentially performing the matching between the notation of the pronunciation dictionary 4 and the input sentence, and determines the reading, accent, and intonation of the input sentence, thereby phonological prosodic symbol string. Generate D2.
This phonological prosody symbol string D2 is given to the synthesis parameter generation unit 6.

【０００５】合成パラメータ生成部６は、音声素片デー
タ部８の格納内容を利用して、音韻韻律記号列Ｄ２を、
音声信号を合成するための合成パラメータＤＰに変換す
る。音声合成においては、発生すべき音の種類を持つ音
声素片（音声の基本単位）、音の継続時間を表す音韻継
続時間（音の長さ）及び音の基本周波数パターン（音の
高さ）を選定することが必要になる。このため合成パラ
メータ生成部６は、音韻韻律記号列Ｄ２に含まれるアク
セント、イントネーションの情報に基づいて音韻継続時
間及び基本周波数パターンを選定する。さらに音韻韻律
記号列Ｄ２に含まれる読みの情報に基づいて音声素片デ
ータ部８を検索し、これにより音声素片を選定する。こ
のようにして形成された合成パラメータＤＰが音声合成
部１０に与えられる。The synthesis parameter generation unit 6 uses the stored contents of the speech unit data unit 8 to generate a phonological prosodic symbol string D2,
It is converted into a synthesis parameter DP for synthesizing an audio signal. In speech synthesis, a speech unit (basic unit of speech) having a type of sound to be generated, a phoneme duration (sound length) that represents the duration of the sound, and a fundamental frequency pattern (pitch) of the sound. Will need to be selected. Therefore, the synthesis parameter generation unit 6 selects the phoneme duration and the fundamental frequency pattern based on the accent and intonation information included in the phoneme prosodic symbol string D2. Further, the speech unit data portion 8 is searched based on the reading information included in the phonological prosody symbol string D2, and the speech unit is selected by this. The synthesis parameter DP formed in this way is given to the speech synthesis unit 10.

【０００６】音声合成部１０は、合成パラメータＤＰに
基づいて音声信号を合成し、スピーカ１２に与えて発音
出力させる。このとき、音声合成部１０は、音韻継続時
間及び基本周波数パターンを基準にしながら、合成音が
自然音に近づくように音声素片を接続し音声信号を合成
する。The voice synthesizing section 10 synthesizes a voice signal based on the synthesis parameter DP and gives it to the speaker 12 to output a sound. At this time, the speech synthesis unit 10 connects the speech units and synthesizes the speech signal so that the synthesized sound approaches the natural sound while using the phoneme duration and the fundamental frequency pattern as a reference.

【０００７】これにより、日本語テキスト音声変換装置
１においては、分かち書きされていない日本語文章（漢
字かな混じり文）Ｄ１を順次読み上げ得るようになされ
ている。As a result, the Japanese text-to-speech conversion apparatus 1 can sequentially read aloud the Japanese sentences (kanji / kana mixed sentences) D1 that have not been separated.

【０００８】[0008]

【発明が解決しようとする課題】しかしながら、従来の
日本語テキスト音声変換装置１においては、入力された
漢字かな混じり文Ｄ１に俗語、卑語、差別用語、忌み語
等（いわゆる放送禁止用語に該当する）が含まれている
場合でも、これらの用語を例外なく読み上げる問題があ
る。このような用語は、普段は口にするのを慎むべき用
語であり、聞き手に不快感を与えるものである。However, in the conventional Japanese text-to-speech conversion device 1, the input kanji-kana mixed sentence D1 includes slang words, slang words, discriminative words, abusive words, etc. ) Is included, there is a problem of reading these terms without exception. Such a term is a term that should normally not be spoken, and is unpleasant to the listener.

【０００９】この問題を解決する一つの方法として、こ
の種の用語を発音辞書４に登録しない方法が考えられ
る。しかし、未知語処理を行なうシステムにおいては、
登録された他の表記から推察して結局この種の用語を読
み上げてしまう問題がある。As a method of solving this problem, a method of not registering this type of term in the pronunciation dictionary 4 can be considered. However, in a system that processes unknown words,
There is a problem that this term is eventually read aloud by guessing from other registered notations.

【００１０】これに対してオペレータが予め入力文をチ
エツクしたり、又は、未知語として処理をシステムに求
められたときにオペレータがこれに応じたりして、この
種の用語を改める方法も考えられるが、作業に時間を要
すると共に完全を期し得ない問題もあり、又文章量が増
大すると作業自体も煩雑になる。On the other hand, a method is also conceivable in which the operator checks the input sentence in advance, or the operator responds to it when the system is requested to process it as an unknown word. However, there is a problem that the work takes time and cannot be completed completely, and the work itself becomes complicated when the amount of texts increases.

【００１１】本発明は、以上の点を考慮してなされたも
のであり、入力文に発音出力が妥当でない用語が含まれ
ている場合に、この種の用語の発音を禁止することがで
きるテキスト音声変換装置を提供しようとするものであ
る。The present invention has been made in consideration of the above points, and when an input sentence contains a term whose pronunciation output is not appropriate, a text that can prohibit the pronunciation of this type of term. It is intended to provide a voice conversion device.

【００１２】[0012]

【課題を解決するための手段】かかる課題を解決するた
め、第１の本発明においては、テキストデータでなる入
力文章を音声信号に変換して発音出力するするテキスト
音声変換装置において、以下の各手段を設けた。すなわ
ち、読み上げ禁止用語を格納する読み上げ禁止用語テー
ブルと、入力文章を単語単位で切り出し、読み上げ禁止
テーブルを検索して入力文章に含まれている単語が読み
上げ禁止用語か否かを判断する読み上げ禁止用語判断手
段と、この読み上げ禁止用語判断手段の判断結果に基づ
いて、読み上げ禁止用語に該当する単語の発音を禁止す
る発音禁止手段とを設けた。In order to solve such a problem, according to the first aspect of the present invention, a text-to-speech conversion device for converting an input sentence composed of text data into a sound signal and outputting the sound is output. Means were provided. That is, a reading-prohibited term table that stores reading-prohibited terms and an input sentence is cut out in word units, and a reading-prohibition table is searched to determine whether or not a word included in the input sentence is a reading-prohibited term. The judgment means and the pronunciation prohibition means for prohibiting the pronunciation of the word corresponding to the reading-prohibited term based on the judgment result of the reading-prohibited term judgment means are provided.

【００１３】また、第２の本発明においては、テキスト
データでなる入力文章を音声信号に変換して発音出力す
るするテキスト音声変換装置において、以下の各手段を
設けた。すなわち、読み上げ禁止用語とこの読み上げ禁
止用語を表現する置換表現との対を格納する読み上げ禁
止用語テーブルと、入力文章を単語単位で切り出し、読
み上げ禁止テーブルを検索して入力文章に含まれている
単語が読み上げ禁止用語か否か判断する読み上げ禁止用
語判断手段と、この読み上げ禁止用語判断手段の判断結
果に基づいて、読み上げ禁止用語を置換表現に変換して
発音させる読み上げ禁止用語置換手段とを設けた。Further, in the second aspect of the present invention, the following means are provided in the text-to-speech conversion apparatus for converting an input sentence composed of text data into a sound signal and outputting the sound. That is, a reading prohibited term table that stores a pair of the reading prohibited term and a replacement expression that represents this reading prohibited term, the input sentence is cut out in word units, and the reading prohibited table is searched to search for the word included in the input sentence. A reading prohibition term judging means for judging whether or not is a reading prohibition term, and a reading prohibition term replacing means for converting the reading prohibition term into a substitution expression and pronounced based on the judgment result of the reading prohibition term judging means are provided. ..

【００１４】[0014]

【作用】第１の本発明において、読み上げ禁止用語テー
ブルに発音出力を禁止する読み上げ禁止用語を予め格納
しておく。テキストデータでなる文章が入力されると、
読み上げ禁止用語判断手段は、入力文章を単語単位で切
り出し、読み上げ禁止テーブルを検索して入力文章に含
まれている単語が読み上げ禁止用語か否かを判断する。
そして、発音禁止手段が、この読み上げ禁止用語判断手
段の判断結果に基づいて、読み上げ禁止用語に該当する
単語の発音を禁止する。例えば、読み上げ禁止用語に代
えてビープ音を発音させたり、無音にさせたりすること
で読み上げ禁止用語の発音を禁止する。In the first aspect of the present invention, the reading-prohibited term for prohibiting the pronunciation output is stored in advance in the reading-prohibited term table. When a sentence consisting of text data is entered,
The reading-prohibited term determination means cuts out the input sentence on a word-by-word basis, searches the reading-prohibition table, and determines whether or not the word included in the input sentence is a reading-prohibited term.
Then, the pronunciation prohibiting means prohibits the pronunciation of the word corresponding to the reading-prohibited term based on the judgment result of the reading-prohibited term judging means. For example, the pronunciation of a reading-prohibited term is prohibited by making a beep sound be pronounced or silenced instead of the reading-prohibited term.

【００１５】また、第２の本発明において、読み上げ禁
止テーブルに読み上げ禁止用語とこの読み上げ禁止用語
を表現する置換表現との対を予め格納しておく。テキス
トデータでなる文章が入力されると、読み上げ禁止用語
判断手段は、入力文章を単語単位で切り出し、読み上げ
禁止テーブルを検索して入力文章に含まれている単語が
読み上げ禁止用語か否か判断する。そして、読み上げ禁
止用語置換手段が、この読み上げ禁止用語判断手段の判
断結果に基づいて、読み上げ禁止用語を置換表現に変換
して発音させる。Further, in the second aspect of the present invention, a pair of a reading prohibited term and a replacement expression expressing the reading prohibited term is stored in advance in the reading prohibited table. When a sentence consisting of text data is input, the reading-prohibited term determination means cuts the input sentence into words and searches the reading-prohibition table to determine whether the word included in the input sentence is a reading-prohibited term. .. Then, the reading-prohibited term replacing means converts the reading-prohibited term into a replacement expression based on the judgment result of the reading-prohibited term judging means and causes the pronunciation.

【００１６】[0016]

【実施例】以下、図面を参照しながら、本発明を日本語
テキスト音声変換装置に適用した一実施例を詳述する。
ここで、図１がこの実施例の構成を示すブロック図であ
り、上述した図２との対応部分には同一符号を付して示
している。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment in which the present invention is applied to a Japanese text-to-speech converter will be described in detail below with reference to the drawings.
Here, FIG. 1 is a block diagram showing the configuration of this embodiment, and the portions corresponding to those in FIG. 2 described above are designated by the same reference numerals.

【００１７】図１において、この実施例の日本語テキス
ト音声変換装置２０は、読み上げ禁止用語テーブル２２
を備えている。In FIG. 1, the Japanese text-to-speech conversion apparatus 20 of this embodiment has a reading-inhibited term table 22.
Is equipped with.

【００１８】図４は、この読み上げ禁止用語テーブル２
２の構成を示すものである。読み上げ禁止用語テーブル
２２は、この図４に示すように、読み上げ（発音出力）
を禁止する用語（以下、読み上げ禁止用語と呼ぶ）とそ
の置き換えの表現（以下、置換表現と呼ぶ）との対を格
納するようになされている。この実施例の場合、「つん
ぼ」、「めくら」等の読み上げ禁止用語に対して「目の
不自由な人」、「耳の不自由な人」等の置換表現が格納
されている。この読み上げ禁止用語テーブル２２は、シ
ステム利用者が内容を自由に追加、削除し得るようにな
された記憶手段で形成され、これにより当該日本語テキ
スト音声変換装置２０においては、システム利用者が読
み上げ禁止用語及び置換表現を自由に設定し得るように
なされている。FIG. 4 shows the reading-prohibited term table 2
2 shows the configuration of No. 2. As shown in FIG. 4, the read-aloud term table 22 is read out (sound output).
Is stored with a pair of a term forbidden (hereinafter, referred to as a reading prohibited term) and a replacement expression (hereinafter, referred to as a replacement expression). In this embodiment, replacement expressions such as "blind person" and "early person" are stored for reading-prohibited terms such as "tsubo" and "blind". The read-aloud term table 22 is formed by a storage means that allows the system user to freely add and delete the contents, whereby the system user of the Japanese text-to-speech converter 20 is prohibited to read the content. The terms and replacement expressions can be freely set.

【００１９】テキスト解析部２４は、基本的には従来と
同様に、入力された漢字かな混じり文Ｄ１を音韻韻律記
号列Ｄ３に変換するものである。しかし、この実施例の
テキスト解析部２４は、さらに図５に示す読み上げ禁止
用語に係る処理をも実行する。この図５に示す処理が実
行されることにより、読み上げ禁止用語テーブル２２に
登録されている読み上げ禁止用語の発音を未然に防止で
きる。The text analysis unit 24 basically converts the inputted Kanji / Kana mixed sentence D1 into a phonological prosodic symbol string D3, as in the conventional case. However, the text analysis unit 24 of this embodiment also executes the process related to the reading-prohibited term shown in FIG. By executing the processing shown in FIG. 5, it is possible to prevent the pronunciation of the reading-prohibited terms registered in the reading-prohibited term table 22 from occurring.

【００２０】テキスト解析部２４は、漢字かな混じり文
Ｄ１が入力されてくると、図５の処理を開始してまず発
音辞書４の読みを利用して１単語を切り出し、さらに発
音辞書４の格納内容を利用して音韻韻律記号Ｄ３に変換
する（ステップ１０１、１０２）。そして、読み上げ禁
止用語テーブル２２を検索して切り出された単語が読み
上げ禁止用語に該当するか否かを判断する（ステップ１
０３）。When the kana-kana mixed sentence D1 is input, the text analysis unit 24 starts the process of FIG. 5 to cut out one word by using the reading of the pronunciation dictionary 4, and further stores the pronunciation dictionary 4. The contents are used to convert into a phonological prosodic symbol D3 (steps 101 and 102). Then, the reading prohibited term table 22 is searched to determine whether or not the cut out word corresponds to the reading prohibited term (step 1).
03).

【００２１】切り出された単語が読み上げ禁止用語でな
いと、テキスト解析部２４は、さらに文末か否か判断す
る（ステップ１０４）。文末であると、すなわち、最後
の単語に対する読み上げ禁止用語に係る処理を実行し終
えた場合には、図５に示す処理を終了する（ステップ１
０５）。他方、切り出された単語が読み上げ禁止用語で
なくかつ文末でもないと、テキスト解析部２４は、上述
したステツプ１０２に戻つて次の単語を切り出す。If the cut-out word is not a reading-prohibited term, the text analysis unit 24 further judges whether it is the end of sentence (step 104). If it is the end of the sentence, that is, if the processing relating to the reading-prohibited term for the last word has been executed, the processing shown in FIG. 5 is ended (step 1).
05). On the other hand, if the cut-out word is neither a reading-prohibited term nor a sentence end, the text analysis unit 24 returns to step 102 described above and cuts out the next word.

【００２２】すなわち、テキスト解析部２４は、入力文
から切り出した単語が読み上げ禁止用語でない間は、ス
テツプ１０２−１０３−１０４−１０２のループを繰返
し行なうことで順次音韻韻律記号Ｄ３に変換する。That is, while the word cut out from the input sentence is not a reading-prohibited term, the text analysis unit 24 repeats the loop of steps 102-103-104-102 to sequentially convert it into the phonological prosodic symbol D3.

【００２３】入力文に読み上げ禁止用語が含まれてお
り、その単語がステップ１０２で切り出されると、上述
したステップ１０３で肯定結果が得られる。このときに
は、テキスト解析部２４は、読み上げ禁止用語に対する
処理モードを判断する（ステップ１０６）。If the input sentence contains a reading-prohibited term and the word is cut out in step 102, a positive result is obtained in step 103. At this time, the text analysis unit 24 determines the processing mode for the reading-prohibited term (step 106).

【００２４】この実施例の場合、該日本語テキスト音声
変換装置２０は、システム利用者が予め読み上げ禁止用
語の処理モードを選定し得るようになされている。処理
モードとしては、読み上げ禁止用語部分をビープ音で発
音させるビープ音モードと、読み上げ禁止用語部分をそ
の置換表現で発音させる言い換えモードとがある。この
ような処理モードの判断は、例えばディップスイッチの
モードに応じた位置で行なっても良く、またキーボード
等を介して入力されたモード情報を保持しているフラグ
の内容を判断することで行なっても良い。In the case of this embodiment, the Japanese text-to-speech converter 20 is designed so that the system user can select the processing mode of the reading-prohibited term in advance. As the processing modes, there are a beep sound mode in which the reading prohibited term portion is pronounced with a beep sound, and a paraphrasing mode in which the reading prohibited term portion is pronounced with the replacement expression. Such processing mode determination may be performed, for example, at a position corresponding to the mode of the DIP switch, or by determining the content of the flag holding the mode information input via the keyboard or the like. Is also good.

【００２５】システム利用者が予めビープ音モードを選
択していると、テキスト解析部２４は、読み上げ禁止用
語に該当する処理中の単語について、当該単語に対する
音韻韻律記号列Ｄ３をビープ音発音コマンドに変換した
後、上述したステツプ１０４に移る（ステップ１０
７）。When the system user has selected the beep sound mode in advance, the text analysis unit 24 uses the phonological prosodic symbol string D3 for the word as a beep sound generation command for the word being processed which corresponds to the reading prohibited term. After the conversion, the process moves to the above-mentioned step 104 (step 10).
7).

【００２６】これに対してシステム利用者が予め言い換
えモードを選択していると、テキスト解析部２４は、読
み上げ禁止用語に対する音韻韻律記号列Ｄ３を破棄し、
入力文における読み上げ禁止用語に該当する単語を置換
表現で置き換えて上述したステップ１０２に戻る（ステ
ップ１０８）。このようにしてステップ１０２に戻った
場合には、置換表現から１単語の切り出しが行われる。On the other hand, if the system user has previously selected the paraphrase mode, the text analysis unit 24 discards the phonological prosodic symbol string D3 for the reading-inhibited term,
The word corresponding to the read-aloud term in the input sentence is replaced with the replacement expression, and the process returns to step 102 (step 108). When the process returns to step 102 in this way, one word is cut out from the replacement expression.

【００２７】以上のように、この実施例のテキスト解析
部２４は、入力文から１単語を切り出してはそれを音韻
韻律記号Ｄ３に変換すると共に、切り出された単語が読
み上げ禁止用語の場合には、音韻韻律記号Ｄ３に代えて
ビープ音発音コマンドを出力するか、又は、読み上げ禁
止用語の置換表現についての音韻韻律記号Ｄ３を出力す
る。As described above, the text analysis unit 24 of this embodiment cuts out one word from the input sentence and converts it into the phonological prosodic symbol D3, and when the cut out word is a reading-prohibited term. , A beep sound pronunciation command is output instead of the phonological prosodic symbol D3, or the phonological prosodic symbol D3 for the replacement expression of the reading-prohibited term is output.

【００２８】図６は、「彼女はめくらだ。」という入力
文が入力された場合におけるテキスト解析部２４の出力
を示すものである。ビープ音モードが選択されている場
合には、読み、アクセントが交互に連続し、かつ読み上
げ禁止用語「めくら」の部分にビープ音発音コマンド
〔ＢＥＥＰ〕が挿入された「カノジョワ１，〔ＢＥＥ
Ｐ〕ダ」という音韻韻律記号列Ｄ３が生成される。ま
た、言い換えモードが選択されている場合には、「彼女
はめくらだ。」の入力文を「彼女は目の不自由な人
だ。」と置き換えたと同様な文章に対する音韻韻律記号
列Ｄ３、すなわち「カノジョワ１，メノ１，フジユーナ
１，ヒトダ２」を生成する。FIG. 6 shows the output of the text analysis unit 24 when the input sentence "She is blind." Is input. When the beep sound mode is selected, reading and accent are alternately continued, and a beep sound pronunciation command [BEEP] is inserted in the reading prohibited term "blank".
A phonological prosodic symbol string D3 "P] Da" is generated. When the paraphrase mode is selected, the phonological prosodic symbol string D3 for a sentence similar to the sentence in which the input sentence of “She is blind” is replaced with “She is a blind person”, that is, "Kanojowa 1, Meno 1, Fuji Yuna 1, Hitoda 2" is generated.

【００２９】なお、従来の日本語テキスト音声変換装置
１においては、この場合、読み、アクセントが交互に連
続する「カノジョワ１，メクラダ３」の音韻韻律記号列
Ｄ２が生成される。In the conventional Japanese text-to-speech conversion apparatus 1, in this case, the phoneme prosodic symbol string D2 of "Kanojowa 1, Mekurada 3" in which reading and accent are alternately continuous is generated.

【００３０】この実施例において、読み上げ禁止用語に
対しては置換表現に置き換えて処理するようにしたの
は、システム利用者の使い勝手を向上することができる
ためである。以下、この理由について詳述する。In this embodiment, the reading-prohibited term is replaced with the replacement expression for processing because the usability for the system user can be improved. Hereinafter, the reason for this will be described in detail.

【００３１】音韻韻律記号列Ｄ３を生成する際、読み上
げ禁止用語の読み及びアクセントを置き換えても、読み
上げ禁止用語の発音を未然に防止することができる。し
かし、この方法において、システム利用者が自由に読み
上げ禁止用語を設定、削除し得るようにすると、結局シ
ステム利用者自身が読み上げ禁止用語の読み及びアクセ
ントを登録しなければならない。すなわちこの場合、読
み上げ禁止用語テーブル２２は利用者専用の発音辞書と
同じ扱いとなり、ユーザ自身が自己のユーザ辞書に読み
上げ禁止用語の読み及びアクセントを登録することにな
る。ところが、実際上アクセントの登録は熟練を要し、
使い慣れていないシステム利用者においては登録が困難
な問題がある。特に置換表現が複数の単語からなる句の
場合、置換表現が１つの単語の場合に比してさらに一段
とアクセントの登録が困難になる。そこで、この実施例
においては、読み上げ禁止用語の表記を置き換えること
により、簡易に登録できるようにし、置き換えた表記を
テキスト解析部２４で解析処理し、読み及びアクセント
を付与するようにした。When the phonological prosodic symbol string D3 is generated, even if the reading and accent of the reading prohibited term are replaced, the pronunciation of the reading prohibited term can be prevented in advance. However, in this method, if the system user is allowed to freely set and delete the read-aloud words, the system user must eventually register the reading and accent of the read-aloud words. That is, in this case, the reading prohibited term table 22 is treated in the same manner as the pronunciation dictionary dedicated to the user, and the user himself / herself registers the reading and accent of the reading prohibited term in his / her own user dictionary. However, registering accents actually requires skill,
There is a problem that registration is difficult for system users who are not familiar with it. In particular, when the replacement expression is a phrase composed of a plurality of words, it becomes more difficult to register the accent as compared with the case where the replacement expression is one word. Therefore, in this embodiment, by replacing the notation of the reading-prohibited term, it is possible to easily register, and the replaced notation is analyzed by the text analysis unit 24, and the reading and accent are added.

【００３２】なお、テキスト解析部２４は、文中の全て
の単語について読み及びアクセントを得た後、文全体と
して体裁と整えるように総合的に判断して入力文の読み
及びアクセントを決定し、これにより音韻韻律記号列Ｄ
３を生成するようになされている。The text analysis unit 24, after obtaining readings and accents for all the words in the sentence, makes a comprehensive judgment so that the sentence as a whole is formatted and determines the readings and accents of the input sentence. Phonological prosodic symbol string D
It is designed to generate 3.

【００３３】合成パラメータ生成部２６は、合成パラメ
ータ生成部６（図２）と同様に音韻韻律記号列Ｄ３に基
づいて、合成パラメータＤＰ１を生成するものである。
但し、合成パラメータ生成部２６は、音韻韻律記号列Ｄ
３に挿入されているビープ音発音コマンド〔ＢＥＥＰ〕
を検出すると、ビープ音発音のための合成パラメータ
（すなわち位置、長さ等のデータでなる）を生成して出
力する。The synthesis parameter generation unit 26, like the synthesis parameter generation unit 6 (FIG. 2), generates the synthesis parameter DP1 based on the phonological prosodic symbol string D3.
However, the synthesis parameter generation unit 26 uses the phonological prosodic symbol string D.
Beep sound command [BEEP] inserted in 3
When it detects, a synthetic parameter (that is, data such as position and length) for beep sound generation is generated and output.

【００３４】以上の構成において、入力文Ｄ１はテキス
ト解析部２４で音韻韻律記号列Ｄ３に変換され、この音
韻韻律記号列Ｄ３は合成パラメータ生成部２６で音声合
成用の合成パラメータＤＰ１に変換される。In the above configuration, the input sentence D1 is converted into the phonological prosody symbol string D3 by the text analysis unit 24, and this phonological prosody symbol string D3 is converted into the synthesis parameter DP1 for speech synthesis by the synthesis parameter generation unit 26. ..

【００３５】このとき入力文Ｄ１はテキスト解析部２４
で単語単位に切り出され、読み上げ禁止用語テーブル２
２に基づいて各単語が読み上げ禁止用語に該当するか否
か判断される。ここで、禁止用語に該当すると、ビープ
音モードにおいては、ビープ音発音コマンドが置き換え
られて音韻韻律記号列Ｄ３が生成されるのに対し、言い
換えモードにおいては読み上げ禁止用語テーブル２２の
置換表現に置き換えられて音韻韻律記号列Ｄ３が生成さ
れる。At this time, the input sentence D1 is the text analysis unit 24.
Table 2
Based on 2, it is determined whether or not each word corresponds to a read-aloud term. Here, when the prohibited term is applied, in the beep sound mode, the beep sound pronunciation command is replaced to generate the phonological prosodic symbol string D3, whereas in the paraphrase mode, the replacement expression of the reading prohibited term table 22 is replaced. As a result, a phonological prosodic symbol string D3 is generated.

【００３６】これによりビープ音モードにおいては、読
み上げ禁止用語に代えてビープ音が発音出力されるのに
対し、言い換えモードにおいては読み上げ禁止用語が所
定の表現に置き換えられて読み上げられる。As a result, in the beep sound mode, the beep sound is output in place of the read-aloud words, whereas in the paraphrase mode, the read-aloud words are replaced with a predetermined expression and read.

【００３７】従って、上記実施例によれば、入力文を単
語単位で切り出し、読み上げ禁止用語テーブル２２を検
索して読み上げ禁止用語をビープ音又は置換表現と置き
換えることにより、煩雑な操作を省略して読み上げ禁止
用語の発音を未然に防止することができる。Therefore, according to the above embodiment, the input sentence is cut out word by word, the reading prohibited term table 22 is searched, and the reading prohibited term is replaced with a beep sound or a replacement expression, thereby omitting a complicated operation. It is possible to prevent the pronunciation of the reading-prohibited terms from occurring.

【００３８】なお、上述の実施例においては、読み上げ
禁止用語をビープ音又は置換表現と置き換える場合につ
いて述べたが、ビープ音以外の特有音を発音させたり、
無音にして発音自体を停止したりしても良い。In the above embodiment, the case where the reading-prohibited term is replaced with the beep sound or the replacement expression has been described. However, a peculiar sound other than the beep sound is generated,
It is also possible to silence and stop the pronunciation itself.

【００３９】また、上記実施例においては、読み上げ禁
止用語に対する処理モードとして２個の処理モードを設
けてユーザに選択させるものを示したが、いずれか一方
の処理モードだけを実現できる装置であっても良い。In the above embodiment, two processing modes are provided as the processing mode for the reading-prohibited term and the user is allowed to select the processing mode. However, the apparatus can realize only one of the processing modes. Is also good.

【００４０】さらに、上記実施例においては、日本語文
章を対象としたテキスト音声変換装置を示したが、他の
言語を対象としたテキスト音声変換装置にも本発明を適
用することができる。Further, in the above embodiment, the text-to-speech converter for Japanese sentences is shown, but the present invention can be applied to the text-speech converter for other languages.

【００４１】[0041]

【発明の効果】以上のように、本発明によれば、読み上
げ禁止用語をテーブル化し、入力文を順次単語単位に切
り出して読み上げ禁止用語か否か判断することにより、
簡易に読み上げ禁止用語か否か判断することができ、こ
の判断結果に基づいて読み上げ禁止用語の発音を禁止
し、又は読み上げ禁止用語の表現を置き換えて発音する
ようにしたので、読み上げ禁止用語自体の発音を防止で
きるテキスト音声変換装置を実現できる。As described above, according to the present invention, the reading-prohibited terms are made into a table, and the input sentence is sequentially cut out in word units to judge whether or not the reading-prohibited terms are used.
It is possible to easily judge whether it is a reading prohibited term, and based on this judgment result, the pronunciation of the reading prohibited term is prohibited, or the expression of the reading prohibited term is replaced and pronounced. A text-to-speech converter that can prevent pronunciation can be realized.

[Brief description of drawings]

【図１】実施例のテキスト音声変換装置を示すブロツク
図である。FIG. 1 is a block diagram showing a text-to-speech conversion apparatus according to an embodiment.

【図２】従来の日本語テキスト音声変換装置を示すブロ
ツク図である。FIG. 2 is a block diagram showing a conventional Japanese text-to-speech conversion device.

【図３】発音辞書の内容を示す略線図である。FIG. 3 is a schematic diagram showing the contents of a pronunciation dictionary.

【図４】実施例の読み上げ禁止用語テーブルの内容を示
す説明図である。FIG. 4 is an explanatory diagram showing the contents of a reading-prohibited term table according to the embodiment.

【図５】実施例のテキスト解析部の動作を示すフローチ
ャートである。FIG. 5 is a flowchart showing the operation of the text analysis unit of the embodiment.

【図６】実施例の音韻韻律記号列を示す説明図である。FIG. 6 is an explanatory diagram showing a phonological prosodic symbol string according to the embodiment.

[Explanation of symbols]

４…発音辞書、８…音声素片データ部、１０…音声合成
部、１２…スピーカ、２０…日本語テキスト音声変換装
置、２２…読み上げ禁止用語テーブル、２４…テキスト
解析部、２６…合成パラメータ生成部。4 ... Pronunciation dictionary, 8 ... Speech element data part, 10 ... Speech synthesis part, 12 ... Speaker, 20 ... Japanese text-to-speech conversion device, 22 ... Read-aloud term table, 24 ... Text analysis part, 26 ... Synthesis parameter generation Department.

Claims

[Claims]

1. A text-to-speech conversion device for converting an input sentence consisting of text data into a voice signal and outputting the pronunciation, and a reading-prohibited term table for storing reading-prohibited terms; Based on the reading-prohibited term determination means that searches the reading-prohibited table and determines whether the word included in the input sentence is a reading-prohibited term, and the determination result of the reading-prohibited term determination means,
A text-to-speech conversion device comprising: a pronunciation prohibition unit that prohibits pronunciation of a word corresponding to the above-mentioned reading-prohibited term.

2. In a text-to-speech conversion device for converting an input sentence composed of text data into a voice signal and outputting the pronunciation, a read-aloud term table storing a pair of a read-prohibited term and a replacement expression expressing the read-prohibited term. And a reading-prohibited term judging means for cutting out the input sentence in units of words, searching the reading-prohibition table and judging whether or not the word included in the input sentence is the reading-prohibited term, and the reading-prohibited term judging means. Based on the judgment result of
A text-to-speech conversion device, comprising: a reading-prohibited term replacement unit that converts the reading-prohibited term into the replacement expression and pronounces it.