JP2010145873A

JP2010145873A - Text replacement device, text voice synthesizer, text replacement method, and text replacement program

Info

Publication number: JP2010145873A
Application number: JP2008324784A
Authority: JP
Inventors: Katsuhiko Sato; 勝彦佐藤
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2008-12-19
Filing date: 2008-12-19
Publication date: 2010-07-01

Abstract

<P>PROBLEM TO BE SOLVED: To replace a word included in an input text with a word fit to a vocabulary ability of a hearing person. <P>SOLUTION: This text replacement device or the like sets a vocabulary ability level, analyzes the input text to input the word, determines whether the degree of word difficulty set preliminarily in each word exceeds the set vocabulary ability level or not (step S23), replaces the word when the degree of word difficulty exceeds the vocabulary ability level (step S24), and outputs the word as it is, without replacing the word, when the degree of word difficulty does not exceed the vocabulary ability level. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、聞き手にとって理解し易い言葉に置き換えて合成音声を出力することができるテキスト置換装置、テキスト音声合成装置、テキスト置換方法、及び、テキスト置換プログラムに関する。 The present invention relates to a text replacement device, a text-to-speech synthesizer, a text replacement method, and a text replacement program that can output synthesized speech by replacing words that are easy for a listener to understand.

一般に、入力テキストから合成音声を生成するテキスト音声合成においては、入力テキストに書かれた言葉がそのまま音声に変えられて出力される。ところが、入力テキストは、書かれた文字が読者の目で直接読まれることを前提とした書き言葉で書かれていることが多い。そのため、比較的難解な語彙や表現が使われているなどの場合は、そのまま音声に変えられると、文字を直接目で見ないで耳だけで聞いている聞き手にとっては理解しにくくなる。そこで、書き言葉で書かれたテキストを、平易な表現による話し言葉に変換してから合成音声を生成するテキスト音声合成方法が開発されている。 In general, in text-to-speech synthesis that generates synthesized speech from input text, words written in the input text are directly converted into speech and output. However, the input text is often written in written language assuming that the written characters are read directly by the reader's eyes. For this reason, when relatively difficult vocabulary and expressions are used, it is difficult for a listener who is listening to the ears without directly looking at the characters if the voice is changed as it is. In view of this, a text-to-speech synthesis method has been developed in which text written in written language is converted into spoken language expressed in plain expression and then synthesized speech is generated.

特開平３−０３５２９６号公報（第３〜５頁、図１〜２）Japanese Patent Laid-Open No. 3-035296 (pages 3-5, FIGS. 1-2) 特開２００２−１４９１８０号公報（第４〜８頁、図１〜７）JP 2002-149180 A (pages 4-8, FIGS. 1-7)

しかしながら、従来の方法では、聞き手の語彙力の如何にかかわらず、一律に書き言葉を話し言葉に置き換えていたので、聞き手の語彙力が高い場合には、言葉の置き換えによって却って意味がわかりにくくなってしまう場合があった。例えば、書き言葉であれば簡潔に内容が伝えられるのに、回りくどい表現により、わかりにくくなるなどの場合である。 However, in the conventional method, regardless of the listener's vocabulary, the written words are uniformly replaced with spoken words, so if the listener's vocabulary is high, the meaning becomes difficult to understand by replacing the words. There was a case. For example, the written contents can be communicated in a concise manner, but they can be difficult to understand due to rounded expressions.

また、置き換えられた言葉がわかりやすいかどうかも聞き手の語彙力によって異なる。そのため、一義的な基準によって、一般的にわかりやすいとされる語彙及び表現に変えたとしても、聞き手によっては依然としてわからない場合もあった。 Also, whether the replaced word is easy to understand depends on the vocabulary of the listener. Therefore, even if it is changed to a vocabulary and expression that are generally easy to understand according to the unambiguous criteria, there are cases where it is still not understood by the listener.

本発明は、上記の問題を解決するためになされたものであり、単語を、語彙力に応じた単語に置換できるテキスト置換装置、テキスト音声合成装置、テキスト置換方法、及び、テキスト置換プログラムを提供することを目的とする。 The present invention has been made to solve the above problem, and provides a text replacement device, a text-to-speech synthesis device, a text replacement method, and a text replacement program that can replace a word with a word according to vocabulary. The purpose is to do.

上記目的を達成するために、本発明の第１の観点に係るテキスト置換装置は、
入力テキストから取得される単語を、所定の単語に置換するテキスト置換装置であって、
語彙力を設定して入力するユーザインタフェースと、
単語と、その単語と同等な単語と、前記語彙力と、を記憶する記憶部と、
前記入力テキストから単語を取得し、当該取得された単語を前記記憶部で検索し、検索された単語をその単語と同等な所定の単語に置換するか否かを、前記語彙力に応じて決定する処理手段と、
を備えたことを特徴とする。 In order to achieve the above object, a text replacement device according to a first aspect of the present invention provides:
A text replacement device that replaces a word acquired from input text with a predetermined word,
A user interface to set and input vocabulary,
A storage unit for storing a word, a word equivalent to the word, and the vocabulary
A word is acquired from the input text, the acquired word is searched in the storage unit, and it is determined according to the vocabulary whether to replace the searched word with a predetermined word equivalent to the word. Processing means to
It is provided with.

前記記憶部は、
単語ごとに、当該単語とともに、その単語の難易を指標する単語難易度を記憶し、
前記処理手段は、
前記検索された単語の単語難易度が、前記語彙力を上回るか否かを判定し、
前記単語難易度が前記語彙力を上回る場合に、前記入力テキストから取得され前記記憶部で検索された単語を、所定の単語に置換することが望ましい。 The storage unit
For each word, along with the word, store the word difficulty index that indicates the difficulty of the word,
The processing means includes
Determining whether the word difficulty of the searched word exceeds the vocabulary;
When the word difficulty level exceeds the vocabulary, it is desirable to replace a word acquired from the input text and searched in the storage unit with a predetermined word.

前記記憶部は、
単語ごとに、当該単語とともに、当該単語と置換するその単語と同等な単語の候補を、語彙力の各レベルごとに記憶し、
前記処理手段は、
前記記憶部で、前記入力テキストから取得された単語を検索するとともに、語彙力のレベルを検索し、置換する候補単語を決定するようにしてもよい。 The storage unit
For each word, a word candidate equivalent to the word to be replaced with the word is stored for each level of vocabulary,
The processing means includes
The storage unit may search for a word acquired from the input text and search for a vocabulary level to determine a candidate word to be replaced.

また、前記ユーザインタフェースは、
世代を設定して入力し、
前記記憶部は、
単語ごとに、当該単語とともに、その単語の各世代における常識度を記憶し、
前記処理手段は、
前記入力した世代における前記検索された単語の常識度が所定の閾値を下回るか否かを判定し、
前記検索された単語の常識度が前記所定の閾値を下回る場合に、前記入力テキストから取得され前記記憶部で検索された単語を、所定の単語に置換するようにしてもよい。 The user interface is
Set and enter the generation,
The storage unit
For each word, along with the word, remember the common sense of each word generation,
The processing means includes
Determining whether the common sense of the searched word in the input generation is below a predetermined threshold;
When the common sense of the searched word is lower than the predetermined threshold, the word acquired from the input text and searched in the storage unit may be replaced with a predetermined word.

上記目的を達成するために、本発明の第２の観点に係るテキスト音声合成装置は、
前記テキスト置換装置によって単語が置換されたテキストに基づいて音声を合成する、
ことを特徴とする。 In order to achieve the above object, a text-to-speech synthesizer according to a second aspect of the present invention provides:
Synthesizing speech based on text with words replaced by the text replacement device;
It is characterized by that.

上記目的を達成するために、本発明の第３の観点に係るテキスト置換方法は、
入力テキストから取得される単語を、所定の単語に置換するテキスト置換方法であって、
語彙力を設定して入力し、
単語と、その単語と同等な単語と、前記語彙力と、を記憶し、
前記入力テキストから単語を取得するステップと、
当該取得された単語を検索するステップと、
検索された単語をその単語と同等な所定の単語に置換するか否かを、前記語彙力に応じて決定するステップと、
を備えたことを特徴とする。 In order to achieve the above object, a text replacement method according to a third aspect of the present invention includes:
A text replacement method for replacing a word obtained from input text with a predetermined word,
Set and input vocabulary,
Storing a word, a word equivalent to the word, and the vocabulary
Obtaining a word from the input text;
Searching for the acquired word;
Determining whether to replace a searched word with a predetermined word equivalent to the word, according to the vocabulary; and
It is provided with.

上記目的を達成するために、本発明の第４の観点に係るテキスト置換プログラムは、
入力テキストから取得される単語を、所定の単語に置換するテキスト置換プログラムであって、
コンピュータに、
語彙力を設定して入力し、
単語と、その単語と同等な単語と、前記語彙力と、を記憶し、
前記入力テキストから単語を取得するステップと、
当該取得された単語を検索するステップと、
検索された単語をその単語と同等な所定の単語に置換するか否かを、前記語彙力に応じて決定するステップと、
を実行させることを特徴とする。 In order to achieve the above object, a text replacement program according to the fourth aspect of the present invention provides:
A text replacement program for replacing a word acquired from input text with a predetermined word,
On the computer,
Set and input vocabulary,
Storing a word, a word equivalent to the word, and the vocabulary
Obtaining a word from the input text;
Searching for the acquired word;
Determining whether to replace a searched word with a predetermined word equivalent to the word, according to the vocabulary; and
Is executed.

本発明によれば、単語を、語彙力に応じた単語に置換できるテキスト置換装置、テキスト音声合成装置、テキスト置換方法、及び、テキスト置換プログラムを提供することができる。 According to the present invention, it is possible to provide a text replacement device, a text-to-speech synthesizer, a text replacement method, and a text replacement program that can replace a word with a word according to vocabulary.

以下、図面を参照して本発明の実施の形態について説明する。
まず、本実施形態に係るテキスト置換装置の物理的構成について図１を参照して説明する。 Embodiments of the present invention will be described below with reference to the drawings.
First, the physical configuration of the text replacement device according to the present embodiment will be described with reference to FIG.

《実施形態１》
図１に示すテキスト置換装置１００は、ユーザインタフェース１０１と、記憶部１０２と、ＣＰＵ（Central Processing Unit）１０３と、を備えている。 Embodiment 1
A text replacement device 100 shown in FIG. 1 includes a user interface 101, a storage unit 102, and a CPU (Central Processing Unit) 103.

ユーザインターフェース１０１は、例えば、キーボードやモニタから構成され、本装置のユーザが各種の情報を直接入力し得る。これは、インタフェース回路１０４を介して内部バス１０５に接続されている。 The user interface 101 is composed of, for example, a keyboard and a monitor, and the user of this apparatus can directly input various kinds of information. This is connected to the internal bus 105 via the interface circuit 104.

記憶部１０２は、ハードディスク１０６と、ＲＡＭ（Random Access Memory）１０７とを備えている。 The storage unit 102 includes a hard disk 106 and a RAM (Random Access Memory) 107.

ハードディスク１０６は、磁気ディスクにより構成されており、大容量であり、電源の切断後も記憶内容を保持できる。 The hard disk 106 is composed of a magnetic disk, has a large capacity, and can retain stored contents even after the power is turned off.

ＲＡＭ１０７は、半導体メモリから成り、揮発性である。電源の投入中のみ記憶内容を保持するが、ハードディスク１０６より高速にアクセス可能であり、ハードディスク１０６の記憶内容を電源の投入中に一時的に読み出して処理するためのワークエリアとして使用される。 The RAM 107 is composed of a semiconductor memory and is volatile. The stored contents are retained only while the power is turned on, but can be accessed at a higher speed than the hard disk 106 and used as a work area for temporarily reading and processing the stored contents of the hard disk 106 while the power is turned on.

ＣＰＵ１０３は、内部バス１０５を介して記憶部１０２と相互に接続されており、記憶部１０２に記憶されたプログラムを逐次読み出してその命令に従ってデータを処理し、その結果を記憶部１０２に書き込む。 The CPU 103 is mutually connected to the storage unit 102 via the internal bus 105, sequentially reads out the program stored in the storage unit 102, processes the data according to the command, and writes the result in the storage unit 102.

また、インタフェース回路１０４は、ネットワーク３００と接続されており、ネットワーク３００を介して、ネットワーク３００に接続された図示しないデータ処理装置から記憶部１０２にデータを転送し、テキスト置換を行った結果、得られたデータを該データ処理装置に転送し得る。 Further, the interface circuit 104 is connected to the network 300. As a result of transferring the data from the data processing device (not shown) connected to the network 300 to the storage unit 102 via the network 300 and performing text substitution, the interface circuit 104 is obtained. Transferred data may be transferred to the data processing device.

次に、本実施形態に係るテキスト音声合成装置の物理的構成について図１を参照して説明する。 Next, the physical configuration of the text-to-speech synthesizer according to this embodiment will be described with reference to FIG.

テキスト置換装置１００に、スピーカ２０１と、サウンドボード２０２と、を追加することによって、テキスト音声合成装置２００が構成される。 A text-to-speech synthesizer 200 is configured by adding a speaker 201 and a sound board 202 to the text replacement device 100.

スピーカ２０１は、合成音声を出力できるもので足り、高性能であることを要しない。サウンドボード２０２は、スピーカ２０１と内部バス１０５との間に設けられ、ＣＰＵ１０３が出力する合成音声データに対応してスピーカ２０１を駆動する信号を出力する。 The speaker 201 need only be capable of outputting synthesized speech, and does not need to have high performance. The sound board 202 is provided between the speaker 201 and the internal bus 105, and outputs a signal for driving the speaker 201 corresponding to the synthesized voice data output from the CPU 103.

次に、実施形態１に係るテキスト置換装置１００の動作について説明する。
図１の装置において、ＣＰＵ１０３は、まず、記憶部１０２へテキスト置換辞書を記憶し、次に、記憶部１０２へ語彙力を記憶し、そして、記憶部１０２へ入力テキストを記憶する。そうして、記憶部１０２上でテキスト置換を行う。 Next, the operation of the text replacement device 100 according to the first embodiment will be described.
In the apparatus of FIG. 1, the CPU 103 first stores the text replacement dictionary in the storage unit 102, then stores the vocabulary in the storage unit 102, and stores the input text in the storage unit 102. Then, text replacement is performed on the storage unit 102.

（記憶部１０２へのテキスト置換辞書の記憶）
ユーザがユーザインタフェース１０１により所定の指示を出すことにより、ＣＰＵ１０３は、ネットワーク３００を経由して外部からテキスト置換辞書を入力し、記憶部１０２に記憶する。テキスト置換辞書は、単語と、その単語と同等な単語と、を記憶している。例えば、図４に示す内容であるが、図４の具体的な内容については後述のテキスト置換において説明する。 (Storage of text replacement dictionary in storage unit 102)
When the user issues a predetermined instruction through the user interface 101, the CPU 103 inputs a text replacement dictionary from the outside via the network 300 and stores it in the storage unit 102. The text replacement dictionary stores a word and a word equivalent to the word. For example, the contents shown in FIG. 4 will be described in the text replacement described later.

（記憶部１０２への語彙力の記憶）
ユーザがユーザインタフェース１０１により語彙力を入力することにより、ＣＰＵ１０３は、これを記憶部１０２に記憶する。ここで、語彙力とは、ユーザがどれだけ難解な語彙まで、その意味や使い方を理解できているかを指標するパラメータである。このパラメータは、その数値が高いほど語彙力が高いとする。 (Storing vocabulary in the storage unit 102)
When the user inputs vocabulary through the user interface 101, the CPU 103 stores this in the storage unit 102. Here, the vocabulary is a parameter that indicates how difficult the vocabulary the user can understand the meaning and usage. It is assumed that this parameter has higher vocabulary as the value is higher.

（記憶部１０２への入力テキストの記憶）
ネットワーク３００を経由して図示しない他のデータ処理装置から転送データを受信することにより、ＣＰＵ１０３は、その受信した転送データを記憶部１０２に記憶させる。この受信により、ＣＰＵ１０３は、テキスト置換を行う。 (Storage of input text to storage unit 102)
By receiving transfer data from another data processing apparatus (not shown) via the network 300, the CPU 103 stores the received transfer data in the storage unit 102. Upon receiving this, the CPU 103 performs text replacement.

（記憶部１０２上でのテキスト置換）
ＣＰＵ１０３は、図２に示すテキスト置換（ステップＳ１）において、入力テキストから単語を取得して形態素解析（ステップＳ３）を行い、当該取得された単語を記憶部１０２で検索し、検索された単語をその単語と同等な所定の単語に置換する単語置換（ステップＳ４）を行うか否かを、語彙力レベルに応じて決定する。 (Text replacement on the storage unit 102)
In the text replacement shown in FIG. 2 (step S1), the CPU 103 acquires a word from the input text, performs morphological analysis (step S3), searches the storage unit 102 for the acquired word, and selects the searched word. It is determined according to the vocabulary level whether or not to perform word replacement (step S4) for replacing with a predetermined word equivalent to the word.

図３は、図２のステップＳ３の処理の詳細を示したものである。ＣＰＵ１０３は、カウンタｊを設定し、ｊ＝１に初期化設定する（ステップＳ１４）。そして、記憶部１０２に記憶されている形態素解析辞書３を用いて、任意の既知の手法により、入力テキストに対し形態素解析を行う（ステップＳ１５）。次に、形態素解析を行った結果からｊ番目の形態素情報morpheme（ｊ）を抽出する（ステップＳ１６）。 FIG. 3 shows details of the processing in step S3 of FIG. The CPU 103 sets a counter j and initializes j = 1 (step S14). Then, using the morphological analysis dictionary 3 stored in the storage unit 102, morphological analysis is performed on the input text by any known method (step S15). Next, j-th morpheme information morpheme (j) is extracted from the result of the morpheme analysis (step S16).

続いて、入力テキストの文末まで解析が終わったか否かを判別する（ステップＳ１７）。終わっていないと判別された場合（ステップＳ１７；Ｎｏ）、次の形態素情報を抽出するため、カウンタｊを１増加してから（ステップＳ１８）、ステップＳ１６に戻る。
一方、ステップＳ１７で、上記解析が終わったと判別された場合（ステップＳ１７；Ｙｅｓ）、カウンタｊ値を数値Ｎmorphemeに移し（ステップＳ１９）、形態素解析を終了する。 Subsequently, it is determined whether or not the analysis has been completed up to the end of the input text (step S17). If it is determined that it has not been completed (step S17; No), the counter j is incremented by 1 to extract the next morpheme information (step S18), and the process returns to step S16.
On the other hand, if it is determined in step S17 that the analysis has been completed (step S17; Yes), the counter j value is moved to the numerical value Nmorpheme (step S19), and the morphological analysis is terminated.

この結果、入力テキストが形態素情報列morpheme（ｋ）(１≦ｋ≦Ｎmorpheme)に変換される。そして、Ｎmorphemeが、入力テキストから抽出された形態素の数として取得される。 As a result, the input text is converted into a morpheme information string morpheme (k) (1 ≦ k ≦ Nmorpheme). Nmorpheme is acquired as the number of morphemes extracted from the input text.

次に、形態素情報列morpheme（ｋ）(１≦ｋ≦Ｎmorpheme)に対する単語置換（ステップＳ４）の詳細について図５を参照して説明する。 Next, details of word replacement (step S4) for the morpheme information string morpheme (k) (1 ≦ k ≦ Nmorpheme) will be described with reference to FIG.

ＣＰＵ１０３は、カウンタｋを設定し、ｋ＝１に初期化設定する（ステップＳ２１）。そして、記憶部１０２に記憶されているテキスト置換辞書４で形態素情報morpheme（ｋ）を検出する（ステップＳ２２）。 The CPU 103 sets a counter k and initializes k = 1 (step S21). Then, the morpheme information morpheme (k) is detected from the text replacement dictionary 4 stored in the storage unit 102 (step S22).

続いて、ＣＰＵ１０３は、形態素情報morpheme（ｋ）に対応する単語難易度VocabularyDifficulty（ｋ）を語彙力レベルVocabularyLevelと比較する（ステップＳ２３）。そして、次式（１） Subsequently, the CPU 103 compares the word difficulty level VocabularyDifficulty (k) corresponding to the morpheme information morpheme (k) with the vocabulary level VocabularyLevel (step S23). And the following formula (1)

単語難易度VocabularyDifficulty（ｋ）＞語彙力レベルVocabularyLevel （１）
を満たす場合（ステップＳ２３；Ｙｅｓ）、ＣＰＵ１０３は、図４に示す置換前単語情報４１である形態素情報morpheme（ｋ）を、置換後単語情報４３である形態素情報Morpheme（ｋ）に置換する（ステップＳ２４）。 Word difficulty VocabularyDifficulty (k)> VocabularyLevel (1)
When satisfy | filling (step S23; Yes), CPU103 substitutes the morpheme information Morpheme (k) which is the word information 43 after substitution for the morpheme information morpheme (k) which is the word information 41 before substitution shown in FIG. S24).

前記式（１）を満たさない場合（ステップＳ２３；Ｎｏ）、ＣＰＵ１０３は、置換前単語情報４１である形態素情報morpheme（ｋ）を、置換後単語情報４３である形態素情報Morpheme（ｋ）に置換しない。 When the expression (1) is not satisfied (step S23; No), the CPU 103 does not replace the morpheme information morpheme (k) that is the pre-substitution word information 41 with the morpheme information Morpheme (k) that is the post-substitution word information 43. .

続いて、カウンタｋがＮmorphemeに達し、形態素解析で抽出されたすべての形態素情報について置換を終えたか否かが判別される（ステップＳ２５）。すべての形態素情報について置換を終えていないと判別された場合（ステップＳ２５；Ｎｏ）、カウンタｋを１増加してから（ステップＳ２６）、ステップＳ２２に戻り、ステップＳ２２以降の処理を再度行う。
一方、すべての形態素情報について置換を終えたと判別された場合（ステップＳ２５；Ｙｅｓ）、単語置換の処理を終了する。 Subsequently, it is determined whether or not the counter k has reached Nmorpheme and all the morpheme information extracted by the morpheme analysis has been replaced (step S25). When it is determined that the replacement has not been completed for all morpheme information (step S25; No), the counter k is incremented by 1 (step S26), the process returns to step S22, and the processes after step S22 are performed again.
On the other hand, when it is determined that the replacement has been completed for all the morpheme information (step S25; Yes), the word replacement process is terminated.

以上の処理を行うことで、例えば、図６に示す「事実を歪曲して報告する。」という入力テキストの形態素情報列morpheme（ｋ）(１≦ｋ≦Ｎmorpheme)についてテキスト置換が施されたことになる。ここで、テキスト置換が施された形態素情報列をMorpheme（ｋ）(１≦ｋ≦Ｎmorpheme)とする。 By performing the above processing, for example, the text substitution has been performed on the morpheme information sequence morpheme (k) (1 ≦ k ≦ Nmorpheme) of the input text “distorted and reported fact” shown in FIG. become. Here, Morpheme (k) (1 ≦ k ≦ Nmorpheme) is assumed to be a morpheme information sequence subjected to text replacement.

次に、本実施形態に係るテキスト音声合成装置の動作について説明する。
テキスト音声合成装置の場合、図２において、テキスト置換（ステップＳ１）を行った後、音声合成（ステップＳ２）を行う。 Next, the operation of the text-to-speech synthesizer according to this embodiment will be described.
In the case of a text-to-speech synthesizer, in FIG. 2, after text substitution (step S1), speech synthesis (step S2) is performed.

ＣＰＵ１０３は、音声合成（ステップＳ２）において、形態素情報列に対し、韻律設定を行う（ステップＳ５）。すなわち、テキスト置換が施された形態素情報列Morpheme（ｋ）(１≦ｋ≦Ｎmorpheme)に含まれる各Morpheme（ｋ）に任意の既知の手法により韻律情報を付加して韻律情報付ラベル列Label（ｍ）(１≦ｍ≦ＮLabel)を生成する。 In the speech synthesis (step S2), the CPU 103 performs prosody setting for the morpheme information string (step S5). In other words, prosody information is added to each Morpheme (k) included in the morpheme information sequence Morpheme (k) (1 ≦ k ≦ Nmorpheme) subjected to text substitution by any known method to label the label sequence with prosodic information Label ( m) (1 ≦ m ≦ N Label) is generated.

そして、韻律情報付ラベル列Label（ｍ）(１≦ｍ≦ＮLabel)について音声合成辞書５を用いて、音声合成を行い、合成音声SynthesizeSpeechを出力する（ステップＳ６）。 Then, the speech synthesis dictionary 5 is used to synthesize speech for the label string with prosodic information Label (m) (1 ≦ m ≦ N Label), and a synthesized speech SynthesizeSpeech is output (step S6).

以上の処理により、図７に示すように、例えば、語彙力レベルが「１」の聞き手に対しては、次のように動作する。この聞き手に対し、「歪曲」と「報告」の単語難易度４２が語彙力レベルを上回る。そして、「歪曲」は「ゆがめて」に、「報告」は「しらせる」に置換される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、「じじつをゆがめてしらせる。」という合成音声で出力されて語彙レベルが「１」の聞き手にとってわかりやすい音声が得られる。 With the above processing, as shown in FIG. 7, for example, the following operation is performed for a listener whose vocabulary level is “1”. For this listener, the word difficulty level 42 of “distortion” and “report” exceeds the vocabulary level. Then, “distortion” is replaced with “distortion”, and “report” is replaced with “shrimp”. As a result, the input text “Factual fact is distorted and reported” shown in FIG. 6 is output as a synthesized speech “Make the word distorted”, and a speech that is easy to understand for a listener whose vocabulary level is “1”. can get.

また、図７に示すように、例えば、語彙力レベルが「２」から「５」までの聞き手に対しては、「歪曲」の単語難易度４２のみが語彙力レベルを上回り、「歪曲」のみが「ゆがめて」に置換される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、「じじつをゆがめてほうこくする。」という合成音声で出力される。よって、「報告」のような、語彙力レベルが「２」以上の者が話し言葉としても使っている書き言葉は、あえて言い換える必要がないので、そのまま残される。これにより、語彙力レベルが「２」から「５」までの聞き手にとってわかりやすい音声が得られる。 Also, as shown in FIG. 7, for example, for listeners with vocabulary level “2” to “5”, only the word difficulty level 42 of “distortion” exceeds the vocabulary level, and only “distortion”. Is replaced with “distorted”. As a result, the input text “Report the fact is distorted” shown in FIG. Therefore, written words such as “report” that are used as spoken words by people with a vocabulary level of “2” or higher do not need to be rephrased and are left as they are. As a result, a voice that is easy to understand for listeners with vocabulary level "2" to "5" can be obtained.

さらに、図７に示すように、語彙力レベルが「６」の聞き手に対しては、すべての単語の単語難易度４２が語彙力レベルを上回ることがなく、すべての単語がそのまま出力される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、そのまま読まれて「じじつをわいきょくしてほうこくする。」という合成音声で出力される。これにより、あえて言葉を言い換える必要のない語彙レベルが「６」の聞き手にとってわかりやすい音声が得られる。 Furthermore, as shown in FIG. 7, for a listener with a vocabulary level of “6”, the word difficulty level 42 of all words does not exceed the vocabulary level, and all words are output as they are. As a result, the input text “Report the fact is distorted” shown in FIG. 6 is read as it is, and is output as a synthesized speech “I am going to make a mistake. As a result, it is possible to obtain a voice that is easy to understand for a listener whose vocabulary level is “6” and does not need to be reworded.

このように、各語彙力レベルの聞き手に対してそれぞれ適合した合成音声が出力される。 In this way, synthesized speech that is suitable for each vocabulary level listener is output.

《実施形態２》
実施形態２は、図２に示す実施形態１におけるテキスト置換辞書４に代えて、図８に示すテキスト置換辞書６を用い、テキスト置換（図２のステップＳ１）において実施形態１とは異なる処理手順でテキスト置換を行う。 << Embodiment 2 >>
In the second embodiment, a text replacement dictionary 6 shown in FIG. 8 is used instead of the text replacement dictionary 4 in the first embodiment shown in FIG. 2, and the text processing (step S1 in FIG. 2) is different from the first embodiment. Replace text with.

実施形態２に係るテキスト音声合成装置の物理的構成については図１に示すものと同様である。
テキスト置換辞書６は、置換前単語情報６１と、置換後単語情報６２を格納している。 The physical configuration of the text-to-speech synthesizer according to the second embodiment is the same as that shown in FIG.
The text replacement dictionary 6 stores pre-substitution word information 61 and post-substitution word information 62.

置換前単語情報６１は、形態素解析により得られた置換前の単語情報であり、図示の例では、「事実」、「歪曲」、「報告」など、入力テキスト、例えば、「事実を歪曲して報告する。」に含まれる単語である。 Pre-substitution word information 61 is pre-substitution word information obtained by morphological analysis. In the illustrated example, input text such as “facts”, “distortion”, “report”, etc. It is a word that is included in "Report."

置換後単語情報６２は、置換前単語情報４１に対応する置換後の単語である。この単語は、合成音声を受ける聞き手の語彙レベルに応じて複数又は単数の候補が決められている。 The post-substitution word information 62 is a post-substitution word corresponding to the pre-substitution word information 41. A plurality or a single candidate of this word is determined according to the vocabulary level of the listener who receives the synthesized speech.

次に、実施形態２に係る装置の動作について図２及び図９を参照して説明する。なお、本実施形態の装置の構成については図１を参照する。 Next, the operation of the apparatus according to the second embodiment will be described with reference to FIGS. Note that FIG. 1 is referred to for the configuration of the apparatus of the present embodiment.

まず、上述のテキスト置換装置１００は、記憶部１０２に記憶されている入力テキストに形態素解析を施す（図２のステップＳ３）。これは、図３に示す実施形態１に係る手順と同様の手順で行われる。 First, the text replacement device 100 performs morphological analysis on the input text stored in the storage unit 102 (step S3 in FIG. 2). This is performed in the same procedure as the procedure according to the first embodiment shown in FIG.

次に、該テキスト音声合成装置２００は、図９に従って形態素情報列morpheme（ｋ）(１≦ｋ≦Ｎmorpheme)にテキスト置換を施す。まず、ＣＰＵ１０３は、カウンタｋを設定し、ｋ＝１に初期化設定する（ステップＳ３１）。そして、記憶部１０２に記憶されているテキスト置換辞書６で形態素情報morpheme（ｋ）を検出する（ステップＳ３２）。 Next, the text-to-speech synthesizer 200 performs text substitution on the morpheme information string morpheme (k) (1 ≦ k ≦ Nmorpheme) according to FIG. First, the CPU 103 sets a counter k and initializes k = 1 (step S31). Then, the morpheme information morpheme (k) is detected from the text replacement dictionary 6 stored in the storage unit 102 (step S32).

続いて、ＣＰＵ１０３は、形態素情報morpheme（ｋ）を、語彙力レベルVocabularyLevelに対応する候補の形態素情報Morpheme（ｋ）に置換する（ステップＳ３３）。 Subsequently, the CPU 103 replaces the morpheme information morpheme (k) with candidate morpheme information Morpheme (k) corresponding to the vocabulary level VocabularyLevel (step S33).

続いて、カウンタｋがＮmorphemeに達したか否かが判別される（ステップＳ３４）。達していないと判別された場合（ステップＳ３４；Ｎｏ）、カウンタｋを１増加してから（ステップＳ３５）、ステップＳ３２に戻り、達したと判別された場合（ステップＳ３４；Ｙｅｓ）、終了する。 Subsequently, it is determined whether or not the counter k has reached Nmorpheme (step S34). If it is determined that it has not been reached (step S34; No), the counter k is incremented by 1 (step S35), then the process returns to step S32, and if it is determined that it has been reached (step S34; Yes), the process ends.

これにより、図７に示すように、例えば、語彙力レベルが「１」の聞き手に対しては、「歪曲」は「ゆがめて」に、「報告」は「しらせる」に置換される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、「じじつをゆがめてしらせる。」という合成音声で出力されて語彙力レベルが「１」の聞き手にとってわかりやすい音声が得られる。 As a result, as shown in FIG. 7, for a listener whose vocabulary level is “1”, for example, “distortion” is replaced with “distorted” and “report” is replaced with “she let”. As a result, the input text “Factual fact is distorted and reported” shown in FIG. 6 is output as a synthesized speech that “distorts the truth” and is easy to understand for a listener whose vocabulary level is “1”. Is obtained.

また、図７に示すように、例えば、語彙力レベルが「２」から「５」までの聞き手に対しては、「歪曲」のみが「ゆがめて」に置換される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、「じじつをゆがめてほうこくする。」という合成音声で出力される。よって、「報告」という、語彙力レベルが「２」以上の者が話し言葉としても使っているようなあえて言い換える必要のない書き言葉は、そのまま残される。これにより、語彙力レベルが「２」から「５」までの聞き手にとってわかりやすい音声が得られる。 Also, as shown in FIG. 7, for example, only “distortion” is replaced with “distorted” for listeners with vocabulary level “2” to “5”. As a result, the input text “Report the fact is distorted” shown in FIG. Therefore, a written word that does not need to be reworded, such as “report”, which is used as a spoken word by a person whose vocabulary level is “2” or higher, is left as it is. As a result, a voice that is easy to understand for listeners with vocabulary level "2" to "5" can be obtained.

さらに、図７に示すように、語彙力レベルが「６」の聞き手に対しては、すべての単語がそのまま出力される。その結果、図５に示す「事実を歪曲して報告する。」という入力テキストは、そのまま読まれて「じじつをわいきょくしてほうこくする。」という合成音声で出力され、あえて言葉を言い換える必要のない語彙力レベルが「６」の聞き手にとってわかりやすい音声が得られる。 Further, as shown in FIG. 7, all the words are output as they are to the listener whose vocabulary level is “6”. As a result, the input text “Report the fact is distorted and reported” shown in FIG. 5 is read as it is and is output with the synthesized speech “I am going to make a mess.” Voices that are easy to understand for listeners with a vocabulary level of "6" without speech.

《実施形態３》
実施形態３は、図４に示す実施形態１におけるテキスト置換辞書４における単語難易度４２に代えて、世代毎の単語に対する常識度という概念を用い、第１の実施形態と酷似した処理手順でテキスト置換（図２のステップＳ１）を行う。ここで、常識度とは、単語の語彙及び表現、意味及び用法が、同一世代の中で知られている割合をいう。 << Embodiment 3 >>
In the third embodiment, instead of the word difficulty level 42 in the text replacement dictionary 4 in the first embodiment shown in FIG. 4, the concept of common sense with respect to words for each generation is used, and text is processed in a processing procedure very similar to that of the first embodiment. Replacement (step S1 in FIG. 2) is performed. Here, common sense refers to the rate at which the vocabulary and expression of words, meanings, and usage are known within the same generation.

つまり、実施形態１では、世代にかかわらず、すべての人々の中で知られている割合で単語の難易度を指標して言葉の置き換えを行った。これに対し、本実施形態では、話す言葉を共有する世代毎に各単語についてその世代の人々の中でその単語を知っている人の数が異なることから、その世代毎に各単語を知っている人の割合を求める。そして、その割合をその世代におけるいわば常識度の指標として採用する。 That is, in the first embodiment, the word replacement is performed by indicating the difficulty level of the word at a rate known among all people regardless of the generation. On the other hand, in this embodiment, since the number of people who know the word among the people of that generation for each generation sharing the spoken word is different, it is necessary to know each word for each generation. Find the percentage of people who are. The ratio is adopted as an index of common sense in the generation.

図１０に示すテキスト置換辞書７は、置換前単語情報７１と、世代毎の常識度７２と、置換後単語情報７３と、を格納している。 The text replacement dictionary 7 shown in FIG. 10 stores pre-substitution word information 71, common sense 72 for each generation, and post-substitution word information 73.

置換前単語情報７１は、形態素解析により得られた置換前の単語情報である。図１０に示す例では、「事実」、「歪曲」、「報告」などであり、入力テキスト、例えば、「事実を歪曲して報告する。」に含まれる単語である。 Pre-substitution word information 71 is pre-substitution word information obtained by morphological analysis. In the example illustrated in FIG. 10, “facts”, “distortion”, “report”, and the like are words included in the input text, for example, “report the fact by distortion”.

世代毎の常識度７２は、置換前単語情報７１としてテキスト置換辞書７に格納されている各単語ごとに設けられる。この世代毎の常識度７２は、単語の意味及び用法が、同一世代の中で知られている割合を、例えば、％値で示す。 The common sense 72 for each generation is provided for each word stored in the text replacement dictionary 7 as the pre-substitution word information 71. The common sense 72 for each generation indicates, for example, a percentage value in which the meaning and usage of the word are known in the same generation.

置換後単語情報７３は、置換前単語情報７１に対応する、世代毎の常識度の高い単語である。図示の例では、「歪曲」という言葉に対し、いずれの世代においても、「ゆがめる」を置換後の単語としている。また、「報告」という言葉に対し、世代「Ａ」と「Ｂ」で「しらせる」を置換後の単語とし、世代「Ｃ」で「ほうこくする」を置換後の単語としている。また、「事実」という言葉に対し、いずれの世代においても、「じじつ」を置換後の単語としており、言葉を置き換えないこととしている。 The post-substitution word information 73 is a word having a high level of common sense for each generation corresponding to the pre-substitution word information 71. In the example shown in the figure, the word “distorted” is replaced with “distorted” in any generation. In addition, for the word “report”, the words “replace” in the generations “A” and “B” are the replacement words, and the word “report” is the replacement word in the generation “C”. In addition, for the word “facts”, “jijitsu” is used as the replacement word in all generations, and the word is not replaced.

次に、実施形態３に係るテキスト置換装置の動作について図１１を参照して説明する。なお、本実施形態の装置の構成については図１を参照する。 Next, the operation of the text replacement device according to the third embodiment will be described with reference to FIG. Note that FIG. 1 is referred to for the configuration of the apparatus of the present embodiment.

まず、ＣＰＵ１０３は、カウンタｋを設定し、ｋ＝１に初期化設定する（ステップＳ４１）。そして、記憶部１０２に記憶されているテキスト置換辞書７で形態素情報morpheme（ｋ）を検出する（ステップＳ４２）。 First, the CPU 103 sets a counter k and initializes k = 1 (step S41). Then, the morpheme information morpheme (k) is detected in the text replacement dictionary 7 stored in the storage unit 102 (step S42).

続いて、ＣＰＵ１０３は、形態素情報morpheme（ｋ）に対応する世代毎における常識度CommonSenseRate[Generation](ｋ)を所定の閾値ＴＨCommonSenseと比較する（ステップＳ４３）。そして、次式（２） Subsequently, the CPU 103 compares the common sense CommonSenseRate [Generation] (k) for each generation corresponding to the morpheme information morpheme (k) with a predetermined threshold value THCommonSense (step S43). And the following formula (2)

世代毎における常識度CommonSenseRate[Generation](ｋ)＜所定の閾値ＴＨCommonSense（２）
を満たすか否かを判定する。これにより、常識度が所定の度合いよりも低いかどうかを判定する。例えば、図１０の例で、ＴＨCommonSense＝７０％とすると、「歪曲」は、世代ＡとＢで常識度が低いと判定され、世代Ｃで常識度が高いと判定される。また、「報告」は、世代Ａで常識度が低いと判定され、世代ＢとＣで常識度が高いと判定される。 Common sense rate for each generation CommonSenseRate [Generation] (k) <predetermined threshold TH CommonSense (2)
It is determined whether or not the above is satisfied. Thereby, it is determined whether the common sense level is lower than a predetermined degree. For example, in the example of FIG. 10, if THCommonSense = 70%, “distortion” is determined to be low in common sense in generations A and B, and is determined as high in common sense in generation C. In addition, “report” is determined to be low in common sense in generation A, and is determined to be high in common sense in generations B and C.

前記式（２）を満たす場合（ステップＳ４３；Ｙｅｓ）、ＣＰＵ１０３は、図１０に示す置換前単語情報７１である形態素情報morpheme（ｋ）を、置換後単語情報７３である形態素情報Morpheme（ｋ）に置換する（ステップＳ４４）。 When the expression (2) is satisfied (step S43; Yes), the CPU 103 replaces the morpheme information morpheme (k), which is the pre-substitution word information 71 shown in FIG. 10, with the morpheme information Morpheme (k), the post-substitution word information 73. (Step S44).

前記式（２）を満たさない場合（ステップＳ４３；Ｎｏ）、ＣＰＵ１０３は、置換前単語情報７１である形態素情報morpheme（ｋ）を、置換後単語情報７３である形態素情報Morpheme（ｋ）に置換しない。 When the expression (2) is not satisfied (step S43; No), the CPU 103 does not replace the morpheme information morpheme (k) that is the pre-substitution word information 71 with the morpheme information Morpheme (k) that is the post-substitution word information 73. .

続いて、カウンタｋがＮmorphemeに達したか否かが判別される（ステップＳ４５）。達していないと判別された場合（ステップＳ４５；Ｎｏ）、カウンタｋを１増加してから（ステップＳ４６）、ステップＳ４２に戻り、達したと判別された場合（ステップＳ４５；Ｙｅｓ）、終了する。 Subsequently, it is determined whether or not the counter k has reached Nmorpheme (step S45). If it is determined that it has not been reached (step S45; No), the counter k is incremented by 1 (step S46), then the process returns to step S42, and if it is determined that it has been reached (step S45; Yes), the process ends.

テキスト置換辞書７において、例えば、「歪曲」という置換前単語情報７１に関しては、世代毎における常識度７２が世代Ａで１０％、世代Ｂで５０％、世代Ｃで８０％に設定されている。「報告」という置換前単語情報７１に関しては、世代Ａで２０％、世代Ｂで８０％、世代Ｃで８０％に設定されている。また、「事実」という置換前単語情報７１に関しては、世代Ａ，Ｂ，Ｃで９０％に設定されている。 In the text replacement dictionary 7, for example, regarding the pre-substitution word information 71 of “distortion”, the common sense 72 for each generation is set to 10% for generation A, 50% for generation B, and 80% for generation C. The pre-substitution word information 71 “report” is set to 20% for generation A, 80% for generation B, and 80% for generation C. The pre-substitution word information 71 “facts” is set to 90% for generations A, B, and C.

これにより、図１２に示すように、例えば、世代Ａの聞き手に対しては、「歪曲」と「報告」の常識度７２が所定の閾値７０％を下回り、「歪曲」は「ゆがめて」に、「報告」は「しらせる」に置換される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、「じじつをゆがめてしらせる。」という合成音声で出力されて世代Ａの聞き手にとってわかりやすい音声が得られる。 Accordingly, as shown in FIG. 12, for example, for a generation A listener, the common sense 72 of “distortion” and “report” falls below a predetermined threshold of 70%, and “distortion” becomes “distorted”. , "Report" is replaced with "she let". As a result, the input text “Report the fact is distorted and reported” shown in FIG. 6 is output as a synthesized speech “Make the word distorted”, and a speech easy to understand for the generation A listener is obtained.

また、図１２に示すように、例えば、世代Ｂの聞き手に対しては、「歪曲」の常識度７２が所定の閾値７０％を下回り、「歪曲」のみが「ゆがめて」に置換される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、「じじつをゆがめてほうこくする。」という合成音声で出力される。よって、「報告」という、世代Ｂの間で話し言葉としても使っている書き言葉は、あえて言い換える必要がなく、そのまま残される。これにより、世代Ｂの聞き手にとってわかりやすい音声が得られる。 As shown in FIG. 12, for example, for the listener of generation B, the commonness 72 of “distortion” falls below a predetermined threshold of 70%, and only “distortion” is replaced with “distortion”. As a result, the input text “Report the fact is distorted” shown in FIG. Therefore, the written word “report” that is also used as a spoken word among generations B does not need to be reworded and is left as it is. As a result, a voice that is easy for a generation B listener to understand is obtained.

さらに、図１２に示すように、世代Ｃの聞き手に対しては、すべての単語の常識度７２が所定の閾値を下回ることがなく、すべての単語がそのまま出力される。その結果、図６に示す「事実を歪曲して報告する。」という入力テキストは、そのまま読まれて「じじつをわいきょくしてほうこくする。」という合成音声で出力され、あえて言葉を言い換える必要のない世代Ｃの聞き手にとってわかりやすい音声が得られる。 Furthermore, as shown in FIG. 12, for the listener of generation C, the common sense level 72 of all words does not fall below a predetermined threshold value, and all the words are output as they are. As a result, the input text “Report the fact is distorted and reported” shown in FIG. 6 is read as it is and is output with a synthesized speech of “I am going to make a mistake.” It is necessary to rephrase the word. Sounds that are easy to understand for generation C listeners with no sound.

このように、各世代Ａ，Ｂ，Ｃの聞き手に対してそれぞれ適合した合成音声が出力される。 In this way, synthesized speech adapted to each generation A, B, C listener is output.

以上、本発明の実施の形態について説明したが、本発明を実施するにあたっては、種々の変形及び応用が可能であり、上記実施の形態に限られるものではない。 While the embodiments of the present invention have been described above, various modifications and applications can be made in implementing the present invention, and the present invention is not limited to the above embodiments.

例えば、上記実施の形態では、形態素解析と、テキスト置換とを別々に行ったが、同時に行うようにしてもよい。これは、テキスト置換辞書４，６，７にそれぞれ格納されている置換前単語情報４１，６１，７１に対応する単語難易度４２、置換後単語情報４３，６２，７３、世代ごとの常識度７２を、形態素解析辞書３に格納されている登録言語に対応する情報として当該形態素解析辞書３に格納することにより、行える。 For example, in the above embodiment, morphological analysis and text replacement are performed separately, but they may be performed simultaneously. This is because the word difficulty level 42 corresponding to the pre-substitution word information 41, 61, 71 stored in the text substitution dictionaries 4, 6, 7 and 7, the post-substitution word information 43, 62, 73, and the common sense 72 for each generation, respectively. Is stored in the morpheme analysis dictionary 3 as information corresponding to the registered language stored in the morpheme analysis dictionary 3.

また、上記各実施形態では、プログラムをＣＤ−ＲＯＭなどの記録媒体から汎用のコンピュータにインストールして本発明装置を構成する場合について説明したが、プログラムを予め記憶したＲＯＭ（Read Only Memory）等を内部バスに接続して本発明装置を構成するようにしてもよい。 In each of the above embodiments, the case where the apparatus of the present invention is configured by installing a program from a recording medium such as a CD-ROM into a general-purpose computer has been described. However, a ROM (Read Only Memory) that stores the program in advance is used. The device of the present invention may be configured by connecting to an internal bus.

本発明のテキスト置換装置及びテキスト音声合成装置の構成の一例を示す図である。It is a figure which shows an example of a structure of the text substitution apparatus and text-to-speech synthesizer of this invention. 図１の装置におけるＣＰＵで実行されるプログラム構成の一例を示す図である。It is a figure which shows an example of the program structure performed with CPU in the apparatus of FIG. 図２における形態素解析の手順を示すフローチャートである。It is a flowchart which shows the procedure of the morphological analysis in FIG. 図２におけるテキスト置換辞書の構成の一例を示す図である。It is a figure which shows an example of a structure of the text replacement dictionary in FIG. 実施形態１に係るテキスト置換の処理手順を示すフローチャートである。6 is a flowchart illustrating a text replacement processing procedure according to the first embodiment. 入力テキストの一例を示す図である。It is a figure which shows an example of an input text. 合成音声の一例を示す図である。It is a figure which shows an example of a synthetic | combination voice. 図２におけるテキスト置換辞書の構成の他の例を示す図である。It is a figure which shows the other example of a structure of the text replacement dictionary in FIG. 実施形態２に係るテキスト置換の処理手順を示すフローチャートである。10 is a flowchart illustrating a text replacement processing procedure according to the second embodiment. 図２におけるテキスト置換辞書の構成の更に他の例を示す図である。It is a figure which shows the further another example of a structure of the text replacement dictionary in FIG. 実施形態３に係るテキスト置換の処理手順を示すフローチャートである。14 is a flowchart illustrating a text replacement processing procedure according to the third embodiment. 合成音声の他の例を示す図である。It is a figure which shows the other example of a synthetic voice.

Explanation of symbols

３・・・形態素解析辞書、４、６、７・・・テキスト置換辞書、５・・・音声合成辞書、４１，６１，７１・・・置換前単語情報、４２・・・単語難易度、４３，６２，７３・・・置換後単語情報、７２・・・常識度、１００・・・テキスト置換装置、２００・・・テキスト音声合成装置、１０１・・・ユーザインターフェース、１０２・・・記憶部、１０３・・・ＣＰＵ、１０４・・・インターフェース回路、１０５・・・内部バス、１０６・・・ハードディスク、１０７・・・ＲＡＭ、２００・・・テキスト音声合成装置、２０１・・・スピーカ、２０２・・・サウンドボード、３００・・・ネットワーク 3 ... Morphological analysis dictionary, 4, 6, 7 ... Text substitution dictionary, 5 ... Speech synthesis dictionary, 41, 61, 71 ... Word information before substitution, 42 ... Word difficulty, 43 , 62, 73 ... Word information after substitution, 72 ... Common sense, 100 ... Text substitution device, 200 ... Text-to-speech synthesizer, 101 ... User interface, 102 ... Storage unit, DESCRIPTION OF SYMBOLS 103 ... CPU, 104 ... Interface circuit, 105 ... Internal bus, 106 ... Hard disk, 107 ... RAM, 200 ... Text-to-speech synthesizer, 201 ... Speaker, 202 ...・ Sound board, 300 ... Network

Claims

A text replacement device that replaces a word acquired from input text with a predetermined word,
A user interface to set and input vocabulary,
A storage unit for storing a word, a word equivalent to the word, and the vocabulary
A word is acquired from the input text, the acquired word is searched in the storage unit, and it is determined according to the vocabulary whether to replace the searched word with a predetermined word equivalent to the word. And a processing means for performing the text replacement.

The storage unit
For each word, along with the word, store the word difficulty index that indicates the difficulty of the word,
The processing means includes
Determining whether the word difficulty of the searched word exceeds the vocabulary;
When the word difficulty level exceeds the vocabulary, the word retrieved from the input text and searched in the storage unit is replaced with a predetermined word.
The text replacement device according to claim 1, wherein:

The storage unit
For each word, a word candidate equivalent to the word to be replaced with the word is stored for each level of vocabulary,
The processing means includes
In the storage unit, searching for a word acquired from the input text, searching for a level of vocabulary, and determining a candidate word to replace,
The text replacement device according to claim 1, wherein:

The user interface is
Set and enter the generation,
The storage unit
For each word, along with the word, remember the common sense of each word generation,
The processing means includes
Determining whether the common sense of the searched word in the input generation is below a predetermined threshold;
When the common sense of the searched word is lower than the predetermined threshold, replace the word acquired from the input text and searched in the storage unit with a predetermined word,
The text replacement device according to claim 1, wherein:

Synthesizing speech based on text in which words are replaced by the text replacement device according to any one of claims 1 to 4;
A text-to-speech synthesizer characterized by the above.

A text replacement method for replacing a word obtained from input text with a predetermined word,
Set and input vocabulary,
Storing a word, a word equivalent to the word, and the vocabulary
Obtaining a word from the input text;
Searching for the acquired word;
Determining whether or not to replace the searched word with a predetermined word equivalent to the word, according to the vocabulary ability.

A text replacement program for replacing a word acquired from input text with a predetermined word,
On the computer,
Set and input vocabulary,
Storing a word, a word equivalent to the word, and the vocabulary
Obtaining a word from the input text;
Searching for the acquired word;
And determining whether to replace the searched word with a predetermined word equivalent to the word according to the vocabulary.