JP3838193B2

JP3838193B2 - Text-to-speech device, program for the device, and recording medium

Info

Publication number: JP3838193B2
Application number: JP2002343275A
Authority: JP
Inventors: 慈明小松
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 2002-11-27
Filing date: 2002-11-27
Publication date: 2006-10-25
Anticipated expiration: 2022-11-27
Also published as: JP2004177635A

Description

【０００１】
【発明の属する技術分野】
本発明は文章読み上げ装置、文章を読み上げるためのプログラム及び同プログラムを記録した記録媒体に関するものである。
【０００２】
【従来の技術】
電子書籍などのテキストデータを基に音声信号を合成して音声出力を行う文章読み上げ装置において、読み上げ文章の読み上げ中に音量や速度を調整できるようにしたものは既に知られている。
【０００３】
例えば、特許文献１には、透明なタッチパネルがディスプレイの画面上に一体に構成された表示入力デバイスを用い、操作者が指などでタッチパネルをなぞるトレース動作を行うことで、発音速度、音量を表すパラメータを反映して音声合成を行い、それによって操作者の意図に沿った了解性の高い合成音声を得ることができるテキスト読み上げ装置が記載されている。
【０００４】
また、読み上げ中の読み上げ条件の変更は上記以外にも例えば、トーンの変更、声質の変更、男女の変更などが挙げられる。
【０００５】
【特許文献１】
特開平９−２６５２９９号公報（要約、段落（０００９）、段落（００１０）、図１，図２）
【０００６】
ところで、文章読み上げ中に読み上げの条件を変更する場合、その変更操作がリアルタイムで読み上げに反映できる場合と、変更してもそれが出力に反映するのに時間がかかる、即ち入出力に時間差が生じる場合とがある。
【０００７】
例えば、音量制御の場合は、音声出力用のアンプのゲインを調整するだけで済むため音声変換処理後に調整することができるが、速度調整を行う場合、それを出力に反映させるためにはその変更速度に基づく音声変換処理を要するから、音声変換処理以前に行わないと出力される音声に反映させることはできない。
【０００８】
このように、文章読み上げ装置における読み上げ条件の変更は、音声変換処理後に調整できるものとできないものがある。例えば、音量の変更、エフェクト（トーン（高低）、エコー、周波数等）の変更は音声変換処理後でも調整できるが、速度の変更、声質（男女（性別）、話手）の変更はその変更に基づく音声変換処理を要するから、音声変換処理前でないと出力される音声には反映されない。
【０００９】
図４は読み上げ装置において、音声変換処理と実際に音声が出力されるタイミングを示したタイミングチャートである。図示のように、音声変換処理と実際に音声が出力されるタイミングに時間的な差が生じている。これは文章読み上げ中に例えば読み上げ速度の変更を行うと、その変更がされた速度での読み上げ出力までに時間差がでるため、読み上げが不自然になることを表している。
【００１０】
この点について図５、図６を参照して更に説明する。
【００１１】
図５は、先行技術文献によるものではないが、本発明の文章読み上げ装置の前提技術となる読み上げ装置の１例を示す正面図である。文章読み上げ装置１の画面には、読上用（読み上げの開始）及び停止用のボタン３と共に速度、速度変更用及び音量変更用のスライドバー４、及びスピーカ５等が表示されている。その画面の全面には透明なタッチパネル２が配置されており、そのタッチパネル２と画面の表示内容とは対応がとられている。
【００１２】
図６は、図５に示す文章読み上げ装置１の構成を概略的に示したブロック図である。
【００１３】
文章読み上げ装置１のＣＰＵ１０には、タッチパネル２、例えば液晶ディスプレイ６のような表示手段５０と、読み上げを行う電子書籍データや、音量値、速度値等を記録したＲＡＭ３０と、音声合成用のプログラム、音声合成のための文書の解析や音声合成に使用する単語辞書データ、合成音素データ等を格納したＲＯＭ４０とが接続されている。またＣＰＵ１０は、タッチパネル２からの入力を受けその操作内容をチェックして特定するタップ部１１と、そのタップ部１１からの信号を受けそれぞれＲＡＭ３０に記録された速度値を読み出して変更し、変更した速度値を音声変換処理部１４に渡すと共に、ＲＡＭ３０の速度値を変更した速度値に更新する速度制御部１２と，同様にタップ部１１からの信号を受けそれぞれＲＡＭ３０に記録された音量値を読み出して変更し、変更した音量値でアンプ６０を制御しスピーカ５から出力すると共にＲＡＭ３０の音量値を変更された音量値で更新する音量制御部１３と、ＲＡＭ３０に記録された電子書籍データを読み出し、ＲＯＭ４０に格納された音声合成プログラムや辞書データを用いて、音声合成を行い、速度制御部１２からの出力信号に基づく音声速度でアンプ６０に出力する音声変換処理部１４と、並びに、ＲＡＭ３０に格納されている書籍データから読み上げる文章データを液晶ディスプレイ６等の表示手段５０に表示するよう制御をする表示制御部１５とからなっている。
【００１４】
ここで、音声変換処理部１４における音声合成処理手順について図７に示すフロー図に従って説明する。
【００１５】
ステップＳ１０１において、ＣＰＵ１０は、ＲＡＭ３０に格納された電子書籍データから読み上げ用に抽出された抽出文に対して逆かな漢字変換を行う。つまり、抽出文に対して読みを付与する。例えば、抽出文が“昔々、・・・・”であれば、逆かな漢字変換によって“ムカシムカシ、・・・・”が得られる。続いて、ステップＳ１０２において、逆かな漢字変換されたものに対してアクセント型を付与する（アクセント処理）。例えば、“ムカシムカシ”に対してはアクセント型として０（ゼロ）型が付与される。
【００１６】
ステップＳ１０３において、ステップＳ１０２でアクセント型が付与された後の夫々の音節の継続時間長Ｔを発声速度係数αとＲＯＭ４０に格納されているその音節の継続長Ｌとを乗算することによって算出する（Ｔ＝α×Ｌ）。
【００１７】
ステップＳ１０４において、夫々の音節の基本周波数を算出し、続いてステップＳ１０５において、夫々の音節の音量を算出する。例えば、アクセントが高くなる音節に対して、その音節の基本周波数が高くなるように基本周波数を制御する（ステップＳ１０４）とともに、音量が大きくなるように音量を制御する（ステップＳ１０５）。これらの処理はアクセントに対応して文章の読み上げに抑揚をつけるために行う処理である。
【００１８】
ステップＳ１０６において、ステップＳ１０２で抽出された抽出文（液晶ディスプレイ６の表示画面に表示されている文章の中の最後の文）の先頭の文字から液晶ディスプレイ６に表示されている最後の文字（改ページタグの直前の文字）までのステップＳ１０３で算出された夫々の継続時間長を加算することによって想定時間（頁切り換えまでに要する時間）を算出する。
【００１９】
ステップＳ１０７において、ＲＯＭ４０に記憶されている言語処理用の辞書や音声合成用の音声データ、ステップＳ１０３で算出された夫々の音節の継続時間長、ステップＳ１０４で算出された夫々の音節の基本周波数、ステップＳ１０５で算出された夫々の音量を利用して、ステップＳ１０２で抽出された抽出文の、所望の速度及び音量の音声合成データを作成して、アンプ６０を介してスピーカ５等の音声出力装置に出力する。
【００２０】
このようにして、液晶ディスプレイ６の表示画面に表示されている１又は複数の文章の中から表示画面に表示されている最後の文が抽出された場合に、その最後の文の先頭から液晶ディスプレイ６の表示画面に表示されている最後の文字までを読み上げるのに要する時間を想定し、その最後の文の読み上げが開始されてから想定された時間が経過したときに、液晶ディスプレイ６の表示画面の表示内容を切り換え、液晶ディスプレイ６の表示画面の表示内容を切り換えるタイミングをその表示内容の読み上げの終了のタイミングに合わせる制御を行っている。
【００２１】
以上で示した文章読み上げ装置１において、図５の例で下方のスライドバー４を操作して音量調整を行った場合は、そのタッチパネル２の操作からタップ部１１がその操作が音量調整であると特定して音量制御部１３に伝え、音量制御部１３は指示に従いＲＡＭ３０から音量値を読み出して変更し、変更後の音量値に基づき直接アンプ６０を制御して音量を調整し、同時にＲＡＭ３０の音量値の領域に変更した音量値を書き込む。このように、音量変更操作は音声変換処理部１４を介在させずに行うことができるから、出力にリアルタイムで反映することができる。
【００２２】
これに対し、読み上げ速度の変更の場合は、タッチパネル２の上方のスライドバー４の操作からタップ部１１がその操作が速度調整であると特定して速度制御部１２に伝え、速度制御部１２はＲＡＭ３０から読み出した速度値を変更して音声変換処理部１４に送り、同時に変更された速度をＲＡＭ３０に記録する。音声変換処理部１４は変更した信号に基づき音声変換処理を行ってアンプ６０を制御し、スピーカ５から変更した速度で音声を出力する。
【００２３】
以上の処理動作において、読み上げ速度を変更するときは既に説明したように、音声変換入力と出力に時間差があって、速度変更がリアルタイムで出力に反映されず読み上げたとき違和感が残る。そのため一旦、停止用ボタン３を操作して音声変換処理を停止させた上で変更を行うということが行われている。
【００２４】
そのため、ユーザは文章読み上げ中に読み上げ条件を変更したい場合（機能設定も含む）には、その変更を音声変換処理を止めて行うべきか否かその都度判断し、かつその判断に従って読上用又は停止用ボタン３を操作しなければならない。具体的には、ユーザにとって現在の読み上げの速度が適切ではない（例えば、聞きづらい）ために、ユーザがその読み上げの速度を変更しようと思った場合には、読み上げを停止するため停止用ボタン３を操作し、次に、速度変更用のスライドバー４を操作し、更に、読み上げを開始するための読上用ボタン３を操作する。あるいは、読み上げを行っていない状態の時に音量を変更しようとユーザが思った場合には、音量変更用のスライドバー４を操作し、更に読み上げを開始するための読上用ボタン３を操作する。このような多数の操作は煩雑で不便なため問題であった。
【００２５】
【発明が解決しようとする課題】
上述したようなユーザの操作は煩雑で不便という問題点があった。
【００２６】
本発明は、以上の問題（煩雑で不便な操作）を解決するためになされたもので、その目的は、ユーザが読み上げ条件を容易に変更できるようにすることである。
【００２７】
【課題を解決するための手段】
請求項１の発明は、文章読み上げ条件が変更可能な文章読み上げ装置において、文章読み上げ条件を変更するための変更情報を入力するための手段と、前記変更情報による文章読み上げ条件の変更が音声出力に遅延して反映されるものであるときは読み上げ停止制御して変更処理し、前記変更情報による文章読み上げ条件の変更が音声出力に遅延せずに反映されるものであるときは読み上げ停止制御しないで変更処理する文章読み上げ条件変更手段と、変更された文章読み上げ条件に基づき読み上げを制御する手段とを備えたことを特徴とする文章読み上げ装置である。即ち、文章読み上げ条件の変更が音声出力に遅延して反映されるか否かということと、読み上げ停止制御とが関連付けられている。
【００２８】
請求項２の発明は、請求項１に記載された文章読み上げ装置において、前記文章読み上げ条件変更手段は、前記変更情報による文章読み上げ条件の変更が音声出力に遅延せずに反映されるものであり、且つ文章読み上げが停止中であるときは、読み上げ開始制御して変更処理することを特徴とするものであり、文章読み上げ条件の変更が音声出力に遅延して反映されるか否かということ、及び文章読み上げ中か否かということと、読み上げ開始又は停止制御とが関連付けられている。
【００２９】
請求項３の発明は、請求項１又は２に記載された文章読み上げ装置において、前記文章読み上げ条件変更手段は、前記変更情報が読み上げ音量又はトーンの変更情報であるとき、文章読み上げ装置が読み上げ中であるか否かを判断し、読み上げ中でなければ読み上げ開始制御して変更処理することを特徴とするものであり、音量又はトーンといった特定の変更と読み上げ開始制御とが関連付けられている。
【００３０】
請求項４の発明は、請求項１〜３のいずれかに記載された文章読み上げ装置において、前記文章読み上げ条件変更手段により停止制御したとき、読み上げ文章の先頭から読み上げるよう制御することを特徴とするものであり、文章読み上げ条件の変更があれば、文章の先頭から読み上げられる。
【００３１】
請求項５の発明は、文章読み上げ条件が変更可能に文章を読み上げるためにコンピュータに、入力された変更情報による文章読み上げ条件の変更が音声出力に遅延して反映されるものであるときは読み上げ停止制御して変更処理し、前記変更情報による文章読み上げ条件の変更が音声出力に遅延せずに反映されるものであるときは読み上げ停止制御しないで変更処理する手順と、変更された文章読み上げ条件に基づき読み上げを制御する手順と、を実行させることを特徴とするプログラムである。即ち、文章読み上げ条件の変更が音声出力に遅延して反映されるか否かということと、読み上げ停止制御とが関連付けられている。
【００３２】
請求項６の発明は、請求項５に記載されたプログラムを記録したことを特徴とするコンピュータ読み取り可能な記録媒体である。この請求項６の発明によれば、請求項５と同様の作用を奏する。
【００３３】
【００３４】
【発明の実施の形態】
本発明の実施の形態について添付図面を参考に説明する。
【００３５】
図１は本発明に係る文章読み上げ装置１の実施の形態を示している。
【００３６】
この実施の形態は、図６に示した文章読み上げ装置１のＣＰＵ１０に読み上げ開始制御部１６と読み上げ停止制御部１７とを付加した構成である。
【００３７】
読み上げ開始制御部１６及び読み上げ停止制御部１７は共にタップ部１１からの信号を受けて、読み上げ開始制御部１６は文章読み上げ装置１が読み上げ中でないときに音声変換処理部１４を開始制御し、また読み上げ停止制御部１７は、文章読み上げ中に音声変換処理部１４を停止制御する。更に、ＲＡＭ３０には、文章読み上げ中か否かを示すための読み上げ中フラグのための記憶領域が設けられている。その他の構成機能は図７について説明したものと同様である。
【００３８】
次に、以上で説明した文章読み上げ装置１による読み上げ中における文章読み上げ条件の変更（設定をも含む）について、読み上げの音量と速度を例に採って説明する。
【００３９】
図２は前記文章読み上げ装置における処理の手順（第１の実施の形態）を説明するためのフロー図である。
【００４０】
文章読み上げ装置１の音量、速度を変更する場合、まず、ユーザによってタッチパネル２の一部がタップされたことを検出する（Ｓ２０１、ＹＥＳ）。タップ部１１はタッチパネル２からの信号を受けてその内容をチェックし、それが読上用のボタン３による読み上げ（再生）の開始のための操作であると判断（特定）したときは（Ｓ２０２、ＹＥＳ）、ＲＯＭ３０に記憶された文章の先頭から読み上げ（再生）を開始し（Ｓ２０３）そのまま読み上げを行う。
【００４１】
タップ部１１が読上（再生）の開始のための操作でなく（Ｓ２０２、ＮＯ）、音量変更スライドバー４による音量変更制御のための操作であると判断（特定）したときは（Ｓ２０４、ＹＥＳ）、読み上げ開始制御部１６はＲＡＭ３０に記録された読み上げ中（読み上げモード）であるか否かを示す「読み上げ中フラグ」をチェックし（Ｓ２０５）、読み上げ停止中（フラグが０）であれば（Ｓ２０６、ＹＥＳ）ＲＡＭ３０に記憶された文章の先頭から読み上げを開始し（Ｓ２０７）、また、読み上げ中であれば（Ｓ２０６、ＮＯ）そのまま読み上げ中の状態で、それぞれ音量制御部１３はＲＡＭ３０から音量値を読み出して変更した音量値で音量調節を行い（Ｓ２０８）、変更した音量値をＲＡＭ３０に書き込む（Ｓ２０９）と共にアンプ６０を介してスピーカ５から変更された音量で音声出力する。ここで、ユーザは読み上げ音量を聞きながら音量調節を音量変更用のスライドバー４によって行い、必要な設定が終了するまでステップＳ２０８、Ｓ２０９の処理を繰り返し、設定が終われば（Ｓ２１０、ＹＥＳ）処理を終了する。
【００４２】
ステップＳ２０４において、タッチパネル２へのタップ操作が音量変更のための操作でなく（Ｓ２０４、ＮＯ）、速度変更用のスライドバー４による速度変更のための操作であると、タップ部１１が特定（判断）したとき（Ｓ２１１、ＹＥＳ）、読み上げ停止制御部１７はＲＡＭ３０に記録された「読み上げ中フラグ」をチェックし（Ｓ２１２）、読み上げ中（読み上げモード）であれば（Ｓ２１３、ＹＥＳ）読み上げを停止すると共にＲＡＭ３０に記録された「読み上げ中フラグ」をリセット（例えば「１」から「０」に変更）する（Ｓ２１４）。読み上げ中（読み上げモード）でなければ（Ｓ２１３、ＮＯ）そのまま、速度制御部１２はＲＡＭ３０から速度値を読み出し、入力された速度値で速度調節を行い（Ｓ２１５）、変更した速度値を持った合成音声をアンプ６０を介してスピーカ５から出力し、かつ変更（調節）した速度値をＲＡＭ３０に書き込む（Ｓ２１６）。
【００４３】
ステップＳ２１５〜Ｓ２１６の手順は速度設定が終了するまで行われ、設定が終了すれば（Ｓ２１８、ＹＥＳ）、読み上げ位置を文章の先頭（つまり、読み上げ途中であればその文章の頭）に戻して（Ｓ２１８）、新しく設定された読み上げ速度で読み上げを開始し、同時にＲＡＭ３０に保存されている「読み上げ中フラグ」をセット（例えば「０」から「１」に変更）する（Ｓ２１９）。これによって速度制御の処理手順を終了して、読み上げる文章がまだあれば、読み上げを変更された条件で継続する。
【００４４】
なお、ステップＳ２１１において速度制御でない場合（Ｓ２１１，ＮＯ）については説明を省略するが、例えば速度変更用のスライドバー４以外の読み上げ音声の変更、男女声の変更等、一旦読み上げを停止した後に変更を実施した方がよい場合における処理手順は速度制御と同様であり、また、読み上げ中でないと適切に調節できないおそれがあるトーンの変更等は音量の変更と同様の手順で処理が実行される。
【００４５】
文章読み上げ中に読み上げ条件を変更するための操作がなされたとき、その変更又は設定を行うのに、例えば音量やトーンの変更や設定のように読み上げを止める必要のないものについては読み上げを止めず、止める必要のあるものは自動的に読み上げを止めるようにし、また、音量やトーンの変更や設定のように、読み上げ中でないと設定できない、つまり実際の音量やトーンを聞きいてみなければ設定ができないものについては、読み上げ停止中であっても自動的に読み上げを開始できるようにすることで、それによってユーザが読み上げ条件を変更する際の操作の負担軽減を図ることができる。
【００４６】
図３は、上記文章読み上げ装置１における別の処理手順（第２の実施の形態）を説明するためのフロー図である。
【００４７】
文章読み上げ装置１の音量、速度を変更する場合、まず、ユーザによってタッチパネル２の一部がタップされたことを検出し（Ｓ３０１、ＹＥＳ）、タップ部１１はそのタップの内容をチェックし、それが読上用のボタン３による読み上げ（再生）の開始のための操作であると判断したときは（Ｓ３０２、ＹＥＳ）、読み上げを開始し（Ｓ３０３）ＲＡＭ３０に記憶された文章の先頭から読み上げを行う。
【００４８】
読み上げの開始のための操作でなく（Ｓ３０２、ＮＯ）、音量変更用のスライドバー４による速度変更のための操作であると判断（特定）したときは（Ｓ３０４、ＹＥＳ）、読み上げ開始制御部１６はＲＡＭ３０に記録された「読み上げ中フラグ」をチェックし（Ｓ３０５）、その結果、読み上げ停止中であれば（Ｓ３０６、ＹＥＳ）、読み上げないと音量調節は不可能あるいは、適切にできないおそれがあるのでＲＡＭ３０に記憶された文章の先頭から読み上げを開始し（Ｓ３０７）、ＲＡＭ３０中の「読み上げ中フラグ」をセットする（Ｓ３０８）。ステップＳ３０６において、読み上げ中であれば（Ｓ３０６、ＮＯ）そのまま読み上げ中の状態で、それぞれ音量制御部１３はＲＡＭ３０から音量値を読み出して入力された音量値に変更することで音量調節を行い（Ｓ３０９）アンプ６０を介してスピーカ５から音声出力し、かつ変更（調節）後の音量値をＲＡＭ３０に書き込む（Ｓ３１０）。
【００４９】
ここで、ユーザが読み上げ音量を聞きながら音変更用のスライドバー４により音量調節を行い、必要な設定が終了するまでステップＳ３０９、Ｓ３１０の処理を繰り返し、調節（設定）が終われば（Ｓ３１１、ＹＥＳ）処理を終了する。
【００５０】
ステップＳ３０４において、タップ部１１により、タッチパネル２のタップ操作が音量変更のための操作でなく（Ｓ３０４、ＮＯ）、速度変更用のスライドバー４の操作による速度変更のためであるとタップ部１１が判断されたときは（Ｓ３１２、ＹＥＳ）、ユーザが速度調節操作を行うと（Ｓ３１３）、タッチパネル２の入力はタップ部１１で特定され、速度制御部１２はＲＡＭ３０から速度値を読み出して変更された速度を音声変換処理部１４に渡すと共に変更した速度値をＲＡＭ３０に書き込む（Ｓ３１４）。この段階で、読み上げ停止制御部１７はＲＡＭ３０に記録された読み上げ中フラグをチェックし（Ｓ３１５）、チェックの結果、「読み上げフラグ」がセット状態（例えばフラグが１）で読み上げ中（読み上げモード）であると判断されたときは（Ｓ３１６，ＹＥＳ）、読み上げを一旦停止し（Ｓ３１７）「読み上げフラグ」をリセットする。その後読み上げを開始し、変更された速度の合成音声をアンプ６０を介してスピーカ５から出力すると共に「読み上げフラグ」をセットする（Ｓ３１８）。以下ステップＳ３１３〜Ｓ３１８の処理を設定が終了するまで実行し、設定が終われば（Ｓ３１９、ＹＥＳ）処理を終了して、読み上げる文章がまだあれば、読み上げを変更された条件で継続する。
【００５１】
ステップＳ３１６において、読み上げ中でなければ（Ｓ３１６、ＮＯ）、設定終了後（Ｓ３１９、ＹＥＳ）に処理を終了する。
【００５２】
なお、ステップ３１２において速度変更用のスライドバー４でないとき（Ｓ３１２、ＮＯ）の処理は第１実施の形態と同様である。即ち、例えば速度変更用のスライドバー４以外の読み上げ音声の変更について、男女、相手などの声質の変更等、一旦読み上げを停止した後に変更を実施した方がよい場合における処理手順は速度制御と同様であり、また、読み上げ中でないと調節できないトーン、エコー、周波数などのエフェクトの変更等は音量の変更と同様の手順で処理が実行される。
【００５３】
上述した実施の形態では、いずれも合成音声を発生するためのスピーカ５を備える１つの装置において、図２、図３に示す処理全て行っているが、各処理や処理の一部を別々の装置で処理して、最終的にスピーカ５から変更された条件に沿った合成音声を生じさせ、文章の読み上げを行うようにしても良い。例えば、第１のコンピュータはユーザーの入力を受けるのみで、その他の実質的な処理は別の第２のコンピュータが行う。更に、装置は文章読み上げのための専用の装置に限らず、読み上げ以外の機能を有するＰＤＡ、パソコン、携帯電話、カーナビゲーションの端末、ＴＶ等であっても良い。
【００５４】
尚、読み上げられる文章は書籍に限らず手紙（電子メールを含む）、道案内、宣伝並びに歌詞などであっても良い。また、ＲＡＭ３０に記憶されたデータは装置の電源が落されると消失するが、装置の電源が落されても継続して記憶されても良い。
【００５５】
上述した実施の形態では、いずれも読み上げ条件の変更に伴って、読み上げの自動的な開始と停止との両方を行うが、自動的な開始か、自動的な停止かの一方を行う構成であっても良い。また、読み上げの開始位置は、文章の先頭としているが、文章の途中でも良い。更に、条件変更が完了するまでの間に読み上げられる対象は、その条件変更時に用いられる専用の文章であっても良い。
【００５６】
以上で説明した処理は、該処理の手順を記述したプログラムにより文章読み取り装置１のＣＰＵ１０で実行させることができる。また、本プログラムは、ＦＤ（フレキシブルディスク）、ＣＤ-ＲＯＭ、ＭＯ、ＤＶＤ-ＲＯＭ等のプログラムを記録する周知の記録媒体に記録されて提供される他、インターネット等のネットワーク網を介して提供することができる。
【００５７】
【発明の効果】
本願の請求項１に記載の発明によれば、文章読み上げ中に読み上げ条件の変更を行う場合、ユーザは読み上げ開始又は停止のための操作を従来のように行うことなく、自動で読み上げを止める必要のないものについては読み上げを止めず、あるいは止める必要のあるものは読み上げを止めるようにすることができ、文章読み上げ条件の変更が読み上げ中に行うべきものであるときは、装置が読み上げ中でないときは自動で読み上げ開始を行うことができる。そのため、従来のようにユーザがタッチパネル等の操作を行う煩雑さがなく、読み上げ条件の変更を容易に行うことができる。また、ユーザは読み上げ速度等の変更情報の入力から出力までに時間差のある読み上げ条件変更を行っても、その時間差を意識することなく合成音声による読み上げを自然に聞くことができる。
【００５８】
本願の請求項２に記載の発明によれば、請求項１に記載の発明の効果を奏し、文章を読み上げ中か否かということと、読み上げ開始又は停止制御とが関連付けられており、良好な読み上げが可能である。
【００５９】
本願の請求項３に記載の発明によれば、請求項１又は２に記載の発明の効果を奏し、音量又はトーンといった特定の変更と読み上げ開始制御とが関連付けられており、迅速な読み上げが可能である。
【００６０】
本願の請求項４に記載の発明によれば、請求項１〜３のいずれかに記載の発明の効果を奏し、文章読み上げ条件の変更があれば、文章が頭書から読み上げられるため、良好な読み上げが可能である。
【００６１】
本願の請求項５に記載の発明によれば、従来のようにユーザがタッチパネル等の操作を行う煩雑さがなく、読み上げ条件の変更を容易に行うことができる。また、読み上げ速度等の変更情報の入力から出力までに時間差のある読み上げ条件変更を行っても、その時間差を意識することなく合成音声による読み上げを自然に聞くことができる。
【００６２】
本願の請求項６に記載の発明によれば、請求項５に記載の発明と同様の効果を奏し、そのプログラムを携帯端末その他の情報機器のコンピュータに読み取らせることにより、任意の情報機器において上記効果を実現することができる。
【００６３】
【図面の簡単な説明】
【図１】本発明の文章読み上げ装置の第１実施の形態に係る構成を示したブロック図である。
【図２】読み取り条件変更手順を説明するためのフロー図である。
【図３】他の読み取り条件変更手順を説明するためのフロー図である。
【図４】音声変換処理と音声出力の時間差を示すタイムチャートである。
【図５】従来の文章読み上げ装置の１例を示す正面図である。
【図６】図５に示す文章読み上げ装置の概略構成を示すブロック図である。
【図７】図５に示す文章読み上げ装置の音声変換処理を説明するフロー図である。
【符号の説明】
１・・・文章読み上げ装置、２・・・タッチパネル、３・・・読み上げ及び停止用のボタン、４・・・速度及び音量変更用のスライドバー、５・・・スピーカ、６・・・液晶ディスプレイ、１０・・・ＣＰＵ、３０・・・ＲＡＭ、４０・・・ＲＯＭ、５０・・・表示手段、６０・・・アンプ。[0001]
BACKGROUND OF THE INVENTION
  The present invention relates to a text reading device, a program for reading a text, and a recording medium on which the program is recorded.
[0002]
[Prior art]
  We have already known a text-to-speech device that synthesizes speech signals based on text data such as e-books, and that can adjust the volume and speed while reading aloud text.Ising.
[0003]
  For example, in Patent Document 1, a display input device in which a transparent touch panel is integrally formed on a display screen is used, and the operator performs a tracing operation of tracing the touch panel with a finger or the like, thereby expressing the sound production speed and volume. A text-to-speech device is described that can synthesize speech by reflecting parameters and thereby obtain a highly understandable synthesized speech in accordance with the operator's intention.
[0004]
  In addition to the above, changes in reading conditions during reading include, for example, tone changes, voice quality changes, and gender changes.
[0005]
[Patent Document 1]
  JP-A-9-265299 (abstract, paragraph (0009), paragraph (0010), FIGS. 1 and 2)
[0006]
  By the way, when changing the reading condition during reading aloud, it takes time to reflect the change operation on the reading in real time and even if it is changed, that is, there is a time difference between input and output. There are cases.
[0007]
  For example, in the case of volume control, it is only necessary to adjust the gain of the amplifier for audio output, so it can be adjusted after the audio conversion process, but when speed adjustment is performed, the change is made to reflect it in the output Since voice conversion processing based on speed is required, it cannot be reflected in the output voice unless it is performed before the voice conversion processing.
[0008]
  As described above, there are some changes in the reading conditions in the text-to-speech device that can be adjusted after the speech conversion process. For example, changes in volume and effects (tone (high / low), echo, frequency, etc.) can be adjusted even after the voice conversion process, but changes in speed, voice quality (gender (sex), speaker) Since the voice conversion processing based on this is required, it is not reflected in the output voice unless it is before the voice conversion processing.
[0009]
  FIG. 4 is a timing chart showing the voice conversion process and the actual voice output timing in the reading apparatus. As shown in the figure, there is a time difference between the voice conversion process and the actual voice output timing. This means that if, for example, the reading speed is changed during the text reading, a time difference occurs until the reading is output at the changed speed, so that the reading becomes unnatural.
[0010]
  This point will be further described with reference to FIGS.
[0011]
  FIG. 5 is a front view showing an example of a reading device which is not based on the prior art document but is a prerequisite technology of the text reading device of the present invention. On the screen of the text-to-speech reading apparatus 1, a slide bar 4 for speed, speed change and volume change, a speaker 5 and the like are displayed together with a button 3 for reading (start of reading) and a stop. A transparent touch panel 2 is disposed on the entire surface of the screen, and the touch panel 2 and the display content of the screen are associated with each other.
[0012]
  FIG. 6 is a block diagram schematically showing the configuration of the text-to-speech device 1 shown in FIG.
[0013]
  The CPU 10 of the text-to-speech reading apparatus 1 includes a display means 50 such as a touch panel 2, for example, a liquid crystal display 6, an electronic book data to be read out, a RAM 30 in which volume values, speed values, and the like are recorded, a speech synthesis program, A ROM 40 storing word dictionary data, synthesized phoneme data, and the like used for document analysis and speech synthesis for speech synthesis is connected. Further, the CPU 10 receives an input from the touch panel 2 and checks and identifies the operation content, and receives a signal from the tap unit 11 and reads and changes the speed value recorded in the RAM 30. The speed value is transferred to the voice conversion processing unit 14, and the speed control unit 12 that updates the speed value of the RAM 30 to the changed speed value, and similarly receives the signal from the tap unit 11 and reads the volume value recorded in the RAM 30. The volume control unit 13 that controls the amplifier 60 with the changed volume value and outputs it from the speaker 5 and updates the volume value of the RAM 30 with the changed volume value, and reads the electronic book data recorded in the RAM 30, Speech synthesis is performed using a speech synthesis program and dictionary data stored in the ROM 40, and an output signal from the speed control unit 12 The voice conversion processing unit 14 that outputs to the amplifier 60 at a voice speed based on it, and the display control unit 15 that controls to display the text data read from the book data stored in the RAM 30 on the display means 50 such as the liquid crystal display 6. It is made up of.
[0014]
  Here, the speech synthesis processing procedure in the speech conversion processing unit 14 will be described with reference to the flowchart shown in FIG.
[0015]
  In step S <b> 101, the CPU 10 performs reverse kanji conversion on the extracted sentence extracted for reading from the electronic book data stored in the RAM 30. That is, reading is given to the extracted sentence. For example, if the extracted sentence is “Long Time,...”, “Kumashi Mashi,...” Can be obtained by reverse kanji conversion. Subsequently, in step S102, an accent type is assigned to the reversely kana-kanji converted character (accent processing). For example, “0” (zero) type is given as an accent type for “Mukashimakashi”.
[0016]
  In step S103, the duration time T of each syllable after the accent type is given in step S102 is calculated by multiplying the utterance speed coefficient α by the syllable duration length L stored in the ROM 40 ( T = α × L).
[0017]
  In step S104, the fundamental frequency of each syllable is calculated, and then the step.TheIn S105, the volume of each syllable is calculated. For example, for a syllable with higher accent, the fundamental frequency is controlled so that the fundamental frequency of the syllable becomes higher (step S104), and the volume is controlled so that the volume becomes larger (step S105). These processes are used to accentuate text reading in response to accents.DoIt is processing.
[0018]
  In step S106, the last character (modified) displayed on the liquid crystal display 6 from the first character of the extracted sentence extracted in step S102 (the last sentence in the sentence displayed on the display screen of the liquid crystal display 6). The expected time (time required for page switching) is calculated by adding the respective duration times calculated in step S103 (characters immediately before the page tag).
[0019]
  In step S107, the language processing dictionary and speech synthesis speech data stored in the ROM 40, the duration of each syllable calculated in step S103, the fundamental frequency of each syllable calculated in step S104, Using each volume calculated in step S105, voice synthesis data of a desired speed and volume of the extracted sentence extracted in step S102 is created, and a voice output device such as the speaker 5 is provided via the amplifier 60. Output to.
[0020]
  In this way, when the last sentence displayed on the display screen is extracted from one or a plurality of sentences displayed on the display screen of the liquid crystal display 6, the liquid crystal display is displayed from the beginning of the last sentence. Assuming the time required to read up to the last character displayed on the display screen 6, the display screen of the liquid crystal display 6 is displayed when the estimated time has elapsed since the start of reading the last sentence. The display content is switched, and the timing for switching the display content on the display screen of the liquid crystal display 6 is controlled to match the timing when the display content is read out.
[0021]
  In the text-to-speech reading device 1 shown above, when the volume adjustment is performed by operating the lower slide bar 4 in the example of FIG. 5, the operation of the touch panel 2 causes the tap unit 11 to adjust the volume. According to the instruction, the volume control unit 13 reads out and changes the volume value from the RAM 30 and adjusts the volume by directly controlling the amplifier 60 based on the changed volume value. Write the changed volume value in the value area. As described above, the sound volume changing operation can be performed without the voice conversion processing unit 14, and can be reflected in the output in real time.
[0022]
  On the other hand, in the case of changing the reading speed, the tap unit 11 specifies that the operation is speed adjustment from the operation of the slide bar 4 above the touch panel 2 and notifies the speed control unit 12, and the speed control unit 12 The speed value read from the RAM 30 is changed and sent to the voice conversion processing unit 14, and at the same time, the changed speed is recorded in the RAM 30. The voice conversion processing unit 14 performs voice conversion processing based on the changed signal, controls the amplifier 60, and outputs the voice from the speaker 5 at the changed speed.
[0023]
  In the above processing operation, when the reading speed is changed, as described above, there is a time difference between the voice conversion input and the output, and when the speed change is read out without being reflected in the output in real time, a sense of incongruity remains. Therefore, once the stop button 3 is operated to stop the voice conversion process, the change is performed.
[0024]
  Therefore, when the user wants to change the reading conditions during the text reading (including function setting), the user determines whether or not the change should be made by stopping the voice conversion process, and for the reading or according to the determination. The stop button 3 must be operated. Specifically, youTo theThe current reading speed is not appropriate (for example, it is difficult to hear).TheWhen it is desired to change the reading speed, the stop button 3 is operated to stop reading, then the speed change slide bar 4 is operated, and further, reading is started to start reading. The up button 3 is operated. Alternatively, when the user wants to change the volume when reading is not performed, the user operates the slide bar 4 for changing the volume and further operates the reading button 3 for starting reading. Many such operations are problematic because they are cumbersome and inconvenient.
[0025]
[Problems to be solved by the invention]
The above-described user operations are complicated and inconvenient.
[0026]
  The present invention solves the above problems (complex and inconvenient operations).InThe purpose is to allow the user to easily change the reading conditions.
[0027]
[Means for Solving the Problems]
  The invention of claim 1 is a sentence reading apparatus capable of changing a sentence reading condition, and means for inputting change information for changing the sentence reading condition;When the change in the text-to-speech reading condition due to the change information is reflected in the voice output with a delay, the change processing is performed by controlling the reading stop so that the change in the text-to-speech reading condition based on the change information is not delayed in the voice output. Change the text-to-speech condition to be changed without controlling the reading stop when it is reflectedA sentence reading apparatus comprising: means; and means for controlling reading based on the changed sentence reading condition. That is,Whether changes in text-to-speech reading conditions are delayed and reflected in the audio outputRead aloudStop systemAre associated with each other.
[0028]
  The invention of claim 2 is a text reading apparatus according to claim 1,The text-to-speech condition changing means reflects the change of the text-to-speech reading condition based on the change information without delay in the voice output, and when the text-to-speech reading is stopped, the reading process is controlled to perform a change process. It is characterized byTheChanges in text-to-speech conditions are delayed to voice output Whether or notWhether the text is being read out or not, and start or stop readingcontrolAnd associateEtIt is.
[0029]
  The invention of claim 3 is the text-to-speech device according to claim 1 or 2,The sentence reading condition changing means is:Change informationButRead aloudWhen it is volume or tone change information, it is determined whether or not the text-to-speech device is reading aloud, and if it is not being read out, it is controlled to start reading, and the volume or tone is changed. Specific changes such asStart readingcontrolAnd relatedWithTheEtIt is.
[0030]
  The invention of claim 4 is claimed in claimAny one of 1-3In the text-to-speech device described in 1.When stop control is performed by the text-to-speech condition changing means, control is performed so that the text is read from the head of the text to be read. If there is a change in the text-to-speech condition, the text is read from the head.
[0031]
  The invention of claim 5In order to read the text so that the text-to-speech reading condition can be changed, if the change in the text-to-speech condition due to the input change information is delayed and reflected in the voice output, it is controlled to stop reading and change processing, When the change of the text-to-speech reading condition by the change information is reflected without delay in the voice output, a procedure for performing the change processing without controlling the reading stop, and a procedure for controlling the reading based on the changed text-to-speech condition , Is a program characterized by being executed. That is, whether or not the change in the text-to-speech condition is delayed and reflected in the voice output is associated with the reading-out stop control.
[0032]
  The invention of claim 6A computer-readable recording medium on which the program according to claim 5 is recorded. According to the sixth aspect of the invention, the same effect as that of the fifth aspect is obtained.
[0033]
[0034]
DETAILED DESCRIPTION OF THE INVENTION
  Embodiments of the present invention will be described with reference to the accompanying drawings.
[0035]
  FIG. 1 shows an embodiment of a text-to-speech device 1 according to the present invention.
[0036]
  In this embodiment, a reading start control unit 16 and a reading stop control unit 17 are added to the CPU 10 of the text reading device 1 shown in FIG.
[0037]
  Both the reading start control unit 16 and the reading stop control unit 17 receive a signal from the tap unit 11, and the reading start control unit 16 starts and controls the speech conversion processing unit 14 when the text reading device 1 is not reading, The reading stop control unit 17 controls to stop the voice conversion processing unit 14 while reading a sentence. Further, the RAM 30 is provided with a storage area for a reading flag to indicate whether or not the text is being read out. Other structural functions are the same as those described with reference to FIG.
[0038]
  Next, the change (including the setting) of the text reading condition during reading by the text reading apparatus 1 described above will be described taking the reading volume and speed as an example.
[0039]
  FIG. 2 is a flowchart for explaining a processing procedure (first embodiment) in the text-to-speech reading apparatus.
[0040]
  When changing the volume and speed of the text-to-speech reading device 1, first, it is detected that a part of the touch panel 2 has been tapped by the user (S201, YES). When the tap unit 11 receives a signal from the touch panel 2 and checks its contents, and determines (specifies) that the operation is for starting reading (reproduction) by the reading button 3 (S202, (YES), reading (reproduction) is started from the head of the sentence stored in the ROM 30 (S203) and reading is performed as it is.
[0041]
  When it is determined (specified) that the tap unit 11 is not an operation for starting reading (reproduction) (S202, NO) but an operation for volume change control by the volume change slide bar 4 (S204, YES) The reading start control unit 16 checks the “reading flag” indicating whether or not reading is being performed (reading mode) recorded in the RAM 30 (S205), and if reading is stopped (flag is 0) ( (S206, YES) The reading starts from the beginning of the sentence stored in the RAM 30 (S207). If the reading is in progress (S206, NO), the volume control unit 13 reads the volume value from the RAM 30 in the reading state. The volume is adjusted with the changed volume value (S208), and the changed volume value is written in the RAM 30 (S209) and the amplifier 6 Voice output at the volume is changed from the speaker 5 via a. Here, the user adjusts the volume while listening to the reading volume by using the slide bar 4 for changing the volume, and repeats the processes of steps S208 and S209 until the necessary setting is completed, and if the setting is completed (S210, YES), the process is performed. finish.
[0042]
  In step S204, the tap unit 11 specifies (determines) that the tap operation on the touch panel 2 is not an operation for changing the volume (S204, NO) but an operation for changing the speed by the speed change slide bar 4. ) (S211, YES), the reading stop control unit 17 checks the “reading flag” recorded in the RAM 30 (S212), and if reading is in progress (reading mode) (S213, YES), stops reading. At the same time, the “reading flag” recorded in the RAM 30 is reset (for example, changed from “1” to “0”) (S214). If it is not being read out (reading mode) (S213, NO), the speed controller 12 reads the speed value from the RAM 30, adjusts the speed with the input speed value (S215), and combines with the changed speed value. Audio is output from the speaker 5 via the amplifier 60, and the changed (adjusted) speed value is written in the RAM 30 (S216).
[0043]
  Steps S215 to S216 are performed until the speed setting is completed, and when the setting is completed (S218, YES), the reading position is returned to the head of the sentence (that is, the head of the sentence if reading is in progress) ( In step S218, reading is started at the newly set reading speed, and at the same time, the “reading flag” stored in the RAM 30 is set (for example, changed from “0” to “1”) (S219). Thus, the speed control processing procedure is ended, and if there is a sentence to be read out, the reading is continued under the changed condition.
[0044]
  In addition, although explanation is omitted about the case where it is not speed control in Step S211 (S211, NO), for example, change of reading voice other than the slide bar 4 for speed change, change of male and female voices, etc. are changed after reading is stopped once. The processing procedure in the case where it is better to perform is similar to the speed control, and the tone change or the like that may not be adjusted properly unless it is being read out is executed in the same procedure as the volume change.
[0045]
  When an operation to change the reading condition is made during reading aloud, it is necessary to change or set the reading condition. If there is a need to stop it, it will automatically stop reading, and it can only be set during reading, like changing or setting the volume or tone. For those that cannot be read out, even when reading is stopped, reading can be started automatically, thereby reducing the burden on the user when changing the reading conditions.
[0046]
  FIG. 3 is a flowchart for explaining another processing procedure (second embodiment) in the text-to-speech reading apparatus 1.
[0047]
  When changing the volume and speed of the text-to-speech reading device 1, first, it is detected that a part of the touch panel 2 has been tapped by the user (S301, YES), and the tap unit 11 checks the content of the tap. When it is determined that the operation is to start reading (reproduction) by the button 3 for reading (S302, YES), reading is started (S303) and reading is performed from the head of the sentence stored in the RAM 30.
[0048]
  When it is determined (specified) that the operation is not the operation for starting the reading (S302, NO) but the speed changing by the slide bar 4 for changing the volume (S304, YES), the reading start control unit 16 Checks the “reading flag” recorded in the RAM 30 (S305), and as a result, if reading is stopped (S306, YES), there is a possibility that the volume cannot be adjusted or cannot be appropriately adjusted without reading.ThereTherefore, reading is started from the head of the text stored in the RAM 30 (S307), and the “reading flag” in the RAM 30 is set (S308). In step S306, if it is being read out (S306, NO), the volume control unit 13 performs volume adjustment by reading the volume value from the RAM 30 and changing it to the input volume value in the state of being read out as it is (S309). The sound is output from the speaker 5 through the amplifier 60, and the changed (adjusted) volume value is written in the RAM 30 (S310).
[0049]
  Here, the user adjusts the volume with the sound-changing slide bar 4 while listening to the reading volume, repeats the processing of steps S309 and S310 until the necessary setting is completed, and if the adjustment (setting) is completed (S311, YES) ) End the process.
[0050]
  In step S304, the tap unit 11 determines that the tap operation on the touch panel 2 is not an operation for changing the volume (NO in S304) but is for changing the speed by operating the slide bar 4 for changing the speed. When the determination is made (YES at S312), when the user performs a speed adjustment operation (S313), the input of the touch panel 2 is specified by the tap unit 11, and the speed control unit 12 is read and changed from the RAM 30. The speed is transferred to the voice conversion processing unit 14 and the changed speed value is written in the RAM 30 (S314). At this stage, the reading stop control unit 17 checks the reading flag recorded in the RAM 30 (S315). As a result of the check, the “reading flag” is being read in the set state (for example, the flag is 1) (reading mode). When it is determined that there is (S316, YES), reading is temporarily stopped (S317) and the “reading flag” is reset. Thereafter, reading is started, and the synthesized voice of the changed speed is output from the speaker 5 through the amplifier 60 and the “reading flag” is set (S318). Thereafter, the processing of steps S313 to S318 is executed until the setting is completed, and if the setting is completed (S319, YES), the processing is terminated, and if there is a sentence to be read out, the reading is continued under the changed condition.
[0051]
  In step S316, if reading is not in progress (S316, NO), the process is terminated after the setting is completed (S319, YES).
[0052]
  Note that the processing in step 312 when it is not the speed-changing slide bar 4 (S312: NO) is the same as in the first embodiment. That is, for example, regarding the change of the reading voice other than the slide bar 4 for changing the speed, the processing procedure in the case where it is better to stop the reading once, such as the change of the voice quality of men and women, the other party, etc. is the same as the speed control. In addition, changes in effects such as tone, echo, frequency, etc. that cannot be adjusted without reading are performed in the same procedure as the change in volume.
[0053]
  In the above-described embodiments, all the processes shown in FIGS. 2 and 3 are performed in one apparatus including the speaker 5 for generating synthesized speech. However, each process and a part of the processes are performed separately. Then, the synthesized speech may be finally generated from the speaker 5 in accordance with the changed condition, and the text may be read out. For example, the first computer only receives user input, and other substantial processing is performed by another second computer. Furthermore, the device is not limited to a dedicated device for reading a text, but may be a PDA, a personal computer, a mobile phone, a car navigation terminal, a TV, or the like having a function other than reading.
[0054]
  The text to be read out is not limited to books, but may be letters (including e-mails), directions, advertisements, and lyrics. The data stored in the RAM 30 is lost when the apparatus is turned off, but may be stored continuously even when the apparatus is turned off.
[0055]
  In the above-described embodiments, both of the automatic start and stop of the reading are performed in accordance with the change of the reading conditions. However, either of the automatic starting or the automatic stopping is performed. May be. The starting position of reading is at the beginning of the sentence, but it may be in the middle of the sentence. Furthermore, the object read out until the condition change is completed may be a dedicated sentence used when the condition is changed.
[0056]
  The processing described above can be executed by the CPU 10 of the text reading device 1 by a program describing the procedure of the processing. In addition, this program is FD (flexible disk), CD-ROM, MO, DVD-In addition to being provided by being recorded on a known recording medium for recording a program such as a ROM, the program can be provided via a network such as the Internet.
[0057]
【The invention's effect】
  According to the invention described in claim 1 of the present application, when changing the reading condition while reading a sentence, the user needs to automatically stop reading without performing the operation for starting or stopping the reading as usual. If you do not want to stop reading, you can stop reading what you need to stop, and if the change in text-to-speech conditions should be done while reading, when the device is not reading Can automatically start reading. Therefore, the user does not have the trouble of operating the touch panel and the like, and the reading conditions can be easily changed.Further, even when the user changes the reading condition with a time difference from the input to the output of the change information such as the reading speed, the user can naturally hear the reading by the synthesized voice without being aware of the time difference.
[0058]
  According to the invention described in claim 2 of the present application, the effect of the invention described in claim 1 is achieved, whether or not the text is being read out and whether or not reading is started or stopped.controlCan be read out satisfactorily.
[0059]
  According to invention of Claim 3 of this application, there exists an effect of the invention of Claim 1 or 2,A specific change such as a volume or a tone is associated with a reading start control, so that quick reading can be performed.
[0060]
  According to the invention of claim 4 of the present application,Any one of 1-3The effects of the invention described inIf the text-to-speech conditions change, the text will be read from the headline, so it ’s goodCan be read aloud.
[0061]
  According to the invention described in claim 5 of the present application,There is no need for the user to operate the touch panel or the like as in the prior art, and the reading conditions can be easily changed. Also, reading speed Even if the reading condition is changed with a time difference from the input to the output of the change information such as the change information, it is possible to naturally hear the reading by the synthesized speech without being aware of the time difference.
[0062]
  According to the invention described in claim 6 of the present application,The same effect as that of the invention described in claim 5 can be obtained, and the above effect can be realized in any information device by causing the computer of the portable device or other information device to read the program.
[0063]
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration according to a first embodiment of a text-to-speech device of the present invention.
FIG. 2 is a flowchart for explaining a reading condition changing procedure;
FIG. 3 is a flowchart for explaining another reading condition changing procedure;
FIG. 4 is a time chart showing a time difference between audio conversion processing and audio output.
FIG. 5 is a front view showing an example of a conventional text-to-speech device.
6 is a block diagram showing a schematic configuration of the text-to-speech device shown in FIG. 5. FIG.
FIG. 7 is a flowchart for explaining speech conversion processing of the text-to-speech device shown in FIG. 5;
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 ... Text-to-speech device, 2 ... Touch panel, 3 ... Button for reading and stopping, 4 ... Slide bar for speed and volume change, 5 ... Speaker, 6 ... Liquid crystal display DESCRIPTION OF SYMBOLS 10 ... CPU, 30 ... RAM, 40 ... ROM, 50 ... Display means, 60 ... Amplifier.

Claims

In a text-to-speech device that can change text-to-speech conditions,
Means for inputting change information for changing the text-to-speech condition;
When the change in the text-to-speech reading condition due to the change information is reflected in the voice output with a delay, the change processing is performed by controlling the reading stop so that the change in the text-to-speech reading condition based on the change information is not delayed in the voice output. A text-to-speech reading condition changing means for performing a change process without reading stop control when it is reflected ,
Means for controlling reading based on the changed text reading conditions;
A text-to-speech device characterized by comprising:

In the text-to-speech device according to claim 1,
The text-to-speech condition changing means reflects the change of the text-to-speech reading condition based on the change information without delay in the voice output, and when the text-to-speech reading is stopped, the reading process is controlled to perform a change process. A text-to-speech device characterized by:

In the text-to-speech device according to claim 1 or 2,
The text-to-speech reading condition changing means determines whether or not the text-to-speech device is reading aloud when the change information is a reading volume or tone change information. A text-to-speech device characterized by:

In the text-to-speech device according to any one of claims 1 to 3 ,
A text-to-speech device that controls to read out from the head of the text to be read when the text-to-speech condition changing means controls the stop .

In order to read out the text, the text-to-speech conditions can be changed.
If the change in the text-to-speech reading condition due to the input change information is reflected in the voice output with a delay, it is controlled to stop reading, and the change in the text-to-speech reading condition based on the change information is delayed in the voice output. If it is reflected without change, the procedure for change processing without reading stop control ,
A procedure to control reading based on the changed text-to-speech conditions;
A program characterized by having executed .

A computer-readable recording medium having recorded thereon the program according to claim 5.