JP3433532B2

JP3433532B2 - Electronic book device

Info

Publication number: JP3433532B2
Application number: JP26150794A
Authority: JP
Inventors: 恒夫古木
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1994-09-30
Filing date: 1994-09-30
Publication date: 2003-08-04
Anticipated expiration: 2018-08-04
Also published as: JPH08101697A

Description

【発明の詳細な説明】【０００１】【産業上の利用分野】この発明は、例えばＣＤ−ＲＯＭ
などの情報記録媒体に格納されたテキスト情報を液晶な
どのディスプレイに表示する電子ブック装置にかかり、
更に具体的には、その音声情報再生の改良に関する。【０００２】【背景技術と発明が解決しようとする課題】辞典や小説
などの各種書籍の内容（以下単に「テキスト」という）
をディスプレイに表示するとともに、その朗読を行う電
子ブック装置としては、例えば特開平６−２０２６８６
号公報に開示されたものがある。これによれば、既存の
テキストは、元の活字のまま表示され、その朗読音を聴
くことができるようになっている。また、電子ペンから
の自由な手書きを可能にしつつ、その保存が可能となっ
ている。【０００３】ところで、ＣＤ−ＲＯＭなどの記録媒体に
は、文字や画像などのデータの他に音声データが記録さ
れている場合がある。例えば、鳥の泣き声，外国語の発
音，波の音のＢＧＭなどである。これらの音を、テキス
トの朗読音とともに再生する場合には、両者が良好に聴
取できるような工夫が必要となる。【０００４】また、テキストの朗読をすることなくディ
スプレイ上のテキストを黙読するような場合でも、使用
者の注意を喚起したり、読書中の集中力をの低下を補う
ような機能があると、好都合である。【０００５】この発明は、以上の点に着目したもので、
各種の音声を効率的に再生して良好な朗読を行うととも
に、使用者の注意を喚起したり、読書中の集中力の低下
を補うことができる使い勝手のよい便利な電子ブック装
置を提供することを、その目的とするものである。【０００６】【課題を解決するための手段】前記目的を達成するた
め、この発明では、文字データと前記文字データに付随
した音声データとが夫々異なるフレーム単位でデータ識
別フラグと共に格納されている情報記録媒体から前記文
字データ及び音声データを再生する電子ブック装置であ
り、前記文字データ及び音声データと共に前記情報記録
媒体から読み出された前記データ識別フラグに基づき、
前記音声データと前記文字データとを識別する識別手段
と、前記識別手段により識別された前記文字データを表
示するための表示手段と、前記識別手段により識別され
た前記音声データから付加音声を得るための音声再生手
段と、前記識別手段により識別された前記文字データを
音声変換して変換音声を得るための音声変換手段とを備
え、前記表示手段による表示文字に対応して、前記付加
音声と前記変換音声との何れか一方を選択的に出力する
第１のモードと、前記表示手段による表示文字に対応し
て、前記付加音声と前記変換音声とを加算して出力する
第２のモードと、前記表示手段による表示文字に対応し
て、前記付加音声と前記変換音声との何れかの音声も出
力しない第３のモードとを有し、前記第３のモードにお
ける前記表示手段による文字表示後、ページ送りが行わ
れることなく一定時間が経過した場合に、前記第１又は
第２のモードへと移行させるようにしたことを特徴とす
る。【０００７】この発明の前記及び他の目的，特徴，利点
は、次の詳細な説明及び添付図面から明瞭になろう。【０００８】【好ましい実施例の説明】この発明には数多くの実施例
が有り得るが、ここでは適切な数の実施例を示し、詳細
に説明する。＜実施例の構成＞最初に、図１を参照して実施例の主要
部の構成を説明する。同図において、ＣＰＵ１０には、電子ブック装置の使用者が動作指示を行うための操作
部１２，ＣＤ−ＲＯＭカートリッジ１３を駆動するためのＲＯ
Ｍドライバ１４，圧縮情報を伸長するための伸長部１６，文字表示や音声変換のためのデータを蓄積する辞書・
素片データ部１８，表示処理を行うためのディスプレイプロセッサ２０，音声合成のためのテキスト解析と合成パラメータを得
るためのテキスト解析・合成パラメータ生成部２２，音声合成を行うための音声合成部２４，が接続されている。【０００９】また、伸長部１６にはＤ／Ａ変換器２６が
接続されており、音声合成部２４及びＤ／Ａ変換器２６
の出力側は加算器２８の入力側に接続されている。この
加算器２８の出力側が、図示しないスピーカなどの音声
出力手段に接続されている。他方、ディスプレイプロセ
ッサ２０にはディスプレイ３０が接続されている。【００１０】次に、ＲＯＭドライバ１４にセットされる
ＲＯＭカートリッジ１３に格納されたデータのフォーマ
ットについて説明する。基本的なフォーマットは、よく
知られているＣＤ−ＲＯＭの規格に沿っているが、その
フレームに含まれる主情報が文字情報のみである場合
は、ＳＹＮＣ（同期信号），ヘッダ（アドレス情報）に
続くサブヘッダ（コーディング情報）が例えば論理値の
「0」に設定される（同図（A）参照）。例えば、小説な
どの場合は通常テキストは文字のみであり、各フレーム
の主情報は文字情報のみである。この場合、主情報はコ
ードデータのみとなるので、２３１２バイトの固定長と
なっている。【００１１】主情報に音声データが付随する場合は、サ
ブヘッダが例えば論理値の「1」に設定される（同図
（B）参照）。例えば、鳥の泣き声などのデータの場合
は、その泣き声の音声データが付随している。この場
合、泣き声データは圧縮処理されるため、主情報は可変
長となっている。図示の例では、２２９２バイトとなっ
ている。なお、ＲＯＭカートリッジ１３全体のデータ構
造は、同図（C）に示すようになっている。【００１２】後述するように、サブヘッダが「0」の場
合は文字データのみであるから、主情報は後述するよう
にテキスト音声変換される。他方、フサブヘッダが
「1」の場合は、付属のフレーム中の主情報の圧縮を解
除しながら音声信号が取り出される。【００１３】このように、圧縮音声データを文字データ
と区分してＲＯＭカートリッジ１３に格納することで、
ＲＯＭカートリッジ１３，つまり記録媒体の規模を縮小
できるという利点がある。【００１４】次に、他の構成部分を説明すると、操作部
１２は例えば複数の入力キーなどを含んでおり、これに
よってページ送り（改ページ）の指示，動作モード指示
など各種の入力が行われる。伸長部１６は、ＲＯＭカー
トリッジ１３からＲＯＭドライバ１４によって読み出さ
れた圧縮音声情報を伸長するためのものである。伸長さ
れた音声データは、Ｄ／Ａ変換器２６でアナログの音声
信号に変換される。ディスプレイプロセッサ２０は、Ｒ
ＯＭカートリッジ１３から読み出されたデータに基づい
て対応する表示をディスプレイ３０に行う表示処理を行
うためのものである。【００１５】図３には、ディスプレイ３０の表示エリア
の一例が示されている。この例は、液晶表示装置の例
で、縦が４８０ドット，横が６００ドットとなってい
る。そして、上下にはそれぞれ２４ドットのボーダーエ
リア３０Ａ，３０Ｂがあり、これらを除く中央部分がテ
キストエリア３０Ｃとなっている。このテキストエリア
３０Ｃには、文字の大きさにもよるが、縦１８文字，横
２５文字が表示可能となっている。【００１６】辞書・素片データ部１８は、テキスト解析
・合成パラメータ生成部２２におけるテキスト解析や音
声合成パラメータ生成の際に参照されるもので、音声合
成に必要なデータが格納されている。テキスト解析・合
成パラメータ生成部２２は、辞書・素片データ部１８を
参照して、音声合成部２４で音声合成を行うために必要
なパラメータを得るためのものである。これら辞書・素
片データ部１８，テキスト解析・合成パラメータ生成部
２２，音声合成部２４によって、テキスト音声変換装置
が構成されている。【００１７】加算器２８は、Ｄ／Ａ変換器２６から供給
されるＲＯＭカートリッジ１３に付加されていた音声信
号（以下単に「付加音声信号」という）と、音声合成部
２４から供給されるテキスト変換された音声信号（以下
単に「変換音声信号」という）とを加算するためのもの
である。【００１８】次に、ＣＰＵ１０は、操作部１２によって
指示された動作モードに対応して各部の動作を制御する
ためのものである。図４には、ＣＰＵ１０の基本的な動
作のフローチャートが示されている。まず、装置起動時
は、同図（A）に示すように、システムのイニシャライ
ズ（ステップＳ１０）を行い、その後メインルーチン
（ステップＳ１２）に移行する。このメインルーチンで
は、ＣＰＵ１０は、いずれのモードの動作，つまりタス
クが指示されるかが監視されている。そして、使用者が
操作部１２を操作していずれかのタスクが指示される
と、同図（B）に示す割り込みルーチン（ステップＳ２
０）として実行される。実行後は、再び、ステップＳ１
２に戻る（ステップＳ２２）。【００１９】＜実施例の動作＞次に、以上のように構成
された実施例の動作を説明する。なお、主要な３つの動
作モードがあるので、各モード毎に説明する。【００２０】（1）合成再生モードこのモードを使用者が選択すると、付加音声と変換音声
とが合成して再生される。例えば、「鶯は『ホーホケキ
ョ』と鳴きます。」というテキストの『ホーホケキョ』
の部分に実際の鳥の泣き声が付加されているとする。こ
の場合、図５（A）に示すように、「鶯は」の部分は、
文字データからの変換音声による朗読が行われる。しか
し『ホーホケキョ』の部分は付加音声が再生され、文字
データからの音声変換は行われない。次の「と鳴きま
す」の部分は、文字データからの変換音声が朗読され
る。【００２１】次に、図６のフローチャートを参照しなが
ら、この合成再生モードにおける割り込みルーチンの動
作を説明する。使用者がこのモードを操作部１２で指示
すると、図６に示すこのモードの割り込みルーチンが実
行される。まず、ＲＯＭドライバ１４によってＲＯＭカ
ートリッジ１３からデータが読み込まれる（ステップＳ
３０）。そして、ディスプレイプロセッサ２０により、
ディスプレイ３０のテキストエリア３０Ｃ（図３参照）
に、読みまれたデータが表示される（ステップＳ３
２）。ページ送りが指示されれば、次のページが表示さ
れる。【００２２】次に、ＣＰＵ１０では、表示データのフレ
ームのサブヘッダ（図２参照）が参照される。そして、
サブヘッダが「0」の場合は（ステップＳ３４のＹ）、
そのフレームのデータが、辞書・素片データ部１８，テ
キスト解析・合成パラメータ生成部２２，音声合成部２
４によってテキスト音声変換され、変換音声信号が出力
される（ステップＳ３６）。【００２３】他方、サブヘッダが「1」の場合は（ステ
ップＳ３４のＮ）、圧縮された音声データが付加されて
いるので、伸長部１６でその音声データが伸長され、付
加音声信号が出力される（ステップＳ３８）。次のフレ
ームがあれば、以上の動作が繰り返される（ステップＳ
４０）。このように、合成再生モードでは、ステップＳ
３６，Ｓ３８の動作が選択的に行われ、図５（A）に示
したように付加音声と変換音声とが選択的に朗読される
ようになる。【００２４】（2）加算再生モードこのモードを使用者が選択すると、付加音声と変換音声
とが加算されて再生される。例えば、「鶯は『ホーホケ
キョ』と鳴きます。」というテキストに森のイメージを
表わすような音楽（ＢＧＭ）が付加されているとする。
この場合、図５（B）に示すように、「鶯は『ホーホケ
キョ』と鳴きます」の全体について、文字データからの
変換音声による朗読が行われる。他方、それらととも
に、ＢＧＭの付加音声が再生され、文字データからの変
換音声とともに再生される。つまり、このモードでは、
ＢＧＭに乗って文章が朗読される。【００２５】次に、図７のフローチャートを参照しなが
ら、この加算再生モードにおける割り込みルーチンの動
作を説明する。使用者がこのモードを操作部１２で指示
すると、図７に示すこのモードの割り込みルーチンが実
行される。前記動作モードと同様にして、まず、ＲＯＭ
ドライバ１４によってＲＯＭカートリッジ１３からデー
タが読み込まれる（ステップＳ３０）。そして、ディス
プレイプロセッサ２０により、ディスプレイ３０のテキ
ストエリア３０Ｃ（図３参照）に、読みまれたデータが
表示される（ステップＳ３２）。ページ送りが指示され
れば、次のページが表示される。【００２６】次に、ＣＰＵ１０の指示により、表示デー
タの最初のフレームの文字に対して、辞書・素片データ
部１８，テキスト解析・合成パラメータ生成部２２，音
声合成部２４によるテキスト音声変換され、変換音声信
号が出力される（ステップＳ５０）。同時に、付加され
ている圧縮されたＢＧＭの音声データが伸長部１６で伸
長され、付加音声信号が出力される（ステップＳ５
０）。これらの各音声は、同時に重畳して再生される。
以上の動作が、以降のフレームについて繰り返される
（ステップＳ４０）。このように、加算再生モードで
は、図５（B）に示したように、付加音声と変換音声と
が重畳して朗読再生されるようになる。【００２７】（3）黙読再生モードこの動作モードは、上述した朗読を行わない，つまり使
用者がディスプレイ３０の表示を黙読するような場合に
好適な動作モードである。使用者が黙読開始後一定時間
たってもページ送りしないときは、上述したいずれかの
朗読モードに切り替えられ、朗読が行われるようにな
る。従って、例えば黙読の最中に集中力が低下したよう
な場合、朗読が自動的に開始されるので、使用者に注意
を喚起したり、読書中の集中力をの低下を補うことが可
能となる。【００２８】図８を参照しながら、この動作モードにお
ける割込みルーチンの動作を説明する。使用者がこのモ
ードを操作部１２で指示すると、図８に示すこのモード
の割り込みルーチンが実行される。まず、ＲＯＭドライ
バ１４によってＲＯＭカートリッジ１３からデータが読
み込まれる（ステップＳ３０）。そして、ディスプレイ
プロセッサ２０により、ディスプレイ３０のテキストエ
リア３０Ｃ（図３参照）に、読みまれたデータが表示さ
れる（ステップＳ３２）。ページ送りが指示されれば、
次のページが表示される。これらの動作は、上述したモ
ードと同様である。【００２９】次に、ＣＰＵ１０では内部のカウンタがリ
セットされるとともに、時間のカウントが開始される
（ステップＳ６０）。このカウントは、ページ送りされ
たときは中止され、その後リセットされて再びカウント
が開始される（ステップＳ６４）。しかし、ページ送り
されなければ一定値までカウントされる（ステップＳ６
６，Ｓ６２）。すると、ＣＰＵ１０の指示によりタスク
が変更され、そのタスクが実行される（ステップＳ６
８）。例えば、上述した合成再生モードにタスクが変更
され、そのモードの朗読動作が実行される。【００３０】このように、あるページが表示されてから
ページ送りされることなく一定カウントが行われたと
き、つまり、テキストのあるページが表示されてからペ
ージ送りされることなく所定時間が経過すると、テキス
トの朗読が自動的に行われる。【００３１】＜他の実施例＞この発明は、以上の開示に
基づいて多様に改変することが可能であり、例えば次の
ようなものがある。（1）前記実施例では、記録媒体としてＣＤ−ＲＯＭを
用いたが、ＦＤなど各種のものを用いてよい。表示装置
も同様であり、液晶ディスプレイに限定されるものでは
ない。（2）上述した音声再生の動作モードは、必ずしも全部
備える必要はなく、いずれか１つのみを備えるようにし
てもよい。（3）前記実施例は、表示内容が主として文字のみの場
合であるが、他の情報、例えば絵や写真などの画像が含
まれるような場合であっても適用可能である。【００３２】【発明の効果】以上説明したように、この発明によれ
ば、次のような効果がある。（1）文字データを音声変換することとしたので、記録
媒体に格納されている全ての文字データについて音声デ
ータを付加する必要がなく、効率よく記録媒体を利用し
て、良好な朗読が可能となる。また、テキスト変換の音
声を男女いずれか選択したり、あるいは好みの俳優，歌
手などの声に変換したりすることもでき、趣向性のある
朗読が可能となる。（2）付加音声と変換音声の合成再生や、両音声の加算
再生など、各種のモードによる朗読を楽しむことができ
る。（3）表示後一定時間経過してもページ送りがないとき
は、自動的に朗読モードに切り替えられることとしたの
で、使用者の注意を喚起したり、読書中の集中力の低下
を補うことができる。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention
E-book device that displays text information stored on an information recording medium such as a liquid crystal display on a display,
More specifically, the present invention relates to an improvement in the reproduction of audio information. BACKGROUND ART [0002] The contents of various books such as dictionaries and novels (hereinafter simply referred to as "text")
Is displayed on a display and an e-book reader for reading the information is disclosed in, for example, Japanese Patent Application Laid-Open No. 6-202686.
Is disclosed in Japanese Patent Application Laid-Open Publication No. HEI 9-203 (1995). According to this, the existing text is displayed as it is in its original type, and the reading sound can be heard. In addition, the handwriting can be freely saved from the electronic pen, and can be saved. [0003] In some cases, audio data is recorded on a recording medium such as a CD-ROM in addition to data such as characters and images. For example, crying of a bird, pronunciation of a foreign language, BGM of a sound of a wave, and the like. In the case where these sounds are reproduced together with the reading sound of the text, it is necessary to devise such that both can be heard well. [0004] In addition, even when reading a text on a display silently without reading the text, if there is a function that alerts the user or compensates for a decrease in concentration during reading, It is convenient. The present invention focuses on the above points,
To provide an easy-to-use and convenient electronic book device that can efficiently reproduce various voices to perform good reading, draw attention of a user, and compensate for a decrease in concentration during reading. Is the purpose. In order to achieve the above-mentioned object, according to the present invention, character data and additional data are added to the character data.
Data in different frame units
From the information recording medium stored with the separate flag
An electronic book device for reproducing character data and voice data.
Recording the information together with the character data and voice data.
Based on the data identification flag read from the medium,
Identification means for identifying the voice data and the character data
And the character data identified by the identification means.
Display means for indicating, and the identification means
Audio reproducing means for obtaining additional audio from the audio data
And the character data identified by the identification means.
Voice conversion means for converting the voice to obtain a converted voice.
In addition, according to the characters displayed by the display means,
Selectively outputting either the voice or the converted voice
Corresponding to the first mode and the characters displayed by the display means.
To add and output the additional audio and the converted audio
Corresponding to the second mode and the characters displayed by the display means.
Output any of the additional sound and the converted sound.
And a third mode in which no force is applied.
Page display after character display by the display means
If a certain period of time has passed without being
The mode is shifted to the second mode.
You. [0007] The above and other objects of this invention, features, and advantages will become apparent from the following detailed description and accompanying drawings. DESCRIPTION OF THE PREFERRED EMBODIMENTS While there are many possible embodiments of the present invention, a suitable number of embodiments are shown and described in detail. <Structure of Embodiment> First, the structure of the main part of the embodiment will be described with reference to FIG. In FIG. 1, a CPU 10 includes an operation unit 12 for a user of the electronic book device to give an operation instruction and an RO for driving a CD-ROM cartridge 13.
M driver 14, decompression unit 16 for decompressing compressed information, dictionary for storing data for character display and voice conversion
A segment data section 18, a display processor 20 for performing display processing, a text analysis / synthesis parameter generation section 22 for obtaining text analysis and synthesis parameters for speech synthesis, a speech synthesis section 24 for performing speech synthesis, Is connected. A D / A converter 26 is connected to the decompression unit 16, and the speech synthesis unit 24 and the D / A converter 26
Is connected to the input side of the adder 28. The output side of the adder 28 is connected to audio output means such as a speaker (not shown). On the other hand, a display 30 is connected to the display processor 20. Next, the format of data stored in the ROM cartridge 13 set in the ROM driver 14 will be described. The basic format conforms to the well-known CD-ROM standard. However, if the main information included in the frame is only character information, the frame includes SYNC (synchronization signal) and header (address information). The subsequent subheader (coding information) is set to, for example, a logical value “0” (see FIG. 3A). For example, in the case of a novel or the like, the text is usually only characters, and the main information of each frame is only character information. In this case, since the main information is only the code data, it has a fixed length of 2312 bytes. When the main information is accompanied by audio data, the subheader is set to, for example, a logical value "1" (see FIG. 1B). For example, in the case of data such as a bird's cry, voice data of the cry is attached. In this case, since the crying data is compressed, the main information has a variable length. In the illustrated example, the size is 2292 bytes. The data structure of the entire ROM cartridge 13 is as shown in FIG. As will be described later, when the subheader is "0", it is only character data, so that the main information is converted to text-to-speech as described later. On the other hand, when the header is "1", the audio signal is extracted while decompressing the main information in the attached frame. As described above, by storing the compressed voice data in the ROM cartridge 13 while separating it from the character data,
There is an advantage that the scale of the ROM cartridge 13, that is, the recording medium can be reduced. Next, the other components will be described. The operation unit 12 includes, for example, a plurality of input keys, which are used to perform various inputs such as a page feed (page break) instruction and an operation mode instruction. . The expansion section 16 expands the compressed audio information read from the ROM cartridge 13 by the ROM driver 14. The expanded audio data is converted by a D / A converter 26 into an analog audio signal. The display processor 20 uses R
This is for performing display processing for displaying a corresponding display on the display 30 based on the data read from the OM cartridge 13. FIG. 3 shows an example of the display area of the display 30. This example is an example of a liquid crystal display device, and has 480 dots vertically and 600 dots horizontally. The upper and lower border areas 30A and 30B of 24 dots are provided, respectively, and the central part excluding these is a text area 30C. This text area 30C can display 18 characters vertically and 25 characters horizontally, depending on the size of the characters. The dictionary / segment data section 18 is referred to at the time of text analysis and speech synthesis parameter generation at the text analysis / synthesis parameter generation section 22, and stores data necessary for speech synthesis. The text analysis / synthesis parameter generation section 22 refers to the dictionary / segment data section 18 to obtain parameters necessary for the speech synthesis section 24 to perform speech synthesis. The dictionary / speech data unit 18, the text analysis / synthesis parameter generation unit 22, and the speech synthesis unit 24 constitute a text-to-speech converter. The adder 28 converts the audio signal (hereinafter simply referred to as “additional audio signal”) added to the ROM cartridge 13 supplied from the D / A converter 26 with the text conversion supplied from the audio synthesizer 24. And a converted audio signal (hereinafter simply referred to as “converted audio signal”). Next, the CPU 10 controls the operation of each unit according to the operation mode specified by the operation unit 12. FIG. 4 shows a flowchart of the basic operation of the CPU 10. First, when the apparatus is started, the system is initialized (step S10) as shown in FIG. 9A, and then the process proceeds to the main routine (step S12). In this main routine, the CPU 10 monitors which mode of operation, that is, which task is instructed. Then, when the user operates the operation unit 12 to specify one of the tasks, an interrupt routine (step S2) shown in FIG.
0). After execution, step S1 is performed again.
Return to step 2 (step S22). <Operation of Embodiment> The operation of the embodiment configured as described above will now be described. Since there are three main operation modes, each mode will be described. (1) Synthetic reproduction mode When the user selects this mode, the additional voice and the converted voice are synthesized and reproduced. For example, the text “Hoogokekyo” with the text “Uguissels sing“ Hohokyo ”
It is assumed that an actual bird's cry is added to the part. In this case, as shown in FIG.
The reading by the converted voice from the character data is performed. However, the additional sound is reproduced in the part of "Hohokekyo", and sound conversion from character data is not performed. In the next part, "sound," the converted voice from the character data is read aloud. Next, the operation of the interrupt routine in the composite reproduction mode will be described with reference to the flowchart of FIG. When the user instructs this mode with the operation unit 12, an interruption routine of this mode shown in FIG. 6 is executed. First, data is read from the ROM cartridge 13 by the ROM driver 14 (Step S).
30). Then, by the display processor 20,
Text area 30C of display 30 (see FIG. 3)
The read data is displayed (step S3).
2). If the page feed is instructed, the next page is displayed. Next, the CPU 10 refers to the subheader (see FIG. 2) of the frame of the display data. And
If the subheader is “0” (Y in step S34),
The data of the frame includes a dictionary / segment data unit 18, a text analysis / synthesis parameter generation unit 22, and a speech synthesis unit 2.
4, the text-to-speech conversion is performed, and a converted sound signal is output (step S36). On the other hand, if the subheader is "1" (N in step S34), since the compressed audio data has been added, the audio data is expanded by the expansion unit 16 and an additional audio signal is output. (Step S38). If there is a next frame, the above operation is repeated (step S
40). As described above, in the composite playback mode, step S
The operations of S36 and S38 are selectively performed, and the additional voice and the converted voice are selectively read aloud as shown in FIG. (2) Addition and reproduction mode When this mode is selected by the user, the additional sound and the converted sound are added and reproduced. For example, it is assumed that music (BGM) that represents the image of a forest is added to the text "Uguissuru squeaks."
In this case, as shown in FIG. 5 (B), the reading of the whole "Uguisuru squeaks" is converted from the character data by the converted voice. On the other hand, the additional sound of BGM is reproduced together with them, and reproduced together with the converted sound from the character data. In other words, in this mode,
The text is read aloud on the BGM. Next, the operation of the interrupt routine in the addition reproduction mode will be described with reference to the flowchart of FIG. When the user instructs this mode with the operation unit 12, an interrupt routine of this mode shown in FIG. 7 is executed. First, as in the operation mode,
Data is read from the ROM cartridge 13 by the driver 14 (step S30). Then, the read data is displayed in the text area 30C (see FIG. 3) of the display 30 by the display processor 20 (step S32). If the page feed is instructed, the next page is displayed. Next, in accordance with an instruction from the CPU 10, text / speech conversion is performed on the characters of the first frame of the display data by the dictionary / segment data unit 18, the text analysis / synthesis parameter generation unit 22, and the speech synthesis unit 24. A converted audio signal is output (Step S50). At the same time, the added compressed BGM audio data is expanded by the expansion unit 16 and an additional audio signal is output (step S5).
0). These sounds are simultaneously superimposed and reproduced.
The above operation is repeated for the subsequent frames (step S40). As described above, in the addition reproduction mode, as shown in FIG. 5B, the additional voice and the converted voice are read and reproduced in a superimposed manner. (3) Silent Reading Reproduction Mode This operation mode is a suitable operation mode in which the above-described reading is not performed, that is, the user reads the display on the display 30 silently. If the user does not turn the page after a certain period of time from the start of the silent reading, the mode is switched to one of the above-mentioned reading modes, and the reading is performed. Therefore, for example, when the concentration is reduced during the silent reading, the reading is automatically started, so that it is possible to alert the user or compensate for the decrease in the concentration during reading. Become. The operation of the interrupt routine in this operation mode will be described with reference to FIG. When the user instructs this mode with the operation unit 12, an interruption routine of this mode shown in FIG. 8 is executed. First, data is read from the ROM cartridge 13 by the ROM driver 14 (step S30). Then, the read data is displayed in the text area 30C (see FIG. 3) of the display 30 by the display processor 20 (step S32). If page turn is instructed,
The next page is displayed. These operations are the same as in the above-described mode. Next, the CPU 10 resets the internal counter and starts counting time (step S60). This count is stopped when the page is turned, then reset and restarted (step S64). However, if the page is not turned, it is counted up to a certain value (step S6).
6, S62). Then, the task is changed by the instruction of the CPU 10, and the task is executed (step S6).
8). For example, the task is changed to the above-described composite reproduction mode, and the reading operation in that mode is executed. As described above, when a certain count is performed without turning a page after a certain page is displayed, that is, when a predetermined time elapses without a page being turned after a certain page of text is displayed. , The reading of the text is done automatically. <Other Embodiments> The present invention can be variously modified based on the above disclosure, and includes, for example, the following. (1) In the above embodiment, a CD-ROM is used as a recording medium, but various types such as an FD may be used. The display device is similar, and is not limited to the liquid crystal display. (2) It is not always necessary to provide all of the above-described operation modes for sound reproduction, and only one of them may be provided. (3) The above-described embodiment is mainly for a case where the display content is mainly characters, but can be applied to a case where other information, for example, an image such as a picture or a photograph is included. As described above, according to the present invention, the following effects can be obtained. (1) Since the character data is converted to voice, there is no need to add voice data to all the character data stored in the recording medium. Become. In addition, the voice of the text conversion can be selected for either male or female, or can be converted to the voice of a favorite actor, singer, or the like. (2) You can enjoy reading aloud in various modes, such as synthesis playback of additional voice and converted voice, and addition playback of both voices. (3) If there is no page turning after a certain period of time after the display, the mode is automatically switched to the reading mode. Can be.

【図面の簡単な説明】【図１】この発明の実施例の構成の主要部を示すブロッ
ク図である。【図２】実施例のデータフォーマットの概略を示す図で
ある。【図３】実施例のディスプレイの表示エリアを示す図で
ある。【図４】実施例のＣＰＵの基本的な動作を示すフローチ
ャートである。【図５】実施例における朗読の様子を示す図である。【図６】実施例の動作を示すフローチャートである。【図７】実施例の動作を示すフローチャートである。【図８】実施例の動作を示すフローチャートである。【符号の説明】１０…ＣＰＵ１２…操作部１３…ＲＯＭカートリッジ１４…ＲＯＭドライバ１６…伸長部１８…辞書・素片データ部２０…ディスプレイプロセッサ２２…テキスト解析・合成パラメータ生成部２４…音声合成部２６…Ｄ／Ａ変換器２８…加算器３０…ディスプレイBRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing a main part of the configuration of an embodiment of the present invention. FIG. 2 is a diagram showing an outline of a data format of an embodiment. FIG. 3 is a diagram illustrating a display area of a display according to the embodiment. FIG. 4 is a flowchart illustrating a basic operation of a CPU according to the embodiment. FIG. 5 is a diagram showing a state of reading aloud in the embodiment. FIG. 6 is a flowchart showing the operation of the embodiment. FIG. 7 is a flowchart showing the operation of the embodiment. FIG. 8 is a flowchart showing the operation of the embodiment. [Description of Signs] 10 CPU 12 Operating unit 13 ROM cartridge 14 ROM driver 16 Decompression unit 18 Dictionary / unit data unit 20 Display processor 22 Text analysis / synthesis parameter generation unit 24 Voice synthesis unit 26 D / A converter 28 Adder 30 Display

Claims

(57) [Claims] [Claim 1] Character data and the character data
The data identification file is in frame units that are different from the audio data.
From the information recording medium stored with the lag
An electronic book device for reproducing data and audio data, wherein the information recording medium is used together with the character data and audio data.
Based on the data identification flag read from
An identification unit for identifying voice data and the character data, and displaying the character data identified by the identification unit.
Display means for adding the voice data identified by the identification means.
Voice reproducing means for obtaining voice, and converting the character data identified by the identifying means into voice
Voice converting means for obtaining a converted voice by converting the additional voice corresponding to the characters displayed by the display means.
And selectively outputting one of the converted voice and the converted voice.
And the additional voice corresponding to the characters displayed by the display means.
And a second mode for adding and outputting the converted sound and the additional sound corresponding to the character displayed by the display means.
And a third mode which does not output any of the converted voices.
And a character display by the display means in the third mode.
After that, a certain period of time has passed without page turning
In such a case, the mode is shifted to the first or second mode.
An electronic book device characterized by the following .