JP6421167B2

JP6421167B2 - Digital content playback and recording device

Info

Publication number: JP6421167B2
Application number: JP2016238872A
Authority: JP
Inventors: 西澤　達夫; 達夫西澤; 佑介田代; 真史神林
Original assignee: Shinano Kenshi Co Ltd
Current assignee: Shinano Kenshi Co Ltd
Priority date: 2016-12-08
Filing date: 2016-12-08
Publication date: 2018-11-07
Anticipated expiration: 2036-12-08
Also published as: JP2018097043A

Description

本発明はデジタルコンテンツ再生録音装置に関する。 The present invention relates to a digital content playback / recording apparatus.

本体またはネットワーク上（いわゆるクラウド上）に配設された記憶部に記憶されているデジタルコンテンツを表示部や音声出力部に出力するデジタルコンテンツ再生録音装置としては、使用者の使用目的に合わせた構成のものが多数提供されている。 A digital content playback / recording device that outputs digital content stored in a storage unit arranged on the main unit or on a network (on the so-called cloud) to a display unit or an audio output unit, is configured in accordance with the user's purpose of use. Many things are offered.

このようなデジタルコンテンツ再生装置としては、表示部に表示させるテキストデータを適宜の位置で句切り、区切られた部分のテキストデータの表示とこれに対応する音声データを同期させて表示および再生させることが可能な構成が知られている（特許文献１参照）。 As such a digital content playback apparatus, text data to be displayed on the display unit is punctuated at an appropriate position, and the display of the text data in the divided portion and the corresponding audio data are displayed and played back in synchronization with each other. The structure which can do is known (refer patent document 1).

特開２０１６−０１２０９９号公報JP 2006-012099 A

特許文献１に開示されているデジタルコンテンツ再生装置を音読や復唱等の学習に用いることで、現在再生されている音声データとテキストデータの同期をとることができるため、使用者の音読学習がしやすくなるという点において好都合である。このような音読や復唱の学習を行う際においては、自分の音読内容や復唱内容を手本と比較することにより、どの程度手本に忠実に音読や復唱ができているかについて確認することが重要である。しかしながら従来のデジタルコンテンツ再生装置においては、このような音読内容や復唱内容と手本とを比較することができるような機能を有する構成が提案されておらず、音読内容や復唱内容と手本との比較を行うことが困難であるといった課題を有している。 By using the digital content playback apparatus disclosed in Patent Document 1 for learning such as reading aloud and reading back, it is possible to synchronize the voice data currently being played back with the text data. This is advantageous in that it is easier. When learning reading aloud or reading aloud like this, it is important to check how well you read aloud and repeat aloud by comparing your reading aloud and reading content with the example. It is. However, in the conventional digital content playback apparatus, a configuration having a function capable of comparing the content of reading aloud or the content of recitation with a model has not been proposed. It is difficult to compare the two.

そこで本発明は、使用者がデジタルコンテンツを用いた音読や復唱等の学習をする際において、自分の音読内容や復唱内容と手本との比較を行いやすくすることで、自分の音読内容や復唱内容がどの程度手本に忠実であるかを確認することが可能なデジタルコンテンツ再生録音装置の提供を目的としている。 Therefore, the present invention makes it easy for the user to compare his / her reading contents / repeated contents with a model when learning to read / return using digital contents. It is an object of the present invention to provide a digital content playback / recording apparatus that can confirm how faithful the content is.

上記課題を解決するため発明者が鋭意研究した結果、以下の構成に想到した。 As a result of intensive studies by the inventor in order to solve the above-mentioned problems, the following configuration has been conceived.

すなわち、本発明は、テキストデータを含むデジタルコンテンツが記憶されている記憶部と、前記テキストデータの全部または一部を表示する表示部と、前記テキストデータから単位フレーズを生成する単位フレーズ生成部と、前記単位フレーズに対応する前記テキストデータに基づいて音声合成により音声データを生成する音声データ生成部と、音声出力部と、使用者が前記テキストデータを読み上げた音声を音声録音データとして記録する音声記録部と、前記単位フレーズに対応する前記音声データの長さに基づいて前記単位フレーズに対応する前記テキストデータを前記表示部に表示するフレーズ表示時間を規定するフレーズ表示時間規定部と、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを前記フレーズ表示時間にわたって前記表示部に強調表示させるフレーズ出力制御部と、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを前記フレーズ表示時間にわたって前記表示部に強調表示させた後、前記音声記録部により前記音声録音データを収集させると共に、収集した前記音声録音データを前記記憶部に記憶させる音声録音データ収集制御部と、前記デジタルコンテンツを、前記単位フレーズごとに前記単位フレーズに対応する前記テキストデータおよび前記音声データを前記表示部および前記音声出力部に出力させ、さらに前記単位フレーズに対応する前記音声録音データを前記音声出力部に出力させる音声録音データ確認制御部と、を具備していることを特徴とするデジタルコンテンツ再生録音装置である。 That is, the present invention includes a storage unit that stores digital content including text data, a display unit that displays all or part of the text data, and a unit phrase generation unit that generates a unit phrase from the text data. A voice data generation unit that generates voice data by voice synthesis based on the text data corresponding to the unit phrase, a voice output unit, and a voice that records a voice read by the user as the voice recording data A recording unit; a phrase display time defining unit that defines a phrase display time for displaying the text data corresponding to the unit phrase on the display unit based on a length of the audio data corresponding to the unit phrase; The text corresponding to the unit phrase without outputting the audio data to the output unit A phrase output control unit that highlights data on the display unit over the phrase display time, and the text data corresponding to the unit phrase over the phrase display time without outputting the audio data to the audio output unit. After highlighting on the display unit, the voice recording data is collected by the voice recording unit, and the voice recording data collection control unit for storing the collected voice recording data in the storage unit, and the digital content, For each unit phrase, the text data and the voice data corresponding to the unit phrase are output to the display unit and the voice output unit, and the voice recording data corresponding to the unit phrase is output to the voice output unit. And a recording data confirmation control unit. It is a digital content playback recording device.

また、テキストデータおよび前記テキストデータに対応した音声データを含むデジタルコンテンツが記憶されている記憶部と、前記テキストデータの全部または一部を表示する表示部と、前記テキストデータおよび前記音声データから単位フレーズを生成する単位フレーズ生成部と、音声出力部と、使用者が前記テキストデータを読み上げた音声を音声録音データとして記録する音声記録部と、前記単位フレーズに対応する前記音声データの長さに基づいて前記単位フレーズに対応する前記テキストデータを前記表示部に表示するフレーズ表示時間を規定するフレーズ表示時間規定部と、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを前記フレーズ表示時間にわたって前記表示部に強調表示させるフレーズ出力制御部と、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを前記フレーズ表示時間にわたって前記表示部に強調表示させた後、前記音声記録部により前記音声録音データを収集させると共に、収集した前記音声録音データを前記記憶部に記憶させる音声録音データ収集制御部と、前記デジタルコンテンツを、前記単位フレーズごとに前記単位フレーズに対応する前記テキストデータおよび前記音声データを前記表示部および前記音声出力部に出力させ、さらに前記単位フレーズに対応する前記音声録音データを前記音声出力部に出力させる音声録音データ確認制御部と、を具備していることを特徴とするデジタルコンテンツ再生録音装置とすることもできる。 A storage unit for storing digital data including text data and audio data corresponding to the text data; a display unit for displaying all or part of the text data; and units from the text data and the audio data. A unit phrase generation unit that generates a phrase, an audio output unit, an audio recording unit that records the audio read out by the user as the audio recording data, and the length of the audio data corresponding to the unit phrase Based on the phrase display time defining unit for defining the phrase display time for displaying the text data corresponding to the unit phrase on the display unit, and corresponding to the unit phrase without outputting the audio data to the audio output unit The text data to be highlighted on the display section over the phrase display time A phrase output control unit, and the voice recording unit after highlighting the text data corresponding to the unit phrase over the phrase display time without outputting the voice data to the voice output unit. The voice recording data is collected by the voice recording data collection control unit that stores the collected voice recording data in the storage unit, and the digital content is the text data corresponding to the unit phrase for each unit phrase. And a voice recording data confirmation control unit that outputs the voice data to the display unit and the voice output unit, and further outputs the voice recording data corresponding to the unit phrase to the voice output unit. It is possible to provide a digital content reproduction / recording apparatus characterized by the above.

また、テキストデータまたはテキストデータおよび前記テキストデータに対応した音声データを含むデジタルコンテンツと、前記デジタルコンテンツの構成内容が記述されたコンテンツ構成内容データと、が記憶されている記憶部と、前記テキストデータの全部または一部を表示する表示部と、前記テキストデータまたは前記テキストデータおよび前記音声データから単位フレーズを生成する単位フレーズ生成部と、前記単位フレーズに対応する前記テキストデータに基づいて音声合成により前記音声データを生成する音声データ生成部と、音声出力部と、使用者が前記テキストデータを読み上げた音声を音声録音データとして記録する音声記録部と、前記単位フレーズに対応する前記音声データの長さに基づいて、前記単位フレーズに対応する前記テキストデータを前記表示部に表示するフレーズ表示時間を規定するフレーズ表示時間規定部と、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを前記フレーズ表示時間にわたって前記表示部に強調表示させるフレーズ出力制御部と、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを前記フレーズ表示時間にわたって前記表示部に強調表示させた後、前記音声記録部により前記音声録音データを収集させると共に、収集した前記音声録音データを前記記憶部に記憶させる音声録音データ収集制御部と、前記デジタルコンテンツを、前記単位フレーズごとに前記単位フレーズに対応する前記テキストデータおよび前記音声データを前記表示部および前記音声出力部に出力させ、さらに前記単位フレーズに対応する前記音声録音データを前記音声出力部に出力させる音声録音データ確認制御部と、前記コンテンツ構成内容データの内容に基づいて、前記単位フレーズ生成部と前記フレーズ表示時間規定部との動作内容を切り替える動作内容切替制御部と、を具備し、前記動作内容切替制御部は、前記コンテンツ構成内容データを参照し、前記デジタルコンテンツが前記テキストデータに対応した音声データを含んでいないと判断した場合には、前記単位フレーズ生成部に前記テキストデータから前記単位フレーズを生成させる処理と、前記音声データ生成部に前記単位フレーズに対応する前記テキストデータに基づいて音声合成により前記音声データを生成させる処理を実行し、前記デジタルコンテンツが前記テキストデータに対応した音声データを含んでいると判断した場合には、前記単位フレーズ生成部に、前記テキストデータおよび前記音声データから前記単位フレーズを生成させ、前記音声データ生成部の動作をスキップさせる処理を実行することを特徴とするデジタルコンテンツ再生録音装置とすることもできる。 In addition, a storage unit storing digital data including text data or text data and audio data corresponding to the text data, content configuration content data describing the configuration content of the digital content, and the text data A display unit that displays all or part of the text, a unit phrase generation unit that generates a unit phrase from the text data or the text data and the voice data, and voice synthesis based on the text data corresponding to the unit phrase. A voice data generation unit that generates the voice data; a voice output unit; a voice recording unit that records voice that the user reads out the text data as voice recording data; and a length of the voice data corresponding to the unit phrase. Based on the above, it corresponds to the unit phrase A phrase display time defining unit that defines a phrase display time for displaying the text data on the display unit, and the phrase data is output to the speech output unit without outputting the audio data to the phrase. A phrase output control unit that highlights on the display unit over a display time, and the text data corresponding to the unit phrase is emphasized on the display unit over the phrase display time without outputting the audio data to the audio output unit After the display, the voice recording data is collected by the voice recording unit, and the voice recording data collection control unit that stores the collected voice recording data in the storage unit, and the digital content for each unit phrase Before the text data and the voice data corresponding to the unit phrase Based on the content of the audio recording data confirmation control unit that outputs the audio recording data corresponding to the unit phrase to the audio output unit and the audio recording data confirmation control unit that outputs to the display unit and the audio output unit, An operation content switching control unit that switches operation content between a unit phrase generation unit and the phrase display time defining unit, the operation content switching control unit refers to the content configuration content data, and the digital content is If it is determined that the voice data corresponding to the text data is not included, the unit phrase generating unit generates the unit phrase from the text data, and the voice data generating unit corresponds to the unit phrase. A process for generating the voice data by voice synthesis based on the text data is executed. When the digital content is determined to include audio data corresponding to the text data, the unit phrase generation unit generates the unit phrase from the text data and the audio data, and generates the audio data. It is also possible to provide a digital content playback / recording apparatus characterized by executing a process of skipping the operation of the unit.

以上の構成を採用することにより、使用者が音読対象となるフレーズの手本となる単位フレーズ対応音声データを聞いた直後に自らの音読音声を録音した音声録音データを作成すると共に、手本となる単位フレーズ対応音声データと音声録音データをそれぞれ確認することができるので、使用者が音読等の学習をする際における使い勝手を向上させることが可能になる。 By adopting the above configuration, immediately after listening to the unit phrase-corresponding voice data that is a model of the phrase that is to be read aloud, voice recording data that records its own voice reading voice is created, and Since the unit phrase-corresponding voice data and voice recording data can be respectively confirmed, it is possible to improve usability when the user learns reading aloud.

また、前記フレーズ出力制御部は、前記フレーズ表示時間にわたって前記単位フレーズに対応する前記音声データを前記音声出力部に出力させると共に、前記単位フレーズに対応する前記テキストデータを前記表示部に強調表示させた後に、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを前記フレーズ表示時間にわたって前記表示部に強調表示させることが好ましい。 The phrase output control unit causes the voice output unit to output the voice data corresponding to the unit phrase over the phrase display time, and causes the display unit to highlight the text data corresponding to the unit phrase. It is preferable that the text data corresponding to the unit phrase is highlighted on the display unit for the phrase display time without outputting the audio data to the audio output unit.

この構成によれば、使用者は単位フレーズに対応する音声データを単位フレーズに対応するテキストデータを見ながら聞いた後に、単位フレーズに対応するテキストデータを見ながら音声録音データの作成をすることができるため、音声録音データの作成時における音読の誤りを防ぐことができる。 According to this configuration, the user can listen to the voice data corresponding to the unit phrase while viewing the text data corresponding to the unit phrase, and then create the voice recording data while viewing the text data corresponding to the unit phrase. Therefore, it is possible to prevent reading errors when creating voice recording data.

また、前記フレーズ出力制御部は、前記フレーズ表示時間を超えてもなお、前記音声記録部による前記音声録音データの収集が継続されている場合、前記音声記録部による前記音声録音データの収集が終了するまでの間、前記表示部に前記単位フレーズに対応する前記テキストデータの表示を継続させることが好ましい。 In addition, the phrase output control unit ends the collection of the voice recording data by the voice recording unit when the voice recording data is continuously collected by the voice recording unit even if the phrase display time is exceeded. In the meantime, it is preferable to continue displaying the text data corresponding to the unit phrase on the display unit.

この構成によれば、さらに音読に時間がかかる使用者であっても単位フレーズに対応するテキストデータを見ながら音声録音データの作成をすることができる。 According to this configuration, even a user who takes time to read aloud can create voice recording data while viewing the text data corresponding to the unit phrase.

また、前記音声録音データ収集制御部は、前記音声出力部に前記音声データの出力をせずに前記単位フレーズに対応する前記テキストデータを強調表示させた後において、前記音声記録部に入力された最初の音声をトリガーとして前記音声記録部に前記音声録音データを収集させる処理を実行することが好ましい。 In addition, the voice recording data collection control unit is input to the voice recording unit after highlighting the text data corresponding to the unit phrase without outputting the voice data to the voice output unit. It is preferable to execute a process of causing the voice recording unit to collect the voice recording data using a first voice as a trigger.

これにより、音声録音データ収集制御部をいわゆる自動トリガーによって作動させることができるため、使用者は自身の音読状態に合わせてデジタルコンテンツ再生録音装置の操作をする必要がなく、使い勝手を向上させることができる。 As a result, since the voice recording data collection control unit can be operated by a so-called automatic trigger, the user does not need to operate the digital content playback / recording apparatus in accordance with his / her reading state, and the usability can be improved. it can.

また、前記音声録音データ収集制御部は、前記音声記録部に前記音声録音データを収集させる処理を実行した後、前記音声記録部への音声の無入力状態が所要時間継続したことをトリガーとして、前記音声記録部に前記音声録音データの収集を停止させる処理を実行することが好ましい。 In addition, the voice recording data collection control unit, after executing the process of collecting the voice recording data to the voice recording unit, as a trigger that the voice non-input state to the voice recording unit has continued for a required time, It is preferable to execute processing for causing the voice recording unit to stop collecting the voice recording data.

これにより、音声録音データ収集制御部をいわゆる自動トリガーによって停止させることができるため、使用者は自身の音読状態に合わせてデジタルコンテンツ再生録音装置の操作をする必要がなく、使い勝手を向上させることができる。 As a result, since the voice recording data collection control unit can be stopped by a so-called automatic trigger, the user does not need to operate the digital content playback / recording device in accordance with his / her reading state, which improves usability. it can.

また、前記音声録音データ収集制御部は、前記音声記録部が収集した前記音声録音データに対して、前記音声録音データの終端から前記音声記録部への音声の無入力状態が継続した時間の範囲を削除する処理を実行することが好ましい。 Further, the voice recording data collection control unit is a range of time during which no voice input state from the end of the voice recording data to the voice recording unit has continued for the voice recording data collected by the voice recording unit It is preferable to execute processing for deleting.

これにより、自動トリガーで収集した音声録音データの不要な部分をトリミングすることができ、音声録音データのデータ容量を削減することができると共に、音声録音データの再生時における余分な無音部分を無くすことができ、学習効率を向上させることができる。 This makes it possible to trim unnecessary portions of the voice recording data collected by the automatic trigger, reduce the data volume of the voice recording data, and eliminate the unnecessary silence when playing the voice recording data. And learning efficiency can be improved.

また、前記デジタルコンテンツ再生録音装置には音声録音データ収集操作部が配設されていて、前記音声録音データ収集制御部は、使用者による前記音声録音データ収集操作部の操作状態をトリガーとして、前記音声記録部に前記音声録音データを記録させる処理を実行することが好ましい。 The digital content playback / recording apparatus is provided with a voice recording data collection operation unit, and the voice recording data collection control unit is triggered by an operation state of the voice recording data collection operation unit by a user. It is preferable to execute processing for recording the voice recording data in a voice recording unit.

これにより、音声録音データ収集制御部をいわゆる手動トリガーによって作動させることができるため、使用者は自身の音読状態に合わせてデジタルコンテンツ再生録音装置の操作をすることができる。 As a result, the voice recording data collection control unit can be operated by a so-called manual trigger, so that the user can operate the digital content reproduction / recording apparatus in accordance with his / her reading state.

また、前記音声録音データ確認制御部は、前記単位フレーズごとに前記単位フレーズに対応する前記テキストデータおよび前記音声データを前記表示部および前記音声出力部に出力させた後に、前記表示部への前記単位フレーズに対応する前記テキストデータの出力状態を維持した状態で前記単位フレーズに対応する前記音声録音データを前記音声出力部に出力させることが好ましい。 Further, the voice recording data confirmation control unit outputs the text data and the voice data corresponding to the unit phrase to the display unit and the voice output unit for each unit phrase, and then outputs the text data and the voice data to the display unit. It is preferable that the voice recording unit corresponding to the unit phrase is output to the voice output unit while maintaining the output state of the text data corresponding to the unit phrase.

これにより、使用者が音読学習をする際において、模範音声データである単位フレーズ対応音声データと自分の音読データである音声録音データとの比較がしやすく、音読学習の効率を高めることができる。 Thus, when the user learns reading aloud, it is easy to compare the unit phrase-corresponding voice data that is model voice data and the voice recording data that is her own reading data, and the efficiency of reading aloud can be improved.

また、前記音声録音データ確認制御部は、前記単位フレーズごとに前記単位フレーズに対応する前記テキストデータを前記表示部に出力させた後に、前記単位フレーズに対応する前記音声録音データを前記音声出力部に出力させることが好ましい。 The voice recording data check control unit outputs the voice recording data corresponding to the unit phrase to the voice output unit after outputting the text data corresponding to the unit phrase to the display unit for each unit phrase. Is preferably output.

これにより、使用者が音読学習をする際において、テキストデータと自分の音読データである音声録音データとの比較のみを行うことができ、音読学習の習熟度が所定レベルに達した場合における音読学習の効率を高めることができる。 As a result, when the user learns to read aloud, the user can only compare the text data with the voice recording data that is his / her own aloud reading data. Can increase the efficiency.

また、前記記憶部には、前記単位フレーズに対応する前記音声録音データが複数記憶されていて、前記音声録音データ確認制御部は、特定の前記単位フレーズに対応する複数の前記音声録音データの中から使用者によって任意に選択された前記音声録音データを前記記憶部から読み出させると共に、前記音声出力部に出力させる処理を実行することが好ましい。 The storage unit stores a plurality of the voice recording data corresponding to the unit phrase, and the voice recording data confirmation control unit includes a plurality of the voice recording data corresponding to the specific unit phrase. It is preferable that the voice recording data arbitrarily selected by the user is read from the storage unit and is output to the voice output unit.

これにより、使用者が時系列に沿って自身の音声録音データを聞くことにより、過去の音読状態と現在の音読状態との比較を容易に行うことができ、音読学習の効果を実感することができ、学習意欲の向上に寄与することができる。 As a result, the user can easily compare the past reading state with the current reading state by listening to his / her voice recording data in time series, and can realize the effect of reading aloud. Can contribute to the improvement of learning motivation.

また、前記記憶部には、前記音声データを前記音声出力部に出力させる際における音声の特性を規定する音声特性データが予め複数記憶されていて、前記音声録音データ確認制御部は、使用者によりデータ入力手段を介して選択された前記音声特性データに基づいた前記音声データを前記音声出力部に出力させることが好ましい。 Further, the storage unit stores in advance a plurality of audio characteristic data that defines audio characteristics when the audio data is output to the audio output unit, and the audio recording data confirmation control unit is controlled by a user. It is preferable that the audio output unit outputs the audio data based on the audio characteristic data selected via the data input means.

これにより、手本となる単位フレーズ対応音声データを使用者の好みに応じて使い分けすることができる。 Thereby, the unit phrase corresponding | compatible audio | voice data used as a model can be selectively used according to a user's liking.

本発明にかかるデジタルコンテンツ再生録音装置の構成を採用することにより、使用者がＤＡＩＳＹデータに代表されるデジタルコンテンツを用いた音読や復唱等の学習をする際において、自分の音読内容や復唱内容と手本との比較が行いやすくなる。これにより、自分の音読内容や復唱内容がどの程度手本に忠実であるかを容易に確認することができ、使用者の音読学習の学習効率を大幅に向上させることが可能になる。また、使用者は本発明にかかるデジタルコンテンツ再生録音装置を用いれば、指導者が不在であっても音読学習を適切かつ効率的に自習することができる。 By adopting the configuration of the digital content playback / recording apparatus according to the present invention, when the user learns reading aloud or repeating using a digital content represented by DAISY data, It becomes easier to compare with the model. As a result, it is possible to easily confirm how faithfully the content of the reading aloud or the content of the recitation is faithful to the model, and the learning efficiency of the reading aloud learning of the user can be greatly improved. In addition, if the user uses the digital content playback / recording apparatus according to the present invention, the user can learn reading aloud appropriately and efficiently even when the instructor is absent.

第１実施形態におけるデジタルコンテンツ再生録音装置の概略構成図である。It is a schematic block diagram of the digital content reproduction | regeneration recording device in 1st Embodiment. 第１実施形態におけるデジタルコンテンツ再生録音装置によるデジタルコンテンツ再生録音方法のフロー図である。It is a flowchart of the digital content reproduction | regeneration recording method by the digital content reproduction | regeneration recording device in 1st Embodiment. フレーズの強調表示状態を示す説明図である。It is explanatory drawing which shows the highlight display state of a phrase. フレーズの表示時間の経過と強調表示状態の変遷を示す説明図である。It is explanatory drawing which shows transition of the display time of a phrase, and transition of a highlight state. 音声録音データ収集制御部による使用者の音読音声の録音のフローを示す説明図である。It is explanatory drawing which shows the flow of recording of the user's voice-reading voice by a voice recording data collection control part. 音声録音データ確認制御部による模範音声データと音声録音データの再生状況を示す説明図である。It is explanatory drawing which shows the reproduction | regeneration condition of model audio | voice data and audio | voice recording data by an audio | voice recording data confirmation control part. 第２実施形態におけるデジタルコンテンツ再生録音装置の概略説明図である。It is a schematic explanatory drawing of the digital content reproduction | regeneration recording device in 2nd Embodiment. 第３実施形態におけるデジタルコンテンツ再生録音装置の概略構成図である。It is a schematic block diagram of the digital content reproduction | regeneration recording device in 3rd Embodiment. 他の実施形態におけるデジタルコンテンツ再生録音装置の概略構成図である。It is a schematic block diagram of the digital content reproduction | regeneration recording device in other embodiment.

以下、本発明にかかるデジタルコンテンツ再生録音装置とデジタルコンテンツ再生録音方法の実施形態について図面に基づいて説明する。 Embodiments of a digital content playback / recording apparatus and a digital content playback / recording method according to the present invention will be described below with reference to the drawings.

（第１実施形態）
図１は本実施形態におけるデジタルコンテンツ再生録音装置の概略構成図であり、図２は本実施形態におけるデジタルコンテンツ再生録音装置によるデジタルコンテンツ再生録音方法のフロー図である。本実施形態においては図１に示すように、いわゆるタブレット端末やスレートパソコンと称されている電子機器によりデジタルコンテンツ再生録音装置１０が構成されている形態に基づいて説明する。デジタルコンテンツ再生録音装置１０のデジタルコンテンツＤＣは半導体等により構成される不揮発性メモリ代表される記憶部１２に記憶されている。デジタルコンテンツ再生録音装置１０がネットワーク接続部１４を有している場合には、ネットワーク接続部を介して接続されたネットワーク上に配設されている記憶部３０に格納されているデジタルコンテンツＤＣをダウンロードやストリーミング等により記憶部１２に恒久的または一時的に記憶させて用いることもできる。この場合の記憶部１２は揮発性メモリを用いることもできる。また、ネットワーク接続部１４の接続形態は有線接続・無線接続の種類は問わず、公知の接続形態を採用することができる。 (First embodiment)
FIG. 1 is a schematic configuration diagram of a digital content playback / recording apparatus according to the present embodiment, and FIG. 2 is a flowchart of a digital content playback / recording method performed by the digital content playback / recording apparatus according to the present embodiment. In the present embodiment, as shown in FIG. 1, a description will be given based on a form in which a digital content reproduction / recording apparatus 10 is configured by an electronic device called a so-called tablet terminal or slate personal computer. The digital content DC of the digital content playback / recording apparatus 10 is stored in a storage unit 12 represented by a non-volatile memory composed of a semiconductor or the like. When the digital content playback / recording apparatus 10 has the network connection unit 14, the digital content DC stored in the storage unit 30 disposed on the network connected via the network connection unit is downloaded. It can also be used by being stored permanently or temporarily in the storage unit 12 by streaming or the like. In this case, the storage unit 12 may be a volatile memory. Moreover, the connection form of the network connection unit 14 can employ a known connection form regardless of the type of wired connection or wireless connection.

ここで、デジタルコンテンツＤＣとは、テキストデータを含むデジタルデータを指すものとする。デジタルコンテンツＤＣの他の具体例としては、テキストデータおよびこのテキストデータに対応する単位フレーズ対応音声データを含むデータ構成の他に、テキストデータおよびこのテキストデータに対応する単位フレーズ対応音声データに加えて他の単位フレーズ対応音声データや画像データ等の各種データを含ませたデータ構成を採用することも可能である。 Here, the digital content DC refers to digital data including text data. As another specific example of the digital content DC, in addition to the text data and the unit phrase corresponding voice data corresponding to the text data, in addition to the data structure including the text data and the unit phrase corresponding voice data corresponding to the text data, It is also possible to adopt a data configuration including various data such as other unit phrase-corresponding audio data and image data.

記憶部１２に記憶されたデジタルコンテンツＤＣは、最初に動作制御部１８により読み出しされ（データ読み出し工程）る。読み出されたデジタルコンテンツＤＣは、単位フレーズ生成部１８Ａとしての動作制御部１８がデジタルコンテンツＤＣ内のテキストデータをフレーズごとに分割する処理を実行する（単位フレーズ生成工程）。なお、テキストデータを単位フレーズごとに分割処理する具体的な方法は、例えば、特開２０１６−１２０９９号公報に開示されているような公知の手法を採用することができるので、ここでの詳細な説明は省略する。 The digital content DC stored in the storage unit 12 is first read by the operation control unit 18 (data reading step). For the read digital content DC, the operation control unit 18 as the unit phrase generation unit 18A executes processing for dividing the text data in the digital content DC into phrases (unit phrase generation step). A specific method for dividing the text data into unit phrases can employ a known method as disclosed in, for example, Japanese Patent Application Laid-Open No. 2006-1299. Description is omitted.

ここで、単位フレーズとは、いわゆる分かち書きで区切ることができる単位の他、句点や読点で区切ることができる単位、または、段落で区切ることができる単位等、任意の区切れ単位で区切られた文章の要素、または文章、若しくは文のかたまりを単位フレーズとして取り扱うことができる。 Here, the unit phrase is a sentence that is delimited by any delimiter unit, such as a unit that can be delimited by so-called split writing, a unit that can be delimited by punctuation or punctuation, or a unit that can be delimited by paragraphs. Elements, sentences, or chunks of sentences can be handled as unit phrases.

単位フレーズ生成部１８Ａによりテキストデータを分割して得たそれぞれの単位フレーズＰは、分割された順番に分割単位フレーズ通し番号付与部１８Ｂである動作制御部１８により分割された単位フレーズＰの夫々に対して通し番号が付与された（通し番号付与工程）後に、単位フレーズＰと単位フレーズＰに対して付与された通し番号とが紐付けされた状態で記憶部１２に記憶される。 Each unit phrase P obtained by dividing the text data by the unit phrase generating unit 18A is divided into the unit phrases P divided by the operation control unit 18 which is the divided unit phrase serial number assigning unit 18B in the divided order. After the serial number is assigned (serial number assigning step), the unit phrase P and the serial number assigned to the unit phrase P are stored in the storage unit 12 in a linked state.

音声データ生成部１８Ｚである動作制御部１８は、単位フレーズＰに対応するテキストデータに基づいて音声合成により単位フレーズ対応音声データを生成する（単位フレーズ対応音声データ生成工程）。テキストデータに基づいた音声合成による単位フレーズ対応音声データの生成方法は公知のものを適用することができるため、ここでは音声合成についての詳細な説明は省略する。本実施形態においては音声データ生成部１８Ｚにより生成された単位フレーズ対応音声データを模範音声データＭＯＤとして単位フレーズＰに対して付与された通し番号と紐付けした状態で記憶部１２に記憶させている。なお、デジタルコンテンツＤＣにテキストデータに対応する音声データが含まれている場合には、単位フレーズ対応音声データ分割部としての動作制御部１８が単位フレーズＰに対応する単位フレーズ対応音声データを分割することで模範音声データＭＯＤを生成するようにしてもよい。すなわち、音声データ生成部１８Ｚによる単位フレーズ対応音声データ生成工程をスキップさせることになる。 The operation control unit 18 that is the voice data generation unit 18Z generates unit phrase-corresponding voice data by voice synthesis based on the text data corresponding to the unit phrase P (unit phrase-corresponding voice data generation step). Since a known method for generating unit phrase-corresponding speech data by speech synthesis based on text data can be applied, a detailed description of speech synthesis is omitted here. In the present embodiment, the unit phrase-corresponding voice data generated by the voice data generation unit 18Z is stored in the storage unit 12 in a state of being associated with the serial number assigned to the unit phrase P as the model voice data MOD. When the digital content DC includes audio data corresponding to the text data, the operation control unit 18 as the unit phrase corresponding audio data dividing unit divides the unit phrase corresponding audio data corresponding to the unit phrase P. Thus, the exemplary voice data MOD may be generated. That is, the unit phrase-corresponding voice data generation step by the voice data generation unit 18Z is skipped.

フレーズ表示時間規定部１８Ｃである動作制御部１８は、分割された単位フレーズＰに付与された通し番号順に、それぞれの単位フレーズＰ内における音素数（ここでは読み仮名の文字数を音素数としている）をカウントする。フレーズ表示時間規定部１８Ｃである動作制御部１８は、予め記憶部１２に記憶されているテキストデータの表示部１６Ａへの表示時間を規定するためのテキスト表示時間基本データＴＨＤとカウントした音素数との積を算出する。さらにフレーズ表示時間規定部１８Ｃである動作制御部１８は、算出した積を標準のフレーズ表示時間ＨＰＴとして（フレーズ表示時間規定工程）、単位フレーズＰの通し番号と紐付けした状態で記憶部１２に記憶させる処理を行う。このようにしてフレーズ表示時間規定部１８Ｃとしての動作制御部１８は、分割された単位フレーズＰの全てに対して標準のフレーズ表示時間ＨＰＴの算出を行う。 The operation control unit 18 which is the phrase display time defining unit 18C calculates the number of phonemes in each unit phrase P (here, the number of characters in the reading kana is used as the number of phonemes) in the order of the serial numbers given to the divided unit phrases P. Count. The operation control unit 18 serving as the phrase display time defining unit 18C includes the text display time basic data THD for defining the display time of the text data stored in the storage unit 12 in advance on the display unit 16A, and the counted phoneme number. The product of is calculated. Further, the operation control unit 18 which is the phrase display time defining unit 18C stores the calculated product as the standard phrase display time HPT (phrase display time defining step) in the storage unit 12 in a state linked to the serial number of the unit phrase P. To perform the process. In this way, the operation control unit 18 as the phrase display time defining unit 18C calculates the standard phrase display time HPT for all of the divided unit phrases P.

テキスト表示時間基本データＴＨＤは、読み仮名の文字にかかわらず１音素（読み仮名１文字）あたりにおける発音時間を規定するデータの他、読み仮名の文字に応じて異なる発音時間が紐付けされた読み仮名ごとの発音時間を規定するデータ等を採用することができる。読み仮名ごとの発音時間を規定するデータを採用した場合、フレーズ表示時間規定部１８Ｃである動作制御部１８は、単位フレーズＰ内における読み仮名の文字の種類と数をそれぞれカウントする処理を実行する。さらには、テキストデータに対応した音声データが記憶部１２に記憶されている場合には、音声データの全音素数に対する再生時間（音声データの長さ）と、単位フレーズＰに分割された後の音声データの音素数に対応する再生時間とを比率により、フレーズ表示時間規定部１８Ｃとしての動作制御部が算出するようにしてもよい。 The text display time basic data THD is data that specifies the pronunciation time per phoneme (one character of the reading) regardless of the character of the reading kana, as well as readings associated with different pronunciation times according to the characters of the reading kana. Data that defines the pronunciation time for each kana can be used. When data defining the pronunciation time for each reading kana is adopted, the operation control unit 18 which is the phrase display time defining unit 18C executes a process of counting the type and number of characters of the reading kana in the unit phrase P. . Furthermore, when the voice data corresponding to the text data is stored in the storage unit 12, the playback time (the length of the voice data) with respect to the total number of phonemes of the voice data and the voice after being divided into unit phrases P The operation control unit as the phrase display time defining unit 18C may calculate the reproduction time corresponding to the number of phonemes of the data based on the ratio.

フレーズ出力制御部１８Ｄとしての動作制御部１８は、分割された単位フレーズＰについて通し番号の順番に、通し番号に紐付けされた標準のフレーズ表示時間ＨＰＴの時間にわたって、それぞれの単位フレーズＰに対応するテキストデータを表示部１６Ａに、単位フレーズ対応音声データであるテキストデータの模範音声データＭＯＤを音声出力部１６Ｂにそれぞれ出力した後、同じ単位フレーズＰにおいて音声出力部１６Ｂへの音声データの出力は行わずに、表示部１６Ａに出力すべき単位フレーズＰに対応するテキストデータを標準のフレーズ表示時間ＨＰＴにわたって出力する処理を実行する（フレーズ出力制御工程）。 The operation control unit 18 as the phrase output control unit 18 </ b> D is a text corresponding to each unit phrase P over the standard phrase display time HPT associated with the serial numbers in the order of the serial numbers for the divided unit phrases P. After the data is output to the display unit 16A and the text data model audio data MOD that is unit phrase-corresponding audio data is output to the audio output unit 16B, the audio data is not output to the audio output unit 16B in the same unit phrase P. In addition, a process of outputting the text data corresponding to the unit phrase P to be output to the display unit 16A over the standard phrase display time HPT is performed (phrase output control step).

なお、単位フレーズＰにおけるテキストデータに対応する単位フレーズ対応音声データとしての模範音声データＭＯＤは、単位フレーズＰにおけるテキストデータを音声データ生成部１８Ｚとしての動作制御部１８が前述のようにＴＴＳ（ＴｅｘｔＴｏＳｐｅｅｃｈ）処理することにより生成する他、音読の指導者によるテキストデータの音読音声をマイク等に代表される音声記録部１７により録音する等して予め記憶部１２に模範音声データＭＯＤとして記憶させておくこともできる。 The model voice data MOD as unit phrase-corresponding voice data corresponding to the text data in the unit phrase P is obtained by using the text data in the unit phrase P by the operation control unit 18 as the voice data generation unit 18Z as described above. In addition to the generation by processing, the voice reading voice of the text data by the voice reading instructor is recorded by the voice recording unit 17 typified by a microphone or the like and stored in the storage unit 12 as the model voice data MOD in advance. You can also keep it.

このとき、表示部１６Ａに出力するテキストデータは、表示対象となっている通し番号が付与された単位フレーズＰだけでなく、表示対象となっている通し番号を含む所要範囲の単位フレーズＰに対応するテキストデータを同時に表示部１６Ａに表示させておくこともできる。この形態を採用した場合には、図３に示すように、表示対象となっている通し番号が紐付けされている単位フレーズＰに対応するテキストデータに対して色付網掛け処理や太文字表示等に代表される強調表示処理を標準のフレーズ表示時間ＨＰＴの時間にわたって行う（フレーズ強調表示工程）ようにしてもよい。 At this time, the text data output to the display unit 16A is not only the unit phrase P to which the serial number to be displayed is assigned, but also the text corresponding to the unit phrase P in the required range including the serial number to be displayed. Data can also be displayed on the display unit 16A at the same time. When this form is adopted, as shown in FIG. 3, for the text data corresponding to the unit phrase P to which the serial number to be displayed is linked, color shading processing, bold display, etc. The highlighting process represented by (1) may be performed over the standard phrase display time HPT (phrase highlighting process).

さらにフレーズ出力制御部１８Ｄとしての動作制御部１８は、図４に示すように、表示部１６Ａへの表示対象であり、第１の強調表示（網掛け処理）がなされている単位フレーズＰに対応するテキストデータに対して、テキストデータを太文字表示する第２の強調表示（他の強調表示）処理を同時に行うようにしてもよい。そしてこの場合において、フレーズ出力制御部１８Ｄとしての動作制御部１８は、標準のフレーズ表示時間ＨＰＴの時間経過と、第２の強調表示を施す範囲とを比例させる（図４内における矢印に示す表示状態の変遷）ようにしてもよい。なお、図４内に示す矢印は標準のフレーズ表示時間ＨＰＴの時間経過方向を示すものである。ここで、第１の強調表示と第２の強調表示を施す範囲は、図４に示す態様の他に、第１の強調表示と第２の強調表示を施す範囲を単位フレーズＰのブロック単位に設定することもできる。 Further, as shown in FIG. 4, the operation control unit 18 as the phrase output control unit 18D corresponds to the unit phrase P that is the display target on the display unit 16A and is subjected to the first highlight display (shading process). The second highlight display (other highlight display) processing for displaying the text data in bold characters may be simultaneously performed on the text data to be performed. In this case, the operation control unit 18 as the phrase output control unit 18D makes the time passage of the standard phrase display time HPT proportional to the range in which the second highlighting is performed (the display indicated by the arrow in FIG. 4). (Transition of state). In addition, the arrow shown in FIG. 4 shows the time passage direction of standard phrase display time HPT. Here, in addition to the mode shown in FIG. 4, the range in which the first highlight display and the second highlight display are performed is the range in which the first highlight display and the second highlight display are performed in block units of the unit phrase P. It can also be set.

また、フレーズ出力制御部１８Ｄとしての動作制御部１８は、データ入力部を介して入力された（または記憶部１２に予め記憶させておいた）フレーズ表示時間調整値に基づいて、標準のフレーズ表示時間ＨＰＴを伸縮させる処理を行う（フレーズ表示時間伸縮工程）機能も有している。 Further, the operation control unit 18 as the phrase output control unit 18D is configured to display a standard phrase based on the phrase display time adjustment value input via the data input unit (or stored in advance in the storage unit 12). It also has a function of performing a process of expanding and contracting the time HPT (phrase display time expanding and contracting step).

本実施形態においては、単位フレーズＰに対応するテキストデータと模範音声データＭＯＤを表示部１６Ａと音声出力部１６Ｂ（ここでは、表示部１６Ａと音声出力部１６Ｂとによりデータ出力部１６が構成されている）にそれぞれ出力した後、単位フレーズＰに対応するテキストデータのみを表示部１６Ａに出力させている間において、使用者に単位フレーズＰに対応するテキストデータの読み上げ（音読）の練習をさせることができる。 In the present embodiment, the text data corresponding to the unit phrase P and the model voice data MOD are displayed on the display unit 16A and the voice output unit 16B (here, the data output unit 16 is configured by the display unit 16A and the voice output unit 16B). To output the text data corresponding to the unit phrase P to the display unit 16A, while the text data corresponding to the unit phrase P is being output to the display unit 16A. Can do.

音声録音データ収集制御部１８Ｅとしての動作制御部１８は、図５に示すように使用者が音読練習をしている間に使用者の音読音声を録音する（音声録音データを記録する）ように音声記録部１７を作動させ、使用者がテキストデータを音読している際の音声データを録音して音声録音データＯＲＤとして収集する。このとき音声録音データ収集制御部１８Ｅは、単位フレーズＰの通し番号と使用者の音声録音データＯＲＤとを紐付けした状態で記憶部１２に記憶させる処理を実行する（音声録音データ収集制御工程）。 As shown in FIG. 5, the operation control unit 18 as the voice recording data collection control unit 18E records the user's reading voice (records voice recording data) while the user is practicing reading. The voice recording unit 17 is operated to record voice data when the user is reading text data aloud and collect it as voice recording data ORD. At this time, the voice recording data collection control unit 18E executes a process of storing the serial number of the unit phrase P and the voice recording data ORD of the user in the storage unit 12 in a linked state (voice recording data collection control step).

ここで、音声記録部１７による音声録音データの収集が標準のフレーズ表示時間ＨＰＴを超えても継続して行われている場合には、フレーズ出力制御部１８Ｄとしての動作制御部１８は、音声記録部１７による音声録音データの収集が終了するまでの間、表示部１６Ａへの単位フレーズＰに対応するテキストデータの表示を継続させるようにすることもできる。 If the voice recording data is continuously collected by the voice recording unit 17 even after the standard phrase display time HPT is exceeded, the operation control unit 18 as the phrase output control unit 18D performs voice recording. The display of the text data corresponding to the unit phrase P on the display unit 16A can be continued until the collection of the voice recording data by the unit 17 is completed.

音声録音データ収集制御部１８Ｅが音声録音データＯＲＤの収集を行った後、音声録音データ確認制御部１８Ｆとしての動作制御部１８は、図６に示すように表示部１６Ａと音声出力部１６Ｂに単位フレーズＰの通し番号順に単位フレーズＰのテキストデータと、テキストデータに対応する単位フレーズ対応音声データとしての模範音声データＭＯＤとを表示部１６Ａと音声出力部１６Ｂとにそれぞれ出力した後に続けて、使用者の音声録音データＯＲＤの音声出力部１６Ｂへの出力を行う（音声録音データ確認制御工程）。 After the voice recording data collection control unit 18E collects the voice recording data ORD, the operation control unit 18 as the voice recording data confirmation control unit 18F has unit units in the display unit 16A and the voice output unit 16B as shown in FIG. After the text data of the unit phrase P and the model voice data MOD as the unit phrase corresponding voice data corresponding to the text data are output to the display unit 16A and the voice output unit 16B, respectively, in order of the serial number of the phrase P, the user Is output to the audio output unit 16B (audio recording data confirmation control step).

ここで、音声録音データ確認制御部１８Ｆとしての動作制御部１８は、使用者の音声録音データＯＲＤを音声出力部１６Ｂに出力させる際において、表示部１６Ａに対して単位フレーズＰに対応するテキストデータの出力状態を停止した状態にすることもできるし、単位フレーズＰに対応するテキストデータの出力状態を維持させることもできる。また、複数の単位フレーズＰに対応するテキストデータを表示部１６Ａに出力させた後に、複数の単位フレーズＰに対応する音声録音データＯＲＤを音声出力部１６Ｂに出力させるようにしてもよい。このように表示部１６Ａおよび音声出力部１６Ｂに出力させる単位フレーズＰは単数であってもよいし複数であってもよい。 Here, the operation control unit 18 serving as the voice recording data confirmation control unit 18F outputs text data corresponding to the unit phrase P to the display unit 16A when outputting the voice recording data ORD of the user to the voice output unit 16B. The output state of the text data corresponding to the unit phrase P can be maintained. Further, after the text data corresponding to the plurality of unit phrases P is output to the display unit 16A, the voice recording data ORD corresponding to the plurality of unit phrases P may be output to the voice output unit 16B. Thus, the unit phrase P to be output to the display unit 16A and the audio output unit 16B may be singular or plural.

本実施形態におけるデジタルコンテンツ再生録音装置１０によれば、使用者にテキストデータの音読の練習を行わせるに先立って、使用者に模範音声データＭＯＤを聞かせた直後に単位フレーズＰごとに単位フレーズＰに対応するテキストデータを表示部１６Ａに表示させながら単位フレーズＰに対応するテキストデータを音読させることができる。これにより使用者は単位フレーズＰに対応するテキストデータの音読の練習を容易に行うことができる。 According to the digital content playback / recording apparatus 10 of the present embodiment, the unit phrase P for each unit phrase P immediately after letting the user hear the model voice data MOD prior to the user practicing reading the text data aloud. The text data corresponding to the unit phrase P can be read aloud while the text data corresponding to is displayed on the display unit 16A. As a result, the user can easily practice reading aloud text data corresponding to the unit phrase P.

さらには、単位フレーズＰごとに使用者の音読音声を音声録音データＯＲＤとして収集することができると共に、単位フレーズＰに対応するテキストデータと模範音声データＭＯＤを表示部１６Ａと音声出力部１６Ｂに出力した後に、これと同じ単位フレーズＰに対応するテキストデータと音声録音データＯＲＤを表示部１６Ａおよび音声出力部１６Ｂに出力させていることにより、使用者は模範音声データＭＯＤに対する自らの音読状態を確認しながら音読学習を進めることができる。 Furthermore, the user's reading voice can be collected as voice recording data ORD for each unit phrase P, and text data and model voice data MOD corresponding to the unit phrase P are output to the display unit 16A and the voice output unit 16B. After that, the text data and the voice recording data ORD corresponding to the same unit phrase P are output to the display unit 16A and the voice output unit 16B, so that the user confirms his / her reading state with respect to the model voice data MOD. You can continue reading aloud while reading.

また、音声出力部１６Ｂが複数配設されている場合においては、音声録音データ確認制御部１８Ｆは一方の音声出力部１６Ｂから単位フレーズＰごとに音読の手本となる模範音声データＭＯＤの出力を実行すると同時に、他方の音声出力部１６Ｂから一方の音声出力部１６Ｂに出力された単位フレーズＰに対応する音声録音データＯＲＤの出力を実行することもできる。このとき、単位フレーズＰに対応するテキストデータを表示部１６Ａに出力させてもよい。このように複数の音声出力部１６Ｂから模範音声データＭＯＤと音声録音データＯＲＤとを同時に出力することにより、音声録音データＯＲＤと模範音声データＭＯＤとの差異を明確にすることができる点において好都合である。 When a plurality of voice output units 16B are provided, the voice recording data confirmation control unit 18F outputs model voice data MOD, which serves as an example of reading aloud for each unit phrase P, from one voice output unit 16B. Simultaneously with the execution, output of the voice recording data ORD corresponding to the unit phrase P output from the other voice output unit 16B to the one voice output unit 16B can also be executed. At this time, text data corresponding to the unit phrase P may be output to the display unit 16A. Thus, it is advantageous in that the difference between the voice recording data ORD and the model voice data MOD can be clarified by simultaneously outputting the model voice data MOD and the voice recording data ORD from the plurality of voice output units 16B. is there.

（第２実施形態）
第１実施形態においては、デジタルコンテンツ再生録音装置１０の動作制御部１８が、デジタルコンテンツＤＣ内のテキストデータを単位フレーズＰごとに分割し、分割した単位フレーズＰの表示部１６Ａへの表示時間を規定する処理を実行している。これに対して本実施形態においては、図７に示すように使用するデジタルコンテンツＤＣがＤＡＩＳＹ規格のデータである場合について説明を行う。なお、本実施形態において第１実施形態と同様に用いることができる構成については、図面および明細書中において第１実施形態で用いた部材番号と同一の番号を付すことにより、ここでの詳細な説明は省略している。 (Second Embodiment)
In the first embodiment, the operation control unit 18 of the digital content reproduction / recording apparatus 10 divides the text data in the digital content DC for each unit phrase P, and sets the display time of the divided unit phrase P on the display unit 16A. The specified process is being executed. On the other hand, in this embodiment, the case where the digital content DC to be used is DAISY standard data as shown in FIG. 7 will be described. In addition, about the structure which can be used similarly to 1st Embodiment in this embodiment, it attaches | subjects the detailed number here by attaching | subjecting the same number as the member number used in 1st Embodiment in drawing and specification. The explanation is omitted.

ＤＡＩＳＹ規格のデジタルコンテンツＤＣは、コンテンツの再生に必要な情報である単位フレーズＰに付与されている通し番号と、それぞれの単位フレーズＰにおける開示時間と終了時間が記載されたｓｍｉｌファイルと、目次やページの移動の制御に関連した情報が記述されたｎｃｃファイルまたはｎｃｘファイルと、表示部１６Ａに表示可能なテキストデータを記述したｈｔｍｌファイルまたはｘｍｌファイルを備えている。 The digital content DC of the DAISY standard includes a serial number given to a unit phrase P that is information necessary for content reproduction, a smil file in which a disclosure time and an end time in each unit phrase P are described, a table of contents and a page An ncc file or ncx file in which information related to the movement control is described, and an html file or xml file in which text data that can be displayed on the display unit 16A are described.

このようにＤＡＩＳＹ規格のデジタルコンテンツＤＣを採用した場合には、動作制御部１８が記憶部１２から読み出したｓｍｉｌファイルとｎｃｃファイルに基づいた処理を実行することにより、単位フレーズ生成部１８Ａ、分割単位フレーズ通し番号付与部１８Ｂ、フレーズ表示時間規定部１８Ｃ、フレーズ出力制御部１８Ｄとしての機能を発揮することになる。 As described above, when the digital content DC of the DAISY standard is adopted, the operation control unit 18 executes processing based on the smil file and the ncc file read from the storage unit 12, thereby generating the unit phrase generation unit 18 </ b> A and the division unit. Functions as the phrase serial number assigning unit 18B, the phrase display time defining unit 18C, and the phrase output control unit 18D are exhibited.

本実施形態においても、表示部１６ＡにデジタルコンテンツＤＣ内のテキストデータを単位フレーズＰごとに表示している間に、動作制御部１８は音声録音データ収集制御部１８Ｅとして、使用者が音読をしている間に使用者の音読音声を録音するようにマイク等に代表される音声記録部１７を作動させ、使用者の音読音声を録音（音読音声録音工程）してもよい。音声録音データＯＲＤは第１実施形態と同様に単位フレーズＰの通し番号に紐付けされた状態で記憶部１２に記憶させておくことができる。 Also in the present embodiment, while the text data in the digital content DC is displayed for each unit phrase P on the display unit 16A, the operation control unit 18 functions as the voice recording data collection control unit 18E so that the user reads aloud. During this time, the voice recording unit 17 represented by a microphone or the like may be operated so as to record the user's voice reading voice, and the user's voice reading voice may be recorded (spoken voice recording step). The voice recording data ORD can be stored in the storage unit 12 in a state linked to the serial number of the unit phrase P as in the first embodiment.

また、動作制御部１８は音声録音データ確認制御部１８Ｆとして、表示部１６Ａと音声出力部１６Ｂに単位フレーズＰの通し番号順に単位フレーズＰのテキストデータとテキストデータの模範音声データＭＯＤとを表示部１６Ａと音声出力部１６Ｂとにそれぞれ出力した後に続けて使用者の音声録音データＯＲＤの音声出力部１６Ｂへの出力を行うようにしてもよい。 Further, the operation control unit 18 serves as the voice recording data confirmation control unit 18F, and displays the text data of the unit phrase P and the model voice data MOD of the text data on the display unit 16A and the voice output unit 16B in the order of the serial number of the unit phrase P. The audio recording data ORD of the user may be output to the audio output unit 16B after being output to the audio output unit 16B.

このように、デジタルコンテンツＤＣとしてＤＡＩＳＹ規格のデータを採用することにより、単位フレーズ生成部１８Ａによるテキストデータの単位フレーズ生成処理と、分割単位フレーズ通し番号付与部１８Ｂによる分割単位フレーズ通し番号付与処理と、フレーズ表示時間規定部１８Ｃによるフレーズ表示時間規定処理を大幅に軽減（または省略）することができる。これにより動作制御部１８が行うデジタルコンテンツＤＣのデータ処理負荷を軽減させることが可能になり、安価な構成のデジタルコンテンツ再生録音装置１０であっても円滑にデジタルコンテンツＤＣを再生することができる点において好都合である。 Thus, by adopting DAISY standard data as the digital content DC, the unit phrase generation process of the text data by the unit phrase generation unit 18A, the division unit phrase serial number assignment process by the division unit phrase serial number assignment unit 18B, the phrase The phrase display time defining process by the display time defining unit 18C can be greatly reduced (or omitted). As a result, the data processing load of the digital content DC performed by the operation control unit 18 can be reduced, and the digital content DC can be smoothly reproduced even by the digital content reproduction / recording apparatus 10 having an inexpensive configuration. Is convenient.

（第３実施形態）
デジタルコンテンツＤＣの中には、テキストデータのみの部分とテキストデータおよびテキストデータに対応した音声データを有する部分とが混在する場合がある。このような場合におけるデジタルコンテンツＤＣには、図８に示すように、記憶部１２にはデジタルコンテンツＤＣの構成内容が記述されたコンテンツ構成内容データＣＣＤが記憶されている。本実施形態においては、記憶部１２にコンテンツ構成内容データＣＣＤが記憶されているデジタルコンテンツ再生録音装置１０についての動作について説明をおこなう。 (Third embodiment)
In digital content DC, there may be a mixture of text data only and text data and a portion having audio data corresponding to the text data. In the digital content DC in such a case, as shown in FIG. 8, the storage unit 12 stores content configuration content data CCD in which the configuration content of the digital content DC is described. In the present embodiment, the operation of the digital content reproduction / recording apparatus 10 in which the content configuration content data CCD is stored in the storage unit 12 will be described.

動作内容切替制御部１８Ｇとしての動作制御部１８は、デジタルコンテンツＤＣに対するデータ処理を開始する際、デジタルコンテンツＤＣのコンテンツ構成内容データＣＣＤを記憶部１２から読み出しすると共にコンテンツ構成内容データＣＣＤの内容を参照する。そして、動作内容切替制御部１８Ｇとしての動作制御部１８は、コンテンツ構成内容データＣＣＤに記述されているデジタルコンテンツＤＣのデータ構成に応じて単位フレーズ生成部１８Ａおよび音声データ生成部１８Ｚの動作を切り替えする処理を実行する。具体的には、以下のとおりである。 When starting the data processing for the digital content DC, the operation control unit 18 as the operation content switching control unit 18G reads the content configuration content data CCD of the digital content DC from the storage unit 12 and the contents of the content configuration content data CCD. refer. Then, the operation control unit 18 as the operation content switching control unit 18G switches the operations of the unit phrase generation unit 18A and the audio data generation unit 18Z according to the data configuration of the digital content DC described in the content configuration content data CCD. Execute the process. Specifically, it is as follows.

動作内容切替制御部１８Ｇが、デジタルコンテンツＤＣにテキストデータに対応した音声データを含んでいないと判断した場合には、単位フレーズ生成部１８Ａに、テキストデータから単位フレーズＰを生成させる。また、動作内容切替制御部１８Ｇは、音声データ生成部１８Ｚに、単位フレーズＰに対応するテキストデータに基づいて音声合成により単位フレーズ対応音声データを生成させる（単位フレーズ対応音声データ生成工程）処理を実行させる。単位フレーズ対応音声データ生成工程より後のデータ処理については、第１実施形態におけるフレーズ表示時間規定工程以降のデータ処理が実行されることになるので、ここでの具体的な説明は省略する。 When the operation content switching control unit 18G determines that the digital content DC does not include audio data corresponding to the text data, the unit phrase generation unit 18A generates the unit phrase P from the text data. Further, the operation content switching control unit 18G causes the speech data generation unit 18Z to generate unit phrase-corresponding speech data by speech synthesis based on the text data corresponding to the unit phrase P (unit phrase-corresponding speech data generation step). Let it run. As for the data processing after the unit phrase-corresponding voice data generation step, the data processing after the phrase display time defining step in the first embodiment is executed, and therefore a specific description thereof is omitted here.

これに対して動作内容切替制御部１８Ｇが、デジタルコンテンツＤＣにテキストデータに対応した音声データを含んでいると判断した場合には、単位フレーズ生成部１８Ａに、テキストデータおよび音声データから単位フレーズＰを生成させる。また、動作内容切替制御部１８Ｇは、音声データ生成部１８Ｚの動作をスキップさせる（音声データ生成スキップ工程）処理を実行させる。そして音声データ生成スキップ工程より後のデータ処理については、第１実施形態のフレーズ表示時間規定工程以降のデータ処理が実行されることになるので、ここでの具体的な説明は省略する。 On the other hand, when the operation content switching control unit 18G determines that the digital content DC includes voice data corresponding to the text data, the unit phrase generation unit 18A sends the unit phrase P from the text data and the voice data. Is generated. In addition, the operation content switching control unit 18G performs a process of skipping the operation of the audio data generation unit 18Z (audio data generation skip step). As for the data processing after the audio data generation skip step, the data processing after the phrase display time defining step of the first embodiment is executed, and therefore a specific description thereof is omitted here.

このように、デジタルコンテンツＤＣのデータ構成内容に基づいて、デジタルコンテンツＤＣに含まれているデータを最大限利用したうえで、動作制御部１８による各種データ処理工程を実行することができる。これにより、デジタルコンテンツ再生録音装置１０のデータ処理負荷を軽減させることができ、短時間でデータ処理を実行させることが可能になる点において好都合である。 As described above, based on the data configuration content of the digital content DC, the data included in the digital content DC can be used to the maximum and various data processing steps by the operation control unit 18 can be executed. This is advantageous in that the data processing load of the digital content playback / recording apparatus 10 can be reduced and the data processing can be executed in a short time.

（他の実施形態）
使用者によるテキストデータの音読音声を録音する音声録音データ収集制御部１８Ｅは、音声出力部１６Ｂに単位フレーズ対応音声データの出力をせずに単位フレーズＰに対応するテキストデータを強調表示させた後、音声記録部１７に入力された最初の音声を音声録音データ収集開始自動トリガーとして、音声記録部１７に音声録音データＯＲＤを収集させる処理を実行してもよい。また、この音声録音データ収集開始自動トリガーに続けて、または音声録音データ収集開始自動トリガーとは独立に、音声記録部１７による音声録音データＯＲＤの収集処理を開始した後、音声記録部１７への音声の無入力状態が予め記憶部１２に記憶されている所要時間（無入力状態継続時間）にわたって継続したときには、これをもって音声録音データ収集停止自動トリガーとし、音声録音データ収集制御部１８Ｅに音声録音データＯＲＤの収集を停止させる処理を実行させるようにしてもよい。 (Other embodiments)
The voice recording data collection control unit 18E that records the voice reading of the text data by the user highlights the text data corresponding to the unit phrase P without outputting the unit phrase corresponding voice data to the voice output unit 16B. The voice recording unit 17 may collect the voice recording data ORD using the first voice input to the voice recording unit 17 as a voice recording data collection start automatic trigger. Further, after the voice recording data collection start automatic trigger is started, or independently of the voice recording data collection start automatic trigger, the voice recording data ORD is started to be collected by the voice recording unit 17, and then the voice recording data 17 is sent to the voice recording unit 17. When the voice non-input state continues for the required time (no-input state duration) stored in the storage unit 12 in advance, this is used as a voice recording data collection stop automatic trigger, and the voice recording data collection control unit 18E performs voice recording. You may make it perform the process which stops collection of data ORD.

さらには、音声録音データ収集停止自動トリガーにより音声記録部１７が記録した音声録音データＯＲＤに対して、音声録音データ収集制御部１８Ｅとしての動作制御部１８は、録音直後のオリジナル音声録音データの終端位置から記憶部１２に予め記憶されている無入力継続時間の範囲までさかのぼった位置までの間を削除（トリミング）して得たデータを音声録音データＯＲＤとして記憶部１２に記憶させるようにしてもよい。これによれば、自動トリガーにより音声録音データＯＲＤを収集した場合であっても、無音部分のない音声録音データＯＲＤを記憶部１２に記憶させることができる点で好都合である。 Furthermore, for the voice recording data ORD recorded by the voice recording unit 17 by the voice recording data collection stop automatic trigger, the operation control unit 18 as the voice recording data collection control unit 18E terminates the original voice recording data immediately after recording. Data obtained by deleting (trimming) data from the position up to the position of the no-input duration time stored in advance in the storage unit 12 may be stored in the storage unit 12 as voice recording data ORD. Good. This is advantageous in that the voice recording data ORD having no silence can be stored in the storage unit 12 even when the voice recording data ORD is collected by an automatic trigger.

また、図９に示すように、デジタルコンテンツ再生録音装置１０の表示部１６Ａに音声録音データ収集操作部としてのトリガーボタン１９を配設してもよい。使用者によるトリガーボタン１９の操作状態を動作制御部１８が検出すると、音声録音データ収集制御部１８Ｅが音声記録部１７による音声録音データＯＲＤの収集の開始および停止させる処理を実行するようにしてもよい。このようにいわゆる手動トリガーによる音声録音データＯＲＤの収集および音声録音データＯＲＤの記憶部１２への記憶処理の開始および停止を使用者の任意のタイミングで実行させることが可能になる点で好都合である。なお、ここでは表示部１６Ａにソフトウェア上のトリガーボタン１９が配設された形態を示しているが、デジタルコンテンツ再生録音装置１０の専用品を採用した場合、専用品の本体に対して物理的なトリガーボタン１９を配設してもよい。 Further, as shown in FIG. 9, a trigger button 19 as an audio recording data collection operation unit may be arranged on the display unit 16A of the digital content reproduction / recording apparatus 10. When the operation control unit 18 detects the operation state of the trigger button 19 by the user, the voice recording data collection control unit 18E may execute a process of starting and stopping the voice recording data ORD collection by the voice recording unit 17. Good. As described above, it is advantageous in that the voice recording data ORD can be collected by a so-called manual trigger and the voice recording data ORD can be started and stopped at any timing of the user. . Here, a form in which a trigger button 19 on software is arranged on the display unit 16A is shown. However, when a dedicated product for the digital content playback / recording apparatus 10 is employed, the physical body of the dedicated product is physically separated. A trigger button 19 may be provided.

また、記憶部１２に音声録音データＯＲＤを記憶させる際においては、特定の単位フレーズＰの通し番号に対応する音声録音データＯＲＤを複数記憶させるようにしてもよい。単位フレーズＰに対して複数の音声録音データＯＲＤを記憶させるときには、音声録音データＯＲＤに時系列データを紐付けした状態で記憶部１２に記憶させておくことが好ましい。使用者または指導者が記憶部１２に記憶されている特定の単位フレーズＰの通し番号に対応する複数の音声録音データＯＲＤから任意の音声録音データＯＲＤを複数選択した場合、音声録音データ確認制御部１８Ｆは、記憶部１２から使用者または指導者により選択された音声録音データＯＲＤを読み出すと共に、時系列順に音声出力部１６Ｂに再生させるようにしてもよい。これにより、使用者の経時的なテキストデータの（文字の）音読習熟度の状態（音読学習の効果）を確認することができる。 Further, when the voice recording data ORD is stored in the storage unit 12, a plurality of voice recording data ORD corresponding to the serial number of a specific unit phrase P may be stored. When storing a plurality of voice recording data ORD for the unit phrase P, it is preferable to store the voice recording data ORD in the storage unit 12 in a state where time series data is linked to the voice recording data ORD. When the user or the instructor selects a plurality of arbitrary voice recording data ORD from a plurality of voice recording data ORD corresponding to the serial number of the specific unit phrase P stored in the storage unit 12, the voice recording data confirmation control unit 18F The voice recording data ORD selected by the user or the instructor may be read from the storage unit 12 and may be reproduced by the voice output unit 16B in chronological order. As a result, it is possible to confirm the state of reading proficiency (of the character) of the text data over time (the effect of learning of reading aloud).

また、予め模範音声データＭＯＤと音声録音データＯＲＤを音声出力部１６Ｂに出力させる際の単位フレーズＰの単位出力数を入力手段により入力または記憶部１２に記憶させておき、使用者や指導者により入力または記憶部１２に記憶されている単位出力数ごとに単位フレーズＰの模範音声データＭＯＤと音声録音データＯＲＤを音声出力部１６Ｂにそれぞれ出力させるようにしてもよい。このようにすることで、使用者の音読習熟度に応じた音読学習をすることができる点において好都合である。 In addition, the number of unit outputs of the unit phrase P when outputting the model voice data MOD and the voice recording data ORD to the voice output unit 16B is input by the input means or stored in the storage unit 12, and the user or the instructor For each unit output number stored in the input or storage unit 12, the model voice data MOD and the voice recording data ORD of the unit phrase P may be output to the voice output unit 16B. By doing in this way, it is convenient in the point that the reading aloud according to a user's reading aloud proficiency can be performed.

また、表示部１６Ａにまたはデジタルコンテンツ再生録音装置１０の専用品の本体に、比較再生開始ボタンを表示または配設させ、使用者が比較再生開始ボタンを操作した際に、動作制御部１８は次のような動作を実行させるようにしてもよい。すなわち、前回の比較再生開始ボタンが操作されてから今回の比較再生開始ボタンが操作されたまでの間にある単位フレーズＰを対象にして、模範音声データＭＯＤと音声録音データＯＲＤを音声出力部１６Ｂに出力させる動作処理である。 Further, when the comparison playback start button is displayed or arranged on the display unit 16A or the dedicated body of the digital content playback / recording apparatus 10, and the user operates the comparison playback start button, the operation control unit 18 Such an operation may be executed. In other words, the model audio data MOD and the audio recording data ORD are converted into the audio output unit 16B for the unit phrase P between the time when the previous comparative reproduction start button is operated and the time when the current comparative reproduction start button is operated. This is an operation process to be output.

また、以上の実施形態においては、分割した単位フレーズＰに対応するテキストデータはデジタルコンテンツＤＣの一部のテキストデータを表示部１６Ａに表示する形態について説明しているが、デジタルコンテンツＤＣのテキストデータの全部を表示部１６Ａに表示させることも可能である。 In the above embodiment, the text data corresponding to the divided unit phrase P is described as a form in which a part of the text data of the digital content DC is displayed on the display unit 16A. However, the text data of the digital content DC is described. Can be displayed on the display unit 16A.

また、音声録音データＯＲＤを音声出力部１６Ｂに出力する際には、単位フレーズＰの通し番号に対応するテキストデータを表示部１６Ａに、これに対応する模範音声データＭＯＤを音声出力部１６Ｂに出力した後に行われているが、模範音声データＭＯＤの音声出力部１６Ｂへの出力をせずに音声録音データＯＲＤの音声出力部１６Ｂへの出力を行うこともできる。また、単位フレーズＰの通し番号に対応するテキストデータの表示部１６Ａへの出力と、これに対応する模範音声データＭＯＤの音声出力部１６Ｂへの出力をせず、音声録音データＯＲＤのみを音声出力部１６Ｂに出力するようにしてもよい。 When outputting the voice recording data ORD to the voice output unit 16B, the text data corresponding to the serial number of the unit phrase P is output to the display unit 16A, and the corresponding model voice data MOD is output to the voice output unit 16B. As described later, the audio recording data ORD can be output to the audio output unit 16B without outputting the exemplary audio data MOD to the audio output unit 16B. Further, the text data corresponding to the serial number of the unit phrase P is not output to the display unit 16A, and the corresponding model audio data MOD is not output to the audio output unit 16B, and only the audio recording data ORD is output to the audio output unit. You may make it output to 16B.

また、記憶部１２には模範音声データＭＯＤを音声出力部１６Ｂに出力させる際における音声の特性が異なる複数の音声特性データを予め記憶させておいてもよい。このような音声特性データを備えた構成により、使用者がデータ入力手段を介して任意の音声特性データを選択すると、音声録音データ確認制御部１８Ｆである動作制御部１８が選択された音声特性データに基づいた模範音声データＭＯＤを音声出力部１６Ｂに出力する。このような構成により使用者の好みに応じた模範音声データＭＯＤを音声出力部１６Ｂに出力させることができるから、音読学習の効率を期待することができる。 Further, the storage unit 12 may store in advance a plurality of audio characteristic data having different audio characteristics when the exemplary audio data MOD is output to the audio output unit 16B. With the configuration including such voice characteristic data, when the user selects arbitrary voice characteristic data via the data input means, the voice control data selected by the operation control unit 18 which is the voice recording data confirmation control unit 18F is selected. Is output to the audio output unit 16B. With such a configuration, the model voice data MOD according to the user's preference can be output to the voice output unit 16B, so that the efficiency of reading aloud can be expected.

以上に実施形態に基づいて本発明について詳細に説明をしたが、本発明の技術的範囲は以上の実施形態に限定されるものではない。例えば、以上の実施形態においては、デジタルコンテンツ再生録音装置１０としてタブレット端末やスレートパソコンを用いた構成について説明しているが、デスクトップパソコンやノートパソコン等のコンピュータを用いた構成であってもよい。このとき、コンピュータの記憶部に本発明にかかるデジタルコンテンツ再生録音方法を実行するデジタルコンテンツ再生録音プログラムを予めインストールしておき、コンピュータの動作制御部にデジタルコンテンツ再生録音プログラムに基づいたデジタルコンテンツの再生録音制御を実行させるようにすればよい。 Although the present invention has been described in detail above based on the embodiments, the technical scope of the present invention is not limited to the above embodiments. For example, in the above embodiment, a configuration using a tablet terminal or a slate personal computer as the digital content playback / recording apparatus 10 has been described, but a configuration using a computer such as a desktop personal computer or a notebook personal computer may be used. At this time, a digital content reproduction / recording program for executing the digital content reproduction / recording method according to the present invention is installed in advance in the storage unit of the computer, and the digital content reproduction based on the digital content reproduction / recording program is installed in the operation control unit of the computer. Recording control may be executed.

本発明は、テキストデータまたはテキストデータおよび単位フレーズ対応音声データを含むデジタルコンテンツＤＣにおいて、テキストデータを単位フレーズＰごとに分割し、分割した単位フレーズＰに対応するテキストデータおよびテキストデータに対応する単位フレーズ対応音声データを模範音声データＭＯＤとしてデータ出力部１６に手本として出力し、手本が出力された直後に使用者の音読音声を音声録音データＯＲＤとして収集した後に、音声録音データＯＲＤを音声出力部１６Ｂに出力することを最大の特徴としている。よって、単位フレーズＰごとに分割する具体的な手法や分割した単位フレーズＰごとの単位フレーズ対応音声データまたは単位フレーズ対応音声データ（模範音声データＭＯＤ）および音声録音データＯＲＤの具体的な再生方法については特に限定されるものではない。 In the digital content DC including text data or text data and unit phrase-corresponding audio data, the present invention divides the text data for each unit phrase P, and the unit corresponding to the text data and the text data corresponding to the divided unit phrase P. The phrase-corresponding voice data is output as a model voice data MOD as a model to the data output unit 16, and immediately after the model is output, the user's voice reading voice is collected as the voice recording data ORD. The greatest feature is to output to the output unit 16B. Therefore, a specific method for dividing each unit phrase P and a specific reproduction method for unit phrase-corresponding voice data or unit phrase-corresponding voice data (exemplary voice data MOD) and voice recording data ORD for each divided unit phrase P Is not particularly limited.

そして、明細書中において説明したそれぞれの実施形態と各種の変形例を適宜組み合わせた形態を採用することも可能である。 And it is also possible to employ | adopt the form which combined each embodiment demonstrated in the specification and various modifications suitably.

１０デジタルコンテンツ再生録音装置
１２記憶部
１４ネットワーク接続部
１６データ出力部
１６Ａ表示部
１６Ｂ音声出力部
１７音声記録部
１８動作制御部
１８Ａ単位フレーズ生成部
１８Ｂ分割単位フレーズ通し番号付与部
１８Ｃフレーズ表示時間規定部
１８Ｄフレーズ出力制御部
１８Ｅ音声録音データ収集制御部
１８Ｆ音声録音データ確認制御部
１８Ｇ動作内容切替制御部
１８Ｚ音声データ生成部
１９トリガーボタン
３０ネットワーク上に配設されている記憶部
ＣＣＤコンテンツ構成内容データ
ＤＣデジタルコンテンツ
ＨＰＴ標準のフレーズ表示時間
ＴＨＤテキスト表示時間基本データ
ＭＯＤ模範音声データ
ＯＲＤ音声録音データ
Ｐ単位フレーズ DESCRIPTION OF SYMBOLS 10 Digital content reproduction | regeneration recording / recording apparatus 12 Memory | storage part 14 Network connection part 16 Data output part 16A Display part 16B Audio | voice output part 17 Audio | voice recording part 18 Operation control part 18A Unit phrase production | generation part 18B Division | segmentation unit phrase serial number assignment part 18C Phrase display time prescription | regulation part 18D Phrase output control unit 18E Audio recording data collection control unit 18F Audio recording data confirmation control unit 18G Operation content switching control unit 18Z Audio data generation unit 19 Trigger button 30 Storage unit CCD content configuration content data DC arranged on the network Digital content HPT Standard phrase display time THD Text display time Basic data MOD Model voice data ORD Voice recording data P Unit phrase

Claims

A storage unit in which digital content including text data is stored;
A display for displaying all or part of the text data;
A unit phrase generator for generating a unit phrase from the text data;
A voice data generation unit that generates voice data by voice synthesis based on the text data corresponding to the unit phrase;
An audio output unit;
A voice recording unit for recording a voice read out by the user as the voice data as voice recording data;
A phrase display time defining unit for defining a phrase display time for displaying the text data corresponding to the unit phrase on the display unit based on the length of the audio data corresponding to the unit phrase;
A phrase output control unit that causes the display unit to highlight the text data corresponding to the unit phrase without outputting the audio data to the audio output unit;
Collecting the voice recording data by the voice recording unit after the text data corresponding to the unit phrase is highlighted on the display unit for the phrase display time without outputting the voice data to the voice output unit And a voice recording data collection control unit for storing the collected voice recording data in the storage unit,
The digital content is output, for each unit phrase, the text data and the audio data corresponding to the unit phrase to the display unit and the audio output unit, and the audio recording data corresponding to the unit phrase is output to the audio A voice recording data confirmation control unit to be output to the output unit;
A digital content playback / recording apparatus comprising:

A storage unit for storing digital data including text data and audio data corresponding to the text data;
A display for displaying all or part of the text data;
A unit phrase generator for generating a unit phrase from the text data and the voice data;
An audio output unit;
A voice recording unit for recording a voice read out by the user as the voice data as voice recording data;
A phrase display time defining unit for defining a phrase display time for displaying the text data corresponding to the unit phrase on the display unit based on the length of the audio data corresponding to the unit phrase;
A phrase output control unit that causes the display unit to highlight the text data corresponding to the unit phrase without outputting the audio data to the audio output unit;
Collecting the voice recording data by the voice recording unit after the text data corresponding to the unit phrase is highlighted on the display unit for the phrase display time without outputting the voice data to the voice output unit And a voice recording data collection control unit for storing the collected voice recording data in the storage unit,
The digital content is output, for each unit phrase, the text data and the audio data corresponding to the unit phrase to the display unit and the audio output unit, and the audio recording data corresponding to the unit phrase is output to the audio A voice recording data confirmation control unit to be output to the output unit;
A digital content playback / recording apparatus comprising:

A storage unit storing text data or digital content including text data and audio data corresponding to the text data; and content configuration content data in which the configuration content of the digital content is described;
A display for displaying all or part of the text data;
A unit phrase generator for generating a unit phrase from the text data or the text data and the audio data;
A voice data generation unit that generates the voice data by voice synthesis based on the text data corresponding to the unit phrase;
An audio output unit;
A voice recording unit for recording a voice read out by the user as the voice data as voice recording data;
A phrase display time defining unit for defining a phrase display time for displaying the text data corresponding to the unit phrase on the display unit based on the length of the audio data corresponding to the unit phrase;
A phrase output control unit that causes the display unit to highlight the text data corresponding to the unit phrase without outputting the audio data to the audio output unit;
Collecting the voice recording data by the voice recording unit after the text data corresponding to the unit phrase is highlighted on the display unit for the phrase display time without outputting the voice data to the voice output unit And a voice recording data collection control unit for storing the collected voice recording data in the storage unit,
The digital content is output, for each unit phrase, the text data and the audio data corresponding to the unit phrase to the display unit and the audio output unit, and the audio recording data corresponding to the unit phrase is output to the audio A voice recording data confirmation control unit to be output to the output unit;
An operation content switching control unit that switches operation content between the unit phrase generation unit and the phrase display time defining unit based on the content configuration content data,
The operation content switching control unit refers to the content configuration content data,
When it is determined that the digital content does not include audio data corresponding to the text data, the unit phrase generation unit generates the unit phrase from the text data, and the audio data generation unit Performing a process of generating the speech data by speech synthesis based on the text data corresponding to a phrase;
If it is determined that the digital content includes audio data corresponding to the text data, the unit phrase generator generates the unit phrase from the text data and the audio data, and the audio data generator A digital content playback / recording apparatus characterized by executing a process of skipping the operation of.

The phrase output control unit
The audio data corresponding to the unit phrase is output to the audio output unit over the phrase display time, and the text data corresponding to the unit phrase is highlighted on the display unit, and then the audio output unit 4. The text data corresponding to the unit phrase is highlighted on the display unit over the phrase display time without outputting voice data. Digital content playback and recording device.

The phrase output control unit
Even if the phrase display time is exceeded, if the voice recording data is continuously collected by the voice recording unit, the voice recording data is not collected by the voice recording unit until the voice recording data is collected. 5. The digital content reproduction / recording apparatus according to claim 1, wherein the display of the text data corresponding to the unit phrase is continued.

The voice recording data collection control unit highlights the text data corresponding to the unit phrase without outputting the voice data to the voice output unit, and then first inputs the voice recording unit. 6. The digital content reproduction / recording apparatus according to claim 1, wherein the voice recording unit collects the voice recording data using voice as a trigger.

The voice recording data collection control unit triggers the voice recording unit to collect the voice recording data, and then the voice recording state to the voice recording unit continues for a required time after executing the process of collecting the voice recording data. The digital content reproduction / recording apparatus according to any one of claims 1 to 6, wherein a process of causing the recording unit to stop collecting the voice recording data is executed.

The voice recording data collection control unit deletes a time range in which no voice input state from the end of the voice recording data continues to the voice recording unit for the voice recording data collected by the voice recording unit The digital content reproduction / recording apparatus according to claim 7, wherein the processing is performed.

The digital content playback / recording apparatus is provided with a voice recording data collection operation unit,
2. The voice recording data collection control unit executes a process of recording the voice recording data in the voice recording unit, triggered by an operation state of the voice recording data collection operation unit by a user. The digital content reproduction / recording apparatus according to any one of?

The voice recording data confirmation control unit outputs the text data and the voice data corresponding to the unit phrase for each unit phrase to the display unit and the voice output unit, and then outputs the unit phrase to the display unit. 10. The voice output data corresponding to the unit phrase is output to the voice output unit in a state where the output state of the text data corresponding to is maintained. The digital content playback / recording apparatus described.

The voice recording data confirmation control unit outputs the voice recording data corresponding to the unit phrase to the voice output unit after outputting the text data corresponding to the unit phrase to the display unit for each unit phrase. The digital content reproduction / recording apparatus according to any one of claims 1 to 10, wherein

The storage unit stores a plurality of the voice recording data corresponding to the unit phrases,
The voice recording data confirmation control unit reads the voice recording data arbitrarily selected by the user from a plurality of the voice recording data corresponding to the specific unit phrase from the storage unit, and The digital content reproduction / recording apparatus according to claim 1, wherein a process of outputting to an output unit is executed.

The storage unit stores in advance a plurality of audio characteristic data for defining audio characteristics when the audio data is output to the audio output unit,
The voice recording data confirmation control unit causes the voice output unit to output the voice data based on the voice characteristic data selected by a user via a data input unit. The digital content reproduction recording device according to any one of the above.