JP2004341811A

JP2004341811A - Information transmitting method, device, and program

Info

Publication number: JP2004341811A
Application number: JP2003137354A
Authority: JP
Inventors: Yumi Tomioka; 由実富岡; Kota Hidaka; 浩太日高; Junji Takeuchi; 順二竹内; Shinya Nakajima; 信弥中嶌
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2003-05-15
Filing date: 2003-05-15
Publication date: 2004-12-02
Anticipated expiration: 2023-05-15
Also published as: JP4027840B2

Abstract

<P>PROBLEM TO BE SOLVED: To draw attention to information or impress information to enable the passive receipt of the information regardless of a delivery medium of information. <P>SOLUTION: A substantial character takes an action such as singing a song of text information of a content, speaking or waving the hand when it enters or exits. After the entering operation is ended, information is transmitted while cooperating the substantial character with an image character to communicate therewith. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は情報を表示する装置に利用する。特に、情報の視聴者に対する情報への注意喚起および情報を印象付ける技術に関する。
【０００２】
【従来の技術】
テレビ、ラジオ、街頭広告等のメディアでは、情報への注意の喚起や、情報を印象付けるための方法として、人気の高い芸能人を起用する、人気の高いデザイナーによるデザインを用いる、効果音を付加する、定評のある制作者・編集者によりコンテンツを制作する、等の方法を用いている。
【０００３】
パソコンを用いて情報を得ようとする場合は、ユーザはアクティブに情報を検索する必要がある。しかし一部、インターネット上の情報を閲覧する際に用いるブラウザとして、エージェントとして合成音声によりキャラクタが喋ったり、ユーザが音声入力をする等の、音声インタフェースを提供する技術が知られている。
【０００４】
合成音声の生成法として、例えば、特許文献１の音声合成方法等が開示されている。しかし、従来の合成音声では、人間に近い自然な音声とは必ずしもいえないという問題があり、その問題を解決するためのインタフェースとして、画面にエージェントを表示する方法が行われている。
【０００５】
音声合成の一つの応用例として、歌声を合成する歌声メッセージ生成・配信方法およびその装置（例えば、特許文献２参照）があり、利用者が希望する声質の歌声を希望する曲に希望する歌詞で合成し、配信する方法が行われている。
【０００６】
注目を集める表示装置として、例えば、特許文献３の壁面移動パフォーマンス装置があり、これは壁面を自在に動き回り、衆目をひきつける効果が高い壁面移動パフォーマンス装置を提供している。
【０００７】
広告配信方法として、例えば、特許文献４の電子広告システムがあり、広告端末に表示された広告を実際目にしている広告閲覧者のみが、広告表示端末に表示される広告データを変更可能にすることで広告閲覧者の広告への関心を高めるために有効な電子広告システムを提供している。
【０００８】
【特許文献１】
特開平６−３０８９９７号公報
【特許文献２】
特開２００２−１３２２８１号公報
【特許文献３】
特開平７−１４０９１９号公報
【特許文献４】
特開２００２−１５０１３０号公報
【特許文献５】
特開平１１−２０２８８４号公報
【特許文献６】
特開平１０−１３９３２３号公報
【０００９】
【発明が解決しようとする課題】
テレビ、ラジオ、街頭広告、パソコン等の情報発信装置および方法において、視聴者に対して、情報への注意を喚起し、さらにその情報を印象付けるという効果の向上は、これまで人手をかけてコンテンツの制作者や編集者の経験に頼り、行われてきた。そのため、時間やコスト等がかかるという問題があった。
【００１０】
本発明では観点を変え、コンテンツの制作や編集そのものではなく、キャラクタの動作により情報への喚起や情報を印象付ける等の効果を向上させ、新規に作成されたコンテンツはもちろん、これまであったコンテンツにおいても適応の範囲を広げ、情報配信等の付加価値の高いサービスを提供することを目的とする。
【００１１】
また、パソコンを用いて情報を得ようとする場合は、ユーザはアクティブに情報を検索する必要があり、受動的に情報を受け取ることは難しい。そのため、本技術では情報の配信媒体を問わず、受動的に情報を受け取ることを可能にすることを目的とする。
【００１２】
さらに、これまで不自然であるとされてきた合成音声に関して、実体のあるキャラクタから音声を出力し、あたかも実体キャラクタそのものが喋っているかのようにみせることによって、より自然にみせることを目的とする。
【００１３】
【課題を解決するための手段】
テレビ、ラジオ、街頭広告、パソコン等、情報を発信する装置や、そこでの情報発信方法は幅広く知られている。このような情報発信装置および方法において、視聴者に対して、情報への注意を喚起し、さらにその情報を印象付けることは重要な課題となる。また、パソコンを用いて情報を得ようとする場合は、ユーザはアクティブに情報を検索する必要がある。
【００１４】
本発明は、このような情報発信装置および方法における、視聴者への注目喚起効果の向上および印象の残存性の向上または情報の受動的な取得を可能にする配信の充実を図った情報発信の装置および方法に関する。
【００１５】
上記目的を達成するための手段として、実体キャラクタと画像キャラクタとを用いる。実体キャラクタが登場や退場の動作をしたり、画像キャラクタが登場や退場の動作を行う。
【００１６】
実体キャラクタが登場退場する際、コンテンツのテキスト情報を歌にしたものを歌ったり、喋ったり、手を振る等の動作をする。登場動作終了後は、実体キャラクタと画像キャラクタとを連携させコミュニケーションをとったりしながら情報を発信する。
【００１７】
これまで不自然であるとされてきた合成音声に関して、実体のあるキャラクタから音声を出力し、あたかも実体キャラクタそのものが喋っているかのように見せ、その実体キャラクタと画像キャラクタとを連携させることによって、これまであった画像キャラクタによる合成音声の読み上げ方法を、より自然に見せる。
【００１８】
すなわち、これまでエージェントが行って来た情報の発信方法に加え、実体キャラクタが登場退場の動作を行い、その際、歌ったり喋ったり、手を振る等の動作をすることにより、視覚的効果を高め、情報への注意を喚起することが可能となる。
【００１９】
実体キャラクタが登場退場する際、コンテンツのテキスト情報を歌にしたものを歌ったり、喋ったり、手を振る等の動作をすることにより、さらに聴覚・視覚的効果を高め、注意を喚起すると同時に情報の内容を印象付けることが可能となる。また、歌ったり喋ったりする方法を用いることにより、音声で情報が発信されるため、視聴者は、テキスト情報を目で追う必要が無くなり、受動的に情報を受け取ることが可能となる。さらに、登場動作終了後は、実体キャラクタと画像キャラクタとを連携させ、より面白い内容となるようにし、情報の内容を印象付けることが可能となる。
【００２０】
これまで不自然であるとされてきた合成音声に関して、実体のあるキャラクタから音声を出力し、あたかも実体キャラクタそのものが喋っているかのように見せ、その実体キャラクタと画像キャラクタとを連携させることによって、これまであった画像キャラクタによる合成音声の読み上げ方法を、より自然に見せることが可能となる。
【００２１】
本発明は情報管理装置からの通知により新着情報を検知した実体キャラクタ制御装置からの信号によって登場し、さらに情報を発信した後退場するという情報の配信方法であり、このように各情報毎に登場したり退場したりすることによって、一件一件の情報への注意を引きつけるという作用を有する。
【００２２】
本発明は、キャラクタが情報を読み上げる情報発信手段であるが、このように情報を発信することにより、例えば、Ｗｅｂブラウザを用いた場合のように、これまでアクティブに収集し取得してきた情報でも、受動的に受け取ることが出来る環境を提供するという作用を有する。
【００２３】
本発明はキャラクタが情報を歌として歌う情報発信手段であるが、その発信する情報の内容によって、スポーツニュースのように歌にするとより楽しめるものもあれば、歌にするのは適切ではないと思われる事故のニュースなどがあり、それらの情報の特徴に合わせて情報発信の手段を選択し、情報をより的確に伝えるという効果を有する。
【００２４】
すなわち、本発明の第一の観点は、情報発信方法であって、本発明の特徴とするところは、情報を収集するステップと、この収集した情報から当該情報のタイトルおよび内容を表す単語または文章を選択するステップと、この選択された単語または文章に基づき当該情報のジャンルを決定するステップと、この決定されたジャンルに基づきテキスト原稿作成用文章例を選択するステップと、この選択された前記テキスト原稿作成用文章例に基づき前記単語または文章を選択するステップにより選択された前記単語または文章の並べ替えを行うステップと、この並べ替えられた前記単語または文章を連結してテキスト原稿を作成するステップと、この作成されたテキスト原稿を音声により読み上げるステップと、この読み上げに反応してキャラクタを動作させるステップとを実行するところにある（請求項１）。
【００２５】
また、前記キャラクタを動作させるステップは、前記ジャンルを決定するステップにより決定されたジャンルに応じて前記キャラクタの動作パターンを選択するステップを実行し、前記読み上げるステップは、前記ジャンルを決定するステップにより決定されたジャンルに応じて読み上げ時の発話パターンを選択するステップを実行することができる（請求項２）。
【００２６】
また、前記キャラクタは複数設けられ、前記読み上げるステップは、前記テキスト原稿の内容を分割し当該複数のキャラクタにそれぞれ割り当てそれぞれ異なる音声により読み上げるステップを実行し、前記キャラクタを動作させるステップは、前記読み上げの内容に応じて当該複数のキャラクタを連携して動作させるステップを実行することができる（請求項３）。
【００２７】
また、複数の前記キャラクタのいずれかはモータにより駆動される人形であり、その他はディスプレイに表示された画像であることができる（請求項４）。
【００２８】
前記文章の並べ替えを行うステップは、前記単語または文章を選択するステップにより選択された前記単語または文章をその属性に基づき分類するステップと、前記単語または文章をその属性に応じてあらかじめ定められた所定の配置位置に挿入するステップとを実行することができる（請求項５）。
【００２９】
また、前記連携して動作させるステップは、前記人形によるキャラクタが前記画像によるキャラクタとなって前記ディスプレイに入り込んだように、あるいは、前記画像によるキャラクタが前記人形によるキャラクタとなって前記ディスプレイから飛び出したように見えるように動作させるステップを実行することができる（請求項６）。
【００３０】
あるいは、前記連携して動作させるステップは、前記複数のキャラクタが互いに対話しているように見えるように動作させるステップを実行することができる（請求項７）。このときに、対話の相手は、人形によるキャラクタ同士であったり、あるはい、画像によるキャラクタ同士であったり、あるいは、画像によるキャラクタと人形によるキャラクタとであったりすることができる。
【００３１】
また、前記読み上げるステップは、旋律を生成するステップと、この旋律にしたがって前記テキスト原稿を歌として読み上げるステップとを実行することができる（請求項８）。
【００３２】
本発明の第二の観点は、情報発信装置であって、本発明の特徴とするところは、情報を収集する手段と、この収集する手段により収集した情報から当該情報のタイトルおよび内容を表す単語または文章を選択する手段と、この選択する手段により選択された単語または文章に基づき当該情報のジャンルを決定する手段と、この決定する手段により決定されたジャンルに基づきテキスト原稿作成用文章例を選択する手段と、この選択する手段により選択された前記テキスト原稿作成用文章例に基づき前記単語または文章を選択する手段により選択された前記単語または文章の並べ替えを行う手段と、この並べ替えを行う手段により並べ替えられた前記単語または文章を連結してテキスト原稿を作成する手段と、この作成する手段により作成されたテキスト原稿を音声により読み上げる手段と、この読み上げる手段の読み上げに反応してキャラクタを動作させる手段とを備えたところにある（請求項９）。
【００３３】
また、前記キャラクタを動作させる手段は、前記ジャンルを決定する手段により決定されたジャンルに応じて前記キャラクタの動作パターンを選択する手段を備え、前記読み上げる手段は、前記ジャンルを決定する手段により決定されたジャンルに応じて読み上げ時の発話パターンを選択する手段を備えることができる（請求項１０）。
【００３４】
また、前記キャラクタは複数設けられ、前記読み上げる手段は、前記テキスト原稿の内容を分割し当該複数のキャラクタにそれぞれ割り当てそれぞれ異なる音声により読み上げる手段を備え、前記キャラクタを動作させる手段は、前記読み上げの内容に応じて当該複数のキャラクタを連携して動作させる手段を備えることができる（請求項１１）。
【００３５】
また、前記文章の並べ替えを行う手段は、前記単語または文章を選択する手段により選択された前記単語または文章をその属性に基づき分類する手段と、前記単語または文章をその属性に応じてあらかじめ定められた所定の配置位置に挿入する手段とを備えることができる（請求項１２）。
【００３６】
また、前記読み上げる手段は、旋律を生成する手段と、この旋律にしたがって前記テキスト原稿を歌として読み上げる手段とを備えることができる（請求項１３）。
【００３７】
本発明の第三の観点は、プログラムであって、本発明の特徴とするところは、情報処理装置にインストールすることにより、その情報処理装置に、本発明の情報発信方法の各ステップを実行させるところにある。あるいは、情報処理装置にインストールすることにより、その情報処理装置に、本発明の情報発信装置の各手段に相応する機能を実現させるところにある（請求項１４、１５）。
【００３８】
本発明のプログラムは記録媒体に記録されることにより、前記情報処理装置は、この記録媒体を用いて本発明のプログラムをインストールすることができる。あるいは、本発明のプログラムを保持するサーバからネットワークを介して直接前記情報処理装置に本発明のプログラムをインストールすることもできる。
【００３９】
これにより、情報処理装置を用いて、コンテンツの制作や編集そのものではなく、キャラクタの動作により情報への喚起や情報を印象付ける等の効果を向上させ、新規に作成されたコンテンツはもちろん、これまであったコンテンツにおいても適応の範囲を広げ、情報配信等の付加価値の高いサービスを提供し、また、情報の配信媒体を問わず、受動的に情報を受け取ることを可能にし、さらに、これまで不自然であるとされてきた合成音声に関して、実体のあるキャラクタから音声を出力し、あたかも実体キャラクタそのものが喋っているかのようにみせることによって、より自然にみせることができる情報発信方法および装置を実現することができる。
【００４０】
【発明の実施の形態】
本発明実施形態の情報発信装置を図１〜図９を参照して説明する。図１は本実施形態のキャラクタによる情報発信装置を示す図である。図２は本実施形態のキャラクタによる情報発信方法の基本手順を示す流れ図である。図３は本実施形態のキャラクタによる情報発信方法の基本手順のうち情報処理手順の詳細を示す流れ図である。図４は本実施形態のキャラクタによる情報発信方法の基本手順のうち情報のジャンルを決定するための辞書を示す図である。図５は本実施形態のキャラクタによる情報発信方法の基本手順のうち並べ替えによる読み上げ用テキスト生成の手順を示す流れ図である。図６は本実施形態のキャラクタによる情報発信方法の基本手順のうち読み上げ用テキストのフラグに埋め込む文章を示す図である。図７は本実施形態のキャラクタによる情報発信方法の基本手順のうちキャラクタ動作決定の詳細な流れを示す図である。図８は本実施形態のキャラクタによる情報発信方法の基本手順のうちキャラクタ動作決定の詳細を各ジャンル毎に示した図である。図９は本実施形態のキャラクタによる情報発信方法のうち実体キャラクタ動作を示した図である。
【００４１】
本実施形態の情報発信装置は、図１に示すように、情報を収集する情報処理部１０と、この情報処理部１０により収集した情報から当該情報のタイトルおよび内容を表す単語または文章を選択する情報選択部１１と、この情報選択部１１により選択された単語または文章に基づき当該情報のジャンルを決定するジャンル決定部１２と、このジャンル決定部１２により決定されたジャンルに基づきテキスト原稿作成用文章例を選択する文章例選択部１３と、この文章例選択部１３により選択された前記テキスト原稿作成用文章例に基づき情報選択部１１により選択された前記単語または文章の並べ替えを行う情報並べ替え部１４と、この情報並べ替え部１４により並べ替えられた前記単語または文章を連結してテキスト原稿を作成するテキスト原稿作成部１５と、このテキスト原稿作成部１５により作成されたテキスト原稿を音声により読み上げるテキスト読み上げ部１６と、このテキスト読み上げ部１６の読み上げに反応してキャラクタを動作させるキャラクタ制御部２とを備えたところにある（請求項９）。
【００４２】
キャラクタ制御部２は、ジャンル決定部１２により決定されたジャンルに応じて前記キャラクタの動作パターンを選択するキャラクタ動作決定部２０を備え、テキスト読み上げ部１６は、ジャンル決定部１２により決定されたジャンルに応じて読み上げ時の発話パターンを選択する発話パターン選択部１７を備える（請求項１０）。
【００４３】
前記キャラクタは複数設けられ、図１の例では、一つは実体キャラクタであり、他の一つは画像キャラクタである。テキスト読み上げ部１６は、前記テキスト原稿の内容を分割し当該複数のキャラクタにそれぞれ割り当てそれぞれ異なる音声により読み上げる読み上げキャラクタ割当部１８を備え、キャラクタ制御部２は、前記読み上げの内容に応じて当該複数のキャラクタを連携して動作させる（請求項１１）。
【００４４】
また、情報並べ替え部１４は、図６に示すように、情報選択部１１により選択された前記単語または文章をその属性に基づき分類し、前記単語または文章をその属性に応じてあらかじめ定められた所定の配置位置に挿入する（請求項１２）。
【００４５】
また、テキスト読み上げ部１６は、旋律を生成する旋律生成部１９を備え、この旋律にしたがって前記テキスト原稿を歌として読み上げることもできる（請求項１３）。
【００４６】
本発明は、汎用の情報処理装置にインストールすることにより、上述の情報発信装置として機能させるプログラムとして実現することができる（請求項１４、１５）。このプログラムは、記録媒体に記録されて情報処理装置にインストールされ、あるいは通信回線を介して情報処理装置にインストールされることにより当該情報処理装置を本実施形態の情報発信装置として機能させることができる。
【００４７】
以下に本発明実施形態を、図面を参照して詳細に説明する。図２に本発明実施形態の基本手順を示す。ステップＳ１の情報処理部で情報を検知し、情報処理し、ステップＳ２のキャラクタ動作決定部でキャラクタ動作を決定し、ステップＳ３の情報発信動作部で情報発信のための動作を行う。なお、図１に示した情報発信装置において、ステップＳ１は情報処理部１で実施され、ステップ２はキャラクタ制御部２で実施され、ステップＳ３は表示部３で実施される。表示部３は画像キャラクタ、実体キャラクタおよびスピーカを備え、ここから情報を発信する。
【００４８】
図２のステップＳ１の情報処理では、情報から必要な箇所を選択し、情報のタイトル、ジャンル、キーワード等を抽出し、抽出したデータを基に情報を並べ替えることにより読み上げ用のテキスト原稿を作成する。図３に図２のステップＳ１の処理手順の詳細を示す。まず、ステップＳ１０１である程度広範囲に情報を収集する。その情報の中から、ステップＳ１０２では必要なものを選択する。ステップＳ１０３では、選択した情報に基づき情報のジャンルを決定する。選択した情報をステップＳ１０４で並べ替えて、ステップＳ１０５で読み上げ用の原稿とする（請求項１）。
【００４９】
ここではステップＳ１０２での必要な情報の選択、ステップＳ１０３でのジャンル決定、ステップＳ１０４での情報の並べ替え、ステップＳ１０５の読み上げ用テキスト原稿の作成を行う例として、インターネット上のニュースサイトの内容を挙げてこの手順を説明する。例えば、代表的なニュースサイトであるアサヒ・コム（ｈｔｔｐ：／／ｗｗｗ．ａｓａｈｉ．ｃｏｍ）を閲覧すると、各ニュースはタイトルと本文および場合によって画像とその画像を説明する短いテキストにより構成されている。タイトルは平均２３．７文字で構成され、本文はおよそ２５０〜４５０文字、２〜３のパラグラフで構成されている（発明者が無作為に選んだアサヒ・コム内の１０個のサイトを調査）。
【００５０】
ステップＳ１では、このような読み物として作られた情報を、例えば読み上げる場合に、読み上げに適した長さのテキストを再構成するための手順を示しているのが図３である。ステップＳ１０２では、このタイトルと本文、また画像の周りのテキストのうち、タイトルからはタイトル全文、本文からは各パラグラフの第一文、画像の周りのテキスト全文を選択する。次にステップＳ１０３では、ステップＳ１０２で選択した文章において、あらかじめ準備した辞書内のキーワードの有無により、この情報のジャンルをスポーツ、お天気、特例である等とそれぞれ決定する。図４に辞書の例を示す。
【００５１】
例えば、スポーツ用辞書の場合、スポーツ、野球、ゴルフ、…等があり、これらに相当する単語があれば情報のジャンルはスポーツであると決定する。いずれの辞書のキーワードも含まれていない場合は、その他としてジャンルを決定する。次にステップＳ１０４ではステップＳ１０３で図５に示す手順でステップＳ１０２で選択した文章を並べ替える。
【００５２】
ステップＳ１０４−０１でテキストファイルを作成し、ステップＳ１０４−０２で冒頭用フラグを書き込み、ステップＳ１０４−０３でタイトル書き込みを行う。ステップＳ１０４−０４では画像の有無を判定する。画像が有る場合はステップＳ１０４−０５に進み画像用フラグを書き込み、ステップＳ１０４−０６で画像付近のテキスト書き込みを行う。画像が無い場合はステップＳ１０４−０７へ進み、パラグラフの有無を判定する。パラグラフが有る場合はステップＳ１０４−０８に進み本文用フラグを書き込み、ステップＳ１０４−０９でパラグラフ中の第一文の書き込みを行う。パラグラフが無い場合は終了処理を行う。以上の手順をフローチャートに示すとおり順次行う。最後にステップＳ１０５で読み上げ用のテキスト原稿を完成する（請求項５）。
【００５３】
ここでは、ステップＳ１０３で決定したこの情報のジャンルごとに、スポーツにはスポーツ用文章、お天気にはお天気用文章、特例とそれ以外のジャンルにはその他用文章を図６に示す文章例を基に各フラグ毎に書き込む。以上の手順で読み上げ用のテキスト原稿を作成する。この手順を用いて例えばスポーツ用の読み上げ用のテキスト原稿を作成すると以下のようになる。
【００５４】
「スポーツのニュースがあるよ。今日はどんなスポーツがあったのかな？」「ヤンキーズ松本、トロントスカイドームで練習」「見てみて、この写真はね、開幕を翌日に控え、チームメートと談笑しながらストレッチで汗を流す松本だよ」「このニュースの内容はね、ヤンキーズの松本は３０日、開幕戦を翌日に控えて、カナダ・トロントのスカイドームで練習試合をした」「それからね、正午から２時間、打撃練習などをしてドームの感触を確かめたんだって」
図７に図２のステップＳ２の処理手順の詳細を示す。ステップＳ２０２では発信する情報の種類に応じてキャラクタの動作を決定し、ステップＳ２０３では発信する情報の種類に応じてキャラクタの喋り方、歌うないし読み上げる等の発話方法を決定する（請求項２）。ステップＳ２０４では発信する情報の種類に応じて掛け合いの方法を決定する。その際、ステップＳ２０１では、ステップＳ１０３で決定した情報のジャンルによって、内容を変化させる。このジャンルと内容の対応例を図８に示し、以下に詳細に述べる。
【００５５】
情報がスポーツである場合、キャラクタの動作は大きくかつ頻度も高く設定し、喋り方は話し言葉とする。歌も併用しテキストの内容を歌にしたものをキャラクタが歌う。掛け合いはボケと突っ込み等コミカルなものとする。情報が天気である場合、キャラクタの動作は中程度とする。喋り方は話し言葉とする。掛け合いは質問形式にし、視聴者がより情報を記憶に残りやすいようにする。情報が特例およびその他のジャンルである場合、キャラクタの動作は小さくし、頻度も低くする。喋り方はアナウンサーのようにし、内容を伝えることに主眼を置く。掛け合いも少なくし、特に特例の場合は掛け合いは全く使わず、交互に内容を読み上げるだけにとどめる。
【００５６】
ここでは、実体キャラクタおよび画像キャラクタの両者に関する処理手順を示している（請求項４）。実体キャラクタ同士による動作、実体キャラクタと画像キャラクタとの組み合わせによる動作、画像キャラクタ同士の動作がそれぞれある。実体キャラクタおよび画像キャラクタのうち、互いに同種類のキャラクタ同士は連動し、実体キャラクタが退場するなどして隠れたことを受けて画像キャラクタが画面の中に登場したり、逆に画像キャラクタが画面に表示されなくなるなどして隠れたことを受けて実体キャラクタが登場したりすることにより、あたかもキャラクタが実世界と画面上を行き来しているかのような動作を行う（請求項６）。
【００５７】
情報を発信する際、キャラクタは登場・退場の動作を行うが、このときキャラクタは手を振ったり、喋ったり、歌を歌ったりしながら登場・退場の動作を行うことにより、情報に対する注意を喚起する。
【００５８】
図９に実体キャラクタの動きを示す。矢印で示されたとおり、キャラクタ本体中の動きＭ１は首、動きＭ２は顔、動きＭ３は口、動きＭ４は手、動きＭ５は胴体、動きＭ６足、および動きＭ７キャラクタの台座の動きを示し、これらの動きを複合的に行い、登場や退場または発話時の発話らしい動作を行い視聴者への情報の注意を喚起する。例えば、ステップＳ１０３で発信する情報がスポーツであると決定された場合、登場時に台座の動きＭ７は速く、手の動きＭ７は手を上げた状態且つ手を振る。登場が終わり、コンテンツを発信する際には、足の動きＭ６で歩いているように見せながら、胴体の動きＭ５で画面の方を向き、同時に腕の動きＭ４でコンテンツを指し示すように手を上げ、首および顔の動きＭ１、Ｍ２を用いてコンテンツを見るようにする。
【００５９】
口の動きＭ３を伴いながらコンテンツの内容を発話し、発話中は首および顔の動きＭ１、Ｍ２により、視聴者の方を見たり、コンテンツの方を向いたりを頻繁に繰り返し、手の動きＭ４胴の動きＭ５足の動きＭ６を加えて身振り手振りをしているような動作を行う。退場時には登場時同様、手の動きＭ７により手を上げた状態且つ手を振ったり、足の動きＭ６で飛び跳ねているように見せたりする。
【００６０】
読み上げまたは歌を歌う際、合成音を用いる場合と、人間が実際に発声した物を使う場合の両者を用いることにより、そのコンテンツの内容にあった形で、より質の高い情報の配信を実現する。
【００６１】
発信する情報の内容が特にマニュアルや、Ｑ＆Ａ（質疑応答）の形である場合、キャラクタ同士に掛け合いをさせる（請求項３、７）。例えば「どうしてそうなるの？」「それはね、〇〇だからだよ」のように話し言葉による会話の形をとることによって、ただ文章のままのマニュアルやＱ＆Ａよりも、分かり易く且つ印象深い内容で情報を提供する。
【００６２】
発信する情報の内容が特に対談である場合、各キャラクタに役割分担をさせることによって、その対談形式の情報を実際の対談と同じ様に受け取ることを可能にする。
【００６３】
動作の決定方法は以下に３点示す。
【００６４】
１）歌の音階の種類により、動作を決定する。情報処理部１で生成された音楽や、あらかじめ準備してあった音楽のうち、キャラクタ制御部２が、選択した音楽の音階の種類によって、動作方法を決定する。
【００６５】
２）テキストから情報を収集し動作の種類を決定する。その際、例えば、特許文献５に示されている合成音声メッセージ編集作成方法を用いて、テキストから情報を収集し動作の種類を決定する。
【００６６】
３）例えば、特許文献５を応用し、動作を決定する。キャラクタの種類を想定し、キャラクタごとに適した辞書を用意し、辞書内の決められた単語の種類によって楽しく、悲しく等の動作を決定する。
【００６７】
例えば、特許文献５や特許文献６の合成音声メッセージ編集作成方法、その装置およびその方法を記録した記録媒体等による合成音声は、その音質から、人間の発声を模したものとしての違和感があり、自然に聞こえないという問題点があった。しかし本発明のキャラクタによる情報発信方法を適用することにより、音を聞く際の注意がキャラクタに向けられることにより分散されたり、もともと作り物であるキャラクタが喋っているように見えるために自然に見えたりする場合がある。
【００６８】
例えば、特許文献２の音声メッセージ生成・配信方法およびその装置を用いて情報処理部１によって準備された読み上げ用テキストに対し、音楽をつけて、歌を生成する（請求項８）。
【００６９】
本発明は、もともとテキストや画像といった、見て認識する視聴情報に対して、それを基に読み上げ用のテキスト原稿を作り、キャラクタに読み上げさせたり、歌わせたりする視聴情報を加えるため、例えば他の事をしているときなどでも、なんとなく音を聞いているだけで受動的に情報を得ることができる。
【００７０】
【発明の効果】
以上述べたように、本発明によれば、情報への喚起や情報を印象付ける等の効果を向上させ、受動的に情報を受け取ることを可能にし、これまで不自然であるとされてきた合成音声を、より自然なものとして提供できる。
【図面の簡単な説明】
【図１】本実施形態のキャラクタによる情報発信装置を示す図。
【図２】本実施形態のキャラクタによる情報発信方法の基本手順を示す流れ図。
【図３】本実施形態のキャラクタによる情報発信方法の基本手順のうち情報処理手順の詳細を示す流れ図。
【図４】本実施形態のキャラクタによる情報発信方法の基本手順のうち情報のジャンルを決定するための辞書を示す図。
【図５】本実施形態のキャラクタによる情報発信方法の基本手順のうち並べ替えによる読み上げ用テキスト生成の手順を示す流れ図。
【図６】本実施形態のキャラクタによる情報発信方法の基本手順のうち読み上げ用テキストのフラグに埋め込む文章を示す図。
【図７】本実施形態のキャラクタによる情報発信方法の基本手順のうちキャラクタ動作決定の詳細な流れを示す図。
【図８】本実施形態のキャラクタによる情報発信方法の基本手順のうちキャラクタ動作決定の詳細を各ジャンル毎に示した図。
【図９】本実施形態のキャラクタによる情報発信方法のうち実体キャラクタ動作を示した図。
【符号の説明】
１情報処理部
２キャラクタ制御部
３表示部
１０情報収集部
１１情報選択部
１２ジャンル決定部
１３文章例選択部
１４情報並べ替え部
１５テキスト原稿作成部
１６テキスト読み上げ部
１７発話パターン選択部
１８読み上げキャラクタ割当部
１９旋律生成部
２０キャラクタ動作決定部[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention is applied to a device for displaying information. In particular, the present invention relates to a technique for alerting a viewer of information to information and impressing information.
[0002]
[Prior art]
In media such as television, radio, and street advertising, as a way of calling attention to information or impressing information, use popular celebrities, use designs by popular designers, and add sound effects. , Content is produced by reputable producers and editors, etc.
[0003]
When trying to obtain information using a personal computer, the user needs to actively search for information. However, as a browser used when browsing information on the Internet, there is known a technology for providing a voice interface, such as a character speaking by a synthetic voice as an agent or a user inputting voice.
[0004]
As a method for generating a synthesized voice, for example, a voice synthesis method disclosed in Patent Document 1 is disclosed. However, the conventional synthesized speech has a problem that it cannot always be said to be a natural sound close to humans, and a method of displaying an agent on a screen has been used as an interface for solving the problem.
[0005]
As one application example of voice synthesis, there is a singing voice message generation / distribution method and apparatus for synthesizing a singing voice (for example, see Patent Literature 2). A method of synthesizing and distributing is being performed.
[0006]
As a display device that attracts attention, for example, there is a wall surface movement performance device disclosed in Patent Document 3, which provides a wall surface movement performance device having a high effect of attracting public attention by freely moving around a wall surface.
[0007]
As an advertisement distribution method, for example, there is an electronic advertisement system disclosed in Patent Literature 4, in which only an advertisement viewer who actually views an advertisement displayed on an advertisement terminal can change advertisement data displayed on the advertisement display terminal. In this way, an electronic advertisement system is provided that is effective for increasing the interest of advertisement viewers in advertisement.
[0008]
[Patent Document 1]
JP-A-6-308997
[Patent Document 2]
JP-A-2002-132281
[Patent Document 3]
JP-A-7-140919
[Patent Document 4]
JP-A-2002-150130
[Patent Document 5]
JP-A-11-202883
[Patent Document 6]
JP-A-10-139323
[0009]
[Problems to be solved by the invention]
In information transmission devices and methods such as televisions, radios, street advertisements, personal computers, etc., the improvement of the effect of calling attention to information to the viewer and impressing the information has been achieved by manually increasing the content. It relies on the experience of its creators and editors. Therefore, there is a problem that it takes time and cost.
[0010]
In the present invention, the viewpoint is changed, not the production or editing of the content itself, but the effect of evoking information or impressing the information by the movement of the character, etc., and improving the effect of newly created content as well as the content which has existed before. It is also an object of the present invention to provide a high value-added service such as information distribution by expanding the range of adaptation.
[0011]
Further, when trying to obtain information using a personal computer, the user must actively search for information, and it is difficult to passively receive information. Therefore, an object of the present technology is to enable passive reception of information regardless of the information distribution medium.
[0012]
Furthermore, with regard to synthesized speech that has been considered to be unnatural, the purpose is to output sound from a substantial character and make it appear more natural by making it appear as if the actual character itself is talking. .
[0013]
[Means for Solving the Problems]
Devices for transmitting information, such as televisions, radios, street advertisements, and personal computers, and information transmission methods therefor are widely known. In such an information transmitting apparatus and method, it is important to call attention to information to a viewer and further impress the information. In addition, when trying to obtain information using a personal computer, the user needs to actively search for information.
[0014]
The present invention relates to an information transmission apparatus and method for transmitting information that enhances the effect of attracting attention to a viewer and improves the persistence of impressions or enhances the distribution that enables passive acquisition of information. Apparatus and method.
[0015]
As means for achieving the above object, a real character and an image character are used. A real character performs appearance and exit operations, and an image character performs appearance and exit operations.
[0016]
When the real character appears and exits, the character performs operations such as singing, speaking, and waving the text information of the content. After the appearance operation is completed, information is transmitted while the real character and the image character are linked to communicate with each other.
[0017]
Regarding synthesized speech that has been regarded as unnatural until now, sound is output from a substantial character, it looks as if the real character itself is talking, and by linking the real character and the image character, The conventional method of reading out a synthesized voice using an image character is shown more naturally.
[0018]
In other words, in addition to the information transmission method that the agent has done so far, the real character performs the appearance and exit action, singing, talking, waving, etc. It is possible to raise the awareness of the information.
[0019]
When the real character appears and leaves, singing, speaking, waving, etc., the text information of the content, the auditory and visual effects are further enhanced to alert and alert Can be impressed. In addition, by using a method of singing and talking, information is transmitted by voice, so that the viewer does not need to follow text information with his / her eyes, and can passively receive information. Further, after the appearance operation is completed, the real character and the image character are linked to each other so that the content becomes more interesting and the content of the information can be impressed.
[0020]
Regarding synthesized speech that has been regarded as unnatural until now, sound is output from a substantial character, it looks as if the real character itself is talking, and by linking the real character and the image character, It is possible to make the conventional method of reading out the synthesized voice by the image character look more natural.
[0021]
The present invention is a method of distributing information that appears by a signal from a real character control device that has detected new arrival information in response to a notification from an information management device, further transmits information, and then exits. It has the effect of attracting attention to each piece of information by dropping or leaving.
[0022]
The present invention is an information transmitting unit in which a character reads out information. By transmitting information in this manner, for example, even if information has been actively collected and acquired so far, such as when using a Web browser, It has the effect of providing an environment that can be passively received.
[0023]
The present invention is an information transmitting means in which a character sings information as a song. However, depending on the content of the information to be transmitted, some characters, such as sports news, can be more enjoyable when made into a song, but it is not appropriate to make a song. There is an effect of selecting information transmission means according to the characteristics of the information, and transmitting the information more accurately.
[0024]
That is, a first aspect of the present invention is an information transmitting method, which is characterized by a step of collecting information, and a word or a sentence representing the title and content of the information from the collected information. Selecting a genre of the information based on the selected word or sentence, selecting a text example for creating a text manuscript based on the determined genre, and selecting the selected text. Rearranging the words or sentences selected in the step of selecting the words or sentences based on the manuscript creation sentence example, and creating a text manuscript by linking the sorted words or sentences. And a step of reading out the created text manuscript by voice, and a character responding to the reading out. There is to be run and operating the motor (claim 1).
[0025]
Further, the step of operating the character includes executing a step of selecting an operation pattern of the character according to the genre determined by the step of determining the genre, and the step of reading out is determined by a step of determining the genre. A step of selecting an utterance pattern at the time of reading out according to the performed genre can be executed (claim 2).
[0026]
Also, a plurality of the characters are provided, and the reading step executes a step of dividing the contents of the text document and assigning the contents to the plurality of characters, and reading them out with different voices. A step of operating the plurality of characters in cooperation in accordance with the content can be executed (claim 3).
[0027]
Further, one of the plurality of characters may be a doll driven by a motor, and the other may be an image displayed on a display.
[0028]
The step of rearranging the sentences is a step of classifying the word or the sentence selected by the step of selecting the word or the sentence based on its attribute, and the step of sorting the word or the sentence according to the attribute. Inserting at a predetermined arrangement position (claim 5).
[0029]
Further, the step of operating in cooperation with the character of the doll has entered the display as a character of the image, or the character of the image has jumped out of the display as a character of the doll. (See claim 6).
[0030]
Alternatively, the step of operating in cooperation may execute the step of operating so that the plurality of characters seem to be interacting with each other (claim 7). At this time, the conversation partner can be a doll character, or yes, an image character, or an image character and a doll character.
[0031]
The reading out step may include a step of generating a melody and a step of reading out the text document as a song according to the melody (claim 8).
[0032]
A second aspect of the present invention is an information transmitting apparatus, which is characterized by a means for collecting information, and a word representing the title and content of the information from the information collected by the collecting means. Or means for selecting a sentence, means for determining the genre of the information based on the word or sentence selected by the selecting means, and selecting a text manuscript creation sentence example based on the genre determined by the determining means Means for reordering the words or sentences selected by the means for selecting the words or sentences on the basis of the text document creation example sentence selected by the selecting means, and performing the sorting Means for creating a text manuscript by connecting the words or sentences rearranged by the means, and Means for reading out a text document by voice, is in place and means for operating the character in response to reading of the spoken means (claim 9).
[0033]
Further, the means for operating the character includes means for selecting an operation pattern of the character according to the genre determined by the means for determining the genre, and the means for reading out is determined by the means for determining the genre. Means for selecting an utterance pattern at the time of reading out according to the genre can be provided (claim 10).
[0034]
The character is provided in plurality, and the reading means includes means for dividing the contents of the text document and allocating to each of the plurality of characters, and reading the text with different voices. The means for operating the character includes the contents of the reading. (Claim 11).
[0035]
Further, the means for rearranging the sentence includes: means for classifying the word or sentence selected by the means for selecting the word or sentence based on its attribute; and means for pre-determining the word or sentence in accordance with the attribute. Means for inserting the device into a predetermined position.
[0036]
The reading means may include a means for generating a melody and a means for reading the text document as a song according to the melody (claim 13).
[0037]
A third aspect of the present invention is a program. A feature of the present invention is that the program is installed in an information processing apparatus to cause the information processing apparatus to execute each step of the information transmitting method of the present invention. There. Alternatively, by installing in an information processing apparatus, the information processing apparatus realizes a function corresponding to each means of the information transmitting apparatus of the present invention (claims 14 and 15).
[0038]
Since the program of the present invention is recorded on a recording medium, the information processing apparatus can install the program of the present invention using the recording medium. Alternatively, the program of the present invention can be directly installed on the information processing apparatus from a server holding the program of the present invention via a network.
[0039]
By using the information processing device, instead of creating or editing the content itself, the effect of evoking information and impressing the information by the movement of the character is improved, and newly created content as well as It expands the scope of adaptation to existing content, provides high value-added services such as information distribution, and enables passive reception of information regardless of the information distribution medium. Realized an information transmission method and device that can output more natural sound by outputting sound from a character with a substance with regard to synthesized speech that has been regarded as natural and making it appear as if the substance character itself is talking. can do.
[0040]
BEST MODE FOR CARRYING OUT THE INVENTION
An information transmission device according to an embodiment of the present invention will be described with reference to FIGS. FIG. 1 is a diagram illustrating an information transmission device using characters according to the present embodiment. FIG. 2 is a flowchart showing the basic procedure of the information transmission method using characters according to the present embodiment. FIG. 3 is a flowchart showing details of the information processing procedure among the basic procedures of the character information transmission method of the present embodiment. FIG. 4 is a diagram showing a dictionary for determining the genre of information in the basic procedure of the information transmission method using characters according to the present embodiment. FIG. 5 is a flowchart showing a procedure for generating a text for reading out by rearranging, among the basic procedures of the information transmission method using characters according to the present embodiment. FIG. 6 is a diagram showing a sentence embedded in the flag of the text for reading out of the basic procedure of the information transmitting method by the character of the present embodiment. FIG. 7 is a diagram showing a detailed flow of character action determination in the basic procedure of the character information transmission method of the present embodiment. FIG. 8 is a diagram showing, for each genre, details of character action determination in the basic procedure of the character information transmission method of the present embodiment. FIG. 9 is a diagram illustrating a substantial character operation in the information transmission method using characters according to the present embodiment.
[0041]
As shown in FIG. 1, the information transmitting device of the present embodiment selects an information processing unit 10 for collecting information and a word or a sentence representing the title and content of the information from the information collected by the information processing unit 10. An information selection unit 11, a genre determination unit 12 that determines the genre of the information based on the word or text selected by the information selection unit 11, and a text original creation text based on the genre determined by the genre determination unit 12 A sentence example selection unit 13 for selecting an example, and information sorting for sorting the words or sentences selected by the information selection unit 11 based on the text example for text original creation selected by the sentence example selection unit 13 Unit 14 and a text for creating a text document by linking the words or sentences rearranged by the information rearranging unit 14 A text preparation unit 15, a text reading unit 16 that reads out a text document prepared by the text preparation unit 15 by voice, and a character control unit 2 that operates a character in response to the reading of the text reading unit 16. (Claim 9).
[0042]
The character control unit 2 includes a character motion determination unit 20 that selects a motion pattern of the character according to the genre determined by the genre determination unit 12. The text-to-speech unit 16 determines the genre determined by the genre determination unit 12. An utterance pattern selection unit 17 is provided for selecting an utterance pattern at the time of reading out in response.
[0043]
A plurality of the characters are provided. In the example of FIG. 1, one is a real character and the other is an image character. The text-to-speech unit 16 includes a text-to-speech character allocating unit 18 that divides the contents of the text document and allocates the divided characters to the plurality of characters, and reads out the texts with different voices. The character is operated in cooperation (claim 11).
[0044]
Also, as shown in FIG. 6, the information rearranging unit 14 classifies the word or the sentence selected by the information selecting unit 11 based on its attribute, and sorts the word or the sentence in advance according to the attribute. It is inserted into a predetermined arrangement position (claim 12).
[0045]
The text-to-speech unit 16 includes a melody generating unit 19 that generates a melody, and can read out the text document as a song according to the melody.
[0046]
The present invention can be realized as a program that functions as the above-described information transmission device by being installed in a general-purpose information processing device (claims 14 and 15). This program is recorded on a recording medium and installed in the information processing device, or installed in the information processing device via a communication line, whereby the information processing device can function as the information transmitting device of the present embodiment. .
[0047]
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. FIG. 2 shows a basic procedure of the embodiment of the present invention. The information is detected and processed by the information processing unit in step S1, the character motion is determined by the character motion determination unit in step S2, and the information transmission operation unit performs an operation for transmitting information in step S3. In the information transmission device shown in FIG. 1, step S1 is performed by the information processing unit 1, step 2 is performed by the character control unit 2, and step S3 is performed by the display unit 3. The display unit 3 includes an image character, a real character, and a speaker, and transmits information from the speaker.
[0048]
In the information processing in step S1 in FIG. 2, a necessary portion is selected from the information, a title, a genre, a keyword, and the like of the information are extracted, and the information is rearranged based on the extracted data to create a text document for reading aloud. I do. FIG. 3 shows the details of the processing procedure of step S1 in FIG. First, information is collected over a wide range to some extent in step S101. In step S102, necessary information is selected from the information. In step S103, the genre of information is determined based on the selected information. The selected information is rearranged in step S104, and the read information is read in step S105 (claim 1).
[0049]
Here, as an example of selecting necessary information in step S102, determining a genre in step S103, rearranging information in step S104, and creating a text manuscript for reading in step S105, the contents of a news site on the Internet are described. This procedure is described below. For example, when browsing a typical news site, Asahi.com (http://www.asahi.com), each news is composed of a title and a main body, and in some cases, an image and a short text explaining the image. . The title consists of an average of 23.7 characters and the body consists of approximately 250 to 450 characters and a few paragraphs (investigating 10 sites in the Asahi.com randomly selected by the inventor). .
[0050]
FIG. 3 shows a procedure for reconstructing a text having a length suitable for reading when information read as such a reading material is read, for example, in step S1. In step S102, of the title, the main text, and the text around the image, the title is selected from the title, the first sentence of each paragraph is selected from the text, and the full text around the image is selected. Next, in step S103, in the text selected in step S102, the genre of this information is determined to be sports, weather, special cases, etc. based on the presence or absence of a keyword in a dictionary prepared in advance. FIG. 4 shows an example of the dictionary.
[0051]
For example, in the case of a sports dictionary, there are sports, baseball, golf,..., Etc., and if there is a word corresponding to these, the genre of information is determined to be sports. If neither dictionary keyword is included, the genre is determined as others. Next, in step S104, the sentences selected in step S102 are rearranged in the procedure shown in FIG. 5 in step S103.
[0052]
A text file is created in step S104-01, a head flag is written in step S104-02, and a title is written in step S104-03. In step S104-04, the presence or absence of an image is determined. If there is an image, the flow advances to step S104-05 to write an image flag, and the text near the image is written in step S104-06. If there is no image, the flow advances to step S104-07 to determine whether there is a paragraph. If there is a paragraph, the flow advances to step S104-08 to write a text flag, and in step S104-09, the first sentence in the paragraph is written. If there is no paragraph, end processing is performed. The above procedure is sequentially performed as shown in the flowchart. Finally, in step S105, a text document for reading out is completed (claim 5).
[0053]
Here, for each genre of this information determined in step S103, sports texts for sports, weather texts for weather, and other texts for special cases and other genres are based on the text examples shown in FIG. Write for each flag. The text manuscript for reading out is created by the above procedure. For example, a text manuscript for reading aloud for sports using this procedure is as follows.
[0054]
"I have sports news. What kind of sports did you have today?""Yankees Matsumoto practice at the Toronto Sky Dome.""Look at this picture. "It's Matsumoto who sweats by stretching.""The content of this news is that the Yankees Matsumoto played a practice game at the Sky Dome in Toronto, Canada, ahead of the opening game on the next day, 30" I practiced hitting for two hours and checked the feel of the dome. "
FIG. 7 shows details of the processing procedure of step S2 in FIG. In step S202, the action of the character is determined in accordance with the type of information to be transmitted, and in step S203, a method of speaking, singing, or reading out the character is determined in accordance with the type of information to be transmitted (claim 2). In step S204, a negotiating method is determined according to the type of information to be transmitted. At this time, in step S201, the content is changed according to the genre of the information determined in step S103. FIG. 8 shows an example of the correspondence between the genre and the content, which will be described in detail below.
[0055]
If the information is sports, the character's action is set to be large and frequently, and the way of speaking is spoken. The character sings a song that combines the contents of the text with a song. Negotiations should be comical, such as blurring and plunging. When the information is the weather, the movement of the character is moderate. Speak in spoken language. Negotiations are in the form of a question, so that the viewer can more easily remember the information. When the information is a special case or another genre, the movement of the character is reduced and the frequency is also reduced. Speak like an announcer and focus on communicating the content. The number of negotiations is reduced, especially in special cases, where negotiations are not used at all and only the contents are read aloud alternately.
[0056]
Here, a processing procedure relating to both a real character and an image character is shown (claim 4). There are an operation by the real characters, an operation by the combination of the real character and the image character, and an operation by the image characters. Of the real character and the image character, characters of the same type are linked to each other, and the image character appears on the screen in response to the real character leaving and hiding, and conversely, the image character appears on the screen. When the real character appears in response to being hidden such as disappearing from the display, an operation is performed as if the character were moving back and forth between the real world and the screen.
[0057]
When transmitting information, the character performs an appearance / leaving action. At this time, the character performs an appearance / leaving action while waving, talking, or singing a song, thereby calling attention to the information. I do.
[0058]
FIG. 9 shows the movement of the real character. As indicated by the arrows, the movement M1 in the character body is the neck, the movement M2 is the face, the movement M3 is the mouth, the movement M4 is the hand, the movement M5 is the body, the movement M6 feet, and the movement of the base of the movement M7 character. These movements are performed in a composite manner, and the appearance, exit, or utterance-like operation at the time of utterance is performed to alert the viewer to the information. For example, when it is determined in step S103 that the information to be transmitted is a sport, the pedestal moves M7 at a high speed at the time of appearance, and the hand movement M7 raises and shakes the hand. When the appearance is over and the content is transmitted, raise the hand so that it looks as if walking with foot motion M6, face the screen with body motion M5, and at the same time point the content with arm motion M4. , The content is viewed using the neck and face movements M1 and M2.
[0059]
The user utters the contents of the content while accompanied by the mouth movement M3. During the utterance, the head and head movements M1 and M2 frequently repeat looking at the viewer and facing the content, and the hand movement M4. The movement of the torso is performed by adding the movement M6 of the trunk M5 and the movement M6 of the foot. At the time of exit, as in the case of appearance, the hand is raised with the hand motion M7 and the hand is waved, or it appears as if it is jumping with the foot motion M6.
[0060]
Realization of higher-quality information in the form that matches the content of the content by using both synthetic speech and the use of actual utterances when reading or singing songs I do.
[0061]
If the content of the information to be transmitted is in the form of a manual or Q & A (question and answer), the characters are negotiated with each other (claims 3 and 7). By taking the form of a spoken dialogue such as "Why is it?" Or "That's why, it's because it's so," information is more comprehensible and impressive than a text-only manual or Q & A. provide.
[0062]
When the content of the information to be transmitted is particularly a dialogue, by assigning roles to the respective characters, it becomes possible to receive the information in the form of the dialogue in the same way as the actual dialogue.
[0063]
The following three methods are used to determine the operation.
[0064]
1) The action is determined according to the type of musical scale. Among the music generated by the information processing unit 1 and the music prepared in advance, the character control unit 2 determines the operation method according to the type of the scale of the selected music.
[0065]
2) Gather information from the text and determine the type of action. At that time, information is collected from the text and the type of operation is determined, for example, by using the synthetic voice message editing and creating method disclosed in Patent Document 5.
[0066]
3) For example, the operation is determined by applying Patent Document 5. Assuming the types of characters, a dictionary suitable for each character is prepared, and a fun, sad, etc. action is determined according to the determined word type in the dictionary.
[0067]
For example, the synthesized voice message editing and creating method of Patent Literature 5 and Patent Literature 6, the synthesized voice by the device and the recording medium on which the method is recorded have a sense of incongruity as imitating human utterance from the sound quality, There was a problem that it could not be heard naturally. However, by applying the information transmission method by the character of the present invention, attention when listening to sound is dispersed by being directed to the character, or the character that is originally made looks natural because it seems to be talking. May be.
[0068]
For example, a song is generated by adding music to the text for reading prepared by the information processing unit 1 using the voice message generation / distribution method and the apparatus of Patent Document 2 (claim 8).
[0069]
The present invention creates a text manuscript for reading aloud based on viewing information originally recognized and recognized, such as text and images, and adds viewing information for causing a character to read or sing a song. Even when you are doing things, you can get information passively just by listening to the sound.
[0070]
【The invention's effect】
As described above, according to the present invention, it is possible to improve the effects of evoking information and impressing information, to enable passive reception of information, and to provide a synthesis that has been considered to be unnatural. Sound can be provided as more natural.
[Brief description of the drawings]
FIG. 1 is an exemplary view showing an information transmission device using characters according to an embodiment.
FIG. 2 is a flowchart showing a basic procedure of an information transmission method by a character according to the embodiment.
FIG. 3 is a flowchart showing details of an information processing procedure among basic procedures of a character information transmission method according to the embodiment;
FIG. 4 is an exemplary view showing a dictionary for determining a genre of information in the basic procedure of the information transmission method using characters according to the embodiment.
FIG. 5 is a flowchart showing a procedure of generating a text for reading out by rearranging among basic procedures of the information transmission method using characters according to the embodiment;
FIG. 6 is a diagram showing a sentence embedded in a flag of the text for reading out of the basic procedure of the information transmission method using characters according to the embodiment.
FIG. 7 is a view showing a detailed flow of character action determination in a basic procedure of a character information transmission method according to the embodiment.
FIG. 8 is a diagram showing, for each genre, details of character action determination in a basic procedure of a character information transmission method according to the embodiment.
FIG. 9 is an exemplary view showing an actual character operation in the information transmission method using characters according to the embodiment;
[Explanation of symbols]
1 Information processing unit
2 Character control unit
3 Display
10 Information Collection Department
11 Information selection section
12 Genre decision section
13 Sentence example selector
14 Information sorting unit
15 Text manuscript creation department
16 Text-to-speech section
17 Utterance pattern selector
18 Speaking character assignment unit
19 Melody generator
20 Character motion decision unit

Claims

Collecting information;
Selecting from the collected information a word or sentence representing the title and content of the information;
Determining the genre of the information based on the selected word or sentence;
Selecting a text manuscript writing example based on the determined genre;
Rearranging the word or sentence selected by the step of selecting the word or sentence based on the selected text original creation sentence example;
Concatenating the sorted words or sentences to create a text manuscript;
Reading out the created text manuscript by voice;
Operating the character in response to the reading aloud.

The step of operating the character executes a step of selecting an operation pattern of the character according to the genre determined by the step of determining the genre,
2. The information transmitting method according to claim 1, wherein the reading step includes a step of selecting an utterance pattern at the time of reading according to the genre determined in the step of determining the genre.

The character is provided in plurality,
The reading aloud step executes a step of dividing the content of the text document and assigning each of the plurality of characters to each of the plurality of characters, and reading each of the plurality of characters with a different voice.
The information transmitting method according to claim 1, wherein the step of operating the character includes the step of operating the plurality of characters in cooperation with each other in accordance with the content of the reading.

4. The information transmitting method according to claim 1, wherein one of the plurality of characters is a doll driven by a motor, and the other is an image displayed on a display.

The step of rearranging the sentences,
Classifying the word or sentence selected by the step of selecting the word or sentence based on its attribute;
Inserting the word or sentence at a predetermined arrangement position determined in advance according to the attribute of the word or sentence.

The step of operating in cooperation is such that the character by the doll has entered the display as a character by the image, or such that the character by the image has jumped out of the display as a character by the doll 5. The information transmitting method according to claim 3, wherein a step of performing a visual operation is performed.

4. The information transmitting method according to claim 3, wherein the step of operating in cooperation includes executing the step of operating the plurality of characters so that they appear to be interacting with each other.

The reading out step includes:
Generating a melody;
Reading out the text original as a song according to the melody.

Means for collecting information;
Means for selecting a word or a sentence representing the title and content of the information from the information collected by the collecting means,
Means for determining the genre of the information based on the word or sentence selected by the selecting means;
Means for selecting a text example for creating a text manuscript based on the genre determined by the determining means;
Means for rearranging the words or sentences selected by the means for selecting the words or sentences based on the text example for text manuscript creation selected by the selecting means;
Means for creating a text document by linking the words or sentences sorted by the means for performing the sorting,
Means for reading out the text document created by the creating means by voice,
Means for operating the character in response to the reading of the reading means.

The means for operating the character includes means for selecting an operation pattern of the character according to the genre determined by the means for determining the genre,
10. The information transmitting apparatus according to claim 9, wherein the reading means includes means for selecting an utterance pattern at the time of reading according to the genre determined by the genre determining means.

The character is provided in plurality,
The reading means includes means for dividing the contents of the text document, assigning the contents to the plurality of characters, and reading the contents with different sounds.
The information transmitting apparatus according to claim 9, wherein the means for operating the character includes means for operating the plurality of characters in cooperation with each other in accordance with the content of the reading.

The means for rearranging the sentences,
Means for classifying the word or sentence selected by the means for selecting the word or sentence based on its attributes;
10. The information transmitting apparatus according to claim 9, further comprising: means for inserting the word or the sentence into a predetermined arrangement position predetermined according to its attribute.

The reading means is
Means for generating a melody;
10. The information transmitting apparatus according to claim 9, further comprising: means for reading out the text original as a song according to the melody.

A program which, when installed in an information processing apparatus, causes the information processing apparatus to execute each step of the information transmission method according to claim 1.

A program which, when installed in an information processing apparatus, causes the information processing apparatus to realize a function corresponding to each unit of the information transmitting apparatus according to any one of claims 9 to 13.