JP2004267433A

JP2004267433A - Information processor, server, program, recording medium for providing voice chat function

Info

Publication number: JP2004267433A
Application number: JP2003061544A
Authority: JP
Inventors: Yusuke Matsuzaki; 祐介松崎; Takashi Aoki; 青木　　隆
Original assignee: Namco Ltd
Current assignee: Namco Ltd
Priority date: 2003-03-07
Filing date: 2003-03-07
Publication date: 2004-09-30

Abstract

PROBLEM TO BE SOLVED: To make a user feel as if he/she is talking as a character in a virtual space in the case of performing voice chat arranging the character in the virtual space. SOLUTION: Voice transforming processing (S103, S105, S107) or voice synthesizing process (S109) which makes the relationship between the virtual space and voice intimate is performed prior to outputting the chat voice (S110). For example, the voice gets differently heard depending on the presence of objects other than the characters arranged in the virtual space, or character's voice made while he/she is moving the virtual space gets heard changing with the movement in real time. Further, the chat voice gets affected by voices other than the chat voice made in the virtual space or affected by the body condition the user really uttering the voice. COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、仮想空間に配置されたキャラクタを介して会話する音声チャットシステムへの適用に適した情報処理装置、サーバおよびそれらを制御するコンピュータプログラムに関する。
【０００２】
【従来の技術】
インターネット上で提供されているチャットサービスは、ユーザから入力されたテキストデータをネットワークを介してリアルタイムに他のユーザに転送することにより、物理的に離れているユーザ同士が文字による会話を楽しめるようにしたサービスである。もともとは、ユーザ同士の直接的なコミュニケーション手段として提案されたものであるが、近年、コンピュータ画面に表示されたキャラクタ同士を会話させるための機能としても利用されるようになった。例えば特許文献１には、３次元仮想現実空間にアバタと呼ばれるユーザの分身キャラクタを配置して、各ユーザが入力した文字を分身キャラクタの発言として他のユーザに伝えるシステムが開示されている。
【０００３】
また、近年、マイク入力された音声をデジタルデータに変換して交換することによって、電話と同様に音声で会話することができる音声チャットサービスも提供されはじめた。
【０００４】
【特許文献１】
特開２００１−３１２７４４号公報
【０００５】
【発明が解決しようとする課題】
前述のように、チャットには、ユーザが自分自身として会話する形態と、分身キャラクタとして会話する形態の２通りがある。前者と後者では、チャットシステムに期待される役割は若干異なる。前者の場合には各ユーザの発言がリアルタイムに正確に伝達されれば十分である。しかし、後者の場合、特にネットワーク・ロールプレイング・ゲームなどでは、正確な情報伝達のみならず、各ユーザが分身キャラクタに十分に感情移入できる雰囲気作りも重要である。
【０００６】
上記特許文献１記載の発明は、チャット文字列の属性を変えることによってアバタ同士の距離感を表現しようとしているが、このような文字による雰囲気作りには限界がある。そこで、本発明は、仮想空間の雰囲気を盛り上げ、臨場感溢れる会話を楽しむことができる音声チャットシステムを提供することを目的とする。さらには、臨場感を重視したことにより起こり得るいくつかの問題点も、合わせて解決する。
【０００７】
【課題を解決するための手段】
本発明は、音声チャットを行う際の臨場感を高めるための手段として、以下に説明するようなキャラクタ制御手段、チャットデータ発信手段、およびチャット音声出力手段を備える情報処理装置を提供する。また、コンピュータをそのような情報処理装置として機能させるプログラムも合わせて提供する。なお、プログラムは、ＤＶＤ、ＣＤ−ＲＯＭ、メモリカードなどのコンピュータ読み取り可能な記録媒体に記録して提供することができる。
【０００８】
キャラクタ制御手段は、複数のユーザによる共有が可能な仮想空間およびその仮想空間に配置されたユーザごとのキャラクタを管理するサーバと交信することにより、その情報処理装置のユーザが仮想空間に配置したキャラクタの行動を制御するための手段である。
【０００９】
チャットデータ発信手段は、情報処理装置のユーザが発した音声を表す第１音声データを取得し、その第１音声データを含むチャットデータを生成して発信する手段である。
【００１０】
チャット音声出力手段は、サーバと交信中の情報処理装置により生成されたチャットデータを取得し、そのチャットデータに含まれる第１音声データを使用して、チャットデータが生成された情報処理装置のユーザが配置したキャラクタの音声を出力する手段である。
【００１１】
本発明が提案する第１の情報処理装置あるいはプログラムでは、チャット音声出力手段は、取得したチャットデータに含まれる第１音声データを、仮想空間に配置されたオブジェクトの音声に係る属性に基づいて変換することにより第２音声データを生成し、その第２音声データが表す音声を前記キャラクタの音声として出力する。但し、変換の結果第１音声データと第２音声データが等しくなる場合もあり得る。オブジェクトの音声に係る属性は、例えば音吸収、音反射、音程変更などであり、仮想空間を構成する際にオブジェクトの他の属性とともに定義される。
【００１２】
言い換えれば、仮想空間内のキャラクタ以外のオブジェクトの存在によって音声の聞こえ方が変わるようにして、ユーザが、視覚のみならず聴覚によっても仮想空間の構造を認識できるようにする。これにより、臨場感を高めることができる。
【００１３】
また、本発明が提案する第２の情報処理装置あるいはプログラムでは、チャット音声出力手段は、取得したチャットデータに含まれる第１音声データを、チャットデータが生成された情報処理装置のユーザが配置したキャラクタと、この情報処理装置のユーザが配置したキャラクタの位置関係の単位時間あたりの変位量、すなわち変位速度に基づいて変換することにより第２音声データを生成し、第２音声データが表す音声を前記キャラクタの音声として出力する。
【００１４】
移動するキャラクタが発する音声をリアルタイムに変化させることにより、ユーザはキャラクタが移動中であることを聴覚によっても認識できるようになるので、ユーザの臨場感を高めることができる。
【００１５】
また、本発明が提案する第３の情報処理装置あるいはプログラムでは、チャットデータ発信手段は、ユーザに装着された所定のセンサによりユーザの身体情報を取得して、その身体情報を含むチャットデータを生成して発信する。また、チャット音声出力手段は、取得したチャットデータに含まれる第１音声データを、チャットデータに含まれる身体情報に基づいて変換することにより第２音声データを生成し、第２音声データが表す音声を前記キャラクタの音声として出力する。
【００１６】
ユーザの身体情報とは、キャラクタを操作するユーザの身体状態の変化を表す情報であり、例えば脈拍、発汗状態、体温などがある。キャラクタを操作するユーザの身体状態に合わせてキャラクタの声を変化させることにより、ユーザとキャラクタとが同化したように感じさせる効果を狙ったものである。
【００１７】
また、本発明が提案する第４の情報処理装置あるいはプログラムでは、チャット音声出力手段は、仮想空間を演出するための効果音を表す効果音データを取得し、第１音声データと効果音データを合成することにより第２音声データを生成し、第２音声データが表す音声を前記キャラクタの音声として出力する。
【００１８】
効果音とチャット音声の出力タイミングが重なった場合に、ユーザが発する音声が仮想空間内で発生する他の音の影響を受けるようにすることで、ユーザはあたかも仮想空間内で声を発しているかのように感じることができる。
【００１９】
さらに、上記各情報処理装置のチャット音声出力手段は、音声を出力する音声出力装置（スピーカ）と情報処理装置のユーザの位置関係に基づいて、音声出力装置ごとに第２音声データを生成することが望ましい。チャット音声をスピーカの配置を生かして出力することで、臨場感をさらに高めることができる。
【００２０】
次に、本発明は音声チャットの新たな機能として、チャット音声の再生機能を提案し、そのような機能を実現するための手段として、次のような情報処理装置とサーバ、さらにはコンピュータをそのような情報処理装置あるいはサーバとして機能させるプログラムを提供する。
【００２１】
本発明が提供する第５の情報処理装置およびプログラムは、第１から第４までの情報処理装置などと同じく、前述のようなキャラクタ制御手段、チャットデータ発信手段、およびチャット音声出力手段を備える。第５の情報処理装置およびプログラムでは、チャット音声出力手段は、各情報処理装置により生成された直後のチャットデータを取得してキャラクタの音声を出力する機能に加え、各情報処理装置により生成されサーバに蓄積保存されたチャットデータを取得してキャラクタの音声を再生する機能を提供する。なお、チャット音声出力手段は、音声データが取得された速度と異なる速度で前記キャラクタの音声を再生できることが望ましい。再生時の音声の早送りなどを可能にするためである。
【００２２】
さらに、チャット音声の再生機能を提供するために、以下のようなキャラクタ制御手段、チャットデータ配信手段、およびチャットデータ検索手段を備えたサーバおよびサーバ用のプログラムを提供する。
【００２３】
キャラクタ制御手段は、複数のユーザによる共有が可能な仮想空間を提供するとともに、仮想空間に各ユーザが配置したキャラクタの行動を、各ユーザの情報処理装置と交信することにより制御する手段である。
【００２４】
チャットデータ配信手段は、各ユーザの情報処理装置から、各ユーザが発した音声を表す音声データを含むチャットデータを受信して、そのチャットデータを所定の記憶媒体に記憶するとともに交信中の複数の情報処理装置に対し配信する手段である。チャットデータに含まれる音声データは、文字データに変換してから記憶媒体に記憶してもよい。
【００２５】
チャットデータ検索手段は、記憶媒体に記憶されたチャットデータの中からユーザが要求するチャットデータを検索し、そのユーザの情報処理装置に送信する手段である。これにより、サーバにアクセスする情報処理装置は、チャット音声を再生することができる。
【００２６】
さらに、本発明は、第６の情報処理装置あるいはプログラムとして、第１から第４までの情報処理装置と同じく、前述のようなキャラクタ制御手段、チャットデータ発信手段、およびチャット音声出力手段を備え、チャット音声出力手段が、この情報処理装置を使用するユーザが配置したキャラクタや出力する音声データが取得された情報処理装置のユーザが配置したキャラクタが有する属性やアイテムに基づいて、キャラクタの音声を出力する装置とプログラムを提供する。この装置あるいはプログラムによれば、ユーザは所定の属性やアイテムを獲得することで、自らの選択により音声の聞こえ方を変化させることができる。ユーザに、好みの聞こえ方を選択させることにより、チャット音声の聞こえ方に関してユーザにストレスを感じさせないようにするためである。
【００２７】
【発明の実施の形態】
以下、本発明の実施の形態について、ネットワークロールプレイングゲームを例にあげて説明する。はじめに、図１を参照して、ネットワークロールプレイングゲームと、その音声チャット機能の概要について説明する。
【００２８】
一般に、ネットワークゲームサービスは、ゲーム会社などが管理するサーバコンピュータにより提供される。本実施の形態のネットワークロールプレイングゲームサービスも、インターネットなどのネットワーク１に接続された１台または複数のコンピュータ（以下、サーバ２とする）により提供される。ゲームサービスの利用者（以下、ユーザと称する）は、通信機能を備えた情報処理装置、例えば家庭用ゲーム機器、パソコン、携帯用ゲーム機器、携帯電話、携帯情報端末などを操作し、ネットワーク１を介してサーバ２にアクセスしてゲームを行う。なお、通信機能は有線に限らず無線通信機能でもよい。図は、２人のユーザ４Ａ、４Ｂが、それぞれ情報処理装置３Ａ，３Ｂを使用してサーバ２にアクセスしている状態を表している。
【００２９】
サーバ２は、土地、建造物、その他多種多様なオブジェクトにより構成される仮想空間５を定義し、その仮想空間５をサーバ２にアクセスしたユーザが共有できるような状態で提供する。ユーザは、サーバ２により提供される仮想空間５に自分の分身に相当するキャラクタを配置して、そのキャラクタを動かすことによって、あたかも自分が仮想空間にいるかのような感覚を楽しむことができる。ユーザは、情報処理装置に接続された操作機器（コントローラ）を操作してサーバ２に所定の指示信号を送ることにより自分のキャラクタを動かすことができる。図は、ユーザ４Ａがコントローラ１１Ａを操作してキャラクタ６Ａを動かし、ユーザ４Ｂがコントローラ１１Ｂを操作してキャラクタ６Ｂを動かした結果、仮想空間においてキャラクタ６Ａとキャラクタ６Ｂが遭遇したところを例示している。
【００３０】
各ユーザが使用する情報処理装置３Ａ、３Ｂには、音声を入力するためのマイクおよび音声を出力するためのスピーカが、内蔵または接続されている。さらに、本実施の形態では、各ユーザは、身体のいずれかの部位に、脈拍を測定するための脈拍センサを装着する。マイク、スピーカ、脈拍センサの形態は、どのような形態であってもよい。例えば、図は、ユーザ４Ａ、４Ｂが、音声入力用マイク７Ａ，７Ｂと音声出力用イヤホン８Ａ，８Ｂを備えたマイク付きヘッドホンを装備し、さらに、頭のこめかみ部分にあたるように脈拍センサ１２Ａ，１２Ｂを装着した状態を例示している。
【００３１】
音声チャットは、マイク７Ａ，７Ｂから入力された音声を表す音声データを含むチャットデータを生成し、ネットワーク１およびサーバ２を介してチャットデータを交換することにより実現される。例えばユーザ４Ｂの発声音９がマイク７Ｂを通して情報処理装置３Ｂに入力されると、情報処理装置３Ｂからサーバ２に発声音９を表す音声データを含むチャットデータが転送される。そのチャットデータはサーバ２により情報処理装置３Ａ，３Ｂに配信され、ユーザ４Ａのイヤホン８Ａ、ユーザ４Ｂのイヤホン８Ｂから音声１０が出力される。
【００３２】
ネットワークロールプレイングゲームでは、音声チャットを行うユーザは、通常分身キャラクタになったつもりで音声を発する。言い換えれば、ユーザ同士の直接的な会話ではなく、キャラクタを介した間接的な会話が行われる。このため、以下の説明では、ユーザがキャラクタとして発声することを、必要に応じて「キャラクタが発声する」などと表現する。
【００３３】
本実施の形態のネットワークロールプレイングゲームの音声チャット機能は、以下に説明するようないくつかの特徴を有する。
【００３４】
第１に、キャラクタが発した音声が、そのキャラクタの周辺にある物体の形状、位置、向き、特性などによって変化して聞こえる。言い換えれば、仮想空間に配置されたキャラクタ以外のオブジェクトの属性が、キャラクタが発した音声の聞こえ方に影響する。各オブジェクトの属性は、仮想空間を構成する時点で定義される。
【００３５】
例えば、図１に例示した仮想空間５では、オブジェクト１３（トンネル）の属性の１つとして、音反響特性が定義されている。この場合、オブジェクト１３付近にいるキャラクタ６Ｃがオブジェクト１３の方を向いて音声を発した場合に、その音声が反響して聞こえる。キャラクタ６Ｃがオブジェクト１３と反対の方向を向いて音声を発した場合には、音声が反響することはない。また、オブジェクト１４（植物）は、音吸収特性を有する。この場合、オブジェクト１４を挟んで会話しようとしたキャラクタ６Ｄとキャラクタ６Ｅは、いずれも発した音声がオブジェクト１４により吸収されてしまうため、互いの声が聞こえず、会話することができない。一方、オブジェクト１３からもオブジェクト１４からも所定距離以上離れた位置にいるキャラクタ６Ａおよび６Ｂは、オブジェクトの影響を受けることなく、普通に会話することができる。
【００３６】
音声に係るオブジェクト属性としては、上記反響、吸収以外にも、例えば発した音声が段階的に小さくなって聞こえる特性や、オクターブ高くなって聞こえる特性など、多種多様な属性を自由に定義することができる。音声に係る属性は、オブジェクト１３の例のように現実世界に近い環境となるように定義してもよいし、オブジェクト１４の例のように非現実的な現象を起こす属性としてもよい。
【００３７】
キャラクタのいる場所によってそのキャラクタが発した音声の聞こえ方が変化するということは、音声が仮想空間の構造と密接に関連しているということに他ならない。これにより、各ユーザは、あるキャラクタが仮想空間のある場所から別の場所に移動したということを、そのキャラクタが発した音声の聞こえ方の変化から感じとることができる。すなわち、従来、視覚でのみ捉えていた事象を、聴覚によっても捉えることになるので、従来よりも臨場感が増す。
【００３８】
また、音声チャットは、本来はコミュニケーションを図るための機能であるが、本実施の形態では、キャラクタを移動させながら独り言を発し、音声の聞こえ方の変化を楽しむこともできる。さらには、会話したくない相手と遭遇した際に、わざと音声が聞こえにくくなる場所にキャラクタを移動させるなど、音声の聞こえ方を利用しながらゲームを進行させることもできる。
【００３９】
次に、第２の特徴について説明する。本実施の形態のゲームでは、会話をしているキャラクタの位置関係が時間の経過とともに変化する場合、音声もまた時間の経過とともにリアルタイムに変化する。例えば、車に乗って走り去ろうとするキャラクタが車を走らせながら発した音声は、その音声を聞くキャラクタとの距離が離れるほどに徐々に小さくなって聞こえる。すなわち、キャラクタ同士の相対的な位置関係のみならず位置関係の変化をも検出して、検出した変化を出力する音声に反映させる。位置関係の変化と音声変化の関係は、上記例のように現実世界に類似した関係としてもよいが、仮想空間特有の非現実的な関係を定義してもよい。例えば発声中のキャラクタが仮想空間内を瞬時移動（ワープ）した場合に、ワープしたタイミングで音声が聞こえなくなったり、あるいは突然音声が聞こえるようになるといった関係を定義することができる。
【００４０】
キャラクタの動きに合わせてそのキャラクタが発した音声の聞こえ方が変化するということもまた、ユーザが発した音声と仮想空間との関係を密にすることに他ならない。上記第１の特徴と同様に、ユーザは、あるキャラクタが仮想空間のある場所から別の場所に移動中であるということを、そのキャラクタが発した音声の聞こえ方の変化から感じとることができる。従来、視覚でのみ捉えていた事象を、聴覚によっても捉えることができるようになるので、臨場感が増す。
【００４１】
次に、第３の特徴について説明する。ロールプレイングゲームでは、通常、場面あるいはキャラクタがとった行動に合わせて、演出のための効果音が出力される。例えば風の音や衝突音などである。これらの効果音は仮想空間と同じく、ゲームを作成し提供する側によって定義される。本実施の形態のロールプレイングゲームでは、キャラクタが音声を発したタイミングが、このような効果音の出力タイミングと重なった場合に、キャラクタの音声の聞こえ方が変化する。言い換えれば、ゲーム制御プログラムにより出力される音声データと、マイク入力により取得された音声データとが、所定の規則にしたがって合成された後に出力される。
【００４２】
例えば、強風の場面では、風の音とキャラクタの声が重なって聞こえることとなるが、この場合には、風の音の重み付けを大きくすることにより、キャラクタの音声が風の音にかき消されて聞こえにくくなるようにする。音声を合成する際の規則は、現実世界に類似する聞こえ方になるような規則としてもよいし、仮想世界特有の非現実的な聞こえ方になるような規則としてもよい。いずれの場合も、種々の規則が考えられることは言うまでもない。
【００４３】
仮想空間の中で発生した音によって、キャラクタが発した音声の聞こえ方が変化するということもまた、ユーザが発した音声と仮想空間との関係を密にすることに他ならない。これにより、音声を発するユーザは、あたかも自分自身が仮想空間内で音声を発しているかのような臨場感を味わうことができ、音声を聞く側のユーザは、仮想空間内のキャラクタが実際に音声を発しているかのような感覚を味わうことができる。
【００４４】
次に、第４の特徴について説明する。本実施の形態のロールプレイングゲームでは、前述のように各ユーザは脈拍センサ１２Ａ，１２Ｂを装着しており、ユーザが発した音声は、ユーザの脈拍に応じて変化する。
【００４５】
例えば、脈拍が極度に高いことが検出された場合には、ユーザ自身が平常通りの声を発していたとしても、その音声を聞く側のユーザには、声が震えたり、高くなったりして聞こえる。さらには、脈拍の高いユーザのキャラクタは、例えば、顔色が赤くなる（あるいは青くなる）など、画面表示も変化する。なお、脈拍センサに代えて発汗センサや温度センサを装着するようにしてもよい。センサは、ユーザの身体状態の変化を検出する目的で装着するものであるため、この目的にかなうセンサであればどのようなものであってもよい。
【００４６】
上述のように、ユーザの身体情報をユーザのキャラクタの発声音に反映させた場合、発声する側のユーザはキャラクタと同化してゲームを楽しむことができる。また、音声を聞く側のユーザは、キャラクタの発声音から、そのキャラクタを操作するユーザの状態、あるいは性格を垣間見ることができる。
【００４７】
次に、上述の音声チャット機能を提供するための手段について説明する。図２はユーザが使用する情報処理装置３の機能について説明するための図である。図に示すように、情報処理装置３は、キャラクタ制御機能１６と、チャットデータ発信機能１７とチャット音声出力機能１８を備える。詳細には、これらの機能は、情報処理装置３に組み込まれる制御プログラムにより実現される。
【００４８】
キャラクタ制御機能１６は、図示されないサーバからネットワーク１を介して仮想空間やキャラクタの配置位置の情報を受信し、受信した情報に基づいてディスプレイ１５に仮想空間の一部の領域を表示する一方、コントローラ１１からの操作入力を受け付けて、サーバに対しキャラクタの行動を指定する情報を送信する機能である。
【００４９】
チャットデータ発信機能１７は、マイク７からの音声入力と、センサ１２からの脈拍情報の入力を受け付けて、それらの情報を含むチャットデータを生成し、ネットワーク１を介してサーバに発信する機能である。チャットデータは、図３に示すように、少なくとも、マイク入力されデジタル化された発声音２１、発声音２１の発声時刻１９、および発声キャラクタ２０の情報の３種類の情報を含むフォーマットとする必要がある。あるいは、発声キャラクタ２０に代えて、ユーザを特定する情報を付加してもよい。また、本実施の形態では、チャットデータには、センサにより取得した脈拍などのユーザ身体情報２２も含まれている。
【００５０】
チャット音声出力機能１８は、サーバ２からネットワーク１を介して転送される音声のデータを受信して、スピーカ８に出力する機能である。サーバ２から転送される音声のデータには、各情報処理装置のチャットデータ発信機能１７により発信されたチャットデータのほか、演出のための効果音を表す演出音データがある。演出音データは、図４に示すように、少なくとも、効果音２４と、効果音２４の出力タイミング２３の情報が含まれている。演出音データは、サーバ２から転送される場合もあるが、予め情報処理装置３が保持している場合もある。
【００５１】
図５は、情報処理装置３のハードウェア構成を表す図である。情報処理装置３は、少なくともＣＰＵ２５，ＲＡＭ２６，通信制御部２７、入出力制御部２８、操作入力制御部２９、表示出力制御部３０、音声入出力制御部３１、センサ入力制御部３２およびそれらを接続するシステムバス３３を備えている。
【００５２】
通信制御部２７はネットワーク１と接続され、サーバ２とのプログラム、データのやりとりを制御する。また、入出力制御部２８は、ＣＤ−ＲＯＭやＤＶＤ３３、メモリカード３４、ハードディスク３５などの記録媒体からのデータの読取りおよびそれらの記録媒体へのデータの書き込みを制御する。操作入力制御部２９は、情報処理装置３に外部接続されたコントローラ１１などの入力機器からのユーザ入力を制御する。受け付けた入力はシステムバス３３を介してＣＰＵ２５に伝達される。表示出力制御部３０は、制御プログラムが出力する画像のディスプレイ１５への表示を制御する。音声入出力制御部３１は、マイク７からの音声入力とスピーカ８への音声出力を制御する。さらに、センサ入力制御部３２は、脈拍センサ１２からのセンサ入力を制御する。
【００５３】
図２の各機能を提供する制御プログラムは、ＣＤ−ＲＯＭやＤＶＤ３３によって提供されるか、またはサーバ２からのダウンロードにより提供される。いずれの場合も、制御プログラムはＲＡＭ２６にロードされ、ＣＰＵ２５によって実行される。ＣＰＵ２５は、制御プログラムに基づいて、操作入力制御部２９、表示出力制御部３０および通信制御部２７との間で指示信号などを交換することによりキャラクタ制御機能１６を実現する。同様に、チャットデータ発信機能１７は、ＣＰＵ２５が制御プログラムに基づいて音声入出力制御部３１、センサ入力制御部３２および通信制御部２７と信号などを交換することにより実現され、チャット音声出力機能１８は、通信制御部２７や音声入出力制御部３１と信号などを交換することにより実現される。
【００５４】
図６は、図２のチャット音声出力機能１８に対応する制御プログラムの処理の概要を表すフローチャートである。図に示すように、制御プログラムは、ステップＳ１０１においてチャットデータを受信すると、まずステップＳ１０２において、そのチャットデータの発信元のユーザに対応するキャラクタの周辺に音声出力に影響する属性を有するオブジェクトが存在するか否かを判定する。仮想空間の構成とキャラクタの配置位置についての情報は、前述のようにキャラクタ制御機能により取得済みであるので、判定はその情報を利用して行うことができる。例えば、音声を発したキャラクタを中心とした所定半径の円状領域、あるいはキャラクタの正面の所定角度の扇型領域内に、音声に係る属性を有するオブジェクトが存在するか否かを判定する。
【００５５】
音声に係る属性を有するオブジェクトが存在した場合には、次にステップＳ１０３においてそのオブジェクト属性に基づいて、チャットデータに含まれる発声音を変換する。例えば、オブジェクトの属性が音反響特性であれば、発声音に対しエコー処理を施した後、処理後の音声データをメモリに保存する。音声に係る属性を有するオブジェクトがキャラクタの周辺に存在しない場合には、ステップＳ１０３の処理は実行しない。
【００５６】
次に、ステップＳ１０４において、音声を発したキャラクタと、この音声出力処理を実行する装置を使用しているユーザのキャラクタとの相対的な位置関係を、仮想空間およびキャラクタ配置の情報に基づいて計算する。さらに、２キャラクタのいずれか、もしくは両方が移動中である場合には、単位時間あたりの位置関係の変化（変位速度）を求め、変位速度が音声に影響する程度の速度か否かを所定の閾値との比較により判定する。
【００５７】
変位速度が閾値以上である場合には、ステップＳ１０５において、変位速度に基づく音声変換処理を行う。ステップＳ１０３において、変換処理が行われていた場合には、メモリに保存されている処理後の音声データに対してさらに変換処理を施す。変位速度が音声に影響しない程度である場合には、ステップＳ１０５の処理は実行しない。
【００５８】
次に、ステップＳ１０６においてチャットデータに脈拍などのユーザ身体情報が含まれているか否かを判定する。音声に影響するようなユーザ身体情報（例えば脈拍が非常に高いという情報）が含まれている場合には、ステップＳ１０７において、そのユーザ身体情報に基づく変換処理を実行する。例えば前述のように、声が震えて聞こえるように発声音を変換する。ステップＳ１０３あるいはＳ１０５において、変換処理が行われていた場合には、メモリに保存されている変換後の音声データに対してさらに変換処理を施す。変換後のデータは再びメモリに保存する。一方、音声に影響するようなユーザ身体情報が含まれていなかった場合には、ステップＳ１０７の処理は実行しない。
【００５９】
次に、ステップＳ１０８において、出力中あるいはこれから出力しようとする効果音があるか否かを演出音データに含まれる出力タイミングの情報に基づいて判定する。効果音がある場合には、ステップＳ１０９において、発声音と効果音を、前述のように所定の重み付けを行うなどして、合成する。合成の方法は、効果音の種類ごとに、予め定義しておくのがよい。ステップＳ１０３、Ｓ１０５あるいはＳ１０７において変換処理が行われていた場合には、メモリに保存されている処理後の音声を効果音と合成する。なお、効果音がない場合には、ステップＳ１０９の処理は行わない。
【００６０】
次に、ステップＳ１１０において変換あるいは合成された音声を出力する。以上の処理により、前述の４つの特徴を備えた音声チャット機能を実現することができる。但し、図６に示したフローチャートは、上記４つの特徴すべてを備えるための処理を示したものであるが、上記各特徴は単独で臨場感を増す効果を奏するものであり、必ずしもすべての特徴を組み合わせる必要はない。
【００６１】
以上に説明したように、音声の聞こえ方は、音声変換や、他の音声との合成により音声データ自体を加工することにより、変化させることができる。一方、音声の聞こえ方は、音声を聞くユーザと音声が出力されるスピーカの位置によっても変わることは経験的に知られている。
【００６２】
そこで、本実施の形態では、上記音声の変換や合成を行う際に、ユーザの右側に配置されるスピーカ用、左側に配置されるスピーカ用というように、スピーカの配置位置ごとに異なる出力用データを生成する。例えば、あるキャラクタが音声を発しながら、情報処理装置を使用するユーザのキャラクタからみて右方向に高速で移動した場合には、左側のスピーカから出力する発声音は音量が段階的に小さくなるようにし、右側のスピーカから出力する発声音は音量が段階的に大きくなるようにする。これにより、発声キャラクタが右方向に高速移動したことが聴覚により実感でき、臨場感が増す。３以上のスピーカが配置されることを想定して、より多くの出力用データを生成するようにしてもよいことは言うまでもない。
【００６３】
次に、臨場感を増すことにより起こり得る問題と、その問題を解決するための手段について説明する。前述の説明からも明らかであるように、音声の聞こえ方を変化させることにより臨場感を増すということと、音声を聞こえやすくするということは、必ずしも両立しない。このため、臨場感を楽しむユーザがいる一方で、音声が聞こえにくいことにストレスを感じるユーザもいる可能性がある。そこで、本実施の形態のネットワークロールプレイングゲームは、すべてのユーザがストレスを感じることなく前述の臨場感を楽しめるよう、いくつかの新たな機能を備える。
【００６４】
図７は、聞き取りにくさのストレスを緩和するための第１の機能について説明するための図である。仮想空間において、ユーザ４Ａのキャラクタ６Ａと他のユーザが操作するキャラクタ６Ｂおよび６Ｃの間には、大音量ノイズを発するオブジェクト３６が配置されている。ユーザ４Ａのヘッドホンのスピーカ８Ａからは、音声１０が出力されているが、オブジェクト３６の影響を受けて音質が悪化しているため、ユーザ４Ａは音声１０が、どのキャラクタの声であるかを判別することができない。
【００６５】
第１の機能は、このようなケースで、発声キャラクタを容易に判別できるようにするための機能である。具体的には、図７に示すように、発声中のキャラクタ６Ｂの周辺に、発声中であることを示すマーク３７を表示する。あるいは、「発声中」などの文字を発声キャラクタ６Ｂの周辺の表示してもよい。さらには、発声キャラクタ本体の色を変化させたり、キャラクタの口を動かすなどしてもよい。これにより、ユーザ４Ａは、音声１０の発生元がキャラクタ６Ｂであることを容易に認識することができる。これにより、例えばキャラクタ６Ａをキャラクタ６Ｂの近くまで移動させて、再度会話を交わすことにより、聞き取り損ねた発言の内容を確認することができる。第１の機能は、図２に示した制御プログラムの構成において、チャット音声出力機能１８とキャラクタ制御機能１６を連携させることにより実現することができる。
【００６６】
ここで、第１の機能では、キャラクタ６Ａがキャラクタ６Ｂに問いかけを行い、キャラクタ６Ｂが再度同じ発言をすることによってはじめて、聞き取り損ねた発言の内容が明らかになる。言い換えれば、キャラクタ６Ｂが発言を繰り返すことを拒んだ場合には、聞き取り損ねた発言の内容を知ることができない。そこで、そのような場合でも、聞き取り損ねた発言の内容を知ることができるように、本実施の形態では、第２の機能として音声再生機能を提供する。
【００６７】
第２の機能として提供する音声再生機能は、ユーザから所定の指示入力があった場合に、キャラクタの過去の発声音を再生する機能である。指示入力のためのユーザインタフェースは種々考えられるが、例えば図８に示すように、画面に音声再生指示のためのメニュー３８を表示する方法が考えられる。図８の例は、キャラクタ６Ｂにカーソル３９を合わせて所定のボタン操作を行うことにより操作メニュー３８を表示させる例である。さらにカーソル３９を操作してメニュー項目の中から所望の指示を選択すれば、過去の発言の一部または全部を再生することができる。第１の機能のみとした場合には、キャラクタ６Ｂを操作するユーザは、多数のキャラクタから再発言を求められた場合に何度も同じ発言を繰り返さなければならないが、第２の機能によれば、キャラクタ６Ｂを操作するユーザは再発言を求められることはない。また、キャラクタ６Ａを操作するユーザ４Ａも、キャラクタ６Ｂを操作するユーザに気兼ねすることなく、知りたい内容を確認することができる。
【００６８】
さらに、本実施の形態では、図９に示すように、キャラクタを指定することなく、仮想空間内の所定の場所（部屋など）で行われた会話を、まとめて再生する機能も提供する。この機能によれば、ユーザが所定のボタン操作を行った場合に操作メニュー４０が画面に現れる。ユーザは、カーソル３９を操作することによりメニュー項目のいずれかを選択し、仮想空間内の所定の場所において、過去に行われた会話の一部あるいは全部を再生することができる。本実施の形態では、多くの会話を短時間に再生できるように、早送り再生機能も提供する。
【００６９】
なお、音声再生機能は、音声を聞き取り損ねた場合の聞き直しに限らず、ゲームに途中から参加したユーザにとっても有用な機能である。例えば図９は、キャラクタ６Ａを操作するユーザがゲームに途中参加し、会話中のキャラクタ６Ｂ〜６Ｄに遭遇した例を示している。このようなケースで、キャラクタ６Ａを操作するユーザは、この場所で過去に行われた会話を再生することにより、３人のキャラクタの間でなされた会話の内容を把握することができる。この場合、キャラクタ６Ａが、状況把握のために多くの質問をして他のユーザを煩わせることがなくなる。
【００７０】
第２の機能を提供するためには、過去の発言に係る音声データをすべて記憶しておく必要がある。聞き直しのみを目的とした再生機能を提供する場合には、情報処理装置側に再生用のデータを蓄積してもよいが、前述のように聞き直し目的に限らず全キャラクタの全発言内容を再生できるようにするためには、サーバ側に再生用のデータを蓄積しておくのがよい。
【００７１】
図１０は、音声再生を行うために必要なサーバ側の機能を示す図である。図に示すように、サーバ２は、仮想空間を提供するとともにキャラクタの配置を管理する仮想空間／キャラクタ制御機能４１と、各情報処理装置からチャットデータを受信して他の情報処理装置に配信するチャットデータ配信機能４２と、各情報処理装置からの要求に応じてデータベース４４を検索することにより要求されたチャットデータを取得して送信するチャットデータ検索機能４３を備える。
【００７２】
チャットデータは、チャットデータ配信機能４２によりデータベース４４に蓄積保存される。この際、チャットデータに含まれる発声時刻、発声キャラクタの情報に加え、発声場所など検索に必要な他の情報が付加される。チャットデータの保存は、受信したままのチャットデータを保存する方法のほか、チャットデータに含まれる音声データを文字データに変換し、発声時刻、発声キャラクタ、発声内容を表す文字データを含むデータとして保存する方法も考えられる。発声内容を文字データとして保存した場合には、音声の再生を要求された場合に、音声と合わせて文字を表示することも可能になる。また、文字情報の検索技術は数多く知られているため、それらの技術を用いれば、発声キャラクタ、発声時刻のみならず、発声内容の検索も可能になる。さらには、近年、音声データを標準音声符号に変換して音声符号の一致、不一致により音声の検索を行う技術も提案されている。したがって、音声データを標準音声符号に変換して保存し、音声符号に基づく検索を行ってもよい。
【００７３】
また、例えばＭＰ３などの標準音声圧縮技術の多くは、音声の早送り再生についても規格を定めている。したがって、チャットデータに含まれる発声音の情報を、標準のデータ形式で保存すれば、前述の早送り機能を提供することができる。なお、検索用データと一括再生や早送り再生用のデータを別個に異なるデータ形式で保存しておき、目的に応じて使い分けてもよい。
【００７４】
図１１は、情報処理装置３により実行される再生処理と、サーバ２により実行される検索処理を表したフローチャートである。図に示すように、情報処理装置３は、ステップＳ２０１において、図８あるいは図９のような操作メニューから音声再生指示の入力を受け付けると、ステップＳ２０２において、キャラクタ、発声時刻、会話がなされた場所など検索のキーワードの情報を含む再生要求をサーバ２に送信する。サーバ２は、ステップＳ３０１において再生要求を受信すると、ステップＳ３０２において、検索キーワードの有無を確認することによって、一部再生が要求されているか、過去の全音声の再生が要求されているかを判定する。検索キーワードが含まれている場合には、ステップＳ３０３において、そのキーワードを使用してデータベース４４の検索を行う。ステップＳ３０４では、検索により取得したチャットデータを再生要求を送信した情報処理装置に送信する。なお、過去の全音声の再生が要求された場合には、データベース４４に保管されているチャットデータを時系列に並べて情報処理装置に送信する。
【００７５】
情報処理装置は、ステップＳ２０３において、サーバ２が送信したチャットデータを受信し、ステップＳ２０４においてそのチャットデータに含まれる音声を再生出力する。
【００７６】
次に、聞き取りにくさのストレスを緩和するための第３の機能について説明する。第３の機能は、キャラクタが所定のアイテムを所持している場合、あるいはキャラクタが所定の属性を有する場合など、キャラクタが所定の条件を満たす場合に、臨場感を出すための音声変換処理あるいは音声合成処理を省略して音声を出力する機能である。
【００７７】
図１２は、第３の機能を提供するための処理を示すフローチャートである。図に示すように、図１２に示すフローチャートは、図６に示したフローチャートのステップＳ１０１の後に、チャットデータを受信したユーザ（音声の聞き手のユーザ）のキャラクタが、指定アイテムを所持しているか否かを判定するステップＳ１１１を追加したものである。指定アイテムは固定的としてもよいが、チャットデータごとに定義してもよい。すなわち、チャットデータに指定アイテムの情報を含めておいてもよい。
【００７８】
ステップＳ１１１において、聞き手のキャラクタが指定アイテムを所持していないと判定した場合には、図６のステップＳ１０２以降の処理を実行する。一方、聞き手のキャラクタが指定アイテムを所持している場合には、図６のステップＳ１０２〜Ｓ１０９までの処理は実行しない。この場合、音声変換や合成処理は行われないので、ステップＳ１１０では、チャットデータに含まれる音声がそのまま出力される。
【００７９】
第３の機能によれば、例えば指定アイテムをトランシーバとした場合、図１３に示すように、トランシーバ４５Ａを所持するキャラクタ６Ａとトランシーバ４５Ｂを所持するキャラクタ６Ｂは、周辺オブジェクトや効果音の影響を受けることなく、常に聞き取りやすい音声で会話することができる。
【００８０】
同様に、図１２のステップＳ１１１においてキャラクタの属性を参照し、属性によって、音声変換あるいは合成の要否を決定してもよい。この場合、例えば、人間属性のキャラクタがロボット属性のキャラクタと会話する際には音声が聞き取りにくくなることがあるが、人間属性のキャラクタ同士が会話するときには常に鮮明な音声で会話できるといった仕様を実現することができる。グループに分かれて対戦を行うタイプのゲームであれば、グループ分けを属性として定義しておくことにより、味方グループのみとチャットできる仕様を実現することもできる。
【００８１】
第３の機能によれば、ユーザは、所定の条件を満たすことによって、従来と同じく発声したままの音声によって会話を行うことができるようになるため、臨場感を優先するユーザと、聞き取りやすさを優先するユーザが、いずれもストレスを感じることなく一緒にゲームを楽しむことができるようになる。
【００８２】
以上に説明したように、本実施の形態のロールプレイングゲームでは、音声チャットを行う際に、仮想空間と密接に関連した音声が出力されるため、ユーザは、視覚のみならず聴覚によっても臨場感を感じることができる。また、臨場感を重視した結果、一時的に音声が聞き取りにくくなることがあるとしても、音声を聞き直す、あるいは鮮明な音声に聞くための工夫が施されているため、ユーザがストレスを感じることはない。
【００８３】
なお、ネットワークロールプレイングゲームを例に説明したが、本発明はキャラクタを介して音声チャットを行うあらゆるシステムに適用可能な技術であることは言うまでもない。
【００８４】
【発明の効果】
本発明の各情報処理装置あるいはプログラムによれば、チャット音声は、仮想空間と音声の関連を密にする処理が施された後に出力されるので、その音声を聞くユーザは、あたかも仮想空間内で会話をしているような臨場感を感じることができる。
【００８５】
また、本発明の他の各情報処理装置あるいはプログラムによれば、チャット音声が聞き取りにくい場合でも、ユーザにストレスを感じさせることがない。
【図面の簡単な説明】
【図１】ネットワークロールプレイングゲームと、その音声チャット機能の概要について説明するための図である。
【図２】本発明の情報処理装置の機能を示す図である。
【図３】チャットデータのフォーマットの一例を表す図である。
【図４】演出音データのフォーマットの一例を表す図である。
【図５】情報処理装置のハードウェア構成を表す図である。
【図６】本発明のプログラムの一実施の形態における処理概要を表すフローチャートである。
【図７】発声キャラクタを判別するための機能について説明するための図である。
【図８】チャット音声のキャラクタごとの再生機能について説明するための図である。
【図９】チャット音声の一括再生機能について説明するための図である。
【図１０】チャット音声の再生に必要なサーバの機能を示す図である。
【図１１】チャット音声を再生する処理の概要を表すフローチャートである。
【図１２】所持アイテムに基づく音声出力処理を表すフローチャートである。
【図１３】所持アイテムに基づく音声出力処理について説明するための図である。
【符号の説明】
２サーバ、３，３Ａ，３Ｂ情報処理装置、４Ａ，４Ｂユーザ、５仮想空間、６Ａ〜６Ｅキャラクタ、７，７Ａ，７Ｂマイク、８，８Ａ，８Ｂスピーカ（イヤホン）、９発声音、１０出力される音声、１１，１１Ａ，１１Ｂコントローラ、１２，１２Ａ，１２Ｂセンサ、１３，１４，３６オブジェクト、３３システムバス、３４メモリカード、３５ハードディスク、３７マーク、３８，４０操作メニュー、３９カーソル、４５Ａ，４５Ｂトランシーバ。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an information processing apparatus, a server, and a computer program that controls the information processing apparatus suitable for application to a voice chat system in which conversation is performed via characters arranged in a virtual space.
[0002]
[Prior art]
The chat service provided on the Internet allows users who are physically separated to enjoy text-based conversations by transferring text data input from users to other users in real time via the network. Service. Originally proposed as a means for direct communication between users, it has recently been used as a function for allowing characters displayed on a computer screen to have a conversation. For example, Patent Document 1 discloses a system in which a user's surrogate character called an avatar is placed in a three-dimensional virtual reality space, and a character input by each user is transmitted to another user as an utterance of the surrogate character.
[0003]
In recent years, a voice chat service has started to be provided that enables voice conversation similar to a telephone by converting voice inputted by a microphone into digital data and exchanging it.
[0004]
[Patent Document 1]
JP 2001-31744 A
[0005]
[Problems to be solved by the invention]
As described above, there are two types of chat: a mode in which the user has a conversation as himself and a mode in which the user has a conversation as a substitute character. The roles expected of the chat system are slightly different between the former and the latter. In the former case, it is sufficient that each user's speech is accurately transmitted in real time. However, in the latter case, particularly in network role-playing games, etc., it is important not only to accurately transmit information, but also to create an atmosphere in which each user can fully transfer emotions to the characters.
[0006]
The invention described in Patent Document 1 tries to express a sense of distance between avatars by changing attributes of a chat character string, but there is a limit to creating an atmosphere using such characters. Accordingly, an object of the present invention is to provide a voice chat system that can enhance the atmosphere of a virtual space and enjoy a conversation full of realism. Furthermore, some problems that may arise due to the importance of presence are also solved.
[0007]
[Means for Solving the Problems]
The present invention provides an information processing apparatus including character control means, chat data transmission means, and chat voice output means as described below as means for enhancing the sense of presence when performing voice chat. A program for causing a computer to function as such an information processing apparatus is also provided. The program can be provided by being recorded on a computer-readable recording medium such as a DVD, a CD-ROM, or a memory card.
[0008]
The character control means communicates with a server that manages a virtual space that can be shared by a plurality of users and a character for each user placed in the virtual space, so that a character placed by the user of the information processing device in the virtual space. It is a means for controlling the behavior.
[0009]
The chat data transmission means is means for acquiring first voice data representing voice uttered by a user of the information processing apparatus, generating chat data including the first voice data, and transmitting the chat data.
[0010]
The chat voice output means acquires chat data generated by the information processing apparatus in communication with the server, and uses the first voice data included in the chat data to use the user of the information processing apparatus that generated the chat data. Is means for outputting the voice of the arranged character.
[0011]
In the first information processing apparatus or program proposed by the present invention, the chat voice output means converts the first voice data included in the acquired chat data based on the attribute relating to the voice of the object arranged in the virtual space. By doing so, the second voice data is generated, and the voice represented by the second voice data is output as the voice of the character. However, the first audio data and the second audio data may be equal as a result of the conversion. The attributes relating to the sound of the object are, for example, sound absorption, sound reflection, and pitch change, and are defined together with other attributes of the object when configuring the virtual space.
[0012]
In other words, the way the sound is heard changes depending on the presence of an object other than the character in the virtual space, so that the user can recognize the structure of the virtual space not only visually but also by hearing. Thereby, a sense of reality can be enhanced.
[0013]
In the second information processing apparatus or program proposed by the present invention, the chat voice output means arranges the first voice data included in the acquired chat data by the user of the information processing apparatus in which the chat data is generated. The second voice data is generated by converting the character and the positional relationship between the character placed by the user of the information processing apparatus based on the displacement amount per unit time, that is, the displacement speed, and the voice represented by the second voice data is converted into the second voice data. Output as the voice of the character.
[0014]
By changing the sound produced by the moving character in real time, the user can recognize that the character is moving by hearing, so that the user's sense of reality can be enhanced.
[0015]
Further, in the third information processing apparatus or program proposed by the present invention, the chat data transmission means acquires the user's physical information by a predetermined sensor attached to the user and generates chat data including the physical information. Then make a call. The chat voice output means generates second voice data by converting the first voice data included in the acquired chat data based on the physical information included in the chat data, and the voice represented by the second voice data. Is output as the voice of the character.
[0016]
The user's physical information is information representing a change in the physical state of the user who operates the character, and includes, for example, a pulse, a sweating state, and a body temperature. By changing the voice of the character in accordance with the physical state of the user who operates the character, the effect is to make the user and the character feel assimilated.
[0017]
In the fourth information processing apparatus or program proposed by the present invention, the chat sound output means acquires sound effect data representing sound effects for producing a virtual space, and uses the first sound data and sound effect data. The second voice data is generated by the synthesis, and the voice represented by the second voice data is output as the voice of the character.
[0018]
If the output timing of the sound effect and chat voice overlap, the user's voice is affected by other sounds generated in the virtual space, so that the user is speaking in the virtual space. Can feel like.
[0019]
Further, the chat voice output means of each information processing device generates second voice data for each voice output device based on the positional relationship between the voice output device (speaker) that outputs voice and the user of the information processing device. Is desirable. By outputting the chat voice by utilizing the arrangement of the speakers, the sense of reality can be further enhanced.
[0020]
Next, the present invention proposes a chat voice playback function as a new function of voice chat. As means for realizing such a function, the following information processing apparatus and server as well as a computer are used. Provided is a program that functions as an information processing apparatus or server.
[0021]
A fifth information processing apparatus and program provided by the present invention includes the character control means, chat data transmission means, and chat voice output means as described above, as in the first to fourth information processing apparatuses. In the fifth information processing apparatus and program, the chat voice output means is a server generated by each information processing apparatus in addition to the function of acquiring the chat data immediately after being generated by each information processing apparatus and outputting the voice of the character. The function of acquiring the chat data stored and stored in and reproducing the voice of the character is provided. It is desirable that the chat voice output means can reproduce the voice of the character at a speed different from the speed at which the voice data is acquired. This is to enable fast-forwarding of audio during playback.
[0022]
Furthermore, in order to provide a chat voice reproduction function, a server and a server program including the following character control means, chat data distribution means, and chat data search means are provided.
[0023]
The character control means is a means for providing a virtual space that can be shared by a plurality of users and controlling the actions of the characters arranged by each user in the virtual space by communicating with the information processing apparatus of each user.
[0024]
The chat data distribution means receives chat data including voice data representing voice uttered by each user from the information processing apparatus of each user, stores the chat data in a predetermined storage medium, and communicates with the plurality of chat data. It is means for delivering to the information processing apparatus. The voice data included in the chat data may be stored in a storage medium after being converted into character data.
[0025]
The chat data search means is means for searching for chat data requested by the user from the chat data stored in the storage medium and transmitting it to the information processing apparatus of the user. Thereby, the information processing apparatus that accesses the server can reproduce the chat voice.
[0026]
Furthermore, the present invention includes, as the sixth information processing apparatus or program, the character control means, the chat data transmission means, and the chat voice output means as described above, similar to the first to fourth information processing apparatuses. The chat voice output means outputs the voice of the character based on the attributes and items of the character arranged by the user using the information processing apparatus and the character arranged by the user of the information processing apparatus from which the output voice data is acquired. Devices and programs are provided. According to this apparatus or program, the user can change the way the sound is heard by his / her selection by acquiring predetermined attributes and items. This is to prevent the user from feeling stressed about how to listen to the chat voice by allowing the user to select a preferred listening method.
[0027]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, an embodiment of the present invention will be described by taking a network role playing game as an example. First, an overview of a network role playing game and its voice chat function will be described with reference to FIG.
[0028]
In general, the network game service is provided by a server computer managed by a game company or the like. The network role playing game service of the present embodiment is also provided by one or a plurality of computers (hereinafter referred to as server 2) connected to the network 1 such as the Internet. A user of a game service (hereinafter referred to as a user) operates an information processing apparatus having a communication function, for example, a home game machine, a personal computer, a portable game machine, a mobile phone, a portable information terminal, etc. The server 2 is accessed through the game. Note that the communication function is not limited to wired, but may be a wireless communication function. The figure shows a state in which two users 4A and 4B are accessing the server 2 using the information processing apparatuses 3A and 3B, respectively.
[0029]
The server 2 defines a virtual space 5 constituted by land, buildings, and other various objects, and provides the virtual space 5 in a state that can be shared by users who have accessed the server 2. The user can enjoy a feeling as if he / she is in the virtual space by placing a character corresponding to his / herself in the virtual space 5 provided by the server 2 and moving the character. The user can move his / her character by operating a control device (controller) connected to the information processing apparatus and sending a predetermined instruction signal to the server 2. In the figure, as a result of the user 4A operating the controller 11A to move the character 6A and the user 4B operating the controller 11B to move the character 6B, the character 6A and the character 6B encounter each other in the virtual space. .
[0030]
The information processing apparatuses 3A and 3B used by each user have a built-in or connected microphone for inputting sound and a speaker for outputting sound. Further, in the present embodiment, each user wears a pulse sensor for measuring a pulse at any part of the body. The form of the microphone, the speaker, and the pulse sensor may be any form. For example, in the figure, the users 4A and 4B are equipped with microphone-equipped headphones equipped with voice input microphones 7A and 7B and voice output earphones 8A and 8B, and pulse sensors 12A and 12B so as to hit the temples of the head. The state which mounted | wore is illustrated.
[0031]
Voice chat is realized by generating chat data including voice data representing voice input from the microphones 7 A and 7 B and exchanging chat data via the network 1 and the server 2. For example, when the utterance sound 9 of the user 4B is input to the information processing apparatus 3B through the microphone 7B, chat data including voice data representing the utterance sound 9 is transferred from the information processing apparatus 3B to the server 2. The chat data is distributed by the server 2 to the information processing apparatuses 3A and 3B, and the sound 10 is output from the earphone 8A of the user 4A and the earphone 8B of the user 4B.
[0032]
In the network role-playing game, a user who performs voice chat usually utters a voice with the intention of becoming a substitute character. In other words, not a direct conversation between users but an indirect conversation via a character. For this reason, in the following description, when a user utters as a character, it expresses as "a character utters" etc. as needed.
[0033]
The voice chat function of the network role-playing game according to the present embodiment has several features as described below.
[0034]
First, the voice uttered by a character is heard depending on the shape, position, orientation, characteristics, etc. of objects around the character. In other words, the attribute of an object other than the character arranged in the virtual space affects the way in which the voice uttered by the character is heard. The attribute of each object is defined at the time of configuring the virtual space.
[0035]
For example, in the virtual space 5 illustrated in FIG. 1, sound echo characteristics are defined as one of the attributes of the object 13 (tunnel). In this case, when the character 6C in the vicinity of the object 13 makes a sound while facing the object 13, the sound is reverberated and heard. When the character 6C makes a sound in the opposite direction to the object 13, the sound does not reverberate. The object 14 (plant) has sound absorption characteristics. In this case, the character 6D and the character 6E trying to communicate with each other with the object 14 sandwiched between them are absorbed by the object 14 and thus cannot hear each other's voice and cannot talk. On the other hand, the characters 6A and 6B located at a predetermined distance or more from the object 13 and the object 14 can talk normally without being influenced by the object.
[0036]
As object attributes related to sound, in addition to the above-mentioned echo and absorption, various attributes such as a characteristic that the emitted sound can be heard in a stepwise manner and a characteristic that it can be heard in an octave higher can be freely defined. it can. The attribute relating to the sound may be defined so as to be an environment close to the real world as in the example of the object 13, or may be an attribute causing an unrealistic phenomenon as in the example of the object 14.
[0037]
The way in which the voice uttered by the character changes depending on the location of the character is nothing other than that the voice is closely related to the structure of the virtual space. Thus, each user can feel that a certain character has moved from one place in the virtual space to another place from a change in how the voice uttered by the character is heard. In other words, since an event that has been captured only by vision can be captured by hearing, a sense of reality is increased as compared with the past.
[0038]
Voice chat is originally a function for communication, but in the present embodiment, it is also possible to enjoy the change in how the voice is heard by speaking to the character while moving the character. Furthermore, when encountering a partner who does not want to have a conversation, the game can be advanced while utilizing the way the sound is heard, such as moving the character to a place where it is difficult to hear the sound.
[0039]
Next, the second feature will be described. In the game of the present embodiment, when the positional relationship between characters having a conversation changes with the passage of time, the sound also changes in real time with the passage of time. For example, a voice uttered by a character trying to run away while riding a car is gradually reduced as the distance from the character that hears the voice increases. That is, not only a relative positional relationship between characters but also a change in the positional relationship is detected, and the detected change is reflected in the output voice. The relationship between the change in the positional relationship and the change in the sound may be a relationship similar to the real world as in the above example, but an unrealistic relationship unique to the virtual space may be defined. For example, when the character being uttered moves (warps) in the virtual space instantaneously, it is possible to define a relationship such that no sound can be heard or suddenly the sound can be heard at the warped timing.
[0040]
The fact that the way in which the voice uttered by the character changes in accordance with the movement of the character is nothing but the close relationship between the voice uttered by the user and the virtual space. Similar to the first feature, the user can feel that a certain character is moving from one place in the virtual space to another place from a change in how the voice uttered by the character is heard. Conventionally, an event that has been captured only by vision can be captured by auditory sense.
[0041]
Next, the third feature will be described. In role-playing games, sound effects for production are usually output in accordance with the action taken by the scene or character. For example, wind sound or collision sound. These sound effects, like the virtual space, are defined by the side that creates and provides the game. In the role playing game of the present embodiment, when the timing at which the character utters the voice overlaps with the output timing of such a sound effect, the way the character's voice is heard changes. In other words, the sound data output by the game control program and the sound data acquired by the microphone input are output after being synthesized according to a predetermined rule.
[0042]
For example, in a strong wind scene, the sound of the wind and the voice of the character will be heard, but in this case, the weight of the sound of the wind is increased so that the sound of the character is drowned out by the sound of the wind and becomes difficult to hear. . The rule for synthesizing speech may be a rule that makes it sound like the real world, or a rule that makes an unreal sound that is unique to the virtual world. In any case, it goes without saying that various rules can be considered.
[0043]
The fact that the sound heard by the character changes depending on the sound generated in the virtual space is nothing but the close relationship between the sound emitted by the user and the virtual space. As a result, the user who emits the sound can feel the presence as if he / she is speaking in the virtual space, and the user who listens to the sound can actually hear the character in the virtual space. You can enjoy the feeling as if
[0044]
Next, the fourth feature will be described. In the role playing game according to the present embodiment, each user wears the pulse sensors 12A and 12B as described above, and the sound produced by the user changes according to the user's pulse.
[0045]
For example, when it is detected that the pulse is extremely high, even if the user himself / herself utters a normal voice, the voice of the user who listens to the voice may be trembling or high. hear. Furthermore, the user's character with a high pulse changes the screen display, for example, the face color becomes red (or blue). Instead of the pulse sensor, a sweat sensor or a temperature sensor may be attached. Since the sensor is worn for the purpose of detecting a change in the user's physical condition, any sensor that meets this purpose may be used.
[0046]
As described above, when the user's body information is reflected in the utterance sound of the user's character, the uttering user can enjoy the game assimilating with the character. In addition, the user who listens to the voice can catch a glimpse of the state or character of the user who operates the character from the voice of the character.
[0047]
Next, means for providing the above-described voice chat function will be described. FIG. 2 is a diagram for explaining functions of the information processing apparatus 3 used by the user. As illustrated, the information processing apparatus 3 includes a character control function 16, a chat data transmission function 17, and a chat voice output function 18. Specifically, these functions are realized by a control program incorporated in the information processing apparatus 3.
[0048]
The character control function 16 receives information on the virtual space and the arrangement position of the characters from a server (not shown) via the network 1 and displays a partial area of the virtual space on the display 15 based on the received information. 11 is a function of receiving an operation input from 11 and transmitting information specifying the character's action to the server.
[0049]
The chat data transmission function 17 is a function that receives voice input from the microphone 7 and pulse information from the sensor 12, generates chat data including such information, and transmits the chat data to the server via the network 1. . As shown in FIG. 3, the chat data needs to be in a format including at least three types of information, that is, the uttered sound 21 input by the microphone and digitized, the utterance time 19 of the uttered sound 21, and the information of the uttered character 20. is there. Alternatively, information specifying the user may be added in place of the utterance character 20. In the present embodiment, the chat data also includes user physical information 22 such as a pulse acquired by a sensor.
[0050]
The chat voice output function 18 is a function that receives voice data transferred from the server 2 via the network 1 and outputs the voice data to the speaker 8. The voice data transferred from the server 2 includes production sound data representing sound effects for production, in addition to chat data transmitted by the chat data transmission function 17 of each information processing apparatus. As shown in FIG. 4, the effect sound data includes at least information on the sound effect 24 and the output timing 23 of the sound effect 24. The effect sound data may be transferred from the server 2 but may be held in advance by the information processing device 3.
[0051]
FIG. 5 is a diagram illustrating a hardware configuration of the information processing apparatus 3. The information processing apparatus 3 includes at least a CPU 25, a RAM 26, a communication control unit 27, an input / output control unit 28, an operation input control unit 29, a display output control unit 30, a voice input / output control unit 31, a sensor input control unit 32, and a connection between them. The system bus 33 is provided.
[0052]
The communication control unit 27 is connected to the network 1 and controls exchange of programs and data with the server 2. Further, the input / output control unit 28 controls reading of data from a recording medium such as a CD-ROM, DVD 33, memory card 34, and hard disk 35 and writing of data to the recording medium. The operation input control unit 29 controls user input from an input device such as the controller 11 externally connected to the information processing apparatus 3. The accepted input is transmitted to the CPU 25 via the system bus 33. The display output control unit 30 controls display on the display 15 of an image output by the control program. The voice input / output control unit 31 controls voice input from the microphone 7 and voice output to the speaker 8. Further, the sensor input control unit 32 controls sensor input from the pulse sensor 12.
[0053]
A control program that provides each function of FIG. 2 is provided by a CD-ROM or DVD 33 or downloaded from the server 2. In either case, the control program is loaded into the RAM 26 and executed by the CPU 25. The CPU 25 implements the character control function 16 by exchanging instruction signals and the like among the operation input control unit 29, the display output control unit 30, and the communication control unit 27 based on the control program. Similarly, the chat data transmission function 17 is realized by the CPU 25 exchanging signals and the like with the voice input / output control unit 31, the sensor input control unit 32, and the communication control unit 27 based on the control program. Is realized by exchanging signals with the communication control unit 27 and the voice input / output control unit 31.
[0054]
FIG. 6 is a flowchart showing an outline of the processing of the control program corresponding to the chat voice output function 18 of FIG. As shown in the figure, when the control program receives chat data in step S101, first, in step S102, there is an object having an attribute that affects voice output around the character corresponding to the user who sent the chat data. It is determined whether or not to do. Since the information about the configuration of the virtual space and the arrangement position of the character has been acquired by the character control function as described above, the determination can be performed using the information. For example, it is determined whether or not an object having a sound attribute exists in a circular area with a predetermined radius centered on the character that made the sound or a fan-shaped area with a predetermined angle in front of the character.
[0055]
If there is an object having an attribute related to voice, in step S103, the utterance sound included in the chat data is converted based on the object attribute. For example, if the attribute of the object is a sound echo characteristic, after the echo processing is performed on the uttered sound, the processed sound data is stored in the memory. If there is no object having the attribute related to the voice around the character, the process of step S103 is not executed.
[0056]
Next, in step S104, the relative positional relationship between the character that produced the sound and the character of the user who is using the device that executes the sound output process is calculated based on the virtual space and character placement information. To do. Further, when one or both of the two characters are moving, a change in positional relationship (displacement speed) per unit time is obtained, and whether or not the displacement speed is a speed that affects the voice is determined in advance. Judgment is made by comparison with a threshold value.
[0057]
If the displacement speed is greater than or equal to the threshold value, a voice conversion process based on the displacement speed is performed in step S105. If conversion processing has been performed in step S103, the conversion processing is further performed on the processed audio data stored in the memory. If the displacement speed does not affect the sound, the process of step S105 is not executed.
[0058]
Next, in step S106, it is determined whether or not user physical information such as a pulse is included in the chat data. If user physical information that affects voice (for example, information that the pulse is very high) is included, a conversion process based on the user physical information is executed in step S107. For example, as described above, the utterance sound is converted so that the voice can be heard trembling. If conversion processing has been performed in step S103 or S105, conversion processing is further performed on the converted audio data stored in the memory. The converted data is stored again in the memory. On the other hand, if the user physical information that affects the voice is not included, the process of step S107 is not executed.
[0059]
Next, in step S108, it is determined based on the output timing information included in the effect sound data whether there is a sound effect that is being output or is about to be output. If there is a sound effect, in step S109, the uttered sound and the sound effect are synthesized by performing predetermined weighting as described above. The synthesis method is preferably defined in advance for each type of sound effect. If the conversion process has been performed in step S103, S105, or S107, the processed sound stored in the memory is synthesized with the sound effect. If there is no sound effect, the process of step S109 is not performed.
[0060]
Next, the voice converted or synthesized in step S110 is output. With the above processing, a voice chat function having the above-described four features can be realized. However, the flowchart shown in FIG. 6 shows a process for providing all the above four features, but each of the above features has an effect of increasing the sense of reality alone, and all the features are not necessarily included. There is no need to combine them.
[0061]
As described above, how the sound is heard can be changed by processing the sound data itself by sound conversion or synthesis with another sound. On the other hand, it is empirically known that how to hear the sound changes depending on the user who listens to the sound and the position of the speaker from which the sound is output.
[0062]
Therefore, in the present embodiment, when performing the above voice conversion and synthesis, output data that differs depending on the speaker placement position, such as the speaker placed on the right side of the user and the speaker placed on the left side. Is generated. For example, when a certain character emits sound and moves at high speed in the right direction as viewed from the character of the user who uses the information processing apparatus, the volume of the uttered sound output from the left speaker is gradually reduced. The volume of the uttered sound output from the right speaker is increased stepwise. Thereby, it can be felt by hearing that the utterance character has moved in the right direction at high speed, and the sense of reality increases. Needless to say, more output data may be generated assuming that three or more speakers are arranged.
[0063]
Next, problems that may occur due to increased realism and means for solving the problems will be described. As is clear from the above description, increasing the sense of presence by changing the way the sound is heard does not necessarily make it easier to hear the sound. For this reason, while there are users who enjoy a sense of realism, there is a possibility that some users feel stressed that it is difficult to hear sound. Therefore, the network role-playing game according to the present embodiment has some new functions so that all users can enjoy the above-mentioned presence without feeling stress.
[0064]
FIG. 7 is a diagram for explaining a first function for alleviating stress that is difficult to hear. In the virtual space, an object 36 that emits loud sound noise is arranged between the character 6A of the user 4A and the characters 6B and 6C operated by other users. The sound 10 is output from the speaker 8A of the headphone of the user 4A, but the sound quality is deteriorated due to the influence of the object 36. Therefore, the user 4A determines which character the voice 10 is. Can not do it.
[0065]
The first function is a function for making it possible to easily distinguish the voice character in such a case. Specifically, as shown in FIG. 7, a mark 37 indicating that the voice is being spoken is displayed around the character 6B being voiced. Alternatively, characters such as “speaking” may be displayed around the speaking character 6B. Furthermore, the color of the voice character body may be changed, or the mouth of the character may be moved. Accordingly, the user 4A can easily recognize that the sound 10 is generated from the character 6B. Thereby, for example, by moving the character 6 A to the vicinity of the character 6 B and exchanging the conversation again, it is possible to confirm the content of the speech that has been missed. The first function can be realized by linking the chat voice output function 18 and the character control function 16 in the configuration of the control program shown in FIG.
[0066]
Here, in the first function, the character 6A makes an inquiry to the character 6B, and the character 6B makes the same statement again, so that the content of the missed statement becomes clear. In other words, if the character 6B refuses to repeat a statement, the content of the statement that has been missed cannot be known. Therefore, in this case, in this embodiment, an audio playback function is provided as the second function so that the content of the speech that has been missed can be known.
[0067]
The voice reproduction function provided as the second function is a function for reproducing a past utterance sound of a character when a predetermined instruction is input from the user. Various user interfaces for inputting instructions can be considered. For example, as shown in FIG. 8, a method of displaying a menu 38 for instructing voice reproduction on a screen is conceivable. The example of FIG. 8 is an example in which the operation menu 38 is displayed by moving the cursor 39 to the character 6B and performing a predetermined button operation. Furthermore, if a desired instruction is selected from the menu items by operating the cursor 39, a part or all of past utterances can be reproduced. In the case of only the first function, the user who operates the character 6B must repeat the same remarks many times when many characters ask for repetitive remarks. The user who operates the character 6B is not required to restate. Also, the user 4A who operates the character 6A can confirm the content he wants to know without hesitation of the user who operates the character 6B.
[0068]
Furthermore, as shown in FIG. 9, the present embodiment also provides a function of collectively playing back conversations conducted in a predetermined place (such as a room) in the virtual space without specifying a character. According to this function, the operation menu 40 appears on the screen when the user performs a predetermined button operation. The user can select any of the menu items by operating the cursor 39, and can reproduce a part or all of a conversation that has been performed in the past at a predetermined location in the virtual space. In this embodiment, a fast-forward playback function is also provided so that many conversations can be played back in a short time.
[0069]
Note that the audio playback function is useful not only for re-listening when the voice is missed, but also for users who have joined the game from the middle. For example, FIG. 9 shows an example in which a user operating the character 6A participates in the game halfway and encounters characters 6B to 6D in conversation. In such a case, the user who operates the character 6A can grasp the content of the conversation between the three characters by playing back the conversation that has been performed in the past at this place. In this case, the character 6A does not bother other users by asking many questions for grasping the situation.
[0070]
In order to provide the second function, it is necessary to store all audio data related to past utterances. When providing a playback function for the purpose of only re-listening, data for playback may be stored on the information processing device side, but as described above, not only for the purpose of re-listening, In order to enable playback, it is preferable to store playback data on the server side.
[0071]
FIG. 10 is a diagram showing functions on the server side necessary for performing audio reproduction. As shown in the figure, the server 2 provides virtual space and manages a character space / character control function 41 for managing the arrangement of characters, and receives chat data from each information processing device and distributes it to other information processing devices. A chat data distribution function 42 and a chat data search function 43 for acquiring and transmitting the requested chat data by searching the database 44 in response to a request from each information processing apparatus are provided.
[0072]
Chat data is stored and stored in the database 44 by the chat data distribution function 42. At this time, in addition to the utterance time and utterance character information included in the chat data, other information necessary for the search, such as the utterance location, is added. In addition to the method of saving chat data as it is received, chat data can be saved by converting voice data contained in chat data into character data and saving it as data containing the voice time, voice character, and voice data. A way to do this is also possible. When the utterance content is stored as character data, it is also possible to display the characters together with the voice when the reproduction of the voice is requested. In addition, since many techniques for retrieving character information are known, using these techniques makes it possible to retrieve not only the utterance character and utterance time but also the utterance content. Furthermore, in recent years, a technique has also been proposed in which voice data is converted into a standard voice code and a voice search is performed by matching or mismatching of the voice codes. Therefore, the voice data may be converted into a standard voice code and stored, and a search based on the voice code may be performed.
[0073]
For example, many standard audio compression techniques such as MP3 also set standards for fast-forward playback of audio. Therefore, if the utterance information included in the chat data is stored in a standard data format, the aforementioned fast-forward function can be provided. Note that the search data and the data for batch playback or fast-forward playback may be stored separately in different data formats and used separately according to the purpose.
[0074]
FIG. 11 is a flowchart showing a reproduction process executed by the information processing apparatus 3 and a search process executed by the server 2. As shown in the figure, in step S201, when the information processing apparatus 3 receives an input of a voice reproduction instruction from the operation menu as shown in FIG. 8 or FIG. 9, in step S202, the character, the utterance time, and the place where the conversation was made. The reproduction request including the search keyword information is transmitted to the server 2. When the server 2 receives the reproduction request in step S301, in step S302, the server 2 determines whether or not partial reproduction is requested or reproduction of all past audio is requested by confirming the presence or absence of the search keyword. . If a search keyword is included, the database 44 is searched using the keyword in step S303. In step S304, the chat data acquired by the search is transmitted to the information processing apparatus that transmitted the reproduction request. When reproduction of all past audio is requested, the chat data stored in the database 44 is arranged in time series and transmitted to the information processing apparatus.
[0075]
In step S203, the information processing apparatus receives the chat data transmitted by the server 2, and reproduces and outputs the voice included in the chat data in step S204.
[0076]
Next, a third function for alleviating stress that is difficult to hear will be described. The third function is a voice conversion process or voice for giving a sense of reality when the character satisfies a predetermined condition, such as when the character possesses a predetermined item or when the character has a predetermined attribute. This is a function for outputting a sound without synthesizing processing.
[0077]
FIG. 12 is a flowchart illustrating a process for providing the third function. As shown in the figure, the flowchart shown in FIG. 12 shows whether or not the character of the user who received the chat data (the user who listened to the voice) possesses the designated item after step S101 of the flowchart shown in FIG. Step S111 for determining whether or not is added. The specified item may be fixed, but may be defined for each chat data. That is, information on the designated item may be included in the chat data.
[0078]
If it is determined in step S111 that the listener's character does not possess the specified item, the processing from step S102 onward in FIG. 6 is executed. On the other hand, when the listener's character possesses the designated item, the processes from steps S102 to S109 in FIG. 6 are not executed. In this case, since voice conversion and synthesis processing are not performed, in step S110, the voice included in the chat data is output as it is.
[0079]
According to the third function, for example, when the designated item is a transceiver, as shown in FIG. 13, the character 6A carrying the transceiver 45A and the character 6B carrying the transceiver 45B are affected by surrounding objects and sound effects. You can always talk with a voice that is easy to hear.
[0080]
Similarly, in step S111 in FIG. 12, the attribute of the character may be referred to and the necessity of speech conversion or synthesis may be determined based on the attribute. In this case, for example, when a human attribute character has a conversation with a robot attribute character, it may be difficult to hear the voice, but when a human attribute character has a conversation with each other, it is possible to always talk with a clear voice. can do. In the case of a game of a type in which a battle is divided into groups, by defining the grouping as an attribute, it is possible to realize a specification that allows chatting with only the ally group.
[0081]
According to the third function, when the user satisfies a predetermined condition, the user can have a conversation with the voice that is uttered as in the conventional case. Any user who prioritizes can enjoy the game together without feeling stressed.
[0082]
As described above, in the role-playing game according to the present embodiment, when voice chat is performed, voice closely related to the virtual space is output. Can feel. In addition, as a result of emphasizing the sense of presence, even if it may be difficult to hear the sound temporarily, the user feels stressed because the device has been devised to rehearse the sound or listen to a clear sound. There is no.
[0083]
Although the network role playing game has been described as an example, it is needless to say that the present invention is a technique applicable to any system that performs voice chat via a character.
[0084]
【The invention's effect】
According to each information processing apparatus or program of the present invention, the chat voice is output after the processing for making the relation between the virtual space and the voice dense, so that the user who listens to the voice is as if in the virtual space. You can feel a sense of realism like having a conversation.
[0085]
Further, according to each other information processing apparatus or program of the present invention, even when chat voice is difficult to hear, the user is not stressed.
[Brief description of the drawings]
FIG. 1 is a diagram for explaining an outline of a network role-playing game and its voice chat function.
FIG. 2 is a diagram illustrating functions of an information processing apparatus according to the present invention.
FIG. 3 is a diagram illustrating an example of a format of chat data.
FIG. 4 is a diagram illustrating an example of a format of effect sound data.
FIG. 5 is a diagram illustrating a hardware configuration of an information processing apparatus.
FIG. 6 is a flowchart showing an outline of processing in an embodiment of the program of the present invention.
FIG. 7 is a diagram for explaining a function for discriminating a voice character;
FIG. 8 is a diagram for explaining a playback function for each character of chat voice;
FIG. 9 is a diagram for explaining a chat sound batch reproduction function;
FIG. 10 is a diagram illustrating server functions necessary for playing chat voice.
FIG. 11 is a flowchart showing an outline of processing for reproducing chat voice.
FIG. 12 is a flowchart showing audio output processing based on possessed items.
FIG. 13 is a diagram for explaining audio output processing based on possessed items.
[Explanation of symbols]
2 server, 3, 3A, 3B information processing device, 4A, 4B user, 5 virtual space, 6A-6E character, 7, 7A, 7B microphone, 8, 8A, 8B speaker (earphone), 9 vocal sound, 10 output 11, 11A, 11B controller, 12, 12A, 12B sensor, 13, 14, 36 object, 33 system bus, 34 memory card, 35 hard disk, 37 mark, 38, 40 operation menu, 39 cursor, 45A, 45B Transceiver.

Claims

An information processing apparatus that provides a voice chat function,
By controlling a virtual space that can be shared by multiple users and a server that manages the characters for each user placed in the virtual space, the user of the information processing apparatus controls the behavior of the characters placed in the virtual space Character control means for
Chat data transmission means for acquiring first voice data representing voice uttered by a user of the information processing apparatus, generating chat data including the first voice data, and transmitting the chat data;
The character arranged by the user of the information processing apparatus that has acquired chat data generated by the information processing apparatus in communication with the server and uses the first voice data included in the chat data and generated the chat data Chat voice output means for outputting the voice of
The chat voice output means generates second voice data by converting the first voice data included in the acquired chat data based on an attribute relating to voice of an object arranged in the virtual space, and the second voice data is generated. 2. An information processing apparatus that outputs a voice represented by two voice data as the voice of the character.

An information processing apparatus that provides a voice chat function,
By controlling a virtual space that can be shared by multiple users and a server that manages the characters for each user placed in the virtual space, the user of the information processing apparatus controls the behavior of the characters placed in the virtual space Character control means for
Chat data transmission means for acquiring first voice data representing voice uttered by a user of the information processing apparatus, generating chat data including the first voice data, and transmitting the chat data;
The character arranged by the user of the information processing apparatus that has acquired chat data generated by the information processing apparatus in communication with the server and uses the first voice data included in the chat data and generated the chat data Chat voice output means for outputting the voice of
The chat voice output means includes a first voice data included in the acquired chat data, and a positional relationship between a character arranged by a user of the information processing apparatus in which the chat data is generated and a character arranged by the user of the information processing apparatus An information processing apparatus that generates second voice data by performing conversion based on a displacement amount per unit time and outputs the voice represented by the second voice data as the voice of the character.

An information processing apparatus that provides a voice chat function,
By controlling a virtual space that can be shared by multiple users and a server that manages the characters for each user placed in the virtual space, the user of the information processing apparatus controls the behavior of the characters placed in the virtual space Character control means for
Chat data transmission means for acquiring first voice data representing voice uttered by a user of the information processing apparatus, generating chat data including the first voice data, and transmitting the chat data;
The character arranged by the user of the information processing apparatus that has acquired chat data generated by the information processing apparatus in communication with the server and uses the first voice data included in the chat data and generated the chat data Chat voice output means for outputting the voice of
The chat data transmitting means acquires the user's physical information by a predetermined sensor attached to the user, generates and transmits chat data including the physical information,
The chat voice output unit generates second voice data by converting the first voice data included in the acquired chat data based on the physical information included in the chat data, and the second voice data represents the second voice data. An information processing apparatus that outputs voice as the voice of the character.

An information processing apparatus that provides a voice chat function,
By controlling a virtual space that can be shared by multiple users and a server that manages the characters for each user placed in the virtual space, the user of the information processing apparatus controls the behavior of the characters placed in the virtual space Character control means for
Chat data transmission means for acquiring first voice data representing voice uttered by a user of the information processing apparatus, generating chat data including the first voice data, and transmitting the chat data;
The character arranged by the user of the information processing apparatus that has acquired chat data generated by the information processing apparatus in communication with the server and uses the first voice data included in the chat data and generated the chat data Chat voice output means for outputting the voice of
The chat sound output means acquires sound effect data representing sound effects for producing the virtual space, generates second sound data by synthesizing the first sound data and the sound effect data, An information processing apparatus that outputs the voice represented by the second voice data as the voice of the character.

In the information processing apparatus according to any one of claims 1 to 4,
The chat voice output unit generates the second voice data for each voice output device based on a positional relationship between a voice output device that outputs the voice and a user of the information processing device. .

An information processing apparatus that provides a voice chat function,
By controlling a virtual space that can be shared by multiple users and a server that manages the characters for each user placed in the virtual space, the user of the information processing apparatus controls the behavior of the characters placed in the virtual space Character control means for
Chat data transmission means for acquiring voice data representing voice uttered by a user of the information processing apparatus, generating chat data including the voice data, and transmitting the chat data;
The chat data generated by the information processing apparatus in communication with the server is acquired, and the voice of the character arranged by the user of the information processing apparatus in which the chat data is generated using the voice data included in the chat data Chat voice output means for outputting,
The chat voice output means obtains chat data immediately after generated by each information processing apparatus and outputs the voice of the character, and chat data generated by each information processing apparatus and stored and stored in the server. An information processing apparatus that provides a function of acquiring and reproducing the voice of the character.

The information processing apparatus according to claim 6, wherein the chat voice output unit can reproduce the voice of the character at a speed different from a speed at which the voice data is acquired.

A server that provides a voice chat function,
A character control means for providing a virtual space that can be shared by a plurality of users, and controlling the behavior of the character placed by each user in the virtual space by communicating with the information processing device of each user;
Chat data including voice data representing the voice uttered by each user is received from the information processing apparatus of each user, the chat data is stored in a predetermined storage medium, and the plurality of information processing apparatuses in communication Chat data delivery means for delivery to
Chat data search means for searching for chat data requested by the user from chat data stored in the storage medium and transmitting the chat data to the information processing apparatus of the user. A server characterized by enabling playback.

9. The server according to claim 8, wherein the chat data distribution unit converts the voice data included in the chat data into character data, and stores the character data in the predetermined storage medium.

An information processing apparatus that provides a voice chat function,
By controlling a virtual space that can be shared by multiple users and a server that manages the characters for each user placed in the virtual space, the user of the information processing apparatus controls the behavior of the characters placed in the virtual space Character control means for
Chat data transmission means for acquiring voice data representing voice uttered by a user of the information processing apparatus, generating chat data including the voice data, and transmitting the chat data;
The chat data generated by the information processing apparatus in communication with the server is acquired, and the voice of the character arranged by the user of the information processing apparatus in which the chat data is generated using the voice data included in the chat data Chat voice output means for outputting,
The chat voice output means is based on attributes and / or items of a character arranged by a user using the information processing apparatus and / or a character arranged by a user of the information processing apparatus from which output voice data is acquired. An information processing apparatus that outputs voice of the character.

A program that provides a voice chat function.
A character that controls the action of a character placed in the virtual space by a user of the computer by communicating with a virtual space that can be shared by a plurality of users and a server that manages the character for each user placed in the virtual space Control means,
The first voice data representing the voice uttered by the user of the computer is acquired, the chat data transmitting means for generating and transmitting the chat data including the first voice data, and the computer generated by the computer in communication with the server The chat data is acquired, and the first voice data included in the chat data is used to function as a chat voice output means for outputting the voice of the character arranged by the computer user who generated the chat data,
The chat voice output means generates second voice data by converting the first voice data included in the acquired chat data based on an attribute relating to voice of an object arranged in the virtual space, and the second voice data is generated. 2. A program that outputs the voice represented by the voice data as the voice of the character.

A program that provides a voice chat function.
A character that controls the action of a character placed in the virtual space by a user of the computer by communicating with a virtual space that can be shared by a plurality of users and a server that manages the character for each user placed in the virtual space Control means,
The first voice data representing the voice uttered by the user of the computer is acquired, the chat data transmitting means for generating and transmitting the chat data including the first voice data, and the computer generated by the computer in communication with the server The chat data is acquired, and the first voice data included in the chat data is used to function as a chat voice output means for outputting the voice of the character arranged by the computer user who generated the chat data,
The chat voice output means includes a first voice data included in the acquired chat data, and a positional relationship between a character arranged by a computer user who generates the chat data and a character arranged by a computer user who operates the program A program for generating second voice data by converting based on a displacement amount per unit time and outputting the voice represented by the second voice data as the voice of the character.

A program that provides a voice chat function.
A character that controls the action of a character placed in the virtual space by a user of the computer by communicating with a virtual space that can be shared by a plurality of users and a server that manages the character for each user placed in the virtual space Control means,
The first voice data representing the voice uttered by the user of the computer is acquired, the chat data transmitting means for generating and transmitting the chat data including the first voice data, and the computer generated by the computer in communication with the server The chat data is acquired, and the first voice data included in the chat data is used to function as a chat voice output means for outputting the voice of the character arranged by the computer user who generated the chat data,
The chat data transmitting means acquires the user's physical information by a predetermined sensor attached to the user, generates and transmits chat data including the physical information,
The chat voice output unit generates second voice data by converting the first voice data included in the acquired chat data based on the physical information included in the chat data, and the second voice data represents the second voice data. A program for outputting a voice as the voice of the character.

A program that provides a voice chat function.
A character that controls the action of a character placed in the virtual space by a user of the computer by communicating with a virtual space that can be shared by a plurality of users and a server that manages the character for each user placed in the virtual space Control means,
The first voice data representing the voice uttered by the user of the computer is acquired, the chat data transmitting means for generating and transmitting the chat data including the first voice data, and the computer generated by the computer in communication with the server The chat data is acquired, and the first voice data included in the chat data is used to function as a chat voice output means for outputting the voice of the character arranged by the computer user who generated the chat data,
The chat sound output means acquires sound effect data representing sound effects for producing the virtual space, generates second sound data by synthesizing the first sound data and the sound effect data, A program that outputs the voice represented by the second voice data as the voice of the character.

A program according to any one of claims 11 to 14,
The chat voice output unit generates the second voice data for each voice output device based on a positional relationship between a voice output device that outputs the voice and a computer user who operates the program. .

A program that provides a voice chat function.
A character that controls the action of a character placed in the virtual space by a user of the computer by communicating with a virtual space that can be shared by a plurality of users and a server that manages the character for each user placed in the virtual space Control means,
The first voice data representing the voice uttered by the user of the computer is acquired, the chat data transmitting means for generating and transmitting the chat data including the first voice data, and the computer generated by the computer in communication with the server The chat data is acquired, and the first voice data included in the chat data is used to function as a chat voice output means for outputting the voice of the character arranged by the computer user who generated the chat data,
The chat voice output means acquires chat data immediately after being generated by each computer and outputs the voice of the character, and acquires chat data generated by each computer and stored in the server. And a function of reproducing the voice of the character.

The program according to claim 16, wherein the chat voice output unit can play back the voice of the character at a speed different from the speed at which the voice data is acquired.

A program for a server that provides a voice chat function.
A character control means for providing a virtual space that can be shared by a plurality of users, and controlling the behavior of the character placed by each user in the virtual space by communicating with the information processing device of each user,
Chat data including voice data representing the voice uttered by each user is received from the information processing apparatus of each user, the chat data is stored in a predetermined storage medium, and the information processing apparatuses in communication By functioning as chat data distribution means for distributing to the user and chat data requested by the user from chat data stored in the storage medium and functioning as chat data search means for transmitting to the information processing apparatus of the user A server program characterized in that chat information can be reproduced by the information processing apparatus.

19. The program according to claim 18, wherein the chat data distribution unit converts the voice data included in the chat data into character data, and stores the character data in the predetermined storage medium.

A program that provides a voice chat function.
A character that controls the action of a character placed in the virtual space by a user of the computer by communicating with a virtual space that can be shared by a plurality of users and a server that manages the character for each user placed in the virtual space Control means,
Acquires voice data representing voice uttered by a user of the computer, generates chat data including the voice data and transmits the chat data, and acquires chat data generated by the computer in communication with the server The voice data included in the chat data is used to function as chat voice output means for outputting the voice of the character arranged by the computer user who generated the chat data.
The chat voice output means is based on an attribute and / or an item of a character arranged by a user using a computer that operates the program and / or a character arranged by a user of a computer from which audio data to be output is acquired. A program for outputting the voice of the character.

A computer-readable recording medium on which the program according to any one of claims 11 to 20 is recorded.