JP2000285063A

JP2000285063A - Information processor, information processing method and medium

Info

Publication number: JP2000285063A
Application number: JP11092862A
Authority: JP
Inventors: Kiyonobu Kojima; 清信小島; Tsunenori Noma; 恒毅野間
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1999-03-31
Filing date: 1999-03-31
Publication date: 2000-10-13

Abstract

PROBLEM TO BE SOLVED: To enable a user to enjoy chat even when the user is not skillful in the operation of a keyboard. SOLUTION: A personal computer(PC) 1 to be a chat client recognizes an inputted voice as voice and transmits text data to a server 2 to be a chat server as a voice recognition result. On the other hand, the PC 1 receives text data transmitted from the server 2 and synthesizes a voice on the basis of the text data. Then a voice (synthetic voice) corresponding to the text data is outputted.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、情報処理装置およ
び情報処理方法、並びに媒体に関し、特に、例えば、キ
ーボード等の操作に熟練していなくても、チャットを楽
しむことができるようにする情報処理装置および情報処
理方法、並びに媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing apparatus, an information processing method, and a medium. The present invention relates to an apparatus, an information processing method, and a medium.

【０００２】[0002]

【従来の技術】例えば、遠隔地にいる多数のユーザどう
しが、容易にコミュニケーションを図るためのツールと
しては、例えば、チャットシステムがある。チャットシ
ステムにおいては、サーバにログインしているあるクラ
イアントから送信されてくるテキストデータが、サーバ
で受信され、他のログインしているクライアントに送信
されるようになっており、これにより、遠隔地にいる多
数のユーザどうしが、容易にコミュニケーションを図る
ことができるようになっている。2. Description of the Related Art For example, there is a chat system as a tool for easily communicating with a large number of remote users. In a chat system, text data transmitted from a client logged in to a server is received by the server and transmitted to another logged-in client. Many users can easily communicate with each other.

【０００３】ここで、従来においては、ある１のサーバ
においては、そのサーバに接続しているユーザどうしの
間でしかチャットを行うことができなかったが、最近急
速に普及してきたインターネットでは、ＩＲＣ（Intern
et Relay Chat）方式と呼ばれるチャット方式によっ
て、ＩＲＣサーバと呼ばれるサーバのいずれかに接続す
れば、そのサーバとリンクしているサーバと接続してい
るクライアントとも、チャットを行うことができる。な
お、ＩＲＣについては、ＩＥＴＦ（Internet Engineeri
ng Task Force）のＲＦＣ（Request For Comments）１
４５９に規定されている。[0003] Heretofore, in a conventional one server, a chat can be performed only between users connected to the server. (Intern
If you connect to one of the servers called IRC server by the chat method called “et Relay Chat” method, you can chat with the client connected to the server linked to that server. In addition, about IRC, IETF (Internet Engineeri
ng Task Force) RFC (Request For Comments) 1
459.

【０００４】[0004]

【発明が解決しようとする課題】ところで、チャットシ
ステムでは、サーバとクライアントとの間で、テキスト
データがリアルタイムで送受信されるが、従来において
は、チャットを行うためには、ユーザは、キーボードを
操作して、テキストデータを入力しなければならない。
従って、キーボードの操作に熟練していないユーザは、
チャットを楽しむことが困難であった。また、キーボー
ドから入力した平仮名は、仮名漢字変換システム（フロ
ントエンドプロセッサ）によって漢字混じりの文章にし
なければ読みにくく、従って、仮名漢字変換システムの
操作にも熟練する必要があった。By the way, in a chat system, text data is transmitted and received between a server and a client in real time. However, conventionally, in order to perform a chat, a user operates a keyboard. Then you have to enter the text data.
Therefore, users who are not skilled in keyboard operation
It was difficult to enjoy chatting. Also, hiragana input from the keyboard is difficult to read unless the sentence is mixed with kanji by a kana-kanji conversion system (front-end processor). Therefore, it is necessary to be skilled in operation of the kana-kanji conversion system.

【０００５】そこで、キーボードを操作することなく、
遠隔地にいる多数のユーザどうしがコミュニケーション
を図ることのできるツールとして、例えば、特開平６−
２７４５９６号公報に開示されているような、多数の端
末間で音声データのやりとりを行うシステムがある。Therefore, without operating the keyboard,
As a tool that allows a large number of remote users to communicate with each other, for example, Japanese Unexamined Patent Publication No.
There is a system for exchanging audio data between a large number of terminals as disclosed in Japanese Patent No. 274596.

【０００６】しかしながら、特開平６−２７４５９６号
公報に開示されているシステムでは、音声データをその
まま送信するため、データ量が多く、広帯域の伝送路が
必要となる。また、特開平６−２７４５９６号公報に
は、音声データを圧縮して送信することが開示されてい
るが、高圧縮の場合には、音質が低下し、低圧縮の場合
には、広帯域の伝送路が必要となる。However, in the system disclosed in Japanese Patent Application Laid-Open No. 6-274596, since voice data is transmitted as it is, a large amount of data and a wide-band transmission path are required. Japanese Patent Application Laid-Open No. 6-274596 discloses that audio data is transmitted after being compressed. However, in the case of high compression, the sound quality is degraded, and in the case of low compression, wideband transmission is performed. Road is needed.

【０００７】本発明は、このような状況に鑑みてなされ
たものであり、狭帯域の伝送路であっても、ユーザどう
しが、容易にコミュニケーションを図ることができるよ
うにするものである。[0007] The present invention has been made in view of such a situation, and it is an object of the present invention to enable users to easily communicate with each other even in a narrow-band transmission path.

【０００８】[0008]

【課題を解決するための手段】請求項１に記載の情報処
理装置は、入力された音声を音声認識し、その音声認識
結果を、テキストデータで出力する音声認識手段と、音
声認識結果としてのテキストデータを、サーバに送信す
る送信手段とを含むことを特徴とする。According to a first aspect of the present invention, there is provided an information processing apparatus which performs voice recognition of an input voice and outputs the voice recognition result as text data. And transmitting means for transmitting the text data to the server.

【０００９】請求項５に記載の情報処理方法は、入力さ
れた音声を音声認識し、その音声認識結果を、テキスト
データで出力する音声認識ステップと、音声認識結果と
してのテキストデータを、サーバに送信する送信ステッ
プとを含むことを特徴とする。According to a fifth aspect of the present invention, in the information processing method, a voice recognition step of recognizing an input voice and outputting the voice recognition result as text data, and transmitting text data as the voice recognition result to a server. And a transmitting step of transmitting.

【００１０】請求項６に記載の媒体が情報処理装置に実
行させるプログラムは、入力された音声を音声認識し、
その音声認識結果を、テキストデータで出力する音声認
識ステップと、音声認識結果としてのテキストデータ
を、サーバに送信する送信ステップとを含むことを特徴
とする。According to a sixth aspect of the present invention, there is provided a program for causing an information processing apparatus to execute a medium, wherein the program performs speech recognition on input speech,
The method includes a voice recognition step of outputting the voice recognition result as text data, and a transmitting step of transmitting text data as a voice recognition result to a server.

【００１１】請求項７に記載の情報処理装置は、サーバ
から送信されてくるテキストデータを受信する受信手段
と、サーバからのテキストデータに基づいて、音声合成
を行い、そのテキストデータに対応する合成音を出力す
る音声合成手段とを含むことを特徴とする。According to a seventh aspect of the present invention, there is provided an information processing apparatus, comprising: receiving means for receiving text data transmitted from a server; and performing speech synthesis based on the text data from the server, and performing synthesis corresponding to the text data. Voice synthesis means for outputting sound.

【００１２】請求項９に記載の情報処理方法は、サーバ
から送信されてくるテキストデータを受信する受信ステ
ップと、サーバからのテキストデータに基づいて、音声
合成を行い、そのテキストデータに対応する合成音を出
力する音声合成ステップとを含むことを特徴とする。According to a ninth aspect of the present invention, in the information processing method, a receiving step of receiving text data transmitted from the server, a voice synthesis is performed based on the text data from the server, and a synthesis corresponding to the text data is performed. And a voice synthesizing step of outputting a sound.

【００１３】請求項１０に記載の媒体が情報処理装置に
実行させるプログラムは、サーバから送信されてくるテ
キストデータを受信する受信ステップと、サーバからの
テキストデータに基づいて、音声合成を行い、そのテキ
ストデータに対応する合成音を出力する音声合成ステッ
プとを含むことを特徴とする。[0013] According to a tenth aspect of the present invention, a program for causing an information processing apparatus to execute a medium performs a receiving step of receiving text data transmitted from a server, and performs speech synthesis based on the text data from the server. And a voice synthesizing step of outputting a synthesized voice corresponding to the text data.

【００１４】請求項１に記載の情報処理装置および請求
項５に記載の情報処理方法、並びに請求項６に記載の媒
体においては、入力された音声が音声認識され、その音
声認識結果が、テキストデータで出力される。そして、
その音声認識結果としてのテキストデータが、サーバに
送信される。[0014] In the information processing apparatus according to the first aspect, the information processing method according to the fifth aspect, and the medium according to the sixth aspect, the input voice is recognized by speech, and the speech recognition result is a text. Output as data. And
Text data as a result of the speech recognition is transmitted to the server.

【００１５】請求項７に記載の情報処理装置および請求
項９に記載の情報処理方法、並びに請求項１０に記載の
媒体においては、サーバから送信されてくるテキストデ
ータが受信され、そのサーバからのテキストデータに基
づいて、音声合成が行われ、そのテキストデータに対応
する合成音が出力される。[0015] In the information processing apparatus according to the seventh aspect, the information processing method according to the ninth aspect, and the medium according to the tenth aspect, text data transmitted from a server is received and transmitted from the server. Speech synthesis is performed based on the text data, and a synthesized sound corresponding to the text data is output.

【００１６】[0016]

【発明の実施の形態】図１は、本発明を適用したチャッ
トシステムの一実施の形態の構成例を示している。FIG. 1 shows a configuration example of an embodiment of a chat system to which the present invention is applied.

【００１７】図１の実施の形態においては、３台のパー
ソナルコンピュータ（パソコン）１−１乃至１−３、並
びに２台のチャットサーバ２−１および２−２が、例え
ば、インターネット等の通信ネットワーク３を介して相
互に接続されている。In the embodiment shown in FIG. 1, three personal computers (personal computers) 1-1 to 1-3 and two chat servers 2-1 and 2-2 are connected to a communication network such as the Internet, for example. 3 are interconnected.

【００１８】そして、サーバ２−１および２−２は、ユ
ーザがチャットを行う場としてのチャット空間を提供す
るためのチャットサーバプログラムが格納されたＨＤ
（HardDisk）２７（図３）を有し、そのプログラムを実
行することで、パソコンどうしでチャットを行うための
制御を行うチャットサーバとして機能するようになって
いる。即ち、サーバ２−１および２−２（サーバ２−１
および２−２を区別する必要がない場合は、以下、適
宜、サーバ２と記述する）は、通信ネットワーク３を介
して接続されているパソコン１−１乃至１−３に、その
ユーザがチャットを行うためのチャット環境を提供する
ようになっている。なお、ここでは、サーバ２−１およ
び２−２は、例えば、ＩＲＣ方式によるチャット環境を
提供するようになっており、従って、サーバ２−１に接
続したユーザどうし、サーバ２−２に接続したユーザど
うしの他、サーバ２−１の接続したユーザと、サーバ２
−２に接続したユーザとの間でも、チャットを行うこと
ができるようになっている。The servers 2-1 and 2-2 are each provided with an HD in which a chat server program for providing a chat space as a place where a user performs a chat is stored.
(HardDisk) 27 (FIG. 3), and by executing the program, functions as a chat server for controlling chatting between personal computers. That is, the servers 2-1 and 2-2 (the server 2-1)
In the case where it is not necessary to distinguish between the two and 2-2, the user will be referred to as a server 2 as appropriate), and the user chats with the personal computers 1-1 to 1-3 connected via the communication network 3. It provides a chat environment to perform. Here, the servers 2-1 and 2-2 provide, for example, a chat environment based on the IRC method. Therefore, users who have connected to the server 2-1 have connected to the server 2-2. In addition to the users, the user connected to the server 2-1 and the server 2
It is also possible to chat with a user connected to -2.

【００１９】一方、パソコン１−１乃至１−３（パソコ
ン１−１乃至１−３を区別する必要がない場合は、以
下、適宜、パソコン１と記述する）は、サーバ２−１や
２−２が提供するチャット空間を、他のパソコンとの間
で共有しながらチャットを行うためのチャットクライア
ントプログラムを記憶しており、このチャットクライア
ントプログラムと、サーバ２−１や２−２におけるチャ
ットサーバプログラムの実行によって、パソコン１で
は、チャットを行うためのチャット空間が表示される。On the other hand, the personal computers 1-1 to 1-3 (hereinafter, when it is not necessary to distinguish the personal computers 1-1 to 1-3, are referred to as the personal computer 1 as appropriate) are the servers 2-1 and 2- 2 stores a chat client program for chatting while sharing the chat space provided by the PC 2 with another personal computer. The chat client program and the chat server program in the servers 2-1 and 2-2 are stored. Is executed, the personal computer 1 displays a chat space for chatting.

【００２０】そして、パソコン１−１乃至１−３が、す
べてサーバ２に接続している場合において、そのうち
の、例えば、パソコン１−１からテキストデータが入力
され、通信ネットワーク３を介して、サーバ２に送信さ
れると、サーバ２では、そのテキストデータが受信さ
れ、他のパソコン１−２および１−３に送信される。ま
た、パソコン１−２や１−３において入力されたテキス
トデータも、同様にして、他のパソコンに送信される。When all of the personal computers 1-1 to 1-3 are connected to the server 2, for example, text data is input from the personal computer 1-1, and the server 1-1 receives text data via the communication network 3. 2, the text data is received by the server 2 and transmitted to the other personal computers 1-2 and 1-3. The text data input in the personal computers 1-2 and 1-3 is transmitted to other personal computers in the same manner.

【００２１】以上のようにして、パソコン１−１乃至１
−３それぞれの間では、サーバ２を介して、チャットが
行われる。As described above, the personal computers 1-1 to 1
-3, a chat is performed via the server 2.

【００２２】次に、図２は、図１のパソコン１のハード
ウェア構成例を示している。FIG. 2 shows an example of the hardware configuration of the personal computer 1 shown in FIG.

【００２３】通信Ｉ／Ｆ（Interface）１１は、例え
ば、モデムや、ターミナルアダプタ、ネットワークカー
ド等でなり、通信ネットワーク３を介しての通信制御を
行うようになっている。ＲＯＭ（Read Only Memory）１
２は、例えば、ＢＩＯＳ（Basic Input Output Syste
m）のプログラムなどを記憶している。ＣＰＵ（Central
Processing Unit）１３は、ＨＤ１７に記憶されている
チャットクライアントプログラムや、音声処理プログラ
ム等のアプリケーションプログラムを、ＲＡＭ（Random
Access Memory）１４上に展開して実行することで、チ
ャットや音声認識、音声合成等のための処理を行うよう
になっている。ＲＡＭ１４は、ＣＰＵ１３の動作上必要
なプログラムやデータを一時記憶するようになってい
る。The communication I / F (Interface) 11 includes, for example, a modem, a terminal adapter, a network card, and the like, and controls communication via the communication network 3. ROM (Read Only Memory) 1
2 is, for example, a BIOS (Basic Input Output System)
m) programs are stored. CPU (Central
The processing unit (13) stores application programs such as a chat client program and a voice processing program stored in the HD 17 into a RAM (Random).
Access memory) 14 is expanded and executed to perform processing for chat, voice recognition, voice synthesis, and the like. The RAM 14 temporarily stores programs and data necessary for the operation of the CPU 13.

【００２４】入力部１５は、例えば、キーボードや、マ
ウス、マイク（マイクロフォン）等で構成され、必要な
コマンドやデータを入力するときに用いられる。出力部
１６は、例えば、ディスプレイやスピーカ、ヘッドフォ
ン等で構成され、ＣＰＵ１３の制御の下、所定の情報を
表示したり、音声で出力するようになっている。ＨＤ１
７は、上述したようなアプリケーションプログラムの
他、例えば、Windows95や98（商標）、Linux等のＯＳ
（Operating System）のプログラム等を記憶している。The input unit 15 is composed of, for example, a keyboard, a mouse, a microphone (microphone), etc., and is used to input necessary commands and data. The output unit 16 includes, for example, a display, a speaker, headphones, and the like, and is configured to display predetermined information or output by voice under the control of the CPU 13. HD1
Reference numeral 7 denotes an OS such as Windows 95, 98 (trademark), Linux, etc., in addition to the application programs described above.
(Operating System) programs and the like.

【００２５】以上のように構成されるパソコン１では、
ＣＰＵ１３において、ＨＤ１７に記憶されたＯＳのプロ
グラムが、ＲＡＭ１４に展開されて実行される。さら
に、ＣＰＵ１３において、ＯＳのプログラムの制御の
下、ＨＤ１７に記憶されたアプリケーションプログラム
が実行されることで、上述したようなチャットのための
処理や、後述する音声認識、音声合成その他の処理が行
われる。In the personal computer 1 configured as described above,
In the CPU 13, the OS program stored in the HD 17 is expanded in the RAM 14 and executed. Further, the CPU 13 executes the application program stored in the HD 17 under the control of the OS program, thereby performing the above-described processing for chat, voice recognition, voice synthesis, and other processing described below. Will be

【００２６】次に、図３は、図１のサーバ２のハードウ
ェア構成例を示している。Next, FIG. 3 shows an example of a hardware configuration of the server 2 of FIG.

【００２７】図３に示すように、サーバ２は、パソコン
１を構成する通信Ｉ／Ｆ１１乃至ＨＤ１７とそれぞれ同
様の通信Ｉ／Ｆ２１乃至ＨＤ２７から構成されている。
但し、ＨＤ１７には、チャットサーバプログラム等の、
サーバ２がチャットサーバとして機能するためのプログ
ラムが記憶されている。As shown in FIG. 3, the server 2 comprises communication I / Fs 21 to HD27 which are the same as the communication I / Fs 11 to HD 17 constituting the personal computer 1, respectively.
However, the HD 17 includes a chat server program and the like.
A program for the server 2 to function as a chat server is stored.

【００２８】以上のように構成されるサーバ２でも、Ｃ
ＰＵ２３において、ＨＤ２７に記憶されたアプリケーシ
ョンプログラムが実行されることで、パソコンどうしで
チャットを行うのに必要な処理が行われる。In the server 2 configured as described above, C
By executing the application program stored in the HD 27 in the PU 23, processing necessary for chatting between personal computers is performed.

【００２９】次に、図４を参照して、パソコン１におい
てチャットクライアントプログラムが実行され、サーバ
２においてチャットサーバプログラムが実行されること
により行われる処理について説明する。Next, with reference to FIG. 4, a description will be given of a process performed by executing the chat client program on the personal computer 1 and executing the chat server program on the server 2. FIG.

【００３０】なお、図４において、パソコン１を構成す
るブロックは、パソコン１でチャットクライアントプロ
グラムが実行されることにより実現される機能的なブロ
ックであり、サーバ２を構成するブロックも、サーバ２
でチャットサーバプログラムが実行されることにより実
現される機能的なブロックである。In FIG. 4, the blocks constituting the personal computer 1 are functional blocks realized by executing the chat client program on the personal computer 1, and the blocks constituting the server 2 are the same as those of the server 2.
Is a functional block realized by executing the chat server program.

【００３１】また、図４では、パソコン１は、チャット
通信部３１およびチャット処理部３２で構成されている
が、パソコン１−ｉ（本実施の形態では、ｉ＝１，２，
３）を構成するチャット通信部３１とチャット処理部３
２を、それぞれチャット通信部３１−ｉとチャット処理
部３２−ｉと記述するものとする。In FIG. 4, the personal computer 1 is composed of a chat communication unit 31 and a chat processing unit 32, but the personal computer 1-i (in the present embodiment, i = 1, 2, 2, 3).
3) Chat communication unit 31 and chat processing unit 3
2 are described as a chat communication unit 31-i and a chat processing unit 32-i, respectively.

【００３２】いま、パソコン１−１と１−２が、通信ネ
ットワーク３を介して、サーバ２に接続しており、この
パソコン１−１と１−２との間でチャットが行われると
すると、例えば、パソコン１−１のユーザが、入力部１
５を構成するキーボードを操作することにより、テキス
トを入力すると、そのテキストデータは、チャット処理
部３２−１に供給される。チャット処理部３２−１で
は、入力されたテキストデータに対して、所定の処理が
施され、即ち、例えば、ＩＲＣ方式でチャットが行われ
る場合には、テキストデータが、７ビット可視ＡＳＣＩ
Ｉ文字列のデータに変換され、チャット通信部３１−１
に供給される。チャット通信部３１−１では、チャット
処理部３２−１からのテキストデータが、通信ネットワ
ーク３を介して、サーバ２に送信される。Now, suppose that the personal computers 1-1 and 1-2 are connected to the server 2 via the communication network 3 and that a chat is performed between the personal computers 1-1 and 1-2. For example, the user of the personal computer 1-1 operates the input unit 1
When the text is input by operating the keyboard constituting the text No. 5, the text data is supplied to the chat processing unit 32-1. The chat processing unit 32-1 performs a predetermined process on the input text data. That is, for example, when a chat is performed by the IRC method, the text data is converted to a 7-bit visible ASCI.
Is converted into I-string data, and the chat communication unit 31-1
Supplied to In the chat communication unit 31-1, text data from the chat processing unit 32-1 is transmitted to the server 2 via the communication network 3.

【００３３】サーバ２では、チャットサーバ機能部４１
において、チャット通信部３１−１からのテキストデー
タが受信され、通信ネットワーク３を介して、パソコン
１−１以外の、いま接続している他のパソコン（チャッ
トクライアント）に送信される。即ち、図４の実施の形
態では、サーバ２に接続している、パソコン１−１以外
のパソコンは、パソコン１−２であり、従って、チャッ
トサーバ機能部４１では、チャット通信部３１−１から
のテキストデータは、通信ネットワーク３を介して、パ
ソコン１−２に送信される。In the server 2, the chat server function unit 41
In, the text data from the chat communication unit 31-1 is received and transmitted via the communication network 3 to another personal computer (chat client) other than the personal computer 1-1, which is currently connected. That is, in the embodiment of FIG. 4, the personal computer other than the personal computer 1-1 connected to the server 2 is the personal computer 1-2. Therefore, in the chat server function unit 41, the chat communication unit 31-1 Is transmitted to the personal computer 1-2 via the communication network 3.

【００３４】パソコン１−２では、チャット通信部３１
−２において、チャットサーバ機能部４１からのテキス
トデータが受信され、チャット処理部３２−２に供給さ
れる。チャット処理部３２−２では、チャット通信部３
１−２からのテキストデータに必要な処理が施され、そ
の出力部１６に供給されて表示される。In the personal computer 1-2, the chat communication unit 31
In -2, text data from the chat server function unit 41 is received and supplied to the chat processing unit 32-2. In the chat processing unit 32-2, the chat communication unit 3
Necessary processing is performed on the text data from 1-2, and the text data is supplied to the output unit 16 and displayed.

【００３５】一方、パソコン１−２のユーザが、入力部
１５を構成するキーボードを操作することにより、テキ
ストを入力すると、そのテキストデータは、上述の場合
とは逆のルートで、パソコン１−１に送信される。On the other hand, when the user of the personal computer 1-2 inputs text by operating the keyboard constituting the input unit 15, the text data is transferred to the personal computer 1-1 along a route reverse to that described above. Sent to.

【００３６】即ち、パソコン１−２では、入力されたテ
キストデータが、チャット処理部３２−２およびチャッ
ト通信部３１−１、並びに通信ネットワーク３を介し
て、サーバ２に送信される。サーバ２では、チャットサ
ーバ機能部４１において、パソコン１−２からのテキス
トデータが受信され、通信ネットワーク３を介して、パ
ソコン１−２以外の、いま接続している他のパソコン
（チャットクライアント）、即ち、図４の実施の形態で
は、パソコン１−１に送信される。パソコン１−１で
は、チャット通信部３１−１において、チャットサーバ
機能部４１からのテキストデータが受信され、チャット
処理部３２−１に供給される。チャット処理部３２−１
では、チャット通信部３１−１からのテキストデータに
必要な処理が施され、その出力部１６に供給されて表示
される。That is, in the personal computer 1-2, the input text data is transmitted to the server 2 via the chat processing unit 32-2, the chat communication unit 31-1, and the communication network 3. In the server 2, the chat server function unit 41 receives text data from the personal computer 1-2, and via the communication network 3, other personal computers (chat clients) other than the personal computer 1-2, That is, in the embodiment of FIG. 4, the data is transmitted to the personal computer 1-1. In the personal computer 1-1, the chat communication unit 31-1 receives text data from the chat server function unit 41 and supplies the text data to the chat processing unit 32-1. Chat processing unit 32-1
Then, necessary processing is performed on the text data from the chat communication unit 31-1, and the text data is supplied to the output unit 16 and displayed.

【００３７】以上のように、パソコン１において、チャ
ットクライアントプログラムが実行されるだけでは、ユ
ーザは、チャットの内容であるテキストを、キーボード
を操作して入力する必要があり、面倒である。As described above, if the chat client program is only executed in the personal computer 1, the user needs to operate the keyboard to input the text as the content of the chat, which is troublesome.

【００３８】そこで、図５は、パソコン１のＨＤ１７に
記憶されているチャットクライアントプログラムの他、
音声処理プログラムも実行されることにより実現される
パソコン１の機能的構成例を示している。FIG. 5 shows, in addition to the chat client program stored in the HD 17 of the personal computer 1,
3 shows an example of a functional configuration of the personal computer 1 that is realized by executing a voice processing program.

【００３９】図５において、音声入力装置５１は、図２
の入力部１５に相当し、マイク等で構成される。そし
て、音声入力装置５１は、そこに入力される音声を、電
気信号としての音声信号とし、さらに、Ａ／Ｄ変換し
て、ディジタルの音声データとして音声認識装置５２に
供給するようになっている。音声認識装置５２は、音声
入力装置５１からの音声データを、例えばＨＭＭ（Hidd
en Markou Models）法等の所定の音声認識アルゴリズム
にしたがって音声認識し、その音声認識結果を、テキス
トデータで出力するようになっている。この音声認識結
果としてのテキストデータは、図４に示したパソコン１
を構成するチャット通信部３１およびチャット処理部３
２に相当するチャットクライアント機能部３０に供給さ
れるようになっている。In FIG. 5, the voice input device 51 is
, And is constituted by a microphone or the like. Then, the voice input device 51 converts the voice input thereto into a voice signal as an electric signal, further performs A / D conversion, and supplies the voice signal to the voice recognition device 52 as digital voice data. . The voice recognition device 52 converts the voice data from the voice input device 51 into, for example, an HMM (Hidd
The voice recognition is performed in accordance with a predetermined voice recognition algorithm such as an en Markou Models method, and the voice recognition result is output as text data. The text data as a result of the speech recognition is transmitted to the personal computer 1 shown in FIG.
Communication unit 31 and chat processing unit 3 constituting
2 is supplied to the chat client function unit 30 corresponding to the second embodiment.

【００４０】なお、音声認識装置５２では、音声認識結
果としてのテキストデータに対して、必要に応じて、仮
名漢字変換処理が施されて出力されるようになってい
る。In the speech recognition device 52, text data as a speech recognition result is subjected to kana-kanji conversion processing as necessary and output.

【００４１】音声読み上げ装置５３には、チャットクラ
イアント機能部３０から、サーバ２から送信されてくる
他のパソコンからのテキストデータが供給されるように
なっている。そして、音声読み上げ装置５３は、チャッ
トクライアント機能部３０からのテキストデータに基づ
いて音声合成を行い、そのテキストデータに対応する、
例えば、ＷＡＶ形式やＡＵ形式の合成音のデータを生成
して、音声出力装置５４に供給するようになっている。
音声出力装置５４は、図２の出力部１６に相当し、スピ
ーカ等で構成される。そして、音声出力装置５４は、音
声読み上げ装置５３からの合成音のデータにしたがい、
サーバ２からのテキストデータを読み上げる合成音を出
力するようになっている。The text-to-speech apparatus 53 is supplied with text data from another personal computer transmitted from the server 2 from the chat client function unit 30. The text-to-speech apparatus 53 performs speech synthesis based on the text data from the chat client function unit 30, and corresponds to the text data.
For example, data of a synthesized sound in the WAV format or the AU format is generated and supplied to the audio output device 54.
The audio output device 54 corresponds to the output unit 16 in FIG. 2, and includes a speaker and the like. Then, the voice output device 54 follows the synthesized voice data from the voice reading device 53,
A synthetic sound for reading text data from the server 2 is output.

【００４２】次に、図６を参照して、図５のように構成
されるパソコン１の動作について説明する。Next, the operation of the personal computer 1 configured as shown in FIG. 5 will be described with reference to FIG.

【００４３】パソコン１において、ユーザが、例えば、
「こんにちは」等の発話を行うと（図６）、その音声
は、音声入力装置（マイク）５１に入力され、電気信号
としての音声データとされる。この音声データは、音声
認識装置５２に供給され、そこで音声認識される。さら
に、音声認識装置５２では、その音声認識結果が、テキ
ストデータに変換され、必要に応じて、仮名漢字変換さ
れた後、チャットクライアント機能部３０に供給され
る。チャットクライアント機能部３０では、図４で説明
したように、音声認識装置５２からのテキストデータ
が、通信ネットワーク３を介して、サーバ２に送信され
る。即ち、以上のようにして、ユーザの発話「こんにち
は」は、テキストデータとされ、サーバ２に送信され
る。In the personal computer 1, the user, for example,
Doing utterance such as "Hello" (FIG. 6), the voice is input to the voice input device (microphone) 51 is audio data as an electrical signal. This voice data is supplied to a voice recognition device 52, where the voice is recognized. Further, in the voice recognition device 52, the voice recognition result is converted into text data, and if necessary, converted into kana-kanji characters, and then supplied to the chat client function unit 30. In the chat client function unit 30, the text data from the voice recognition device 52 is transmitted to the server 2 via the communication network 3, as described with reference to FIG. In other words, as described above, a user of the utterance "Hello" is a text data is transmitted to the server 2.

【００４４】一方、サーバ２から通信ネットワーク３を
介して送信されてくるテキストデータは、チャットクラ
イアント機能部３０において、図４で説明したようにし
て受信され、音声読み上げ装置５３に供給される。音声
読み上げ装置５３では、チャットクライアント機能部３
０からのテキストデータに対応する合成音が生成され、
音声出力装置５４に供給される。音声出力装置５４で
は、音声読み上げ装置５３からの合成音が出力される。
即ち、例えば、サーバ２から「今日はいい天気ですね」
等のテキストデータが送信されてきた場合には、音声出
力装置５４では、その合成音「今日はいい天気ですね」
が出力される（図６）。On the other hand, the text data transmitted from the server 2 via the communication network 3 is received by the chat client function unit 30 as described with reference to FIG. In the voice reading device 53, the chat client function unit 3
A synthesized sound corresponding to the text data from 0 is generated,
It is supplied to the audio output device 54. The voice output device 54 outputs a synthesized voice from the voice reading device 53.
That is, for example, from the server 2, "Today is fine weather."
When the text data such as is transmitted, the voice output device 54 outputs the synthesized sound “Today is fine weather.”
Is output (FIG. 6).

【００４５】次に、図７および図８を参照して、パソコ
ン１においてチャットクライアントプログラムおよび音
声処理プログラムが実行されることにより表示される画
面について説明する。Next, a screen displayed by executing the chat client program and the voice processing program on the personal computer 1 will be described with reference to FIG. 7 and FIG.

【００４６】パソコン１において、チャットクライアン
トプログラムが実行されると、出力部１６では、図７お
よび図８に示すようなチャット用のウインドウ６１が表
示される。When the chat client program is executed in the personal computer 1, the output unit 16 displays a chat window 61 as shown in FIGS.

【００４７】なお、ここでは、例えば、パソコン１−１
と１−２との間でチャットが行われるものとし、パソコ
ン１−１のユーザを、ユーザＡとするとともに、パソコ
ン１−２のユーザを、ユーザＢとする。また、パソコン
１−１または１−２それぞれの出力部１６に表示される
ウインドウ６１を、ウインドウ６１−１または６１−２
と記述する。Here, for example, the personal computer 1-1
It is assumed that a chat is performed between the personal computer 1-1 and the personal computer 1-2. The user of the personal computer 1-1 is a user A, and the user of the personal computer 1-2 is a user B. The window 61 displayed on the output unit 16 of each of the personal computers 1-1 and 1-2 is referred to as a window 61-1 or 61-2.
It is described.

【００４８】ウインドウ６１は、図７および図８に示す
ように、ユーザどうしの間で行われたチャットの内容が
表示されるチャット文字表示エリア６２と、ユーザが入
力した最新のテキストが表示されるテキスト入力フィー
ルド６３から構成される。なお、ウインドウ６１−１を
構成するチャット文字表示エリア６２またはテキスト入
力フィールド６３を、以下、適宜、チャット文字表示エ
リア６２−１またはテキスト入力フィールド６３−１と
それぞれ記述するとともに（図７）、ウインドウ６１−
２を構成するチャット文字表示エリア６２またはテキス
ト入力フィールド６３を、以下、適宜、チャット文字表
示エリア６２−２またはテキスト入力フィールド６３−
２とそれぞれ記述する（図８）。The window 61, as shown in FIGS. 7 and 8, displays a chat character display area 62 in which the contents of a chat performed between users are displayed, and the latest text input by the user. It consists of a text input field 63. Note that the chat character display area 62 or the text input field 63 constituting the window 61-1 will be appropriately described as the chat character display area 62-1 or the text input field 63-1 below (FIG. 7). 61-
The chat character display area 62 or the text input field 63 constituting the chat character display area 62-2 or the text input field 63-
2 (FIG. 8).

【００４９】いま、ユーザＡが、例えば、「今日はいい
天気ですね」を発話すると、パソコン１−１において、
その音声は音声認識され、仮名漢字変換される。この仮
名漢字変換結果は、図７に示すように、テキスト入力フ
ィールド６３−１に表示され、その後、通信ネットワー
ク３を介して、サーバ２に送信される。Now, when the user A utters, for example, "Today is fine weather,"
The voice is recognized and converted to kana-kanji. This kana-kanji conversion result is displayed in the text input field 63-1 as shown in FIG. 7, and then transmitted to the server 2 via the communication network 3.

【００５０】サーバ２では、パソコン１−１からのテキ
ストデータ「今日はいい天気ですね」が受信され、通信
ネットワーク３を介して、パソコン１−２に送信され
る。パソコン１−２では、パソコン１−１からのテキス
トデータ「今日はいい天気ですね」が、図８に示すよう
に、その出力部１６に表示されたウインドウ６１−２の
チャット文字表示エリア６２−２に表示される。さら
に、パソコン１−２では、例えば、テキスト音声合成が
行われることにより、テキストデータ「今日はいい天気
ですね」に対応する合成音が生成されて出力される。The server 2 receives the text data “Today is fine weather” from the personal computer 1-1 and sends it to the personal computer 1-2 via the communication network 3. In the personal computer 1-2, the text data "Today is fine weather" from the personal computer 1-1 is transmitted to the chat character display area 62- of the window 61-2 displayed on the output unit 16 as shown in FIG. 2 is displayed. Further, in the personal computer 1-2, for example, a text-to-speech synthesis is performed, so that a synthesized sound corresponding to the text data “Today is fine weather” is generated and output.

【００５１】以上のように、入力された音声が音声認識
され、その音声認識結果としてのテキストデータが、通
信ネットワーク３を介して、サーバ２に送信されるの
で、ユーザは、キーボードの操作に熟練していなくて
も、容易に、チャットを楽しむことができる。As described above, the input voice is recognized by speech, and the text data as the speech recognition result is transmitted to the server 2 via the communication network 3, so that the user is skilled in operating the keyboard. Even if you do not, you can easily enjoy chatting.

【００５２】また、通信ネットワーク３を介して、パソ
コン１とサーバ２との間でやりとりされるのはテキスト
データであるため、従来のチャットシステムをそのまま
利用することができる。従って、パソコン１との間でチ
ャットを行うチャットクライアントが、上述したような
音声認識や音声合成を行うための音声処理プログラムを
有していなくても、パソコン１との間でチャットを行う
ことができる。そして、この場合でも、パソコン１にお
いては、音声でテキストデータを入力し、かつ送信され
てきたテキストデータを、合成音で出力することができ
る。Since text data is exchanged between the personal computer 1 and the server 2 via the communication network 3, the conventional chat system can be used as it is. Therefore, even if the chat client that chats with the personal computer 1 does not have a voice processing program for performing voice recognition and voice synthesis as described above, it is possible to chat with the personal computer 1. it can. Also in this case, the personal computer 1 can input text data by voice and output the transmitted text data as a synthesized sound.

【００５３】さらに、音声データではなく、テキストデ
ータがやりとりされるため、その伝送は、狭帯域で行う
ことができる。Further, since text data is exchanged instead of voice data, the transmission can be performed in a narrow band.

【００５４】また、サーバ２から送信されてくるテキス
トデータに対応する合成音を生成して出力するようにし
たので、ユーザは、出力部１６の表示を見なくてもチャ
ットを行うことができる。即ち、ディスプレイがなくて
も、チャットを行うことができる。Also, since the synthesized sound corresponding to the text data transmitted from the server 2 is generated and output, the user can chat without looking at the display on the output unit 16. That is, a chat can be performed without a display.

【００５５】従って、チャットクライアントとしては、
極端には、テキストを入力するためのキーや、テキスト
を表示するためのディスプレイが必要がなく、音声を入
力するためのマイクと、音声を出力するためのスピーカ
があれば良いから、携帯電話機等の携帯端末を、チャッ
トクライアントとして用いることも、容易に可能とな
る。Therefore, as a chat client,
In the extreme, there is no need for a key for inputting text or a display for displaying text, and only a microphone for inputting voice and a speaker for outputting voice are required. Can easily be used as a chat client.

【００５６】なお、テキスト音声合成を行う場合には、
その合成の基本単位となる音素データや音節データが必
要となるが、これは、ＨＤ１７にあらかじめ記憶されて
いるものとする。但し、音素データや音声データは、通
信ネットワーク３を介してダウンロードするようにして
も良い。When performing text-to-speech synthesis,
Phoneme data and syllable data, which are basic units for the synthesis, are required, and these are stored in the HD 17 in advance. However, phoneme data and voice data may be downloaded via the communication network 3.

【００５７】また、音声合成は、テキスト音声合成によ
る他、例えば、録音編集方式（あらかじめ発話された単
語や文節を蓄積しておいて接続する方式）によって行う
ことも可能である。Speech synthesis can be performed not only by text speech synthesis but also by, for example, a recording / editing method (a method in which uttered words and phrases are stored in advance and connected).

【００５８】次に、図９のフローチャートを参照して、
チャットクライアントとしての図２に示したパソコン１
の動作について、さらに説明する。Next, referring to the flowchart of FIG.
The personal computer 1 shown in FIG. 2 as a chat client
Will be further described.

【００５９】ユーザが、入力部１５を、チャットクライ
アントプログラムを実行するように操作すると、ＣＰＵ
１３において、ＨＤ１７に記憶されたチャットクライア
ントプログラムが、ＲＡＭ１４にロードされて実行され
る。When the user operates the input unit 15 to execute the chat client program, the CPU
At 13, the chat client program stored in the HD 17 is loaded into the RAM 14 and executed.

【００６０】これにより、ＣＰＵ１３では、図７および
図８に示したようなチャット用のウインドウ６１が表示
され、さらに、ステップＳ１において、サーバ２との接
続を確立し、ログインするための処理が行われる。そし
て、サーバ２へのログインが完了すると、ステップＳ２
に進み、ユーザが発話を行ったかどうかが判定される。
ステップＳ２において、ユーザが発話を行っていないと
判定された場合、即ち、入力部１５に音声が入力されて
いない場合、ステップＳ３乃至Ｓ６をスキップして、ス
テップＳ７に進む。As a result, the CPU 13 displays a chat window 61 as shown in FIGS. 7 and 8, and further executes a process for establishing a connection with the server 2 and logging in at step S1. Will be When the login to the server 2 is completed, step S2
To determine whether the user has spoken.
If it is determined in step S2 that the user has not spoken, that is, if no voice has been input to the input unit 15, steps S3 to S6 are skipped, and the process proceeds to step S7.

【００６１】また、ステップＳ２において、ユーザが発
話を行ったと判定された場合、即ち、入力部１５に音声
が入力された場合、ステップＳ３に進み、その音声が音
声認識される。If it is determined in step S2 that the user has spoken, that is, if a voice is input to the input unit 15, the process proceeds to step S3, and the voice is recognized.

【００６２】即ち、ステップＳ３では、ＣＰＵ１３にお
いて、ＨＤ１７に記憶された音声処理プログラムが、Ｒ
ＡＭ１４にロードされて実行されることにより、ユーザ
の発話が音声認識される。さらに、ステップＳ３では、
その音声認識結果がテキストデータに変換され、仮名漢
字変換される。そして、その仮名漢字変換結果が、ウイ
ンドウ６１のテキスト入力フィールド６３に表示され、
音声処理プログラムが、ＲＡＭ１４からアンロードされ
る。That is, in step S 3, the CPU 13 determines that the audio processing program stored in the HD 17
When loaded and executed on the AM 14, the speech of the user is recognized by speech. Further, in step S3,
The speech recognition result is converted into text data and converted into kana-kanji characters. Then, the result of the kana-kanji conversion is displayed in the text input field 63 of the window 61,
The audio processing program is unloaded from the RAM 14.

【００６３】ユーザは、テキスト入力フィールド６３に
表示された音声認識結果（を仮名漢字変換したもの）を
見て、誤りがあれば、その誤りを、例えば、入力部１５
のキーボード等を操作することにより訂正する。この場
合、ステップＳ４において、入力部１５の操作にしたが
って、テキスト入力フィールド６３に表示された音声認
識結果が訂正される。The user looks at the speech recognition result (the kana-to-kanji converted version) displayed in the text input field 63 and, if there is an error, identifies the error in, for example, the input unit 15.
Correct by operating the keyboard or the like. In this case, in step S4, the speech recognition result displayed in the text input field 63 is corrected according to the operation of the input unit 15.

【００６４】なお、テキスト入力フィールド６３に表示
された音声認識結果の訂正は、入力部１５のキーボード
を操作するのではなく、音声入力によって行うようにす
ることも可能である。The result of the voice recognition displayed in the text input field 63 can be corrected by voice input instead of operating the keyboard of the input unit 15.

【００６５】その後、ステップＳ５に進み、テキスト入
力フィールド６３に表示された音声認識結果としてのテ
キストデータを、サーバ２に送信するかどうかが判定さ
れる。ステップＳ５において、テキスト入力フィールド
６３に表示されたテキストデータを送信しないと判定さ
れた場合、即ち、例えば、テキストデータを送信するよ
うに、入力部１５が操作されなかった場合、ステップＳ
６をスキップして、ステップＳ７に進む。Thereafter, the process proceeds to step S5, where it is determined whether or not the text data as the speech recognition result displayed in the text input field 63 is to be transmitted to the server 2. If it is determined in step S5 that the text data displayed in the text input field 63 is not to be transmitted, that is, if the input unit 15 is not operated to transmit the text data, for example,
Skip to step S7 and proceed to step S7.

【００６６】また、ステップＳ５において、テキスト入
力フィールド６３に表示されたテキストデータを送信す
ると判定された場合、即ち、例えば、テキストデータを
送信するように、入力部１５が操作された場合、ステッ
プＳ６に進み、ＣＰＵ１３は、通信Ｉ／Ｆ１１を制御す
ることにより、テキスト入力フィールド６３に表示され
たテキストデータを、通信ネットワーク３を介して、サ
ーバ２に送信させ、ステップＳ７に進む。If it is determined in step S5 that the text data displayed in the text input field 63 is to be transmitted, that is, if the input unit 15 is operated to transmit the text data, for example, the process proceeds to step S6. The CPU 13 controls the communication I / F 11 to transmit the text data displayed in the text input field 63 to the server 2 via the communication network 3, and proceeds to step S7.

【００６７】この場合、サーバ２では、パソコン１から
のテキストデータが受信され、他のチャットクライアン
トに送信される。これにより、そのチャットクライアン
トでは、パソコン１からのテキストデータが表示され、
あるいは、パソコン１と同様に、音声処理プログラムが
インストールされているチャットクライアントでは、パ
ソコン１からのテキストデータに対応する合成音が出力
される。In this case, the server 2 receives the text data from the personal computer 1 and sends it to another chat client. As a result, the chat client displays text data from the personal computer 1,
Alternatively, in the same manner as in the personal computer 1, in a chat client in which the voice processing program is installed, a synthesized sound corresponding to text data from the personal computer 1 is output.

【００６８】ステップＳ７では、サーバ２からテキスト
データが送信されてきたかどうかが判定され、送信され
てきていないと判定された場合、即ち、通信Ｉ／Ｆ１１
で、サーバ２からのテキストデータが受信されていない
場合、ステップＳ８およびＳ９をスキップして、ステッ
プＳ１０に進む。In step S7, it is determined whether or not text data has been transmitted from the server 2, and if it has been determined that text data has not been transmitted, that is, the communication I / F 11
If the text data has not been received from the server 2, the process skips steps S8 and S9 and proceeds to step S10.

【００６９】また、ステップＳ７において、サーバ２か
らテキストデータが送信されてきたと判定された場合、
即ち、通信Ｉ／Ｆ１１で、サーバ２からのテキストデー
タが受信された場合、ステップＳ８に進み、そのテキス
トデータに必要な処理が施され、ウインドウ６１のチャ
ット文字表示エリア６２に表示される。そして、ステッ
プＳ９に進み、ＣＰＵ１３において、ＨＤ１７に記憶さ
れた音声処理プログラムが、ＲＡＭ１４にロードされて
実行されることにより、チャット文字表示エリア６２に
表示されたテキストデータに対応する合成音が生成され
て出力される。If it is determined in step S7 that text data has been transmitted from the server 2,
That is, when the communication I / F 11 receives text data from the server 2, the process proceeds to step S <b> 8, where necessary processing is performed on the text data, and the text data is displayed in the chat character display area 62 of the window 61. Then, the process proceeds to step S9, where the CPU 13 loads the voice processing program stored in the HD 17 into the RAM 14 and executes the voice processing program, thereby generating a synthesized sound corresponding to the text data displayed in the chat character display area 62. Output.

【００７０】そして、音声処理プログラムが、ＲＡＭ１
４からアンロードされ、ステップＳ１０に進み、ログア
ウトするかどうかが判定される。ステップＳ１０におい
て、ログアウトしないと判定された場合、即ち、例え
ば、入力部１５が、ログアウトするように操作されてい
ない場合、ステップＳ２に戻り、以下、同様の処理が繰
り返される。Then, the voice processing program is stored in the RAM 1
4, the process proceeds to step S10, and it is determined whether or not to log out. If it is determined in step S10 that logout is not to be performed, that is, for example, if the input unit 15 has not been operated to log out, the process returns to step S2, and the same processing is repeated thereafter.

【００７１】また、ステップＳ１０において、ログアウ
トすると判定された場合、即ち、例えば、入力部１５が
ログアウトするように操作された場合、サーバ２からロ
グアウトし、さらに、サーバ２との接続を切断して、処
理を終了する。If it is determined in step S10 that logout is to be performed, that is, for example, if the input unit 15 has been operated to log out, the user logs out of the server 2 and further disconnects the server 2. , And the process ends.

【００７２】なお、上述の場合においては、必要に応じ
て、音声処理プログラムを、ロード／アンロードするよ
うにしたが、音声処理プログラムは、チャットクライア
ントプログラムが実行されている間、ＲＡＭ１４に常駐
させておくようにすることも可能である。In the above-described case, the voice processing program is loaded / unloaded as necessary. However, the voice processing program is resident in the RAM 14 while the chat client program is being executed. It is also possible to keep it.

【００７３】次に、上述の場合には、パソコン１におい
て、サーバ２から送信されてくるテキストデータに対応
した合成音を生成するようにしたが、サーバ２から送信
されてくるテキストデータは、合成音に変換する他、例
えば、ＦＡＸ（ファクシミリ）のデータに変換し、ＦＡ
Ｘに送信するようにすることが可能である。また、サー
バ２から送信されてくるテキストデータは保存しておく
ようにすることも可能である。Next, in the above case, the personal computer 1 generates a synthesized sound corresponding to the text data transmitted from the server 2, but the text data transmitted from the server 2 is In addition to converting to sound, for example, converting to facsimile (FAX) data
X can be sent. Further, the text data transmitted from the server 2 can be stored.

【００７４】そこで、図１０のフローチャートを参照し
て、そのようなパソコン１の処理について説明する。The processing of the personal computer 1 will be described with reference to the flowchart of FIG.

【００７５】この場合、ステップＳ２１において、図９
のステップＳ１における場合と同様に、サーバ２との接
続を確立し、ログインするための処理が行われる。そし
て、ステップＳ２２に進み、サーバ２からテキストデー
タが送信されてきたかどうかが判定され、送信されてき
ていないと判定された場合、即ち、通信Ｉ／Ｆ１１で、
サーバ２からのテキストデータが受信されていない場
合、ステップＳ２３乃至２７をスキップして、ステップ
Ｓ２８に進む。In this case, in step S21, FIG.
As in the case of step S1, a process for establishing a connection with the server 2 and logging in is performed. Then, the process proceeds to step S22, where it is determined whether the text data has been transmitted from the server 2, and if it is determined that the text data has not been transmitted, that is, in the communication I / F 11,
If the text data from the server 2 has not been received, the process skips steps S23 to S27 and proceeds to step S28.

【００７６】また、ステップＳ２２において、サーバ２
からテキストデータが送信されてきたと判定された場
合、即ち、通信Ｉ／Ｆ１１で、サーバ２からのテキスト
データが受信された場合、ステップＳ２３に進み、その
テキストデータに必要な処理が施され、ステップＳ２４
に進む。In step S22, the server 2
If it is determined that text data has been transmitted from the server 2, that is, if text data has been received from the server 2 at the communication I / F 11, the process proceeds to step S23, where necessary processing is performed on the text data. S24
Proceed to.

【００７７】ステップＳ２４では、サーバ２からのテキ
ストデータを、ＦＡＸに送信するかどうかが判定され
る。ステップＳ２４において、サーバ２からのテキスト
データを、ＦＡＸに送信すると判定された場合、即ち、
パソコン１において、テキストデータを、ＦＡＸに送信
することと、そのＦＡＸ番号が設定されている場合、ス
テップＳ２５に進み、サーバ２からのテキストデータ
が、ＦＡＸ用のデータに変換され、設定されているＦＡ
Ｘ番号に送信されて、ステップＳ２８に進む。In step S24, it is determined whether the text data from server 2 is to be transmitted to a facsimile. If it is determined in step S24 that the text data from the server 2 is to be transmitted to the facsimile,
If the personal computer 1 transmits text data to a facsimile and the facsimile number is set, the process proceeds to step S25, where the text data from the server 2 is converted into facsimile data and set. FA
It is transmitted to the X number, and proceeds to step S28.

【００７８】また、ステップＳ２４において、サーバ２
からのテキストデータを、ＦＡＸに送信しないと判定さ
れた場合、ステップＳ２６に進み、そのテキストデータ
を保存しておくように、パソコン１が設定されているか
どうかが判定される。ステップＳ２６において、サーバ
２からのテキストデータを保存しておくように設定され
ていないと判定された場合、ステップＳ２７をスキップ
して、ステップＳ２８に進む。In step S24, the server 2
If it is determined not to transmit the text data from FAX to FAX, the process proceeds to step S26, and it is determined whether the personal computer 1 is set to save the text data. If it is determined in step S26 that the text data from the server 2 is not set to be stored, the process skips step S27 and proceeds to step S28.

【００７９】一方、ステップＳ２６において、サーバ２
からのテキストデータを保存しておくように設定されて
いると判定された場合、ステップＳ２７に進み、そのテ
キストデータが、ＨＤ１７に記憶され、ステップＳ２８
に進む。On the other hand, in step S26, the server 2
If it is determined that the text data is set to be stored, the process proceeds to step S27, and the text data is stored in the HD 17, and the process proceeds to step S28.
Proceed to.

【００８０】ステップＳ２８では、図９のステップＳ１
０における場合と同様に、ログアウトするかどうかが判
定され、ログアウトしないと判定された場合、ステップ
Ｓ２２に戻り、以下、同様の処理が繰り返される。In step S28, step S1 in FIG.
As in the case of 0, it is determined whether or not to log out. If it is determined that the user does not log out, the process returns to step S22, and the same processing is repeated thereafter.

【００８１】また、ステップＳ２８において、ログアウ
トすると判定された場合、サーバ２からログアウトし、
さらに、サーバ２との接続を切断して、処理を終了す
る。If it is determined in step S28 that the user logs out, the user logs out of the server 2 and
Further, the connection with the server 2 is disconnected, and the processing is terminated.

【００８２】ここで、パソコン１において、音声で入力
されたテキストデータを、ＦＡＸのデータに変換するよ
うにすれば、用紙に文字等を描くことなく、ＦＡＸ送信
を行うことが可能となる。また、パソコン１において、
ＦＡＸのデータを受信して、ＯＣＲ（Optical Characte
r Reader）等で、文字認識を行い、その文字認識結果を
対象に音声合成を行うようにすれば、ファックスされて
きた内容を見ることなく、その内容を把握することが可
能となる。さらに、例えば、留守番電話機能を有する電
話機、あるいは携帯電話機の留守番電話機能を司るセン
タにおいて、音声によるメッセージを音声認識し、テキ
ストデータに変換して記憶しておくようにすれば、音声
データを記憶する場合に比較して、記憶容量が少なくて
済み、コストの削減を図ることが可能となる。Here, if the PC 1 converts text data input by voice into FAX data, FAX transmission can be performed without drawing characters or the like on paper. In the personal computer 1,
FAX data is received, and OCR (Optical Characte
(r Reader), etc., if character recognition is performed and speech synthesis is performed on the result of the character recognition, the contents can be grasped without looking at the faxed contents. Further, for example, if a telephone having an answering machine function or a center that manages an answering machine function of a mobile phone recognizes a voice message by voice and converts it into text data and stores it, the voice data can be stored. As compared with the case of performing the above, the storage capacity can be reduced, and the cost can be reduced.

【００８３】なお、本実施の形態では、チャットサーバ
およびチャットクライアントを、ソフトウェアで実現す
るようにしたが、チャットサーバやチャットクライアン
トは、それ専用のハードウェアで実現することも可能で
ある。In the present embodiment, the chat server and the chat client are realized by software. However, the chat server and the chat client can be realized by dedicated hardware.

【００８４】また、上述したチャットクライアントプロ
グラムや音声処理プログラム等のアプリケーションプロ
グラムを、コンピュータにインストールして実行させる
場合には、そのアプリケーションプログラムは、ＨＤ１
７や、フロッピーディスク、ＣＤ−ＲＯＭ（Compact Di
sc - ROM），ＤＶＤ（Digtal Versatile Disc）等のパ
ッケージメディアや、プログラムが一時的若しくは永続
的に格納される半導体メモリ等に記録して提供したり、
ＬＡＮ（Local Area Network）や、インターネット、デ
ィジタル衛星回線等の有線／無線の通信ネットワーク
３、およびそのような通信ネットワーク３を介してのデ
ータの転送若しくは受信を行うルータやモデム等の通信
Ｉ／Ｆ１１を介して提供したりすることが可能であり、
本明細書における媒体とは、そのようなものを含む広義
の概念を意味する。When an application program such as the above-mentioned chat client program or voice processing program is installed in a computer and executed, the application program is executed on the HD1.
7, floppy disk, CD-ROM (Compact Di
sc-ROM), DVD (Digital Versatile Disc), and other package media, and semiconductor memory in which programs are temporarily or permanently stored are recorded and provided.
A wired / wireless communication network 3 such as a LAN (Local Area Network), the Internet, or a digital satellite line, and a communication I / F 11 such as a router or a modem for transferring or receiving data via the communication network 3. Or can be provided via
The medium in the present specification means a broad concept including such a medium.

【００８５】[0085]

【発明の効果】請求項１に記載の情報処理装置および請
求項５に記載の情報処理方法、並びに請求項６に記載の
媒体によれば、入力された音声が音声認識され、その音
声認識結果が、テキストデータで出力される。そして、
その音声認識結果としてのテキストデータが、サーバに
送信される。従って、例えば、チャットを、容易に行う
ことが可能となる。According to the information processing apparatus according to the first aspect, the information processing method according to the fifth aspect, and the medium according to the sixth aspect, the input voice is subjected to voice recognition, and the voice recognition result is obtained. Is output as text data. And
Text data as a result of the speech recognition is transmitted to the server. Therefore, for example, chat can be easily performed.

【００８６】請求項７に記載の情報処理装置および請求
項９に記載の情報処理方法、並びに請求項１０に記載の
媒体によれば、サーバから送信されてくるテキストデー
タが受信され、そのサーバからのテキストデータに基づ
いて、音声合成が行われ、そのテキストデータに対応す
る合成音が出力される。従って、例えば、チャットを、
画面表示を見なくても行うことが可能となる。According to the information processing apparatus of claim 7, the information processing method of claim 9, and the medium of claim 10, text data transmitted from a server is received, and the text data is transmitted from the server. Speech synthesis is performed based on the text data, and a synthesized sound corresponding to the text data is output. So, for example, chat
This can be performed without looking at the screen display.

[Brief description of the drawings]

【図１】本発明を適用したチャットシステムの一実施の
形態の構成例を示す図である。FIG. 1 is a diagram showing a configuration example of an embodiment of a chat system to which the present invention is applied.

【図２】図１のパソコン１のハードウェア構成例を示す
ブロック図である。FIG. 2 is a block diagram illustrating a hardware configuration example of a personal computer 1 of FIG.

【図３】図１のサーバ２のハードウェア構成例を示すブ
ロック図である。FIG. 3 is a block diagram illustrating a hardware configuration example of a server 2 of FIG. 1;

【図４】図１のチャットシステムの動作を説明するため
の図である。FIG. 4 is a diagram for explaining the operation of the chat system of FIG. 1;

【図５】図１のパソコン１の機能的構成例を示すブロッ
ク図である。FIG. 5 is a block diagram showing a functional configuration example of the personal computer 1 of FIG.

【図６】図５のパソコン１の動作を説明するための図で
ある。6 is a diagram for explaining the operation of the personal computer 1 of FIG.

【図７】チャットクライアントとしてのパソコン１の表
示画面を示す図である。FIG. 7 is a diagram showing a display screen of a personal computer 1 as a chat client.

【図８】チャットクライアントとしてのパソコン１の表
示画面を示す図である。FIG. 8 is a diagram showing a display screen of a personal computer 1 as a chat client.

【図９】図１のパソコン１の動作を説明するためのフロ
ーチャートである。9 is a flowchart for explaining the operation of the personal computer 1 of FIG.

【図１０】図１のパソコン１の動作を説明するためのフ
ローチャートである。FIG. 10 is a flowchart for explaining the operation of the personal computer 1 of FIG.

[Explanation of symbols]

１−１乃至１−３パソコン，２−１，２−２サー
バ，３通信ネットワーク，１１通信Ｉ／Ｆ，
１２ＲＯＭ，１３ＣＰＵ，１４ＲＡＭ，１
５入力部，１６出力部，１７ＨＤ，２１
通信Ｉ／Ｆ，２２ＲＯＭ，２３ＣＰＵ，２４
ＲＡＭ，２５入力部，２６出力部，２７Ｈ
Ｄ，３０チャットクライアント機能部，３１−
１，３１−２チャット通信部，３２−１，３２−２
チャット処理部，４１チャットサーバ機能部，
５１音声入力装置，５２音声認識装置，５３
音声読み上げ装置，５４音声出力装置，６１−
１，６１−２ウインドウ，６２−１，６２−２チャ
ット文字表示エリア，６３−１，６３−２テキス
ト入力フィールド1-1 to 1-3 personal computers, 2-1 and 2-2 servers, 3 communication networks, 11 communication I / Fs,
12 ROM, 13 CPU, 14 RAM, 1
5 input section, 16 output section, 17 HD, 21
Communication I / F, 22 ROM, 23 CPU, 24
RAM, 25 inputs, 26 outputs, 27H
D, 30 chat client function section, 31-
1,31-2 Chat communication unit, 32-1,32-2
Chat processing unit, 41 chat server function unit,
51 voice input device, 52 voice recognition device, 53
Voice reading device, 54 Voice output device, 61-
1, 61-2 window, 62-1, 62-2 chat character display area, 63-1, 63-2 text input field

Claims

[Claims]

1. A system for receiving text data transmitted from a client, transmitting the text data to a server that transmits the text data to one or more other clients, and transmitting the text data from the server. An information processing apparatus for receiving, comprising: a voice recognition unit configured to perform voice recognition of an input voice and output the voice recognition result as text data; and transmitting the text data as the voice recognition result to the server. An information processing apparatus, comprising: means; receiving means for receiving text data transmitted from the server; and output means for outputting text data from the server.

2. A speech synthesizing unit that performs speech synthesis based on text data from the server and outputs a synthesized sound corresponding to the text data, wherein the output unit is configured to synthesize the speech corresponding to the text data. The information processing apparatus according to claim 1, wherein the information processing apparatus outputs a sound.

3. The information processing apparatus according to claim 1, further comprising a correction unit configured to correct text data as a result of the speech recognition.

4. The information processing apparatus according to claim 1, further comprising a display unit for displaying the text data.

5. Receiving text data transmitted from a client, transmitting the text data to a server that transmits the text data to one or more other clients, and transmitting the text data from the server. An information processing method for receiving, comprising: a voice recognition step of performing voice recognition on input voice and outputting the voice recognition result as text data; and transmitting the text data as the voice recognition result to the server. An information processing method, comprising: a step of receiving text data transmitted from the server; and an output step of outputting text data from the server.

6. A method for receiving text data transmitted from a client, transmitting the text data to a server that transmits the text data to one or more other clients, and transmitting the text data from the server. A medium for causing the information processing apparatus to execute a program for causing the information processing apparatus to perform a process of receiving, wherein a voice recognition step of performing voice recognition of the input voice and outputting the voice recognition result as text data Transmitting a text data as a result of the voice recognition to the server; a receiving step of receiving the text data transmitted from the server; and an output step of outputting the text data from the server. Medium for causing the information processing apparatus to execute a program characterized by including

7. A method for receiving text data transmitted from a client, transmitting the text data to a server that transmits the text data to one or more other clients, and transmitting the text data from the server. An information processing apparatus for receiving, comprising: input means for inputting the text data; transmitting means for transmitting the input text data to the server; and receiving means for receiving text data transmitted from the server. An information processing apparatus, comprising: voice synthesis means for performing voice synthesis based on text data from the server, and outputting a synthesized voice corresponding to the text data.

8. The information processing apparatus according to claim 7, further comprising display means for displaying the text data.

9. Receiving text data transmitted from a client, transmitting the text data to a server that transmits the text data to one or more other clients, and transmitting the text data from the server. An information processing method for receiving, comprising: an input step of inputting the text data; a transmitting step of transmitting the input text data to the server; and a receiving step of receiving text data transmitted from the server. And a voice synthesizing step of performing voice synthesis based on the text data from the server and outputting a synthesized voice corresponding to the text data.

10. Receiving text data transmitted from a client, transmitting the text data to a server that transmits the text data to one or more other clients, and transmitting the text data from the server. A medium that causes the information processing device to execute a program for causing the information processing device to perform a process of receiving, an inputting step of inputting the text data, and transmitting the input text data to the server. A transmitting step, a receiving step of receiving text data transmitted from the server, a voice synthesizing step of performing speech synthesis based on the text data from the server, and outputting a synthesized sound corresponding to the text data. A program characterized by including the following is executed by the information processing apparatus. Media to be.